Skip to main content

Showing 1–50 of 84 results for author: Mertikopoulos, P

.
  1. arXiv:2406.09241  [pdf, other

    math.OC cs.LG math.PR stat.ML

    What is the long-run distribution of stochastic gradient descent? A large deviations analysis

    Authors: Waïss Azizian, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: In this paper, we examine the long-run distribution of stochastic gradient descent (SGD) in general, non-convex problems. Specifically, we seek to understand which regions of the problem's state space are more likely to be visited by SGD, and by how much. Using an approach based on the theory of large deviations and randomly perturbed dynamical systems, we show that the long-run distribution of SG… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 70 pages, 3 figures; to be published in the proceedings of ICML 2024

    MSC Class: Primary 90C15; 90C26; 60F10; secondary 90C30; 68Q32

  2. arXiv:2405.17693  [pdf, other

    stat.ML cs.LG math.NA math.OC math.PR

    Tamed Langevin sampling under weaker conditions

    Authors: Iosif Lytras, Panayotis Mertikopoulos

    Abstract: Motivated by applications to deep learning which often fail standard Lipschitz smoothness requirements, we examine the problem of sampling from distributions that are not log-concave and are only weakly dissipative, with log-gradients allowed to grow superlinearly at infinity. In terms of structure, we only assume that the target distribution satisfies either a log-Sobolev or a Poincaré inequality… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 32 pages, 2 figures

    MSC Class: Primary 65C05; 60H10; secondary 68Q32

  3. arXiv:2405.07224  [pdf, other

    cs.GT cs.LG math.OC

    A geometric decomposition of finite games: Convergence vs. recurrence under exponential weights

    Authors: Davide Legacci, Panayotis Mertikopoulos, Bary Pradelski

    Abstract: In view of the complexity of the dynamics of learning in games, we seek to decompose a game into simpler components where the dynamics' long-run behavior is well understood. A natural starting point for this is Helmholtz's theorem, which decomposes a vector field into a potential and an incompressible component. However, the geometry of game dynamics - and, in particular, the dynamics of exponenti… ▽ More

    Submitted 18 May, 2024; v1 submitted 12 May, 2024; originally announced May 2024.

    Comments: 49 pages, 16 figures

    MSC Class: Primary 91A10; 91A26; secondary 91A68; 68Q32; 68T05

  4. arXiv:2402.09824  [pdf, other

    math.DS cs.GT

    On the discrete-time origins of the replicator dynamics: From convergence to instability and chaos

    Authors: Fryderyk Falniowski, Panayotis Mertikopoulos

    Abstract: We consider three distinct discrete-time models of learning and evolution in games: a biological model based on intra-species selective pressure, the dynamics induced by pairwise proportional imitation, and the exponential / multiplicative weights (EW) algorithm for online learning. Even though these models share the same continuous-time limit - the replicator dynamics - we show that second-order… ▽ More

    Submitted 26 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 22 pages, 8 figures

    MSC Class: Primary 91A22; 91A26; 37E05; secondary 37N40; 91A14

  5. arXiv:2312.16609  [pdf, other

    cs.GT cs.LG

    Exploiting hidden structures in non-convex games for convergence to Nash equilibrium

    Authors: Iosif Sakos, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Panayotis Mertikopoulos, Georgios Piliouras

    Abstract: A wide array of modern machine learning applications - from adversarial models to multi-agent reinforcement learning - can be formulated as non-cooperative games whose Nash equilibria represent the system's desired operational states. Despite having a highly non-convex loss landscape, many cases of interest possess a latent convex structure that could potentially be leveraged to yield convergence… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: 32 pages, 18 figures

    MSC Class: Primary 91A10; 91A26; secondary 68Q32

  6. arXiv:2311.10859  [pdf, other

    quant-ph cs.GT cs.LG math.OC

    A Quadratic Speedup in Finding Nash Equilibria of Quantum Zero-Sum Games

    Authors: Francisca Vasconcelos, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Panayotis Mertikopoulos, Georgios Piliouras, Michael I. Jordan

    Abstract: Recent developments in domains such as non-local games, quantum interactive proofs, and quantum generative adversarial networks have renewed interest in quantum game theory and, specifically, quantum zero-sum games. Central to classical game theory is the efficient algorithmic computation of Nash equilibria, which represent optimal strategies for both players. In 2008, Jain and Watrous proposed th… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 53 pages, 7 figures, QTML 2023 (Accepted (Long Talk))

    MSC Class: primary 91A05; 81Q93; secondary 68Q32; 91A26; 37N40;

  7. arXiv:2311.02423  [pdf, other

    cs.GT cs.LG math.OC quant-ph

    Payoff-based learning with matrix multiplicative weights in quantum games

    Authors: Kyriakos Lotidis, Panayotis Mertikopoulos, Nicholas Bambos, Jose Blanchet

    Abstract: In this paper, we study the problem of learning in quantum games - and other classes of semidefinite games - with scalar, payoff-based feedback. For concreteness, we focus on the widely used matrix multiplicative weights (MMW) algorithm and, instead of requiring players to have full knowledge of the game (and/or each other's chosen states), we introduce a suite of minimal-information matrix multip… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 39 pages, 21 figures, 2 tables

    MSC Class: Primary 91A10; 91A26; 37N40; secondary 68Q32; 81Q93

  8. arXiv:2311.02407  [pdf, other

    cs.GT cs.LG math.OC

    The equivalence of dynamic and strategic stability under regularized learning in games

    Authors: Victor Boone, Panayotis Mertikopoulos

    Abstract: In this paper, we examine the long-run behavior of regularized, no-regret learning in finite games. A well-known result in the field states that the empirical frequencies of no-regret play converge to the game's set of coarse correlated equilibria; however, our understanding of how the players' actual strategies evolve over time is much more limited - and, in many cases, non-existent. This issue i… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 31 pages, 8 figures, 2 tables

    MSC Class: Primary 91A10; 91A26; secondary 68Q32; 62L20

  9. arXiv:2311.02374  [pdf, other

    math.OC cs.LG

    Riemannian stochastic optimization methods avoid strict saddle points

    Authors: Ya-** Hsieh, Mohammad Reza Karimi, Andreas Krause, Panayotis Mertikopoulos

    Abstract: Many modern machine learning applications - from online principal component analysis to covariance matrix identification and dictionary learning - can be formulated as minimization problems on Riemannian manifolds, and are typically solved with a Riemannian stochastic gradient method (or some variant thereof). However, in many cases of interest, the resulting minimization problem is not geodesical… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: 27 pages, 3 figures

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C48

  10. arXiv:2302.02333  [pdf, other

    cs.GT cs.LG math.OC quant-ph

    Learning in quantum games

    Authors: Kyriakos Lotidis, Panayotis Mertikopoulos, Nicholas Bambos

    Abstract: In this paper, we introduce a class of learning dynamics for general quantum games, that we call "follow the quantum regularized leader" (FTQL), in reference to the classical "follow the regularized leader" (FTRL) template for learning in finite games. We show that the induced quantum state dynamics decompose into (i) a classical, commutative component which governs the dynamics of the system's ei… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    Comments: 30 pages, 4 figures

    MSC Class: Primary 91A81; 37N40; 68Q32; secondary 68T05; 81Q93; 91B80

  11. arXiv:2211.08043  [pdf, ps, other

    math.OC cs.LG

    The rate of convergence of Bregman proximal methods: Local geometry vs. regularity vs. sharpness

    Authors: Waïss Azizian, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: We examine the last-iterate convergence rate of Bregman proximal methods - from mirror descent to mirror-prox and its optimistic variants - as a function of the local geometry induced by the prox-map** defining the method. For generality, we focus on local solutions of constrained, non-monotone variational inequalities, and we show that the convergence rate of a given method depends sharply on i… ▽ More

    Submitted 2 August, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 31 pages, 2 tables, 2 figures

    MSC Class: Primary 65K15; 90C33; secondary 68Q25; 68Q32

  12. arXiv:2210.12860  [pdf, ps, other

    math.OC cs.CC cs.LG

    Explicit Second-Order Min-Max Optimization Methods with Optimal Convergence Guarantee

    Authors: Tianyi Lin, Panayotis Mertikopoulos, Michael I. Jordan

    Abstract: We propose and analyze several inexact regularized Newton-type methods for finding a global saddle point of \emph{convex-concave} unconstrained min-max optimization problems. Compared to first-order methods, our understanding of second-order methods for min-max optimization is relatively limited, as obtaining global rates of convergence with second-order information is much more involved. In this… ▽ More

    Submitted 23 April, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Provide a simple subroutine with a detailed complexity analysis; 30 pages, 9 figures

  13. arXiv:2210.08857  [pdf, ps, other

    cs.GT cs.LG math.OC

    On the convergence of policy gradient methods to Nash equilibria in general stochastic games

    Authors: Angeliki Giannou, Kyriakos Lotidis, Panayotis Mertikopoulos, Emmanouil-Vasileios Vlatakis-Gkaragkounis

    Abstract: Learning in stochastic games is a notoriously difficult problem because, in addition to each other's strategic decisions, the players must also contend with the fact that the game itself evolves over time, possibly in a very complicated manner. Because of this, the convergence properties of popular learning algorithms - like policy gradient and its variants - are poorly understood, except in speci… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 43 pages, 2 tables; to appear in the proceedings of NeurIPS 2022

    MSC Class: Primary 91A15; 91A26; secondary 68Q32; 68T05; 90C40

  14. Survival of dominated strategies under imitation dynamics

    Authors: Panayotis Mertikopoulos, Yannick Viossat

    Abstract: The literature on evolutionary game theory suggests that pure strategies that are strictly dominated by other pure strategies always become extinct under imitative game dynamics, but they can survive under innovative dynamics. As we explain, this is because innovative dynamics favour rare strategies while standard imitative dynamics do not. However, as we also show, there are reasonable imitation… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: 27 pages, 7 figures

    MSC Class: Primary: 91A22; 91A26

  15. arXiv:2209.04926  [pdf, other

    cs.GT

    Learning in Games with Quantized Payoff Observations

    Authors: Kyriakos Lotidis, Panayotis Mertikopoulos, Nicholas Bambos

    Abstract: This paper investigates the impact of feedback quantization on multi-agent learning. In particular, we analyze the equilibrium convergence properties of the well-known "follow the regularized leader" (FTRL) class of algorithms when players can only observe a quantized (and possibly noisy) version of their payoffs. In this information-constrained setting, we show that coarser quantization triggers… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

  16. arXiv:2207.07543  [pdf, other

    math.OC cs.DC cs.LG

    Pick your Neighbor: Local Gauss-Southwell Rule for Fast Asynchronous Decentralized Optimization

    Authors: Marina Costantini, Nikolaos Liakopoulos, Panayotis Mertikopoulos, Thrasyvoulos Spyropoulos

    Abstract: In decentralized optimization environments, each agent $i$ in a network of $n$ nodes has its own private function $f_i$, and nodes communicate with their neighbors to cooperatively minimize the aggregate objective $\sum_{i=1}^n f_i$. In this setting, synchronizing the nodes' updates incurs significant communication overhead and computational costs, so much of the recent literature has focused on t… ▽ More

    Submitted 15 September, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: Revised writing, added references

    MSC Class: 90C25; 68T99 ACM Class: G.1.6; I.2.11; I.2.6; C.2.4

  17. arXiv:2206.09352  [pdf, other

    math.OC

    A universal black-box optimization method with almost dimension-free convergence rate guarantees

    Authors: Kimon Antonakopoulos, Dong Quan Vu, Vokan Cevher, Kfir Y. Levy, Panayotis Mertikopoulos

    Abstract: Universal methods for optimization are designed to achieve theoretically optimal convergence rates without any prior knowledge of the problem's regularity parameters or the accurarcy of the gradient oracle employed by the optimizer. In this regard, existing state-of-the-art algorithms achieve an $\mathcal{O}(1/T^2)$ value convergence rate in Lipschitz smooth problems with a perfect gradient oracle… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 31 pages, 4 figures, 1 table; to appear in ICML 2022

    MSC Class: Primary 90C25; 90C15; secondary 68Q32; 68T05

  18. arXiv:2206.09348  [pdf, other

    cs.LG cs.GT math.OC

    Nested bandits

    Authors: Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier, Houssam Zenati

    Abstract: In many online decision processes, the optimizing agent is called to choose between large numbers of alternatives with many inherent similarities; in turn, these similarities imply closely correlated losses that may confound standard discrete choice models and bandit algorithms. We study this question in the context of nested bandits, a class of adversarial multi-armed bandit problems where the le… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 35 pages, 14 figures; to appear in ICML 2022

    MSC Class: Primary 68Q32; secondary 91B06

  19. arXiv:2206.06795  [pdf, other

    math.OC cs.LG math.DS

    Riemannian stochastic approximation algorithms

    Authors: Mohammad Reza Karimi, Ya-** Hsieh, Panayotis Mertikopoulos, Andreas Krause

    Abstract: We examine a wide class of stochastic approximation algorithms for solving (stochastic) nonlinear problems on Riemannian manifolds. Such algorithms arise naturally in the study of Riemannian optimization, game theory and optimal transport, but their behavior is much less understood compared to the Euclidean case because of the lack of a global linear structure on the manifold. We overcome this dif… ▽ More

    Submitted 27 December, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: 33 pages, 2 figures; a one-page abstract of this paper was presented in COLT 2022

    MSC Class: Primary 62L20; 37N40; secondary 90C15; 90C47; 90C48

  20. arXiv:2206.06015  [pdf, other

    cs.GT cs.LG

    No-Regret Learning in Games with Noisy Feedback: Faster Rates and Adaptivity via Learning Rate Separation

    Authors: Yu-Guan Hsieh, Kimon Antonakopoulos, Volkan Cevher, Panayotis Mertikopoulos

    Abstract: We examine the problem of regret minimization when the learner is involved in a continuous game with other optimizing agents: in this case, if all players follow a no-regret algorithm, it is possible to achieve significantly lower regret relative to fully adversarial environments. We study this problem in the context of variationally stable games (a class of continuous games which includes all con… ▽ More

    Submitted 17 March, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: In Advances in Neural Information Processing Systems 35 (NeurIPS 2022)

  21. arXiv:2206.03922  [pdf, other

    cs.GT cs.LG math.OC

    A unified stochastic approximation framework for learning in games

    Authors: Panayotis Mertikopoulos, Ya-** Hsieh, Volkan Cevher

    Abstract: We develop a flexible stochastic approximation framework for analyzing the long-run behavior of learning in games (both continuous and finite). The proposed analysis template incorporates a wide array of popular learning algorithms, including gradient-based methods, the exponential/multiplicative weights algorithm for learning in finite games, optimistic and bandit variants of the above, etc. In a… ▽ More

    Submitted 3 July, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: 40 pages, 5 figures, 2 tables

    MSC Class: Primary 91A10; 91A26; secondary 68Q32; 68T02

  22. arXiv:2201.02985  [pdf, other

    cs.GT

    Routing in an Uncertain World: Adaptivity, Efficiency, and Equilibrium

    Authors: Dong Quan Vu, Kimon Antonakopoulos, Panayotis Mertikopoulos

    Abstract: We consider the traffic assignment problem in nonatomic routing games where the players' cost functions may be subject to random fluctuations (e.g., weather disturbances, perturbations in the underlying network, etc.). We tackle this problem from the viewpoint of a control interface that makes routing recommendations based solely on observed costs and without any further knowledge of the system's… ▽ More

    Submitted 9 January, 2022; originally announced January 2022.

  23. arXiv:2109.05829  [pdf, other

    cs.LG math.OC

    Zeroth-order non-convex learning via hierarchical dual averaging

    Authors: Amélie Héliou, Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier

    Abstract: We propose a hierarchical version of dual averaging for zeroth-order online non-convex optimization - i.e., learning processes where, at each stage, the optimizer is facing an unknown non-convex loss function and only receives the incurred loss as feedback. The proposed class of policies relies on the construction of an online model that aggregates loss information as it arrives, and it consists o… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: 40 pages, 14 figures

    MSC Class: Primary 68Q32; 90C56; secondary 90C15; 90C26

  24. arXiv:2108.04506  [pdf, other

    cs.GT math.OC

    A heuristic for estimating Nash equilibria in first-price auctions with correlated values

    Authors: Benjamin Heymann, Panayotis Mertikopoulos

    Abstract: Our paper concerns the computation of Nash equilibria of first-price auctions with correlated values. While there exist several equilibrium computation methods for auctions with independent values, the correlation of the bidders' values introduces significant complications that render existing methods unsatisfactory in practice. Our contribution is a step towards filling this gap: inspired by the… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

  25. arXiv:2107.08011  [pdf, other

    math.OC cs.LG

    Adaptive first-order methods revisited: Convex optimization without Lipschitz requirements

    Authors: Kimon Antonakopoulos, Panayotis Mertikopoulos

    Abstract: We propose a new family of adaptive first-order methods for a class of convex minimization problems that may fail to be Lipschitz continuous or smooth in the standard sense. Specifically, motivated by a recent flurry of activity on non-Lipschitz (NoLips) optimization, we consider problems that are continuous or smooth relative to a reference Bregman function - as opposed to a global, ambient norm… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: 34 pages, 4 figures

    MSC Class: Primary 90C25; 90C15; 90C30; secondary 68Q25; 90C60

  26. arXiv:2107.02919  [pdf, other

    math.OC cs.LG

    Distributed stochastic optimization with large delays

    Authors: Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Peter W. Glynn, Yinyu Ye

    Abstract: One of the most widely used methods for solving large-scale stochastic optimization problems is distributed asynchronous stochastic gradient descent (DASGD), a family of algorithms that result from parallelizing stochastic gradient descent on distributed computing architectures (possibly) asychronously. However, a key obstacle in the efficient implementation of DASGD is the issue of delays: when a… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: 41 pages, 8 figures; to be published in Mathematics of Operations Research

    MSC Class: Primary 90C15; 90C26; secondary 90C25; 90C06

  27. arXiv:2107.01906  [pdf, ps, other

    math.OC cs.LG

    The Last-Iterate Convergence Rate of Optimistic Mirror Descent in Stochastic Variational Inequalities

    Authors: Waïss Azizian, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: In this paper, we analyze the local convergence rate of optimistic mirror descent methods in stochastic variational inequalities, a class of optimization problems with important applications to learning theory and machine learning. Our analysis reveals an intricate relation between the algorithm's rate of convergence and the local geometry induced by the method's underlying Bregman function. We qu… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 31 pages, 3 figures, 1 table; to be presented at the 34th Annual Conference on Learning Theory (COLT 2021)

    MSC Class: 65K15; 90C33 (Primary) 68Q25; 68Q32 (Secondary)

  28. arXiv:2107.01595  [pdf, ps, other

    cs.GT cs.LG math.OC

    Learning in nonatomic games, Part I: Finite action spaces and population games

    Authors: Saeed Hadikhanloo, Rida Laraki, Panayotis Mertikopoulos, Sylvain Sorin

    Abstract: We examine the long-run behavior of a wide range of dynamics for learning in nonatomic games, in both discrete and continuous time. The class of dynamics under consideration includes fictitious play and its regularized variants, the best-reply dynamics (again, possibly regularized), as well as the dynamics of dual averaging / "follow the regularized leader" (which themselves include as special cas… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: 27 pages

    MSC Class: Primary 91A22; 91A26; secondary 37N40; 68Q32

  29. arXiv:2106.14636  [pdf, other

    cs.GT

    Asymptotic Degradation of Linear Regression Estimates With Strategic Data Sources

    Authors: Benjamin Roussillon, Nicolas Gast, Patrick Loiseau, Panayotis Mertikopoulos

    Abstract: We consider the problem of linear regression from strategic data sources with a public good component, i.e., when data is provided by strategic agents who seek to minimize an individual provision cost for increasing their data's precision while benefiting from the model's overall precision. In contrast to previous works, our model tackles the case where there is uncertainty on the attributes chara… ▽ More

    Submitted 11 March, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: 31 pages, 7 figures

  30. arXiv:2105.13348  [pdf, other

    math.OC cs.LG cs.MA

    Optimization in Open Networks via Dual Averaging

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: In networks of autonomous agents (e.g., fleets of vehicles, scattered sensors), the problem of minimizing the sum of the agents' local functions has received a lot of interest. We tackle here this distributed optimization problem in the case of open networks when agents can join and leave the network at any time. Leveraging recent online optimization techniques, we propose and analyze the converge… ▽ More

    Submitted 16 October, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

    Comments: In 60th IEEE Conference on Decision and Control (CDC 2021); 7 pages, 1 figure

  31. arXiv:2104.12761  [pdf, other

    cs.GT cs.LG math.OC

    Adaptive Learning in Continuous Games: Optimal Regret Bounds and Convergence to Nash Equilibrium

    Authors: Yu-Guan Hsieh, Kimon Antonakopoulos, Panayotis Mertikopoulos

    Abstract: In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often measured by its regret. However, no-regret algorithms are not created equal in terms of game-theoretic guarantees: depending on how they are tuned, some of them may… ▽ More

    Submitted 16 October, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: In the 34th Annual Conference on Learning Theory (COLT 2021); 35 pages, 2 figures

  32. arXiv:2101.04667  [pdf, ps, other

    cs.GT cs.LG cs.MA math.OC

    Survival of the strictest: Stable and unstable equilibria under regularized learning with partial information

    Authors: Angeliki Giannou, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Panayotis Mertikopoulos

    Abstract: In this paper, we examine the Nash equilibrium convergence properties of no-regret learning in general N-player games. For concreteness, we focus on the archetypal follow the regularized leader (FTRL) family of algorithms, and we consider the full spectrum of uncertainty that the players may encounter - from noisy, oracle-based feedback, to bandit, payoff-based information. In this general context… ▽ More

    Submitted 4 February, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

  33. arXiv:2012.11579  [pdf, ps, other

    cs.LG cs.MA math.OC

    Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: In this paper, we provide a general framework for studying multi-agent online learning problems in the presence of delays and asynchronicities. Specifically, we propose and analyze a class of adaptive dual averaging schemes in which agents only need to accumulate gradient feedback received from the whole system, without requiring any between-agent coordination. In the single-agent case, the adapti… ▽ More

    Submitted 16 April, 2022; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: Accepted by Journal of Machine Learning Research (JMLR)

  34. arXiv:2010.12100  [pdf, other

    math.OC cs.GT cs.LG

    Adaptive extra-gradient methods for min-max optimization and games

    Authors: Kimon Antonakopoulos, E. Veronica Belmega, Panayotis Mertikopoulos

    Abstract: We present a new family of min-max optimization algorithms that automatically exploit the geometry of the gradient data observed at earlier iterations to perform more informative extra-gradient steps in later ones. Thanks to this adaptation mechanism, the proposed method automatically detects whether the problem is smooth or not, without requiring any prior tuning by the optimizer. As a result, th… ▽ More

    Submitted 19 November, 2020; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 28 pages, 5 figures, 1 table

    MSC Class: Primary 90C47; 91A68; secondary 49J40; 90C33

  35. arXiv:2010.09514  [pdf, ps, other

    cs.GT cs.LG math.OC

    No-regret learning and mixed Nash equilibria: They do not mix

    Authors: Lampros Flokas, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Thanasis Lianeas, Panayotis Mertikopoulos, Georgios Piliouras

    Abstract: Understanding the behavior of no-regret dynamics in general $N$-player games is a fundamental question in online learning and game theory. A folk result in the field states that, in finite games, the empirical frequency of play under no-regret learning converges to the game's set of coarse correlated equilibria. By contrast, our understanding of how the day-to-day behavior of the dynamics correlat… ▽ More

    Submitted 20 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: 24 pages, 7 figures, 1 table

    MSC Class: Primary 91A26; 37N40; secondary 91A68; 68Q32; 68T05

  36. arXiv:2010.08496  [pdf, other

    cs.LG math.OC

    Online non-convex optimization with imperfect feedback

    Authors: Amélie Héliou, Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier

    Abstract: We consider the problem of online learning with non-convex losses. In terms of feedback, we assume that the learner observes - or otherwise constructs - an inexact model for the loss function encountered at each stage, and we propose a mixed-strategy learning policy based on dual averaging. In this general context, we derive a series of tight regret minimization guarantees, both for the learner's… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 30 pages, 2 figures, 1 table

    MSC Class: Primary 68Q32; secondary 90C26; 91A26

  37. arXiv:2010.06250  [pdf, ps, other

    cs.LG cs.GT math.OC

    Regret minimization in stochastic non-convex learning via a proximal-gradient approach

    Authors: Nadav Hallak, Panayotis Mertikopoulos, Volkan Cevher

    Abstract: Motivated by applications in machine learning and operations research, we study regret minimization with stochastic first-order oracle feedback in online constrained, and possibly non-smooth, non-convex problems. In this setting, the minimization of external regret is beyond reach for first-order methods, so we focus on a local regret measure defined via a proximal-gradient map**. To achieve no… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

  38. arXiv:2006.11144  [pdf, other

    math.OC cs.LG math.PR stat.ML

    On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems

    Authors: Panayotis Mertikopoulos, Nadav Hallak, Ali Kavis, Volkan Cevher

    Abstract: This paper analyzes the trajectories of stochastic gradient descent (SGD) to help understand the algorithm's convergence properties in non-convex problems. We first show that the sequence of iterates generated by SGD remains bounded and converges with probability $1$ under a very broad range of step-size schedules. Subsequently, going beyond existing positive probability guarantees, we show that S… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

    Comments: 32 pages, 8 figures

    MSC Class: Primary 90C26; 62L20; secondary 90C30; 90C15; 37N40

  39. arXiv:2006.10911  [pdf, ps, other

    cs.GT math.OC

    Gradient-free Online Learning in Games with Delayed Rewards

    Authors: Amélie Héliou, Panayotis Mertikopoulos, Zhengyuan Zhou

    Abstract: Motivated by applications to online advertising and recommender systems, we consider a game-theoretic model with delayed rewards and asynchronous, payoff-based feedback. In contrast to previous work on delayed multi-armed bandits, we focus on multi-player games with continuous action spaces, and we examine the long-run behavior of strategic agents that follow a no-regret learning policy (but are o… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: 26 pages, 4 figures; to appear in ICML 2020

    MSC Class: Primary 91A10; 91A68; 68Q32; secondary 91A20; 91A26; 68T05

  40. arXiv:2006.09065  [pdf, other

    math.OC cs.LG stat.ML

    The limits of min-max optimization algorithms: convergence to spurious non-critical sets

    Authors: Ya-** Hsieh, Panayotis Mertikopoulos, Volkan Cevher

    Abstract: Compared to ordinary function minimization problems, min-max optimization algorithms encounter far greater challenges because of the existence of periodic cycles and similar phenomena. Even though some of these behaviors can be overcome in the convex-concave regime, the general case is considerably more difficult. On that account, we take an in-depth look at a comprehensive class of state-of-the a… ▽ More

    Submitted 14 February, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  41. arXiv:2006.05445  [pdf, other

    cs.IT eess.SP math.OC

    Fast Optimization with Zeroth-Order Feedback in Distributed, Multi-User MIMO Systems

    Authors: Olivier Bilenne, Panayotis Mertikopoulos, E. Veronica Belmega

    Abstract: In this paper, we develop a gradient-free optimization methodology for efficient resource allocation in Gaussian MIMO multiple access channels. Our approach combines two main ingredients: (i) an entropic semidefinite optimization based on matrix exponential learning (MXL); and (ii) a one-shot gradient estimator which achieves low variance through the reuse of past information. This novel algorithm… ▽ More

    Submitted 18 October, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: Final version; to appear in IEEE Transactions on Signal Processing; 16 pages, 4 figures

    ACM Class: C.2.1; G.1.6

    Journal ref: IEEE Trans. Signal Process., vol. 68, pp. 6085-6100, 2020

  42. arXiv:2003.10162  [pdf, other

    math.OC cs.GT cs.LG

    Explore Aggressively, Update Conservatively: Stochastic Extragradient Methods with Variable Stepsize Scaling

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: Owing to their stability and convergence speed, extragradient methods have become a staple for solving large-scale saddle-point problems in machine learning. The basic premise of these algorithms is the use of an extrapolation step before performing an update; thanks to this exploration step, extra-gradient methods overcome many of the non-convergence issues that plague gradient descent/ascent sch… ▽ More

    Submitted 5 November, 2020; v1 submitted 23 March, 2020; originally announced March 2020.

    Comments: In Advances in Neural Information Processing Systems 33 (NeurIPS 2020); 29 pages, 5 figures

    MSC Class: 65K15; 62L20; 90C15; 90C33

  43. arXiv:2003.09729  [pdf, other

    stat.ML cs.LG math.OC

    A new regret analysis for Adam-type algorithms

    Authors: Ahmet Alacaoglu, Yura Malitsky, Panayotis Mertikopoulos, Volkan Cevher

    Abstract: In this paper, we focus on a theory-practice gap for Adam and its variants (AMSgrad, AdamNC, etc.). In practice, these algorithms are used with a constant first-order moment parameter $β_{1}$ (typically between $0.9$ and $0.99$). In theory, regret guarantees for online convex optimization require a rapidly decaying $β_{1}\to0$ schedule. We show that this is an artifact of the standard analysis and… ▽ More

    Submitted 21 March, 2020; originally announced March 2020.

  44. arXiv:2002.09806  [pdf, ps, other

    math.OC cs.GT stat.ML

    Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games

    Authors: Tianyi Lin, Zhengyuan Zhou, Panayotis Mertikopoulos, Michael I. Jordan

    Abstract: In this paper, we consider multi-agent learning via online gradient descent in a class of games called $λ$-cocoercive games, a fairly broad class of games that admits many Nash equilibria and that properly includes unconstrained strongly monotone games. We characterize the finite-time last-iterate convergence rate for joint OGD learning on $λ$-cocoercive games; further, building on this result, we… ▽ More

    Submitted 17 July, 2021; v1 submitted 22 February, 2020; originally announced February 2020.

    Comments: Accepted by ICML 2020; The first two authors contributed equally to this work

  45. arXiv:2001.00468  [pdf, ps, other

    cs.GT

    Quick or cheap? Breaking points in dynamic markets

    Authors: Panayotis Mertikopoulos, Heinrich H. Nax, Bary S. R. Pradelski

    Abstract: We examine two-sided markets where players arrive stochastically over time and are drawn from a continuum of types. The cost of matching a client and provider varies, so a social planner is faced with two contending objectives: a) to reduce players' waiting time before getting matched; and b) to form efficient pairs in order to reduce matching costs. We show that such markets are characterized by… ▽ More

    Submitted 3 January, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: 32 pages, 2 tables

  46. arXiv:1908.08465  [pdf, other

    math.OC cs.GT cs.LG

    On the convergence of single-call stochastic extra-gradient methods

    Authors: Yu-Guan Hsieh, Franck Iutzeler, Jérôme Malick, Panayotis Mertikopoulos

    Abstract: Variational inequalities have recently attracted considerable interest in machine learning as a flexible paradigm for models that go beyond ordinary loss function minimization (such as generative adversarial networks and related deep learning systems). In this setting, the optimal $\mathcal{O}(1/t)$ convergence rate for solving smooth monotone variational inequalities is achieved by the Extra-Grad… ▽ More

    Submitted 11 February, 2020; v1 submitted 22 August, 2019; originally announced August 2019.

    Comments: In Advances in Neural Information Processing Systems 32 (NeurIPS 2019); 24 pages, 3 figures

    MSC Class: 65K15; 62L20; 90C15; 90C33

  47. arXiv:1902.03355  [pdf, other

    math.OC cs.LG

    Forward-backward-forward methods with variance reduction for stochastic variational inequalities

    Authors: Radu Ioan Bot, Panayotis Mertikopoulos, Mathias Staudigl, Phan Tu Vuong

    Abstract: We develop a new stochastic algorithm with variance reduction for solving pseudo-monotone stochastic variational inequalities. Our method builds on Tseng's forward-backward-forward (FBF) algorithm, which is known in the deterministic literature to be a valuable alternative to Korpelevich's extragradient method when solving variational inequalities over a convex and closed set governed by pseudo-mo… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

    Comments: 34 pages, 11 figures

    MSC Class: Primary 65K15; 62L20; secondary 90C15; 90C33

  48. arXiv:1810.01925  [pdf, ps, other

    cs.GT cs.LG math.OC

    Bandit learning in concave $N$-person games

    Authors: Mario Bravo, David S. Leslie, Panayotis Mertikopoulos

    Abstract: This paper examines the long-run behavior of learning with bandit feedback in non-cooperative concave games. The bandit framework accounts for extremely low-information environments where the agents may not even know they are playing a game; as such, the agents' most sensible choice in this setting would be to employ a no-regret learning algorithm. In general, this does not mean that the players'… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.

    Comments: 24 pages, 1 figure

    MSC Class: Primary 91A10; 91A26; secondary 68Q32; 68T02

  49. arXiv:1809.09449  [pdf, other

    math.OC cs.LG

    Hessian barrier algorithms for linearly constrained optimization problems

    Authors: Immanuel M. Bomze, Panayotis Mertikopoulos, Werner Schachinger, Mathias Staudigl

    Abstract: In this paper, we propose an interior-point method for linearly constrained optimization problems (possibly nonconvex). The method - which we call the Hessian barrier algorithm (HBA) - combines a forward Euler discretization of Hessian Riemannian gradient flows with an Armijo backtracking step-size policy. In this way, HBA can be seen as an alternative to mirror descent (MD), and contains as speci… ▽ More

    Submitted 8 May, 2019; v1 submitted 25 September, 2018; originally announced September 2018.

    Comments: 27 pages, 6 figures

    MSC Class: Primary: 90C51; 90C30; secondary: 90C25; 90C26

    Journal ref: SIAM Journal on Optimization 29 (2019), 2100-2127

  50. arXiv:1809.03066  [pdf, ps, other

    cs.GT cs.LG math.OC

    Multi-agent online learning in time-varying games

    Authors: Benoit Duvocelle, Panayotis Mertikopoulos, Mathias Staudigl, Dries Vermeulen

    Abstract: We examine the long-run behavior of multi-agent online learning in games that evolve over time. Specifically, we focus on a wide class of policies based on mirror descent, and we show that the induced sequence of play (a) converges to Nash equilibrium in time-varying games that stabilize in the long run to a strictly monotone limit; and (b) it stays asymptotically close to the evolving equilibrium… ▽ More

    Submitted 4 September, 2021; v1 submitted 9 September, 2018; originally announced September 2018.

    Comments: 35 pages

    MSC Class: Primary 91A10; 91A26; secondary 68Q32; 68T02

    Journal ref: Mathematics of Operations Research, 2022