Skip to main content

Showing 1–25 of 25 results for author: Anagnostides, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.09670  [pdf, ps, other

    cs.GT

    Efficient $Φ$-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

    Authors: Brian Hu Zhang, Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm

    Abstract: Recent breakthrough results by Dagan, Daskalakis, Fishelson and Golowich [2023] and Peng and Rubinstein [2023] established an efficient algorithm attaining at most $ε$ swap regret over extensive-form strategy spaces of dimension $N$ in $N^{\tilde O(1/ε)}$ rounds. On the other extreme, Farina and Pipis [2023] developed an efficient algorithm for minimizing the weaker notion of linear-swap regret in… ▽ More

    Submitted 17 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  2. arXiv:2312.12067  [pdf, other

    cs.GT cs.LG

    Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property

    Authors: Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

    Abstract: Policy gradient methods enjoy strong practical performance in numerous tasks in reinforcement learning. Their theoretical understanding in multiagent settings, however, remains limited, especially beyond two-player competitive and potential Markov games. In this paper, we develop a new framework to characterize optimistic policy gradient methods in multi-player Markov games with a single controlle… ▽ More

    Submitted 21 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: To appear at AAAI 2024

  3. arXiv:2311.14869  [pdf, ps, other

    cs.GT

    On the Complexity of Computing Sparse Equilibria and Lower Bounds for No-Regret Learning in Games

    Authors: Ioannis Anagnostides, Alkis Kalavasis, Tuomas Sandholm, Manolis Zampetakis

    Abstract: Characterizing the performance of no-regret dynamics in multi-player games is a foundational problem at the interface of online learning and game theory. Recent results have revealed that when all players adopt specific learning algorithms, it is possible to improve exponentially over what is predicted by the overly pessimistic no-regret framework in the traditional adversarial regime, thereby lea… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: To appear at ITCS 2024

  4. arXiv:2310.16976  [pdf, other

    cs.GT

    On the Interplay between Social Welfare and Tractability of Equilibria

    Authors: Ioannis Anagnostides, Tuomas Sandholm

    Abstract: Computational tractability and social welfare (aka. efficiency) of equilibria are two fundamental but in general orthogonal considerations in algorithmic game theory. Nevertheless, we show that when (approximate) full efficiency can be guaranteed via a smoothness argument à la Roughgarden, Nash equilibria are approachable under a family of no-regret learning algorithms, thereby enabling fast and d… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: To appear at NeurIPS 2023

  5. arXiv:2306.05221  [pdf, other

    cs.GT

    Steering No-Regret Learners to a Desired Equilibrium

    Authors: Brian Hu Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen Marcus McAleer, Andreas Alexander Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm

    Abstract: A mediator observes no-regret learners playing an extensive-form game repeatedly across $T$ rounds. The mediator attempts to steer players toward some desirable predetermined equilibrium by giving (nonnegative) payments to players. We call this the steering problem. The steering problem captures problems several problems of interest, among them equilibrium selection and information design (persuas… ▽ More

    Submitted 17 February, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  6. arXiv:2306.05216  [pdf, ps, other

    cs.GT

    Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games

    Authors: Brian Hu Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen Marcus McAleer, Andreas Alexander Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm

    Abstract: We introduce a new approach for computing optimal equilibria via learning in games. It applies to extensive-form settings with any number of players, including mechanism design, information design, and solution concepts such as correlated, communication, and certification equilibria. We observe that optimal equilibria are minimax equilibrium strategies of a player in an extensive-form zero-sum gam… ▽ More

    Submitted 23 May, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  7. arXiv:2301.11241  [pdf, other

    cs.LG cs.GT

    On the Convergence of No-Regret Learning Dynamics in Time-Varying Games

    Authors: Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

    Abstract: Most of the literature on learning in games has focused on the restrictive setting where the underlying repeated game does not change over time. Much less is known about the convergence of no-regret learning algorithms in dynamic multiagent settings. In this paper, we characterize the convergence of optimistic gradient descent (OGD) in time-varying games. Our framework yields sharp convergence bou… ▽ More

    Submitted 18 October, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: To appear at NeurIPS 2023; V3 incorporates reviewers' feedback and minor corrections

  8. arXiv:2301.02129  [pdf, ps, other

    cs.GT cs.CC cs.DS

    Algorithms and Complexity for Computing Nash Equilibria in Adversarial Team Games

    Authors: Ioannis Anagnostides, Fivos Kalogiannis, Ioannis Panageas, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Stephen McAleer

    Abstract: Adversarial team games model multiplayer strategic interactions in which a team of identically-interested players is competing against an adversarial player in a zero-sum game. Such games capture many well-studied settings in game theory, such as congestion games, but go well-beyond to environments wherein the cooperation of one team -- in the absence of explicit communication -- is obstructed by… ▽ More

    Submitted 30 May, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: To appear at the conference on Economics and Computation (EC) 2023

  9. arXiv:2209.14110  [pdf, other

    cs.GT

    Meta-Learning in Games

    Authors: Keegan Harris, Ioannis Anagnostides, Gabriele Farina, Mikhail Khodak, Zhiwei Steven Wu, Tuomas Sandholm

    Abstract: In the literature on game-theoretic equilibrium finding, focus has mainly been on solving a single game in isolation. In practice, however, strategic interactions -- ranging from routing problems to online advertising auctions -- evolve dynamically, thereby leading to many similar games to be solved. To address this gap, we introduce meta-learning for equilibrium finding and learning to play games… ▽ More

    Submitted 1 March, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: In the eleventh Conference on Learning Representations (ICLR 2023)

  10. arXiv:2208.11787  [pdf, other

    cs.GT

    Sampling and Optimal Preference Elicitation in Simple Mechanisms

    Authors: Ioannis Anagnostides, Dimitris Fotakis, Panagiotis Patsilinakos

    Abstract: In this work we are concerned with the design of efficient mechanisms while eliciting limited information from the agents. First, we study the performance of sampling approximations in facility location games. Our key result is to show that for any $ε> 0$, a sample of size $c(ε) = Θ(1/ε^2)$ yields in expectation a $1 + ε$ approximation with respect to the optimal social cost of the generalized med… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: Preliminary version appeared at SAGT 2020

  11. arXiv:2208.09747  [pdf, ps, other

    cs.GT cs.LG

    Near-Optimal $Φ$-Regret Learning in Extensive-Form Games

    Authors: Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm

    Abstract: In this paper, we establish efficient and uncoupled learning dynamics so that, when employed by all players in multiplayer perfect-recall imperfect-information extensive-form games, the trigger regret of each player grows as $O(\log T)$ after $T$ repetitions of play. This improves exponentially over the prior best known trigger-regret bound of $O(T^{1/4})$, and settles a recent open question by Ba… ▽ More

    Submitted 19 September, 2023; v1 submitted 20 August, 2022; originally announced August 2022.

    Comments: Appearing at ICML 2023. V3 corrects a statement

  12. arXiv:2208.02204  [pdf, ps, other

    cs.GT cs.LG cs.MA

    Efficiently Computing Nash Equilibria in Adversarial Team Markov Games

    Authors: Fivos Kalogiannis, Ioannis Anagnostides, Ioannis Panageas, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Vaggos Chatziafratis, Stelios Stavroulakis

    Abstract: Computing Nash equilibrium policies is a central problem in multi-agent reinforcement learning that has received extensive attention both in theory and in practice. However, provable guarantees have been thus far either limited to fully competitive or cooperative scenarios or impose strong assumptions that are difficult to meet in most practical applications. In this work, we depart from those pri… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

  13. arXiv:2206.08742  [pdf, other

    cs.GT cs.LG

    Near-Optimal No-Regret Learning Dynamics for General Convex Games

    Authors: Gabriele Farina, Ioannis Anagnostides, Haipeng Luo, Chung-Wei Lee, Christian Kroer, Tuomas Sandholm

    Abstract: A recent line of work has established uncoupled learning dynamics such that, when employed by all players in a game, each player's \emph{regret} after $T$ repetitions grows polylogarithmically in $T$, an exponential improvement over the traditional guarantees within the no-regret framework. However, so far these results have only been limited to certain classes of games with structured strategy sp… ▽ More

    Submitted 16 October, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: To appear at NeurIPS 2022. V2 incorporates reviewers' feedback

  14. arXiv:2204.11417  [pdf, other

    cs.GT cs.LG

    Uncoupled Learning Dynamics with $O(\log T)$ Swap Regret in Multiplayer Games

    Authors: Ioannis Anagnostides, Gabriele Farina, Christian Kroer, Chung-Wei Lee, Haipeng Luo, Tuomas Sandholm

    Abstract: In this paper we establish efficient and \emph{uncoupled} learning dynamics so that, when employed by all players in a general-sum multiplayer game, the \emph{swap regret} of each player after $T$ repetitions of the game is bounded by $O(\log T)$, improving over the prior best bounds of $O(\log^4 (T))$. At the same time, we guarantee optimal $O(\sqrt{T})$ swap regret in the adversarial regime as w… ▽ More

    Submitted 5 October, 2022; v1 submitted 24 April, 2022; originally announced April 2022.

    Comments: To appear at NeurIPS 2022. V2 incorporates reviewers' feedback and minor corrections

  15. arXiv:2203.12074  [pdf, other

    cs.GT

    Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse Correlated Equilibria in Bimatrix Games

    Authors: Ioannis Anagnostides, Gabriele Farina, Ioannis Panageas, Tuomas Sandholm

    Abstract: We show that, for any sufficiently small fixed $ε> 0$, when both players in a general-sum two-player (bimatrix) game employ optimistic mirror descent (OMD) with smooth regularization, learning rate $η= O(ε^2)$ and $T = Ω(\text{poly}(1/ε))$ repetitions, either the dynamics reach an $ε$-approximate Nash equilibrium (NE), or the average correlated distribution of play is an $Ω(\text{poly}(ε))$-strong… ▽ More

    Submitted 6 October, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: To appear at NeurIPS 2022. V2 incorporates reviewers' feedback

  16. arXiv:2203.12056  [pdf, other

    cs.GT

    On Last-Iterate Convergence Beyond Zero-Sum Games

    Authors: Ioannis Anagnostides, Ioannis Panageas, Gabriele Farina, Tuomas Sandholm

    Abstract: Most existing results about \emph{last-iterate convergence} of learning dynamics are limited to two-player zero-sum games, and only apply under rigid assumptions about what dynamics the players follow. In this paper we provide new results and techniques that apply to broader families of games and learning dynamics. First, we use a regret-based analysis to show that in a class of games that include… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

  17. arXiv:2202.05446  [pdf, other

    cs.GT

    Faster No-Regret Learning Dynamics for Extensive-Form Correlated and Coarse Correlated Equilibria

    Authors: Ioannis Anagnostides, Gabriele Farina, Christian Kroer, Andrea Celli, Tuomas Sandholm

    Abstract: A recent emerging trend in the literature on learning in games has been concerned with providing faster learning dynamics for correlated and coarse correlated equilibria in normal-form games. Much less is known about the significantly more challenging setting of extensive-form games, which can capture both sequential and simultaneous moves, as well as imperfect information. In this paper we establ… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: Preliminary parts of this paper will appear at the AAAI-22 Workshop on Reinforcement Learning in Games. This version also contains results from an earlier preprint published by a subset of the authors (arXiv:2109.08138)

  18. Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-Player General-Sum Games

    Authors: Ioannis Anagnostides, Constantinos Daskalakis, Gabriele Farina, Maxwell Fishelson, Noah Golowich, Tuomas Sandholm

    Abstract: Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed that if all agents in a multi-player general-sum normal-form game employ Optimistic Multiplicative Weights Update (OMWU), the external regret of every player is $O(\textrm{polylog}(T))$ after $T$ repetitions of the game. We extend their result from external regret to internal regret and swap regret, thereby establishing uncoup… ▽ More

    Submitted 24 January, 2023; v1 submitted 10 November, 2021; originally announced November 2021.

    Comments: Appeared at STOC 2022

  19. arXiv:2109.05151  [pdf, other

    cs.DC cs.DS

    Almost Universally Optimal Distributed Laplacian Solvers via Low-Congestion Shortcuts

    Authors: Ioannis Anagnostides, Christoph Lenzen, Bernhard Haeupler, Goran Zuzic, Themis Gouleakis

    Abstract: In this paper, we refine the (almost) \emph{existentially optimal} distributed Laplacian solver recently developed by Forster, Goranci, Liu, Peng, Sun, and Ye (FOCS `21) into an (almost) \emph{universally optimal} distributed Laplacian solver. Specifically, when the topology is known, we show that any Laplacian system on an $n$-node graph with \emph{shortcut quality} $\text{SQ}(G)$ can be solved… ▽ More

    Submitted 14 May, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

  20. arXiv:2109.02184  [pdf, other

    cs.GT

    Dimensionality, Coordination, and Robustness in Voting

    Authors: Ioannis Anagnostides, Dimitris Fotakis, Panagiotis Patsilinakos

    Abstract: We study the performance of voting mechanisms from a utilitarian standpoint, under the recently introduced framework of metric-distortion, offering new insights along three main lines. First, if $d$ represents the doubling dimension of the metric space, we show that the distortion of STV is $O(d \log \log m)$, where $m$ represents the number of candidates. For doubling metrics this implies an expo… ▽ More

    Submitted 24 March, 2022; v1 submitted 5 September, 2021; originally announced September 2021.

  21. arXiv:2108.01740  [pdf, other

    cs.DC

    Deterministic Distributed Algorithms and Lower Bounds in the Hybrid Model

    Authors: Ioannis Anagnostides, Themis Gouleakis

    Abstract: The $\hybrid$ model was recently introduced by Augustine et al. \cite{DBLP:conf/soda/AugustineHKSS20} in order to characterize from an algorithmic standpoint the capabilities of networks which combine multiple communication modes. Concretely, it is assumed that the standard $\local$ model of distributed computing is enhanced with the feature of all-to-all communication, but with very limited bandw… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  22. arXiv:2107.02489  [pdf, other

    cs.GT

    Metric-Distortion Bounds under Limited Information

    Authors: Ioannis Anagnostides, Dimitris Fotakis, Panagiotis Patsilinakos

    Abstract: In this work we study the metric distortion problem in voting theory under a limited amount of ordinal information. Our primary contribution is threefold. First, we consider mechanisms which perform a sequence of pairwise comparisons between candidates. We show that a widely-popular deterministic mechanism employed in most knockout phases yields distortion $\mathcal{O}(\log m)$ while eliciting onl… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  23. arXiv:2010.09106  [pdf, other

    stat.ML cs.LG

    Robust Learning under Strong Noise via SQs

    Authors: Ioannis Anagnostides, Themis Gouleakis, Ali Marashian

    Abstract: This work provides several new insights on the robustness of Kearns' statistical query framework against challenging label-noise models. First, we build on a recent result by \cite{DBLP:journals/corr/abs-2006-04787} that showed noise tolerance of distribution-independently evolvable concept classes under Massart noise. Specifically, we extend their characterization to more general noise models, in… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

  24. arXiv:2010.03211  [pdf, other

    math.OC cs.GT

    A Robust Framework for Analyzing Gradient-Based Dynamics in Bilinear Games

    Authors: Ioannis Anagnostides, Paolo Penna

    Abstract: In this work, we establish a frequency-domain framework for analyzing gradient-based algorithms in linear minimax optimization problems; specifically, our approach is based on the Z-transform, a powerful tool applied in Control Theory and Signal Processing in order to characterize linear discrete-time systems. We employ our framework to obtain the first tight analysis of stability of Optimistic Gr… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  25. arXiv:2010.00109  [pdf, other

    math.OC cs.GT

    Solving Zero-Sum Games through Alternating Projections

    Authors: Ioannis Anagnostides, Paolo Penna

    Abstract: In this work, we establish near-linear and strong convergence for a natural first-order iterative algorithm that simulates Von Neumann's Alternating Projections method in zero-sum games. First, we provide a precise analysis of Optimistic Gradient Descent/Ascent (OGDA) -- an optimistic variant of Gradient Descent/Ascent -- for \emph{unconstrained} bilinear games, extending and strengthening prior r… ▽ More

    Submitted 17 August, 2021; v1 submitted 30 September, 2020; originally announced October 2020.