Skip to main content

Showing 1–19 of 19 results for author: Zhang, B H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15970  [pdf, ps, other

    cs.GT cs.AI cs.CC

    Imperfect-Recall Games: Equilibrium Concepts and Their Complexity

    Authors: Emanuel Tewolde, Brian Hu Zhang, Caspar Oesterheld, Manolis Zampetakis, Tuomas Sandholm, Paul W. Goldberg, Vincent Conitzer

    Abstract: We investigate optimal decision making under imperfect recall, that is, when an agent forgets information it once held before. An example is the absentminded driver game, as well as team games in which the members have limited communication capabilities. In the framework of extensive-form games with imperfect recall, we analyze the computational complexities of finding equilibria in multiplayer se… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Long version of the paper that got accepted to the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024). 35 pages, 10 figures, 1 table

    MSC Class: 91A05; 91A06; 91A10; 91A11; 91A18; 91A35; 91A68; 68T37; 68Q17; 68Q25 ACM Class: I.2; J.4; F.2

  2. arXiv:2406.13116  [pdf, ps, other

    cs.GT

    A Lower Bound on Swap Regret in Extensive-Form Games

    Authors: Constantinos Daskalakis, Gabriele Farina, Noah Golowich, Tuomas Sandholm, Brian Hu Zhang

    Abstract: Recent simultaneous works by Peng and Rubinstein [2024] and Dagan et al. [2024] have demonstrated the existence of a no-swap-regret learning algorithm that can reach $ε$ average swap regret against an adversary in any extensive-form game within $m^{\tilde{\mathcal O}(1/ε)}$ rounds, where $m$ is the number of nodes in the game tree. However, the question of whether a $\mathrm{poly}(m, 1/ε)$-round a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2405.06797  [pdf, ps, other

    cs.GT

    Exponential Lower Bounds on the Double Oracle Algorithm in Zero-Sum Games

    Authors: Brian Hu Zhang, Tuomas Sandholm

    Abstract: The double oracle algorithm is a popular method of solving games, because it is able to reduce computing equilibria to computing a series of best responses. However, its theoretical properties are not well understood. In this paper, we provide exponential lower bounds on the performance of the double oracle algorithm in both partially-observable stochastic games (POSGs) and extensive-form games (E… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  4. arXiv:2402.09670  [pdf, ps, other

    cs.GT

    Efficient $Φ$-Regret Minimization with Low-Degree Swap Deviations in Extensive-Form Games

    Authors: Brian Hu Zhang, Ioannis Anagnostides, Gabriele Farina, Tuomas Sandholm

    Abstract: Recent breakthrough results by Dagan, Daskalakis, Fishelson and Golowich [2023] and Peng and Rubinstein [2023] established an efficient algorithm attaining at most $ε$ swap regret over extensive-form strategy spaces of dimension $N$ in $N^{\tilde O(1/ε)}$ rounds. On the other extreme, Farina and Pipis [2023] developed an efficient algorithm for minimizing the weaker notion of linear-swap regret in… ▽ More

    Submitted 17 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  5. arXiv:2402.05245  [pdf, ps, other

    cs.GT

    On the Outcome Equivalence of Extensive-Form and Behavioral Correlated Equilibria

    Authors: Brian Hu Zhang, Tuomas Sandholm

    Abstract: We investigate two notions of correlated equilibrium for extensive-form games: extensive-form correlated equilibrium (EFCE) and behavioral correlated equilibrium (BCE). We show that the two are outcome-equivalent, in the sense that every outcome distribution achievable under one notion is achievable under the other. Our result implies, to our knowledge, the first polynomial-time algorithm for comp… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  6. arXiv:2310.15935  [pdf, other

    cs.GT

    Mediator Interpretation and Faster Learning Algorithms for Linear Correlated Equilibria in General Extensive-Form Games

    Authors: Brian Hu Zhang, Gabriele Farina, Tuomas Sandholm

    Abstract: A recent paper by Farina & Pipis (2023) established the existence of uncoupled no-linear-swap regret dynamics with polynomial-time iterations in extensive-form games. The equilibrium points reached by these dynamics, known as linear correlated equilibria, are currently the tightest known relaxation of correlated equilibrium that can be learned in polynomial time in any finite extensive-form game.… ▽ More

    Submitted 15 March, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  7. arXiv:2308.16017  [pdf, ps, other

    cs.GT

    Hidden-Role Games: Equilibrium Concepts and Computation

    Authors: Luca Carminati, Brian Hu Zhang, Gabriele Farina, Nicola Gatti, Tuomas Sandholm

    Abstract: In this paper, we study the class of games known as hidden-role games in which players are assigned privately to teams and are faced with the challenge of recognizing and cooperating with teammates. This model includes both popular recreational games such as the Mafia/Werewolf family and The Resistance (Avalon) and many real-world settings, such as distributed systems where nodes need to work toge… ▽ More

    Submitted 17 February, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

  8. arXiv:2306.05221  [pdf, other

    cs.GT

    Steering No-Regret Learners to a Desired Equilibrium

    Authors: Brian Hu Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen Marcus McAleer, Andreas Alexander Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm

    Abstract: A mediator observes no-regret learners playing an extensive-form game repeatedly across $T$ rounds. The mediator attempts to steer players toward some desirable predetermined equilibrium by giving (nonnegative) payments to players. We call this the steering problem. The steering problem captures problems several problems of interest, among them equilibrium selection and information design (persuas… ▽ More

    Submitted 17 February, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  9. arXiv:2306.05216  [pdf, ps, other

    cs.GT

    Computing Optimal Equilibria and Mechanisms via Learning in Zero-Sum Extensive-Form Games

    Authors: Brian Hu Zhang, Gabriele Farina, Ioannis Anagnostides, Federico Cacciamani, Stephen Marcus McAleer, Andreas Alexander Haupt, Andrea Celli, Nicola Gatti, Vincent Conitzer, Tuomas Sandholm

    Abstract: We introduce a new approach for computing optimal equilibria via learning in games. It applies to extensive-form settings with any number of players, including mechanism design, information design, and solution concepts such as correlated, communication, and certification equilibria. We observe that optimal equilibria are minimax equilibrium strategies of a player in an extensive-form zero-sum gam… ▽ More

    Submitted 23 May, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  10. arXiv:2206.15395  [pdf, other

    cs.GT

    Polynomial-Time Optimal Equilibria with a Mediator in Extensive-Form Games

    Authors: Brian Hu Zhang, Tuomas Sandholm

    Abstract: For common notions of correlated equilibrium in extensive-form games, computing an optimal (e.g., welfare-maximizing) equilibrium is NP-hard. Other equilibrium notions -- communication (Forges 1986) and certification (Forges & Koessler 2005) equilibria -- augment the game with a mediator that has the power to both send and receive messages to and from the players -- and, in particular, to remember… ▽ More

    Submitted 30 November, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022

  11. arXiv:2202.00789  [pdf, other

    cs.GT

    Team Belief DAG: Generalizing the Sequence Form to Team Games for Fast Computation of Correlated Team Max-Min Equilibria via Regret Minimization

    Authors: Brian Hu Zhang, Gabriele Farina, Tuomas Sandholm

    Abstract: A classic result in the theory of extensive-form games asserts that the set of strategies available to any perfect-recall player is strategically equivalent to a low-dimensional convex polytope, called the sequence-form polytope. Online convex optimization tools operating on this polytope are the current state-of-the-art for computing several notions of equilibria in games, and have been crucial i… ▽ More

    Submitted 17 February, 2024; v1 submitted 1 February, 2022; originally announced February 2022.

  12. arXiv:2110.11853  [pdf, ps, other

    cs.DS math.ST

    Polynomial-Time Sum-of-Squares Can Robustly Estimate Mean and Covariance of Gaussians Optimally

    Authors: Pravesh K. Kothari, Peter Manohar, Brian Hu Zhang

    Abstract: In this work, we revisit the problem of estimating the mean and covariance of an unknown $d$-dimensional Gaussian distribution in the presence of an $\varepsilon$-fraction of adversarial outliers. The pioneering work of [DKK+16] gave a polynomial time algorithm for this task with optimal $\tilde{O}(\varepsilon)$ error using $n = \textrm{poly}(d, 1/\varepsilon)$ samples. On the other hand, [KS17b… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  13. arXiv:2109.05284  [pdf, ps, other

    cs.GT

    Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions

    Authors: Brian Hu Zhang, Tuomas Sandholm

    Abstract: Despite the many recent practical and theoretical breakthroughs in computational game theory, equilibrium finding in extensive-form team games remains a significant challenge. While NP-hard in the worst case, there are provably efficient algorithms for certain families of team game. In particular, if the game has common external information, also known as A-loss recall -- informally, actions playe… ▽ More

    Submitted 16 January, 2022; v1 submitted 11 September, 2021; originally announced September 2021.

  14. arXiv:2106.06068  [pdf, ps, other

    cs.GT

    Subgame solving without common knowledge

    Authors: Brian Hu Zhang, Tuomas Sandholm

    Abstract: In imperfect-information games, subgame solving is significantly more challenging than in perfect-information games, but in the last few years, such techniques have been developed. They were the key ingredient to the milestone of superhuman play in no-limit Texas hold'em poker. Current subgame-solving techniques analyze the entire common-knowledge closure of the player's current information set, t… ▽ More

    Submitted 2 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

  15. arXiv:2009.07384  [pdf, other

    cs.GT

    Finding and Certifying (Near-)Optimal Strategies in Black-Box Extensive-Form Games

    Authors: Brian Hu Zhang, Tuomas Sandholm

    Abstract: Often -- for example in war games, strategy video games, and financial simulations -- the game is given to us only as a black-box simulator in which we can play it. In these settings, since the game may have unknown nature action distributions (from which we can only obtain samples) and/or be too large to expand fully, it can be difficult to compute strategies with guarantees on exploitability. Re… ▽ More

    Submitted 17 March, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: AAAI 2021

  16. arXiv:2006.16387  [pdf, ps, other

    cs.GT

    Small Nash Equilibrium Certificates in Very Large Games

    Authors: Brian Hu Zhang, Tuomas Sandholm

    Abstract: In many game settings, the game is not explicitly given but is only accessible by playing it. While there have been impressive demonstrations in such settings, prior techniques have not offered safety guarantees, that is, guarantees on the game-theoretic exploitability of the computed strategies. In this paper we introduce an approach that shows that it is possible to provide exploitability guaran… ▽ More

    Submitted 15 December, 2020; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  17. arXiv:2006.03451  [pdf, other

    cs.GT cs.LG

    Sparsified Linear Programming for Zero-Sum Equilibrium Finding

    Authors: Brian Hu Zhang, Tuomas Sandholm

    Abstract: Computational equilibrium finding in large zero-sum extensive-form imperfect-information games has led to significant recent AI breakthroughs. The fastest algorithms for the problem are new forms of counterfactual regret minimization [Brown and Sandholm, 2019]. In this paper we present a totally different approach to the problem, which is competitive and often orders of magnitude better than the p… ▽ More

    Submitted 29 June, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: Accepted for publication at ICML 2020

  18. arXiv:1811.06609  [pdf, other

    cs.LG stat.ML

    A Spectral View of Adversarially Robust Features

    Authors: Shivam Garg, Vatsal Sharan, Brian Hu Zhang, Gregory Valiant

    Abstract: Given the apparent difficulty of learning models that are robust to adversarial perturbations, we propose tackling the simpler problem of develo** adversarially robust features. Specifically, given a dataset and metric of interest, the goal is to return a function (or multiple functions) that 1) is robust to adversarial perturbations, and 2) has significant variation across the datapoints. We es… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Comments: To appear at NIPS 2018

  19. arXiv:1801.07593  [pdf, other

    cs.LG cs.AI cs.CY

    Mitigating Unwanted Biases with Adversarial Learning

    Authors: Brian Hu Zhang, Blake Lemoine, Margaret Mitchell

    Abstract: Machine learning is a tool for building models that accurately represent input training data. When undesired biases concerning demographic groups are in the training data, well-trained models will reflect those biases. We present a framework for mitigating such biases by including a variable for the group of interest and simultaneously learning a predictor and an adversary. The input to the networ… ▽ More

    Submitted 22 January, 2018; originally announced January 2018.