Skip to main content

Showing 1–50 of 53 results for author: Savani, R

.
  1. arXiv:2403.02227  [pdf, other

    cs.GT cs.AI cs.MA

    Policy Space Response Oracles: A Survey

    Authors: Ariyan Bighashdel, Yongzhao Wang, Stephen McAleer, Rahul Savani, Frans A. Oliehoek

    Abstract: Game theory provides a mathematical way to study the interaction between multiple decision makers. However, classical game-theoretic analysis is limited in scalability due to the large number of strategies, precluding direct application to more complex scenarios. This survey provides a comprehensive overview of a framework for large games, known as Policy Space Response Oracles (PSRO), which holds… ▽ More

    Submitted 27 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Ariyan Bighashdel and Yongzhao Wang contributed equally

    Journal ref: The 33rd International Joint Conference on Artificial Intelligence, 2024

  2. arXiv:2402.07683  [pdf, other

    cs.CC cs.CG math.OC

    Two Choices are Enough for P-LCPs, USOs, and Colorful Tangents

    Authors: Michaela Borzechowski, John Fearnley, Spencer Gordon, Rahul Savani, Patrick Schnider, Simon Weber

    Abstract: We provide polynomial-time reductions between three search problems from three distinct areas: the P-matrix linear complementarity problem (P-LCP), finding the sink of a unique sink orientation (USO), and a variant of the $α$-Ham Sandwich problem. For all three settings, we show that "two choices are enough", meaning that the general non-binary version of the problem can be reduced in polynomial t… ▽ More

    Submitted 21 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 29 pages, 9 figures

  3. arXiv:2311.13738  [pdf, other

    cs.CC math.OC

    The Complexity of Computing KKT Solutions of Quadratic Programs

    Authors: John Fearnley, Paul W. Goldberg, Alexandros Hollender, Rahul Savani

    Abstract: It is well known that solving a (non-convex) quadratic program is NP-hard. We show that the problem remains hard even if we are only looking for a Karush-Kuhn-Tucker (KKT) point, instead of a global optimum. Namely, we prove that computing a KKT point of a quadratic polynomial over the domain $[0,1]^n$ is complete for the class CLS = PPAD$\cap$PLS.

    Submitted 22 November, 2023; originally announced November 2023.

  4. arXiv:2306.12806  [pdf, other

    q-fin.TR cs.AI cs.CE cs.LG

    Conditional Generators for Limit Order Book Environments: Explainability, Challenges, and Robustness

    Authors: Andrea Coletta, Joseph Jerome, Rahul Savani, Svitlana Vyetrenko

    Abstract: Limit order books are a fundamental and widespread market mechanism. This paper investigates the use of conditional generative models for order book simulation. For develo** a trading agent, this approach has drawn recent attention as an alternative to traditional backtesting due to its ability to react to the presence of the trading agent. Using a state-of-the-art CGAN (from Coletta et al. (202… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  5. arXiv:2306.05366  [pdf, other

    cs.GT cs.LG

    Ordinal Potential-based Player Rating

    Authors: Nelson Vadori, Rahul Savani

    Abstract: It was recently observed that Elo ratings fail at preserving transitive relations among strategies and therefore cannot correctly extract the transitive component of a game. We provide a characterization of transitive games as a weak variant of ordinal potential games and show that Elo ratings actually do preserve transitivity when computed in the right space, using suitable invertible map**s. L… ▽ More

    Submitted 6 March, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

  6. arXiv:2301.12941  [pdf, other

    math.OC cs.CE

    First Order Methods for Geometric Optimization of Crystal Structures

    Authors: Antonia Tsili, Matthew Dyer, Vladimir Gusev, Piotr Krysta, Rahul Savani

    Abstract: The geometric optimization of crystal structures is a procedure widely used in Chemistry that changes the geometrical placement of the particles inside a structure. It is called structural relaxation and constitutes a local minimization problem with a non-convex objective function whose domain complexity increases according to the number of particles involved. In this work we study the performance… ▽ More

    Submitted 22 May, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  7. arXiv:2209.07823  [pdf, other

    q-fin.TR cs.AI cs.LG q-fin.CP q-fin.MF

    Model-based gym environments for limit order book trading

    Authors: Joseph Jerome, Leandro Sanchez-Betancourt, Rahul Savani, Martin Herdegen

    Abstract: Within the mathematical finance literature there is a rich catalogue of mathematical models for studying algorithmic trading problems -- such as market-making and optimal execution -- in limit order books. This paper introduces \mbtgym, a Python module that provides a suite of gym environments for training reinforcement learning (RL) agents to solve such model-based trading problems. The module is… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  8. arXiv:2207.03352  [pdf, other

    q-fin.TR cs.AI cs.LG

    Market Making with Scaled Beta Policies

    Authors: Joseph Jerome, Gregory Palmer, Rahul Savani

    Abstract: This paper introduces a new representation for the actions of a market maker in an order-driven market. This representation uses scaled beta distributions, and generalises three approaches taken in the artificial intelligence for market making literature: single price-level selection, ladder strategies and "market making at the touch". Ladder strategies place uniform volume across an interval of c… ▽ More

    Submitted 27 September, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

  9. Trading via Selective Classification

    Authors: Nestoras Chalkidis, Rahul Savani

    Abstract: A binary classifier that tries to predict if the price of an asset will increase or decrease naturally gives rise to a trading strategy that follows the prediction and thus always has a position in the market. Selective classification extends a binary or many-class classifier to allow it to abstain from making a prediction for certain inputs, thereby allowing a trade-off between the accuracy of th… ▽ More

    Submitted 31 October, 2021; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: (8 pages, 6 figures, 4 tables, ICAIF'21)

  10. arXiv:2106.02615  [pdf, other

    cs.GT cs.LG

    Consensus Multiplicative Weights Update: Learning to Learn using Projector-based Game Signatures

    Authors: Nelson Vadori, Rahul Savani, Thomas Spooner, Sumitra Ganesh

    Abstract: Cheung and Piliouras (2020) recently showed that two variants of the Multiplicative Weights Update method - OMWU and MWU - display opposite convergence properties depending on whether the game is zero-sum or cooperative. Inspired by this work and the recent literature on learning to optimize for single functions, we introduce a new framework for learning last-iterate convergence to Nash Equilibria… ▽ More

    Submitted 11 June, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: ICML 2022, the 39th International Conference on Machine Learning

  11. arXiv:2106.01285  [pdf, other

    cs.GT

    Sample-based Approximation of Nash in Large Many-Player Games via Gradient Descent

    Authors: Ian Gemp, Rahul Savani, Marc Lanctot, Yoram Bachrach, Thomas Anthony, Richard Everett, Andrea Tacchetti, Tom Eccles, János Kramár

    Abstract: Nash equilibrium is a central concept in game theory. Several Nash solvers exist, yet none scale to normal-form games with many actions and many players, especially those with payoff tensors too big to be stored in memory. In this work, we propose an approach that iteratively improves an approximation to a Nash equilibrium through joint play. It accomplishes this by tracing a previously establishe… ▽ More

    Submitted 4 February, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: Published in AAMAS 2022 (code available as part of open_spiel on github -- search ADIDAS in repo)

  12. Difference Rewards Policy Gradients

    Authors: Jacopo Castellini, Sam Devlin, Frans A. Oliehoek, Rahul Savani

    Abstract: Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however, that is not addressed by many of these methods is multi-agent credit assignment: assessing an agent's contribution to the overall performance, which is crucial for learning good policies. We propose a novel algorithm called Dr.Reinforce that explicitly… ▽ More

    Submitted 9 November, 2023; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: This work as been accepted as an Extended Abstract in Proc. of the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021), U. Endriss, A. Nowé, F. Dignum, A. Lomuscio (eds.), May 3-7 2021, Online

    ACM Class: I.2.6; I.2.11

    Journal ref: Neural Comput & Applic (2022)

  13. arXiv:2011.01929  [pdf, other

    cs.CC cs.LG math.OC

    The Complexity of Gradient Descent: CLS = PPAD $\cap$ PLS

    Authors: John Fearnley, Paul W. Goldberg, Alexandros Hollender, Rahul Savani

    Abstract: We study search problems that can be solved by performing Gradient Descent on a bounded convex polytopal domain and show that this class is equal to the intersection of two well-known classes: PPAD and PLS. As our main underlying technical contribution, we show that computing a Karush-Kuhn-Tucker (KKT) point of a continuously differentiable function over the domain $[0,1]^2$ is PPAD $\cap$ PLS-com… ▽ More

    Submitted 3 March, 2023; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: Journal version

    Journal ref: Journal of the ACM, 70(1):Article 7 (2022)

  14. arXiv:2010.02618  [pdf, other

    cs.DS cs.GT

    A faster algorithm for finding Tarski fixed points

    Authors: John Fearnley, Dömötör Pálvölgyi, Rahul Savani

    Abstract: Dang et al. have given an algorithm that can find a Tarski fixed point in a $k$-dimensional lattice of width $n$ using $O(\log^{k} n)$ queries. Multiple authors have conjectured that this algorithm is optimal [Dang et al., Etessami et al.], and indeed this has been proven for two-dimensional instances [Etessami et al.]. We show that these conjectures are false in dimension three or higher by givin… ▽ More

    Submitted 20 March, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

  15. arXiv:2007.04611  [pdf, other

    cs.CY cs.LG

    A deep learning approach to identify unhealthy advertisements in street view images

    Authors: Gregory Palmer, Mark Green, Emma Boyland, Yales Stefano Rios Vasconcelos, Rahul Savani, Alex Singleton

    Abstract: While outdoor advertisements are common features within towns and cities, they may reinforce social inequalities in health. Vulnerable populations in deprived areas may have greater exposure to fast food, gambling and alcohol advertisements encouraging their consumption. Understanding who is exposed and evaluating potential policy restrictions requires a substantial manual data collection effort.… ▽ More

    Submitted 7 February, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: 13 pages, 5 figures, 3 table. To appear in Nature Scientific Reports

  16. arXiv:2007.04203  [pdf, other

    cs.LG cs.AI q-fin.CP q-fin.PM stat.ML

    A Natural Actor-Critic Algorithm with Downside Risk Constraints

    Authors: Thomas Spooner, Rahul Savani

    Abstract: Existing work on risk-sensitive reinforcement learning - both for symmetric and downside risk measures - has typically used direct Monte-Carlo estimation of policy gradients. While this approach yields unbiased gradient estimates, it also suffers from high variance and decreased sample efficiency compared to temporal-difference methods. In this paper, we study prediction and control with aversion… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: 14 pages, 5 figures

  17. arXiv:2003.01820  [pdf, other

    q-fin.TR cs.AI cs.LG stat.ML

    Robust Market Making via Adversarial Reinforcement Learning

    Authors: Thomas Spooner, Rahul Savani

    Abstract: We show that adversarial reinforcement learning (ARL) can be used to produce market marking agents that are robust to adversarial and adaptively-chosen market conditions. To apply ARL, we turn the well-studied single-agent model of Avellaneda and Stoikov [2008] into a discrete-time zero-sum game between a market maker and adversary. The adversary acts as a proxy for other market participants that… ▽ More

    Submitted 8 July, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 7 pages, 3 figures; IJCAI-PRICAI '20 Conference Proceedings

  18. arXiv:2002.12119  [pdf, other

    cs.GT cs.CC

    Tree Polymatrix Games are PPAD-hard

    Authors: Argyrios Deligkas, John Fearnley, Rahul Savani

    Abstract: We prove that it is PPAD-hard to compute a Nash equilibrium in a tree polymatrix game with twenty actions per player. This is the first PPAD hardness result for a game with a constant number of actions per player where the interaction graph is acyclic. Along the way we show PPAD-hardness for finding an $ε$-fixed point of a 2D LinearFIXP instance, when $ε$ is any constant less than… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  19. arXiv:2002.09406  [pdf, other

    cs.CV

    The Automated Inspection of Opaque Liquid Vaccines

    Authors: Gregory Palmer, Benjamin Schnieders, Rahul Savani, Karl Tuyls, Joscha-David Fossel, Harry Flore

    Abstract: In the pharmaceutical industry the screening of opaque vaccines containing suspensions is currently a manual task carried out by trained human visual inspectors. We show that deep learning can be used to effectively automate this process. A moving contrast is required to distinguish anomalies from other particles, reflections and dust resting on a vial's surface. We train 3D-ConvNets to predict th… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Comments: 8 pages, 5 Figures, 3 Tables, ECAI 2020 Conference Proceedings

  20. arXiv:2001.04458  [pdf, ps, other

    cs.GT cs.FL cs.LO

    One-Clock Priced Timed Games are PSPACE-hard

    Authors: John Fearnley, Rasmus Ibsen-Jensen, Rahul Savani

    Abstract: The main result of this paper is that computing the value of a one-clock priced timed game (OCPTG) is PSPACE-hard. Along the way, we provide a family of OCPTGs that have an exponential number of event points. Both results hold even in very restricted classes of games such as DAGs with treewidth three. Finally, we provide a number of positive results, including polynomial-time algorithms for even m… ▽ More

    Submitted 4 March, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

  21. arXiv:1904.06239  [pdf, other

    cs.NE

    Evolving Indoor Navigational Strategies Using Gated Recurrent Units In NEAT

    Authors: James Butterworth, Rahul Savani, Karl Tuyls

    Abstract: Simultaneous Localisation and Map** (SLAM) algorithms are expensive to run on smaller robotic platforms such as Micro-Aerial Vehicles. Bug algorithms are an alternative that use relatively little processing power, and avoid high memory consumption by not building an explicit map of the environment. Bug Algorithms achieve relatively good performance in simulated and robotic maze solving domains.… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

  22. Analysing Factorizations of Action-Value Networks for Cooperative Multi-Agent Reinforcement Learning

    Authors: Jacopo Castellini, Frans A. Oliehoek, Rahul Savani, Shimon Whiteson

    Abstract: Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems, with great empirical success. However, given the lack of theoretical insight, it remains unclear what the employed neural networks are learning, or how we should enhance their learning power to address the problems on which they fail. In this work, we empirically investigate the lea… ▽ More

    Submitted 9 November, 2023; v1 submitted 20 February, 2019; originally announced February 2019.

    Comments: This work as been accepted as an Extended Abstract in Proc. of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019), N. Agmon, M. E. Taylor, E. Elkind, M. Veloso (eds.), May 2019, Montreal, Canada

    ACM Class: I.2.6; I.2.11

    Journal ref: Auton Agent Multi-Agent Syst 35, 25 (2021)

  23. arXiv:1811.03841  [pdf, ps, other

    cs.CC cs.DS

    Unique End of Potential Line

    Authors: John Fearnley, Spencer Gordon, Ruta Mehta, Rahul Savani

    Abstract: This paper studies the complexity of problems in PPAD $\cap$ PLS that have unique solutions. Three well-known examples of such problems are the problem of finding a fixpoint of a contraction map, finding the unique sink of a Unique Sink Orientation (USO), and solving the P-matrix Linear Complementarity Problem (P-LCP). Each of these are promise-problems, and when the promise holds, they always pos… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

    Comments: This paper substantially revises and extends the work described in our previous preprint "End of Potential Line'' (arXiv:1804.03450). The abstract has been shortened to meet the arXiv character limit

  24. arXiv:1809.05096  [pdf, other

    cs.MA cs.AI cs.LG

    Negative Update Intervals in Deep Multi-Agent Reinforcement Learning

    Authors: Gregory Palmer, Rahul Savani, Karl Tuyls

    Abstract: In Multi-Agent Reinforcement Learning (MA-RL), independent cooperative learners must overcome a number of pathologies to learn optimal joint policies. Addressing one pathology often leaves approaches vulnerable towards others. For instance, hysteretic Q-learning addresses miscoordination while leaving agents vulnerable towards misleading stochastic rewards. Other methods, such as leniency, have pr… ▽ More

    Submitted 7 May, 2019; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: 11 Pages, 6 Figures, AAMAS2019 Conference Proceedings

  25. arXiv:1806.07268  [pdf, other

    cs.LG cs.GT stat.ML

    Beyond Local Nash Equilibria for Adversarial Networks

    Authors: Frans A. Oliehoek, Rahul Savani, Jose Gallego, Elise van der Pol, Roderich Groß

    Abstract: Save for some special cases, current training methods for Generative Adversarial Networks (GANs) are at best guaranteed to converge to a `local Nash equilibrium` (LNE). Such LNEs, however, can be arbitrarily far from an actual Nash equilibrium (NE), which implies that there are no guarantees on the quality of the found generator or classifier. This paper proposes to model GANs explicitly as finite… ▽ More

    Submitted 26 July, 2018; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: Supersedes arXiv:1712.00679; v2 includes Fictitious GAN in the related work and refers to Danskin (1981)

    Journal ref: Published in Benelearn/BANIC 2018

  26. arXiv:1804.04216  [pdf, other

    cs.AI q-fin.TR

    Market Making via Reinforcement Learning

    Authors: Thomas Spooner, John Fearnley, Rahul Savani, Andreas Koukorinis

    Abstract: Market making is a fundamental trading problem in which an agent provides liquidity by continually offering to buy and sell a security. The problem is challenging due to inventory risk, the risk of accumulating an unfavourable position and ultimately losing money. In this paper, we develop a high-fidelity simulation of limit order book markets, and use it to design a market making agent using temp… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: 10 pages, 5 figures, AAMAS2018 Conference Proceedings

  27. arXiv:1804.03450  [pdf, ps, other

    cs.CC cs.GT

    End of Potential Line

    Authors: John Fearnley, Spencer Gordon, Ruta Mehta, Rahul Savani

    Abstract: We introduce the problem EndOfPotentialLine and the corresponding complexity class EOPL of all problems that can be reduced to it in polynomial time. This class captures problems that admit a single combinatorial proof of their joint membership in the complexity classes PPAD of fixpoint problems and PLS of local search problems. EOPL is a combinatorially-defined alternative to the class CLS (for C… ▽ More

    Submitted 18 April, 2018; v1 submitted 10 April, 2018; originally announced April 2018.

    Comments: v2 includes runtimes for P-LCP algorithms based on USOs in related work

  28. arXiv:1712.00679  [pdf, other

    stat.ML cs.GT cs.LG

    GANGs: Generative Adversarial Network Games

    Authors: Frans A. Oliehoek, Rahul Savani, Jose Gallego-Posada, Elise van der Pol, Edwin D. de Jong, Roderich Gross

    Abstract: Generative Adversarial Networks (GAN) have become one of the most successful frameworks for unsupervised generative modeling. As GANs are difficult to train much research has focused on this. However, very little of this research has directly exploited game-theoretic techniques. We introduce Generative Adversarial Network Games (GANGs), which explicitly model a finite zero-sum game between a gener… ▽ More

    Submitted 17 December, 2017; v1 submitted 2 December, 2017; originally announced December 2017.

    Comments: 9 pages, 5 figures

  29. arXiv:1711.05074  [pdf, other

    cs.GT cs.MA

    Symmetric Decomposition of Asymmetric Games

    Authors: Karl Tuyls, Julien Perolat, Marc Lanctot, Georg Ostrovski, Rahul Savani, Joel Leibo, Toby Ord, Thore Graepel, Shane Legg

    Abstract: We introduce new theoretical insights into two-population asymmetric games allowing for an elegant symmetric decomposition into two single population symmetric games. Specifically, we show how an asymmetric bimatrix game (A,B) can be decomposed into its symmetric counterparts by envisioning and investigating the payoff tables (A and B) that constitute the asymmetric game, as two independent, singl… ▽ More

    Submitted 17 January, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: Paper is published in Scientific Reports; https://www.nature.com/articles/s41598-018-19194-4, 2018

  30. Reachability Switching Games

    Authors: John Fearnley, Martin Gairing, Matthias Mnich, Rahul Savani

    Abstract: We study the problem of deciding the winner of reachability switching games for zero-, one-, and two-player variants. Switching games provide a deterministic analogue of stochastic games. We show that the zero-player case is NL-hard, the one-player case is NP-complete, and that the two-player case is PSPACE-hard and in EXPTIME. For the zero-player case, we also show P-hardness for a succinctly-rep… ▽ More

    Submitted 21 April, 2021; v1 submitted 26 September, 2017; originally announced September 2017.

    Journal ref: Logical Methods in Computer Science, Volume 17, Issue 2 (April 22, 2021) lmcs:5425

  31. arXiv:1707.04402  [pdf, other

    cs.MA cs.AI cs.LG

    Lenient Multi-Agent Deep Reinforcement Learning

    Authors: Gregory Palmer, Karl Tuyls, Daan Bloembergen, Rahul Savani

    Abstract: Much of the success of single agent deep reinforcement learning (DRL) in recent years can be attributed to the use of experience replay memories (ERM), which allow Deep Q-Networks (DQNs) to be trained efficiently through sampling stored state transitions. However, care is required when using ERMs for multi-agent deep reinforcement learning (MA-DRL), as stored transitions can become outdated becaus… ▽ More

    Submitted 27 February, 2018; v1 submitted 14 July, 2017; originally announced July 2017.

    Comments: 9 pages, 6 figures, AAMAS2018 Conference Proceedings

  32. arXiv:1705.02266  [pdf, ps, other

    cs.GT

    Computing Constrained Approximate Equilibria in Polymatrix Games

    Authors: Argyrios Deligkas, John Fearnley, Rahul Savani

    Abstract: This paper is about computing constrained approximate Nash equilibria in polymatrix games, which are succinctly represented many-player games defined by an interaction graph between the players. In a recent breakthrough, Rubinstein showed that there exists a small constant $ε$, such that it is PPAD-complete to find an (unconstrained) $ε$-Nash equilibrium of a polymatrix game. In the first part of… ▽ More

    Submitted 8 May, 2017; v1 submitted 5 May, 2017; originally announced May 2017.

  33. arXiv:1704.06549  [pdf, other

    cs.CY

    LiftUpp: Support to develop learner performance

    Authors: Frans A. Oliehoek, Rahul Savani, Elliot Adderton, Xia Cui, David Jackson, Phil Jimmieson, John Christopher Jones, Keith Kennedy, Ben Mason, Adam Plumbley, Luke Dawson

    Abstract: Various motivations exist to move away from the simple assessment of knowledge towards the more complex assessment and development of competence. However, to accommodate such a change, high demands are put on the supporting e-infrastructure in terms of intelligently collecting and analysing data. In this paper, we discuss these challenges and how they are being addressed by LiftUpp, a system that… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

    Comments: Short 4-page version to appear at AIED 2017

  34. arXiv:1702.06017  [pdf, other

    cs.CC

    CLS: New Problems and Completeness

    Authors: John Fearnley, Spencer Gordon, Ruta Mehta, Rahul Savani

    Abstract: The complexity class CLS was introduced by Daskalakis and Papadimitriou with the goal of capturing the complexity of some well-known problems in PPAD$~\cap~$PLS that have resisted, in some cases for decades, attempts to put them in polynomial time. No complete problem was known for CLS, and in previous work, the problems ContractionMap, i.e., the problem of finding an approximate fixpoint of a con… ▽ More

    Submitted 7 April, 2017; v1 submitted 20 February, 2017; originally announced February 2017.

  35. arXiv:1608.03574  [pdf, ps, other

    cs.GT

    Inapproximability Results for Approximate Nash Equilibria

    Authors: Argyrios Deligkas, John Fearnley, Rahul Savani

    Abstract: We study the problem of finding approximate Nash equilibria that satisfy certain conditions, such as providing good social welfare. In particular, we study the problem $ε$-NE $δ$-SW: find an $ε$-approximate Nash equilibrium ($ε$-NE) that is within $δ$ of the best social welfare achievable by an $ε$-NE. Our main result is that, if the exponential-time hypothesis (ETH) is true, then solving… ▽ More

    Submitted 25 April, 2017; v1 submitted 11 August, 2016; originally announced August 2016.

    Comments: A short (14-page) version of this paper appeared at WINE 2016. Compared to that conference version, this new version improves the conditional lower bounds, which now rely on ETH rather than RETH (Randomized ETH)

  36. arXiv:1602.06865  [pdf, other

    cs.GT

    An Empirical Study on Computing Equilibria in Polymatrix Games

    Authors: Argyrios Deligkas, John Fearnley, Tobenna Peter Igwe, Rahul Savani

    Abstract: The Nash equilibrium is an important benchmark for behaviour in systems of strategic autonomous agents. Polymatrix games are a succinct and expressive representation of multiplayer games that model pairwise interactions between players. The empirical performance of algorithms to solve these games has received little attention, despite their wide-ranging applications. In this paper we carry out a c… ▽ More

    Submitted 16 March, 2016; v1 submitted 22 February, 2016; originally announced February 2016.

    Comments: To appear at AAMAS 2016 (without the appendix)

  37. Distributed Methods for Computing Approximate Equilibria

    Authors: Artur Czumaj, Argyrios Deligkas, Michail Fasoulakis, John Fearnley, Marcin Jurdziński, Rahul Savani

    Abstract: We present a new, distributed method to compute approximate Nash equilibria in bimatrix games. In contrast to previous approaches that analyze the two payoff matrices at the same time (for example, by solving a single LP that combines the two players payoffs), our algorithm first solves two independent LPs, each of which is derived from one of the two payoff matrices, and then compute approximate… ▽ More

    Submitted 10 December, 2015; originally announced December 2015.

  38. arXiv:1509.05322  [pdf, other

    cs.GT

    Computing stable outcomes in symmetric additively-separable hedonic games

    Authors: Martin Gairing, Rahul Savani

    Abstract: We study the computational complexity of finding stable outcomes in hedonic games, which are a class of coalition formation games. We restrict our attention to symmetric additively-separable hedonic games, which are a nontrivial subclass of such games that are guaranteed to possess stable outcomes. These games are specified by an undirected edge- weighted graph: nodes are players, an outcome of th… ▽ More

    Submitted 17 September, 2015; originally announced September 2015.

    Comments: Combines a SAGT 2010 paper and a AAMAS 2011 paper by the same authors

  39. arXiv:1507.04500  [pdf, other

    cs.DS cs.CC cs.GT cs.LO

    The Complexity of All-switches Strategy Improvement

    Authors: John Fearnley, Rahul Savani

    Abstract: Strategy improvement is a widely-used and well-studied class of algorithms for solving graph-based infinite games. These algorithms are parameterized by a switching rule, and one of the most natural rules is "all switches" which switches as many edges as possible in each iteration. Continuing a recent line of work, we study all-switches strategy improvement from the perspective of computational co… ▽ More

    Submitted 29 October, 2018; v1 submitted 16 July, 2015; originally announced July 2015.

    Journal ref: Logical Methods in Computer Science, Volume 14, Issue 4 (October 31, 2018) lmcs:3794

  40. arXiv:1502.04980  [pdf, other

    cs.GT

    An Empirical Study of Finding Approximate Equilibria in Bimatrix Games

    Authors: John Fearnley, Tobenna Peter Igwe, Rahul Savani

    Abstract: While there have been a number of studies about the efficacy of methods to find exact Nash equilibria in bimatrix games, there has been little empirical work on finding approximate Nash equilibria. Here we provide such a study that compares a number of approximation methods and exact methods. In particular, we explore the trade-off between the quality of approximate equilibrium and the required ru… ▽ More

    Submitted 9 April, 2015; v1 submitted 17 February, 2015; originally announced February 2015.

  41. Unit Vector Games

    Authors: Rahul Savani, Bernhard von Stengel

    Abstract: McLennan and Tourky (2010) showed that "imitation games" provide a new view of the computation of Nash equilibria of bimatrix games with the Lemke-Howson algorithm. In an imitation game, the payoff matrix of one of the players is the identity matrix. We study the more general "unit vector games", which are already known, where the payoff matrix of one player is composed of unit vectors. Our main a… ▽ More

    Submitted 14 February, 2016; v1 submitted 9 January, 2015; originally announced January 2015.

    Comments: final version as published in IJET

    MSC Class: 91A05

    Journal ref: International Journal of Economic Theory 12 (2016), 7-27

  42. arXiv:1409.3741  [pdf, ps, other

    cs.GT

    Computing Approximate Nash Equilibria in Polymatrix Games

    Authors: Argyrios Deligkas, John Fearnley, Rahul Savani, Paul Spirakis

    Abstract: In an $ε$-Nash equilibrium, a player can gain at most $ε$ by unilaterally changing his behaviour. For two-player (bimatrix) games with payoffs in $[0,1]$, the best-known$ε$ achievable in polynomial time is 0.3393. In general, for $n$-player games an $ε$-Nash equilibrium can be computed in polynomial time for an $ε$ that is an increasing function of $n$ but does not depend on the number of strategi… ▽ More

    Submitted 1 October, 2014; v1 submitted 12 September, 2014; originally announced September 2014.

  43. arXiv:1404.0605  [pdf, ps, other

    cs.DS

    The Complexity of the Simplex Method

    Authors: John Fearnley, Rahul Savani

    Abstract: The simplex method is a well-studied and widely-used pivoting method for solving linear programs. When Dantzig originally formulated the simplex method, he gave a natural pivot rule that pivots into the basis a variable with the most violated reduced cost. In their seminal work, Klee and Minty showed that this pivot rule takes exponential time in the worst case. We prove two main results on the si… ▽ More

    Submitted 17 April, 2014; v1 submitted 2 April, 2014; originally announced April 2014.

  44. Game Theory Explorer - Software for the Applied Game Theorist

    Authors: Rahul Savani, Bernhard von Stengel

    Abstract: This paper presents the "Game Theory Explorer" software tool to create and analyze games as models of strategic interaction. A game in extensive or strategic form is created and nicely displayed with a graphical user interface in a web browser. State-of-the-art algorithms then compute all Nash equilibria of the game after a mouseclick. In tutorial fashion, we present how the program is used, and t… ▽ More

    Submitted 16 March, 2014; originally announced March 2014.

    MSC Class: 91-08 ACM Class: G.1.10; G.4

    Journal ref: Computational Management Science 12:1, 5-33 (2015)

  45. arXiv:1310.7419  [pdf, ps, other

    cs.GT

    Finding Approximate Nash Equilibria of Bimatrix Games via Payoff Queries

    Authors: John Fearnley, Rahul Savani

    Abstract: We study the deterministic and randomized query complexity of finding approximate equilibria in bimatrix games. We show that the deterministic query complexity of finding an $ε$-Nash equilibrium when $ε< \frac{1}{2}$ is $Ω(k^2)$, even in zero-one constant-sum games. In combination with previous results \cite{FGGS13}, this provides a complete characterization of the deterministic query complexity o… ▽ More

    Submitted 12 February, 2014; v1 submitted 28 October, 2013; originally announced October 2013.

  46. arXiv:1309.7258  [pdf, other

    cs.GT

    Polylogarithmic Supports are required for Approximate Well-Supported Nash Equilibria below 2/3

    Authors: Yogesh Anbalagan, Sergey Norin, Rahul Savani, Adrian Vetta

    Abstract: In an epsilon-approximate Nash equilibrium, a player can gain at most epsilon in expectation by unilateral deviation. An epsilon well-supported approximate Nash equilibrium has the stronger requirement that every pure strategy used with positive probability must have payoff within epsilon of the best response payoff. Daskalakis, Mehta and Papadimitriou conjectured that every win-lose bimatrix game… ▽ More

    Submitted 21 March, 2014; v1 submitted 27 September, 2013; originally announced September 2013.

    Comments: Added details on related work (footnote 7 expanded)

  47. arXiv:1302.3116  [pdf, ps, other

    cs.GT

    Learning Equilibria of Games via Payoff Queries

    Authors: John Fearnley, Martin Gairing, Paul Goldberg, Rahul Savani

    Abstract: A recent body of experimental literature has studied empirical game-theoretical analysis, in which we have partial knowledge of a game, consisting of observations of a subset of the pure-strategy profiles and their associated payoffs to players. The aim is to find an exact or approximate Nash equilibrium of the game, based on these observations. It is usually assumed that the strategy profiles may… ▽ More

    Submitted 12 February, 2014; v1 submitted 13 February, 2013; originally announced February 2013.

  48. arXiv:1204.0707  [pdf, ps, other

    cs.GT

    Approximate Well-supported Nash Equilibria below Two-thirds

    Authors: John Fearnley, Paul W. Goldberg, Rahul Savani, Troels Bjerre Sørensen

    Abstract: In an epsilon-Nash equilibrium, a player can gain at most epsilon by changing his behaviour. Recent work has addressed the question of how best to compute epsilon-Nash equilibria, and for what values of epsilon a polynomial-time algorithm exists. An epsilon-well-supported Nash equilibrium (epsilon-WSNE) has the additional requirement that any strategy that is used with non-zero probability by a pl… ▽ More

    Submitted 2 December, 2014; v1 submitted 3 April, 2012; originally announced April 2012.

  49. arXiv:1103.1040  [pdf, ps, other

    cs.GT

    On the Approximation Performance of Fictitious Play in Finite Games

    Authors: Paul W. Goldberg, Rahul Savani, Troels Bjerre Sorensen, Carmine Ventre

    Abstract: We study the performance of Fictitious Play, when used as a heuristic for finding an approximate Nash equilibrium of a 2-player game. We exhibit a class of 2-player games having payoffs in the range [0,1] that show that Fictitious Play fails to find a solution having an additive approximation guarantee significantly better than 1/2. Our construction shows that for n times n games, in the worst cas… ▽ More

    Submitted 19 March, 2011; v1 submitted 5 March, 2011; originally announced March 2011.

  50. arXiv:1006.5352  [pdf, other

    cs.GT cs.CC

    The Complexity of the Homotopy Method, Equilibrium Selection, and Lemke-Howson Solutions

    Authors: Paul W. Goldberg, Christos H. Papadimitriou, Rahul Savani

    Abstract: We show that the widely used homotopy method for solving fixpoint problems, as well as the Harsanyi-Selten equilibrium selection process for games, are PSPACE-complete to implement. Extending our result for the Harsanyi-Selten process, we show that several other homotopy-based algorithms for finding equilibria of games are also PSPACE-complete to implement. A further application of our techniques… ▽ More

    Submitted 4 August, 2011; v1 submitted 28 June, 2010; originally announced June 2010.

    Comments: 23 pages, 1 figure; to appear in FOCS 2011 conference