-
BAR Nash Equilibrium and Application to Blockchain Design
Authors:
Maxime Reynouard,
Rida Laraki,
Olga Gorelkina
Abstract:
This paper presents a novel solution concept, called BAR Nash Equilibrium (BARNE) and apply it to analyse the Verifier's dilemma, a fundamental problem in blockchain. Our solution concept adapts the Nash equilibrium (NE) to accommodate interactions among Byzantine, altruistic and rational agents, which became known as the BAR setting in the literature. We prove the existence of BARNE in a large cl…
▽ More
This paper presents a novel solution concept, called BAR Nash Equilibrium (BARNE) and apply it to analyse the Verifier's dilemma, a fundamental problem in blockchain. Our solution concept adapts the Nash equilibrium (NE) to accommodate interactions among Byzantine, altruistic and rational agents, which became known as the BAR setting in the literature. We prove the existence of BARNE in a large class of games and introduce two natural refinements, global and local stability. Using this equilibrium and its refinement, we analyse the free-rider problem in the context of byzantine consensus. We demonstrate that by incorporating fines and forced errors into a standard quorum-based blockchain protocol, we can effectively reestablish honest behavior as a globally stable BARNE.
△ Less
Submitted 31 January, 2024; v1 submitted 30 January, 2024;
originally announced January 2024.
-
O'Neill's Theorem for Games
Authors:
Srihari Govindan,
Rida Laraki,
Lucas Pahl
Abstract:
We present an analog of O'Neill's Theorem (Theorem 5.2 in [17]) for finite games, which reveals a picture of the structure of equilibria under payoff perturbations in finite games.
We present an analog of O'Neill's Theorem (Theorem 5.2 in [17]) for finite games, which reveals a picture of the structure of equilibria under payoff perturbations in finite games.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
Grading and Ranking Large number of candidates
Authors:
Rida Laraki,
Estelle Varloot
Abstract:
It is common that a jury must grade a set of candidates in a cardinal scale such as {1,2,3,4,5} or an ordinal scale such as {Great, Good, Average, Bad }. When the number of candidates is very large such as hotels (BOOKING), restaurants (GOOGLE), apartments (AIRBNB), drivers (UBER), or papers (EC), it is unreasonable to assume that each jury member will provide a separate grade for each candidate.…
▽ More
It is common that a jury must grade a set of candidates in a cardinal scale such as {1,2,3,4,5} or an ordinal scale such as {Great, Good, Average, Bad }. When the number of candidates is very large such as hotels (BOOKING), restaurants (GOOGLE), apartments (AIRBNB), drivers (UBER), or papers (EC), it is unreasonable to assume that each jury member will provide a separate grade for each candidate. Each jury member is more likely to abstain for some candidates, cast a blank vote, or be associated at random, or as a function of its expertise, with only a small subset of the candidates and is asked to grade each of those. Extending the classical theory, we study aggregation methods in which a voter will not be eligible to grade all the candidates, and the candidates are not eligible for the same sets of voters. Moreover, each candidate on which they are eligible, the voter will have the choice between: a blank vote, grade the candidate, or abstain. Assuming single-peaked preferences over the grades, we axiomatically characterise a broad class of strategy-proof grading mechanisms satisfying axioms such as unanimity, anonymity, neutrality, participation or consistency. Finally, when a strict ranking is necessary (to distinguish let say between two borderline papers in a conference), some tie-breaking rules, extending the leximin and majority judgment, are defined and are shown to be equivalent to some strategy-proof grading functions on a richer space of outcome. Our paper will propose new rules, called phantom-proxy mechanisms, to aggregate the votes in the examples above or others, which differ from the usual average mark, that are easily manipulable. Moreover, the phantom-proxy are able to reduce the injustices caused by some candidates juries too generous or severe.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Smooth Fictitious Play in Stochastic Games with Perturbed Payoffs and Unknown Transitions
Authors:
Lucas Baudin,
Rida Laraki
Abstract:
Recent extensions to dynamic games of the well-known fictitious play learning procedure in static games were proved to globally converge to stationary Nash equilibria in two important classes of dynamic games (zero-sum and identical-interest discounted stochastic games). However, those decentralized algorithms need the players to know exactly the model (the transition probabilities and their payof…
▽ More
Recent extensions to dynamic games of the well-known fictitious play learning procedure in static games were proved to globally converge to stationary Nash equilibria in two important classes of dynamic games (zero-sum and identical-interest discounted stochastic games). However, those decentralized algorithms need the players to know exactly the model (the transition probabilities and their payoffs at every stage). To overcome these strong assumptions, our paper introduces regularizations of the systems in (Leslie 2020; Baudin 2022) to construct a family of new decentralized learning algorithms which are model-free (players don't know the transitions and their payoffs are perturbed at every stage). Our procedures can be seen as extensions to stochastic games of the classical smooth fictitious play learning procedures in static games (where the players best responses are regularized, thanks to a smooth strictly concave perturbation of their payoff functions). We prove the convergence of our family of procedures to stationary regularized Nash equilibria in zero-sum and identical-interest discounted stochastic games. The proof uses the continuous smooth best-response dynamics counterparts, and stochastic approximation methods. When there is only one player, our problem is an instance of Reinforcement Learning and our procedures are proved to globally converge to the optimal stationary policy of the regularized MDP. In that sense, they can be seen as an alternative to the well known Q-learning procedure.
△ Less
Submitted 7 July, 2022;
originally announced July 2022.
-
An $α$-No-Regret Algorithm For Graphical Bilinear Bandits
Authors:
Geovani Rizk,
Igor Colin,
Albert Thomas,
Rida Laraki,
Yann Chevaleyre
Abstract:
We propose the first regret-based approach to the Graphical Bilinear Bandits problem, where $n$ agents in a graph play a stochastic bilinear bandit game with each of their neighbors. This setting reveals a combinatorial NP-hard problem that prevents the use of any existing regret-based algorithm in the (bi-)linear bandit literature. In this paper, we fill this gap and present the first regret-base…
▽ More
We propose the first regret-based approach to the Graphical Bilinear Bandits problem, where $n$ agents in a graph play a stochastic bilinear bandit game with each of their neighbors. This setting reveals a combinatorial NP-hard problem that prevents the use of any existing regret-based algorithm in the (bi-)linear bandit literature. In this paper, we fill this gap and present the first regret-based algorithm for graphical bilinear bandits using the principle of optimism in the face of uncertainty. Theoretical analysis of this new method yields an upper bound of $\tilde{O}(\sqrt{T})$ on the $α$-regret and evidences the impact of the graph structure on the rate of convergence. Finally, we show through various experiments the validity of our approach.
△ Less
Submitted 12 October, 2022; v1 submitted 1 June, 2022;
originally announced June 2022.
-
Best-Response Dynamics and Fictitious Play in Identical-Interest and Zero-Sum Stochastic Games
Authors:
Lucas Baudin,
Rida Laraki
Abstract:
This paper combines ideas from Q-learning and fictitious play to define three reinforcement learning procedures which converge to the set of stationary mixed Nash equilibria in identical interest discounted stochastic games. First, we analyse three continuous-time systems that generalize the best-response dynamics defined by Leslie et al. for zero-sum discounted stochastic games. Under some assump…
▽ More
This paper combines ideas from Q-learning and fictitious play to define three reinforcement learning procedures which converge to the set of stationary mixed Nash equilibria in identical interest discounted stochastic games. First, we analyse three continuous-time systems that generalize the best-response dynamics defined by Leslie et al. for zero-sum discounted stochastic games. Under some assumptions depending on the system, the dynamics are shown to converge to the set of stationary equilibria in identical interest discounted stochastic games. Then, we introduce three analog discrete-time procedures in the spirit of Sayin et al. and demonstrate their convergence to the set of stationary equilibria using our results in continuous time together with stochastic approximation techniques. Some numerical experiments complement our theoretical findings.
△ Less
Submitted 16 May, 2022; v1 submitted 8 November, 2021;
originally announced November 2021.
-
Level-strategyproof Belief Aggregation Mechanisms
Authors:
Rida Laraki,
Estelle Varloot
Abstract:
In the problem of aggregating experts' probabilistic predictions over an ordered set of outcomes, we introduce the axiom of level-strategy\-proofness (level-SP) and prove that it is a natural notion with several applications. Moreover, it is a robust concept as it implies incentive compatibility in a rich domain of single-peakedness over the space of cumulative distribution functions (CDFs). This…
▽ More
In the problem of aggregating experts' probabilistic predictions over an ordered set of outcomes, we introduce the axiom of level-strategy\-proofness (level-SP) and prove that it is a natural notion with several applications. Moreover, it is a robust concept as it implies incentive compatibility in a rich domain of single-peakedness over the space of cumulative distribution functions (CDFs). This contrasts with the literature which assumes single-peaked preferences over the space of probability distributions. Our main results are: (1) a reduction of our problem to the aggregation of CDFs; (2) the axiomatic characterization of level-SP probability aggregation functions with and without the addition of other axioms; (3) impossibility results which provide bounds for our characterization; (4) the axiomatic characterization of two new and practical level-SP methods: the proportional-cumulative method and the middlemost-cumulative method; and (5) the application of proportional-cumulative to extend approval voting, majority rule, and majority judgment methods to situations where voters/experts are uncertain about how to grade the candidates/alternatives to be ranked.\footnote{We are grateful to Thomas Boyer-Kassem, Roger Cooke, Aris Filos-Ratsikas, Hervé Moulin, Clemens Puppe and some anonymous EC2021 referees for their helpful comments and suggestions.}
\keywords{Probability Aggregation Functions \and ordered Set of Alternatives \and Level Strategy-Proofness \and Proportional-Cumulative \and Middlemost-Cumulative}
△ Less
Submitted 13 September, 2022; v1 submitted 10 August, 2021;
originally announced August 2021.
-
EPTAS for stable allocations in matching games
Authors:
Felipe Garrido-Lucero,
Rida Laraki
Abstract:
Gale-Shapley introduced a matching problem between two sets of agents where each agent on one side has a preference over the agents of the other side and proved algorithmically the existence of a pairwise stable matching (i.e. no uncoupled pair can be better off by matching). Shapley-Shubik, Demange-Gale, and many others extended the model by allowing monetary transfers. In this paper, we study an…
▽ More
Gale-Shapley introduced a matching problem between two sets of agents where each agent on one side has a preference over the agents of the other side and proved algorithmically the existence of a pairwise stable matching (i.e. no uncoupled pair can be better off by matching). Shapley-Shubik, Demange-Gale, and many others extended the model by allowing monetary transfers. In this paper, we study an extension where matched couples obtain their payoffs as the outcome of a strategic game and more particularly a solution concept that combines Gale-Shapley pairwise stability with a constrained Nash equilibrium notion (no player can increase its payoff by playing a different strategy without violating the participation constraint of the partner). Whenever all couples play zero-sum matrix games, strictly competitive bi-matrix games, or infinitely repeated bi-matrix games, we can prove that a modification of some algorithms in the literature converge to an $\varepsilon$-stable allocation in at most $O(\frac{1}{\varepsilon})$ steps where each step is polynomial (linear with respect to the number of players and polynomial of degree at most 5 with respect to the number of pure actions per player).
△ Less
Submitted 20 July, 2021; v1 submitted 15 July, 2021;
originally announced July 2021.
-
Learning in nonatomic games, Part I: Finite action spaces and population games
Authors:
Saeed Hadikhanloo,
Rida Laraki,
Panayotis Mertikopoulos,
Sylvain Sorin
Abstract:
We examine the long-run behavior of a wide range of dynamics for learning in nonatomic games, in both discrete and continuous time. The class of dynamics under consideration includes fictitious play and its regularized variants, the best-reply dynamics (again, possibly regularized), as well as the dynamics of dual averaging / "follow the regularized leader" (which themselves include as special cas…
▽ More
We examine the long-run behavior of a wide range of dynamics for learning in nonatomic games, in both discrete and continuous time. The class of dynamics under consideration includes fictitious play and its regularized variants, the best-reply dynamics (again, possibly regularized), as well as the dynamics of dual averaging / "follow the regularized leader" (which themselves include as special cases the replicator dynamics and Friedman's projection dynamics). Our analysis concerns both the actual trajectory of play and its time-average, and we cover potential and monotone games, as well as games with an evolutionarily stable state (global or otherwise). We focus exclusively on games with finite action spaces; nonatomic games with continuous action spaces are treated in detail in Part II of this paper.
△ Less
Submitted 4 July, 2021;
originally announced July 2021.
-
New Characterizations of Strategy-Proofness under Single-Peakedness
Authors:
Andrew Jennings,
Rida Laraki,
Clemens Puppe,
Estelle Varloot
Abstract:
We provide novel simple representations of strategy-proof voting rules when voters have uni-dimensional single-peaked preferences (as well as multi-dimensional separable preferences). The analysis recovers, links and unifies existing results in the literature such as Moulin's classic characterization in terms of phantom voters and Barberà, Gul and Stacchetti's in terms of winning coalitions ("gene…
▽ More
We provide novel simple representations of strategy-proof voting rules when voters have uni-dimensional single-peaked preferences (as well as multi-dimensional separable preferences). The analysis recovers, links and unifies existing results in the literature such as Moulin's classic characterization in terms of phantom voters and Barberà, Gul and Stacchetti's in terms of winning coalitions ("generalized median voter schemes"). First, we compare the computational properties of the various representations and show that the grading curve representation is superior in terms of computational complexity. Moreover, the new approach allows us to obtain new characterizations when strategy-proofness is combined with other desirable properties such as anonymity, responsiveness, ordinality, participation, consistency, or proportionality. In the anonymous case, two methods are single out: the -- well know -- ordinal median and the -- most recent -- linear median.
△ Less
Submitted 16 June, 2022; v1 submitted 23 February, 2021;
originally announced February 2021.
-
Best Arm Identification in Graphical Bilinear Bandits
Authors:
Geovani Rizk,
Albert Thomas,
Igor Colin,
Rida Laraki,
Yann Chevaleyre
Abstract:
We introduce a new graphical bilinear bandit problem where a learner (or a \emph{central entity}) allocates arms to the nodes of a graph and observes for each edge a noisy bilinear reward representing the interaction between the two end nodes. We study the best arm identification problem in which the learner wants to find the graph allocation maximizing the sum of the bilinear rewards. By efficien…
▽ More
We introduce a new graphical bilinear bandit problem where a learner (or a \emph{central entity}) allocates arms to the nodes of a graph and observes for each edge a noisy bilinear reward representing the interaction between the two end nodes. We study the best arm identification problem in which the learner wants to find the graph allocation maximizing the sum of the bilinear rewards. By efficiently exploiting the geometry of this bandit problem, we propose a \emph{decentralized} allocation strategy based on random sampling with theoretical guarantees. In particular, we characterize the influence of the graph structure (e.g. star, complete or circle) on the convergence rate and propose empirical experiments that confirm this dependency.
△ Less
Submitted 10 June, 2021; v1 submitted 14 December, 2020;
originally announced December 2020.
-
Stable Matching Games
Authors:
Felipe Garrido-Lucero,
Rida Laraki
Abstract:
Gale and Shapley introduced a matching problem between two sets of agents where each agent on one side has an exogenous preference ordering over the agents on the other side. They defined a matching as stable if no unmatched pair can both improve their utility by forming a new pair. They proved, algorithmically, the existence of a stable matching. Shapley and Shubik, Demange and Gale, and many oth…
▽ More
Gale and Shapley introduced a matching problem between two sets of agents where each agent on one side has an exogenous preference ordering over the agents on the other side. They defined a matching as stable if no unmatched pair can both improve their utility by forming a new pair. They proved, algorithmically, the existence of a stable matching. Shapley and Shubik, Demange and Gale, and many others extended the model by allowing monetary transfers. We offer a further extension by assuming that matched couples obtain their payoff endogenously as the outcome of a strategic game they have to play in a usual non-cooperative sense (without commitment) or in a semi-cooperative way (with commitment, as the outcome of a bilateral binding contract in which each player is responsible for her part of the contract). Depending on whether the players can commit or not, we define in each case a solution concept that combines Gale-Shapley pairwise stability with a (generalized) Nash equilibrium stability. In each case we give necessary and sufficient conditions for the set of solutions to be non-empty and provide an algorithm to compute a solution.
△ Less
Submitted 15 March, 2024; v1 submitted 4 August, 2020;
originally announced August 2020.
-
On Sustainable Equilibria
Authors:
Srihari Govindan,
Rida Laraki,
Lucas Pahl
Abstract:
Following the ideas laid out in Myerson (1996), Hofbauer (2000) defined a Nash equilibrium of a finite game as sustainable if it can be made the unique Nash equilibrium of a game obtained by deleting/adding a subset of the strategies that are inferior replies to it. This paper proves two results about sustainable equilibria. The first concerns the Hofbauer-Myerson conjecture about the relationship…
▽ More
Following the ideas laid out in Myerson (1996), Hofbauer (2000) defined a Nash equilibrium of a finite game as sustainable if it can be made the unique Nash equilibrium of a game obtained by deleting/adding a subset of the strategies that are inferior replies to it. This paper proves two results about sustainable equilibria. The first concerns the Hofbauer-Myerson conjecture about the relationship between the sustainability of an equilibrium and its index: for a generic class of games, an equilibrium is sustainable iff its index is $+1$. Von Schemde and von Stengel (2008) proved this conjecture for bimatrix games; we show that the conjecture is true for all finite games. More precisely, we prove that an isolated equilibrium has index +1 if and only if it can be made unique in a larger game obtained by adding finitely many strategies that are inferior replies to that equilibrium. Our second result gives an axiomatic extension of sustainability to all games and shows that only the Nash components with positive index can be sustainable.
△ Less
Submitted 10 August, 2021; v1 submitted 28 May, 2020;
originally announced May 2020.
-
NGO-GM: Natural Gradient Optimization for Graphical Models
Authors:
Eric Benhamou,
Jamal Atif,
Rida Laraki,
David Saltiel
Abstract:
This paper deals with estimating model parameters in graphical models. We reformulate it as an information geometric optimization problem and introduce a natural gradient descent strategy that incorporates additional meta parameters. We show that our approach is a strong alternative to the celebrated EM approach for learning in graphical models. Actually, our natural gradient based strategy leads…
▽ More
This paper deals with estimating model parameters in graphical models. We reformulate it as an information geometric optimization problem and introduce a natural gradient descent strategy that incorporates additional meta parameters. We show that our approach is a strong alternative to the celebrated EM approach for learning in graphical models. Actually, our natural gradient based strategy leads to learning optimal parameters for the final objective function without artificially trying to fit a distribution that may not correspond to the real one. We support our theoretical findings with the question of trend detection in financial markets and show that the learned model performs better than traditional practitioner methods and is less prone to overfitting.
△ Less
Submitted 14 May, 2019;
originally announced May 2019.
-
A discrete version of CMA-ES
Authors:
Eric Benhamou,
Jamal Atif,
Rida Laraki
Abstract:
Modern machine learning uses more and more advanced optimization techniques to find optimal hyper parameters. Whenever the objective function is non-convex, non continuous and with potentially multiple local minima, standard gradient descent optimization methods fail. A last resource and very different method is to assume that the optimum(s), not necessarily unique, is/are distributed according to…
▽ More
Modern machine learning uses more and more advanced optimization techniques to find optimal hyper parameters. Whenever the objective function is non-convex, non continuous and with potentially multiple local minima, standard gradient descent optimization methods fail. A last resource and very different method is to assume that the optimum(s), not necessarily unique, is/are distributed according to a distribution and iteratively to adapt the distribution according to tested points. These strategies originated in the early 1960s, named Evolution Strategy (ES) have culminated with the CMA-ES (Covariance Matrix Adaptation) ES. It relies on a multi variate normal distribution and is supposed to be state of the art for general optimization program. However, it is far from being optimal for discrete variables. In this paper, we extend the method to multivariate binomial correlated distributions. For such a distribution, we show that it shares similar features to the multi variate normal: independence and correlation is equivalent and correlation is efficiently modeled by interaction between different variables. We discuss this distribution in the framework of the exponential family. We prove that the model can estimate not only pairwise interactions among the two variables but also is capable of modeling higher order interactions. This allows creating a version of CMA ES that can accommodate efficiently discrete variables. We provide the corresponding algorithm and conclude.
△ Less
Submitted 11 February, 2019; v1 submitted 27 December, 2018;
originally announced December 2018.
-
Operator norm upper bound for sub-Gaussian tailed random matrices
Authors:
Eric Benhamou,
Jamal Atif,
Rida Laraki
Abstract:
This paper investigates an upper bound of the operator norm for sub-Gaussian tailed random matrices. A lot of attention has been put on uniformly bounded sub-Gaussian tailed random matrices with independent coefficients. However, little has been done for sub-Gaussian tailed random matrices whose matrix coefficients variance are not equal or for matrix for which coefficients are not independent. Th…
▽ More
This paper investigates an upper bound of the operator norm for sub-Gaussian tailed random matrices. A lot of attention has been put on uniformly bounded sub-Gaussian tailed random matrices with independent coefficients. However, little has been done for sub-Gaussian tailed random matrices whose matrix coefficients variance are not equal or for matrix for which coefficients are not independent. This is precisely the subject of this paper. After proving that random matrices with uniform sub-Gaussian tailed independent coefficients satisfy the Tracy Widom bound, that is, their matrix operator norm remains bounded by $O(\sqrt n )$ with overwhelming probability, we prove that a less stringent condition is that the matrix rows are independent and uniformly sub-Gaussian. This does not impose in particular that all matrix coefficients are independent, but only their rows, which is a weaker condition.
△ Less
Submitted 19 January, 2019; v1 submitted 22 December, 2018;
originally announced December 2018.
-
A new approach to learning in Dynamic Bayesian Networks (DBNs)
Authors:
E. Benhamou,
J. Atif,
R. Laraki
Abstract:
In this paper, we revisit the parameter learning problem, namely the estimation of model parameters for Dynamic Bayesian Networks (DBNs). DBNs are directed graphical models of stochastic processes that encompasses and generalize Hidden Markov models (HMMs) and Linear Dynamical Systems (LDSs). Whenever we apply these models to economics and finance, we are forced to make some modeling assumptions a…
▽ More
In this paper, we revisit the parameter learning problem, namely the estimation of model parameters for Dynamic Bayesian Networks (DBNs). DBNs are directed graphical models of stochastic processes that encompasses and generalize Hidden Markov models (HMMs) and Linear Dynamical Systems (LDSs). Whenever we apply these models to economics and finance, we are forced to make some modeling assumptions about the state dynamics and the graph topology (the DBN structure). These assumptions may be incorrectly specified and contain some additional noise compared to reality. Trying to use a best fit approach through maximum likelihood estimation may miss this point and try to fit at any price these models on data. We present here a new methodology that takes a radical point of view and instead focus on the final efficiency of our model. Parameters are hence estimated in terms of their efficiency rather than their distributional fit to the data. The resulting optimization problem that consists in finding the optimal parameters is a hard problem. We rely on Covariance Matrix Adaptation Evolution Strategy (CMA-ES) method to tackle this issue. We apply this method to the seminal problem of trend detection in financial markets. We see on numerical results that the resulting parameters seem less error prone to over fitting than traditional moving average cross over trend detection and perform better. The method developed here for algorithmic trading is general. It can be applied to other real case applications whenever there is no physical law underlying our DBNs.
△ Less
Submitted 11 February, 2019; v1 submitted 21 December, 2018;
originally announced December 2018.
-
Acyclic Gambling Games
Authors:
Rida Laraki,
Jérôme Renault
Abstract:
We consider 2-player zero-sum stochastic games where each player controls his own state variable living in a compact metric space. The terminology comes from gambling problems where the state of a player represents its wealth in a casino. Under natural assumptions (such as continuous running payoff and non expansive transitions), we consider for each discount factor the value v $λ$ of the $λ$-disc…
▽ More
We consider 2-player zero-sum stochastic games where each player controls his own state variable living in a compact metric space. The terminology comes from gambling problems where the state of a player represents its wealth in a casino. Under natural assumptions (such as continuous running payoff and non expansive transitions), we consider for each discount factor the value v $λ$ of the $λ$-discounted stochastic game and investigate its limit when $λ$ goes to 0. We show that under a strong acyclicity condition, the limit exists and is characterized as the unique solution of a system of functional equations: the limit is the unique continuous excessive and depressive function such that each player, if his opponent does not move, can reach the zone when the current payoff is at least as good than the limit value, without degrading the limit value. The approach generalizes and provides a new viewpoint on the Mertens-Zamir system coming from the study of zero-sum repeated games with lack of information on both sides. A counterexample shows that under a slightly weaker notion of acyclicity, convergence of (v $λ$) may fail.
△ Less
Submitted 22 February, 2017;
originally announced February 2017.
-
Approachability of convex sets in generalized quitting games
Authors:
János Flesch,
Rida Laraki,
Vianney Perchet
Abstract:
We consider Blackwell approachability, a very powerful and geometric tool in game theory, used for example to design strategies of the uninformed player in repeated games with incomplete information. We extend this theory to "generalized quitting games" , a class of repeated stochastic games in which each player may have quitting actions, such as the Big-Match. We provide three simple geometric an…
▽ More
We consider Blackwell approachability, a very powerful and geometric tool in game theory, used for example to design strategies of the uninformed player in repeated games with incomplete information. We extend this theory to "generalized quitting games" , a class of repeated stochastic games in which each player may have quitting actions, such as the Big-Match. We provide three simple geometric and strongly related conditions for the weak approachability of a convex target set. The first is sufficient: it guarantees that, for any fixed horizon, a player has a strategy ensuring that the expected time-average payoff vector converges to the target set as horizon goes to infinity. The third is necessary: if it is not satisfied, the opponent can weakly exclude the target set. In the special case where only the approaching player can quit the game (Big-Match of type I), the three conditions are equivalent and coincide with Blackwell's condition. Consequently, we obtain a full characterization and prove that the game is weakly determined-every convex set is either weakly approachable or weakly excludable. In games where only the opponent can quit (Big-Match of type II), none of our conditions is both sufficient and necessary for weak approachability. We provide a continuous time sufficient condition using techniques coming from differential games, and show its usefulness in practice, in the spirit of Vieille's seminal work for weak approachability.Finally, we study uniform approachability where the strategy should not depend on the horizon and demonstrate that, in contrast with classical Blackwell approacha-bility for convex sets, weak approachability does not imply uniform approachability.
△ Less
Submitted 28 September, 2016;
originally announced September 2016.
-
Inertial game dynamics and applications to constrained optimization
Authors:
Rida Laraki,
Panayotis Mertikopoulos
Abstract:
Aiming to provide a new class of game dynamics with good long-term rationality properties, we derive a second-order inertial system that builds on the widely studied "heavy ball with friction" optimization method. By exploiting a well-known link between the replicator dynamics and the Shahshahani geometry on the space of mixed strategies, the dynamics are stated in a Riemannian geometric framework…
▽ More
Aiming to provide a new class of game dynamics with good long-term rationality properties, we derive a second-order inertial system that builds on the widely studied "heavy ball with friction" optimization method. By exploiting a well-known link between the replicator dynamics and the Shahshahani geometry on the space of mixed strategies, the dynamics are stated in a Riemannian geometric framework where trajectories are accelerated by the players' unilateral payoff gradients and they slow down near Nash equilibria. Surprisingly (and in stark contrast to another second-order variant of the replicator dynamics), the inertial replicator dynamics are not well-posed; on the other hand, it is possible to obtain a well-posed system by endowing the mixed strategy space with a different Hessian-Riemannian (HR) metric structure, and we characterize those HR geometries that do so. In the single-agent version of the dynamics (corresponding to constrained optimization over simplex-like objects), we show that regular maximum points of smooth functions attract all nearby solution orbits with low initial speed. More generally, we establish an inertial variant of the so-called "folk theorem" of evolutionary game theory and we show that strict equilibria are attracting in asymmetric (multi-population) games - provided of course that the dynamics are well-posed. A similar asymptotic stability result is obtained for evolutionarily stable strategies in symmetric (single- population) games.
△ Less
Submitted 2 March, 2015; v1 submitted 4 May, 2013;
originally announced May 2013.
-
Higher Order Game Dynamics
Authors:
Rida Laraki,
Panayotis Mertikopoulos
Abstract:
Continuous-time game dynamics are typically first order systems where payoffs determine the growth rate of the players' strategy shares. In this paper, we investigate what happens beyond first order by viewing payoffs as higher order forces of change, specifying e.g. the acceleration of the players' evolution instead of its velocity (a viewpoint which emerges naturally when it comes to aggregating…
▽ More
Continuous-time game dynamics are typically first order systems where payoffs determine the growth rate of the players' strategy shares. In this paper, we investigate what happens beyond first order by viewing payoffs as higher order forces of change, specifying e.g. the acceleration of the players' evolution instead of its velocity (a viewpoint which emerges naturally when it comes to aggregating empirical data of past instances of play). To that end, we derive a wide class of higher order game dynamics, generalizing first order imitative dynamics, and, in particular, the replicator dynamics. We show that strictly dominated strategies become extinct in n-th order payoff-monotonic dynamics n orders as fast as in the corresponding first order dynamics; furthermore, in stark contrast to first order, weakly dominated strategies also become extinct for n>1. All in all, higher order payoff-monotonic dynamics lead to the elimination of weakly dominated strategies, followed by the iterated deletion of strictly dominated strategies, thus providing a dynamic justification of the well-known epistemic rationalizability process of Dekel and Fudenberg (1990). Finally, we also establish a higher order analogue of the folk theorem of evolutionary game theory, and we show that con- vergence to strict equilibria in n-th order dynamics is n orders as fast as in first order.
△ Less
Submitted 31 July, 2013; v1 submitted 19 June, 2012;
originally announced June 2012.
-
Equilibrium in Two-Player Non-Zero-Sum Dynkin Games in Continuous Time
Authors:
Rida Laraki,
Eilon Solan
Abstract:
We prove that every two-player non-zero-sum Dynkin game in continuous time admits an epsilon-equilibrium in randomized stop** times. We provide a condition that ensures the existence of an epsilon-equilibrium in non-randomized stop** times.
We prove that every two-player non-zero-sum Dynkin game in continuous time admits an epsilon-equilibrium in randomized stop** times. We provide a condition that ensures the existence of an epsilon-equilibrium in non-randomized stop** times.
△ Less
Submitted 28 September, 2010;
originally announced September 2010.
-
Semidefinite Programming for Min-Max Problems and Games
Authors:
Rida Laraki,
Jean B. Lasserre
Abstract:
We introduce two min-max problems: the first problem is to minimize the supremum of finitely many rational functions over a compact basic semi-algebraic set whereas the second problem is a 2-player zero-sum polynomial game in randomized strategies and with compact basic semi-algebraic pure strategy sets. It is proved that their optimal solution can be approximated by solving a hierarchy of semid…
▽ More
We introduce two min-max problems: the first problem is to minimize the supremum of finitely many rational functions over a compact basic semi-algebraic set whereas the second problem is a 2-player zero-sum polynomial game in randomized strategies and with compact basic semi-algebraic pure strategy sets. It is proved that their optimal solution can be approximated by solving a hierarchy of semidefinite relaxations, in the spirit of the moment approach developed in Lasserre. This provides a unified approach and a class of algorithms to approximate all Nash equilibria and min-max strategies of many static and dynamic games. Each semidefinite relaxation can be solved in time which is polynomial in its input size and practice from global optimization suggests that very often few relaxations are needed for a good approximation (and sometimes even finite convergence).
△ Less
Submitted 16 December, 2009; v1 submitted 17 October, 2008;
originally announced October 2008.
-
Stop** games in continuous time
Authors:
Rida Laraki,
Eilon Solan
Abstract:
We study two-player zero-sum stop** games in continuous time and infinite horizon. We prove that the value in randomized stop** times exists as soon as the payoff processes are right-continuous. In particular, as opposed to existing literature, we do not assume any conditions on the relations between the payoff processes. We also show that both players have simple epsilon- optimal randomized…
▽ More
We study two-player zero-sum stop** games in continuous time and infinite horizon. We prove that the value in randomized stop** times exists as soon as the payoff processes are right-continuous. In particular, as opposed to existing literature, we do not assume any conditions on the relations between the payoff processes. We also show that both players have simple epsilon- optimal randomized stop** times; namely, randomized stop** times which are small perturbations of non-randomized stop** times.
△ Less
Submitted 19 June, 2003;
originally announced June 2003.