Search | arXiv e-print repository

Rhoban Football Club: RoboCup Humanoid Kid-Size 2023 Champion Team Paper

Authors: Julien Allali, Adrien Boussicault, Cyprien Brocaire, Céline Dobigeon, Marc Duclusaud, Clément Gaspard, Hugo Gimbert, Loïc Gondry, Olivier Ly, Grégoire Passault, Antoine Pirrone

Abstract: In 2023, Rhoban Football Club reached the first place of the KidSize soccer competition for the fifth time, and received the best humanoid award. This paper presents and reviews important points in robots architecture and workflow, with hindsights from the competition. In 2023, Rhoban Football Club reached the first place of the KidSize soccer competition for the fifth time, and received the best humanoid award. This paper presents and reviews important points in robots architecture and workflow, with hindsights from the competition. △ Less

Submitted 1 February, 2024; originally announced February 2024.

Journal ref: RoboCup Symposium 2023

arXiv:2305.10546 [pdf, other]

Games on Graphs

Authors: Nathanaël Fijalkow, Nathalie Bertrand, Patricia Bouyer-Decitre, Romain Brenguier, Arnaud Carayol, John Fearnley, Hugo Gimbert, Florian Horn, Rasmus Ibsen-Jensen, Nicolas Markey, Benjamin Monmege, Petr Novotný, Mickael Randour, Ocan Sankur, Sylvain Schmitz, Olivier Serre, Mateusz Skomra

Abstract: The objective of this collaborative textbook is to present the state of the art on games on graphs, which is part of a larger research topic called game theory. Games on graphs is the field concerned with games whose rules and evolution are represented by a graph. The objective of this collaborative textbook is to present the state of the art on games on graphs, which is part of a larger research topic called game theory. Games on graphs is the field concerned with games whose rules and evolution are represented by a graph. △ Less

Submitted 17 May, 2023; originally announced May 2023.

Comments: 490 pages. Coordinator: Nathanaël Fijalkow

arXiv:2204.12409 [pdf, ps, other]

Distributed controller synthesis for deadlock avoidance

Authors: Hugo Gimbert, Corto Mascle, Anca Muscholl, Igor Walukiewicz

Abstract: We consider the distributed control synthesis problem for systems with locks. The goal is to find local controllers so that the global system does not deadlock. With no restriction this problem is undecidable even for three processes each using a fixed number of locks. We propose two restrictions that make distributed control decidable. The first one is to allow each process to use at most two loc… ▽ More We consider the distributed control synthesis problem for systems with locks. The goal is to find local controllers so that the global system does not deadlock. With no restriction this problem is undecidable even for three processes each using a fixed number of locks. We propose two restrictions that make distributed control decidable. The first one is to allow each process to use at most two locks. The problem then becomes Sigma2P-complete, and even in PTIME under some additional assumptions. The dining philosophers problem satisfies these assumptions. The second restriction is a nested usage of locks. In this case the synthesis problem is NEXPTIME-complete. The drinking philosophers problem falls in this case. △ Less

Submitted 13 February, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

Comments: Journal version of a paper published at ICALP 2022

arXiv:2110.14768 [pdf, other]

doi 10.46298/lmcs-18(3:30)2022

Distributed Asynchronous Games With Causal Memory are Undecidable

Authors: Hugo Gimbert

Abstract: We show the undecidability of the distributed control problem when the plant is an asynchronous automaton, the controllers use causal memory and the goal of the controllers is to put each process in a local accepting state. We show the undecidability of the distributed control problem when the plant is an asynchronous automaton, the controllers use causal memory and the goal of the controllers is to put each process in a local accepting state. △ Less

Submitted 7 September, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

Journal ref: Logical Methods in Computer Science, Volume 18, Issue 3 (September 8, 2022) lmcs:8641

arXiv:2109.09089 [pdf, other]

Constrained School Choice with Incomplete Information

Authors: Hugo Gimbert, Claire Mathieu, Simon Mauras

Abstract: School choice is the two-sided matching market where students (on one side) are to be matched with schools (on the other side) based on their mutual preferences. The classical algorithm to solve this problem is the celebrated deferred acceptance procedure, proposed by Gale and Shapley. After both sides have revealed their mutual preferences, the algorithm computes an optimal stable matching. Most… ▽ More School choice is the two-sided matching market where students (on one side) are to be matched with schools (on the other side) based on their mutual preferences. The classical algorithm to solve this problem is the celebrated deferred acceptance procedure, proposed by Gale and Shapley. After both sides have revealed their mutual preferences, the algorithm computes an optimal stable matching. Most often in practice, notably when the process is implemented by a national clearinghouse and thousands of schools enter the market, there is a quota on the number of applications that a student can submit: students have to perform a partial revelation of their preferences, based on partial information on the market. We model this situation by drawing each student type from a publicly known distribution and study Nash equilibria of the corresponding Bayesian game. We focus on symmetric equilibria, in which all students play the same strategy. We show existence of these equilibria in the general case, and provide two algorithms to compute such equilibria under additional assumptions, including the case where schools have identical preferences over students. △ Less

Submitted 19 September, 2021; originally announced September 2021.

arXiv:2002.09941 [pdf, other]

A Bridge between Polynomial Optimization and Games with Imperfect Recall

Authors: Hugo Gimbert, Soumyajit Paul, B. Srivathsan

Abstract: We provide several positive and negative complexity results for solving games with imperfect recall. Using a one-to-one correspondence between these games on one side and multivariate polynomials on the other side, we show that solving games with imperfect recall is as hard as solving certain problems of the first order theory of reals. We establish square root sum hardness even for the specific c… ▽ More We provide several positive and negative complexity results for solving games with imperfect recall. Using a one-to-one correspondence between these games on one side and multivariate polynomials on the other side, we show that solving games with imperfect recall is as hard as solving certain problems of the first order theory of reals. We establish square root sum hardness even for the specific class of A-loss games. On the positive side, we find restrictions on games and strategies motivated by Bridge bidding that give polynomial-time complexity. △ Less

Submitted 25 February, 2020; v1 submitted 23 February, 2020; originally announced February 2020.

arXiv:1904.03890 [pdf, ps, other]

doi 10.1007/978-3-030-86593-1_1

Two-Sided Matching Markets with Correlated Random Preferences

Authors: Hugo Gimbert, Claire Mathieu, Simon Mauras

Abstract: Stable matching in a community consisting of men and women is a classical combinatorial problem that has been the subject of intense theoretical and empirical study since its introduction in 1962 in a seminal paper by Gale and Shapley, who designed the celebrated ``deferred acceptance'' algorithm for the problem. In the input, each participant ranks participants of the opposite type, so the inpu… ▽ More Stable matching in a community consisting of men and women is a classical combinatorial problem that has been the subject of intense theoretical and empirical study since its introduction in 1962 in a seminal paper by Gale and Shapley, who designed the celebrated ``deferred acceptance'' algorithm for the problem. In the input, each participant ranks participants of the opposite type, so the input consists of a collection of permutations, representing the preference lists. A bipartite matching is unstable if some man-woman pair is blocking: both strictly prefer each other to their partner in the matching. Stability is an important economics concept in matching markets from the viewpoint of manipulability. The unicity of a stable matching implies non-manipulability, and near-unicity implies limited manipulability, thus these are mathematical properties related to the quality of stable matching algorithms. This paper is a theoretical study of the effect of correlations on approximate manipulability of stable matching algorithms. Our approach is to go beyond worst case, assuming that some of the input preference lists are drawn from a distribution. Our model encompasses a discrete probabilistic process inspired by a popularity model introduced by Immorlica and Mahdian, that provides a way to capture correlation between preference lists. Approximate manipulability is approached from several angles : when all stable partners of a person have approximately the same rank; or when most persons have a unique stable partner. Another quantity of interest is a person's number of stable partners. Our results aim to paint a picture of the manipulability of stable matchings in a ``beyond worst case'' setting. △ Less

Submitted 8 March, 2021; v1 submitted 8 April, 2019; originally announced April 2019.

arXiv:1807.00893 [pdf, other]

doi 10.23638/LMCS-15(3:6)2019

Controlling a population

Authors: Nathalie Bertrand, Miheer Dewaskar, Blaise Genest, Hugo Gimbert, Adwait Amit Godbole

Abstract: We introduce a new setting where a population of agents, each modelled by a finite-state system, are controlled uniformly: the controller applies the same action to every agent. The framework is largely inspired by the control of a biological system, namely a population of yeasts, where the controller may only change the environment common to all cells. We study a synchronisation problem for such… ▽ More We introduce a new setting where a population of agents, each modelled by a finite-state system, are controlled uniformly: the controller applies the same action to every agent. The framework is largely inspired by the control of a biological system, namely a population of yeasts, where the controller may only change the environment common to all cells. We study a synchronisation problem for such populations: no matter how individual agents react to the actions of the controller, the controller aims at driving all agents synchronously to a target state. The agents are naturally represented by a non-deterministic finite state automaton (NFA), the same for every agent, and the whole system is encoded as a 2-player game. The first player (Controller) chooses actions, and the second player (Agents) resolves non-determinism for each agent. The game with m agents is called the m -population game. This gives rise to a parameterized control problem (where control refers to 2 player games), namely the population control problem: can Controller control the m-population game for all m in N whatever Agents does? △ Less

Submitted 26 July, 2019; v1 submitted 2 July, 2018; originally announced July 2018.

Comments: This is a journal version of the extended abstract arXiv:1707.02058 which appeared in Concur 2017, together with proofs

Journal ref: Logical Methods in Computer Science, Volume 15, Issue 3 (July 29, 2019) lmcs:4662

arXiv:1802.04067 [pdf, ps, other]

Alternating Nonzero Automata

Authors: Paulin Fournier, Hugo Gimbert

Abstract: We introduce a new class of automata on infinite trees called \emph{alternating nonzero automata}, which extends the class of non-deterministic nonzero automata. We reduce the emptiness problem for alternating nonzero automata to the same problem for non-deterministic ones, which implies decidability. We obtain as a corollary algorithms for the satisfiability of a probabilistic temporal logic exte… ▽ More We introduce a new class of automata on infinite trees called \emph{alternating nonzero automata}, which extends the class of non-deterministic nonzero automata. We reduce the emptiness problem for alternating nonzero automata to the same problem for non-deterministic ones, which implies decidability. We obtain as a corollary algorithms for the satisfiability of a probabilistic temporal logic extending both CTL* and the qualitative fragment of pCTL*. △ Less

Submitted 12 February, 2018; originally announced February 2018.

arXiv:1802.03336

The Monadic Second Order Theory of Grid-Free 1-Safe Petri Nets is Decidable

Authors: Hugo Gimbert

Abstract: Finite 1-safe Petri nets, also called \emph{net systems}, are natural models of asynchronous concurrency. The event structure of a net system describes all its possible executions and their concurrent nature: two events may be causally ordered, occur in parallel or be conflicting. Monadic second order logic (MSO) can be used to specify behavioural properties of net systems. Thiagarajan's conjectur… ▽ More Finite 1-safe Petri nets, also called \emph{net systems}, are natural models of asynchronous concurrency. The event structure of a net system describes all its possible executions and their concurrent nature: two events may be causally ordered, occur in parallel or be conflicting. Monadic second order logic (MSO) can be used to specify behavioural properties of net systems. Thiagarajan's conjecture states that MSO is decidable if and only if the net system is grid-free. The present paper gives a positive answer to this conjecture. △ Less

Submitted 12 April, 2022; v1 submitted 9 February, 2018; originally announced February 2018.

Comments: buggy proof and probably false result

arXiv:1709.03122 [pdf, ps, other]

Two Recursively Inseparable Problems for Probabilistic Automata

Authors: Nathanaël Fijalkow, Hugo Gimbert, Florian Horn, Youssouf Oualhadj

Abstract: This paper introduces and investigates decision problems for numberless probabilistic automata, i.e. probabilistic automata where the support of each probabilistic transitions is specified, but the exact values of the probabilities are not. A numberless probabilistic automaton can be instantiated into a probabilistic automaton by specifying the exact values of the non-zero probabilistic transition… ▽ More This paper introduces and investigates decision problems for numberless probabilistic automata, i.e. probabilistic automata where the support of each probabilistic transitions is specified, but the exact values of the probabilities are not. A numberless probabilistic automaton can be instantiated into a probabilistic automaton by specifying the exact values of the non-zero probabilistic transitions. We show that the two following properties of numberless probabilistic automata are recursively inseparable: - all instances of the numberless automaton have value 1, - no instance of the numberless automaton has value 1. △ Less

Submitted 10 September, 2017; originally announced September 2017.

Comments: Conference version: MFCS'14

arXiv:1707.02058 [pdf, other]

Controlling a Population

Authors: Nathalie Bertrand, Miheer Dewaskar, Blaise Genest, Hugo Gimbert

Abstract: We introduce a new setting where a population of agents, each modelled by a finite-state system, are controlled uniformly: the controller applies the same action to every agent. The framework is largely inspired by the control of a biological system, namely a population of yeasts, where the controller may only change the environment common to all cells. We study a synchronisation problem for such… ▽ More We introduce a new setting where a population of agents, each modelled by a finite-state system, are controlled uniformly: the controller applies the same action to every agent. The framework is largely inspired by the control of a biological system, namely a population of yeasts, where the controller may only change the environment common to all cells. We study a synchronisation problem for such populations: no matter how individual agents react to the actions of the controller , the controller aims at driving all agents synchronously to a target state. The agents are naturally represented by a non-deterministic finite state automaton (NFA), the same for every agent, and the whole system is encoded as a 2-player game. The first player (Controller) chooses actions, and the second player (Agents) resolves non-determinism for each agent. The game with m agents is called the m-population game. This gives rise to a parameterized control problem (where control refers to 2 player games), namely the population control problem: can Controller control the m-population game for all $m $\in$ N$ whatever Agents does? In this paper, we prove that the population control problem is decidable, and it is a EXPTIME-complete problem. As far as we know, this is one of the first results on parameterized control. Our algorithm, not based on cutoff techniques, produces winning strategies which are symbolic, that is, they do not need to count precisely how the population is spread between states. We also show that if there is no winning strategy, then there is a population size M such that Controller wins the m-population game if and only if $m $\le$ M$. Surprisingly, M can be doubly exponential in the number of states of the NFA, with tight upper and lower bounds. △ Less

Submitted 7 July, 2017; originally announced July 2017.

arXiv:1702.06858 [pdf, ps, other]

Emptiness of zero automata is decidable

Authors: Mikolaj Bojańczyk, Hugo Gimbert, Edon Kelmendi

Abstract: Zero automata are a probabilistic extension of parity automata on infinite trees. The satisfiability of a certain probabilistic variant of mso, called tmso + zero, reduces to the emptiness problem for zero automata. We introduce a variant of zero automata called nonzero automata. We prove that for every zero automaton there is an equivalent nonzero automaton of quadratic size and the emptiness pro… ▽ More Zero automata are a probabilistic extension of parity automata on infinite trees. The satisfiability of a certain probabilistic variant of mso, called tmso + zero, reduces to the emptiness problem for zero automata. We introduce a variant of zero automata called nonzero automata. We prove that for every zero automaton there is an equivalent nonzero automaton of quadratic size and the emptiness problem of nonzero automata is decidable and both in NP and in coNP. These results imply that tmso + zero has decidable satisfiability. △ Less

Submitted 28 March, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

arXiv:1702.01953 [pdf, ps, other]

A short proof of correctness of the quasi-polynomial time algorithm for parity games

Authors: Hugo Gimbert, Rasmus Ibsen-Jensen

Abstract: Recently Cristian S. Calude, Sanjay Jain, Bakhadyr Khoussainov, Wei Li and Frank Stephan proposed a quasi-polynomial time algorithm for parity games. This paper proposes a short proof of correctness of their algorithm. Recently Cristian S. Calude, Sanjay Jain, Bakhadyr Khoussainov, Wei Li and Frank Stephan proposed a quasi-polynomial time algorithm for parity games. This paper proposes a short proof of correctness of their algorithm. △ Less

Submitted 24 April, 2017; v1 submitted 7 February, 2017; originally announced February 2017.

arXiv:1612.03780 [pdf, other]

Online Reinforcement Learning for Real-Time Exploration in Continuous State and Action Markov Decision Processes

Authors: Ludovic Hofer, Hugo Gimbert

Abstract: This paper presents a new method to learn online policies in continuous state, continuous action, model-free Markov decision processes, with two properties that are crucial for practical applications. First, the policies are implementable with a very low computational cost: once the policy is computed, the action corresponding to a given state is obtained in logarithmic time with respect to the nu… ▽ More This paper presents a new method to learn online policies in continuous state, continuous action, model-free Markov decision processes, with two properties that are crucial for practical applications. First, the policies are implementable with a very low computational cost: once the policy is computed, the action corresponding to a given state is obtained in logarithmic time with respect to the number of samples used. Second, our method is versatile: it does not rely on any a priori knowledge of the structure of optimal policies. We build upon the Fitted Q-iteration algorithm which represents the $Q$-value as the average of several regression trees. Our algorithm, the Fitted Policy Forest algorithm (FPF), computes a regression forest representing the Q-value and transforms it into a single tree representing the policy, while kee** control on the size of the policy using resampling and leaf merging. We introduce an adaptation of Multi-Resolution Exploration (MRE) which is particularly suited to FPF. We assess the performance of FPF on three classical benchmarks for reinforcement learning: the "Inverted Pendulum", the "Double Integrator" and "Car on the Hill" and show that FPF equals or outperforms other algorithms, although these algorithms rely on the use of particular representations of the policies, especially chosen in order to fit each of the three problems. Finally, we exhibit that the combination of FPF and MRE allows to find nearly optimal solutions in problems where $ε$-greedy approaches would fail. △ Less

Submitted 12 December, 2016; originally announced December 2016.

Journal ref: ICAPS 26th, PlanRob 4th (Workshop) (2016) 37-48

arXiv:1611.08487 [pdf, ps, other]

Pure and Stationary Optimal Strategies in Perfect-Information Stochastic Games with Global Preferences

Authors: Hugo Gimbert, Wieslaw Zielonka

Abstract: We examine the problem of the existence of optimal deterministic stationary strategiesintwo-players antagonistic (zero-sum) perfect information stochastic games with finitely many states and actions.We show that the existenceof such strategies follows from the existence of optimal deterministic stationarystrategies for some derived one-player games.Thus we reducethe problem from two-player to one… ▽ More We examine the problem of the existence of optimal deterministic stationary strategiesintwo-players antagonistic (zero-sum) perfect information stochastic games with finitely many states and actions.We show that the existenceof such strategies follows from the existence of optimal deterministic stationarystrategies for some derived one-player games.Thus we reducethe problem from two-player to one-player games (Markov decisionproblems), where usually it is much easier to tackle.The reduction is very general, it holds not only for all possible payoff map**s but alsoin more a general situations whereplayers' preferences are not expressed by payoffs. △ Less

Submitted 25 November, 2016; originally announced November 2016.

arXiv:1605.07753 [pdf, ps, other]

Deciding Maxmin Reachability in Half-Blind Stochastic Games

Authors: Edon Kelmendi, Hugo Gimbert

Abstract: Two-player, turn-based, stochastic games with reachability conditions are considered, where the maximizer has no information (he is blind) and is restricted to deterministic strategies whereas the minimizer is perfectly informed. We ask the question of whether the game has maxmin 1, in other words we ask whether for all $ε>0$ there exists a deterministic strategy for the (blind) maximizer such tha… ▽ More Two-player, turn-based, stochastic games with reachability conditions are considered, where the maximizer has no information (he is blind) and is restricted to deterministic strategies whereas the minimizer is perfectly informed. We ask the question of whether the game has maxmin 1, in other words we ask whether for all $ε>0$ there exists a deterministic strategy for the (blind) maximizer such that against all the strategies of the minimizer, it is possible to reach the set of final states with probability larger than $1-ε$. This problem is undecidable in general, but we define a class of games, called leaktight half-blind games where the problem becomes decidable. We also show that mixed strategies in general are stronger for both players and that optimal strategies for the minimizer might require infinite-memory. △ Less

Submitted 25 May, 2016; originally announced May 2016.

arXiv:1601.05176 [pdf, ps, other]

On the Control of Asynchronous Automata

Authors: Hugo Gimbert

Abstract: The decidability of the distributed version of the Ramadge and Wonham controller synthesis problem,where both the plant and the controllers are modeled as asynchronous automataand the controllers have causal memoryis a challenging open problem.There exist three classes of plants for which the existence of a correct controller with causal memory has been shown decidable: when the dependency graph o… ▽ More The decidability of the distributed version of the Ramadge and Wonham controller synthesis problem,where both the plant and the controllers are modeled as asynchronous automataand the controllers have causal memoryis a challenging open problem.There exist three classes of plants for which the existence of a correct controller with causal memory has been shown decidable: when the dependency graph of actions is series-parallel, when the processes are connectedly communicating and when the dependency graph of processes is a tree. We design a class of plants, called decomposable games, with a decidable controller synthesis problem.This provides a unified proof of the three existing decidability results as well as new examples of decidable plants. △ Less

Submitted 4 August, 2017; v1 submitted 20 January, 2016; originally announced January 2016.

arXiv:1504.04136 [pdf, ps, other]

doi 10.2168/LMCS-11(2:12)2015

Deciding the value 1 problem for probabilistic leaktight automata

Authors: Nathanaël Fijalkow, Hugo Gimbert, Edon Kelmendi, Youssouf Oualhadj

Abstract: The value 1 problem is a decision problem for probabilistic automata over finite words: given a probabilistic automaton, are there words accepted with probability arbitrarily close to 1? This problem was proved undecidable recently; to overcome this, several classes of probabilistic automata of different nature were proposed, for which the value 1 problem has been shown decidable. In this paper, w… ▽ More The value 1 problem is a decision problem for probabilistic automata over finite words: given a probabilistic automaton, are there words accepted with probability arbitrarily close to 1? This problem was proved undecidable recently; to overcome this, several classes of probabilistic automata of different nature were proposed, for which the value 1 problem has been shown decidable. In this paper, we introduce yet another class of probabilistic automata, called leaktight automata, which strictly subsumes all classes of probabilistic automata whose value 1 problem is known to be decidable. We prove that for leaktight automata, the value 1 problem is decidable (in fact, PSPACE-complete) by constructing a saturation algorithm based on the computation of a monoid abstracting the behaviours of the automaton. We rely on algebraic techniques developed by Simon to prove that this abstraction is complete. Furthermore, we adapt this saturation algorithm to decide whether an automaton is leaktight. Finally, we show a reduction allowing to extend our decidability results from finite words to infinite ones, implying that the value 1 problem for probabilistic leaktight parity automata is decidable. △ Less

Submitted 21 June, 2015; v1 submitted 16 April, 2015; originally announced April 2015.

Journal ref: Logical Methods in Computer Science, Volume 11, Issue 2 (June 23, 2015) lmcs:1572

arXiv:1406.4248 [pdf, ps, other]

doi 10.1214/14-AAP1095

On values of repeated games with signals

Authors: Hugo Gimbert, Jérôme Renault, Sylvain Sorin, Xavier Venel, Wiesław Zielonka

Abstract: We study the existence of different notions of value in two-person zero-sum repeated games where the state evolves and players receive signals. We provide some examples showing that the limsup value (and the uniform value) may not exist in general. Then we show the existence of the value for any Borel payoff function if the players observe a public signal including the actions played. We also prov… ▽ More We study the existence of different notions of value in two-person zero-sum repeated games where the state evolves and players receive signals. We provide some examples showing that the limsup value (and the uniform value) may not exist in general. Then we show the existence of the value for any Borel payoff function if the players observe a public signal including the actions played. We also prove two other positive results without assumptions on the signaling structure: the existence of the $\sup$ value in any game and the existence of the uniform value in recursive games with nonnegative payoffs. △ Less

Submitted 7 January, 2016; v1 submitted 17 June, 2014; originally announced June 2014.

Comments: Published at http://dx.doi.org/10.1214/14-AAP1095 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AAP-AAP1095

Journal ref: Annals of Applied Probability 2016, Vol. 26, No. 1, pp. 402-424

arXiv:1401.6575 [pdf, other]

Submixing and Shift-invariant Stochastic Games

Authors: Hugo Gimbert, Edon Kelmendi

Abstract: We consider zero-sum stochastic games with perfect information and finitely many states and actions. The payoff is computed by a function which associates to each infinite sequence of states and actions a real number. We prove that if the payoff function is both shift-invariant and submixing, then the game is half-positional, i.e. the first player has an optimal strategy which is both determinis… ▽ More We consider zero-sum stochastic games with perfect information and finitely many states and actions. The payoff is computed by a function which associates to each infinite sequence of states and actions a real number. We prove that if the payoff function is both shift-invariant and submixing, then the game is half-positional, i.e. the first player has an optimal strategy which is both deterministic and stationary. This result relies on the existence of epsilon-subgame-perfect strategies in shift-invariant games, a second contribution of the paper. The techniques can be used to establish a third result: for shift-invariant and submixing payoff functions, the existence of finite-memory strategies for player 2 in one-player games implies the same property for two-player games as well. △ Less

Submitted 28 March, 2022; v1 submitted 25 January, 2014; originally announced January 2014.

arXiv:1205.6346 [pdf, ps, other]

doi 10.2168/LMCS-9(1:7)2013

On (Subgame Perfect) Secure Equilibrium in Quantitative Reachability Games

Authors: Thomas Brihaye, Véronique Bruyère, Julie De Pril, Hugo Gimbert

Abstract: We study turn-based quantitative multiplayer non zero-sum games played on finite graphs with reachability objectives. In such games, each player aims at reaching his own goal set of states as soon as possible. A previous work on this model showed that Nash equilibria (resp. secure equilibria) are guaranteed to exist in the multiplayer (resp. two-player) case. The existence of secure equilibria in… ▽ More We study turn-based quantitative multiplayer non zero-sum games played on finite graphs with reachability objectives. In such games, each player aims at reaching his own goal set of states as soon as possible. A previous work on this model showed that Nash equilibria (resp. secure equilibria) are guaranteed to exist in the multiplayer (resp. two-player) case. The existence of secure equilibria in the multiplayer case remained and is still an open problem. In this paper, we focus our study on the concept of subgame perfect equilibrium, a refinement of Nash equilibrium well-suited in the framework of games played on graphs. We also introduce the new concept of subgame perfect secure equilibrium. We prove the existence of subgame perfect equilibria (resp. subgame perfect secure equilibria) in multiplayer (resp. two-player) quantitative reachability games. Moreover, we provide an algorithm deciding the existence of secure equilibria in the multiplayer case. △ Less

Submitted 26 February, 2013; v1 submitted 29 May, 2012; originally announced May 2012.

Comments: 32 pages. Full version of the FoSSaCS 2012 proceedings paper

ACM Class: D.2.4

Journal ref: Logical Methods in Computer Science, Volume 9, Issue 1 (February 28, 2013) lmcs:790

arXiv:1204.0077 [pdf, other]

Asynchronous Games over Tree Architectures

Authors: Blaise Genest, Hugo Gimbert, Anca Muscholl, Igor Walukiewicz

Abstract: We consider the task of controlling in a distributed way a Zielonka asynchronous automaton. Every process of a controller has access to its causal past to determine the next set of actions it proposes to play. An action can be played only if every process controlling this action proposes to play it. We consider reachability objectives: every process should reach its set of final states. We show th… ▽ More We consider the task of controlling in a distributed way a Zielonka asynchronous automaton. Every process of a controller has access to its causal past to determine the next set of actions it proposes to play. An action can be played only if every process controlling this action proposes to play it. We consider reachability objectives: every process should reach its set of final states. We show that this control problem is decidable for tree architectures, where every process can communicate with its parent, its children, and with the environment. The complexity of our algorithm is l-fold exponential with l being the height of the tree representing the architecture. We show that this is unavoidable by showing that even for three processes the problem is EXPTIME-complete, and that it is non-elementary in general. △ Less

Submitted 15 February, 2013; v1 submitted 31 March, 2012; originally announced April 2012.

arXiv:1104.3055 [pdf, ps, other]

Deciding the Value 1 Problem of Probabilistic Leaktight Automata

Authors: Nathanaël Fijalkow, Hugo Gimbert, Youssouf Oualhadj

Abstract: The value 1 problem is a decision problem for probabilistic automata over finite words: given a probabilistic automaton A, are there words accepted by A with probability arbitrarily close to 1? This problem was proved undecidable recently. We sharpen this result, showing that the undecidability result holds even if the probabilistic automata have only one probabilistic transition. Our main contrib… ▽ More The value 1 problem is a decision problem for probabilistic automata over finite words: given a probabilistic automaton A, are there words accepted by A with probability arbitrarily close to 1? This problem was proved undecidable recently. We sharpen this result, showing that the undecidability result holds even if the probabilistic automata have only one probabilistic transition. Our main contribution is to introduce a new class of probabilistic automata, called leaktight automata, for which the value 1 problem is shown decidable (and PSPACE-complete). We construct an algorithm based on the computation of a monoid abstracting the behaviours of the automaton, and rely on algebraic techniques developed by Simon for the correctness proof. The class of leaktight automata is decidable in PSPACE, subsumes all subclasses of probabilistic automata whose value 1 problem is known to be decidable (in particular deterministic automata), and is closed under two natural composition operators. △ Less

Submitted 26 January, 2012; v1 submitted 14 April, 2011; originally announced April 2011.

Comments: arXiv admin note: significant text overlap with arXiv:1104.3054

arXiv:1104.3054 [pdf, ps, other]

Pushing undecidability of the isolation problem for probabilistic automata

Authors: Nathanaël Fijalkow, Hugo Gimbert, Youssouf Oualhadj

Abstract: This short note aims at proving that the isolation problem is undecidable for probabilistic automata with only one probabilistic transition. This problem is known to be undecidable for general probabilistic automata, without restriction on the number of probabilistic transitions. In this note, we develop a simulation technique that allows to simulate any probabilistic automaton with one having onl… ▽ More This short note aims at proving that the isolation problem is undecidable for probabilistic automata with only one probabilistic transition. This problem is known to be undecidable for general probabilistic automata, without restriction on the number of probabilistic transitions. In this note, we develop a simulation technique that allows to simulate any probabilistic automaton with one having only one probabilistic transition. △ Less

Submitted 14 April, 2011; originally announced April 2011.

arXiv:1006.1402 [pdf, ps, other]

doi 10.4204/EPTCS.25.5

Blackwell-Optimal Strategies in Priority Mean-Payoff Games

Authors: Hugo Gimbert, Wiesław Zielonka

Abstract: We examine perfect information stochastic mean-payoff games - a class of games containing as special sub-classes the usual mean-payoff games and parity games. We show that deterministic memoryless strategies that are optimal for discounted games with state-dependent discount factors close to 1 are optimal for priority mean-payoff games establishing a strong link between these two classes. We examine perfect information stochastic mean-payoff games - a class of games containing as special sub-classes the usual mean-payoff games and parity games. We show that deterministic memoryless strategies that are optimal for discounted games with state-dependent discount factors close to 1 are optimal for priority mean-payoff games establishing a strong link between these two classes. △ Less

Submitted 7 June, 2010; originally announced June 2010.

Journal ref: EPTCS 25, 2010, pp. 7-21

arXiv:1006.0673 [pdf, ps, other]

doi 10.1007/978-3-642-15155-2_23

Randomness for Free

Authors: Krishnendu Chatterjee, Laurent Doyen, Hugo Gimbert, Thomas A. Henzinger

Abstract: We consider two-player zero-sum games on graphs. These games can be classified on the basis of the information of the players and on the mode of interaction between them. On the basis of information the classification is as follows: (a) partial-observation (both players have partial view of the game); (b) one-sided complete-observation (one player has complete observation); and (c) complete-observ… ▽ More We consider two-player zero-sum games on graphs. These games can be classified on the basis of the information of the players and on the mode of interaction between them. On the basis of information the classification is as follows: (a) partial-observation (both players have partial view of the game); (b) one-sided complete-observation (one player has complete observation); and (c) complete-observation (both players have complete view of the game). On the basis of mode of interaction we have the following classification: (a) concurrent (both players interact simultaneously); and (b) turn-based (both players interact in turn). The two sources of randomness in these games are randomness in transition function and randomness in strategies. In general, randomized strategies are more powerful than deterministic strategies, and randomness in transitions gives more general classes of games. In this work we present a complete characterization for the classes of games where randomness is not helpful in: (a) the transition function probabilistic transition can be simulated by deterministic transition); and (b) strategies (pure strategies are as powerful as randomized strategies). As consequence of our characterization we obtain new undecidability results for these games. △ Less

Submitted 30 September, 2014; v1 submitted 3 June, 2010; originally announced June 2010.

arXiv:0811.3978 [pdf, ps, other]

Optimal Strategies in Perfect-Information Stochastic Games with Tail Winning Conditions

Authors: Hugo Gimbert, Florian Horn

Abstract: We prove that optimal strategies exist in every perfect-information stochastic game with finitely many states and actions and a tail winning condition. We prove that optimal strategies exist in every perfect-information stochastic game with finitely many states and actions and a tail winning condition. △ Less

Submitted 19 November, 2013; v1 submitted 24 November, 2008; originally announced November 2008.

arXiv:0811.3975 [pdf, ps, other]

Determinacy and Decidability of Reachability Games with Partial Observation on Both Sides

Authors: Nathalie Bertrand, Blaise Genest, Hugo Gimbert

Abstract: We prove two determinacy and decidability results about two-players stochastic reachability games with partial observation on both sides and finitely many states, signals and actions. We prove two determinacy and decidability results about two-players stochastic reachability games with partial observation on both sides and finitely many states, signals and actions. △ Less

Submitted 24 November, 2008; originally announced November 2008.

arXiv:0712.1765 [pdf, ps, other]

doi 10.2168/LMCS-5(2:9)2009

Solving Simple Stochastic Games with Few Random Vertices

Authors: Hugo Gimbert, Florian Horn

Abstract: Simple stochastic games are two-player zero-sum stochastic games with turn-based moves, perfect information, and reachability winning conditions. We present two new algorithms computing the values of simple stochastic games. Both of them rely on the existence of optimal permutation strategies, a class of positional strategies derived from permutations of the random vertices. The "permutation-enu… ▽ More Simple stochastic games are two-player zero-sum stochastic games with turn-based moves, perfect information, and reachability winning conditions. We present two new algorithms computing the values of simple stochastic games. Both of them rely on the existence of optimal permutation strategies, a class of positional strategies derived from permutations of the random vertices. The "permutation-enumeration" algorithm performs an exhaustive search among these strategies, while the "permutation-improvement" algorithm is based on successive improvements, à la Hoffman-Karp. Our algorithms improve previously known algorithms in several aspects. First they run in polynomial time when the number of random vertices is fixed, so the problem of solving simple stochastic games is fixed-parameter tractable when the parameter is the number of random vertices. Furthermore, our algorithms do not require the input game to be transformed into a stop** game. Finally, the permutation-enumeration algorithm does not use linear programming, while the permutation-improvement algorithm may run in polynomial time. △ Less

Submitted 25 May, 2009; v1 submitted 11 December, 2007; originally announced December 2007.

ACM Class: I.2.1; G.3

Journal ref: Logical Methods in Computer Science, Volume 5, Issue 2 (May 25, 2009) lmcs:1119

Showing 1–30 of 30 results for author: Gimbert, H