Skip to main content

Showing 1–22 of 22 results for author: Ziliotto, B

.
  1. arXiv:2406.18404  [pdf, ps, other

    math.AP

    Stochastic homogenization of HJ equations: a differential game approach

    Authors: Andrea Davini, Raimundo Saona, Bruno Ziliotto

    Abstract: We prove stochastic homogenization for a class of non-convex and non-coercive first-order Hamilton-Jacobi equations in a finite-range of dependence environment for Hamiltonians that can be expressed by a max-min formula. We make use of the representation of the solution as a value function of a differential game to implement a game-theoretic approach to the homogenization problem.

    Submitted 26 June, 2024; originally announced June 2024.

    MSC Class: 35B27; 35F21; 91A23

  2. arXiv:2405.12583  [pdf, ps, other

    math.OC cs.CC

    Ergodic Unobservable MDPs: Decidability of Approximation

    Authors: Krishnendu Chatterjee, David Lurie, Raimundo Saona, Bruno Ziliotto

    Abstract: Unobservable Markov decision processes (UMDPs) serve as a prominent mathematical framework for modeling sequential decision-making problems. A key aspect in computational analysis is the consideration of decidability, which concerns the existence of algorithms. In general, the computation of the exact and approximated values is undecidable for UMDPs with the long-run average objective. Building on… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    MSC Class: 90C40; 49M25; 90C59; 91A68; 68W25

  3. arXiv:2401.17696  [pdf, ps, other

    math.OC math.AP

    Bayesian Learning in Mean Field Games

    Authors: Eran Shmaya, Bruno Ziliotto

    Abstract: We consider a mean-field game model where the cost functions depend on a fixed parameter, called \textit{state}, which is unknown to players. Players learn about the state from a a stream of private signals they receive throughout the game. We derive a mean field system satisfied by the equilibrium payoff of the game and prove existence of a solution under standard regularity assumptions. Addition… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    MSC Class: 91A16; 91A27; 91A26

  4. arXiv:2401.16252  [pdf, other

    math.OC

    Zero-sum Random Games on Directed Graphs

    Authors: Luc Attia, Lyuben Lichev, Dieter Mitsche, Raimundo Saona, Bruno Ziliotto

    Abstract: This paper considers a class of two-player zero-sum games on directed graphs whose vertices are equipped with random payoffs of bounded support known by both players. Starting from a fixed vertex, players take turns to move a token along the edges of the graph. On the one hand, for acyclic directed graphs of bounded degree and sub-exponential expansion, we show that the value of the game conve… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    MSC Class: 91A15; 91A43

  5. arXiv:2311.09141  [pdf, ps, other

    cs.DS

    Prophet Inequalities Require Only a Constant Number of Samples

    Authors: Andrés Cristi, Bruno Ziliotto

    Abstract: In a prophet inequality problem, $n$ independent random variables are presented to a gambler one by one. The gambler decides when to stop the sequence and obtains the most recent value as reward. We evaluate a stop** rule by the worst-case ratio between its expected reward and the expectation of the maximum variable. In the classic setting, the order is fixed, and the optimal ratio is known to b… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  6. arXiv:2303.04956  [pdf, ps, other

    math.OC cs.GT cs.LG

    Blackwell's Approachability with Time-Dependent Outcome Functions and Dot Products. Application to the Big Match

    Authors: Joon Kwon, Bruno Ziliotto

    Abstract: Blackwell's approachability is a very general sequential decision framework where a Decision Maker obtains vector-valued outcomes, and aims at the convergence of the average outcome to a given "target" set. Blackwell gave a sufficient condition for the decision maker having a strategy guaranteeing such a convergence against an adversarial environment, as well as what we now call the Blackwell's al… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    MSC Class: 91A15; 68W27

  7. arXiv:2110.12227  [pdf, other

    math.OC math.AP

    Percolation games

    Authors: Guillaume Garnier, Bruno Ziliotto

    Abstract: This paper introduces a discrete-time stochastic game class on $\mathbb{Z}^d$, which plays the role of a toy model for the well-known problem of stochastic homogenization of Hamilton-Jacobi equations. Conditions are provided under which the $n$-stage game value converges as $n$ tends to infinity, and connections with homogenization theory is discussed.

    Submitted 17 December, 2021; v1 submitted 23 October, 2021; originally announced October 2021.

    MSC Class: 91A15; 35F21; 49L12; 60K35; 35B27

  8. arXiv:2106.09405  [pdf, other

    math.OC

    Mertens conjectures in absorbing games with incomplete information

    Authors: Bruno Ziliotto

    Abstract: In a zero-sum stochastic game with signals, at each stage, two adversary players take decisions and receive a stage payoff determined by these decisions and a variable called state. The state follows a Markov chain, that is controlled by both players. Actions and states are imperfectly observed by players, who receive a private signal at each stage. Mertens (ICM 1986) conjectured two properties re… ▽ More

    Submitted 1 December, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    MSC Class: 91A15; 60G42

  9. arXiv:2007.06110  [pdf, ps, other

    cs.DS cs.GT

    Unknown I.I.D. Prophets: Better Bounds, Streaming Algorithms, and a New Impossibility

    Authors: José Correa, Paul Dütting, Felix Fischer, Kevin Schewior, Bruno Ziliotto

    Abstract: A prophet inequality states, for some $α\in[0,1]$, that the expected value achievable by a gambler who sequentially observes random variables $X_1,\dots,X_n$ and selects one of them is at least an $α$ fraction of the maximum value in the sequence. We obtain three distinct improvements for a setting that was first studied by Correa et al. (EC, 2019) and is particularly relevant to modern applicatio… ▽ More

    Submitted 20 November, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted to ITCS 2021

  10. arXiv:2004.08844  [pdf, ps, other

    math.OC

    History-dependent evaluations in POMDPs

    Authors: Xavier Venel, Bruno Ziliotto

    Abstract: We consider POMDPs in which the weight of the stage payoff depends on the past sequence of signals and actions occurring in the infinitely repeated problem. We prove that for all epsilon>0, there exists a strategy that is epsilon-optimal for any sequence of weights satisfying a property that interprets as "the decision-maker is patient enough". This unifies and generalizes several results of the l… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

    MSC Class: 90C39; 90C40; 37A50; 60J20

  11. arXiv:1905.07295  [pdf, other

    math.AP math.OC

    An example of failure of stochastic homogenization for viscous Hamilton-Jacobi equations without convexity

    Authors: William M. Feldman, Jean-Baptiste Fermanian, Bruno Ziliotto

    Abstract: We give an example of the failure of homogenization for a viscous Hamilton-Jacobi equation with non-convex Hamiltonian.

    Submitted 17 May, 2019; originally announced May 2019.

    MSC Class: 35B27; 35F21; 91A23

  12. Finite-Memory Strategies in POMDPs with Long-Run Average Objectives

    Authors: Krishnendu Chatterjee, Raimundo Saona, Bruno Ziliotto

    Abstract: Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with probabilistic and nondeterministic behaviour in uncertain environments. We prove that in POMDPs with long-run average objective, the decision maker has approximately optimal strategies with finite memory. This implies notably that approximating the long-run value is recursively enumerable, as well… ▽ More

    Submitted 28 September, 2022; v1 submitted 30 April, 2019; originally announced April 2019.

    MSC Class: 90C39; 90C40; 37A50

  13. Constant payoff in zero-sum stochastic games

    Authors: Olivier Catoni, Miquel Oliu-Barton, Bruno Ziliotto

    Abstract: In a zero-sum stochastic game, at each stage, two adversary players take decisions and receive a stage payoff determined by them and by a controlled random variable representing the state of nature. The total payoff is the normalized discounted sum of the stage payoffs. In this paper we solve the "constant payoff" conjecture formulated by Sorin, Vigeral and Venel (2010): if both players use optima… ▽ More

    Submitted 5 May, 2022; v1 submitted 11 November, 2018; originally announced November 2018.

    MSC Class: 91A05; 91A15; 91A50; 60J10

    Journal ref: Ann. Inst. H. Poincaré Probab. Statist. 57(4): 1888-1900, 2021

  14. arXiv:1807.07483  [pdf, ps, other

    cs.DS

    Prophet Secretary Through Blind Strategies

    Authors: Jose Correa, Raimundo Saona, Bruno Ziliotto

    Abstract: In the classic prophet inequality, samples from independent random variables arrive online. A gambler that knows the distributions must decide at each point in time whether to stop and pick the current sample or to continue and lose that sample forever. The goal of the gambler is to maximize the expected value of what she picks and the performance measure is the worst case ratio between the expect… ▽ More

    Submitted 12 March, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

  15. arXiv:1801.06008  [pdf, ps, other

    math.AP math.OC

    Convergence of the solutions of the discounted Hamilton-Jacobi equation: a counterexample

    Authors: Bruno Ziliotto

    Abstract: This paper provides a counterexample about the asymptotic behavior of the solutions of a discounted Hamilton-Jacobi equation, as the discount factor vanishes. The Hamiltonian of the equation is a 1-dimensional continuous and coercive Hamiltonian.

    Submitted 19 January, 2018; v1 submitted 18 January, 2018; originally announced January 2018.

    MSC Class: 35B40; 49L25; 91A15; 91A25; 91A50; 91A23

  16. arXiv:1609.02175  [pdf, ps, other

    math.OC

    Tauberian theorems for general iterations of operators: applications to zero-sum stochastic games

    Authors: Bruno Ziliotto

    Abstract: This paper proves several Tauberian theorems for general iterations of operators, and provides two applications to zero-sum stochastic games where the total payoff is a weighted sum of the stage payoffs. The first application is to provide conditions under which the existence of the asymptotic value implies the convergence of the values of the weighted game, as players get more and more patient. T… ▽ More

    Submitted 7 September, 2016; originally announced September 2016.

    MSC Class: 47N10; 91A05; 91A15; 91A20; 91A25

  17. Stochastic homogenization of nonconvex Hamilton-Jacobi equations: a counterexample

    Authors: Bruno Ziliotto

    Abstract: We provide an example of a Hamilton-Jacobi equation in which stochastic homogenization does not occur. The Hamiltonian involved in this example satisfies the standard assumptions of the literature, except that it is not convex.

    Submitted 7 September, 2016; v1 submitted 20 December, 2015; originally announced December 2015.

    MSC Class: 35B27; 35F21; 91A23

  18. arXiv:1505.07495  [pdf, ps, other

    math.OC

    Pathwise uniform value in gambling houses and Partially Observable Markov Decision Processes

    Authors: Xavier Venel, Bruno Ziliotto

    Abstract: In several standard models of dynamic programming (gambling houses, MDPs, POMDPs), we prove the existence of a very robust notion of value for the infinitely repeated problem, namely the pathwise uniform value. This solves two open problems. First, this shows that for any epsilon>0, the decision-maker has a pure strategy sigma which is epsilon-optimal in any n-stage game, provided that n is big en… ▽ More

    Submitted 8 September, 2015; v1 submitted 27 May, 2015; originally announced May 2015.

    MSC Class: 90C39; 90C40; 37A50; 47N10

  19. arXiv:1501.06525  [pdf, ps, other

    math.OC

    A Tauberian theorem for nonexpansive operators and applications to zero-sum stochastic games

    Authors: Bruno Ziliotto

    Abstract: We prove a Tauberian theorem for nonexpansive operators, and apply it to the model of zero-sum stochastic game. Under mild assumptions, we prove that the value of the lambda-discounted game v_{lambda} converges uniformly when lambda goes to 0 if and only if the value of the n-stage game v_n converges uniformly when n goes to infinity. This generalizes the Tauberian theorem of Lehrer and Sorin (199… ▽ More

    Submitted 23 February, 2015; v1 submitted 26 January, 2015; originally announced January 2015.

    MSC Class: 47N10; 91A05; 91A15; 91A20; 91A25;

  20. arXiv:1410.5231  [pdf, ps, other

    math.OC

    General limit value in zero-sum stochastic games

    Authors: Bruno Ziliotto

    Abstract: Bewley and Kohlberg (1976) and Mertens and Neyman (1981) have proved, respectively, the existence of the asymptotic value and the uniform value in zero-sum stochastic games with finite state space and finite action sets. In their work, the total payoff in a stochastic game is defined either as a Cesaro mean or an Abel mean of the stage payoffs. This paper presents two findings: first, we generaliz… ▽ More

    Submitted 11 November, 2015; v1 submitted 20 October, 2014; originally announced October 2014.

    MSC Class: 91A05; 91A10; 91A15; 91A20; 91A25; 91A50

  21. arXiv:1407.3028  [pdf, ps, other

    math.OC cs.GT

    Hidden Stochastic Games and Limit Equilibrium Payoffs

    Authors: Jérôme Renault, Bruno Ziliotto

    Abstract: We consider 2-player stochastic games with perfectly observed actions, and study the limit, as the discount factor goes to one, of the equilibrium payoffs set. In the usual setup where current states are observed by the players, we show that the set of stationary equilibrium payoffs always converges, and provide a simple example where the set of equilibrium payoffs has no limit. We then introduce… ▽ More

    Submitted 10 December, 2014; v1 submitted 11 July, 2014; originally announced July 2014.

    MSC Class: 91A05; 91A15; 91A20; 91A28

  22. arXiv:1305.4778  [pdf, ps, other

    math.OC cs.LG

    Zero-sum repeated games: Counterexamples to the existence of the asymptotic value and the conjecture $\operatorname{maxmin}=\operatorname{lim}v_n$

    Authors: Bruno Ziliotto

    Abstract: Mertens [In Proceedings of the International Congress of Mathematicians (Berkeley, Calif., 1986) (1987) 1528-1577 Amer. Math. Soc.] proposed two general conjectures about repeated games: the first one is that, in any two-person zero-sum repeated game, the asymptotic value exists, and the second one is that, when Player 1 is more informed than Player 2, in the long run Player 1 is able to guarantee… ▽ More

    Submitted 15 March, 2016; v1 submitted 21 May, 2013; originally announced May 2013.

    Comments: Published at http://dx.doi.org/10.1214/14-AOP997 in the Annals of Probability (http://www.imstat.org/aop/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOP-AOP997 MSC Class: 91A20 (Primary); 91A05; 91A15 (Secondary)

    Journal ref: Annals of Probability 2016, Vol. 44, No. 2, 1107-1133