Skip to main content

Showing 1–8 of 8 results for author: Ziliotto, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12583  [pdf, ps, other

    math.OC cs.CC

    Ergodic Unobservable MDPs: Decidability of Approximation

    Authors: Krishnendu Chatterjee, David Lurie, Raimundo Saona, Bruno Ziliotto

    Abstract: Unobservable Markov decision processes (UMDPs) serve as a prominent mathematical framework for modeling sequential decision-making problems. A key aspect in computational analysis is the consideration of decidability, which concerns the existence of algorithms. In general, the computation of the exact and approximated values is undecidable for UMDPs with the long-run average objective. Building on… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    MSC Class: 90C40; 49M25; 90C59; 91A68; 68W25

  2. arXiv:2311.09141  [pdf, ps, other

    cs.DS

    Prophet Inequalities Require Only a Constant Number of Samples

    Authors: Andrés Cristi, Bruno Ziliotto

    Abstract: In a prophet inequality problem, $n$ independent random variables are presented to a gambler one by one. The gambler decides when to stop the sequence and obtains the most recent value as reward. We evaluate a stop** rule by the worst-case ratio between its expected reward and the expectation of the maximum variable. In the classic setting, the order is fixed, and the optimal ratio is known to b… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  3. arXiv:2303.04956  [pdf, ps, other

    math.OC cs.GT cs.LG

    Blackwell's Approachability with Time-Dependent Outcome Functions and Dot Products. Application to the Big Match

    Authors: Joon Kwon, Bruno Ziliotto

    Abstract: Blackwell's approachability is a very general sequential decision framework where a Decision Maker obtains vector-valued outcomes, and aims at the convergence of the average outcome to a given "target" set. Blackwell gave a sufficient condition for the decision maker having a strategy guaranteeing such a convergence against an adversarial environment, as well as what we now call the Blackwell's al… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    MSC Class: 91A15; 68W27

  4. arXiv:2007.06110  [pdf, ps, other

    cs.DS cs.GT

    Unknown I.I.D. Prophets: Better Bounds, Streaming Algorithms, and a New Impossibility

    Authors: José Correa, Paul Dütting, Felix Fischer, Kevin Schewior, Bruno Ziliotto

    Abstract: A prophet inequality states, for some $α\in[0,1]$, that the expected value achievable by a gambler who sequentially observes random variables $X_1,\dots,X_n$ and selects one of them is at least an $α$ fraction of the maximum value in the sequence. We obtain three distinct improvements for a setting that was first studied by Correa et al. (EC, 2019) and is particularly relevant to modern applicatio… ▽ More

    Submitted 20 November, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted to ITCS 2021

  5. Finite-Memory Strategies in POMDPs with Long-Run Average Objectives

    Authors: Krishnendu Chatterjee, Raimundo Saona, Bruno Ziliotto

    Abstract: Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with probabilistic and nondeterministic behaviour in uncertain environments. We prove that in POMDPs with long-run average objective, the decision maker has approximately optimal strategies with finite memory. This implies notably that approximating the long-run value is recursively enumerable, as well… ▽ More

    Submitted 28 September, 2022; v1 submitted 30 April, 2019; originally announced April 2019.

    MSC Class: 90C39; 90C40; 37A50

  6. arXiv:1807.07483  [pdf, ps, other

    cs.DS

    Prophet Secretary Through Blind Strategies

    Authors: Jose Correa, Raimundo Saona, Bruno Ziliotto

    Abstract: In the classic prophet inequality, samples from independent random variables arrive online. A gambler that knows the distributions must decide at each point in time whether to stop and pick the current sample or to continue and lose that sample forever. The goal of the gambler is to maximize the expected value of what she picks and the performance measure is the worst case ratio between the expect… ▽ More

    Submitted 12 March, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

  7. arXiv:1407.3028  [pdf, ps, other

    math.OC cs.GT

    Hidden Stochastic Games and Limit Equilibrium Payoffs

    Authors: Jérôme Renault, Bruno Ziliotto

    Abstract: We consider 2-player stochastic games with perfectly observed actions, and study the limit, as the discount factor goes to one, of the equilibrium payoffs set. In the usual setup where current states are observed by the players, we show that the set of stationary equilibrium payoffs always converges, and provide a simple example where the set of equilibrium payoffs has no limit. We then introduce… ▽ More

    Submitted 10 December, 2014; v1 submitted 11 July, 2014; originally announced July 2014.

    MSC Class: 91A05; 91A15; 91A20; 91A28

  8. arXiv:1305.4778  [pdf, ps, other

    math.OC cs.LG

    Zero-sum repeated games: Counterexamples to the existence of the asymptotic value and the conjecture $\operatorname{maxmin}=\operatorname{lim}v_n$

    Authors: Bruno Ziliotto

    Abstract: Mertens [In Proceedings of the International Congress of Mathematicians (Berkeley, Calif., 1986) (1987) 1528-1577 Amer. Math. Soc.] proposed two general conjectures about repeated games: the first one is that, in any two-person zero-sum repeated game, the asymptotic value exists, and the second one is that, when Player 1 is more informed than Player 2, in the long run Player 1 is able to guarantee… ▽ More

    Submitted 15 March, 2016; v1 submitted 21 May, 2013; originally announced May 2013.

    Comments: Published at http://dx.doi.org/10.1214/14-AOP997 in the Annals of Probability (http://www.imstat.org/aop/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOP-AOP997 MSC Class: 91A20 (Primary); 91A05; 91A15 (Secondary)

    Journal ref: Annals of Probability 2016, Vol. 44, No. 2, 1107-1133