Skip to main content

Showing 1–4 of 4 results for author: Boone, V

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.01234  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Achieving Tractable Minimax Optimal Regret in Average Reward MDPs

    Authors: Victor Boone, Zihan Zhang

    Abstract: In recent years, significant attention has been directed towards learning average-reward Markov Decision Processes (MDPs). However, existing algorithms either suffer from sub-optimal regret guarantees or computational inefficiencies. In this paper, we present the first tractable algorithm with minimax optimal regret of $\widetilde{\mathrm{O}}(\sqrt{\mathrm{sp}(h^*) S A T})$, where… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2311.18437  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    The Sliding Regret in Stochastic Bandits: Discriminating Index and Randomized Policies

    Authors: Victor Boone

    Abstract: This paper studies the one-shot behavior of no-regret algorithms for stochastic bandits. Although many algorithms are known to be asymptotically optimal with respect to the expected regret, over a single run, their pseudo-regret seems to follow one of two tendencies: it is either smooth or bumpy. To measure this tendency, we introduce a new notion: the sliding regret, that measures the worst pseud… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 31 pages

  3. arXiv:2304.08048  [pdf, ps, other

    eess.SY math.OC

    When do discounted-optimal policies also optimize the gain?

    Authors: Victor Boone

    Abstract: In this technical note, we establish an upper-bound on the threshold on the discount factor starting from which all discounted-optimal deterministic policies are gain-optimal, that we prove to be tight on an example. To address computability issues of that theoretical threshold, we provide a weaker bound which is tractable on ergodic MDPs in polynomial time.

    Submitted 17 April, 2023; originally announced April 2023.

  4. arXiv:2108.12127  [pdf, other

    eess.SY

    Towards model predictive control of supercritical CO2 cycles

    Authors: Viv Bone, Michael Kearney, Ingo Jahn

    Abstract: Control of non-condensing non-ideal-gas power cycles is challenging because their output power dynamics depend on complex system interactions, non-ideal-gas effects complicate turbomachinery behavior, and state constraints must be respected. This article presents a control methodology for these systems, comprising a control modeling approach and model predictive control (MPC) strategy. This method… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: 26 pages, 11 figures