Skip to main content

Showing 1–6 of 6 results for author: Lecarpentier, É

.
  1. arXiv:2106.11655  [pdf, other

    cs.LG cs.AI

    DARTS-PRIME: Regularization and Scheduling Improve Constrained Optimization in Differentiable NAS

    Authors: Kaitlin Maile, Erwan Lecarpentier, Hervé Luga, Dennis G. Wilson

    Abstract: Differentiable Architecture Search (DARTS) is a recent neural architecture search (NAS) method based on a differentiable relaxation. Due to its success, numerous variants analyzing and improving parts of the DARTS framework have recently been proposed. By considering the problem as a constrained bilevel optimization, we present and analyze DARTS-PRIME, a variant including improvements to architect… ▽ More

    Submitted 18 October, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

  2. arXiv:2005.14186  [pdf, other

    stat.AP cs.SI math.DS physics.soc-ph q-bio.PE

    Understanding and monitoring the evolution of the Covid-19 epidemic from medical emergency calls: the example of the Paris area

    Authors: Stéphane Gaubert, Marianne Akian, Xavier Allamigeon, Marin Boyet, Baptiste Colin, Théotime Grohens, Laurent Massoulié, David P. Parsons, Frédéric Adnet, Érick Chanzy, Laurent Goix, Frédéric Lapostolle, Éric Lecarpentier, Christophe Leroy, Thomas Loeb, Jean-Sébastien Marx, Caroline Télion, Laurent Tréluyer, Pierre Carli

    Abstract: We portray the evolution of the Covid-19 epidemic during the crisis of March-April 2020 in the Paris area, by analyzing the medical emergency calls received by the EMS of the four central departments of this area (Centre 15 of SAMU 75, 92, 93 and 94). Our study reveals strong dissimilarities between these departments. We show that the logarithm of each epidemic observable can be approximated by a… ▽ More

    Submitted 20 July, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: Changes v1->v2: Section 7 expanded. Changes v2->v3: bibliography expanded; minor improvements and corrections

    Journal ref: Comptes Rendus -- Mathématique, Volume 358, issue 7 (2020), p. 843-875

  3. arXiv:2001.05411  [pdf, other

    cs.LG cs.AI stat.ML

    Lipschitz Lifelong Reinforcement Learning

    Authors: Erwan Lecarpentier, David Abel, Kavosh Asadi, Yuu **nai, Emmanuel Rachelson, Michael L. Littman

    Abstract: We consider the problem of knowledge transfer when an agent is facing a series of Reinforcement Learning (RL) tasks. We introduce a novel metric between Markov Decision Processes (MDPs) and establish that close MDPs have close optimal value functions. Formally, the optimal value functions are Lipschitz continuous with respect to the tasks space. These theoretical results lead us to a value-transfe… ▽ More

    Submitted 22 March, 2021; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: In proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), 21 pages, 11 figures

  4. arXiv:1904.10090  [pdf, other

    cs.LG stat.ML

    Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning, Extended version

    Authors: Erwan Lecarpentier, Emmanuel Rachelson

    Abstract: This work tackles the problem of robust zero-shot planning in non-stationary stochastic environments. We study Markov Decision Processes (MDPs) evolving over time and consider Model-Based Reinforcement Learning algorithms in this setting. We make two hypotheses: 1) the environment evolves continuously with a bounded evolution rate; 2) a current model is known at each decision epoch but not its evo… ▽ More

    Submitted 15 January, 2020; v1 submitted 22 April, 2019; originally announced April 2019.

    Comments: Published at NeurIPS 2019, 17 pages, 3 figures

    Journal ref: year: 2019; page range: 7214--7223

  5. arXiv:1805.01367  [pdf, other

    cs.LG stat.ML

    Open Loop Execution of Tree-Search Algorithms, extended version

    Authors: Erwan Lecarpentier, Guillaume Infantes, Charles Lesire, Emmanuel Rachelson

    Abstract: In the context of tree-search stochastic planning algorithms where a generative model is available, we consider on-line planning algorithms building trees in order to recommend an action. We investigate the question of avoiding re-planning in subsequent decision steps by directly using sub-trees as action recommender. Firstly, we propose a method for open loop control via a new algorithm taking th… ▽ More

    Submitted 12 February, 2019; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: 10 pages, 10 figures

    Journal ref: 27th International Joint Conference on Artificial Intelligence (IJCAI 2018)

  6. arXiv:1707.05668  [pdf, other

    cs.LG

    Empirical evaluation of a Q-Learning Algorithm for Model-free Autonomous Soaring

    Authors: Erwan Lecarpentier, Sebastian Rapp, Marc Melo, Emmanuel Rachelson

    Abstract: Autonomous unpowered flight is a challenge for control and guidance systems: all the energy the aircraft might use during flight has to be harvested directly from the atmosphere. We investigate the design of an algorithm that optimizes the closed-loop control of a glider's bank and sideslip angles, while flying in the lower convective layer of the atmosphere in order to increase its mission endura… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.