Skip to main content

Showing 1–13 of 13 results for author: Perrin-Gilbert, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16159  [pdf, other

    cs.LG cs.AI

    AFU: Actor-Free critic Updates in off-policy RL for continuous control

    Authors: Nicolas Perrin-Gilbert

    Abstract: This paper presents AFU, an off-policy deep RL algorithm addressing in a new way the challenging "max-Q problem" in Q-learning for continuous action spaces, with a solution based on regression and conditional gradient scaling. AFU has an actor but its critic updates are entirely independent from it. As a consequence, the actor can be chosen freely. In the initial version, AFU-alpha, we employ the… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 19 pages, 13 figures

  2. arXiv:2403.01078  [pdf, other

    cs.LG cs.AI physics.bio-ph q-bio.GN

    $Γ$-VAE: Curvature regularized variational autoencoders for uncovering emergent low dimensional geometric structure in high dimensional data

    Authors: Jason Z. Kim, Nicolas Perrin-Gilbert, Erkan Narmanli, Paul Klein, Christopher R. Myers, Itai Cohen, Joshua J. Waterfall, James P. Sethna

    Abstract: Natural systems with emergent behaviors often organize along low-dimensional subsets of high-dimensional spaces. For example, despite the tens of thousands of genes in the human genome, the principled study of genomics is fruitful because biological processes rely on coordinated organization that results in lower dimensional phenotypes. To uncover this organization, many nonlinear dimensionality r… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  3. arXiv:2402.09355  [pdf, other

    cs.RO cs.AI

    Single-Reset Divide & Conquer Imitation Learning

    Authors: Alexandre Chenu, Olivier Serris, Olivier Sigaud, Nicolas Perrin-Gilbert

    Abstract: Demonstrations are commonly used to speed up the learning process of Deep Reinforcement Learning algorithms. To cope with the difficulty of accessing multiple demonstrations, some algorithms have been developed to learn from a single demonstration. In particular, the Divide & Conquer Imitation Learning algorithms leverage a sequential bias to learn a control policy for complex robotic tasks using… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  4. arXiv:2311.00344  [pdf, other

    cs.AI

    A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

    Authors: Olivier Sigaud, Gianluca Baldassarre, Cedric Colas, Stephane Doncieux, Richard Duro, Pierre-Yves Oudeyer, Nicolas Perrin-Gilbert, Vieri Giuliano Santucci

    Abstract: A lot of recent machine learning research papers have ``open-ended learning'' in their title. But very few of them attempt to define what they mean when using the term. Even worse, when looking more closely there seems to be no consensus on what distinguishes open-ended learning from related concepts such as continual learning, lifelong learning or autotelic learning. In this paper, we contribute… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  5. arXiv:2307.06758  [pdf, other

    cs.AI cs.GT cs.MA

    Layered controller synthesis for dynamic multi-agent systems

    Authors: Emily Clement, Nicolas Perrin-Gilbert, Philipp Schlehuber-Caissier

    Abstract: In this paper we present a layered approach for multi-agent control problem, decomposed into three stages, each building upon the results of the previous one. First, a high-level plan for a coarse abstraction of the system is computed, relying on parametric timed automata augmented with stopwatches as they allow to efficiently model simplified dynamics of such systems. In the second stage, the hig… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  6. The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers

    Authors: Valentin Macé, Raphaël Boige, Felix Chalumeau, Thomas Pierrot, Guillaume Richard, Nicolas Perrin-Gilbert

    Abstract: In the context of neuroevolution, Quality-Diversity algorithms have proven effective in generating repertoires of diverse and efficient policies by relying on the definition of a behavior space. A natural goal induced by the creation of such a repertoire is trying to achieve behaviors on demand, which can be done by running the corresponding policy from the repertoire. However, in uncertain enviro… ▽ More

    Submitted 13 September, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 10+7 pages

  7. arXiv:2211.13742  [pdf, other

    cs.NE cs.AI

    Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems

    Authors: Felix Chalumeau, Thomas Pierrot, Valentin Macé, Arthur Flajolet, Karim Beguir, Antoine Cully, Nicolas Perrin-Gilbert

    Abstract: A fascinating aspect of nature lies in its ability to produce a collection of organisms that are all high-performing in their niche. Quality-Diversity (QD) methods are evolutionary algorithms inspired by this observation, that obtained great results in many applications, from wing design to robot adaptation. Recently, several works demonstrated that these methods could be applied to perform neuro-… ▽ More

    Submitted 8 September, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: GECCO 2022 Workshop on Quality Diversity Algorithm Benchmarks

  8. arXiv:2211.04786  [pdf, other

    cs.RO cs.AI

    Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration

    Authors: Alexandre Chenu, Olivier Serris, Olivier Sigaud, Nicolas Perrin-Gilbert

    Abstract: Deep Reinforcement Learning has been successfully applied to learn robotic control. However, the corresponding algorithms struggle when applied to problems where the agent is only rewarded after achieving a complex task. In this context, using demonstrations can significantly speed up the learning process, but demonstrations can be costly to acquire. In this paper, we propose to leverage a sequent… ▽ More

    Submitted 17 April, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  9. arXiv:2204.07404  [pdf, other

    cs.AI cs.RO

    Divide & Conquer Imitation Learning

    Authors: Alexandre Chenu, Nicolas Perrin-Gilbert, Olivier Sigaud

    Abstract: When cast into the Deep Reinforcement Learning framework, many robotics tasks require solving a long horizon and sparse reward problem, where learning algorithms struggle. In such context, Imitation Learning (IL) can be a powerful approach to bootstrap the learning process. However, most IL methods require several expert demonstrations which can be prohibitively difficult to acquire. Only a handfu… ▽ More

    Submitted 13 April, 2023; v1 submitted 15 April, 2022; originally announced April 2022.

  10. Exploratory State Representation Learning

    Authors: Astrid Merckling, Nicolas Perrin-Gilbert, Alex Coninx, Stéphane Doncieux

    Abstract: Not having access to compact and meaningful representations is known to significantly increase the complexity of reinforcement learning (RL). For this reason, it can be useful to perform state representation learning (SRL) before tackling RL tasks. However, obtaining a good state representation can only be done if a large diversity of transitions is observed, which can require a difficult explorat… ▽ More

    Submitted 15 February, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    Journal ref: Frontiers in Robotics and AI, 14 February 2022

  11. arXiv:2104.04768  [pdf, other

    cs.AI cs.LG cs.NE

    Selection-Expansion: A Unifying Framework for Motion-Planning and Diversity Search Algorithms

    Authors: Alexandre Chenu, Nicolas Perrin-Gilbert, Stéphane Doncieux, Olivier Sigaud

    Abstract: Reinforcement learning agents need a reward signal to learn successful policies. When this signal is sparse or the corresponding gradient is deceptive, such agents need a dedicated mechanism to efficiently explore their search space without relying on the reward. Looking for a large diversity of behaviors or using Motion Planning (MP) algorithms are two options in this context. In this paper, we b… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  12. Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization

    Authors: Thomas Pierrot, Valentin Macé, Félix Chalumeau, Arthur Flajolet, Geoffrey Cideron, Karim Beguir, Antoine Cully, Olivier Sigaud, Nicolas Perrin-Gilbert

    Abstract: A fascinating aspect of nature lies in its ability to produce a large and diverse collection of organisms that are all high-performing in their niche. By contrast, most AI algorithms focus on finding a single efficient solution to a given problem. Aiming for diversity in addition to performance is a convenient way to deal with the exploration-exploitation trade-off that plays a central role in lea… ▽ More

    Submitted 31 May, 2022; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Add several baselines (Policy Gradient assisted MAP Elites, DIAYN, AGAC) Change writing to take the point of view of the evo community Change style, writing, explanation, figures

  13. State Representation Learning from Demonstration

    Authors: Astrid Merckling, Alexandre Coninx, Loic Cressot, Stéphane Doncieux, Nicolas Perrin-Gilbert

    Abstract: Robots could learn their own state and world representation from perception and experience without supervision. This desirable goal is the main focus of our field of interest, state representation learning (SRL). Indeed, a compact representation of such a state is beneficial to help robots grasp onto their environment for interacting. The properties of this representation have a strong impact on t… ▽ More

    Submitted 26 September, 2021; v1 submitted 15 September, 2019; originally announced October 2019.

    Comments: Published as a conference paper at LOD 2020