Skip to main content

Showing 1–30 of 30 results for author: Doncieux, S

.
  1. arXiv:2403.06173  [pdf, other

    cs.RO cs.LG

    Speeding up 6-DoF Grasp Sampling with Quality-Diversity

    Authors: Johann Huber, François Hélénon, Mathilde Kappel, Elie Chelly, Mahdi Khoramshahi, Faïz Ben Amar, Stéphane Doncieux

    Abstract: Recent advances in AI have led to significant results in robotic learning, including natural language-conditioned planning and efficient optimization of controllers using generative models. However, the interaction data remains the bottleneck for generalization. Getting data for gras** is a critical challenge, as this skill is required to complete many manipulation tasks. Quality-Diversity (QD)… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: 7 pages, 8 figures. Preprint version

  2. arXiv:2311.00344  [pdf, other

    cs.AI

    A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

    Authors: Olivier Sigaud, Gianluca Baldassarre, Cedric Colas, Stephane Doncieux, Richard Duro, Pierre-Yves Oudeyer, Nicolas Perrin-Gilbert, Vieri Giuliano Santucci

    Abstract: A lot of recent machine learning research papers have ``open-ended learning'' in their title. But very few of them attempt to define what they mean when using the term. Even worse, when looking more closely there seems to be no consensus on what distinguishes open-ended learning from related concepts such as continual learning, lifelong learning or autotelic learning. In this paper, we contribute… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  3. arXiv:2310.04517  [pdf, other

    cs.RO cs.LG

    Domain Randomization for Sim2real Transfer of Automatically Generated Gras** Datasets

    Authors: Johann Huber, François Hélénon, Hippolyte Watrelot, Faiz Ben Amar, Stéphane Doncieux

    Abstract: Robotic gras** refers to making a robotic system pick an object by applying forces and torques on its surface. Many recent studies use data-driven approaches to address gras**, but the sparse reward nature of this task made the learning process challenging to bootstrap. To avoid constraining the operational space, an increasing number of works propose gras** datasets to learn from. But most… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: 6 pages, 7 figures, draft version

  4. arXiv:2310.04349  [pdf, other

    cs.RO cs.LG

    Toward a Plug-and-Play Vision-Based Gras** Module for Robotics

    Authors: François Hélénon, Johann Huber, Faïz Ben Amar, Stéphane Doncieux

    Abstract: Despite recent advancements in AI for robotics, gras** remains a partially solved challenge, hindered by the lack of benchmarks and reproducibility constraints. This paper introduces a vision-based gras** framework that can easily be transferred across multiple manipulators. Leveraging Quality-Diversity (QD) algorithms, the framework generates diverse repertoires of open-loop gras** trajecto… ▽ More

    Submitted 12 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: 6 pages, 9 figures

  5. arXiv:2308.13278  [pdf, other

    cs.LG cs.AI cs.RO

    Integrating LLMs and Decision Transformers for Language Grounded Generative Quality-Diversity

    Authors: Achkan Salehi, Stephane Doncieux

    Abstract: Quality-Diversity is a branch of stochastic optimization that is often applied to problems from the Reinforcement Learning and control domains in order to construct repertoires of well-performing policies/skills that exhibit diversity with respect to a behavior space. Such archives are usually composed of a finite number of reactive agents which are each associated to a unique behavior descriptor,… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 16 pages, 9 figures, 2 tables

  6. arXiv:2308.05483  [pdf, other

    cs.RO cs.LG

    Quality Diversity under Sparse Reward and Sparse Interaction: Application to Gras** in Robotics

    Authors: J. Huber, F. Hélénon, M. Coninx, F. Ben Amar, S. Doncieux

    Abstract: Quality-Diversity (QD) methods are algorithms that aim to generate a set of diverse and high-performing solutions to a given problem. Originally developed for evolutionary robotics, most QD studies are conducted on a limited set of domains - mainly applied to locomotion, where the fitness and the behavior signal are dense. Gras** is a crucial task for manipulation in robotics. Despite the effort… ▽ More

    Submitted 31 October, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: 37 pages, 17 figures. Draft version

  7. arXiv:2303.01563  [pdf, other

    cs.RO cs.LG

    Data-efficient, Explainable and Safe Box Manipulation: Illustrating the Advantages of Physical Priors in Model-Predictive Control

    Authors: Achkan Salehi, Stephane Doncieux

    Abstract: Model-based RL/control have gained significant traction in robotics. Yet, these approaches often remain data-inefficient and lack the explainability of hand-engineered solutions. This makes them difficult to debug/integrate in safety-critical settings. However, in many systems, prior knowledge of environment kinematics/dynamics is available. Incorporating such priors can help address the aforement… ▽ More

    Submitted 28 March, 2024; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: accepted for publication by l4dc 2024, 12 pages (with references), 4 figures, 2 tables

  8. arXiv:2210.11801  [pdf, other

    cs.LG

    Random Actions vs Random Policies: Bootstrap** Model-Based Direct Policy Search

    Authors: Elias Hanna, Alex Coninx, Stéphane Doncieux

    Abstract: This paper studies the impact of the initial data gathering method on the subsequent learning of a dynamics model. Dynamics models approximate the true transition function of a given task, in order to perform policy search directly on the model rather than on the costly real system. This study aims to determine how to bootstrap a model as efficiently as possible, by comparing initialization method… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: ICML 2022 Workshop Adaptive Experimental Design and Active Learning in the Real World

  9. arXiv:2210.07887  [pdf, other

    cs.RO cs.LG

    E2R: a Hierarchical-Learning inspired Novelty-Search method to generate diverse repertoires of gras** trajectories

    Authors: Johann Huber, Oumar Sane, Alex Coninx, Faiz Ben Amar, Stephane Doncieux

    Abstract: Robotics gras** refers to the task of making a robotic system pick an object by applying forces and torques on its surface. Despite the recent advances in data-driven approaches, gras** remains an unsolved problem. Most of the works on this task are relying on priors and heavy constraints to avoid the exploration problem. Novelty Search (NS) refers to evolutionary algorithms that replace selec… ▽ More

    Submitted 17 April, 2024; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: 7 pages, 6 figures. Preprint version

  10. Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations

    Authors: Achkan Salehi, Steffen Rühl, Stephane Doncieux

    Abstract: Model-based Reinforcement Learning and Control have demonstrated great potential in various sequential decision making problem domains, including in robotics settings. However, real-world robotics systems often present challenges that limit the applicability of those methods. In particular, we note two problems that jointly happen in many industrial systems: 1) Irregular/asynchronous observations… ▽ More

    Submitted 23 October, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: 16 double column pages, 14 figures, 3 tables

    Journal ref: IEEE Transactions on Robotics; Print ISSN: 1552-3098; Online ISSN: 1941-0468

  11. arXiv:2205.08189  [pdf, other

    cs.RO cs.LG

    Automatic Acquisition of a Repertoire of Diverse Gras** Trajectories through Behavior Sha** and Novelty Search

    Authors: Aurélien Morel, Yakumo Kunimoto, Alex Coninx, Stéphane Doncieux

    Abstract: Gras** a particular object may require a dedicated gras** movement that may also be specific to the robot end-effector. No generic and autonomous method does exist to generate these movements without making hypotheses on the robot or on the object. Learning methods could help to autonomously discover relevant gras** movements, but they face an important issue: gras** movements are so rare… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 7 pages, 9 figures, accepted at ICRA 2022. Annex video available at https://youtu.be/bqqQepJAOKQ

  12. arXiv:2205.03207  [pdf, other

    cs.LG cs.NE

    Towards QD-suite: develo** a set of benchmarks for Quality-Diversity algorithms

    Authors: Achkan Salehi, Stephane Doncieux

    Abstract: While the field of Quality-Diversity (QD) has grown into a distinct branch of stochastic optimization, a few problems, in particular locomotion and navigation tasks, have become de facto standards. Are such benchmarks sufficient? Are they representative of the key challenges faced by QD algorithms? Do they provide the ability to focus on one particular challenge by properly disentangling it from o… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: 6 pages, 8 figures, Written for and presented at the GECCO 22 QD-benchmarking workshop (https://quality-diversity.github.io/workshop)

  13. arXiv:2205.03162  [pdf, other

    cs.LG cs.AI cs.NE

    Geodesics, Non-linearities and the Archive of Novelty Search

    Authors: Achkan Salehi, Alexandre Coninx, Stephane Doncieux

    Abstract: The Novelty Search (NS) algorithm was proposed more than a decade ago. However, the mechanisms behind its empirical success are still not well formalized/understood. This short note focuses on the effects of the archive on exploration. Experimental evidence from a few application domains suggests that archive-based NS performs in general better than when Novelty is solely computed with respect to… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

    Comments: 4 pages, 3 figures

    Journal ref: GECCO 22 Companion, July 9-13, 2022, Boston, MA, USA

  14. arXiv:2111.01919  [pdf, other

    cs.LG cs.AI cs.NE cs.RO

    Discovering and Exploiting Sparse Rewards in a Learned Behavior Space

    Authors: Giuseppe Paolo, Miranda Coninx, Alban Laflaquière, Stephane Doncieux

    Abstract: Learning optimal policies in sparse rewards settings is difficult as the learning agent has little to no feedback on the quality of its actions. In these situations, a good strategy is to focus on exploration, hopefully leading to the discovery of a reward signal to improve on. A learning algorithm capable of dealing with this kind of settings has to be able to (1) explore possible agent behaviors… ▽ More

    Submitted 26 September, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 25 pages. Published by the Evolutionary Computation Journal, MIT Press

  15. Exploratory State Representation Learning

    Authors: Astrid Merckling, Nicolas Perrin-Gilbert, Alex Coninx, Stéphane Doncieux

    Abstract: Not having access to compact and meaningful representations is known to significantly increase the complexity of reinforcement learning (RL). For this reason, it can be useful to perform state representation learning (SRL) before tackling RL tasks. However, obtaining a good state representation can only be done if a large diversity of transitions is observed, which can require a difficult explorat… ▽ More

    Submitted 15 February, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    Journal ref: Frontiers in Robotics and AI, 14 February 2022

  16. arXiv:2109.06826  [pdf, other

    cs.LG cs.AI cs.NE

    Few-shot Quality-Diversity Optimization

    Authors: Achkan Salehi, Alexandre Coninx, Stephane Doncieux

    Abstract: In the past few years, a considerable amount of research has been dedicated to the exploitation of previous learning experiences and the design of Few-shot and Meta Learning approaches, in problem domains ranging from Computer Vision to Reinforcement Learning based control. A notable exception, where to the best of our knowledge, little to no effort has been made in this direction is Quality-Diver… ▽ More

    Submitted 18 January, 2024; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted for publication in the IEEE Robotics and Automation Letters (RA-L) journal

    Journal ref: IEEE Robotics and Automation Letters 7.2 (2022): 4424-4431

  17. arXiv:2104.04768  [pdf, other

    cs.AI cs.LG cs.NE

    Selection-Expansion: A Unifying Framework for Motion-Planning and Diversity Search Algorithms

    Authors: Alexandre Chenu, Nicolas Perrin-Gilbert, Stéphane Doncieux, Olivier Sigaud

    Abstract: Reinforcement learning agents need a reward signal to learn successful policies. When this signal is sparse or the corresponding gradient is deceptive, such agents need a dedicated mechanism to efficiently explore their search space without relying on the reward. Looking for a large diversity of behaviors or using Motion Planning (MP) algorithms are two options in this context. In this paper, we b… ▽ More

    Submitted 10 April, 2021; originally announced April 2021.

  18. arXiv:2104.03936  [pdf, other

    cs.AI cs.LG cs.NE

    BR-NS: an Archive-less Approach to Novelty Search

    Authors: Achkan Salehi, Alexandre Coninx, Stephane Doncieux

    Abstract: As open-ended learning based on divergent search algorithms such as Novelty Search (NS) draws more and more attention from the research community, it is natural to expect that its application to increasingly complex real-world problems will require the exploration to operate in higher dimensional Behavior Spaces which will not necessarily be Euclidean. Novelty Search traditionally relies on k-near… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: Author version of the paper accepted at GECCO 21

  19. arXiv:2102.03140  [pdf, other

    cs.NE cs.AI cs.LG cs.RO

    Sparse Reward Exploration via Novelty Search and Emitters

    Authors: Giuseppe Paolo, Alexandre Coninx, Stephane Doncieux, Alban Laflaquière

    Abstract: Reward-based optimization algorithms require both exploration, to find rewards, and exploitation, to maximize performance. The need for efficient exploration is even more significant in sparse reward settings, in which performance feedback is given sparingly, thus rendering it unsuitable for guiding the search process. In this work, we introduce the SparsE Reward Exploration via Novelty and Emitte… ▽ More

    Submitted 16 April, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: In 2021 Genetic and Evolutionary Computation Conference (GECCO 21), July, 2021, Lille, France. ACM, New York, NY, USA, 11 pages

  20. arXiv:2005.06224  [pdf, other

    cs.NE cs.AI cs.LG cs.RO

    Novelty Search makes Evolvability Inevitable

    Authors: Stephane Doncieux, Giuseppe Paolo, Alban Laflaquière, Alexandre Coninx

    Abstract: Evolvability is an important feature that impacts the ability of evolutionary processes to find interesting novel solutions and to deal with changing conditions of the problem to solve. The estimation of evolvability is not straightforward and is generally too expensive to be directly used as selective pressure in the evolutionary process. Indirectly promoting evolvability as a side effect of othe… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  21. arXiv:2005.06223  [pdf, other

    cs.AI cs.LG cs.NE cs.RO

    DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics

    Authors: Stephane Doncieux, Nicolas Bredeche, Léni Le Goff, Benoît Girard, Alexandre Coninx, Olivier Sigaud, Mehdi Khamassi, Natalia Díaz-Rodríguez, David Filliat, Timothy Hospedales, A. Eiben, Richard Duro

    Abstract: Robots are still limited to controlled conditions, that the robot designer knows with enough details to endow the robot with the appropriate models or behaviors. Learning algorithms add some flexibility with the ability to discover the appropriate behavior given either some demonstrations or a reward to guide its exploration with a reinforcement learning algorithm. Reinforcement learning algorithm… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  22. State Representation Learning from Demonstration

    Authors: Astrid Merckling, Alexandre Coninx, Loic Cressot, Stéphane Doncieux, Nicolas Perrin-Gilbert

    Abstract: Robots could learn their own state and world representation from perception and experience without supervision. This desirable goal is the main focus of our field of interest, state representation learning (SRL). Indeed, a compact representation of such a state is beneficial to help robots grasp onto their environment for interacting. The properties of this representation have a strong impact on t… ▽ More

    Submitted 26 September, 2021; v1 submitted 15 September, 2019; originally announced October 2019.

    Comments: Published as a conference paper at LOD 2020

  23. arXiv:1909.05508  [pdf

    cs.RO cs.AI cs.LG cs.NE

    Unsupervised Learning and Exploration of Reachable Outcome Space

    Authors: Giuseppe Paolo, Alban Laflaquière, Alexandre Coninx, Stephane Doncieux

    Abstract: Performing Reinforcement Learning in sparse rewards settings, with very little prior knowledge, is a challenging problem since there is no signal to properly guide the learning process. In such situations, a good search strategy is fundamental. At the same time, not having to adapt the algorithm to every single problem is very desirable. Here we introduce TAXONS, a Task Agnostic eXploration of Out… ▽ More

    Submitted 4 May, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Published at IEEE International Conference on Robotics and Automation (ICRA) 2020

  24. arXiv:1903.04413  [pdf, other

    cs.RO cs.AI cs.LG

    Building an Affordances Map with Interactive Perception

    Authors: Leni K. Le Goff, Oussama Yaakoubi, Alexandre Coninx, Stephane Doncieux

    Abstract: Robots need to understand their environment to perform their task. If it is possible to pre-program a visual scene analysis process in closed environments, robots operating in an open environment would benefit from the ability to learn it through their interaction with their environment. This ability furthermore opens the way to the acquisition of affordances maps in which the action capabilities… ▽ More

    Submitted 11 March, 2019; originally announced March 2019.

    Comments: 14 pages, 15 figures

  25. arXiv:1901.10968  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Bootstrap** Robotic Ecological Perception from a Limited Set of Hypotheses Through Interactive Perception

    Authors: Léni K. Le Goff, Ghanim Mukhtar, Alexandre Coninx, Stéphane Doncieux

    Abstract: To solve its task, a robot needs to have the ability to interpret its perceptions. In vision, this interpretation is particularly difficult and relies on the understanding of the structure of the scene, at least to the extent of its task and sensorimotor abilities. A robot with the ability to build and adapt this interpretation process according to its own tasks and capabilities would push away th… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

    Comments: 21 pages, 21 figures

  26. From exploration to control: learning object manipulation skills through novelty search and local adaptation

    Authors: Seungsu Kim, Alexandre Coninx, Stephane Doncieux

    Abstract: Programming a robot to deal with open-ended tasks remains a challenge, in particular if the robot has to manipulate objects. Launching, gras**, pushing or any other object interaction can be simulated but the corresponding models are not reversible and the robot behavior thus cannot be directly deduced. These behaviors are hard to learn without a demonstration as the search space is large and th… ▽ More

    Submitted 16 November, 2020; v1 submitted 3 January, 2019; originally announced January 2019.

    Comments: 30 pages, 18 figures, accepted for publication in Robotics and Autonomous Systems

  27. arXiv:1811.02945  [pdf

    cs.LG cs.AI cs.RO stat.ML

    Behavioural Repertoire via Generative Adversarial Policy Networks

    Authors: Marija Jegorova, Stéphane Doncieux, Timothy Hospedales

    Abstract: Learning algorithms are enabling robots to solve increasingly challenging real-world tasks. These approaches often rely on demonstrations and reproduce the behavior shown. Unexpected changes in the environment may require using different behaviors to achieve the same effect, for instance to reach and grasp an object in changing clutter. An emerging paradigm addressing this robustness issue is to l… ▽ More

    Submitted 18 February, 2020; v1 submitted 7 November, 2018; originally announced November 2018.

    Comments: In Proceedings of 2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob), pages 320 - 326

    Journal ref: 2019 Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

  28. arXiv:1803.03453  [pdf, other

    cs.NE

    The Surprising Creativity of Digital Evolution: A Collection of Anecdotes from the Evolutionary Computation and Artificial Life Research Communities

    Authors: Joel Lehman, Jeff Clune, Dusan Misevic, Christoph Adami, Lee Altenberg, Julie Beaulieu, Peter J. Bentley, Samuel Bernard, Guillaume Beslon, David M. Bryson, Patryk Chrabaszcz, Nick Cheney, Antoine Cully, Stephane Doncieux, Fred C. Dyer, Kai Olav Ellefsen, Robert Feldt, Stephan Fischer, Stephanie Forrest, Antoine Frénoy, Christian Gagné, Leni Le Goff, Laura M. Grabowski, Babak Hodjat, Frank Hutter , et al. (28 additional authors not shown)

    Abstract: Biological evolution provides a creative fount of complex and subtle adaptations, often surprising the scientists who discover them. However, because evolution is an algorithmic process that transcends the substrate in which it occurs, evolution's creativity is not limited to nature. Indeed, many researchers in the field of digital evolution have observed their evolving algorithms and organisms su… ▽ More

    Submitted 21 November, 2019; v1 submitted 9 March, 2018; originally announced March 2018.

  29. arXiv:1507.06877  [pdf, other

    cs.NE

    Multi-objective analysis of computational models

    Authors: Stéphane Doncieux, Jean Liénard, Benoît Girard, Mohamed Hamdaoui, Joël Chaskalovic

    Abstract: Computational models are of increasing complexity and their behavior may in particular emerge from the interaction of different parts. Studying such models becomes then more and more difficult and there is a need for methods and tools supporting this process. Multi-objective evolutionary algorithms generate a set of trade-off solutions instead of a single optimal solution. The availability of a se… ▽ More

    Submitted 24 July, 2015; originally announced July 2015.

  30. arXiv:1307.1870  [pdf, other

    cs.RO

    Crossing the Reality Gap: a Short Introduction to the Transferability Approach

    Authors: Jean-Baptiste Mouret, Sylvain Koos, Stéphane Doncieux

    Abstract: In robotics, gradient-free optimization algorithms (e.g. evolutionary algorithms) are often used only in simulation because they require the evaluation of many candidate solutions. Nevertheless, solutions obtained in simulation often do not work well on the real device. The transferability approach aims at crossing this gap between simulation and reality by \emph{making the optimization algorithm… ▽ More

    Submitted 7 July, 2013; originally announced July 2013.