Skip to main content

Showing 1–7 of 7 results for author: Chatzilygeroudis, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2012.04322  [pdf, other

    cs.NE cs.LG math.OC stat.ML

    Quality-Diversity Optimization: a novel branch of stochastic optimization

    Authors: Konstantinos Chatzilygeroudis, Antoine Cully, Vassilis Vassiliades, Jean-Baptiste Mouret

    Abstract: Traditional optimization algorithms search for a single global optimum that maximizes (or minimizes) the objective function. Multimodal optimization algorithms search for the highest peaks in the search space that can be more than one. Quality-Diversity algorithms are a recent addition to the evolutionary computation toolbox that do not only search for a single set of local optima, but instead try… ▽ More

    Submitted 16 December, 2020; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: 13 pages, 4 figures, 3 algorithms, to be published in "Black Box Optimization, Machine Learning and No-Free Lunch Theorems", P. Pardalos, V. Rasskazova, M.N. Vrahatis, Ed., Springer

  2. arXiv:1807.02303  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    A survey on policy search algorithms for learning robot controllers in a handful of trials

    Authors: Konstantinos Chatzilygeroudis, Vassilis Vassiliades, Freek Stulp, Sylvain Calinon, Jean-Baptiste Mouret

    Abstract: Most policy search algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning"… ▽ More

    Submitted 4 December, 2019; v1 submitted 6 July, 2018; originally announced July 2018.

    Comments: 21 pages, 3 figures, 4 algorithms, accepted at IEEE Transactions on Robotics

  3. arXiv:1806.09351  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards

    Authors: Rituraj Kaushik, Konstantinos Chatzilygeroudis, Jean-Baptiste Mouret

    Abstract: The most data-efficient algorithms for reinforcement learning in robotics are model-based policy search algorithms, which alternate between learning a dynamical model of the robot and optimizing a policy to maximize the expected return given the model and its uncertainties. However, the current algorithms lack an effective exploration strategy to deal with sparse or misleading reward scenarios: if… ▽ More

    Submitted 3 March, 2020; v1 submitted 25 June, 2018; originally announced June 2018.

    Comments: Conference on Robot Learning (CoRL)- 2018; Code at https://github.com/resibots/kaushik_2018_multi-dex ; Video at https://youtu.be/9ZLwUxAAq6M

    Journal ref: Proceedings of the Conference on Robot Learning, PMLR 87:839-855, 2018

  4. arXiv:1709.06919  [pdf, other

    cs.RO cs.AI cs.LG cs.NE stat.ML

    Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search

    Authors: RĂ©mi Pautrat, Konstantinos Chatzilygeroudis, Jean-Baptiste Mouret

    Abstract: One of the most interesting features of Bayesian optimization for direct policy search is that it can leverage priors (e.g., from simulation or from previous tasks) to accelerate learning on a robot. In this paper, we are interested in situations for which several priors exist but we do not know in advance which one fits best the current situation. We tackle this problem by introducing a novel acq… ▽ More

    Submitted 13 March, 2018; v1 submitted 20 September, 2017; originally announced September 2017.

    Comments: Accepted at ICRA 2018; 8 pages, 4 figures, 1 algorithm; Video at https://youtu.be/xo8mUIZTvNE ; Spotlight ICRA presentation https://youtu.be/iiVaV-U6Kqo

  5. arXiv:1709.06917  [pdf, other

    cs.RO cs.AI cs.LG cs.NE stat.ML

    Using Parameterized Black-Box Priors to Scale Up Model-Based Policy Search for Robotics

    Authors: Konstantinos Chatzilygeroudis, Jean-Baptiste Mouret

    Abstract: The most data-efficient algorithms for reinforcement learning in robotics are model-based policy search algorithms, which alternate between learning a dynamical model of the robot and optimizing a policy to maximize the expected return given the model and its uncertainties. Among the few proposed approaches, the recently introduced Black-DROPS algorithm exploits a black-box optimization algorithm… ▽ More

    Submitted 13 March, 2018; v1 submitted 20 September, 2017; originally announced September 2017.

    Comments: Accepted at ICRA 2018; 8 pages, 4 figures, 2 algorithms, 1 table; Video at https://youtu.be/HFkZkhGGzTo ; Spotlight ICRA presentation at https://youtu.be/_MZYDhfWeLc

  6. arXiv:1611.07343  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Limbo: A Fast and Flexible Library for Bayesian Optimization

    Authors: Antoine Cully, Konstantinos Chatzilygeroudis, Federico Allocati, Jean-Baptiste Mouret

    Abstract: Limbo is an open-source C++11 library for Bayesian optimization which is designed to be both highly flexible and very fast. It can be used to optimize functions for which the gradient is unknown, evaluations are expensive, and runtime cost matters (e.g., on embedded systems or robots). Benchmarks on standard functions show that Limbo is about 2 times faster than BayesOpt (another C++ library) for… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

  7. arXiv:1605.07496  [pdf, other

    cs.LG cs.AI stat.ML

    Alternating Optimisation and Quadrature for Robust Control

    Authors: Supratik Paul, Konstantinos Chatzilygeroudis, Kamil Ciosek, Jean-Baptiste Mouret, Michael A. Osborne, Shimon Whiteson

    Abstract: Bayesian optimisation has been successfully applied to a variety of reinforcement learning problems. However, the traditional approach for learning optimal policies in simulators does not utilise the opportunity to improve learning by adjusting certain environment variables: state features that are unobservable and randomly determined by the environment in a physical setting but are controllable i… ▽ More

    Submitted 18 December, 2017; v1 submitted 24 May, 2016; originally announced May 2016.

    Comments: To appear in AAAI 2018. Video of policy learnt in simulation deployed on a real hexapod see https://youtu.be/ME90xtIPsKk