Skip to main content

Showing 1–15 of 15 results for author: Stulp, F

.
  1. arXiv:2404.15001  [pdf, other

    cs.RO

    Unknown Object Gras** for Assistive Robotics

    Authors: Elle Miller, Maximilian Durner, Matthias Humt, Gabriel Quere, Wout Boerdijk, Ashok M. Sundaram, Freek Stulp, Jorn Vogel

    Abstract: We propose a novel pipeline for unknown object gras** in shared robotic autonomy scenarios. State-of-the-art methods for fully autonomous scenarios are typically learning-based approaches optimised for a specific end-effector, that generate grasp poses directly from sensor input. In the domain of assistive robotics, we seek instead to utilise the user's cognitive abilities for enhanced satisfact… ▽ More

    Submitted 4 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 7 pages, 9 figures

  2. AI-enabled Cyber-Physical In-Orbit Factory -- AI approaches based on digital twin technology for robotic small satellite production

    Authors: Florian Leutert, David Bohlig, Florian Kempf, Klaus Schilling, Maximilian Mühlbauer, Bengisu Ayan, Thomas Hulin, Freek Stulp, Alin Albu-Schäffer, Vladimir Kutscher, Christian Plesker, Thomas Dasbach, Stephan Damm, Reiner Anderl, Benjamin Schleich

    Abstract: With the ever increasing number of active satellites in space, the rising demand for larger formations of small satellites and the commercialization of the space industry (so-called New Space), the realization of manufacturing processes in orbit comes closer to reality. Reducing launch costs and risks, allowing for faster on-demand deployment of individually configured satellites as well as the pr… ▽ More

    Submitted 5 February, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Journal ref: Acta Astronautica (2024), vol. 217, page 1-17

  3. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  4. arXiv:2310.05808  [pdf, other

    cs.RO

    An Open-Loop Baseline for Reinforcement Learning Locomotion Tasks

    Authors: Antonin Raffin, Olivier Sigaud, Jens Kober, Alin Albu-Schäffer, João Silvério, Freek Stulp

    Abstract: In search of a simple baseline for Deep Reinforcement Learning in locomotion tasks, we propose a model-free open-loop strategy. By leveraging prior knowledge and the elegance of simple oscillators to generate periodic joint motions, it achieves respectable performance in five different locomotion environments, with a number of tunable parameters that is a tiny fraction of the thousands typically r… ▽ More

    Submitted 4 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: video: https://b2drop.eudat.eu/s/ykDPMM7F9KFyLgi minimal code: https://gist.github.com/araffin/1fb77a8f290ac248b2e76e01164f21e0

  5. arXiv:2209.07171  [pdf, other

    cs.RO cs.LG

    Learning to Exploit Elastic Actuators for Quadruped Locomotion

    Authors: Antonin Raffin, Daniel Seidel, Jens Kober, Alin Albu-Schäffer, João Silvério, Freek Stulp

    Abstract: Spring-based actuators in legged locomotion provide energy-efficiency and improved performance, but increase the difficulty of controller design. While previous work has focused on extensive modeling and simulation to find optimal controllers for such systems, we propose to learn model-free controllers directly on the real robot. In our approach, gaits are first synthesized by central pattern gene… ▽ More

    Submitted 20 August, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

  6. arXiv:2010.07208  [pdf, other

    cs.SD eess.AS q-bio.NC

    Emergent Jaw Predominance in Vocal Development through Stochastic Optimization

    Authors: Clément Moulin-Frier, Jules Brochard, Freek Stulp, Pierre-Yves Oudeyer

    Abstract: Infant vocal babbling strongly relies on jaw oscillations, especially at the stage of canonical babbling, which underlies the syllabic structure of world languages. In this paper, we propose, model and analyze an hypothesis to explain this predominance of the jaw in early babbling. This hypothesis states that general stochastic optimization principles, when applied to learning sensorimotor control… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Journal ref: IEEE Transactions on Cognitive and Developmental Systems (Volume: 12 , Issue: 3 , Sept. 2020)

  7. arXiv:2005.05719  [pdf, other

    cs.LG cs.RO stat.ML

    Smooth Exploration for Robotic Reinforcement Learning

    Authors: Antonin Raffin, Jens Kober, Freek Stulp

    Abstract: Reinforcement learning (RL) enables robots to learn skills from interactions with the real world. In practice, the unstructured step-based exploration used in Deep RL -- often very successful in simulation -- leads to jerky motion patterns on real robots. Consequences of the resulting shaky behavior are poor exploration, or even damage to the robot. We address these issues by adapting state-depend… ▽ More

    Submitted 20 June, 2021; v1 submitted 12 May, 2020; originally announced May 2020.

    Comments: Code: https://github.com/DLR-RM/stable-baselines3/ Training scripts: https://github.com/DLR-RM/rl-baselines3-zoo/

    Journal ref: Proceedings of the 5th Conference on Robot Learning, PMLR 164:1634-1644, 2022

  8. arXiv:1906.11909  [pdf, other

    stat.ML cs.LG cs.RO

    Comparing Semi-Parametric Model Learning Algorithms for Dynamic Model Estimation in Robotics

    Authors: Sebastian Riedel, Freek Stulp

    Abstract: Physical modeling of robotic system behavior is the foundation for controlling many robotic mechanisms to a satisfactory degree. Mechanisms are also typically designed in a way that good model accuracy can be achieved with relatively simple models and model identification strategies. If the modeling accuracy using physically based models is not enough or too complex, model-free methods based on ma… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

  9. arXiv:1902.07015  [pdf, other

    cs.LG cs.AI stat.ML

    Investigating Generalisation in Continuous Deep Reinforcement Learning

    Authors: Chenyang Zhao, Olivier Sigaud, Freek Stulp, Timothy M. Hospedales

    Abstract: Deep Reinforcement Learning has shown great success in a variety of control tasks. However, it is unclear how close we are to the vision of putting Deep RL into practice to solve real world problems. In particular, common practice in the field is to train policies on largely deterministic simulators and to evaluate algorithms through training performance alone, without a train/test distinction to… ▽ More

    Submitted 20 February, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

  10. arXiv:1807.02303  [pdf, other

    cs.RO cs.AI cs.LG stat.ML

    A survey on policy search algorithms for learning robot controllers in a handful of trials

    Authors: Konstantinos Chatzilygeroudis, Vassilis Vassiliades, Freek Stulp, Sylvain Calinon, Jean-Baptiste Mouret

    Abstract: Most policy search algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning"… ▽ More

    Submitted 4 December, 2019; v1 submitted 6 July, 2018; originally announced July 2018.

    Comments: 21 pages, 3 figures, 4 algorithms, accepted at IEEE Transactions on Robotics

  11. arXiv:1803.04706  [pdf, other

    cs.LG

    Policy Search in Continuous Action Domains: an Overview

    Authors: Olivier Sigaud, Freek Stulp

    Abstract: Continuous action policy search is currently the focus of intensive research, driven both by the recent success of deep reinforcement learning algorithms and the emergence of competitors based on evolutionary algorithms. In this paper, we present a broad survey of policy search methods, providing a unified perspective on very different approaches, including also Bayesian Optimization and directed… ▽ More

    Submitted 13 June, 2019; v1 submitted 13 March, 2018; originally announced March 2018.

    Comments: Accepted in the Neural Networks Journal (Volume 113, May 2019)

  12. arXiv:1712.05249  [pdf, other

    cs.NE cs.AI cs.LG cs.RO

    Proximodistal Exploration in Motor Learning as an Emergent Property of Optimization

    Authors: Freek Stulp, Pierre-Yves Oudeyer

    Abstract: To harness the complexity of their high-dimensional bodies during sensorimotor development, infants are guided by patterns of freezing and freeing of degrees of freedom. For instance, when learning to reach, infants free the degrees of freedom in their arm proximodistally, i.e. from joints that are closer to the body to those that are more distant. Here, we formulate and study computationally the… ▽ More

    Submitted 14 December, 2017; originally announced December 2017.

  13. arXiv:1512.03201  [pdf, ps, other

    cs.LG

    Gated networks: an inventory

    Authors: Olivier Sigaud, Clément Masson, David Filliat, Freek Stulp

    Abstract: Gated networks are networks that contain gating connections, in which the outputs of at least two neurons are multiplied. Initially, gated networks were used to learn relationships between two input sources, such as pixels from two images. More recently, they have been applied to learning activity recognition or multi-modal representations. The aims of this paper are threefold: 1) to explain the b… ▽ More

    Submitted 10 December, 2015; originally announced December 2015.

    Comments: Unpublished manuscript, 17 pages

  14. arXiv:1401.4599  [pdf

    cs.RO cs.AI

    Learning and Reasoning with Action-Related Places for Robust Mobile Manipulation

    Authors: Freek Stulp, Andreas Fedrizzi, Lorenz Mösenlechner, Michael Beetz

    Abstract: We propose the concept of Action-Related Place (ARPlace) as a powerful and flexible representation of task-related place in the context of mobile manipulation. ARPlace represents robot base locations not as a single position, but rather as a collection of positions, each with an associated probability that the manipulation action will succeed when located there. ARPlaces are generated using a pred… ▽ More

    Submitted 18 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 43, pages 1-42, 2012

  15. arXiv:1206.4621  [pdf

    cs.LG

    Path Integral Policy Improvement with Covariance Matrix Adaptation

    Authors: Freek Stulp, Olivier Sigaud

    Abstract: There has been a recent focus in reinforcement learning on addressing continuous state and action problems by optimizing parameterized policies. PI2 is a recent example of this approach. It combines a derivation from first principles of stochastic optimal control with tools from statistical estimation theory. In this paper, we consider PI2 as a member of the wider family of methods which share the… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012