Skip to main content

Showing 1–15 of 15 results for author: Bohez, S

.
  1. arXiv:2305.14654  [pdf, other

    cs.RO cs.AI

    Barkour: Benchmarking Animal-level Agility with Quadruped Robots

    Authors: Ken Caluwaerts, Atil Iscen, J. Chase Kew, Wenhao Yu, Tingnan Zhang, Daniel Freeman, Kuang-Huei Lee, Lisa Lee, Stefano Saliceti, Vincent Zhuang, Nathan Batchelor, Steven Bohez, Federico Casarini, Jose Enrique Chen, Omar Cortes, Erwin Coumans, Adil Dostmohamed, Gabriel Dulac-Arnold, Alejandro Escontrela, Erik Frey, Roland Hafner, Deepali Jain, Bauyrjan Jyenis, Yuheng Kuang, Edward Lee , et al. (19 additional authors not shown)

    Abstract: Animals have evolved various agile locomotion strategies, such as sprinting, lea**, and jum**. There is a growing interest in develo** legged robots that move like their biological counterparts and show various agile skills to navigate complex environments quickly. Despite the interest, the field lacks systematic benchmarks to measure the performance of control policies and hardware in agili… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 17 pages, 19 figures

  2. arXiv:2210.04932  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    NeRF2Real: Sim2real Transfer of Vision-guided Bipedal Motion Skills using Neural Radiance Fields

    Authors: Arunkumar Byravan, Jan Humplik, Leonard Hasenclever, Arthur Brussee, Francesco Nori, Tuomas Haarnoja, Ben Moran, Steven Bohez, Fereshteh Sadeghi, Bojan Vujatovic, Nicolas Heess

    Abstract: We present a system for applying sim2real approaches to "in the wild" scenes with realistic visuals, and to policies which rely on active perception using RGB cameras. Given a short video of a static scene collected using a generic phone, we learn the scene's contact geometry and a function for novel view synthesis using a Neural Radiance Field (NeRF). We augment the NeRF rendering of the static s… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  3. arXiv:2204.05893  [pdf, other

    cs.RO cs.AI cs.LG

    Forgetting and Imbalance in Robot Lifelong Learning with Off-policy Data

    Authors: Wenxuan Zhou, Steven Bohez, Jan Humplik, Abbas Abdolmaleki, Dushyant Rao, Markus Wulfmeier, Tuomas Haarnoja, Nicolas Heess

    Abstract: Robots will experience non-stationary environment dynamics throughout their lifetime: the robot dynamics can change due to wear and tear, or its surroundings may change over time. Eventually, the robots should perform well in all of the environment variations it has encountered. At the same time, it should still be able to learn fast in a new environment. We identify two challenges in Reinforcemen… ▽ More

    Submitted 18 August, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

    Comments: Published at 1st Conference on Lifelong Learning Agents, 2022

  4. arXiv:2203.17138  [pdf, other

    cs.RO cs.AI cs.LG

    Imitate and Repurpose: Learning Reusable Robot Movement Skills From Human and Animal Behaviors

    Authors: Steven Bohez, Saran Tunyasuvunakool, Philemon Brakel, Fereshteh Sadeghi, Leonard Hasenclever, Yuval Tassa, Emilio Parisotto, Jan Humplik, Tuomas Haarnoja, Roland Hafner, Markus Wulfmeier, Michael Neunert, Ben Moran, Noah Siegel, Andrea Huber, Francesco Romano, Nathan Batchelor, Federico Casarini, Josh Merel, Raia Hadsell, Nicolas Heess

    Abstract: We investigate the use of prior knowledge of human and animal movement to learn reusable locomotion skills for real legged robots. Our approach builds upon previous work on imitating human or dog Motion Capture (MoCap) data to learn a movement skill module. Once learned, this skill module can be reused for complex downstream tasks. Importantly, due to the prior imposed by the MoCap data, our appro… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: 30 pages, 9 figures, 8 tables, 14 videos at https://bit.ly/robot-npmp , submitted to Science Robotics

  5. arXiv:2111.00262  [pdf, other

    cs.RO cs.LG

    Learning Coordinated Terrain-Adaptive Locomotion by Imitating a Centroidal Dynamics Planner

    Authors: Philemon Brakel, Steven Bohez, Leonard Hasenclever, Nicolas Heess, Konstantinos Bousmalis

    Abstract: Dynamic quadruped locomotion over challenging terrains with precise foot placements is a hard problem for both optimal control methods and Reinforcement Learning (RL). Non-linear solvers can produce coordinated constraint satisfying motions, but often take too long to converge for online application. RL methods can learn dynamic reactive controllers but require carefully tuned sha** rewards to p… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: A shorter version without appendix was submitted to ICRA 2022

  6. dm_control: Software and Tasks for Continuous Control

    Authors: Yuval Tassa, Saran Tunyasuvunakool, Alistair Muldal, Yotam Doron, Piotr Trochim, Siqi Liu, Steven Bohez, Josh Merel, Tom Erez, Timothy Lillicrap, Nicolas Heess

    Abstract: The dm_control software package is a collection of Python libraries and task suites for reinforcement learning agents in an articulated-body simulation. A MuJoCo wrapper provides convenient bindings to functions and data structures. The PyMJCF and Composer libraries enable procedural model manipulation and task authoring. The Control Suite is a fixed set of tasks with standardised structure, inten… ▽ More

    Submitted 7 September, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: arXiv admin note: text overlap with arXiv:1801.00690

  7. arXiv:1902.04623  [pdf, other

    cs.RO

    Value constrained model-free continuous control

    Authors: Steven Bohez, Abbas Abdolmaleki, Michael Neunert, Jonas Buchli, Nicolas Heess, Raia Hadsell

    Abstract: The naive application of Reinforcement Learning algorithms to continuous control problems -- such as locomotion and manipulation -- often results in policies which rely on high-amplitude, high-frequency control signals, known colloquially as bang-bang control. Although such solutions may indeed maximize task reward, they can be unsuitable for real world systems. Bang-bang control may lead to incre… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  8. arXiv:1812.02256  [pdf, other

    cs.LG stat.ML

    Relative Entropy Regularized Policy Iteration

    Authors: Abbas Abdolmaleki, Jost Tobias Springenberg, Jonas Degrave, Steven Bohez, Yuval Tassa, Dan Belov, Nicolas Heess, Martin Riedmiller

    Abstract: We present an off-policy actor-critic algorithm for Reinforcement Learning (RL) that combines ideas from gradient-free optimization via stochastic search with learned action-value function. The result is a simple procedure consisting of three steps: i) policy evaluation by estimating a parametric action-value function; ii) policy improvement via the estimation of a local non-parametric policy; and… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

  9. arXiv:1804.10332  [pdf, other

    cs.RO cs.AI

    Sim-to-Real: Learning Agile Locomotion For Quadruped Robots

    Authors: Jie Tan, Tingnan Zhang, Erwin Coumans, Atil Iscen, Yunfei Bai, Danijar Hafner, Steven Bohez, Vincent Vanhoucke

    Abstract: Designing agile locomotion for quadruped robots often requires extensive expertise and tedious manual tuning. In this paper, we present a system to automate this process by leveraging deep reinforcement learning techniques. Our system can learn quadruped locomotion from scratch using simple reward signals. In addition, users can provide an open loop reference to guide the learning process when mor… ▽ More

    Submitted 16 May, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

    Comments: Accompanying video: https://www.youtube.com/watch?v=lUZUr7jxoqM

  10. arXiv:1711.10761  [pdf, other

    cs.NE cs.CV

    Transfer Learning with Binary Neural Networks

    Authors: Sam Leroux, Steven Bohez, Tim Verbelen, Bert Vankeirsbilck, Pieter Simoens, Bart Dhoedt

    Abstract: Previous work has shown that it is possible to train deep neural networks with low precision weights and activations. In the extreme case it is even possible to constrain the network to binary values. The costly floating point multiplications are then reduced to fast logical operations. High end smart phones such as Google's Pixel 2 and Apple's iPhone X are already equipped with specialised hardwa… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Comments: Machine Learning on the Phone and other Consumer Devices, NIPS2017 Workshop

  11. arXiv:1708.02838  [pdf, other

    cs.AI

    Decoupled Learning of Environment Characteristics for Safe Exploration

    Authors: Pieter Van Molle, Tim Verbelen, Steven Bohez, Sam Leroux, Pieter Simoens, Bart Dhoedt

    Abstract: Reinforcement learning is a proven technique for an agent to learn a task. However, when learning a task using reinforcement learning, the agent cannot distinguish the characteristics of the environment from those of the task. This makes it harder to transfer skills between tasks in the same environment. Furthermore, this does not reduce risk when training for a new task. In this paper, we introdu… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

    Comments: 4 pages, 4 figures, ICML 2017 workshop on Reliable Machine Learning in the Wild

  12. arXiv:1703.04550  [pdf, other

    cs.RO cs.LG cs.NE eess.SY

    Sensor Fusion for Robot Control through Deep Reinforcement Learning

    Authors: Steven Bohez, Tim Verbelen, Elias De Coninck, Bert Vankeirsbilck, Pieter Simoens, Bart Dhoedt

    Abstract: Deep reinforcement learning is becoming increasingly popular for robot control algorithms, with the aim for a robot to self-learn useful feature representations from unstructured sensory input leading to the optimal actuation policy. In addition to sensors mounted on the robot, sensors might also be deployed in the environment, although these might need to be accessed via an unreliable wireless co… ▽ More

    Submitted 13 March, 2017; originally announced March 2017.

    Comments: 6 pages, 6 figures, submitted to IROS 2017

  13. arXiv:1605.08543  [pdf, other

    cs.CV cs.NE

    Lazy Evaluation of Convolutional Filters

    Authors: Sam Leroux, Steven Bohez, Cedric De Boom, Elias De Coninck, Tim Verbelen, Bert Vankeirsbilck, Pieter Simoens, Bart Dhoedt

    Abstract: In this paper we propose a technique which avoids the evaluation of certain convolutional filters in a deep neural network. This allows to trade-off the accuracy of a deep neural network with the computational and memory requirements. This is especially important on a constrained device unable to hold all the weights of the network in memory.

    Submitted 27 May, 2016; originally announced May 2016.

  14. arXiv:1605.02486  [pdf, other

    cs.NE

    Efficiency Evaluation of Character-level RNN Training Schedules

    Authors: Cedric De Boom, Sam Leroux, Steven Bohez, Pieter Simoens, Thomas Demeester, Bart Dhoedt

    Abstract: We present four training and prediction schedules from the same character-level recurrent neural network. The efficiency of these schedules is tested in terms of model effectiveness as a function of training time and amount of training data seen. We show that the choice of training and prediction schedule potentially has a considerable impact on the prediction effectiveness for a given training bu… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

    Comments: 3 pages, 3 figures

  15. Learning Semantic Similarity for Very Short Texts

    Authors: Cedric De Boom, Steven Van Canneyt, Steven Bohez, Thomas Demeester, Bart Dhoedt

    Abstract: Levering data on social media, such as Twitter and Facebook, requires information retrieval algorithms to become able to relate very short text fragments to each other. Traditional text similarity methods such as tf-idf cosine-similarity, based on word overlap, mostly fail to produce good results in this case, since word overlap is little or non-existent. Recently, distributed word representations… ▽ More

    Submitted 2 December, 2015; originally announced December 2015.

    Comments: 6 pages, 5 figures, 3 tables, ReLSD workshop at ICDM 15