Skip to main content

Showing 1–26 of 26 results for author: Buchli, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.06102  [pdf, other

    cs.RO cs.LG

    Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning

    Authors: Mohak Bhardwaj, Thomas Lampe, Michael Neunert, Francesco Romano, Abbas Abdolmaleki, Arunkumar Byravan, Markus Wulfmeier, Martin Riedmiller, Jonas Buchli

    Abstract: Recent advances in real-world applications of reinforcement learning (RL) have relied on the ability to accurately simulate systems at scale. However, domains such as fluid dynamical systems exhibit complex dynamic phenomena that are hard to simulate at high integration rates, limiting the direct application of modern deep RL algorithms to often expensive or safety critical hardware. In this work,… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  2. arXiv:2307.11546  [pdf, other

    physics.plasm-ph cs.LG

    Towards practical reinforcement learning for tokamak magnetic control

    Authors: Brendan D. Tracey, Andrea Michi, Yuri Chervonyi, Ian Davies, Cosmin Paduraru, Nevena Lazic, Federico Felici, Timo Ewalds, Craig Donner, Cristian Galperti, Jonas Buchli, Michael Neunert, Andrea Huber, Jonathan Evens, Paula Kurylowicz, Daniel J. Mankowitz, Martin Riedmiller, The TCV Team

    Abstract: Reinforcement learning (RL) has shown promising results for real-time control systems, including the domain of plasma magnetic control. However, there are still significant drawbacks compared to traditional feedback control approaches for magnetic confinement. In this work, we address key drawbacks of the RL method; achieving higher control accuracy for desired plasma properties, reducing the stea… ▽ More

    Submitted 5 October, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  3. arXiv:2110.10819  [pdf, other

    cs.LG cs.AI

    Shaking the foundations: delusions in sequence models for interaction and control

    Authors: Pedro A. Ortega, Markus Kunesch, Grégoire Delétang, Tim Genewein, Jordi Grau-Moya, Joel Veness, Jonas Buchli, Jonas Degrave, Bilal Piot, Julien Perolat, Tom Everitt, Corentin Tallec, Emilio Parisotto, Tom Erez, Yutian Chen, Scott Reed, Marcus Hutter, Nando de Freitas, Shane Legg

    Abstract: The recent phenomenal success of language models has reinvigorated machine learning research, and large sequence models such as transformers are being applied to a variety of domains. One important problem class that has remained relatively elusive however is purposeful adaptive behavior. Currently there is a common perception that sequence models "lack the understanding of the cause and effect of… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: DeepMind Tech Report, 16 pages, 4 figures

  4. arXiv:2010.05545  [pdf, other

    cs.LG cs.AI stat.ML

    Local Search for Policy Iteration in Continuous Control

    Authors: Jost Tobias Springenberg, Nicolas Heess, Daniel Mankowitz, Josh Merel, Arunkumar Byravan, Abbas Abdolmaleki, Jackie Kay, Jonas Degrave, Julian Schrittwieser, Yuval Tassa, Jonas Buchli, Dan Belov, Martin Riedmiller

    Abstract: We present an algorithm for local, regularized, policy improvement in reinforcement learning (RL) that allows us to formulate model-based and model-free variants in a single framework. Our algorithm can be interpreted as a natural extension of work on KL-regularized RL and introduces a form of tree search for continuous action spaces. We demonstrate that additional computation spent on model-based… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  5. arXiv:2001.00449  [pdf, other

    cs.LG cs.RO stat.ML

    Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics

    Authors: Michael Neunert, Abbas Abdolmaleki, Markus Wulfmeier, Thomas Lampe, Jost Tobias Springenberg, Roland Hafner, Francesco Romano, Jonas Buchli, Nicolas Heess, Martin Riedmiller

    Abstract: Many real-world control problems involve both discrete decision variables - such as the choice of control modes, gear switching or digital outputs - as well as continuous decision variables - such as velocity setpoints, control gains or analogue outputs. However, when defining the corresponding optimal control or reinforcement learning problem, it is commonly approximated with fully continuous or… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

    Comments: Presented at the 3rd Conference on Robot Learning (CoRL 2019), Osaka, Japan. Video: https://youtu.be/eUqQDLQXb7I

  6. arXiv:1902.04623  [pdf, other

    cs.RO

    Value constrained model-free continuous control

    Authors: Steven Bohez, Abbas Abdolmaleki, Michael Neunert, Jonas Buchli, Nicolas Heess, Raia Hadsell

    Abstract: The naive application of Reinforcement Learning algorithms to continuous control problems -- such as locomotion and manipulation -- often results in policies which rely on high-amplitude, high-frequency control signals, known colloquially as bang-bang control. Although such solutions may indeed maximize task reward, they can be unsuitable for real world systems. Bang-bang control may lead to incre… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  7. Nonlinear disturbance attenuation control of hydraulic robotics

    Authors: Peng Lu, Timothy Sandy, Jonas Buchli

    Abstract: This paper presents a novel nonlinear disturbance rejection control for hydraulic robots. This method requires two third-order filters as well as inverse dynamics in order to estimate the disturbances. All the parameters for the third-order filters are pre-defined. The proposed method is nonlinear, which does not require the linearization of the rigid body dynamics. The estimated disturbances are… ▽ More

    Submitted 4 August, 2018; originally announced August 2018.

  8. ConFusion: Sensor Fusion for Complex Robotic Systems using Nonlinear Optimization

    Authors: Timothy Sandy, Lukas Stadelmann, Simon Kerscher, Jonas Buchli

    Abstract: We present ConFusion, an open-source package for online sensor fusion for robotic applications. ConFusion is a modular framework for fusing measurements from many heterogeneous sensors within a moving horizon estimator. ConFusion offers greater flexibility in sensor fusion problem design than filtering-based systems and the ability to scale the online estimate quality with the available computing… ▽ More

    Submitted 1 March, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

    Journal ref: IEEE Robotics and Automation Letters, 2019, Volume 4, Number 2, Pages 1093-1100

  9. A Projection Approach to Equality Constrained Iterative Linear Quadratic Optimal Control

    Authors: Markus Giftthaler, Jonas Buchli

    Abstract: This paper presents a state and state-input constrained variant of the discrete-time iterative Linear Quadratic Regulator (iLQR) algorithm, with linear time-complexity in the number of time steps. The approach is based on a projection of the control input onto the nullspace of the linearized constraints. We derive a fully constraint-compliant feedforward-feedback control update rule, for which we… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: Corrected version, fixes a typo in Eq. (11)-(12)

    Journal ref: 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)

  10. The Control Toolbox - An Open-Source C++ Library for Robotics, Optimal and Model Predictive Control

    Authors: Markus Giftthaler, Michael Neunert, Markus Stäuble, Jonas Buchli

    Abstract: We introduce the Control Toolbox (CT), an open-source C++ library for efficient modeling, control, estimation, trajectory optimization and Model Predictive Control. The CT is applicable to a broad class of dynamic systems but features interfaces to modeling tools specifically designed for robotic applications. This paper outlines the general concept of the toolbox, its main building blocks, and hi… ▽ More

    Submitted 26 March, 2018; v1 submitted 12 January, 2018; originally announced January 2018.

  11. Whole-Body Nonlinear Model Predictive Control Through Contacts for Quadrupeds

    Authors: Michael Neunert, Markus Stäuble, Markus Giftthaler, Carmine D. Bellicoso, Jan Carius, Christian Gehring, Marco Hutter, Jonas Buchli

    Abstract: In this work we present a whole-body Nonlinear Model Predictive Control approach for Rigid Body Systems subject to contacts. We use a full dynamic system model which also includes explicit contact dynamics. Therefore, contact locations, sequences and timings are not prespecified but optimized by the solver. Yet, thorough numerical and software engineering allows for running the nonlinear Optimal C… ▽ More

    Submitted 7 December, 2017; originally announced December 2017.

    Comments: Submitted to "Robotics and Automation: Letters" / "International Conference on Robotics and Automation 2018"

  12. arXiv:1711.11006  [pdf, other

    eess.SY cs.RO math.OC

    A Family of Iterative Gauss-Newton Shooting Methods for Nonlinear Optimal Control

    Authors: Markus Giftthaler, Michael Neunert, Markus Stäuble, Jonas Buchli, Moritz Diehl

    Abstract: This paper introduces a family of iterative algorithms for unconstrained nonlinear optimal control. We generalize the well-known iLQR algorithm to different multiple-shooting variants, combining advantages like straight-forward initialization and a closed-loop forward integration. All algorithms have similar computational complexity, i.e. linear complexity in the time horizon, and can be derived i… ▽ More

    Submitted 11 December, 2017; v1 submitted 29 November, 2017; originally announced November 2017.

    Comments: 8 pages

  13. Real-Time Motion Planning of Legged Robots: A Model Predictive Control Approach

    Authors: Farbod Farshidian, Edo Jelavić, Asutosh Satapathy, Markus Giftthaler, Jonas Buchli

    Abstract: We introduce a real-time, constrained, nonlinear Model Predictive Control for the motion planning of legged robots. The proposed approach uses a constrained optimal control algorithm known as SLQ. We improve the efficiency of this algorithm by introducing a multi-processing scheme for estimating value function in its backward pass. This pass has been often calculated as a single process. This para… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 8 pages

  14. Automatic Differentiation of Rigid Body Dynamics for Optimal Control and Estimation

    Authors: Markus Giftthaler, Michael Neunert, Markus Stäuble, Marco Frigerio, Claudio Semini, Jonas Buchli

    Abstract: Many algorithms for control, optimization and estimation in robotics depend on derivatives of the underlying system dynamics, e.g. to compute linearizations, sensitivities or gradient directions. However, we show that when dealing with Rigid Body Dynamics, these derivatives are difficult to derive analytically and to implement efficiently. To overcome this issue, we extend the modelling tool `RobC… ▽ More

    Submitted 16 January, 2018; v1 submitted 12 September, 2017; originally announced September 2017.

    Journal ref: Advanced Robotics, November 2017, Taylor and Francis

  15. arXiv:1708.09342  [pdf, ps, other

    eess.SY cs.LG cs.RO math.OC

    Optimal and Learning Control for Autonomous Robots

    Authors: Jonas Buchli, Farbod Farshidian, Alexander Winkler, Timothy Sandy, Markus Giftthaler

    Abstract: Optimal and Learning Control for Autonomous Robots has been taught in the Robotics, Systems and Controls Masters at ETH Zurich with the aim to teach optimal control and reinforcement learning for closed loop control problems from a unified point of view. The starting point is the formulation of of an optimal control problem and deriving the different types of solutions and algorithms from there. T… ▽ More

    Submitted 30 August, 2017; originally announced August 2017.

    Comments: Lecture Notes, 101 pages

  16. arXiv:1705.10313  [pdf, other

    cs.RO math.OC

    Fast Trajectory Optimization for Legged Robots using Vertex-based ZMP Constraints

    Authors: Alexander W Winkler, Farbod Farshidian, Diego Pardo, Michael Neunert, Jonas Buchli

    Abstract: This paper combines the fast Zero-Moment-Point (ZMP) approaches that work well in practice with the broader range of capabilities of a Trajectory Optimization formulation, by optimizing over body motion, footholds and Center of Pressure simultaneously. We introduce a vertex-based representation of the support-area constraint, which can treat arbitrarily oriented point-, line-, and area-contacts un… ▽ More

    Submitted 27 May, 2017; originally announced May 2017.

    Comments: currently under review for IEEE RA-L

  17. Robust Whole-Body Motion Control of Legged Robots

    Authors: Farbod Farshidian, Edo Jelavić, Alexander W. Winkler, Jonas Buchli

    Abstract: We introduce a robust control architecture for the whole-body motion control of torque controlled robots with arms and legs. The method is based on the robust control of contact forces in order to track a planned Center of Mass trajectory. Its appeal lies in the ability to guarantee robust stability and performance despite rigid body model mismatch, actuator dynamics, delays, contact surface stiff… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: 8 Pages

  18. Efficient Kinematic Planning for Mobile Manipulators with Non-holonomic Constraints Using Optimal Control

    Authors: Markus Giftthaler, Farbod Farshidian, Timothy Sandy, Lukas Stadelmann, Jonas Buchli

    Abstract: This work addresses the problem of kinematic trajectory planning for mobile manipulators with non-holonomic constraints, and holonomic operational-space tracking constraints. We obtain whole-body trajectories and time-varying kinematic feedback controllers by solving a Constrained Sequential Linear Quadratic Optimal Control problem. The employed algorithm features high efficiency through a continu… ▽ More

    Submitted 16 January, 2018; v1 submitted 27 January, 2017; originally announced January 2017.

    Comments: 7 pages

  19. Mobile Robotic Fabrication at 1:1 scale: the In situ Fabricator

    Authors: Markus Giftthaler, Timothy Sandy, Kathrin Dörfler, Ian Brooks, Mark Buckingham, Gonzalo Rey, Matthias Kohler, Fabio Gramazio, Jonas Buchli

    Abstract: This paper presents the concept of an In situ Fabricator, a mobile robot intended for on-site manufacturing, assembly and digital fabrication. We present an overview of a prototype system, its capabilities, and highlight the importance of high-performance control, estimation and planning algorithms for achieving desired construction goals. Next, we detail on two architectural application scenarios… ▽ More

    Submitted 13 January, 2017; originally announced January 2017.

  20. An Efficient Optimal Planning and Control Framework For Quadrupedal Locomotion

    Authors: Farbod Farshidian, Michael Neunert, Alexander W. Winkler, Gonzalo Rey, Jonas Buchli

    Abstract: In this paper, we present an efficient Dynamic Programing framework for optimal planning and control of legged robots. First we formulate this problem as an optimal control problem for switched systems. Then we propose a multi--level optimization approach to find the optimal switching times and the optimal continuous control inputs. Through this scheme, the decomposed optimization can potentially… ▽ More

    Submitted 4 March, 2017; v1 submitted 30 September, 2016; originally announced September 2016.

    Comments: 8 Pages

  21. arXiv:1607.04537  [pdf, other

    cs.RO

    Trajectory Optimization Through Contacts and Automatic Gait Discovery for Quadrupeds

    Authors: Michael Neunert, Farbod Farshidian, Alexander W. Winkler, Jonas Buchli

    Abstract: In this work we present a trajectory Optimization framework for whole-body motion planning through contacts. We demonstrate how the proposed approach can be applied to automatically discover different gaits and dynamic motions on a quadruped robot. In contrast to most previous methods, we do not pre-specify contact switches, timings, points or gait patterns, but they are a direct outcome of the op… ▽ More

    Submitted 15 July, 2016; originally announced July 2016.

    Comments: Video: https://youtu.be/sILuqJBsyKs

  22. arXiv:1510.01625  [pdf, other

    cs.RO

    Projection based whole body motion planning for legged robots

    Authors: Diego Pardo, Michael Neunert, Alexander W. Winkler, Jonas Buchli

    Abstract: In this paper we present a new approach for dynamic motion planning for legged robots. We formulate a trajectory optimization problem based on a compact form of the robot dynamics. Such a form is obtained by projecting the rigid body dynamics onto the null space of the Constraint Jacobian. As consequence of the projection, contact forces are removed from the model but their effects are still taken… ▽ More

    Submitted 6 October, 2015; originally announced October 2015.

  23. arXiv:1507.02081  [pdf, other

    cs.RO

    An Open Source, Fiducial Based, Visual-Inertial Motion Capture System

    Authors: Michael Neunert, Michael Bloesch, Jonas Buchli

    Abstract: Many robotic tasks rely on the accurate localization of moving objects within a given workspace. This information about the objects' poses and velocities are used for control,motion planning, navigation, interaction with the environment or verification. Often motion capture systems are used to obtain such a state estimate. However, these systems are often costly, limited in workspace size and not… ▽ More

    Submitted 13 June, 2016; v1 submitted 8 July, 2015; originally announced July 2015.

    Comments: To appear in The International Conference on Information Fusion (FUSION) 2016

  24. Evaluating direct transcription and nonlinear optimization methods for robot motion planning

    Authors: Diego Pardo, Lukas Möller, Michael Neunert, Alexander W. Winkler, Jonas Buchli

    Abstract: This paper studies existing direct transcription methods for trajectory optimization applied to robot motion planning. There are diverse alternatives for the implementation of direct transcription. In this study we analyze the effects of such alternatives when solving a robotics problem. Different parameters such as integration scheme, number of discretization nodes, initialization strategies and… ▽ More

    Submitted 29 January, 2016; v1 submitted 22 April, 2015; originally announced April 2015.

  25. Robot Impedance Control and Passivity Analysis with Inner Torque and Velocity Feedback Loops

    Authors: Michele Focchi, Gustavo A. Medrano-Cerda, Thiago Boaventura, Marco Frigerio, Jonas Buchli, Darwin G. Caldwell, Claudio Semini

    Abstract: Impedance control is a well-established technique to control interaction forces in robotics. However, real implementations of impedance control with an inner loop may suffer from several limitations. Although common practice in designing nested control systems is to maximize the bandwidth of the inner loop to improve tracking performance, it may not be the most suitable approach when a certain ran… ▽ More

    Submitted 23 May, 2016; v1 submitted 16 June, 2014; originally announced June 2014.

    Comments: 14 pages in Control Theory and Technology (2016)

  26. arXiv:1301.7190  [pdf, other

    cs.RO cs.PL

    A Domain Specific Language for kinematic models and fast implementations of robot dynamics algorithms

    Authors: Marco Frigerio, Jonas Buchli, Darwin G. Caldwell

    Abstract: Rigid body dynamics algorithms play a crucial role in several components of a robot controller and simulations. Real time constraints in high frequency control loops and time requirements of specific applications demand these functions to be very efficient. Despite the availability of established algorithms, their efficient implementation for a specific robot still is a tedious and error-prone tas… ▽ More

    Submitted 30 January, 2013; originally announced January 2013.

    Comments: Presented at DSLRob 2011 (arXiv:1212.3308)

    Report number: DSLRob/2011/02