-
Translation of Nagumo's Foundational Work on Barrier Functions: On the Location of Integral Curves of Ordinary Differential Equations
Authors:
Marcel Menner,
Eugene Lavretsky
Abstract:
In 1942, Prof. Mitio Nagumo published his seminal paper on the location of integral curves of ordinary differential equations. Nagumo's paper provides the foundation of the set invariance of ordinary differential equations and barrier functions, which have recently gained popularity for the control design of safety critical dynamical systems. This translation shall serve the community with an easi…
▽ More
In 1942, Prof. Mitio Nagumo published his seminal paper on the location of integral curves of ordinary differential equations. Nagumo's paper provides the foundation of the set invariance of ordinary differential equations and barrier functions, which have recently gained popularity for the control design of safety critical dynamical systems. This translation shall serve the community with an easily accessible version of the original 1942 paper in English. A copy of Nagumo's paper in German is also attached as a reference. That copy was created by the Boeing Company, Germany, in an attempt to improve pdf format readability of the original paper.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Simultaneous State Estimation and Contact Detection for Legged Robots by Multiple-Model Kalman Filtering
Authors:
Marcel Menner,
Karl Berntorp
Abstract:
This paper proposes an algorithm for combined contact detection and state estimation for legged robots. The proposed algorithm models the robot's movement as a switched system, in which different modes relate to different feet being in contact with the ground. The key element in the proposed algorithm is an interacting multiple-model Kalman filter, which identifies the currently-active mode defini…
▽ More
This paper proposes an algorithm for combined contact detection and state estimation for legged robots. The proposed algorithm models the robot's movement as a switched system, in which different modes relate to different feet being in contact with the ground. The key element in the proposed algorithm is an interacting multiple-model Kalman filter, which identifies the currently-active mode defining contacts, while estimating the state. The rationale for the proposed estimation framework is that contacts (and contact forces) impact the robot's state and vice versa. This paper presents validation studies with a quadruped using (i) the high-fidelity simulator Gazebo for a comparison with ground truth values and a baseline estimator, and (ii) hardware experiments with the Unitree A1 robot. The simulation study shows that the proposed algorithm outperforms the baseline estimator, which does not simultaneous detect contacts. The hardware experiments showcase the applicability of the proposed algorithm and highlights the ability to detect contacts.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Gaussian Processes with State-Dependent Noise for Stochastic Control
Authors:
Marcel Menner,
Karl Berntorp
Abstract:
This paper considers a stochastic control framework, in which the residual model uncertainty of the dynamical system is learned using a Gaussian Process (GP). In the proposed formulation, the residual model uncertainty consists of a nonlinear function and state-dependent noise. The proposed formulation uses a posterior-GP to approximate the residual model uncertainty and a prior-GP to account for…
▽ More
This paper considers a stochastic control framework, in which the residual model uncertainty of the dynamical system is learned using a Gaussian Process (GP). In the proposed formulation, the residual model uncertainty consists of a nonlinear function and state-dependent noise. The proposed formulation uses a posterior-GP to approximate the residual model uncertainty and a prior-GP to account for state-dependent noise. The two GPs are interdependent and are thus learned jointly using an iterative algorithm. Theoretical properties of the iterative algorithm are established. Advantages of the proposed state-dependent formulation include (i) faster convergence of the GP estimate to the unknown function as the GP learns which data samples are more trustworthy and (ii) an accurate estimate of state-dependent noise, which can, e.g., be useful for a controller or decision-maker to determine the uncertainty of an action. Simulation studies highlight these two advantages.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Friction-Adaptive Stochastic Nonlinear Model Predictive Control for Autonomous Vehicles
Authors:
Sean Vaskov,
Rien Quirynen,
Marcel Menner,
Karl Berntorp
Abstract:
This paper addresses the trajectory-tracking problem under uncertain road-surface conditions for autonomous vehicles. We propose a stochastic nonlinear model predictive controller (SNMPC) that learns a tire--road friction model online using standard automotive-grade sensors. Learning the entire tire--road friction model in real time requires driving in the nonlinear, potentially unstable regime of…
▽ More
This paper addresses the trajectory-tracking problem under uncertain road-surface conditions for autonomous vehicles. We propose a stochastic nonlinear model predictive controller (SNMPC) that learns a tire--road friction model online using standard automotive-grade sensors. Learning the entire tire--road friction model in real time requires driving in the nonlinear, potentially unstable regime of the vehicle dynamics, using a prediction model that may not have fully converged. To handle this, we formulate the tire-friction model learning in a Bayesian framework, and propose two estimators that learn different aspects of the tire--road friction. The estimators output the estimate of the tire-friction model as well as the uncertainty of the estimate, which expresses the confidence in the model for different driving regimes. The SNMPC exploits the uncertainty estimate in its prediction model to take proper action when the uncertainty is large. We validate the approach in an extensive Monte-Carlo study using real vehicle parameters and in CarSim. The results when comparing to various MPC approaches indicate a substantial reduction in constraint violations, as well as a reduction in closed-loop cost. We also demonstrate the real-time feasibility in automotive-grade processors using a dSPACE MicroAutoBox-II rapid prototy** unit, showing a worst-case computation time of roughly 40ms.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Real-to-Sim: Predicting Residual Errors of Robotic Systems with Sparse Data using a Learning-based Unscented Kalman Filter
Authors:
Alexander Schperberg,
Yusuke Tanaka,
Feng Xu,
Marcel Menner,
Dennis Hong
Abstract:
Achieving highly accurate dynamic or simulator models that are close to the real robot can facilitate model-based controls (e.g., model predictive control or linear-quadradic regulators), model-based trajectory planning (e.g., trajectory optimization), and decrease the amount of learning time necessary for reinforcement learning methods. Thus, the objective of this work is to learn the residual er…
▽ More
Achieving highly accurate dynamic or simulator models that are close to the real robot can facilitate model-based controls (e.g., model predictive control or linear-quadradic regulators), model-based trajectory planning (e.g., trajectory optimization), and decrease the amount of learning time necessary for reinforcement learning methods. Thus, the objective of this work is to learn the residual errors between a dynamic and/or simulator model and the real robot. This is achieved using a neural network, where the parameters of a neural network are updated through an Unscented Kalman Filter (UKF) formulation. Using this method, we model these residual errors with only small amounts of data -- a necessity as we improve the simulator/dynamic model by learning directly from real-world operation. We demonstrate our method on robotic hardware (e.g., manipulator arm, and a wheeled robot), and show that with the learned residual errors, we can further close the reality gap between dynamic models, simulations, and actual hardware.
△ Less
Submitted 7 May, 2023; v1 submitted 7 September, 2022;
originally announced September 2022.
-
Mobility, Communication and Computation Aware Federated Learning for Internet of Vehicles
Authors:
Md Ferdous Pervej,
Jianlin Guo,
Kyeong ** Kim,
Kieran Parsons,
Philip Orlik,
Stefano Di Cairano,
Marcel Menner,
Karl Berntorp,
Yukimasa Nagai,
Huaiyu Dai
Abstract:
While privacy concerns entice connected and automated vehicles to incorporate on-board federated learning (FL) solutions, an integrated vehicle-to-everything communication with heterogeneous computation power aware learning platform is urgently necessary to make it a reality. Motivated by this, we propose a novel mobility, communication and computation aware online FL platform that uses on-road ve…
▽ More
While privacy concerns entice connected and automated vehicles to incorporate on-board federated learning (FL) solutions, an integrated vehicle-to-everything communication with heterogeneous computation power aware learning platform is urgently necessary to make it a reality. Motivated by this, we propose a novel mobility, communication and computation aware online FL platform that uses on-road vehicles as learning agents. Thanks to the advanced features of modern vehicles, the on-board sensors can collect data as vehicles travel along their trajectories, while the on-board processors can train machine learning models using the collected data. To take the high mobility of vehicles into account, we consider the delay as a learning parameter and restrict it to be less than a tolerable threshold. To satisfy this threshold, the central server accepts partially trained models, the distributed roadside units (a) perform downlink multicast beamforming to minimize global model distribution delay and (b) allocate optimal uplink radio resources to minimize local model offloading delay, and the vehicle agents conduct heterogeneous local model training. Using real-world vehicle trace datasets, we validate our FL solutions. Simulation shows that the proposed integrated FL platform is robust and outperforms baseline models. With reasonable local training episodes, it can effectively satisfy all constraints and deliver near ground truth multi-horizon velocity and vehicle-specific power predictions.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Automated Controller Calibration by Kalman Filtering
Authors:
Marcel Menner,
Karl Berntorp,
Stefano Di Cairano
Abstract:
This paper proposes a method for calibrating control parameters. Examples of such control parameters are gains of PID controllers, weights of a cost function for optimal control, filter coefficients, the sliding surface of a sliding mode controller, or weights of a neural network. Hence, the proposed method can be applied to a wide range of controllers. The method uses a Kalman filter that estimat…
▽ More
This paper proposes a method for calibrating control parameters. Examples of such control parameters are gains of PID controllers, weights of a cost function for optimal control, filter coefficients, the sliding surface of a sliding mode controller, or weights of a neural network. Hence, the proposed method can be applied to a wide range of controllers. The method uses a Kalman filter that estimates control parameters, using data of closed-loop system operation. The control parameter calibration is driven by a training objective, which encompasses specifications on the performance of the dynamical system. The performance-driven calibration method tunes the parameters online and robustly, is computationally efficient, has low data storage requirements, and is easy to implement making it appealing for many real-time applications. Simulation results show that the method is able to learn control parameters quickly, is able to tune the parameters to compensate for disturbances, and is robust to noise. A simulation study with the high-fidelity vehicle simulator CarSim shows that the method can calibrate controllers of a complex dynamical system online, which indicates its applicability to a real-world system. We also verify the real-time feasibility on an embedded platform with automotive-grade processors by implementing our method on a dSPACE MicroAutoBox-II rapid prototy** unit.
△ Less
Submitted 8 March, 2023; v1 submitted 21 November, 2021;
originally announced November 2021.
-
Volume Control of Low-Cost Ventilator with Automatic Set-Point Adaptation
Authors:
Lukas Hewing,
Marcel Menner,
Nikolaos Tachatos,
Marianne Schmid Daners,
Cosima du Pasquier,
Thomas S. Lumpe,
Kristina Shea,
Andrea Carron,
Melanie N. Zeilinger
Abstract:
This paper considers the control design for a low-cost ventilator that is based on a manual resuscitator bag (also known as AmbuBag) to pump air into the lungs of a patient who is physically unable to breathe. First, it experimentally shows that for accurately tracking tidal volumes, the controller needs to be adapted to the individual patient and the different configurations, e.g., hardware or op…
▽ More
This paper considers the control design for a low-cost ventilator that is based on a manual resuscitator bag (also known as AmbuBag) to pump air into the lungs of a patient who is physically unable to breathe. First, it experimentally shows that for accurately tracking tidal volumes, the controller needs to be adapted to the individual patient and the different configurations, e.g., hardware or operation modes. Second, it proposes a set-point adaptation algorithm that uses sensor measurements of a flow meter to automatically adapt the controller to the setup at hand. Third, it experimentally shows that such an adaptive solution improves the performance of the ventilator for various setups. One objective of this paper is to increase awareness of the need for feedback control using sensor measurements in low-cost ventilator solutions in order to automatically adapt to the specific scenario.
△ Less
Submitted 3 September, 2020;
originally announced September 2020.
-
Maximum Likelihood Methods for Inverse Learning of Optimal Controllers
Authors:
Marcel Menner,
Melanie N. Zeilinger
Abstract:
This paper presents a framework for inverse learning of objective functions for constrained optimal control problems, which is based on the Karush-Kuhn-Tucker (KKT) conditions. We discuss three variants corresponding to different model assumptions and computational complexities. The first method uses a convex relaxation of the KKT conditions and serves as the benchmark. The main contribution of th…
▽ More
This paper presents a framework for inverse learning of objective functions for constrained optimal control problems, which is based on the Karush-Kuhn-Tucker (KKT) conditions. We discuss three variants corresponding to different model assumptions and computational complexities. The first method uses a convex relaxation of the KKT conditions and serves as the benchmark. The main contribution of this paper is the proposition of two learning methods that combine the KKT conditions with maximum likelihood estimation. The key benefit of this combination is the systematic treatment of constraints for learning from noisy data with a branch-and-bound algorithm using likelihood arguments. This paper discusses theoretic properties of the learning methods and presents simulation results that highlight the advantages of using the maximum likelihood formulation for learning objective functions.
△ Less
Submitted 6 May, 2020;
originally announced May 2020.
-
Knee Compliance Reduces Peak Swing Phase Collision Forces in a Lower-Limb Exoskeleton Leg: A Test Bench Evaluation
Authors:
Stefan O. Schrade,
Marcel Menner,
Camila Shirota,
Peter Winiger,
Alex Stutz,
Melanie N. Zeilinger,
Olivier Lambercy,
Roger Gassert
Abstract:
Powered lower limb exoskeletons are a viable solution for people with a spinal cord injury to regain mobility for their daily activities. However, the commonly employed rigid actuation and pre-programmed trajectories increase the risk of falling in case of collisions with external objects. Compliant actuation may reduce forces during collisions, thus protecting hardware and user. However, experime…
▽ More
Powered lower limb exoskeletons are a viable solution for people with a spinal cord injury to regain mobility for their daily activities. However, the commonly employed rigid actuation and pre-programmed trajectories increase the risk of falling in case of collisions with external objects. Compliant actuation may reduce forces during collisions, thus protecting hardware and user. However, experimental data of collisions specific to lower limb exoskeletons are not available. In this work, we investigated how a variable stiffness actuator at the knee joint influences collision forces transmitted to the user via the exoskeleton. In a test bench experiment, we compared three configurations of an exoskeleton leg with a variable stiffness knee actuator in (i) compliant or (ii) stiff configurations, and with (iii) a rigid actuator. The peak torque observed at the pelvis was reduced from 260.2 Nm to 116.2 Nm as stiffness decreased. In addition, the mechanical impulse was reduced by a factor of three. These results indicate that compliance in the knee joint of an exoskeleton can be favorable in case of collision and should be considered when designing powered lower limb exoskeletons. Overall, this could decrease the effort necessary to maintain balance after a collision and improved collision handling in exoskeletons could result in safer use and benefit their usefulness in daily life.
△ Less
Submitted 12 September, 2019;
originally announced September 2019.
-
Using Human Ratings for Feedback Control: A Supervised Learning Approach with Application to Rehabilitation Robotics
Authors:
Marcel Menner,
Lukas Neuner,
Lars Lünenburger,
Melanie N. Zeilinger
Abstract:
This paper presents a method for tailoring a parametric controller based on human ratings. The method leverages supervised learning concepts in order to train a reward model from data. It is applied to a gait rehabilitation robot with the goal of teaching the robot how to walk patients physiologically. In this context, the reward model judges the physiology of the gait cycle (instead of therapists…
▽ More
This paper presents a method for tailoring a parametric controller based on human ratings. The method leverages supervised learning concepts in order to train a reward model from data. It is applied to a gait rehabilitation robot with the goal of teaching the robot how to walk patients physiologically. In this context, the reward model judges the physiology of the gait cycle (instead of therapists) using sensor measurements provided by the robot and the automatic feedback controller chooses the input settings of the robot to maximize the reward. The key advantage of the proposed method is that only a few input adaptations are necessary to achieve a physiological gait cycle. Experiments with nondisabled subjects show that the proposed method permits the incorporation of human expertise into a control law and to automatically walk patients physiologically.
△ Less
Submitted 14 January, 2020; v1 submitted 24 June, 2019;
originally announced June 2019.
-
Constrained Inverse Optimal Control with Application to a Human Manipulation Task
Authors:
Marcel Menner,
Peter Worsnop,
Melanie N. Zeilinger
Abstract:
This paper presents an inverse optimal control methodology and its application to training a predictive model of human motor control from a manipulation task. It introduces a convex formulation for learning both objective function and constraints of an infinite-horizon constrained optimal control problem with nonlinear system dynamics. The inverse approach utilizes Bellman's principle of optimalit…
▽ More
This paper presents an inverse optimal control methodology and its application to training a predictive model of human motor control from a manipulation task. It introduces a convex formulation for learning both objective function and constraints of an infinite-horizon constrained optimal control problem with nonlinear system dynamics. The inverse approach utilizes Bellman's principle of optimality to formulate the infinite-horizon optimal control problem as a shortest path problem and Lagrange multipliers to identify constraints. We highlight the key benefit of using the shortest path formulation, i.e., the possibility of training the predictive model with short and selected trajectory segments. The method is applied to training a predictive model of movements of a human subject from a manipulation task. The study indicates that individual human movements can be predicted with low error using an infinite-horizon optimal control problem with constraints on shoulder movement.
△ Less
Submitted 3 December, 2019; v1 submitted 30 December, 2018;
originally announced December 2018.