-
Computationally Efficient System Level Tube-MPC for Uncertain Systems
Authors:
Jerome Sieber,
Alexandre Didier,
Melanie N. Zeilinger
Abstract:
Tube-based model predictive control (MPC) is the principal robust control technique for constrained linear systems affected by additive disturbances. While tube-based methods that compute the tubes online have been successfully applied to systems with additive disturbances, their application to systems affected by additional model uncertainties is challenging. This paper introduces a new tube-base…
▽ More
Tube-based model predictive control (MPC) is the principal robust control technique for constrained linear systems affected by additive disturbances. While tube-based methods that compute the tubes online have been successfully applied to systems with additive disturbances, their application to systems affected by additional model uncertainties is challenging. This paper introduces a new tube-based MPC method - named filter-based system level tube-MPC (SLTMPC) - which overapproximates both uncertainties with an online optimized disturbance set, while simultaneously computing the tube controller online. Extending prior work, we generalize the method to polytopic disturbance sets and for the first time provide rigorous closed-loop guarantees for the receding horizon controller. These guarantees are obtained by virtue of a new terminal controller design and an online optimized terminal set. To reduce the computational complexity of the proposed method, we additionally introduce an asynchronous computation scheme that separates the optimization of the tube controller and the nominal trajectory. Finally, we provide a comprehensive numerical evaluation of the proposed methods to demonstrate their effectiveness.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Model predictive control for tracking using artificial references: Fundamentals, recent results and practical implementation
Authors:
Pablo Krupa,
Johannes Köhler,
Antonio Ferramosca,
Ignacio Alvarado,
Melanie N. Zeilinger,
Teodoro Alamo,
Daniel Limon
Abstract:
This paper provides a comprehensive tutorial on a family of Model Predictive Control (MPC) formulations, known as MPC for tracking, which are characterized by including an artificial reference as part of the decision variables in the optimization problem. These formulations have several benefits with respect to the classical MPC formulations, including guaranteed recursive feasibility under online…
▽ More
This paper provides a comprehensive tutorial on a family of Model Predictive Control (MPC) formulations, known as MPC for tracking, which are characterized by including an artificial reference as part of the decision variables in the optimization problem. These formulations have several benefits with respect to the classical MPC formulations, including guaranteed recursive feasibility under online reference changes, as well as asymptotic stability and an increased domain of attraction. This tutorial paper introduces the concept of using an artificial reference in MPC, presenting the benefits and theoretical guarantees obtained by its use. We then provide a survey of the main advances and extensions of the original linear MPC for tracking, including its non-linear extension. Additionally, we discuss its application to learning-based MPC, and discuss optimization aspects related to its implementation.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks
Authors:
Jerome Sieber,
Carmen Amo Alonso,
Alexandre Didier,
Melanie N. Zeilinger,
Antonio Orvieto
Abstract:
Softmax attention is the principle backbone of foundation models for various artificial intelligence applications, yet its quadratic complexity in sequence length can limit its inference throughput in long-context settings. To address this challenge, alternative architectures such as linear attention, State Space Models (SSMs), and Recurrent Neural Networks (RNNs) have been considered as more effi…
▽ More
Softmax attention is the principle backbone of foundation models for various artificial intelligence applications, yet its quadratic complexity in sequence length can limit its inference throughput in long-context settings. To address this challenge, alternative architectures such as linear attention, State Space Models (SSMs), and Recurrent Neural Networks (RNNs) have been considered as more efficient alternatives. While connections between these approaches exist, such models are commonly developed in isolation and there is a lack of theoretical understanding of the shared principles underpinning these architectures and their subtle differences, greatly influencing performance and scalability. In this paper, we introduce the Dynamical Systems Framework (DSF), which allows a principled investigation of all these architectures in a common representation. Our framework facilitates rigorous comparisons, providing new insights on the distinctive characteristics of each model class. For instance, we compare linear attention and selective SSMs, detailing their differences and conditions under which both are equivalent. We also provide principled comparisons between softmax attention and other model classes, discussing the theoretical conditions under which softmax attention can be approximated. Additionally, we substantiate these new insights with empirical validations and mathematical arguments. This shows the DSF's potential to guide the systematic development of future more efficient and scalable foundation models.
△ Less
Submitted 3 June, 2024; v1 submitted 24 May, 2024;
originally announced May 2024.
-
Optimization-Based System Identification and Moving Horizon Estimation Using Low-Cost Sensors for a Miniature Car-Like Robot
Authors:
Sabrina Bodmer,
Lukas Vogel,
Simon Muntwiler,
Alexander Hansson,
Tobias Bodewig,
Jonas Wahlen,
Melanie N. Zeilinger,
Andrea Carron
Abstract:
This paper presents an open-source miniature car-like robot with low-cost sensing and a pipeline for optimization-based system identification, state estimation, and control. The overall robotics platform comes at a cost of less than $700 and thus significantly simplifies the verification of advanced algorithms in a realistic setting. We present a modified bicycle model with Pacejka tire forces to…
▽ More
This paper presents an open-source miniature car-like robot with low-cost sensing and a pipeline for optimization-based system identification, state estimation, and control. The overall robotics platform comes at a cost of less than $700 and thus significantly simplifies the verification of advanced algorithms in a realistic setting. We present a modified bicycle model with Pacejka tire forces to model the dynamics of the considered all-wheel drive vehicle and to prevent singularities of the model at low velocities. Furthermore, we provide an optimization-based system identification approach and a moving horizon estimation (MHE) scheme. In extensive hardware experiments, we show that the presented system identification approach results in a model with high prediction accuracy, while the MHE results in accurate state estimates. Finally, the overall closed-loop system is shown to perform well even in the presence of sensor failure for limited time intervals. All hardware, firmware, and control and estimation software is released under a BSD 2-clause license to promote widespread adoption and collaboration within the community.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Perfecting Periodic Trajectory Tracking: Model Predictive Control with a Periodic Observer ($Π$-MPC)
Authors:
Luis Pabon,
Johannes Köhler,
John Irvin Alora,
Patrick Benito Eberhard,
Andrea Carron,
Melanie N. Zeilinger,
Marco Pavone
Abstract:
In Model Predictive Control (MPC), discrepancies between the actual system and the predictive model can lead to substantial tracking errors and significantly degrade performance and reliability. While such discrepancies can be alleviated with more complex models, this often complicates controller design and implementation. By leveraging the fact that many trajectories of interest are periodic, we…
▽ More
In Model Predictive Control (MPC), discrepancies between the actual system and the predictive model can lead to substantial tracking errors and significantly degrade performance and reliability. While such discrepancies can be alleviated with more complex models, this often complicates controller design and implementation. By leveraging the fact that many trajectories of interest are periodic, we show that perfect tracking is possible when incorporating a simple observer that estimates and compensates for periodic disturbances. We present the design of the observer and the accompanying tracking MPC scheme, proving that their combination achieves zero tracking error asymptotically, regardless of the complexity of the unmodelled dynamics. We validate the effectiveness of our method, demonstrating asymptotically perfect tracking on a high-dimensional soft robot with nearly 10,000 states and a fivefold reduction in tracking errors compared to a baseline MPC on small-scale autonomous race car experiments.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Adaptive Economic Model Predictive Control for linear systems with performance guarantees
Authors:
Maximilian Degner,
Raffaele Soloperto,
Melanie N. Zeilinger,
John Lygeros,
Johannes Köhler
Abstract:
We present a model predictive control (MPC) formulation to directly optimize economic criteria for linear constrained systems subject to disturbances and uncertain model parameters. The proposed formulation combines a certainty equivalent economic MPC with a simple least-squares parameter adaptation. For the resulting adaptive economic MPC scheme, we derive strong asymptotic and transient performa…
▽ More
We present a model predictive control (MPC) formulation to directly optimize economic criteria for linear constrained systems subject to disturbances and uncertain model parameters. The proposed formulation combines a certainty equivalent economic MPC with a simple least-squares parameter adaptation. For the resulting adaptive economic MPC scheme, we derive strong asymptotic and transient performance guarantees. We provide a numerical example involving building temperature control and demonstrate performance benefits of online parameter adaptation.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
State Space Models as Foundation Models: A Control Theoretic Overview
Authors:
Carmen Amo Alonso,
Jerome Sieber,
Melanie N. Zeilinger
Abstract:
In recent years, there has been a growing interest in integrating linear state-space models (SSM) in deep neural network architectures of foundation models. This is exemplified by the recent success of Mamba, showing better performance than the state-of-the-art Transformer architectures in language tasks. Foundation models, like e.g. GPT-4, aim to encode sequential data into a latent space in orde…
▽ More
In recent years, there has been a growing interest in integrating linear state-space models (SSM) in deep neural network architectures of foundation models. This is exemplified by the recent success of Mamba, showing better performance than the state-of-the-art Transformer architectures in language tasks. Foundation models, like e.g. GPT-4, aim to encode sequential data into a latent space in order to learn a compressed representation of the data. The same goal has been pursued by control theorists using SSMs to efficiently model dynamical systems. Therefore, SSMs can be naturally connected to deep sequence modeling, offering the opportunity to create synergies between the corresponding research areas. This paper is intended as a gentle introduction to SSM-based architectures for control theorists and summarizes the latest research developments. It provides a systematic review of the most successful SSM proposals and highlights their main features from a control theoretic perspective. Additionally, we present a comparative analysis of these models, evaluating their performance on a standardized benchmark designed for assessing a model's efficiency at learning long sequences.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Safe Guaranteed Exploration for Non-linear Systems
Authors:
Manish Prajapat,
Johannes Köhler,
Matteo Turchetta,
Andreas Krause,
Melanie N. Zeilinger
Abstract:
Safely exploring environments with a-priori unknown constraints is a fundamental challenge that restricts the autonomy of robots. While safety is paramount, guarantees on sufficient exploration are also crucial for ensuring autonomous task completion. To address these challenges, we propose a novel safe guaranteed exploration framework using optimal control, which achieves first-of-its-kind result…
▽ More
Safely exploring environments with a-priori unknown constraints is a fundamental challenge that restricts the autonomy of robots. While safety is paramount, guarantees on sufficient exploration are also crucial for ensuring autonomous task completion. To address these challenges, we propose a novel safe guaranteed exploration framework using optimal control, which achieves first-of-its-kind results: guaranteed exploration for non-linear systems with finite time sample complexity bounds, while being provably safe with arbitrarily high probability. The framework is general and applicable to many real-world scenarios with complex non-linear dynamics and unknown domains. Based on this framework we propose an efficient algorithm, SageMPC, SAfe Guaranteed Exploration using Model Predictive Control. SageMPC improves efficiency by incorporating three techniques: i) exploiting a Lipschitz bound, ii) goal-directed exploration, and iii) receding horizon style re-planning, all while maintaining the desired sample complexity, safety and exploration guarantees of the framework. Lastly, we demonstrate safe efficient exploration in challenging unknown environments using SageMPC with a car model.
△ Less
Submitted 9 February, 2024;
originally announced February 2024.
-
Fast System Level Synthesis: Robust Model Predictive Control using Riccati Recursions
Authors:
Antoine P. Leeman,
Johannes Köhler,
Florian Messerer,
Amon Lahr,
Moritz Diehl,
Melanie N. Zeilinger
Abstract:
System Level Synthesis (SLS) enables improved robust MPC formulations by allowing for joint optimization of the nominal trajectory and controller. This paper introduces a tailored algorithm for solving the corresponding disturbance feedback optimization problem. The proposed algorithm builds on a recently proposed joint optimization scheme and iterates between optimizing the controller and the nom…
▽ More
System Level Synthesis (SLS) enables improved robust MPC formulations by allowing for joint optimization of the nominal trajectory and controller. This paper introduces a tailored algorithm for solving the corresponding disturbance feedback optimization problem. The proposed algorithm builds on a recently proposed joint optimization scheme and iterates between optimizing the controller and the nominal trajectory while converging q-linearly to an optimal solution. We show that the controller optimization can be solved through Riccati recursions leading to a horizon-length, state, and input scalability of $\mathcal{O}(N^2 ( n_x^3 + n_u ^3 ) )$ for each iterate. On a numerical example, the proposed algorithm exhibits computational speedups of order $10$ to $10^3$ compared to general-purpose commercial solvers.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Predictive stability filters for nonlinear dynamical systems affected by disturbances
Authors:
Alexandre Didier,
Andrea Zanelli,
Kim P. Wabersich,
Melanie N. Zeilinger
Abstract:
Predictive safety filters provide a way of projecting potentially unsafe inputs, proposed, e.g. by a human or learning-based controller, onto the set of inputs that guarantee recursive state and input constraint satisfaction by leveraging model predictive control techniques. In this paper, we extend this framework such that in addition, robust asymptotic stability of the closed-loop system can be…
▽ More
Predictive safety filters provide a way of projecting potentially unsafe inputs, proposed, e.g. by a human or learning-based controller, onto the set of inputs that guarantee recursive state and input constraint satisfaction by leveraging model predictive control techniques. In this paper, we extend this framework such that in addition, robust asymptotic stability of the closed-loop system can be guaranteed by enforcing a decrease of an implicit Lyapunov function which is constructed using a predicted system trajectory. Differently from previous results, we show robust asymptotic stability with respect to a predefined disturbance set on an extended state consisting of the system state and a warmstart input sequence. The proposed strategy is applied to an automotive lane kee** example in simulation.
△ Less
Submitted 29 April, 2024; v1 submitted 20 January, 2024;
originally announced January 2024.
-
MHE under parametric uncertainty -- Robust state estimation without informative data
Authors:
Simon Muntwiler,
Johannes Köhler,
Melanie N. Zeilinger
Abstract:
In this paper, we study state estimation for general nonlinear systems with unknown parameters and persistent process and measurement noise. In particular, we are interested in stability properties of the state estimate in the absence of persistency of excitation (PE). With a simple academic example, we show that existing moving horizon estimation (MHE) approaches as well as classical adaptive obs…
▽ More
In this paper, we study state estimation for general nonlinear systems with unknown parameters and persistent process and measurement noise. In particular, we are interested in stability properties of the state estimate in the absence of persistency of excitation (PE). With a simple academic example, we show that existing moving horizon estimation (MHE) approaches as well as classical adaptive observers can result in diverging state estimates in the absence of PE, even if the noise is small. We propose a novel MHE formulation involving a regularization based on a constant prior estimate of the unknown system parameters. Only assuming the existence of a stable estimator, we prove that the proposed MHE results in practically robustly stable state estimates even in the absence of PE. We discuss the relation of the proposed MHE formulation to state-of-the-art results from MHE, adaptive estimation, and functional estimation. The properties of the proposed MHE approach are illustrated with a numerical example of a car with unknown tire friction parameters.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Nonlinear Functional Estimation: Functional Detectability and Full Information Estimation
Authors:
Simon Muntwiler,
Johannes Köhler,
Melanie N. Zeilinger
Abstract:
We consider the design of functional estimators, i.e., approaches to compute an estimate of a nonlinear function of the state of a general nonlinear dynamical system subject to process noise based on noisy output measurements. To this end, we introduce a novel functional detectability notion in the form of incremental input/output-to-output stability ($δ$-IOOS). We show that $δ$-IOOS is a necessar…
▽ More
We consider the design of functional estimators, i.e., approaches to compute an estimate of a nonlinear function of the state of a general nonlinear dynamical system subject to process noise based on noisy output measurements. To this end, we introduce a novel functional detectability notion in the form of incremental input/output-to-output stability ($δ$-IOOS). We show that $δ$-IOOS is a necessary condition for the existence of a functional estimator satisfying an input-to-output type stability property. Additionally, we prove that a system is functional detectable if and only if it admits a corresponding $δ$-IOOS Lyapunov function. Furthermore, $δ$-IOOS is shown to be a sufficient condition for the design of a stable functional estimator by introducing the design of a full information estimation (FIE) approach for functional estimation. Together, we present a unified framework to study functional estimation with a detectability condition, which is necessary and sufficient for the existence of a stable functional estimator, and a corresponding functional estimator design. The practical need for and applicability of the proposed functional estimator design is illustrated with a numerical example of a power system.
△ Less
Submitted 3 May, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Automatic nonlinear MPC approximation with closed-loop guarantees
Authors:
Abdullah Tokmak,
Christian Fiedler,
Melanie N. Zeilinger,
Sebastian Trimpe,
Johannes Köhler
Abstract:
Safety guarantees are vital in many control applications, such as robotics. Model predictive control (MPC) provides a constructive framework for controlling safety-critical systems, but is limited by its computational complexity. We address this problem by presenting a novel algorithm that automatically computes an explicit approximation to nonlinear MPC schemes while retaining closed-loop guarant…
▽ More
Safety guarantees are vital in many control applications, such as robotics. Model predictive control (MPC) provides a constructive framework for controlling safety-critical systems, but is limited by its computational complexity. We address this problem by presenting a novel algorithm that automatically computes an explicit approximation to nonlinear MPC schemes while retaining closed-loop guarantees. Specifically, the problem can be reduced to a function approximation problem, which we then tackle by proposing ALKIA-X, the Adaptive and Localized Kernel Interpolation Algorithm with eXtrapolated reproducing kernel Hilbert space norm. ALKIA-X is a non-iterative algorithm that ensures numerically well-conditioned computations, a fast-to-evaluate approximating function, and the guaranteed satisfaction of any desired bound on the approximation error. Hence, ALKIA-X automatically computes an explicit function that approximates the MPC, yielding a controller suitable for safety-critical systems and high sampling rates. We apply ALKIA-X to approximate two nonlinear MPC schemes, demonstrating reduced computational demand and applicability to realistic problems.
△ Less
Submitted 11 April, 2024; v1 submitted 15 December, 2023;
originally announced December 2023.
-
Inverse Optimal Control as an Errors-in-Variables Problem
Authors:
Rahel Rickenbach,
Anna Scampicchio,
Melanie N. Zeilinger
Abstract:
Inverse optimal control (IOC) is about estimating an unknown objective of interest given its optimal control sequence. However, truly optimal demonstrations are often difficult to obtain, e.g., due to human errors or inaccurate measurements. This paper presents an IOC framework for objective estimation from multiple sub-optimal demonstrations in constrained environments. It builds upon the Karush-…
▽ More
Inverse optimal control (IOC) is about estimating an unknown objective of interest given its optimal control sequence. However, truly optimal demonstrations are often difficult to obtain, e.g., due to human errors or inaccurate measurements. This paper presents an IOC framework for objective estimation from multiple sub-optimal demonstrations in constrained environments. It builds upon the Karush-Kuhn-Tucker optimality conditions, and addresses the Errors-In-Variables problem that emerges from the use of sub-optimal data. The approach presented is applied to various systems in simulation, and consistency guarantees are provided for linear systems with zero mean additive noise, polytopic constraints, and objectives with quadratic features.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
A Stiffness-Oriented Model Order Reduction Method for Low-Inertia Power Systems
Authors:
Simon Muntwiler,
Ognjen Stanojev,
Andrea Zanelli,
Gabriela Hug,
Melanie N. Zeilinger
Abstract:
This paper presents a novel model order reduction technique tailored for power systems with a large share of inverter-based energy resources. Such systems exhibit an increased level of dynamic stiffness compared to traditional power systems, posing challenges for time-domain simulations and control design. Our approach involves rotation of the coordinate system of a linearized system using a trans…
▽ More
This paper presents a novel model order reduction technique tailored for power systems with a large share of inverter-based energy resources. Such systems exhibit an increased level of dynamic stiffness compared to traditional power systems, posing challenges for time-domain simulations and control design. Our approach involves rotation of the coordinate system of a linearized system using a transformation matrix derived from the real Jordan canonical form, leading to mode decoupling. The fast modes are then truncated in the rotated coordinate system to obtain a lower-order model with reduced stiffness. Applying the same transformation to the original nonlinear system results in an approximate separation of slow and fast states, which can be truncated to reduce the stiffness. The resulting reduced-order model demonstrates an accurate time-domain performance, the slow eigenvalues of the linearized system are correctly preserved, and a reduction in the model stiffness is achieved, allowing for accurate integration with increased step size. Our methodology is assessed in detail for a 3-bus system with generation units involving grid-forming/following converters and synchronous machines, where it allows for a computational speed-up of up to 100x compared to the original system. Several standard larger test systems are also considered.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Homothetic tube model predictive control with multi-step predictors
Authors:
Danilo Saccani,
Giancarlo Ferrari-Trecate,
Melanie N. Zeilinger,
Johannes Köhler
Abstract:
We present a robust model predictive control (MPC) framework for linear systems facing bounded parametric uncertainty and bounded disturbances. Our approach deviates from standard MPC formulations by integrating multi-step predictors, which provide reduced error bounds. These bounds, derived from multi-step predictors, are utilized in a homothetic tube formulation to mitigate conservatism. Lastly,…
▽ More
We present a robust model predictive control (MPC) framework for linear systems facing bounded parametric uncertainty and bounded disturbances. Our approach deviates from standard MPC formulations by integrating multi-step predictors, which provide reduced error bounds. These bounds, derived from multi-step predictors, are utilized in a homothetic tube formulation to mitigate conservatism. Lastly, a multi-rate formulation is adopted to handle the incompatibilities of multi-step predictors. We provide a theoretical analysis, guaranteeing robust recursive feasibility, constraint satisfaction, and (practical) stability of the desired setpoint. We use a simulation example to compare it to existing literature and demonstrate advantages in terms of conservatism and computational complexity.
△ Less
Submitted 20 November, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Robust Nonlinear Reduced-Order Model Predictive Control
Authors:
John Irvin Alora,
Luis A. Pabon,
Johannes Köhler,
Mattia Cenedese,
Ed Schmerling,
Melanie N. Zeilinger,
George Haller,
Marco Pavone
Abstract:
Real-world systems are often characterized by high-dimensional nonlinear dynamics, making them challenging to control in real time. While reduced-order models (ROMs) are frequently employed in model-based control schemes, dimensionality reduction introduces model uncertainty which can potentially compromise the stability and safety of the original high-dimensional system. In this work, we propose…
▽ More
Real-world systems are often characterized by high-dimensional nonlinear dynamics, making them challenging to control in real time. While reduced-order models (ROMs) are frequently employed in model-based control schemes, dimensionality reduction introduces model uncertainty which can potentially compromise the stability and safety of the original high-dimensional system. In this work, we propose a novel reduced-order model predictive control (ROMPC) scheme to solve constrained optimal control problems for nonlinear, high-dimensional systems. To address the challenges of using ROMs in predictive control schemes, we derive an error bounding system that dynamically accounts for model reduction error. Using these bounds, we design a robust MPC scheme that ensures robust constraint satisfaction, recursive feasibility, and asymptotic stability. We demonstrate the effectiveness of our proposed method in simulations on a high-dimensional soft robot with nearly 10,000 states.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Physics-Informed Machine Learning for Modeling and Control of Dynamical Systems
Authors:
Truong X. Nghiem,
Ján Drgoňa,
Colin Jones,
Zoltan Nagy,
Roland Schwan,
Biswadip Dey,
Ankush Chakrabarty,
Stefano Di Cairano,
Joel A. Paulson,
Andrea Carron,
Melanie N. Zeilinger,
Wenceslao Shaw Cortez,
Draguna L. Vrabie
Abstract:
Physics-informed machine learning (PIML) is a set of methods and tools that systematically integrate machine learning (ML) algorithms with physical constraints and abstract mathematical models developed in scientific and engineering domains. As opposed to purely data-driven methods, PIML models can be trained from additional information obtained by enforcing physical laws such as energy and mass c…
▽ More
Physics-informed machine learning (PIML) is a set of methods and tools that systematically integrate machine learning (ML) algorithms with physical constraints and abstract mathematical models developed in scientific and engineering domains. As opposed to purely data-driven methods, PIML models can be trained from additional information obtained by enforcing physical laws such as energy and mass conservation. More broadly, PIML models can include abstract properties and conditions such as stability, convexity, or invariance. The basic premise of PIML is that the integration of ML and physics can yield more effective, physically consistent, and data-efficient models. This paper aims to provide a tutorial-like overview of the recent advances in PIML for dynamical system modeling and control. Specifically, the paper covers an overview of the theory, fundamental concepts and methods, tools, and applications on topics of: 1) physics-informed learning for system identification; 2) physics-informed learning for control; 3) analysis and verification of PIML models; and 4) physics-informed digital twins. The paper is concluded with a perspective on open challenges and future research opportunities.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Time Dependent Inverse Optimal Control using Trigonometric Basis Functions
Authors:
Rahel Rickenbach,
Elena Arcari,
Melanie N. Zeilinger
Abstract:
The choice of objective is critical for the performance of an optimal controller. When control requirements vary during operation, e.g. due to changes in the environment with which the system is interacting, these variations should be reflected in the cost function. In this paper we consider the problem of identifying a time dependent cost function from given trajectories. We propose a strategy fo…
▽ More
The choice of objective is critical for the performance of an optimal controller. When control requirements vary during operation, e.g. due to changes in the environment with which the system is interacting, these variations should be reflected in the cost function. In this paper we consider the problem of identifying a time dependent cost function from given trajectories. We propose a strategy for explicitly representing time dependency in the cost function, i.e. decomposing it into the product of an unknown time dependent parameter vector and a known state and input dependent vector, modelling the former via a linear combination of trigonometric basis functions. These are incorporated within an inverse optimal control framework that uses the Karush-Kuhn-Tucker (KKT) conditions for ensuring optimality, and allows for formulating an optimization problem with respect to a finite set of basis function hyperparameters. Results are shown for two systems in simulation and evaluated against state-of-the-art approaches.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Approximate non-linear model predictive control with safety-augmented neural networks
Authors:
Henrik Hose,
Johannes Köhler,
Melanie N. Zeilinger,
Sebastian Trimpe
Abstract:
Model predictive control (MPC) achieves stability and constraint satisfaction for general nonlinear systems, but requires computationally expensive online optimization. This paper studies approximations of such MPC controllers via neural networks (NNs) to achieve fast online evaluation. We propose safety augmentation that yields deterministic guarantees for convergence and constraint satisfaction…
▽ More
Model predictive control (MPC) achieves stability and constraint satisfaction for general nonlinear systems, but requires computationally expensive online optimization. This paper studies approximations of such MPC controllers via neural networks (NNs) to achieve fast online evaluation. We propose safety augmentation that yields deterministic guarantees for convergence and constraint satisfaction despite approximation inaccuracies. We approximate the entire input sequence of the MPC with NNs, which allows us to verify online if it is a feasible solution to the MPC problem. We replace the NN solution by a safe candidate based on standard MPC techniques whenever it is infeasible or has worse cost. Our method requires a single evaluation of the NN and forward integration of the input sequence online, which is fast to compute on resource-constrained systems. The proposed control framework is illustrated on three non-linear MPC benchmarks of different complexity, demonstrating computational speedups orders of magnitudes higher than online optimization. In the examples, we achieve deterministic safety through the safety-augmented NNs, where naive NN implementation fails.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Model Predictive Control for Multi-Agent Systems under Limited Communication and Time-Varying Network Topology
Authors:
Danilo Saccani,
Lorenzo Fagiano,
Melanie N. Zeilinger,
Andrea Carron
Abstract:
In control system networks, reconfiguration of the controller when agents are leaving or joining the network is still an open challenge, in particular when operation constraints that depend on each agent's behavior must be met. Drawing our motivation from mobile robot swarms, in this paper, we address this problem by optimizing individual agent performance while guaranteeing persistent constraint…
▽ More
In control system networks, reconfiguration of the controller when agents are leaving or joining the network is still an open challenge, in particular when operation constraints that depend on each agent's behavior must be met. Drawing our motivation from mobile robot swarms, in this paper, we address this problem by optimizing individual agent performance while guaranteeing persistent constraint satisfaction in presence of bounded communication range and time-varying network topology. The approach we propose is a model predictive control (MPC) formulation, building on multi-trajectory MPC (mt-MPC) concepts. To enable plug and play operations when the system is in closed-loop without the need of a request, the proposed MPC scheme predicts two different state trajectories in the same finite horizon optimal control problem. One trajectory drives the system to the desired target, assuming that the network topology will not change in the prediction horizon, while the second one ensures constraint satisfaction assuming a worst-case scenario in terms of new agents joining the network in the planning horizon. Recursive feasibility and stability of the closed-loop system during plug and play operations are shown. The approach effectiveness is illustrated with a numerical simulation.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Robust Optimal Control for Nonlinear Systems with Parametric Uncertainties via System Level Synthesis
Authors:
Antoine P. Leeman,
Jerome Sieber,
Samir Bennani,
Melanie N. Zeilinger
Abstract:
This paper addresses the problem of optimally controlling nonlinear systems with norm-bounded disturbances and parametric uncertainties while robustly satisfying constraints. The proposed approach jointly optimizes a nominal nonlinear trajectory and an error feedback, requiring minimal offline design effort and offering low conservatism. This is achieved by decomposing the affine-in-the-parameter…
▽ More
This paper addresses the problem of optimally controlling nonlinear systems with norm-bounded disturbances and parametric uncertainties while robustly satisfying constraints. The proposed approach jointly optimizes a nominal nonlinear trajectory and an error feedback, requiring minimal offline design effort and offering low conservatism. This is achieved by decomposing the affine-in-the-parameter uncertain nonlinear system into a nominal $\textit{nonlinear}$ system and an uncertain linear time-varying system. Using this decomposition, we can apply established tools from system level synthesis to $\textit{convexly}$ over-bound all uncertainties in the nonlinear optimization problem. Moreover, it enables tight joint optimization of the linearization error bounds, parametric uncertainties bounds, nonlinear trajectory, and error feedback. With this novel controller parameterization, we can formulate a convex constraint to ensure robust performance guarantees for the nonlinear system. The presented method is relevant for numerous applications related to trajectory optimization, e.g., in robotics and aerospace engineering. We demonstrate the performance of the approach and its low conservatism through the simulation example of a post-capture satellite stabilization.
△ Less
Submitted 13 September, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
On stochastic MPC formulations with closed-loop guarantees: Analysis and a unifying framework
Authors:
Johannes Köhler,
Ferdinand Geuss,
Melanie N. Zeilinger
Abstract:
We investigate model predictive control (MPC) formulations for linear systems subject to i.i.d. stochastic disturbances with bounded support and chance constraints. Existing stochastic MPC formulations with closed-loop guarantees can be broadly classified in two separate frameworks: i) using robust techniques; ii) feasibility preserving algorithms. We investigate two particular MPC formulations re…
▽ More
We investigate model predictive control (MPC) formulations for linear systems subject to i.i.d. stochastic disturbances with bounded support and chance constraints. Existing stochastic MPC formulations with closed-loop guarantees can be broadly classified in two separate frameworks: i) using robust techniques; ii) feasibility preserving algorithms. We investigate two particular MPC formulations representative for these two frameworks called robust-stochastic MPC and indirect feedback stochastic MPC. We provide a qualitative analysis, highlighting intrinsic limitations of both approaches in different edge cases. Then, we derive a unifying stochastic MPC framework that naturally includes these two formulations as limit cases. This qualitative analysis is complemented with numerical results, showcasing the advantages and limitations of each method.
△ Less
Submitted 7 August, 2023; v1 submitted 31 March, 2023;
originally announced April 2023.
-
Active Learning-based Model Predictive Coverage Control
Authors:
Rahel Rickenbach,
Johannes Köhler,
Anna Scampicchio,
Melanie N. Zeilinger,
Andrea Carron
Abstract:
The problem of coverage control, i.e., of coordinating multiple agents to optimally cover an area, arises in various applications. However, coverage applications face two major challenges: (1) dealing with nonlinear dynamics while respecting system and safety critical constraints, and (2) performing the task in an initially unknown environment. We solve the coverage problem by using a hierarchical…
▽ More
The problem of coverage control, i.e., of coordinating multiple agents to optimally cover an area, arises in various applications. However, coverage applications face two major challenges: (1) dealing with nonlinear dynamics while respecting system and safety critical constraints, and (2) performing the task in an initially unknown environment. We solve the coverage problem by using a hierarchical framework, in which references are calculated at a central server and passed to the agents' local model predictive control (MPC) tracking schemes. Furthermore, to ensure that the environment is actively explored by the agents a probabilistic exploration-exploitation trade-off is deployed. In addition, we derive a control framework that avoids the hierarchical structure by integrating the reference optimization in the MPC formulation. Active learning is then performed drawing inspiration from Upper Confidence Bound (UCB) approaches. For all developed control architectures, we guarantee closed-loop constraint satisfaction and convergence to an optimal configuration. Furthermore, all methods are tested and compared on hardware using a miniature car platform.
△ Less
Submitted 29 March, 2024; v1 submitted 17 March, 2023;
originally announced March 2023.
-
Robust Nonlinear Optimal Control via System Level Synthesis
Authors:
Antoine P. Leeman,
Johannes Köhler,
Andrea Zanelli,
Samir Bennani,
Melanie N. Zeilinger
Abstract:
This paper addresses the problem of finite horizon constrained robust optimal control for nonlinear systems subject to norm-bounded disturbances. To this end, the underlying uncertain nonlinear system is decomposed based on a first-order Taylor series expansion into a nominal system and an error (deviation) described as an uncertain linear time-varying system. This decomposition allows us to lever…
▽ More
This paper addresses the problem of finite horizon constrained robust optimal control for nonlinear systems subject to norm-bounded disturbances. To this end, the underlying uncertain nonlinear system is decomposed based on a first-order Taylor series expansion into a nominal system and an error (deviation) described as an uncertain linear time-varying system. This decomposition allows us to leverage System Level Synthesis to jointly optimize an affine error feedback, a nominal nonlinear trajectory, and, most importantly, a dynamic linearization error over-bound used to ensure robust constraint satisfaction for the nonlinear system. The proposed approach thereby results in less conservative planning compared with state-of-the-art techniques. We demonstrate the benefits of the proposed approach to control the rotational motion of a rigid body subject to state and input constraints.
△ Less
Submitted 14 February, 2024; v1 submitted 12 January, 2023;
originally announced January 2023.
-
Predictive safety filter using system level synthesis
Authors:
Antoine P. Leeman,
Johannes Köhler,
Samir Benanni,
Melanie N. Zeilinger
Abstract:
Safety filters provide modular techniques to augment potentially unsafe control inputs (e.g. from learning-based controllers or humans) with safety guarantees in the form of constraint satisfaction. In this paper, we present an improved model predictive safety filter (MPSF) formulation, which incorporates system level synthesis techniques in the design. The resulting SL-MPSF scheme ensures safety…
▽ More
Safety filters provide modular techniques to augment potentially unsafe control inputs (e.g. from learning-based controllers or humans) with safety guarantees in the form of constraint satisfaction. In this paper, we present an improved model predictive safety filter (MPSF) formulation, which incorporates system level synthesis techniques in the design. The resulting SL-MPSF scheme ensures safety for linear systems subject to bounded disturbances in an enlarged safe set. It requires less severe and frequent modifications of potentially unsafe control inputs compared to existing MPSF formulations to certify safety. In addition, we propose an explicit variant of the SL-MPSF formulation, which maintains scalability, and reduces the required online computational effort - the main drawback of the MPSF. The benefits of the proposed system level safety filter formulations compared to state-of-the-art MPSF formulations are demonstrated using a numerical example.
△ Less
Submitted 9 June, 2023; v1 submitted 5 December, 2022;
originally announced December 2022.
-
LQG for Constrained Linear Systems: Indirect Feedback Stochastic MPC with Kalman Filtering
Authors:
Simon Muntwiler,
Kim P. Wabersich,
Robert Miklos,
Melanie N. Zeilinger
Abstract:
We present an output feedback stochastic model predictive control (SMPC) approach for linear systems subject to Gaussian disturbances and measurement noise and probabilistic constraints on system states and inputs. The presented approach combines a linear Kalman filter for state estimation with an indirect feedback SMPC, which is initialized with a predicted nominal state, while feedback of the cu…
▽ More
We present an output feedback stochastic model predictive control (SMPC) approach for linear systems subject to Gaussian disturbances and measurement noise and probabilistic constraints on system states and inputs. The presented approach combines a linear Kalman filter for state estimation with an indirect feedback SMPC, which is initialized with a predicted nominal state, while feedback of the current state estimate enters through the objective of the SMPC problem. For this combination, we establish recursive feasibility of the SMPC problem due to the chosen initialization, and closed-loop chance constraint satisfaction thanks to an appropriate tightening of the constraints in the SMPC problem also considering the state estimation uncertainty. Additionally, we show that for specific design choices in the SMPC problem, the unconstrained linear-quadratic-Gaussian (LQG) solution is recovered if it is feasible for a given initial condition and the considered constraints. We demonstrate this fact for a numerical example, and show that the resulting output feedback controller can provide non-conservative constraint satisfaction.
△ Less
Submitted 17 November, 2023; v1 submitted 1 December, 2022;
originally announced December 2022.
-
Approximate Predictive Control Barrier Functions using Neural Networks: A Computationally Cheap and Permissive Safety Filter
Authors:
Alexandre Didier,
Robin C. Jacobs,
Jerome Sieber,
Kim P. Wabersich,
Melanie N. Zeilinger
Abstract:
A predictive control barrier function (PCBF) based safety filter is a modular framework to verify safety of a control input by predicting a future trajectory. The approach relies on the solution of two optimization problems, first computing the minimal state constraint violation given the current state in the form of slacks on the constraint, and then computing the minimal deviation from a propose…
▽ More
A predictive control barrier function (PCBF) based safety filter is a modular framework to verify safety of a control input by predicting a future trajectory. The approach relies on the solution of two optimization problems, first computing the minimal state constraint violation given the current state in the form of slacks on the constraint, and then computing the minimal deviation from a proposed input given the previously computed minimal slacks. This paper presents an approximation procedure that uses a neural network to approximate the optimal value function of the first optimization problem, which defines a control barrier function (CBF). By including this explicit approximation in a CBF-based safety filter formulation, the online computation becomes independent of the prediction horizon. It is shown that this approximation guarantees convergence to a neighborhood of the feasible set of the PCBF safety filter problem with zero constraint violation. The convergence result relies on a novel class $\mathcal{K}$ lower bound on the PCBF decrease and depends on the approximation error of the neural network. Lastly, we demonstrate our approach in simulation for an autonomous driving example and show that the proposed approximation leads to a significant decrease in computation time compared to the original approach.
△ Less
Submitted 24 July, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Asynchronous Computation of Tube-based Model Predictive Control
Authors:
Jerome Sieber,
Andrea Zanelli,
Antoine P. Leeman,
Samir Bennani,
Melanie N. Zeilinger
Abstract:
Tube-based model predictive control (MPC) methods bound deviations from a nominal trajectory due to uncertainties in order to ensure constraint satisfaction. While techniques that compute the tubes online reduce conservativeness and increase performance, they suffer from high and potentially prohibitive computational complexity. This paper presents an asynchronous computation mechanism for system…
▽ More
Tube-based model predictive control (MPC) methods bound deviations from a nominal trajectory due to uncertainties in order to ensure constraint satisfaction. While techniques that compute the tubes online reduce conservativeness and increase performance, they suffer from high and potentially prohibitive computational complexity. This paper presents an asynchronous computation mechanism for system level tube-MPC (SLTMPC), a recently proposed tube-based MPC method which optimizes over both the nominal trajectory and the tubes. Computations are split into a primary and a secondary process, computing the nominal trajectory and the tubes, respectively. This enables running the primary process at a high frequency and moving the computationally complex tube computations to the secondary process. We show that the secondary process can continuously update the tubes, while retaining recursive feasibility of the primary process.
△ Less
Submitted 20 April, 2023; v1 submitted 24 November, 2022;
originally announced November 2022.
-
Bayesian Multi-Task Learning MPC for Robotic Mobile Manipulation
Authors:
Elena Arcari,
Maria Vittoria Minniti,
Anna Scampicchio,
Andrea Carron,
Farbod Farshidian,
Marco Hutter,
Melanie N. Zeilinger
Abstract:
Mobile manipulation in robotics is challenging due to the need of solving many diverse tasks, such as opening a door or picking-and-placing an object. Typically, a basic first-principles system description of the robot is available, thus motivating the use of model-based controllers. However, the robot dynamics and its interaction with an object are affected by uncertainty, limiting the controller…
▽ More
Mobile manipulation in robotics is challenging due to the need of solving many diverse tasks, such as opening a door or picking-and-placing an object. Typically, a basic first-principles system description of the robot is available, thus motivating the use of model-based controllers. However, the robot dynamics and its interaction with an object are affected by uncertainty, limiting the controller's performance. To tackle this problem, we propose a Bayesian multi-task learning model that uses trigonometric basis functions to identify the error in the dynamics. In this way, data from different but related tasks can be leveraged to provide a descriptive error model that can be efficiently updated online for new, unseen tasks. We combine this learning scheme with a model predictive controller, and extensively test the effectiveness of the proposed approach, including comparisons with available baseline controllers. We present simulation tests with a ball-balancing robot, and door-opening hardware experiments with a quadrupedal manipulator.
△ Less
Submitted 21 March, 2023; v1 submitted 18 November, 2022;
originally announced November 2022.
-
Generalised Regret Optimal Controller Synthesis for Constrained Systems
Authors:
Alexandre Didier,
Melanie N. Zeilinger
Abstract:
This paper presents a synthesis method for the generalised dynamic regret problem, comparing the performance of a strictly causal controller to the optimal non-causal controller under a weighted disturbance. This framework encompasses both the dynamic regret problem, considering the difference of the incurred costs, as well as the competitive ratio, which considers their ratio, and which have both…
▽ More
This paper presents a synthesis method for the generalised dynamic regret problem, comparing the performance of a strictly causal controller to the optimal non-causal controller under a weighted disturbance. This framework encompasses both the dynamic regret problem, considering the difference of the incurred costs, as well as the competitive ratio, which considers their ratio, and which have both been proposed as inherently adaptive alternatives to classical control methods. Furthermore, we extend the synthesis to the case of pointwise-in-time bounds on the disturbance and show that the optimal solution is no worse than the bounded energy optimal solution and is lower bounded by a constant factor, which is only dependent on the disturbance weight. The proposed optimisation-based synthesis allows considering systems subject to state and input constraints. Finally, we provide a numerical example which compares the synthesised controller performance to $\mathcal{H}_2$- and $\mathcal{H}_\infty$-controllers.
△ Less
Submitted 24 July, 2023; v1 submitted 15 November, 2022;
originally announced November 2022.
-
Robust adaptive MPC using control contraction metrics
Authors:
András Sasfi,
Melanie N. Zeilinger,
Johannes Köhler
Abstract:
We present a robust adaptive model predictive control (MPC) framework for nonlinear continuous-time systems with bounded parametric uncertainty and additive disturbance. We utilize general control contraction metrics (CCMs) to parameterize a homothetic tube around a nominal prediction that contains all uncertain trajectories. Furthermore, we incorporate model adaptation using set-membership estima…
▽ More
We present a robust adaptive model predictive control (MPC) framework for nonlinear continuous-time systems with bounded parametric uncertainty and additive disturbance. We utilize general control contraction metrics (CCMs) to parameterize a homothetic tube around a nominal prediction that contains all uncertain trajectories. Furthermore, we incorporate model adaptation using set-membership estimation. As a result, the proposed MPC formulation is applicable to a large class of nonlinear systems, reduces conservatism during online operation, and guarantees robust constraint satisfaction and convergence to a neighborhood of the desired setpoint. One of the main technical contributions is the derivation of corresponding tube dynamics based on CCMs that account for the state and input dependent nature of the model mismatch. Furthermore, we online optimize over the nominal parameter, which enables general set-membership updates for the parametric uncertainty in the MPC. Benefits of the proposed homothetic tube MPC and online adaptation are demonstrated using a numerical example involving a planar quadrotor.
△ Less
Submitted 11 July, 2023; v1 submitted 23 September, 2022;
originally announced September 2022.
-
Globally stable and locally optimal model predictive control using a softened initial state constraint -- extended version
Authors:
Johannes Köhler,
Melanie N. Zeilinger
Abstract:
To address feasibility issues in model predictive control (MPC), most implementations relax hard state constraints using additional slack variables with a suitable penalty. We propose an alternative strategy for open-loop asymptotically/Lyapunov stable nonlinear systems by relaxing the initial state constraint with a suitable penalty. The proposed MPC framework is globally feasible, ensures (semi-…
▽ More
To address feasibility issues in model predictive control (MPC), most implementations relax hard state constraints using additional slack variables with a suitable penalty. We propose an alternative strategy for open-loop asymptotically/Lyapunov stable nonlinear systems by relaxing the initial state constraint with a suitable penalty. The proposed MPC framework is globally feasible, ensures (semi-)global asymptotic stability, and (approximately) recovers the closed-loop properties of the nominal MPC on the feasible set. The proposed framework can be naturally combined with a robust formulation to ensure robustness subject to bounded disturbances while retaining input-ot-state stability in case of arbitrarily large disturbances. We also show how the overall design can be simplified in case the nonlinear system is exponentially stable. In the special case of linear systems, the proposed MPC formulation reduces to a quadratic program and the offline design and online computational complexity is only marginally increased compared to anominal design. Benefits compared to classical soft contrained MPC formulations are demonstrated with numerical examples.
△ Less
Submitted 20 July, 2022;
originally announced July 2022.
-
Stochastic MPC with robustness to bounded parametric uncertainty
Authors:
Elena Arcari,
Andrea Iannelli,
Andrea Carron,
Melanie N. Zeilinger
Abstract:
The performance of model-based control techniques strongly depends on the quality of the employed dynamics model. If strong guarantees are desired, it is therefore common to robustly treat all possible sources of uncertainty, such as model inaccuracies or external disturbances. This, however, can result in overly conservative control strategies. In this paper, we present a stochastic model predict…
▽ More
The performance of model-based control techniques strongly depends on the quality of the employed dynamics model. If strong guarantees are desired, it is therefore common to robustly treat all possible sources of uncertainty, such as model inaccuracies or external disturbances. This, however, can result in overly conservative control strategies. In this paper, we present a stochastic model predictive control approach for discrete-time LTI systems subject to bounded parametric uncertainty and potentially unbounded stochastic additive noise. The proposed scheme makes use of homothetic tubes along the prediction horizon for a robust treatment of parametric uncertainty. Stochastic noise is handled by non-conservatively tightening constraints using the concept of probabilistic reachable sets (PRS). In order to accommodate all possible parametric uncertainties, we provide a strategy for generating "robustified" PRS based only on first and second moments of the noise sequence. In the case of quadratic cost functions, and under a further i.i.d. assumption on the noise distribution, we also provide an average asymptotic performance bound for the l2-norm of the closed-loop state. Finally, we demonstrate our scheme on both an illustrative example, and in a building temperature control problem.
△ Less
Submitted 20 May, 2022;
originally announced May 2022.
-
State space models vs. multi-step predictors in predictive control: Are state space models complicating safe data-driven designs?
Authors:
Johannes Köhler,
Kim P. Wabersich,
Julian Berberich,
Melanie N. Zeilinger
Abstract:
This paper contrasts recursive state space models and direct multi-step predictors for linear predictive control. We provide a tutorial exposition for both model structures to solve the following problems: 1. stochastic optimal control; 2. system identification; 3. stochastic optimal control based on the estimated model. Throughout the paper, we provide detailed discussions of the benefits and lim…
▽ More
This paper contrasts recursive state space models and direct multi-step predictors for linear predictive control. We provide a tutorial exposition for both model structures to solve the following problems: 1. stochastic optimal control; 2. system identification; 3. stochastic optimal control based on the estimated model. Throughout the paper, we provide detailed discussions of the benefits and limitations of these two model parametrizations for predictive control and highlight the relation to existing works. Additionally, we derive a novel (partially tight) constraint tightening for stochastic predictive control with parametric uncertainty in the multi-step predictor.
△ Less
Submitted 6 October, 2023; v1 submitted 29 March, 2022;
originally announced March 2022.
-
Recursively feasible stochastic predictive control using an interpolating initial state constraint -- extended version
Authors:
Johannes Köhler,
Melanie N. Zeilinger
Abstract:
We present a stochastic model predictive control (SMPC) framework for linear systems subject to possibly unbounded disturbances. State of the art SMPC approaches with closed-loop chance constraint satisfaction recursively initialize the nominal state based on the previously predicted nominal state or possibly the measured state under some case distinction. We improve these initialization strategie…
▽ More
We present a stochastic model predictive control (SMPC) framework for linear systems subject to possibly unbounded disturbances. State of the art SMPC approaches with closed-loop chance constraint satisfaction recursively initialize the nominal state based on the previously predicted nominal state or possibly the measured state under some case distinction. We improve these initialization strategies by allowing for a continuous optimization over the nominal initial state in an interpolation of these two extremes. The resulting SMPC scheme can be implemented as one standard quadratic program and is more flexible compared to state-of-the-art initialization strategies. As the main technical contribution, we show that the proposed SMPC framework also ensures closed-loop satisfaction of chance constraints and suitable performance bounds.
△ Less
Submitted 20 June, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
A System Level Approach to Regret Optimal Control
Authors:
Alexandre Didier,
Jerome Sieber,
Melanie N. Zeilinger
Abstract:
We present an optimisation-based method for synthesising a dynamic regret optimal controller for linear systems with potentially adversarial disturbances and known or adversarial initial conditions. The dynamic regret is defined as the difference between the true incurred cost of the system and the cost which could have optimally been achieved under any input sequence having full knowledge of all…
▽ More
We present an optimisation-based method for synthesising a dynamic regret optimal controller for linear systems with potentially adversarial disturbances and known or adversarial initial conditions. The dynamic regret is defined as the difference between the true incurred cost of the system and the cost which could have optimally been achieved under any input sequence having full knowledge of all future disturbances for a given disturbance energy. This problem formulation can be seen as an alternative to classical $\mathcal{H}_2$- or $\mathcal{H}_\infty$-control. The proposed controller synthesis is based on the system level parametrisation, which allows reformulating the dynamic regret problem as a semi-definite problem. This yields a new framework that allows to consider structured dynamic regret problems, which have not yet been considered in the literature. For known pointwise ellipsoidal bounds on the disturbance, we show that the dynamic regret bound can be improved compared to using only a bounded energy assumption and that the optimal dynamic regret bound differs by at most a factor of $\frac{2}π$ from the computed solution. Furthermore, the proposed framework allows guaranteeing state and input constraint satisfaction.
△ Less
Submitted 30 May, 2022; v1 submitted 28 February, 2022;
originally announced February 2022.
-
A Lyapunov function for robust stability of moving horizon estimation
Authors:
Julian D. Schiller,
Simon Muntwiler,
Johannes Köhler,
Melanie N. Zeilinger,
Matthias A. Müller
Abstract:
We provide a novel robust stability analysis for moving horizon estimation (MHE) using a Lyapunov function. Additionally, we introduce linear matrix inequalities (LMIs) to verify the necessary incremental input/output-to-state stability ($δ$-IOSS) detectability condition. We consider an MHE formulation with time-discounted quadratic objective for nonlinear systems admitting an exponential $δ$-IOSS…
▽ More
We provide a novel robust stability analysis for moving horizon estimation (MHE) using a Lyapunov function. Additionally, we introduce linear matrix inequalities (LMIs) to verify the necessary incremental input/output-to-state stability ($δ$-IOSS) detectability condition. We consider an MHE formulation with time-discounted quadratic objective for nonlinear systems admitting an exponential $δ$-IOSS Lyapunov function. We show that with a suitable parameterization of the MHE objective, the $δ$-IOSS Lyapunov function serves as an $M$-step Lyapunov function for MHE. Provided that the estimation horizon is chosen large enough, this directly implies exponential stability of MHE. The stability analysis is also applicable to full information estimation, where the restriction to exponential $δ$-IOSS can be relaxed. Moreover, we provide simple LMI conditions to systematically derive $δ$-IOSS Lyapunov functions, which allows us to easily verify $δ$-IOSS for a large class of nonlinear detectable systems. This is useful in the context of MHE in general, since most of the existing nonlinear (robust) stability results for MHE depend on the system being $δ$-IOSS (detectable). In combination, we thus provide a framework for designing MHE schemes with guaranteed robust exponential stability. The applicability of the proposed methods is demonstrated with a nonlinear chemical reactor process and a 12-state quadrotor model.
△ Less
Submitted 8 June, 2023; v1 submitted 25 February, 2022;
originally announced February 2022.
-
System Level Disturbance Reachable Sets and their Application to Tube-based MPC
Authors:
Jerome Sieber,
Andrea Zanelli,
Samir Bennani,
Melanie N. Zeilinger
Abstract:
Tube-based model predictive control (MPC) methods leverage tubes to bound deviations from a nominal trajectory due to uncertainties in order to ensure constraint satisfaction. This paper presents a novel tube-based MPC formulation based on system level disturbance reachable sets (SL-DRS), which leverage the affine system level parameterization (SLP). We show that imposing a finite impulse response…
▽ More
Tube-based model predictive control (MPC) methods leverage tubes to bound deviations from a nominal trajectory due to uncertainties in order to ensure constraint satisfaction. This paper presents a novel tube-based MPC formulation based on system level disturbance reachable sets (SL-DRS), which leverage the affine system level parameterization (SLP). We show that imposing a finite impulse response (FIR) constraint on the affine SLP guarantees containment of all future deviations in a finite sequence of SL-DRS. This allows us to formulate a system level tube-MPC (SLTMPC) method using the SL-DRS as tubes, which enables concurrent optimization of the nominal trajectory and the tubes, while using a positively invariant terminal set. Finally, we show that the SL-DRS tubes can also be computed offline.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.
-
Stability and performance analysis of NMPC: Detectable stage costs and general terminal costs
Authors:
Johannes Köhler,
Melanie N. Zeilinger,
Lars Grüne
Abstract:
We provide a stability and performance analysis for nonlinear model predictive control (NMPC) schemes subject to input constraints. Given an exponential stabilizability and detectability condition w.r.t. the employed state cost, we provide a sufficiently long prediction horizon to ensure asymptotic stability and a desired performance bound w.r.t. the infinite-horizon optimal controller. Compared t…
▽ More
We provide a stability and performance analysis for nonlinear model predictive control (NMPC) schemes subject to input constraints. Given an exponential stabilizability and detectability condition w.r.t. the employed state cost, we provide a sufficiently long prediction horizon to ensure asymptotic stability and a desired performance bound w.r.t. the infinite-horizon optimal controller. Compared to existing results, the provided analysis is applicable to positive semi-definite (detectable) cost functions, provides tight bounds using a linear programming analysis, and allows for a seamless integration of general positive-definite terminal cost functions in the analysis. The practical applicability of the derived theoretical results are demonstrated with numerical examples.
△ Less
Submitted 5 January, 2023; v1 submitted 21 October, 2021;
originally announced October 2021.
-
On-Policy Model Errors in Reinforcement Learning
Authors:
Lukas P. Fröhlich,
Maksym Lefarov,
Melanie N. Zeilinger,
Felix Berkenkamp
Abstract:
Model-free reinforcement learning algorithms can compute policy gradients given sampled environment transitions, but require large amounts of data. In contrast, model-based methods can use the learned model to generate new data, but model errors and bias can render learning unstable or suboptimal. In this paper, we present a novel method that combines real-world data and a learned model in order t…
▽ More
Model-free reinforcement learning algorithms can compute policy gradients given sampled environment transitions, but require large amounts of data. In contrast, model-based methods can use the learned model to generate new data, but model errors and bias can render learning unstable or suboptimal. In this paper, we present a novel method that combines real-world data and a learned model in order to get the best of both worlds. The core idea is to exploit the real-world data for on-policy predictions and use the learned model only to generalize to different actions. Specifically, we use the data as time-dependent on-policy correction terms on top of a learned model, to retain the ability to generate data without accumulating errors over long prediction horizons. We motivate this method theoretically and show that it counteracts an error term for model-based policy improvement. Experiments on MuJoCo- and PyBullet-benchmarks show that our method can drastically improve existing model-based approaches without introducing additional tuning parameters.
△ Less
Submitted 3 March, 2022; v1 submitted 15 October, 2021;
originally announced October 2021.
-
Model Learning and Contextual Controller Tuning for Autonomous Racing
Authors:
Lukas P. Fröhlich,
Christian Küttel,
Elena Arcari,
Lukas Hewing,
Melanie N. Zeilinger,
Andrea Carron
Abstract:
Model predictive control has been widely used in the field of autonomous racing and many data-driven approaches have been proposed to improve the closed-loop performance and to minimize lap time. However, it is often overlooked that a change in the environmental conditions, e.g., when it starts raining, it is not only required to adapt the predictive model but also the controller parameters need t…
▽ More
Model predictive control has been widely used in the field of autonomous racing and many data-driven approaches have been proposed to improve the closed-loop performance and to minimize lap time. However, it is often overlooked that a change in the environmental conditions, e.g., when it starts raining, it is not only required to adapt the predictive model but also the controller parameters need to be adjusted. In this paper, we address this challenge with the goal of requiring only few data. The key novelty of the proposed approach is that we leverage the learned dynamics model to encode the environmental condition as context. This insight allows us to employ contextual Bayesian optimization, thus accelerating the controller tuning problem when the environment changes and to transfer knowledge across different cars. The proposed framework is validated on an experimental platform with 1:28 scale RC race cars. We perform an extensive evaluation with more than 2'000 driven laps demonstrating that our approach successfully optimizes the lap time across different contexts faster compared to standard Bayesian optimization.
△ Less
Submitted 6 October, 2021;
originally announced October 2021.
-
Adaptive Model Predictive Safety Certification for Learning-based Control -- Extended Version
Authors:
Alexandre Didier,
Kim P. Wabersich,
Melanie N. Zeilinger
Abstract:
We propose an adaptive Model Predictive Safety Certification (MPSC) scheme for learning-based control of linear systems with bounded disturbances and uncertain parameters where the true parameters are contained within an a priori known set of parameters. An MPSC is a modular framework which can be used in combination with any learning-based controller to ensure state and input constraint satisfact…
▽ More
We propose an adaptive Model Predictive Safety Certification (MPSC) scheme for learning-based control of linear systems with bounded disturbances and uncertain parameters where the true parameters are contained within an a priori known set of parameters. An MPSC is a modular framework which can be used in combination with any learning-based controller to ensure state and input constraint satisfaction of a dynamical system by solving an online optimisation problem. By continuously connecting the current system state with a safe terminal set using a robust tube, safety can be ensured. Thereby, the main sources of conservative safety interventions are model uncertainties and short planning horizons. We develop an adaptive mechanism to improve the system model, which leverages set-membership estimation to guarantee recursively feasible and non-decreasing safety performance improvements. In order to accommodate short prediction horizons, iterative safe set enlargements using previously computed robust backup plans are proposed. Finally, we illustrate the increase of the safety performance through the parameter and safe set adaptation for numerical examples with up to 16 state dimensions.
△ Less
Submitted 29 September, 2021; v1 submitted 27 September, 2021;
originally announced September 2021.
-
Learning-based Moving Horizon Estimation through Differentiable Convex Optimization Layers
Authors:
Simon Muntwiler,
Kim P. Wabersich,
Melanie N. Zeilinger
Abstract:
To control a dynamical system it is essential to obtain an accurate estimate of the current system state based on uncertain sensor measurements and existing system knowledge. An optimization-based moving horizon estimation (MHE) approach uses a dynamical model of the system, and further allows for integration of physical constraints on system states and uncertainties, to obtain a trajectory of sta…
▽ More
To control a dynamical system it is essential to obtain an accurate estimate of the current system state based on uncertain sensor measurements and existing system knowledge. An optimization-based moving horizon estimation (MHE) approach uses a dynamical model of the system, and further allows for integration of physical constraints on system states and uncertainties, to obtain a trajectory of state estimates. In this work, we address the problem of state estimation in the case of constrained linear systems with parametric uncertainty. The proposed approach makes use of differentiable convex optimization layers to formulate an MHE state estimator for systems with uncertain parameters. This formulation allows us to obtain the gradient of a squared and regularized output error, based on sensor measurements and state estimates, with respect to the current belief of the unknown system parameters. The parameters within the MHE problem can then be updated online using stochastic gradient descent (SGD) to improve the performance of the MHE. In a numerical example of estimating temperatures of a group of manufacturing machines, we show the performance of tuning the unknown system parameters and the benefits of integrating physical state constraints in the MHE formulation.
△ Less
Submitted 2 May, 2022; v1 submitted 8 September, 2021;
originally announced September 2021.
-
Predictive control barrier functions: Enhanced safety mechanisms for learning-based control
Authors:
Kim P. Wabersich,
Melanie N. Zeilinger
Abstract:
While learning-based control techniques often outperform classical controller designs, safety requirements limit the acceptance of such methods in many applications. Recent developments address this issue through so-called predictive safety filters, which assess if a proposed learning-based control input can lead to constraint violations and modifies it if necessary to ensure safety for all future…
▽ More
While learning-based control techniques often outperform classical controller designs, safety requirements limit the acceptance of such methods in many applications. Recent developments address this issue through so-called predictive safety filters, which assess if a proposed learning-based control input can lead to constraint violations and modifies it if necessary to ensure safety for all future time steps. The theoretical guarantees of such predictive safety filters rely on the model assumptions and minor deviations can lead to failure of the filter putting the system at risk. This paper introduces an auxiliary soft-constrained predictive control problem that is always feasible at each time step and asymptotically stabilizes the feasible set of the original safety filter, thereby providing a recovery mechanism in safety-critical situations. This is achieved by a simple constraint tightening in combination with a terminal control barrier function. By extending discrete-time control barrier function theory, we establish that the proposed auxiliary problem provides a `predictive' control barrier function. The resulting algorithm is demonstrated using numerical examples.
△ Less
Submitted 13 May, 2022; v1 submitted 21 May, 2021;
originally announced May 2021.
-
A System Level Approach to Tube-based Model Predictive Control
Authors:
Jerome Sieber,
Samir Bennani,
Melanie N. Zeilinger
Abstract:
Robust tube-based model predictive control (MPC) methods address constraint satisfaction by leveraging an a priori determined tube controller in the prediction to tighten the constraints. This paper presents a system level tube-MPC (SLTMPC) method derived from the system level parameterization (SLP), which allows optimization over the tube controller online when solving the MPC problem, which can…
▽ More
Robust tube-based model predictive control (MPC) methods address constraint satisfaction by leveraging an a priori determined tube controller in the prediction to tighten the constraints. This paper presents a system level tube-MPC (SLTMPC) method derived from the system level parameterization (SLP), which allows optimization over the tube controller online when solving the MPC problem, which can significantly reduce conservativeness. We derive the SLTMPC method by establishing an equivalence relation between a class of robust MPC methods and the SLP. Finally, we show that the SLTMPC formulation naturally arises from an extended SLP formulation and show its merits in a numerical example.
△ Less
Submitted 30 April, 2021; v1 submitted 3 March, 2021;
originally announced March 2021.
-
A predictive safety filter for learning-based racing control
Authors:
Ben Tearle,
Kim P. Wabersich,
Andrea Carron,
Melanie N. Zeilinger
Abstract:
The growing need for high-performance controllers in safety-critical applications like autonomous driving has been motivating the development of formal safety verification techniques. In this paper, we design and implement a predictive safety filter that is able to maintain vehicle safety with respect to track boundaries when paired alongside any potentially unsafe control signal, such as those fo…
▽ More
The growing need for high-performance controllers in safety-critical applications like autonomous driving has been motivating the development of formal safety verification techniques. In this paper, we design and implement a predictive safety filter that is able to maintain vehicle safety with respect to track boundaries when paired alongside any potentially unsafe control signal, such as those found in learning-based methods. A model predictive control (MPC) framework is used to create a minimally invasive algorithm that certifies whether a desired control input is safe and can be applied to the vehicle, or that provides an alternate input to keep the vehicle in bounds. To this end, we provide a principled procedure to compute a safe and invariant set for nonlinear dynamic bicycle models using efficient convex approximation techniques. To fully support an aggressive racing performance without conservative safety interventions, the safe set is extended in real-time through predictive control backup trajectories. Applications for assisted manual driving and deep imitation learning on a miniature remote-controlled vehicle demonstrate the safety filter's ability to ensure vehicle safety during aggressive maneuvers.
△ Less
Submitted 23 February, 2021;
originally announced February 2021.
-
Volume Control of Low-Cost Ventilator with Automatic Set-Point Adaptation
Authors:
Lukas Hewing,
Marcel Menner,
Nikolaos Tachatos,
Marianne Schmid Daners,
Cosima du Pasquier,
Thomas S. Lumpe,
Kristina Shea,
Andrea Carron,
Melanie N. Zeilinger
Abstract:
This paper considers the control design for a low-cost ventilator that is based on a manual resuscitator bag (also known as AmbuBag) to pump air into the lungs of a patient who is physically unable to breathe. First, it experimentally shows that for accurately tracking tidal volumes, the controller needs to be adapted to the individual patient and the different configurations, e.g., hardware or op…
▽ More
This paper considers the control design for a low-cost ventilator that is based on a manual resuscitator bag (also known as AmbuBag) to pump air into the lungs of a patient who is physically unable to breathe. First, it experimentally shows that for accurately tracking tidal volumes, the controller needs to be adapted to the individual patient and the different configurations, e.g., hardware or operation modes. Second, it proposes a set-point adaptation algorithm that uses sensor measurements of a flow meter to automatically adapt the controller to the setup at hand. Third, it experimentally shows that such an adaptive solution improves the performance of the ventilator for various setups. One objective of this paper is to increase awareness of the need for feedback control using sensor measurements in low-cost ventilator solutions in order to automatically adapt to the specific scenario.
△ Less
Submitted 3 September, 2020;
originally announced September 2020.
-
Meta Learning MPC using Finite-Dimensional Gaussian Process Approximations
Authors:
Elena Arcari,
Andrea Carron,
Melanie N. Zeilinger
Abstract:
Data availability has dramatically increased in recent years, driving model-based control methods to exploit learning techniques for improving the system description, and thus control performance. Two key factors that hinder the practical applicability of learning methods in control are their high computational complexity and limited generalization capabilities to unseen conditions. Meta-learning…
▽ More
Data availability has dramatically increased in recent years, driving model-based control methods to exploit learning techniques for improving the system description, and thus control performance. Two key factors that hinder the practical applicability of learning methods in control are their high computational complexity and limited generalization capabilities to unseen conditions. Meta-learning is a powerful tool that enables efficient learning across a finite set of related tasks, easing adaptation to new unseen tasks. This paper makes use of a meta-learning approach for adaptive model predictive control, by learning a system model that leverages data from previous related tasks, while enabling fast fine-tuning to the current task during closed-loop operation. The dynamics is modeled via Gaussian process regression and, building on the Karhunen-Lo{è}ve expansion, can be approximately reformulated as a finite linear combination of kernel eigenfunctions. Using data collected over a set of tasks, the eigenfunction hyperparameters are optimized in a meta-training phase by maximizing a variational bound for the log-marginal likelihood. During meta-testing, the eigenfunctions are fixed, so that only the linear parameters are adapted to the new unseen task in an online adaptive fashion via Bayesian linear regression, providing a simple and efficient inference scheme. Simulation results are provided for autonomous racing with miniature race cars adapting to unseen road conditions.
△ Less
Submitted 21 November, 2022; v1 submitted 13 August, 2020;
originally announced August 2020.
-
Distributed Safe Learning using an Invariance-based Safety Framework
Authors:
Andrea Carron,
Jerome Sieber,
Melanie N. Zeilinger
Abstract:
In large-scale networks of uncertain dynamical systems, where communication is limited and there is a strong interaction among subsystems, learning local models and control policies offers great potential for designing high-performance controllers. At the same time, the lack of safety guarantees, here considered in the form of constraint satisfaction, prevents the use of data-driven techniques to…
▽ More
In large-scale networks of uncertain dynamical systems, where communication is limited and there is a strong interaction among subsystems, learning local models and control policies offers great potential for designing high-performance controllers. At the same time, the lack of safety guarantees, here considered in the form of constraint satisfaction, prevents the use of data-driven techniques to safety-critical distributed systems. This paper presents a safety framework that guarantees constraint satisfaction for uncertain distributed systems while learning. The framework considers linear systems with coupling in the dynamics and subject to bounded parametric uncertainty, and makes use of robust invariance to guarantee safety. In particular, a robust non-convex invariant set, given by the union of multiple ellipsoidal invariant sets, and a nonlinear backup control law, given by the combination of multiple stabilizing linear feedbacks, are computed offline. In presence of unsafe inputs, the safety framework applies the backup control law, preventing the system to violate the constraints. As the robust invariant set and the backup stabilizing controller are computed offline, the online operations reduce to simple function evaluations, which enables the use of the proposed framework on systems with limited computational resources. The capabilities of the safety framework are illustrated by three numerical examples.
△ Less
Submitted 1 July, 2020;
originally announced July 2020.