-
Configuration-Constrained Tube MPC for Tracking
Authors:
Filippo Badalamenti,
Sampath Kumar Mulagaleti,
Alberto Bemporad,
Boris Houska,
Mario Eduardo Villanueva
Abstract:
This paper proposes a novel tube-based Model Predictive Control (MPC) framework for tracking varying setpoint references with linear systems subject to additive and multiplicative uncertainties. The MPC controllers designed using this framework exhibit recursively feasible for changing references, and robust asymptotic stability for piecewise constant references. The framework leverages configurat…
▽ More
This paper proposes a novel tube-based Model Predictive Control (MPC) framework for tracking varying setpoint references with linear systems subject to additive and multiplicative uncertainties. The MPC controllers designed using this framework exhibit recursively feasible for changing references, and robust asymptotic stability for piecewise constant references. The framework leverages configuration-constrained polytopes to parameterize the tubes, offering flexibility to optimize their shape. The efficacy of the approach is demonstrated through two numerical examples. The first example illustrates the theoretical results, and the second uses the framework to design a lane-change controller for an autonomous vehicle.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
A Long-Short-Term Mixed-Integer Formulation for Highway Lane Change Planning
Authors:
Rudolf Reiter,
Armin Nurkanovic,
Daniele Bernadini,
Moritz Diehl,
Alberto Bemporad
Abstract:
This work considers the problem of optimal lane changing in a structured multi-agent road environment. A novel motion planning algorithm that can capture long-horizon dependencies as well as short-horizon dynamics is presented. Pivotal to our approach is a geometric approximation of the long-horizon combinatorial transition problem which we formulate in the continuous time-space domain. Moreover,…
▽ More
This work considers the problem of optimal lane changing in a structured multi-agent road environment. A novel motion planning algorithm that can capture long-horizon dependencies as well as short-horizon dynamics is presented. Pivotal to our approach is a geometric approximation of the long-horizon combinatorial transition problem which we formulate in the continuous time-space domain. Moreover, a discrete-time formulation of a short-horizon optimal motion planning problem is formulated and combined with the long-horizon planner. Both individual problems, as well as their combination, are formulated as MIQP and solved in real-time by using state-of-the-art solvers. We show how the presented algorithm outperforms two other state-of-the-art motion planning algorithms in closed-loop performance and computation time in lane changing problems. Evaluations are performed using the traffic simulator SUMO, a custom low-level tracking model predictive controller, and high-fidelity vehicle models and scenarios, provided by the CommonRoad environment.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Linear and nonlinear system identification under $\ell_1$- and group-Lasso regularization via L-BFGS-B
Authors:
Alberto Bemporad
Abstract:
In this paper, we propose an approach for identifying linear and nonlinear discrete-time state-space models, possibly under $\ell_1$- and group-Lasso regularization, based on the L-BFGS-B algorithm. For the identification of linear models, we show that, compared to classical linear subspace methods, the approach often provides better results, is much more general in terms of the loss and regulariz…
▽ More
In this paper, we propose an approach for identifying linear and nonlinear discrete-time state-space models, possibly under $\ell_1$- and group-Lasso regularization, based on the L-BFGS-B algorithm. For the identification of linear models, we show that, compared to classical linear subspace methods, the approach often provides better results, is much more general in terms of the loss and regularization terms used, and is also more stable from a numerical point of view. The proposed method not only enriches the existing set of linear system identification tools but can be also applied to identifying a very broad class of parametric nonlinear state-space models, including recurrent neural networks. We illustrate the approach on synthetic and experimental datasets and apply it to solve the challenging industrial robot benchmark for nonlinear multi-input/multi-output system identification proposed by Weigand et al. (2022). A Python implementation of the proposed identification method is available in the package \texttt{jax-sysid}, available at \url{https://github.com/bemporad/jax-sysid}.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Learning disturbance models for offset-free reference tracking
Authors:
Pablo Krupa,
Mario Zanon,
Alberto Bemporad
Abstract:
This work presents a nonlinear MPC framework that guarantees asymptotic offset-free tracking of generic reference trajectories by learning a nonlinear disturbance model, which compensates for input disturbances and model-plant mismatch. Our approach generalizes the well-established method of using an observer to estimate a constant disturbance to allow tracking constant setpoints with zero steady-…
▽ More
This work presents a nonlinear MPC framework that guarantees asymptotic offset-free tracking of generic reference trajectories by learning a nonlinear disturbance model, which compensates for input disturbances and model-plant mismatch. Our approach generalizes the well-established method of using an observer to estimate a constant disturbance to allow tracking constant setpoints with zero steady-state error. In this paper, the disturbance model is generalized to a nonlinear static function of the plant's state and command input, learned online, so as to perfectly track time-varying reference trajectories under certain assumptions on the model and provided that future reference samples are available. We compare our approach with the classical constant disturbance model in numerical simulations, showing its superiority.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Harmonic model predictive control for tracking periodic references
Authors:
Pablo Krupa,
Daniel Limon,
Alberto Bemporad,
Teodoro Alamo
Abstract:
Harmonic model predictive control (HMPC) is a recent model predictive control (MPC) formulation for tracking piece-wise constant references that includes a parameterized artificial harmonic reference as a decision variable, resulting in an increased performance and domain of attraction with respect to other MPC formulations. This article presents an extension of the HMPC formulation to track perio…
▽ More
Harmonic model predictive control (HMPC) is a recent model predictive control (MPC) formulation for tracking piece-wise constant references that includes a parameterized artificial harmonic reference as a decision variable, resulting in an increased performance and domain of attraction with respect to other MPC formulations. This article presents an extension of the HMPC formulation to track periodic harmonic references and discusses its use to track arbitrary references. The proposed formulation inherits the benefits of its predecessor, namely its good performance and large domain of attraction when using small prediction horizons, and that the complexity of its optimization problem does not depend on the period of the periodic reference. We show closed-loop results discussing its performance and comparing it to other MPC formulations.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Data-Driven Synthesis of Configuration-Constrained Robust Invariant Sets for Linear Parameter-Varying Systems
Authors:
Manas Mejari,
Sampath Kumar Mulagaleti,
Alberto Bemporad
Abstract:
We present a data-driven method to synthesize robust control invariant (RCI) sets for linear parameter-varying (LPV) systems subject to unknown but bounded disturbances. A finite-length data set consisting of state, input, and scheduling signal measurements is used to compute an RCI set and invariance-inducing controller, without identifying an LPV model of the system. We parameterize the RCI set…
▽ More
We present a data-driven method to synthesize robust control invariant (RCI) sets for linear parameter-varying (LPV) systems subject to unknown but bounded disturbances. A finite-length data set consisting of state, input, and scheduling signal measurements is used to compute an RCI set and invariance-inducing controller, without identifying an LPV model of the system. We parameterize the RCI set as a configuration-constrained polytope whose facets have a fixed orientation and variable offset. This allows us to define the vertices of the polytopic set in terms of its offset. By exploiting this property, an RCI set and associated vertex control inputs are computed by solving a single linear programming (LP) problem, formulated based on a data-based invariance condition and system constraints. We illustrate the effectiveness of our approach via two numerical examples. The proposed method can generate RCI sets that are of comparable size to those obtained by a model-based method in which exact knowledge of the system matrices is assumed. We show that RCI sets can be synthesized even with a relatively small number of data samples, if the gathered data satisfy certain excitation conditions.
△ Less
Submitted 13 September, 2023;
originally announced September 2023.
-
Computation of safe disturbance sets using implicit RPI sets
Authors:
Sampath Kumar Mulagaleti,
Alberto Bemporad,
Mario Zanon
Abstract:
Given a stable linear time-invariant (LTI) system subject to output constraints, we present a method to compute a set of disturbances such that the reachable set of outputs matches as closely as possible the output constraint set, while being included in it. This problem finds application in several control design problems, such as the development of hierarchical control loops, decentralized contr…
▽ More
Given a stable linear time-invariant (LTI) system subject to output constraints, we present a method to compute a set of disturbances such that the reachable set of outputs matches as closely as possible the output constraint set, while being included in it. This problem finds application in several control design problems, such as the development of hierarchical control loops, decentralized control, supervisory control, robustness-verification, etc. We first characterize the set of disturbance sets satisfying the output constraint inclusion using corresponding minimal robust positive invariant (mRPI) sets, following which we formulate an optimization problem that minimizes the distance between the reachable output set and the output constraint set. We tackle the optimization problem using an implicit RPI set approach that provides a priori approximation error guarantees, and adopt a novel disturbance set parameterization that permits the encoding of the set of feasible disturbance sets as a polyhedron. Through extensive numerical examples, we demonstrate that the proposed approach computes disturbance sets with reduced conservativeness improved computational efficiency than state-of-the-art methods.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Parameter Dependent Robust Control Invariant Sets for LPV Systems with Bounded Parameter Variation Rate
Authors:
Sampath Kumar Mulagaleti,
Manas Mejari,
Alberto Bemporad
Abstract:
Real-time measurements of the scheduling parameter of linear parameter-varying (LPV) systems enables the synthesis of robust control invariant (RCI) sets and parameter dependent controllers inducing invariance. We present a method to synthesize parameter-dependent robust control invariant (PD-RCI) sets for LPV systems with bounded parameter variation, in which invariance is induced using PD-vertex…
▽ More
Real-time measurements of the scheduling parameter of linear parameter-varying (LPV) systems enables the synthesis of robust control invariant (RCI) sets and parameter dependent controllers inducing invariance. We present a method to synthesize parameter-dependent robust control invariant (PD-RCI) sets for LPV systems with bounded parameter variation, in which invariance is induced using PD-vertex control laws. The PD-RCI sets are parameterized as configuration-constrained polytopes that admit a joint parameterization of their facets and vertices. The proposed sets and associated control laws are computed by solving a single semidefinite programing (SDP) problem. Through numerical examples, we demonstrate that the proposed method outperforms state-of-the-art methods for synthesizing PD-RCI sets, both with respect to conservativeness and computational load.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Certification of the proximal gradient method under fixed-point arithmetic for box-constrained QP problems
Authors:
Pablo Krupa,
Omar Inverso,
Mirco Tribastone,
Alberto Bemporad
Abstract:
In safety-critical applications that rely on the solution of an optimization problem, the certification of the optimization algorithm is of vital importance. Certification and suboptimality results are available for a wide range of optimization algorithms. However, a typical underlying assumption is that the operations performed by the algorithm are exact, i.e., that there is no numerical error du…
▽ More
In safety-critical applications that rely on the solution of an optimization problem, the certification of the optimization algorithm is of vital importance. Certification and suboptimality results are available for a wide range of optimization algorithms. However, a typical underlying assumption is that the operations performed by the algorithm are exact, i.e., that there is no numerical error during the mathematical operations, which is hardly a valid assumption in a real hardware implementation. This is particularly true in the case of fixed-point hardware, where computational inaccuracies are not uncommon. This article presents a certification procedure for the proximal gradient method for box-constrained QP problems implemented in fixed-point arithmetic. The procedure provides a method to select the minimal fractional precision required to obtain a certain suboptimality bound, indicating the maximum number of iterations of the optimization method required to obtain it. The procedure makes use of formal verification methods to provide arbitrarily tight bounds on the suboptimality guarantee. We apply the proposed certification procedure on the implementation of a non-trivial model predictive controller on 32-bit fixed-point hardware.
△ Less
Submitted 5 December, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Counter-example guided inductive synthesis of control Lyapunov functions for uncertain systems
Authors:
Daniele Masti,
Filippo Fabiani,
Giorgio Gnecco,
Alberto Bemporad
Abstract:
We propose a counter-example guided inductive synthesis (CEGIS) scheme for the design of control Lyapunov functions and associated state-feedback controllers for linear systems affected by parametric uncertainty with arbitrary shape. In the CEGIS framework, a learner iteratively proposes a candidate control Lyapunov function and a tailored controller by solving a linear matrix inequality (LMI) fea…
▽ More
We propose a counter-example guided inductive synthesis (CEGIS) scheme for the design of control Lyapunov functions and associated state-feedback controllers for linear systems affected by parametric uncertainty with arbitrary shape. In the CEGIS framework, a learner iteratively proposes a candidate control Lyapunov function and a tailored controller by solving a linear matrix inequality (LMI) feasibility problem, while a verifier either falsifies the current candidate by producing a counter-example to be considered at the next iteration, or it certifies that the tentative control Lyapunov function actually enjoys such feature. We investigate the Lipschitz continuity of the objective function of the global optimization problem solved by the verifier, which is key to establish the convergence of our method in a finite number of iterations. Numerical simulations confirm the effectiveness of the proposed approach.
△ Less
Submitted 22 May, 2023; v1 submitted 17 March, 2023;
originally announced March 2023.
-
An active learning method for solving competitive multi-agent decision-making and control problems
Authors:
Filippo Fabiani,
Alberto Bemporad
Abstract:
To identify a stationary action profile for a population of competitive agents, each executing private strategies, we introduce a novel active-learning scheme where a centralized external observer (or entity) can probe the agents' reactions and recursively update simple local parametric estimates of the action-reaction map**s. Under very general working assumptions (not even assuming that a stat…
▽ More
To identify a stationary action profile for a population of competitive agents, each executing private strategies, we introduce a novel active-learning scheme where a centralized external observer (or entity) can probe the agents' reactions and recursively update simple local parametric estimates of the action-reaction map**s. Under very general working assumptions (not even assuming that a stationary profile exists), sufficient conditions are established to assess the asymptotic properties of the proposed active learning methodology so that, if the parameters characterizing the action-reaction map**s converge, a stationary action profile is achieved. Such conditions hence act also as certificates for the existence of such a profile. Extensive numerical simulations involving typical competitive multi-agent control and decision-making problems illustrate the practical effectiveness of the proposed learning-based approach.
△ Less
Submitted 3 April, 2024; v1 submitted 23 December, 2022;
originally announced December 2022.
-
Learning Critical Scenarios in Feedback Control Systems for Automated Driving
Authors:
Mengjia Zhu,
Alberto Bemporad,
Maximilian Kneissl,
Hasan Esen
Abstract:
Testing is essential for verifying and validating control designs, especially in safety-critical applications. In particular, the control system governing an automated driving vehicle must be proven reliable enough for its acceptance on the market. Recently, much research has focused on scenario-based methods. However, the number of possible driving scenarios to test is in principle infinite. In t…
▽ More
Testing is essential for verifying and validating control designs, especially in safety-critical applications. In particular, the control system governing an automated driving vehicle must be proven reliable enough for its acceptance on the market. Recently, much research has focused on scenario-based methods. However, the number of possible driving scenarios to test is in principle infinite. In this paper, we formalize a learning-based optimization framework to generate corner test-cases, where we take into account the operational design domain. We examine the approach on the case of a feedback control system for automated driving, for which we suggest the design of the objective function expressing the criticality of scenarios. Numerical tests on two logical scenarios of the case study demonstrate that the approach can identify critical scenarios within a limited number of closed-loop experiments.
△ Less
Submitted 8 September, 2023; v1 submitted 26 September, 2022;
originally announced September 2022.
-
A construction-free coordinate-descent augmented-Lagrangian method for embedded linear MPC based on ARX models
Authors:
Liang Wu,
Alberto Bemporad
Abstract:
This paper proposes a construction-free algorithm for solving linear MPC problems based on autoregressive with exogenous terms (ARX) input-output models. The solution algorithm relies on a coordinate-descent augmented Lagrangian (CDAL) method previously proposed by the authors, which we adapt here to exploit the special structure of ARX-based MPC. The CDAL-ARX algorithm enjoys the construction-fre…
▽ More
This paper proposes a construction-free algorithm for solving linear MPC problems based on autoregressive with exogenous terms (ARX) input-output models. The solution algorithm relies on a coordinate-descent augmented Lagrangian (CDAL) method previously proposed by the authors, which we adapt here to exploit the special structure of ARX-based MPC. The CDAL-ARX algorithm enjoys the construction-free feature, in that it avoids explicitly constructing the quadratic programming (QP) problem associated with MPC, which would eliminate construction cost when the ARX model changes/adapts online. For example, the ARX model parameters are dependent on linear parameter-varying (LPV) scheduling signals, or recursively adapted from streaming input-output data with cheap computation cost, which make the ARX model widely used in adaptive control. Moreover, the implementation of the resulting CDAL-ARX algorithm is matrix-free and library-free, and hence amenable for deployment in industrial embedded platforms. We show the efficiency of CDAL-ARX in two numerical examples, also in comparison with MPC implementations based on other general-purpose quadratic programming solvers.
△ Less
Submitted 27 September, 2022; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Training Recurrent Neural Networks by Sequential Least Squares and the Alternating Direction Method of Multipliers
Authors:
Alberto Bemporad
Abstract:
This paper proposes a novel algorithm for training recurrent neural network models of nonlinear dynamical systems from an input/output training dataset. Arbitrary convex and twice-differentiable loss functions and regularization terms are handled by sequential least squares and either a line-search (LS) or a trust-region method of Levenberg-Marquardt (LM) type for ensuring convergence. In addition…
▽ More
This paper proposes a novel algorithm for training recurrent neural network models of nonlinear dynamical systems from an input/output training dataset. Arbitrary convex and twice-differentiable loss functions and regularization terms are handled by sequential least squares and either a line-search (LS) or a trust-region method of Levenberg-Marquardt (LM) type for ensuring convergence. In addition, to handle non-smooth regularization terms such as $\ell_1$, $\ell_0$, and group-Lasso regularizers, as well as to impose possibly non-convex constraints such as integer and mixed-integer constraints, we combine sequential least squares with the alternating direction method of multipliers (ADMM). We call the resulting algorithm NAILS (nonconvex ADMM iterations and least squares) in the case line search (LS) is used, or NAILM if a trust-region method (LM) is employed instead. The training method, which is also applicable to feedforward neural networks as a special case, is tested in three nonlinear system identification problems.
△ Less
Submitted 15 October, 2022; v1 submitted 31 December, 2021;
originally announced December 2021.
-
Data-driven synthesis of Robust Invariant Sets and Controllers
Authors:
Sampath Kumar Mulagaleti,
Alberto Bemporad,
Mario Zanon
Abstract:
This paper presents a method to identify an uncertain linear time-invariant (LTI) prediction model for tube-based Robust Model Predictive Control (RMPC). The uncertain model is determined from a given state-input dataset by formulating and solving a Semidefinite Programming problem (SDP), that also determines a static linear feedback gain and corresponding invariant sets satisfying the inclusions…
▽ More
This paper presents a method to identify an uncertain linear time-invariant (LTI) prediction model for tube-based Robust Model Predictive Control (RMPC). The uncertain model is determined from a given state-input dataset by formulating and solving a Semidefinite Programming problem (SDP), that also determines a static linear feedback gain and corresponding invariant sets satisfying the inclusions required to guarantee recursive feasibility and stability of the RMPC scheme, while minimizing an identification criterion. As demonstrated through an example, the proposed concurrent approach provides less conservative invariant sets than a sequential approach.
△ Less
Submitted 18 November, 2021;
originally announced November 2021.
-
Computation of Input Disturbance Sets for Constrained Output Reachability
Authors:
Sampath Kumar Mulagaleti,
Alberto Bemporad,
Mario Zanon
Abstract:
Linear models with additive unknown-but-bounded input disturbances are extensively used to model uncertainty in robust control systems design. Typically, the disturbance set is either assumed to be known a priori or estimated from data through set-membership identification. However, the problem of computing a suitable input disturbance set in case the set of possible output values is assigned a pr…
▽ More
Linear models with additive unknown-but-bounded input disturbances are extensively used to model uncertainty in robust control systems design. Typically, the disturbance set is either assumed to be known a priori or estimated from data through set-membership identification. However, the problem of computing a suitable input disturbance set in case the set of possible output values is assigned a priori has received relatively little attention. This problem arises in many contexts, such as in supervisory control, actuator design, decentralized control, and others. In this paper, we propose a method to compute input disturbance sets (and the corresponding set of states) such that the resulting set of outputs matches as closely as possible a given set of outputs, while additionally satisfying strict (inner or outer) inclusion constraints. We formulate the problem as an optimization problem by relying on the concept of robust invariance. The effectiveness of the approach is demonstrated in numerical examples that illustrate how to solve safe reference set and input-constraint set computation problems.
△ Less
Submitted 4 November, 2021;
originally announced November 2021.
-
Recurrent Neural Network Training with Convex Loss and Regularization Functions by Extended Kalman Filtering
Authors:
Alberto Bemporad
Abstract:
This paper investigates the use of extended Kalman filtering to train recurrent neural networks with rather general convex loss functions and regularization terms on the network parameters, including $\ell_1$-regularization. We show that the learning method is competitive with respect to stochastic gradient descent in a nonlinear system identification benchmark and in training a linear system with…
▽ More
This paper investigates the use of extended Kalman filtering to train recurrent neural networks with rather general convex loss functions and regularization terms on the network parameters, including $\ell_1$-regularization. We show that the learning method is competitive with respect to stochastic gradient descent in a nonlinear system identification benchmark and in training a linear system with binary outputs. We also explore the use of the algorithm in data-driven nonlinear model predictive control and its relation with disturbance models for offset-free closed-loop tracking.
△ Less
Submitted 2 November, 2022; v1 submitted 4 November, 2021;
originally announced November 2021.
-
A Simple and Fast Coordinate-Descent Augmented-Lagrangian Solver for Model Predictive Control
Authors:
Liang Wu,
Alberto Bemporad
Abstract:
This paper proposes a novel Coordinate-Descent Augmented-Lagrangian (CDAL) solver for linear, possibly parameter-varying, model predictive control (MPC) problems. At each iteration, an augmented Lagrangian (AL) subproblem is solved by coordinate descent (CD), exploiting the structure of the MPC problem. The CDAL solver enjoys three main properties: (i) it is construction-free, in that it avoids ex…
▽ More
This paper proposes a novel Coordinate-Descent Augmented-Lagrangian (CDAL) solver for linear, possibly parameter-varying, model predictive control (MPC) problems. At each iteration, an augmented Lagrangian (AL) subproblem is solved by coordinate descent (CD), exploiting the structure of the MPC problem. The CDAL solver enjoys three main properties: (i) it is construction-free, in that it avoids explicitly constructing the quadratic programming (QP) problem associated with MPC; (ii) is matrix-free, as it avoids multiplications and factorizations of matrices; and (iii) is library-free, as it can be simply coded without any library dependency, 90-line of C-code in our implementation. To favor convergence speed, CDAL employs a reverse cyclic rule for the CD method, the accelerated Nesterov's scheme for updating the dual variables, a simple diagonal preconditioner, and an efficient coupling scheme between the CD and AL methods. We show that CDAL competes with other state-of-the-art methods, both in case of unstable linear time-invariant and linear parameter-varying prediction models.
△ Less
Submitted 27 September, 2022; v1 submitted 21 September, 2021;
originally announced September 2021.
-
C-GLISp: Preference-Based Global Optimization under Unknown Constraints with Applications to Controller Calibration
Authors:
Mengjia Zhu,
Dario Piga,
Alberto Bemporad
Abstract:
Preference-based global optimization algorithms minimize an unknown objective function only based on whether the function is better, worse, or similar for given pairs of candidate optimization vectors. Such optimization problems arise in many real-life examples, such as finding the optimal calibration of the parameters of a control law. The calibrator can judge whether a particular combination of…
▽ More
Preference-based global optimization algorithms minimize an unknown objective function only based on whether the function is better, worse, or similar for given pairs of candidate optimization vectors. Such optimization problems arise in many real-life examples, such as finding the optimal calibration of the parameters of a control law. The calibrator can judge whether a particular combination of parameters leads to a better, worse, or similar closed-loop performance. Often, the search for the optimal parameters is also subject to unknown constraints. For example, the vector of calibration parameters must not lead to closed-loop instability. This paper extends an active preference learning algorithm introduced recently by the authors to handle unknown constraints. The proposed method, called C-GLISp, looks for an optimizer of the problem only based on preferences expressed on pairs of candidate vectors, and on whether a given vector is reported feasible and/or satisfactory. C-GLISp learns a surrogate of the underlying objective function based on the expressed preferences, and a surrogate of the probability that a sample is feasible and/or satisfactory based on whether each of the tested vectors was judged as such. The surrogate functions are used iteratively to propose a new candidate vector to test and judge. Numerical benchmarks and a semi-automated control calibration task demonstrate the effectiveness of C-GLISp, showing that it can reach near-optimal solutions within a small number of iterations.
△ Less
Submitted 18 December, 2021; v1 submitted 10 June, 2021;
originally announced June 2021.
-
Model Predictive Control with Environment Adaptation for Legged Locomotion
Authors:
Niraj Rathod,
Angelo Bratta,
Michele Focchi,
Mario Zanon,
Octavio Villarreal,
Claudio Semini,
Alberto Bemporad
Abstract:
Re-planning in legged locomotion is crucial to track the desired user velocity while adapting to the terrain and rejecting external disturbances. In this work, we propose and test in experiments a real-time Nonlinear Model Predictive Control (NMPC) tailored to a legged robot for achieving dynamic locomotion on a variety of terrains. We introduce a mobility-based criterion to define an NMPC cost th…
▽ More
Re-planning in legged locomotion is crucial to track the desired user velocity while adapting to the terrain and rejecting external disturbances. In this work, we propose and test in experiments a real-time Nonlinear Model Predictive Control (NMPC) tailored to a legged robot for achieving dynamic locomotion on a variety of terrains. We introduce a mobility-based criterion to define an NMPC cost that enhances the locomotion of quadruped robots while maximizing leg mobility and improves adaptation to the terrain features. Our NMPC is based on the real-time iteration scheme that allows us to re-plan online at $25\,\mathrm{Hz}$ with a prediction horizon of $2$ seconds. We use the single rigid body dynamic model defined in the center of mass frame in order to increase the computational efficiency. In simulations, the NMPC is tested to traverse a set of pallets of different sizes, to walk into a V-shaped chimney,and to locomote over rough terrain. In real experiments, we demonstrate the effectiveness of our NMPC with the mobility feature that allowed IIT's $87\, \mathrm{kg}$ quadruped robot HyQ to achieve an omni-directional walk on flat terrain, to traverse a static pallet, and to adapt to a repositioned pallet during a walk.
△ Less
Submitted 25 November, 2021; v1 submitted 12 May, 2021;
originally announced May 2021.
-
A Dual Active-Set Solver for Embedded Quadratic Programming Using Recursive LDL' Updates
Authors:
Daniel Arnström,
Alberto Bemporad,
Daniel Axehill
Abstract:
In this paper we present a dual active-set solver for quadratic programming which has properties suitable for use in embedded model predictive control applications. In particular, the solver is efficient, can easily be warm-started, and is simple to code. Moreover, the exact worst-case computational complexity of the solver can be determined offline and, by using outer proximal-point iterations, i…
▽ More
In this paper we present a dual active-set solver for quadratic programming which has properties suitable for use in embedded model predictive control applications. In particular, the solver is efficient, can easily be warm-started, and is simple to code. Moreover, the exact worst-case computational complexity of the solver can be determined offline and, by using outer proximal-point iterations, ill-conditioned problems can be handled in a robust manner.
△ Less
Submitted 12 October, 2021; v1 submitted 30 March, 2021;
originally announced March 2021.
-
A machine-learning approach to synthesize virtual sensors for parameter-varying systems
Authors:
Daniele Masti,
Daniele Bernardini,
Alberto Bemporad
Abstract:
This paper introduces a novel model-free approach to synthesize virtual sensors for the estimation of dynamical quantities that are unmeasurable at runtime but are available for design purposes on test benches. After collecting a dataset of measurements of such quantities, together with other variables that are also available during on-line operations, the virtual sensor is obtained using machine…
▽ More
This paper introduces a novel model-free approach to synthesize virtual sensors for the estimation of dynamical quantities that are unmeasurable at runtime but are available for design purposes on test benches. After collecting a dataset of measurements of such quantities, together with other variables that are also available during on-line operations, the virtual sensor is obtained using machine learning techniques by training a predictor whose inputs are the measured variables and the features extracted by a bank of linear observers fed with the same measures. The approach is applicable to infer the value of quantities such as physical states and other time-varying parameters that affect the dynamics of the system. The proposed virtual sensor architecture - whose structure can be related to the Multiple Model Adaptive Estimation framework - is conceived to keep computational and memory requirements as low as possible, so that it can be efficiently implemented in embedded hardware platforms.
The effectiveness of the approach is shown in different numerical examples, involving the estimation of the scheduling parameter of a nonlinear parameter-varying system, the reconstruction of the mode of a switching linear system, and the estimation of the state of charge (SoC) of a lithium-ion battery.
△ Less
Submitted 23 March, 2021;
originally announced March 2021.
-
Exact and Heuristic Methods with Warm-start for Embedded Mixed-Integer Quadratic Programming Based on Accelerated Dual Gradient Projection
Authors:
Vihangkumar V. Naik,
Alberto Bemporad
Abstract:
Small-scale Mixed-Integer Quadratic Programming (MIQP) problems often arise in embedded control and estimation applications. Driven by the need for algorithmic simplicity to target computing platforms with limited memory and computing resources, this paper proposes a few approaches to solving MIQPs, either to optimality or suboptimally. We specialize an existing Accelerated Dual Gradient Projectio…
▽ More
Small-scale Mixed-Integer Quadratic Programming (MIQP) problems often arise in embedded control and estimation applications. Driven by the need for algorithmic simplicity to target computing platforms with limited memory and computing resources, this paper proposes a few approaches to solving MIQPs, either to optimality or suboptimally. We specialize an existing Accelerated Dual Gradient Projection (GPAD) algorithm to effectively solve the Quadratic Programming (QP) relaxation that arise during Branch and Bound (B&B) and propose a generic framework to warm-start the binary variables which reduces the number of QP relaxations. Moreover, in order to find an integer feasible combination of the binary variables upfront, two heuristic approaches are presented: ($i$) without using B&B, and ($ii$) using B&B with a significantly reduced number of QP relaxations. Both heuristic approaches return an integer feasible solution that may be suboptimal but involve a much reduced computation effort. Such a feasible solution can be either implemented directly or used to set an initial upper bound on the optimal cost in B&B. Through different hybrid control and estimation examples involving binary decision variables, we show that the performance of the proposed methods, although very simple to code, is comparable to that of state-of-the-art MIQP solvers.
△ Less
Submitted 22 January, 2021;
originally announced January 2021.
-
Reduction of the Number of Variables in Parametric Constrained Least-Squares Problems
Authors:
Alberto Bemporad,
Gionata Cimini
Abstract:
For linearly constrained least-squares problems that depend on a vector of parameters, this paper proposes techniques for reducing the number of involved optimization variables. After first eliminating equality constraints in a numerically robust way by QR factorization, we propose a technique based on singular value decomposition (SVD) and unsupervised learning, that we call $K$-SVD, and neural c…
▽ More
For linearly constrained least-squares problems that depend on a vector of parameters, this paper proposes techniques for reducing the number of involved optimization variables. After first eliminating equality constraints in a numerically robust way by QR factorization, we propose a technique based on singular value decomposition (SVD) and unsupervised learning, that we call $K$-SVD, and neural classifiers to automatically partition the set of parameter vectors in $K$ nonlinear regions in which the original problem is approximated by using a smaller set of variables. For the special case of parametric constrained least-squares problems that arise from model predictive control (MPC) formulations, we propose a novel and very efficient QR factorization method for equality constraint elimination. Together with SVD or $K$-SVD, the method provides a numerically robust alternative to standard condensing and move blocking, and to other complexity reduction methods for MPC based on basis functions. We show the good performance of the proposed techniques in numerical tests and in a linearized MPC problem of a nonlinear benchmark process.
△ Less
Submitted 18 December, 2020;
originally announced December 2020.
-
Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality?
Authors:
Sebastien Gros,
Mario Zanon,
Alberto Bemporad
Abstract:
For all its successes, Reinforcement Learning (RL) still struggles to deliver formal guarantees on the closed-loop behavior of the learned policy. Among other things, guaranteeing the safety of RL with respect to safety-critical systems is a very active research topic. Some recent contributions propose to rely on projections of the inputs delivered by the learned policy into a safe set, ensuring t…
▽ More
For all its successes, Reinforcement Learning (RL) still struggles to deliver formal guarantees on the closed-loop behavior of the learned policy. Among other things, guaranteeing the safety of RL with respect to safety-critical systems is a very active research topic. Some recent contributions propose to rely on projections of the inputs delivered by the learned policy into a safe set, ensuring that the system safety is never jeopardized. Unfortunately, it is unclear whether this operation can be performed without disrupting the learning process. This paper addresses this issue. The problem is analysed in the context of $Q$-learning and policy gradient techniques. We show that the projection approach is generally disruptive in the context of $Q$-learning though a simple alternative solves the issue, while simple corrections can be used in the context of policy gradient methods in order to ensure that the policy gradients are unbiased. The proposed results extend to safe projections based on robust MPC techniques.
△ Less
Submitted 2 April, 2020;
originally announced April 2020.
-
Constrained Controller and Observer Design by Inverse Optimality
Authors:
Mario Zanon,
Alberto Bemporad
Abstract:
Model Predictive Control (MPC) is often tuned by trial and error. When a baseline linear controller exists that is already well tuned in the absence of constraints and MPC is introduced to enforce them, one would like to avoid altering the original linear feedback law whenever they are not active. We formulate this problem as a controller matching similar to [1]-[3], which we extend to a more gene…
▽ More
Model Predictive Control (MPC) is often tuned by trial and error. When a baseline linear controller exists that is already well tuned in the absence of constraints and MPC is introduced to enforce them, one would like to avoid altering the original linear feedback law whenever they are not active. We formulate this problem as a controller matching similar to [1]-[3], which we extend to a more general framework. We prove that a positive-definite stage cost matrix yielding this matching property can be computed for all stabilizing linear controllers. Additionally, we prove that the constrained estimation problem can also be solved similarly, by matching a linear observer with a Moving Horizon Estimator (MHE). Finally, we discuss various aspects of the practical implementation of the proposed technique in some examples.
△ Less
Submitted 29 October, 2021; v1 submitted 23 March, 2020;
originally announced March 2020.
-
Efficient Calibration of Embedded MPC
Authors:
Marco Forgione,
Dario Piga,
Alberto Bemporad
Abstract:
Model Predictive Control (MPC) is a powerful and flexible design tool of high-performance controllers for physical systems in the presence of input and output constraints. A challenge for the practitioner applying MPC is the need of tuning a large number of parameters such as prediction and control horizons, weight matrices of the MPC cost function, and observer gains, according to different trade…
▽ More
Model Predictive Control (MPC) is a powerful and flexible design tool of high-performance controllers for physical systems in the presence of input and output constraints. A challenge for the practitioner applying MPC is the need of tuning a large number of parameters such as prediction and control horizons, weight matrices of the MPC cost function, and observer gains, according to different trade-offs. The MPC design task is even more involved when the control law has to be deployed to an embedded hardware unit endowed with limited computational resources. In this case, real-time system requirements limit the complexity of the applicable MPC configuration, engendering additional design tradeoffs and requiring to tune further parameters, such as the sampling time and the tolerances used in the on-line numerical solver. To take into account closed-loop performance and real-time requirements, in this paper we tackle the embedded MPC design problem using a global, data-driven, optimization approach We showcase the potential of this approach by tuning an MPC controller on two hardware platforms characterized by largely different computational capabilities.
△ Less
Submitted 17 January, 2021; v1 submitted 29 November, 2019;
originally announced November 2019.
-
An efficient bounded-variable nonlinear least-squares algorithm for embedded MPC
Authors:
Nilay Saraf,
Alberto Bemporad
Abstract:
This paper presents a new approach to solve linear and nonlinear model predictive control (MPC) problems that requires small memory footprint and throughput and is particularly suitable when the model and/or controller parameters change at runtime. Typically MPC requires two phases: 1) construct an optimization problem based on the given MPC parameters (prediction model, tuning weights, prediction…
▽ More
This paper presents a new approach to solve linear and nonlinear model predictive control (MPC) problems that requires small memory footprint and throughput and is particularly suitable when the model and/or controller parameters change at runtime. Typically MPC requires two phases: 1) construct an optimization problem based on the given MPC parameters (prediction model, tuning weights, prediction horizon, and constraints), which results in a quadratic or nonlinear programming problem, and then 2) call an optimization algorithm to solve the resulting problem. In the proposed approach the problem construction step is systematically eliminated, as in the optimization algorithm problem matrices are expressed in terms of abstract functions of the MPC parameters. We present a unifying algorithmic framework based on active-set methods with bounded variables that can cope with linear, nonlinear, and adaptive MPC variants based on a broad class of prediction models and a sum-of-squares cost function. The theoretical and numerical results demonstrate the potential, applicability, and efficiency of the proposed framework for practical real-time embedded MPC.
△ Less
Submitted 24 March, 2021; v1 submitted 20 August, 2019;
originally announced August 2019.
-
Practical Reinforcement Learning of Stabilizing Economic MPC
Authors:
Mario Zanon,
Sébastien Gros,
Alberto Bemporad
Abstract:
Reinforcement Learning (RL) has demonstrated a huge potential in learning optimal policies without any prior knowledge of the process to be controlled. Model Predictive Control (MPC) is a popular control technique which is able to deal with nonlinear dynamics and state and input constraints. The main drawback of MPC is the need of identifying an accurate model, which in many cases cannot be easily…
▽ More
Reinforcement Learning (RL) has demonstrated a huge potential in learning optimal policies without any prior knowledge of the process to be controlled. Model Predictive Control (MPC) is a popular control technique which is able to deal with nonlinear dynamics and state and input constraints. The main drawback of MPC is the need of identifying an accurate model, which in many cases cannot be easily obtained. Because of model inaccuracy, MPC can fail at delivering satisfactory closed-loop performance. Using RL to tune the MPC formulation or, conversely, using MPC as a function approximator in RL allows one to combine the advantages of the two techniques. This approach has important advantages, but it requires an adaptation of the existing algorithms. We therefore propose an improved RL algorithm for MPC and test it in simulations on a rather challenging example.
△ Less
Submitted 9 April, 2019;
originally announced April 2019.
-
Fitting Jump Models
Authors:
A. Bemporad,
V. Breschi,
D. Piga,
S. Boyd
Abstract:
We describe a new framework for fitting jump models to a sequence of data. The key idea is to alternate between minimizing a loss function to fit multiple model parameters, and minimizing a discrete loss function to determine which set of model parameters is active at each data point. The framework is quite general and encompasses popular classes of models, such as hidden Markov models and piecewi…
▽ More
We describe a new framework for fitting jump models to a sequence of data. The key idea is to alternate between minimizing a loss function to fit multiple model parameters, and minimizing a discrete loss function to determine which set of model parameters is active at each data point. The framework is quite general and encompasses popular classes of models, such as hidden Markov models and piecewise affine models. The shape of the chosen loss functions to minimize determine the shape of the resulting jump model.
△ Less
Submitted 21 May, 2018; v1 submitted 25 November, 2017;
originally announced November 2017.
-
Cloud-aided collaborative estimation by ADMM-RLS algorithms for connected vehicle prognostics
Authors:
Valentina Breschi,
Ilya Kolmanovsky,
Alberto Bemporad
Abstract:
As the connectivity of consumer devices is rapidly growing and cloud computing technologies are becoming more widespread, cloud-aided techniques for parameter estimation can be designed to exploit the theoretically unlimited storage memory and computational power of the cloud, while relying on information provided by multiple sources. With the ultimate goal of develo** monitoring and diagnostic…
▽ More
As the connectivity of consumer devices is rapidly growing and cloud computing technologies are becoming more widespread, cloud-aided techniques for parameter estimation can be designed to exploit the theoretically unlimited storage memory and computational power of the cloud, while relying on information provided by multiple sources. With the ultimate goal of develo** monitoring and diagnostic strategies, this report focuses on the design of a Recursive Least-Squares (RLS) based estimator for identification over a group of devices connected to the cloud. The proposed approach, that relies on Node-to-Cloud-to-Node (N2C2N) transmissions, is designed so that: (i) estimates of the unknown parameters are computed locally and (ii) the local estimates are refined on the cloud. The proposed approach requires minimal changes to local (pre-existing) RLS estimators.
△ Less
Submitted 22 September, 2017;
originally announced September 2017.
-
Trajectory Planning Under Vehicle Dimension Constraints Using Sequential Linear Programming
Authors:
Mogens Graf Plessen,
Pedro F. Lima,
Jonas Martensson,
Alberto Bemporad,
Bo Wahlberg
Abstract:
This paper presents a spatial-based trajectory planning method for automated vehicles under actuator, obstacle avoidance, and vehicle dimension constraints. Starting from a nonlinear kinematic bicycle model, vehicle dynamics are transformed to a road-aligned coordinate frame with path along the road centerline replacing time as the dependent variable. Space-varying vehicle dimension constraints ar…
▽ More
This paper presents a spatial-based trajectory planning method for automated vehicles under actuator, obstacle avoidance, and vehicle dimension constraints. Starting from a nonlinear kinematic bicycle model, vehicle dynamics are transformed to a road-aligned coordinate frame with path along the road centerline replacing time as the dependent variable. Space-varying vehicle dimension constraints are linearized around a reference path to pose convex optimization problems. Such constraints do not require to inflate obstacles by safety-margins and therefore maximize performance in very constrained environments. A sequential linear programming (SLP) algorithm is motivated. A linear program (LP) is solved at each SLP-iteration. The relation between LP formulation and maximum admissible traveling speeds within vehicle tire friction limits is discussed. The proposed method is evaluated in a roomy and in a tight maneuvering driving scenario, whereby a comparison to a semi-analytical clothoid-based path planner is given. Effectiveness is demonstrated particularly for very constrained environments, requiring to account for constraints and planning over the entire obstacle constellation space.
△ Less
Submitted 21 July, 2017; v1 submitted 20 April, 2017;
originally announced April 2017.
-
A Convex Feasibility Approach to Anytime Model Predictive Control
Authors:
Alberto Bemporad,
Daniele Bernardini,
Panagiotis Patrinos
Abstract:
This paper proposes to decouple performance optimization and enforcement of asymptotic convergence in Model Predictive Control (MPC) so that convergence to a given terminal set is achieved independently of how much performance is optimized at each sampling step. By embedding an explicit decreasing condition in the MPC constraints and thanks to a novel and very easy-to-implement convex feasibility…
▽ More
This paper proposes to decouple performance optimization and enforcement of asymptotic convergence in Model Predictive Control (MPC) so that convergence to a given terminal set is achieved independently of how much performance is optimized at each sampling step. By embedding an explicit decreasing condition in the MPC constraints and thanks to a novel and very easy-to-implement convex feasibility solver proposed in the paper, it is possible to run an outer performance optimization algorithm on top of the feasibility solver and optimize for an amount of time that depends on the available CPU resources within the current sampling step (possibly going open-loop at a given sampling step in the extreme case no resources are available) and still guarantee convergence to the terminal set. While the MPC setup and the solver proposed in the paper can deal with quite general classes of functions, we highlight the synthesis method and show numerical results in case of linear MPC and ellipsoidal and polyhedral terminal sets.
△ Less
Submitted 27 February, 2015;
originally announced February 2015.