-
In-situ Controller Autotuning by Bayesian Optimization for Closed-loop Feedback Control of Laser Powder Bed Fusion Process
Authors:
Baris Kavas,
Efe C. Balta,
Michael R. Tucker,
Raamadaas Krishnadas,
Alisa Rupenyan,
John Lygeros,
Markus Bambach
Abstract:
Open-loop control of laser powder bed fusion (LPBF) additive manufacturing (AM) has enabled the production of complex, high-criticality parts for various industries. This method relies on static parameter sets from extensive experimentation and simulations, ho** they remain stable and defect-free in production. Closed-loop control of LPBF can further enhance process stability and reduce defects…
▽ More
Open-loop control of laser powder bed fusion (LPBF) additive manufacturing (AM) has enabled the production of complex, high-criticality parts for various industries. This method relies on static parameter sets from extensive experimentation and simulations, ho** they remain stable and defect-free in production. Closed-loop control of LPBF can further enhance process stability and reduce defects despite complex thermal histories, process noise, hardware drift, and unexpected perturbations. Controller performance depends on parameter tuning, traditionally a manual, expertise-driven process with no guarantee of optimal performance and limited transferability between systems. This study proposes Bayesian Optimization (BO) to automate in-layer controller tuning by leveraging LPBF's layer-to-layer repetitive nature. Two approaches are introduced: online tuning, adjusting parameters iteratively during the process, and offline tuning, conducted in a setup such as laser exposures on a bare metal plate. These methods are experimentally implemented on an in-layer PI controller, and the performance is investigated on two wedge geometries prone to overheating. Results show that BO effectively tunes controllers using either method, significantly reducing overheating in controlled wedge specimens compared to uncontrolled ones. This study presents the first printed parts controlled by an in-layer controller subjected to microstructural analysis. Findings reveal partial presence of lack-of-fusion porosities due to insufficient laser power assigned by the controller, highlighting a significant challenge for utilizing laser power controllers. In summary, BO presents a promising method for automatic in-layer controller tuning in LPBF, enhancing control precision and mitigating overheating in production parts.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Data-Driven Distributionally Robust System Level Synthesis
Authors:
Francesco Micheli,
Anastasios Tsiamis,
John Lygeros
Abstract:
We present a novel approach for the control of uncertain, linear time-invariant systems, which are perturbed by potentially unbounded, additive disturbances. We propose a \emph{doubly robust} data-driven state-feedback controller to ensure reliable performance against both model mismatch and disturbance distribution uncertainty. Our controller, which leverages the System Level Synthesis parameteri…
▽ More
We present a novel approach for the control of uncertain, linear time-invariant systems, which are perturbed by potentially unbounded, additive disturbances. We propose a \emph{doubly robust} data-driven state-feedback controller to ensure reliable performance against both model mismatch and disturbance distribution uncertainty. Our controller, which leverages the System Level Synthesis parameterization, is designed as the solution to a distributionally robust finite-horizon optimal control problem. The goal is to minimize a cost function while satisfying constraints against the worst-case realization of the uncertainty, which is quantified using distributional ambiguity sets. The latter are defined as balls in the Wasserstein metric centered on the predictive empirical distribution computed from a set of collected trajectory data. By harnessing techniques from robust control and distributionally robust optimization, we characterize the distributional shift between the predictive and the actual closed-loop distributions, and highlight its dependency on the model mismatch and the uncertainty about the disturbance distribution. We also provide bounds on the number of samples required to achieve a desired confidence level and propose a tractable approximate formulation for the doubly robust data-driven controller. To demonstrate the effectiveness of our approach, we present a numerical example showcasing the performance of the proposed algorithm.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Adaptive Learning-based Model Predictive Control for Uncertain Interconnected Systems: A Set Membership Identification Approach
Authors:
Ahmed Aboudonia,
John Lygeros
Abstract:
We propose a novel adaptive learning-based model predictive control (MPC) scheme for interconnected systems which can be decomposed into several smaller dynamically coupled subsystems with uncertain coupling. The proposed scheme is mainly divided into two main online phases; a learning phase and an adaptation phase. Set membership identification is used in the learning phase to learn an uncertaint…
▽ More
We propose a novel adaptive learning-based model predictive control (MPC) scheme for interconnected systems which can be decomposed into several smaller dynamically coupled subsystems with uncertain coupling. The proposed scheme is mainly divided into two main online phases; a learning phase and an adaptation phase. Set membership identification is used in the learning phase to learn an uncertainty set that contains the coupling strength using online data. In the adaptation phase, rigid tube-based robust MPC is used to compute the optimal predicted states and inputs. Besides computing the optimal trajectories, the MPC ingredients are adapted in the adaptation phase taking the learnt uncertainty set into account. These MPC ingredients include the prestabilizing controller, the rigid tube, the tightened constraints and the terminal ingredients. The recursive feasibility of the proposed scheme as well as the stability of the corresponding closed-loop system are discussed. The developed scheme is compared in simulations to existing schemes including robust, adaptive and learning-based MPC.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast Adaptation of Neural Predictive Models
Authors:
Jiaqi Yan,
Ankush Chakrabarty,
Alisa Rupenyan,
John Lygeros
Abstract:
In this paper, we consider the problem of reference tracking in uncertain nonlinear systems. A neural State-Space Model (NSSM) is used to approximate the nonlinear system, where a deep encoder network learns the nonlinearity from data, and a state-space component captures the temporal relationship. This transforms the nonlinear system into a linear system in a latent space, enabling the applicatio…
▽ More
In this paper, we consider the problem of reference tracking in uncertain nonlinear systems. A neural State-Space Model (NSSM) is used to approximate the nonlinear system, where a deep encoder network learns the nonlinearity from data, and a state-space component captures the temporal relationship. This transforms the nonlinear system into a linear system in a latent space, enabling the application of model predictive control (MPC) to determine effective control actions. Our objective is to design the optimal controller using limited data from the \textit{target system} (the system of interest). To this end, we employ an implicit model-agnostic meta-learning (iMAML) framework that leverages information from \textit{source systems} (systems that share similarities with the target system) to expedite training in the target system and enhance its control performance. The framework consists of two phases: the (offine) meta-training phase learns a aggregated NSSM using data from source systems, and the (online) meta-inference phase quickly adapts this aggregated model to the target system using only a few data points and few online training iterations, based on local loss function gradients. The iMAML algorithm exploits the implicit function theorem to exactly compute the gradient during training, without relying on the entire optimization path. By focusing solely on the optimal solution, rather than the path, we can meta-train with less storage complexity and fewer approximations than other contemporary meta-learning algorithms. We demonstrate through numerical examples that our proposed method can yield accurate predictive models by adaptation, resulting in a downstream MPC that outperforms several baselines.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
On the Regret of Recursive Methods for Discrete-Time Adaptive Control with Matched Uncertainty
Authors:
Aren Karapetyan,
Efe C. Balta,
Anastasios Tsiamis,
Andrea Iannelli,
John Lygeros
Abstract:
Continuous-time adaptive controllers for systems with a matched uncertainty often comprise an online parameter estimator and a corresponding parameterized controller to cancel the uncertainty. However, such methods are often unimplementable, as they depend on an unobserved estimation error. We consider the equivalent discrete-time setting with a causal information structure. We propose a novel, on…
▽ More
Continuous-time adaptive controllers for systems with a matched uncertainty often comprise an online parameter estimator and a corresponding parameterized controller to cancel the uncertainty. However, such methods are often unimplementable, as they depend on an unobserved estimation error. We consider the equivalent discrete-time setting with a causal information structure. We propose a novel, online proximal point method-based adaptive controller, that under a weak persistence of excitation (PE) condition is asymptotically stable and achieves finite regret, scaling only with the time required to fulfill the PE condition. We show the same also for the widely-used recursive least squares with exponential forgetting controller under a stronger PE condition.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Finite Sample Frequency Domain Identification
Authors:
Anastasios Tsiamis,
Mohamed Abdalmoaty,
Roy S. Smith,
John Lygeros
Abstract:
We study non-parametric frequency-domain system identification from a finite-sample perspective. We assume an open loop scenario where the excitation input is periodic and consider the Empirical Transfer Function Estimate (ETFE), where the goal is to estimate the frequency response at certain desired (evenly-spaced) frequencies, given input-output samples. We show that under sub-Gaussian colored n…
▽ More
We study non-parametric frequency-domain system identification from a finite-sample perspective. We assume an open loop scenario where the excitation input is periodic and consider the Empirical Transfer Function Estimate (ETFE), where the goal is to estimate the frequency response at certain desired (evenly-spaced) frequencies, given input-output samples. We show that under sub-Gaussian colored noise (in time-domain) and stability assumptions, the ETFE estimates are concentrated around the true values. The error rate is of the order of $\mathcal{O}((d_{\mathrm{u}}+\sqrt{d_{\mathrm{u}}d_{\mathrm{y}}})\sqrt{M/N_{\mathrm{tot}}})$, where $N_{\mathrm{tot}}$ is the total number of samples, $M$ is the number of desired frequencies, and $d_{\mathrm{u}},\,d_{\mathrm{y}}$ are the dimensions of the input and output signals respectively. This rate remains valid for general irrational transfer functions and does not require a finite order state-space representation. By tuning $M$, we obtain a $N_{\mathrm{tot}}^{-1/3}$ finite-sample rate for learning the frequency response over all frequencies in the $ \mathcal{H}_{\infty}$ norm. Our result draws upon an extension of the Hanson-Wright inequality to semi-infinite matrices. We study the finite-sample behavior of ETFE in simulations.
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
Adaptive Economic Model Predictive Control for linear systems with performance guarantees
Authors:
Maximilian Degner,
Raffaele Soloperto,
Melanie N. Zeilinger,
John Lygeros,
Johannes Köhler
Abstract:
We present a model predictive control (MPC) formulation to directly optimize economic criteria for linear constrained systems subject to disturbances and uncertain model parameters. The proposed formulation combines a certainty equivalent economic MPC with a simple least-squares parameter adaptation. For the resulting adaptive economic MPC scheme, we derive strong asymptotic and transient performa…
▽ More
We present a model predictive control (MPC) formulation to directly optimize economic criteria for linear constrained systems subject to disturbances and uncertain model parameters. The proposed formulation combines a certainty equivalent economic MPC with a simple least-squares parameter adaptation. For the resulting adaptive economic MPC scheme, we derive strong asymptotic and transient performance guarantees. We provide a numerical example involving building temperature control and demonstrate performance benefits of online parameter adaptation.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Guided Bayesian Optimization: Data-Efficient Controller Tuning with Digital Twin
Authors:
Mahdi Nobar,
Jürg Keller,
Alisa Rupenyan,
Mohammad Khosravi,
John Lygeros
Abstract:
This article presents the guided Bayesian optimization algorithm as an efficient data-driven method for iteratively tuning closed-loop controller parameters using an event-triggered digital twin of the system based on available closed-loop data. We define a controller tuning framework independent of the controller or the plant structure. Our proposed methodology is model-free, making it suitable f…
▽ More
This article presents the guided Bayesian optimization algorithm as an efficient data-driven method for iteratively tuning closed-loop controller parameters using an event-triggered digital twin of the system based on available closed-loop data. We define a controller tuning framework independent of the controller or the plant structure. Our proposed methodology is model-free, making it suitable for nonlinear and unmodelled plants with measurement noise. The objective function consists of performance metrics modeled by Gaussian processes. We utilize the available information in the closed-loop system to identify and progressively maintain a digital twin that guides the optimizer, improving the data efficiency of our method. Switching the digital twin on and off is triggered by data-driven criteria related to the digital twin's uncertainty estimations in the BO tuning framework. Effectively, it replaces much of the exploration of the real system with exploration performed on the digital twin. We analyze the properties of our method in simulation and demonstrate its performance on two real closed-loop systems with different plant and controller structures. The experimental results show that our method requires fewer experiments on the physical plant than Bayesian optimization to find the optimal controller parameters.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Data-Driven Extrusion Force Control Tuning for 3D Printing
Authors:
Xavier Guidetti,
Ankita Mukne,
Marvin Rueppel,
Yannick Nagel,
Efe C. Balta,
John Lygeros
Abstract:
The quality of 3D prints often varies due to different conditions inherent to each print, such as filament type, print speed, and nozzle size. Closed-loop process control methods improve the accuracy and repeatability of 3D prints. However, optimal tuning of controllers for given process parameters and design geometry is often a challenge with manually tuned controllers resulting in inconsistent a…
▽ More
The quality of 3D prints often varies due to different conditions inherent to each print, such as filament type, print speed, and nozzle size. Closed-loop process control methods improve the accuracy and repeatability of 3D prints. However, optimal tuning of controllers for given process parameters and design geometry is often a challenge with manually tuned controllers resulting in inconsistent and suboptimal results. This work employs Bayesian optimization to identify the optimal controller parameters. Additionally, we explore transfer learning in the context of 3D printing by leveraging prior information from past trials. By integrating optimized extrusion force control and transfer learning, we provide a novel framework for closed-loop 3D printing and propose an automated calibration routine that produces high-quality prints for a desired combination of print settings, material, and shape.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Force Controlled Printing for Material Extrusion Additive Manufacturing
Authors:
Xavier Guidetti,
Nathan Mingard,
Raul Cruz-Oliver,
Yannick Nagel,
Marvin Rueppel,
Alisa Rupenyan,
Efe C. Balta,
John Lygeros
Abstract:
In material extrusion additive manufacturing, the extrusion process is commonly controlled in a feed-forward fashion. The amount of material to be extruded at each printing location is pre-computed by a planning software. This approach is inherently unable to adapt the extrusion to external and unexpected disturbances, and the quality of the results strongly depends on a number of modeling and tun…
▽ More
In material extrusion additive manufacturing, the extrusion process is commonly controlled in a feed-forward fashion. The amount of material to be extruded at each printing location is pre-computed by a planning software. This approach is inherently unable to adapt the extrusion to external and unexpected disturbances, and the quality of the results strongly depends on a number of modeling and tuning parameters. To overcome these limitations, we propose the first framework for Force Controlled Printing for material extrusion additive manufacturing. We utilize a custom-built extruder to measure the extrusion force in real time, and use this quantity as feedback to continuously control the material flow in closed-loop. We demonstrate the existence of a strong correlation between extrusion force and line width, which we exploit to deposit lines of desired width in a width range of 33 % up to 233 % of the nozzle diameter. We also show how Force Controlled Printing outperforms conventional feed-forward extrusion in print quality and disturbance rejection, while requiring little tuning and automatically adapting to changes in the hardware settings. With no adaptation, Force Controlled Printing can deposit lines of desired width under severe disturbances in bed leveling, such as at layer heights ranging between 20 % and 200 % of the nominal height.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
Data-Enabled Predictive Iterative Control
Authors:
Kai Zhang,
Riccardo Zuliani,
Efe C. Balta,
John Lygeros
Abstract:
This work introduces the Data-Enabled Predictive iteRative Control (DeePRC) algorithm, a direct data-driven approach for iterative LTI systems. The DeePRC learns from previous iterations to improve its performance and achieves the optimal cost. By utilizing a tube-based variation of the DeePRC scheme, we propose a two-stage approach that enables safe active exploration using a left-kernel-based in…
▽ More
This work introduces the Data-Enabled Predictive iteRative Control (DeePRC) algorithm, a direct data-driven approach for iterative LTI systems. The DeePRC learns from previous iterations to improve its performance and achieves the optimal cost. By utilizing a tube-based variation of the DeePRC scheme, we propose a two-stage approach that enables safe active exploration using a left-kernel-based input disturbance design. This method generates informative trajectories to enrich the historical data, which extends the maximum achievable prediction horizon and leads to faster iteration convergence. In addition, we present an end-to-end formulation of the two-stage approach, integrating the disturbance design procedure into the planning phase. We showcase the effectiveness of the proposed algorithms on a numerical experiment.
△ Less
Submitted 30 May, 2024; v1 submitted 18 March, 2024;
originally announced March 2024.
-
Control Strategies for Recommendation Systems in Social Networks
Authors:
Ben Sprenger,
Giulia De Pasquale,
Raffaele Soloperto,
John Lygeros,
Florian Dörfler
Abstract:
A closed-loop control model to analyze the impact of recommendation systems on opinion dynamics within social networks is introduced. The core contribution is the development and formalization of model-free and model-based approaches to recommendation system design, integrating the dynamics of social interactions within networks via an extension of the Friedkin-Johnsen (FJ) model. Comparative anal…
▽ More
A closed-loop control model to analyze the impact of recommendation systems on opinion dynamics within social networks is introduced. The core contribution is the development and formalization of model-free and model-based approaches to recommendation system design, integrating the dynamics of social interactions within networks via an extension of the Friedkin-Johnsen (FJ) model. Comparative analysis and numerical simulations demonstrate the effectiveness of the proposed control strategies in maximizing user engagement and their potential for influencing opinion formation processes.
△ Less
Submitted 10 March, 2024;
originally announced March 2024.
-
Online Identification of Stochastic Continuous-Time Wiener Models Using Sampled Data
Authors:
Mohamed Abdalmoaty,
Efe C. Balta,
John Lygeros,
Roy S. Smith
Abstract:
It is well known that ignoring the presence of stochastic disturbances in the identification of stochastic Wiener models leads to asymptotically biased estimators. On the other hand, optimal statistical identification, via likelihood-based methods, is sensitive to the assumptions on the data distribution and is usually based on relatively complex sequential Monte Carlo algorithms. We develop a sim…
▽ More
It is well known that ignoring the presence of stochastic disturbances in the identification of stochastic Wiener models leads to asymptotically biased estimators. On the other hand, optimal statistical identification, via likelihood-based methods, is sensitive to the assumptions on the data distribution and is usually based on relatively complex sequential Monte Carlo algorithms. We develop a simple recursive online estimation algorithm based on an output-error predictor, for the identification of continuous-time stochastic parametric Wiener models through stochastic approximation. The method is applicable to generic model parameterizations and, as demonstrated in the numerical simulation examples, it is robust with respect to the assumptions on the spectrum of the disturbance process.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Closed-loop Performance Optimization of Model Predictive Control with Robustness Guarantees
Authors:
Riccardo Zuliani,
Efe C. Balta,
John Lygeros
Abstract:
Model mismatch and process noise are two frequently occurring phenomena that can drastically affect the performance of model predictive control (MPC) in practical applications. We propose a principled way to tune the cost function and the constraints of linear MPC schemes to achieve good performance and robust constraint satisfaction on uncertain nonlinear dynamics with additive noise. The tuning…
▽ More
Model mismatch and process noise are two frequently occurring phenomena that can drastically affect the performance of model predictive control (MPC) in practical applications. We propose a principled way to tune the cost function and the constraints of linear MPC schemes to achieve good performance and robust constraint satisfaction on uncertain nonlinear dynamics with additive noise. The tuning is performed using a novel MPC tuning algorithm based on backpropagation developed in our earlier work. Using the scenario approach, we provide probabilistic bounds on the likelihood of closed-loop constraint violation over a finite horizon. We showcase the effectiveness of the proposed method on linear and nonlinear simulation examples.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Joint Chance Constrained Optimal Control via Linear Programming
Authors:
Niklas Schmid,
Marta Fochesato,
Tobias Sutter,
John Lygeros
Abstract:
We establish a linear programming formulation for the solution of joint chance constrained optimal control problems over finite time horizons. The joint chance constraint may represent an invariance, reachability or reach-avoid specification that the trajectory must satisfy with a predefined probability. For finite state and action spaces, the solution is exact and our method computationally super…
▽ More
We establish a linear programming formulation for the solution of joint chance constrained optimal control problems over finite time horizons. The joint chance constraint may represent an invariance, reachability or reach-avoid specification that the trajectory must satisfy with a predefined probability. For finite state and action spaces, the solution is exact and our method computationally superior to approaches in the literature. For continuous state or action spaces, our linear programming formulation enables basis function approximations.
△ Less
Submitted 18 May, 2024; v1 submitted 29 February, 2024;
originally announced February 2024.
-
Predictive Linear Online Tracking for Unknown Targets
Authors:
Anastasios Tsiamis,
Aren Karapetyan,
Yueshan Li,
Efe C. Balta,
John Lygeros
Abstract:
In this paper, we study the problem of online tracking in linear control systems, where the objective is to follow a moving target. Unlike classical tracking control, the target is unknown, non-stationary, and its state is revealed sequentially, thus, fitting the framework of online non-stochastic control. We consider the case of quadratic costs and propose a new algorithm, called predictive linea…
▽ More
In this paper, we study the problem of online tracking in linear control systems, where the objective is to follow a moving target. Unlike classical tracking control, the target is unknown, non-stationary, and its state is revealed sequentially, thus, fitting the framework of online non-stochastic control. We consider the case of quadratic costs and propose a new algorithm, called predictive linear online tracking (PLOT). The algorithm uses recursive least squares with exponential forgetting to learn a time-varying dynamic model of the target. The learned model is used in the optimal policy under the framework of receding horizon control. We show the dynamic regret of PLOT scales with $\mathcal{O}(\sqrt{TV_T})$, where $V_T$ is the total variation of the target dynamics and $T$ is the time horizon. Unlike prior work, our theoretical results hold for non-stationary targets. We implement PLOT on a real quadrotor and provide open-source software, thus, showcasing one of the first successful applications of online control methods on real hardware.
△ Less
Submitted 13 June, 2024; v1 submitted 15 February, 2024;
originally announced February 2024.
-
Towards a Systems Theory of Algorithms
Authors:
Florian Dörfler,
Zhiyu He,
Giuseppe Belgioioso,
Saverio Bolognani,
John Lygeros,
Michael Muehlebach
Abstract:
Traditionally, numerical algorithms are seen as isolated pieces of code confined to an {\em in silico} existence. However, this perspective is not appropriate for many modern computational approaches in control, learning, or optimization, wherein {\em in vivo} algorithms interact with their environment. Examples of such {\em open algorithms} include various real-time optimization-based control str…
▽ More
Traditionally, numerical algorithms are seen as isolated pieces of code confined to an {\em in silico} existence. However, this perspective is not appropriate for many modern computational approaches in control, learning, or optimization, wherein {\em in vivo} algorithms interact with their environment. Examples of such {\em open algorithms} include various real-time optimization-based control strategies, reinforcement learning, decision-making architectures, online optimization, and many more. Further, even {\em closed} algorithms in learning or optimization are increasingly abstracted in block diagrams with interacting dynamic modules and pipelines. In this opinion paper, we state our vision on a to-be-cultivated {\em systems theory of algorithms} and argue in favor of viewing algorithms as open dynamical systems interacting with other algorithms, physical systems, humans, or databases. Remarkably, the manifold tools developed under the umbrella of systems theory are well suited for addressing a range of challenges in the algorithmic domain. We survey various instances where the principles of algorithmic systems theory are being developed and outline pertinent modeling, analysis, and design challenges.
△ Less
Submitted 30 April, 2024; v1 submitted 25 January, 2024;
originally announced January 2024.
-
BP-MPC: Optimizing the Closed-Loop Performance of MPC using BackPropagation
Authors:
Riccardo Zuliani,
Efe C. Balta,
John Lygeros
Abstract:
Model predictive control (MPC) is pervasive in research and industry. However, designing the cost function and the constraints of the MPC to maximize closed-loop performance remains an open problem. To achieve optimal tuning, we propose a backpropagation scheme that solves a policy optimization problem with nonlinear system dynamics and MPC policies. We enforce the system dynamics using linearizat…
▽ More
Model predictive control (MPC) is pervasive in research and industry. However, designing the cost function and the constraints of the MPC to maximize closed-loop performance remains an open problem. To achieve optimal tuning, we propose a backpropagation scheme that solves a policy optimization problem with nonlinear system dynamics and MPC policies. We enforce the system dynamics using linearization and allow the MPC problem to contain elements that depend on the current system state and on past MPC solutions. Moreover, we propose a simple extension that can deal with losses of feasibility. Our approach, unlike other methods in the literature, enjoys convergence guarantees.
△ Less
Submitted 15 March, 2024; v1 submitted 24 December, 2023;
originally announced December 2023.
-
Convergence guarantees for adaptive model predictive control with kinky inference
Authors:
Riccardo Zuliani,
Raffaele Soloperto,
John Lygeros
Abstract:
We analyze the convergence properties of a robust adaptive model predictive control algorithm used to control an unknown nonlinear system. We show that by employing a standard quadratic stabilizing cost function, and by recursively updating the nominal model through kinky inference, the resulting controller ensures convergence of the true system to the origin, despite the presence of model uncerta…
▽ More
We analyze the convergence properties of a robust adaptive model predictive control algorithm used to control an unknown nonlinear system. We show that by employing a standard quadratic stabilizing cost function, and by recursively updating the nominal model through kinky inference, the resulting controller ensures convergence of the true system to the origin, despite the presence of model uncertainty. We illustrate our theoretical findings through a numerical simulation.
△ Less
Submitted 30 May, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Computing Optimal Joint Chance Constrained Control Policies
Authors:
Niklas Schmid,
Marta Fochesato,
Sarah H. Q. Li,
Tobias Sutter,
John Lygeros
Abstract:
We consider the problem of optimally controlling stochastic, Markovian systems subject to joint chance constraints over a finite-time horizon. For such problems, standard Dynamic Programming is inapplicable due to the time correlation of the joint chance constraints, which calls for non-Markovian, and possibly stochastic, policies. Hence, despite the popularity of this problem, solution approaches…
▽ More
We consider the problem of optimally controlling stochastic, Markovian systems subject to joint chance constraints over a finite-time horizon. For such problems, standard Dynamic Programming is inapplicable due to the time correlation of the joint chance constraints, which calls for non-Markovian, and possibly stochastic, policies. Hence, despite the popularity of this problem, solution approaches capable of providing provably-optimal and easy-to-compute policies are still missing. We fill this gap by introducing an augmented binary state to the system dynamics, allowing us to characterize the optimal policies and propose a Dynamic Programming based solution method. Our analysis provides a deep insight into the impact of joint chance constraints on the optimal control policies.
△ Less
Submitted 16 December, 2023;
originally announced December 2023.
-
Distributionally Robust Infinite-horizon Control: from a pool of samples to the design of dependable controllers
Authors:
Jean-Sébastien Brouillon,
Andrea Martin,
John Lygeros,
Florian Dörfler,
Giancarlo Ferrari Trecate
Abstract:
We study control of constrained linear systems when faced with only partial statistical information about the disturbance processes affecting the system dynamics and the sensor measurements. Specifically, given a finite collection of disturbance realizations, we consider the problem of designing a stabilizing control policy with provable safety and performance guarantees in face of the inevitable…
▽ More
We study control of constrained linear systems when faced with only partial statistical information about the disturbance processes affecting the system dynamics and the sensor measurements. Specifically, given a finite collection of disturbance realizations, we consider the problem of designing a stabilizing control policy with provable safety and performance guarantees in face of the inevitable mismatch between the true and the empirical distributions. We capture this discrepancy using Wasserstein ambiguity sets, and we formulate a distributionally robust (DR) optimal control problem, which provides guarantees on the expected cost, safety, and stability of the system. To solve this problem, we first present new results for DR optimization of quadratic objectives using convex programming, showing that strong duality holds under mild conditions. Then, by combining our results with the system level parametrization (SLP) of linear feedback policies, we show that the design problem can be reduced to a semidefinite optimization problem (SDP).
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Closed-Loop Finite-Time Analysis of Suboptimal Online Control
Authors:
Aren Karapetyan,
Efe C. Balta,
Andrea Iannelli,
John Lygeros
Abstract:
Suboptimal methods in optimal control arise due to a limited computational budget, unknown system dynamics, or a short prediction window among other reasons. Although these methods are ubiquitous, their transient performance remains relatively unstudied. We consider the control of discrete-time, nonlinear time-varying dynamical systems and establish sufficient conditions to analyze the finite-time…
▽ More
Suboptimal methods in optimal control arise due to a limited computational budget, unknown system dynamics, or a short prediction window among other reasons. Although these methods are ubiquitous, their transient performance remains relatively unstudied. We consider the control of discrete-time, nonlinear time-varying dynamical systems and establish sufficient conditions to analyze the finite-time closed-loop performance of such methods in terms of the additional cost incurred due to suboptimality. Finite-time guarantees allow the control design to distribute a limited computational budget over a time horizon and estimate the on-the-go loss in performance due to sub-optimality. We study exponential incremental input-to-state stabilizing policies, and show that for nonlinear systems, under some mild conditions, this property is directly implied by exponential stability without further assumptions on global smoothness. The analysis is showcased on a suboptimal model predictive control use case.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Data-Driven Robust Reinforcement Learning Control of Uncertain Nonlinear Systems: Towards a Fully-Automated, Insulin-Based Artificial Pancreas
Authors:
Alexandros Tanzanakis,
John Lygeros
Abstract:
In this paper, a novel robust tracking control scheme for a general class of discrete-time nonlinear systems affected by unknown bounded uncertainty is presented. By solving a parameterized optimal tracking control problem subject to the unknown nominal system and a suitable cost function, the resulting optimal tracking control policy can ensure closed-loop stability by achieving a sufficiently sm…
▽ More
In this paper, a novel robust tracking control scheme for a general class of discrete-time nonlinear systems affected by unknown bounded uncertainty is presented. By solving a parameterized optimal tracking control problem subject to the unknown nominal system and a suitable cost function, the resulting optimal tracking control policy can ensure closed-loop stability by achieving a sufficiently small tracking error for the original uncertain nonlinear system. The computation of the optimal tracking controller is accomplished through the derivation of a novel Q-function-based $λ$-Policy Iteration algorithm. The proposed algorithm not only enjoys rigorous theoretical guarantees, but also avoids technical weaknesses of conventional reinforcement learning methods. By employing a data-driven, critic-only least squares implementation, the performance of the proposed algorithm is evaluated to the problem of fully-automated, insulin-based, closed-loop glucose control for patients diagnosed with Type 1 and Type 2 Diabetes Mellitus. The U.S. FDA-accepted DMMS.R simulator from the Epsilon Group is used to conduct a comprehensive in silico clinical campaign on a rich set of virtual subjects under completely unannounced meal and exercise settings. Simulation results underline the superior glycaemic behavior achieved by the derived approach, as well as its overall maturity for the design of highly-effective, closed-loop drug delivery systems for personalized medicine.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Layer-to-Layer Melt Pool Control in Laser Power Bed Fusion
Authors:
Dominic Liao-McPherson,
Efe C. Balta,
Mohamadreza Afrasiabi,
Alisa Rupenyan,
Markus Bambach,
John Lygeros
Abstract:
Additive manufacturing processes are flexible and efficient technologies for producing complex geometries. However, ensuring reliability and repeatability is challenging due to the complex physics and various sources of uncertainty in the process. In this work, we investigate closed-loop control of the melt pool dimensions in a laser powder bed fusion (LPBF) process. We propose a trajectory optimi…
▽ More
Additive manufacturing processes are flexible and efficient technologies for producing complex geometries. However, ensuring reliability and repeatability is challenging due to the complex physics and various sources of uncertainty in the process. In this work, we investigate closed-loop control of the melt pool dimensions in a laser powder bed fusion (LPBF) process. We propose a trajectory optimization-based layer-to-layer controller that adjusts the laser power input to the next layer to track a desired melt pool depth and validate our controller by placing it in closed-loop high-fidelity multi-layer smoothed particle hydrodynamics simulator of a 2D LPBF process. Detailed numerical case studies demonstrate successful regulation of the melt pool depth on brick and overhang geometries and provide first of its kind results on the effectiveness of layer-to-layer input optimization for the LPBF process as well as detailed insight into the physics of the controlled process. Computational complexity and process performance results illustrate the method's effectiveness and provide an outlook for its implementation onto real systems.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Urban traffic congestion control: a DeePC change
Authors:
Alessio Rimoldi,
Carlo Cenedese,
Alberto Padoan,
Florian Dörfler,
John Lygeros
Abstract:
Urban traffic congestion remains a pressing challenge in our rapidly expanding cities, despite the abundance of available data and the efforts of policymakers. By leveraging behavioral system theory and data-driven control, this paper exploits the DeePC algorithm in the context of urban traffic control performed via dynamic traffic lights. To validate our approach, we consider a high-fidelity case…
▽ More
Urban traffic congestion remains a pressing challenge in our rapidly expanding cities, despite the abundance of available data and the efforts of policymakers. By leveraging behavioral system theory and data-driven control, this paper exploits the DeePC algorithm in the context of urban traffic control performed via dynamic traffic lights. To validate our approach, we consider a high-fidelity case study using the state-of-the-art simulation software package Simulation of Urban MObility (SUMO). Preliminary results indicate that DeePC outperforms existing approaches across various key metrics, including travel time and CO$_2$ emissions, demonstrating its potential for effective traffic management
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Interconnection of (Q,S,R)-Dissipative Systems in Discrete Time
Authors:
Andrea Martinelli,
Ahmed Aboudonia,
John Lygeros
Abstract:
Discrete-time systems cannot be passive unless there is a direct feedthrough from the input to the output. For passivity-based control to be exploited nevertheless, some authors introduce virtual outputs, while others rely on continuous-time passivity and then apply discretization techniques that preserve passivity in discrete time. Here we argue that quadratic supply rates incorporate and extend…
▽ More
Discrete-time systems cannot be passive unless there is a direct feedthrough from the input to the output. For passivity-based control to be exploited nevertheless, some authors introduce virtual outputs, while others rely on continuous-time passivity and then apply discretization techniques that preserve passivity in discrete time. Here we argue that quadratic supply rates incorporate and extend the effect of virtual outputs, allowing one to exploit dissipativity properties directly in discrete time. We derive decentralized (Q,S,R)-dissipativity conditions for a set of nonlinear systems interconnected with arbitrary topology, so that the overall network is guaranteed to be stable. For linear systems, we develop dissipative control conditions that are linear in the supply rate matrices. To demonstrate the validity of our methods, we provide numerical examples in the context of islanded microgrids.
△ Less
Submitted 24 June, 2024; v1 submitted 14 November, 2023;
originally announced November 2023.
-
Multi-Step Optimal Tracking Control of Unknown Nonzero-Sum Games based on Least Squares and Linear Programming: An Application to a Fully-Automated, Dual-Hormone Artificial Pancreas
Authors:
Alexandros Tanzanakis,
John Lygeros
Abstract:
We consider the problem of optimal tracking control of unknown discrete-time nonlinear nonzero-sum games. The related state-of-art literature is mostly focused on Policy Iteration algorithms and multiple neural network approximation, which may lead to practical implementation challenges and high computational burden. To overcome these problems, we propose a novel Q-function-based multi-step Value…
▽ More
We consider the problem of optimal tracking control of unknown discrete-time nonlinear nonzero-sum games. The related state-of-art literature is mostly focused on Policy Iteration algorithms and multiple neural network approximation, which may lead to practical implementation challenges and high computational burden. To overcome these problems, we propose a novel Q-function-based multi-step Value Iteration algorithm, which provides the potential to accelerate convergence speed and improve the quality of solutions, with an easy-to-realize initialization condition. A critic-only least squares implementation approach is then employed, which alleviates the computational complexity of commonly used multiple neural network-based methods. Afterwards, by introducing the coupled Bellman operator, a novel linear programming approach is derived, based on which Nash equilibria can be approximately computed by solving a set of tractable finite-dimensional optimization problems. We evaluate the tracking control capabilities of the proposed algorithms to the problem of fully-automated dual-hormone (i.e., insulin and glucagon) glucose control in Type 1 Diabetes Mellitus. The U.S. FDA-accepted DMMS.R simulator from the Epsilon Group is used to conduct extensive in-silico clinical studies on virtual patients under a variety of completely unannounced meal and exercise scenarios. Simulation results demonstrate the high reliability and exceptional performance of the proposed multi-step algorithmic framework to critical complex systems.
△ Less
Submitted 6 November, 2023;
originally announced November 2023.
-
Data-driven optimal control via linear programming: boundedness guarantees
Authors:
Lucia Falconi,
Andrea Martinelli,
John Lygeros
Abstract:
The linear programming (LP) approach is, together with value iteration and policy iteration, one of the three fundamental methods to solve optimal control problems in a dynamic programming setting. Despite its simple formulation, versatility, and predisposition to be employed in model-free settings, the LP approach has not enjoyed the same popularity as the other methods. The reason is the often p…
▽ More
The linear programming (LP) approach is, together with value iteration and policy iteration, one of the three fundamental methods to solve optimal control problems in a dynamic programming setting. Despite its simple formulation, versatility, and predisposition to be employed in model-free settings, the LP approach has not enjoyed the same popularity as the other methods. The reason is the often poor scalability of the exact LP approach and the difficulty to obtain bounded solutions for a reasonable amount of constraints. We mitigate these issues here, by investigating fundamental geometric features of the LP and develo** sufficient conditions to guarantee finite solutions with minimal constraints. In the model-free context, we show that boundedness can be guaranteed by a suitable choice of dataset and objective function.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Experimental Validation for Distributed Control of Energy Hubs
Authors:
Varsha Behrunani,
Philipp Heer,
John Lygeros
Abstract:
As future energy systems become more decentralised due to the integration of renewable energy resources and storage technologies, several autonomous energy management and peer-to-peer trading mechanisms have been recently proposed for the operation of energy hub networks based on optimization and game theory. However, most of these strategies have been tested either only in simulated environments…
▽ More
As future energy systems become more decentralised due to the integration of renewable energy resources and storage technologies, several autonomous energy management and peer-to-peer trading mechanisms have been recently proposed for the operation of energy hub networks based on optimization and game theory. However, most of these strategies have been tested either only in simulated environments or small prosumer units as opposed to larger energy hubs. This simulation reality gap has hindered large-scale implementation and practical application of these method. In this paper, we aim to experimentally validate the performance of a novel multi-horizon distributed model predictive controller for an energy hub network by implementing the controller on a complete network of hubs comprising of a real energy hub inter-faced with multiple virtual hubs. The experiments are done using two different network topologies and the controller shows promising results in both setups.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Efficient safe learning for controller tuning with experimental validation
Authors:
Marta Zagorowska,
Christopher König,
Hanlin Yu,
Efe C. Balta,
Alisa Rupenyan,
John Lygeros
Abstract:
Optimization-based controller tuning is challenging because it requires formulating optimization problems explicitly as functions of controller parameters. Safe learning algorithms overcome the challenge by creating surrogate models from measured data. To ensure safety, such data-driven algorithms often rely on exhaustive grid search, which is computationally inefficient. In this paper, we propose…
▽ More
Optimization-based controller tuning is challenging because it requires formulating optimization problems explicitly as functions of controller parameters. Safe learning algorithms overcome the challenge by creating surrogate models from measured data. To ensure safety, such data-driven algorithms often rely on exhaustive grid search, which is computationally inefficient. In this paper, we propose a novel approach to safe learning by formulating a series of optimization problems instead of a grid search. We also develop a method for initializing the optimization problems to guarantee feasibility while using numerical solvers. The performance of the new method is first validated in a simulated precision motion system, demonstrating improved computational efficiency, and illustrating the role of exploiting numerical solvers to reach the desired precision. Experimental validation on an industrial-grade precision motion system confirms that the proposed algorithm achieves 30% better tracking at sub-micrometer precision as a state-of-the-art safe learning algorithm, improves the default auto-tuning solution, and reduces the computational cost seven times compared to learning algorithms based on exhaustive search.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Data-driven representations of conical, convex, and affine behaviors
Authors:
Alberto Padoan,
Florian Dörfler,
John Lygeros
Abstract:
The paper studies conical, convex, and affine models in the framework of behavioral systems theory. We investigate basic properties of such behaviors and address the problem of constructing models from measured data. We prove that closed, shift-invariant, conical, convex, and affine models have the intersection property, thereby enabling the definition of most powerful unfalsified models based on…
▽ More
The paper studies conical, convex, and affine models in the framework of behavioral systems theory. We investigate basic properties of such behaviors and address the problem of constructing models from measured data. We prove that closed, shift-invariant, conical, convex, and affine models have the intersection property, thereby enabling the definition of most powerful unfalsified models based on infinite-horizon measurements. We then provide necessary and sufficient conditions for representing conical, convex, and affine finite-horizon behaviors using raw data matrices, expressing persistence of excitation requirements in terms of non-negative rank conditions. The applicability of our results is demonstrated by a numerical example arising in population ecology.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Routing and charging game in ride-hailing service with electric vehicles
Authors:
Kenan Zhang,
John Lygeros
Abstract:
This paper studies the routing and charging behaviors of electric vehicles in a competitive ride-hailing market. When the vehicles are idle, they can choose whether to continue cruising to search for passengers, or move a charging station to recharge. The behaviors of individual vehicles are then modeled by a Markov decision process (MDP). The state transitions in the MDP model, however, depend on…
▽ More
This paper studies the routing and charging behaviors of electric vehicles in a competitive ride-hailing market. When the vehicles are idle, they can choose whether to continue cruising to search for passengers, or move a charging station to recharge. The behaviors of individual vehicles are then modeled by a Markov decision process (MDP). The state transitions in the MDP model, however, depend on the aggregate vehicle flows both in service zones and at charging stations. Accordingly, the value function of each vehicle is determined by the collective behaviors of all vehicles. With the assumption of the large population, we formulate the collective routing and charging behaviors as a mean-field Markov game. We characterize the equilibrium of such a game, prove its existence, and numerically show that the competition among vehicles leads to ``inefficient congestion" both in service zones and at charging stations.
△ Less
Submitted 10 September, 2023;
originally announced September 2023.
-
Sequential Quadratic Programming-based Iterative Learning Control for Nonlinear Systems
Authors:
Samuel Balula,
Efe C. Balta,
Dominic Liao-McPherson,
Alisa Rupenyan,
John Lygeros
Abstract:
Learning-based control methods for industrial processes leverage the repetitive nature of the underlying process to learn optimal inputs for the system. While many works focus on linear systems, real-world problems involve nonlinear dynamics. In this work, we propose an algorithm for the nonlinear iterative learning control problem based on sequential quadratic programming, a well-studied method f…
▽ More
Learning-based control methods for industrial processes leverage the repetitive nature of the underlying process to learn optimal inputs for the system. While many works focus on linear systems, real-world problems involve nonlinear dynamics. In this work, we propose an algorithm for the nonlinear iterative learning control problem based on sequential quadratic programming, a well-studied method for nonconvex optimization. We repeatedly solve quadratic subproblems built using approximate nonlinear models and process measurements, to find an optimal input for the original system. We demonstrate our method in a trajectory optimization problem for a precision motion system. We present simulations to illustrate the performance of the proposed method for linear and nonlinear dynamics models.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Degradation-aware data-enabled predictive control of energy hubs
Authors:
Varsha Behrunani,
Marta Zagorowska,
Mathias Hudoba de Badyn,
Francesco Ricca,
Philipp Heer,
John Lygeros
Abstract:
Mitigating the energy use in buildings, together with satisfaction of comfort requirements are the main objectives of efficient building control systems. Augmenting building energy systems with batteries can improve the energy use of a building, while posing the challenge of considering battery degradation during control operation. We demonstrate the performance of a data-enabled predictive contro…
▽ More
Mitigating the energy use in buildings, together with satisfaction of comfort requirements are the main objectives of efficient building control systems. Augmenting building energy systems with batteries can improve the energy use of a building, while posing the challenge of considering battery degradation during control operation. We demonstrate the performance of a data-enabled predictive control (DeePC) approach applied to a single multi-zone building and an energy hub comprising an electric heat pump and a battery. In a comparison with a standard rule-based controller, results demonstrate that the performance of DeePC is superior in terms of satisfaction of comfort constraints without increasing grid power consumption. Moreover, DeePC achieved two-fold decrease in battery degradation over one year, as compared to a rule-based controller.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
On the Guarantees of Minimizing Regret in Receding Horizon
Authors:
Andrea Martin,
Luca Furieri,
Florian Dörfler,
John Lygeros,
Giancarlo Ferrari-Trecate
Abstract:
Towards bridging classical optimal control and online learning, regret minimization has recently been proposed as a control design criterion. This competitive paradigm penalizes the loss relative to the optimal control actions chosen by a clairvoyant policy, and allows tracking the optimal performance in hindsight no matter how disturbances are generated. In this paper, we propose the first recedi…
▽ More
Towards bridging classical optimal control and online learning, regret minimization has recently been proposed as a control design criterion. This competitive paradigm penalizes the loss relative to the optimal control actions chosen by a clairvoyant policy, and allows tracking the optimal performance in hindsight no matter how disturbances are generated. In this paper, we propose the first receding horizon scheme based on the repeated computation of finite horizon regret-optimal policies, and we establish stability and safety guarantees for the resulting closed-loop system. Our derivations combine novel monotonicity properties of clairvoyant policies with suitable terminal ingredients. We prove that our scheme is recursively feasible, stabilizing, and that it achieves bounded regret relative to the infinite horizon clairvoyant policy. Last, we show that the policy optimization problem can be solved efficiently through convex-concave programming. Our numerical experiments show that minimizing regret can outperform standard receding horizon approaches when the disturbances poorly fit classical design assumptions - even when the finite horizon planning is recomputed less frequently.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
On the Finite-Time Behavior of Suboptimal Linear Model Predictive Control
Authors:
Aren Karapetyan,
Efe C. Balta,
Andrea Iannelli,
John Lygeros
Abstract:
Inexact methods for model predictive control (MPC), such as real-time iterative schemes or time-distributed optimization, alleviate the computational burden of exact MPC by providing suboptimal solutions. While the asymptotic stability of such algorithms is well studied, their finite-time performance has not received much attention. In this work, we quantify the performance of suboptimal linear mo…
▽ More
Inexact methods for model predictive control (MPC), such as real-time iterative schemes or time-distributed optimization, alleviate the computational burden of exact MPC by providing suboptimal solutions. While the asymptotic stability of such algorithms is well studied, their finite-time performance has not received much attention. In this work, we quantify the performance of suboptimal linear model predictive control in terms of the additional closed-loop cost incurred due to performing only a finite number of optimization iterations. Leveraging this novel analysis framework, we propose a novel suboptimal MPC algorithm with a diminishing horizon length and finite-time closed-loop performance guarantees. This analysis allows the designer to plan a limited computational power budget distribution to achieve a desired performance level. We provide numerical examples to illustrate the algorithm's transient behavior and computational complexity.
△ Less
Submitted 20 November, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Recursive Dynamic State Estimation for Power Systems with an Incomplete Nonlinear DAE Model
Authors:
Milos Katanic,
John Lygeros,
Gabriela Hug
Abstract:
Power systems are highly-complex, large-scale engineering systems subject to many uncertainties, which makes accurate mathematical modeling challenging. This paper proposes a novel, centralized dynamic state estimator for power systems that lack models of some components. Including the available dynamic evolution equations, algebraic network equations, and phasor measurements, we apply the least s…
▽ More
Power systems are highly-complex, large-scale engineering systems subject to many uncertainties, which makes accurate mathematical modeling challenging. This paper proposes a novel, centralized dynamic state estimator for power systems that lack models of some components. Including the available dynamic evolution equations, algebraic network equations, and phasor measurements, we apply the least squares criterion to estimate all dynamic and algebraic states recursively. This approach results in an algorithm that generalizes the iterated extended Kalman filter and does not require static network observability. We further derive a graph theoretic condition to guarantee estimability for the proposed approach. A numerical study evaluates the performance under short circuits in the network and load changes for three different discretization schemes. The results show superior tracking performance compared to robust two-stage procedures within computational times that are feasible for real-time application.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
Regret Optimal Control for Uncertain Stochastic Systems
Authors:
Andrea Martin,
Luca Furieri,
Florian Dörfler,
John Lygeros,
Giancarlo Ferrari-Trecate
Abstract:
We consider control of uncertain linear time-varying stochastic systems from the perspective of regret minimization. Specifically, we focus on the problem of designing a feedback controller that minimizes the loss relative to a clairvoyant optimal policy that has foreknowledge of both the system dynamics and the exogenous disturbances. In this competitive framework, establishing robustness guarant…
▽ More
We consider control of uncertain linear time-varying stochastic systems from the perspective of regret minimization. Specifically, we focus on the problem of designing a feedback controller that minimizes the loss relative to a clairvoyant optimal policy that has foreknowledge of both the system dynamics and the exogenous disturbances. In this competitive framework, establishing robustness guarantees proves challenging as, differently from the case where the model is known, the clairvoyant optimal policy is not only inapplicable, but also impossible to compute without knowledge of the system parameters. To address this challenge, we embrace a scenario optimization approach, and we propose minimizing regret robustly over a finite set of randomly sampled system parameters. We prove that this policy optimization problem can be solved through semidefinite programming, and that the corresponding solution retains strong probabilistic out-of-sample regret guarantees in face of the uncertain dynamics. Our method naturally extends to include satisfaction of safety constraints with high probability. We validate our theoretical results and showcase the potential of our approach by means of numerical simulations.
△ Less
Submitted 19 March, 2024; v1 submitted 28 April, 2023;
originally announced April 2023.
-
Distributed Multi-Horizon Model Predictive Control for Network of Energy Hubs
Authors:
Varsha Behrunani,
Hanmin Cai,
Philipp Heer,
Roy S. Smith,
John Lygeros
Abstract:
The increasing penetration of renewable energy resources has transformed the energy system from traditional hierarchical energy delivery paradigm to a distributed structure. Such development is accompanied with continuous liberalization in the energy sector, giving rise to possible energy trading among networked local energy hub. Joint operation of such hubs can improve energy efficiency and suppo…
▽ More
The increasing penetration of renewable energy resources has transformed the energy system from traditional hierarchical energy delivery paradigm to a distributed structure. Such development is accompanied with continuous liberalization in the energy sector, giving rise to possible energy trading among networked local energy hub. Joint operation of such hubs can improve energy efficiency and support the integration of renewable energy resource. Acknowledging peer-to-peer trading between hubs, their optimal operation within the network can maximize consumption of locally produced energy. However, for such complex systems involving multiple stakeholders, both computational tractability and privacy concerns need to be accounted for. We investigate both decentralized and centralized model predictive control (MPC) approaches for a network of energy hubs. While the centralized control strategy offers superior performance to the decentralized method, its implementation is computationally prohibitive and raises privacy concerns, as the information of each hub has to be shared extensively. On the other hand, a classical decentralized control approach can ease the implementation at the expense of sub-optimal performance of the overall system. In this work, a distributed scheme based on a consensus alternating direction method of multipliers (ADMM) algorithm is proposed. It combines the performance of the centralized approach with the privacy preservation of decentralized approach. A novel multi-horizon MPC framework is also introduced to increase the prediction horizon without compromising the time discretization or making the problem computationally intractable. A benchmark three-hub network is used to compare the performance of the mentioned methods. The results show superior performance in terms of total cost, computational time, robustness to demand and prices variations.
△ Less
Submitted 27 April, 2023;
originally announced April 2023.
-
Stochastic MPC for energy hubs using data driven demand forecasting
Authors:
Varsha Behrunani,
Francesco Micheli,
Jonas Mehr,
Philipp Heer,
John Lygeros
Abstract:
Energy hubs convert and distribute energy resources by combining different energy inputs through multiple conversion and storage components. The optimal operation of the energy hub exploits its flexibility to increase the energy efficiency and reduce the operational costs. However, uncertainties in the demand present challenges to energy hub optimization. In this paper, we propose a stochastic MPC…
▽ More
Energy hubs convert and distribute energy resources by combining different energy inputs through multiple conversion and storage components. The optimal operation of the energy hub exploits its flexibility to increase the energy efficiency and reduce the operational costs. However, uncertainties in the demand present challenges to energy hub optimization. In this paper, we propose a stochastic MPC controller to minimize energy costs using chance constraints for the uncertain electricity and thermal demands. Historical data is used to build a demand prediction model based on Gaussian processes to generate a forecast of the future electricity and heat demands. The stochastic optimization problem is solved via the Scenario Approach by sampling multi-step demand trajectories from the derived prediction model. The performance of the proposed predictor and of the stochastic controller is verified on a simulated energy hub model and demand data from a real building.
△ Less
Submitted 24 July, 2023; v1 submitted 24 April, 2023;
originally announced April 2023.
-
Generalized uncertain Nash games: Reformulation and robust equilibrium seeking -- Extended version
Authors:
Marta Fochesato,
Filippo Fabiani,
John Lygeros
Abstract:
We consider generalized Nash equilibrium problems (GNEPs) with linear coupling constraints affected by both local (i.e., agent-wise) and global (i.e., shared resources) disturbances taking values in polyhedral uncertainty sets. By making use of traditional tools borrowed from robust optimization, for this class of problems we derive a tractable, finite-dimensional reformulation leading to a determ…
▽ More
We consider generalized Nash equilibrium problems (GNEPs) with linear coupling constraints affected by both local (i.e., agent-wise) and global (i.e., shared resources) disturbances taking values in polyhedral uncertainty sets. By making use of traditional tools borrowed from robust optimization, for this class of problems we derive a tractable, finite-dimensional reformulation leading to a deterministic ''extended game'', and we show that this latter still amounts to a GNEP featuring generalized Nash equilibria ''in the worst-case''. We then design a fully-distributed, accelerated algorithm based on monotone operator theory, which enjoys convergence towards a Nash equilibrium of the original, uncertain game under weak structural assumptions. Finally, we illustrate the effectiveness of the proposed distributed scheme through numerical simulations
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Online Linear Quadratic Tracking with Regret Guarantees
Authors:
Aren Karapetyan,
Diego Bolliger,
Anastasios Tsiamis,
Efe C. Balta,
John Lygeros
Abstract:
Online learning algorithms for dynamical systems provide finite time guarantees for control in the presence of sequentially revealed cost functions. We pose the classical linear quadratic tracking problem in the framework of online optimization where the time-varying reference state is unknown a priori and is revealed after the applied control input. We show the equivalence of this problem to the…
▽ More
Online learning algorithms for dynamical systems provide finite time guarantees for control in the presence of sequentially revealed cost functions. We pose the classical linear quadratic tracking problem in the framework of online optimization where the time-varying reference state is unknown a priori and is revealed after the applied control input. We show the equivalence of this problem to the control of linear systems subject to adversarial disturbances and propose a novel online gradient descent based algorithm to achieve efficient tracking in finite time. We provide a dynamic regret upper bound scaling linearly with the path length of the reference trajectory and a numerical example to corroborate the theoretical guarantees.
△ Less
Submitted 10 January, 2024; v1 submitted 17 March, 2023;
originally announced March 2023.
-
Designing Fairness in Autonomous Peer-to-peer Energy Trading
Authors:
Varsha Behrunani,
Andrew Irvine,
Giuseppe Belgioioso,
Philipp Heer,
John Lygeros,
Florian Dörfler
Abstract:
Several autonomous energy management and peer-to-peer trading mechanisms for future energy markets have been recently proposed based on optimization and game theory. In this paper, we study the impact of trading prices on the outcome of these market designs for energy-hub networks. We prove that, for a generic choice of trading prices, autonomous peer-to-peer trading is always network-wide benefic…
▽ More
Several autonomous energy management and peer-to-peer trading mechanisms for future energy markets have been recently proposed based on optimization and game theory. In this paper, we study the impact of trading prices on the outcome of these market designs for energy-hub networks. We prove that, for a generic choice of trading prices, autonomous peer-to-peer trading is always network-wide beneficial but not necessarily individually beneficial for each hub. Therefore, we leverage hierarchical game theory to formalize the problem of designing locally-beneficial and network-wide fair peer-to-peer trading prices. Then, we propose a scalable and privacy-preserving price-mediation algorithm that provably converges to a profile of such prices. Numerical simulations on a 3-hub network show that the proposed algorithm can indeed incentivize active participation of energy hubs in autonomous peer-to-peer trading schemes.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
Stress Flow Guided Non-Planar Print Trajectory Optimization for Additive Manufacturing of Anisotropic Polymers
Authors:
Xavier Guidetti,
Efe C. Balta,
Yannick Nagel,
Hang Yin,
Alisa Rupenyan,
John Lygeros
Abstract:
When manufacturing parts using material extrusion additive manufacturing and anisotropic polymers, the mechanical properties of a manufactured component are strongly dependent on the print trajectory orientation. We conduct non-planar slicing and optimize the print trajectories to maximize the alignment between the material deposition direction and the stress flow induced by a predefined load case…
▽ More
When manufacturing parts using material extrusion additive manufacturing and anisotropic polymers, the mechanical properties of a manufactured component are strongly dependent on the print trajectory orientation. We conduct non-planar slicing and optimize the print trajectories to maximize the alignment between the material deposition direction and the stress flow induced by a predefined load case. The trajectory optimization framework considers manufacturability constraints in the form of uniform layer height and line spacing. We demonstrate the method by manufacturing a load bearing mechanical bracket using a 5-axis 3D printer and a liquid crystal polymer material. The failure strength and stiffness of the optimized bracket are improved by a factor of 44 and 6 respectively when compared with conventional printing.
△ Less
Submitted 12 June, 2023; v1 submitted 12 January, 2023;
originally announced January 2023.
-
Optimal service station design for traffic mitigation via genetic algorithm and neural network
Authors:
Carlo Cenedese,
Michele Cucuzzella,
Adriano Cotta Ramusino,
Davide Spalenza,
John Lygeros,
Antonella Ferrara
Abstract:
This paper analyzes how the presence of service stations on highways affects traffic congestion. We focus on the problem of optimally designing a service station to achieve beneficial effects in terms of total traffic congestion and peak traffic reduction. Microsimulators cannot be used for this task due to their computational inefficiency. We propose a genetic algorithm based on the recently prop…
▽ More
This paper analyzes how the presence of service stations on highways affects traffic congestion. We focus on the problem of optimally designing a service station to achieve beneficial effects in terms of total traffic congestion and peak traffic reduction. Microsimulators cannot be used for this task due to their computational inefficiency. We propose a genetic algorithm based on the recently proposed CTMs, that efficiently describes the dynamics of a service station. Then, we leverage the algorithm to train a neural network capable of solving the same problem, avoiding implementing the CTMs. Finally, we examine two case studies to validate the capabilities and performance of our algorithms. In these simulations, we use real data extracted from Dutch highways.
△ Less
Submitted 18 November, 2022;
originally announced November 2022.
-
Drone-based Volume Estimation in Indoor Environments
Authors:
Samuel Balula,
Dominic Liao-McPherson,
Stefan Stevšić,
Alisa Rupenyan,
John Lygeros
Abstract:
Volume estimation in large indoor spaces is an important challenge in robotic inspection of industrial warehouses. We propose an approach for volume estimation for autonomous systems using visual features for indoor localization and surface reconstruction from 2D-LiDAR measurements. A Gaussian Process-based model incorporates information collected from measurements given statistical prior informat…
▽ More
Volume estimation in large indoor spaces is an important challenge in robotic inspection of industrial warehouses. We propose an approach for volume estimation for autonomous systems using visual features for indoor localization and surface reconstruction from 2D-LiDAR measurements. A Gaussian Process-based model incorporates information collected from measurements given statistical prior information about the terrain, from which the volume estimate is computed. Our algorithm finds feasible trajectories which minimize the uncertainty of the volume estimate. We show results in simulation for the surface reconstruction and volume estimate of topographic data.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Probabilistic Reachability and Invariance Computation of Stochastic Systems using Linear Programming
Authors:
Niklas Schmid,
John Lygeros
Abstract:
We consider the safety evaluation of discrete time, stochastic systems over a finite horizon. Therefore, we discuss and link probabilistic invariance with reachability as well as reach-avoid problems. We show how to efficiently compute these quantities using dynamic and linear programming.
We consider the safety evaluation of discrete time, stochastic systems over a finite horizon. Therefore, we discuss and link probabilistic invariance with reachability as well as reach-avoid problems. We show how to efficiently compute these quantities using dynamic and linear programming.
△ Less
Submitted 14 April, 2023; v1 submitted 14 November, 2022;
originally announced November 2022.
-
Implications of Regret on Stability of Linear Dynamical Systems
Authors:
Aren Karapetyan,
Anastasios Tsiamis,
Efe C. Balta,
Andrea Iannelli,
John Lygeros
Abstract:
The setting of an agent making decisions under uncertainty and under dynamic constraints is common for the fields of optimal control, reinforcement learning, and recently also for online learning. In the online learning setting, the quality of an agent's decision is often quantified by the concept of regret, comparing the performance of the chosen decisions to the best possible ones in hindsight.…
▽ More
The setting of an agent making decisions under uncertainty and under dynamic constraints is common for the fields of optimal control, reinforcement learning, and recently also for online learning. In the online learning setting, the quality of an agent's decision is often quantified by the concept of regret, comparing the performance of the chosen decisions to the best possible ones in hindsight. While regret is a useful performance measure, when dynamical systems are concerned, it is important to also assess the stability of the closed-loop system for a chosen policy. In this work, we show that for linear state feedback policies and linear systems subject to adversarial disturbances, linear regret implies asymptotic stability in both time-varying and time-invariant settings. Conversely, we also show that bounded input bounded state stability and summability of the state transition matrices imply linear regret.
△ Less
Submitted 15 April, 2023; v1 submitted 14 November, 2022;
originally announced November 2022.
-
Follow the Clairvoyant: an Imitation Learning Approach to Optimal Control
Authors:
Andrea Martin,
Luca Furieri,
Florian Dörfler,
John Lygeros,
Giancarlo Ferrari-Trecate
Abstract:
We consider control of dynamical systems through the lens of competitive analysis. Most prior work in this area focuses on minimizing regret, that is, the loss relative to an ideal clairvoyant policy that has noncausal access to past, present, and future disturbances. Motivated by the observation that the optimal cost only provides coarse information about the ideal closed-loop behavior, we instea…
▽ More
We consider control of dynamical systems through the lens of competitive analysis. Most prior work in this area focuses on minimizing regret, that is, the loss relative to an ideal clairvoyant policy that has noncausal access to past, present, and future disturbances. Motivated by the observation that the optimal cost only provides coarse information about the ideal closed-loop behavior, we instead propose directly minimizing the tracking error relative to the optimal trajectories in hindsight, i.e., imitating the clairvoyant policy. By embracing a system level perspective, we present an efficient optimization-based approach for computing follow-the-clairvoyant (FTC) safe controllers. We prove that these attain minimal regret if no constraints are imposed on the noncausal benchmark. In addition, we present numerical experiments to show that our policy retains the hallmark of competitive algorithms of interpolating between classical $\mathcal{H}_2$ and $\mathcal{H}_\infty$ control laws - while consistently outperforming regret minimization methods in constrained scenarios thanks to the superior ability to chase the clairvoyant.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Stability and Robustness of Distributed Suboptimal Model Predictive Control
Authors:
Giuseppe Belgioioso,
Dominic Liao-McPherson,
Mathias Hudoba de Badyn,
Nicolas Pelzmann,
John Lygeros,
Florian Dörfler
Abstract:
In distributed model predictive control (MPC), the control input at each sampling time is computed by solving a large-scale optimal control problem (OCP) over a finite horizon using distributed algorithms. Typically, such algorithms require several (virtually, infinite) communication rounds between the subsystems to converge, which is a major drawback both computationally and from an energetic per…
▽ More
In distributed model predictive control (MPC), the control input at each sampling time is computed by solving a large-scale optimal control problem (OCP) over a finite horizon using distributed algorithms. Typically, such algorithms require several (virtually, infinite) communication rounds between the subsystems to converge, which is a major drawback both computationally and from an energetic perspective (for wireless systems). Motivated by these challenges, we propose a suboptimal distributed MPC scheme in which the total communication burden is distributed also in time, by maintaining a running solution estimate for the large-scale OCP and updating it at each sampling time. We demonstrate that, under some regularity conditions, the resulting suboptimal MPC control law recovers the qualitative robust stability properties of optimal MPC, if the communication budget at each sampling time is large enough.
△ Less
Submitted 27 March, 2023; v1 submitted 14 November, 2022;
originally announced November 2022.