-
An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems
Authors:
Mohammad Alsalti,
Victor G. Lopez,
Matthias A. Müller
Abstract:
In this paper, we present a Q-learning algorithm to solve the optimal output regulation problem for discrete-time LTI systems. This off-policy algorithm only relies on using persistently exciting input-output data, measured offline. No model knowledge or state measurements are needed and the obtained optimal policy only uses past input-output information. Moreover, our formulation of the proposed…
▽ More
In this paper, we present a Q-learning algorithm to solve the optimal output regulation problem for discrete-time LTI systems. This off-policy algorithm only relies on using persistently exciting input-output data, measured offline. No model knowledge or state measurements are needed and the obtained optimal policy only uses past input-output information. Moreover, our formulation of the proposed algorithm renders it computationally efficient. We provide conditions that guarantee the convergence of the algorithm to the optimal solution. Finally, the performance of our method is compared to existing algorithms in the literature.
△ Less
Submitted 6 December, 2023;
originally announced December 2023.
-
Notes on data-driven output-feedback control of linear MIMO systems
Authors:
Mohammad Alsalti,
Victor G. Lopez,
Matthias A. Müller
Abstract:
Recent works have approached the data-driven design of output-feedback controllers for discrete-time LTI systems by constructing non-minimal state vectors composed of past inputs and outputs. Depending on the system's complexity (order, lag and number of inputs), it was observed in several works that such an approach presents certain limitations, but no methods were proposed to overcome them. In t…
▽ More
Recent works have approached the data-driven design of output-feedback controllers for discrete-time LTI systems by constructing non-minimal state vectors composed of past inputs and outputs. Depending on the system's complexity (order, lag and number of inputs), it was observed in several works that such an approach presents certain limitations, but no methods were proposed to overcome them. In this note, we clarify these limitations and solve them by proposing the construction of (alternative) non-minimal state vectors that facilitate output-feedback control of MIMO discrete-time LTI systems.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Sample- and computationally efficient data-driven predictive control
Authors:
Mohammad Alsalti,
Manuel Barkey,
Victor G. Lopez,
Matthias A. Müller
Abstract:
Recently proposed data-driven predictive control schemes for LTI systems use non-parametric representations based on the image of a Hankel matrix of previously collected, persistently exciting, input-output data. Persistence of excitation necessitates that the data is sufficiently long and, hence, the computational complexity of the corresponding finite-horizon optimal control problem increases. I…
▽ More
Recently proposed data-driven predictive control schemes for LTI systems use non-parametric representations based on the image of a Hankel matrix of previously collected, persistently exciting, input-output data. Persistence of excitation necessitates that the data is sufficiently long and, hence, the computational complexity of the corresponding finite-horizon optimal control problem increases. In this paper, we propose an efficient data-driven predictive control (eDDPC) scheme which is both more sample efficient (requires less offline data) and computationally efficient (uses less decision variables) compared to existing schemes. This is done by leveraging an alternative data-based representation of the trajectories of LTI systems. We analytically and numerically compare the performance of this scheme to existing ones from the literature.
△ Less
Submitted 6 March, 2024; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Data-based system representations from irregularly measured data
Authors:
Mohammad Alsalti,
Ivan Markovsky,
Victor G. Lopez,
Matthias A. Müller
Abstract:
Non-parametric representations of dynamical systems based on the image of a Hankel matrix of data are extensively used for data-driven control. However, if samples of data are missing, obtaining such representations becomes a difficult task. By exploiting the kernel structure of Hankel matrices of irregularly measured data generated by a linear time-invariant system, we provide computational metho…
▽ More
Non-parametric representations of dynamical systems based on the image of a Hankel matrix of data are extensively used for data-driven control. However, if samples of data are missing, obtaining such representations becomes a difficult task. By exploiting the kernel structure of Hankel matrices of irregularly measured data generated by a linear time-invariant system, we provide computational methods for which any complete finite-length behavior of the system can be obtained. For the special case of periodically missing outputs, we provide conditions on the input such that the former result is guaranteed. In the presence of noise in the data, our method returns an approximate finite-length behavior of the system. We illustrate our result with several examples, including its use for approximate data completion in real-world applications and compare it to alternative methods.
△ Less
Submitted 24 February, 2024; v1 submitted 21 July, 2023;
originally announced July 2023.
-
On the design of persistently exciting inputs for data-driven control of linear and nonlinear systems
Authors:
Mohammad Alsalti,
Victor G. Lopez,
Matthias A. Müller
Abstract:
In the context of data-driven control, persistence of excitation (PE) of an input sequence is defined in terms of a rank condition on the Hankel matrix of the input data. For nonlinear systems, recent results employed rank conditions involving collected input and state/output data, for which no guidelines are available on how to satisfy them a priori. In this paper, we first show that a set of dis…
▽ More
In the context of data-driven control, persistence of excitation (PE) of an input sequence is defined in terms of a rank condition on the Hankel matrix of the input data. For nonlinear systems, recent results employed rank conditions involving collected input and state/output data, for which no guidelines are available on how to satisfy them a priori. In this paper, we first show that a set of discrete impulses is guaranteed to be persistently exciting for any controllable LTI system. Based on this result, for certain classes of nonlinear systems, we guarantee persistence of excitation of sequences of basis functions a priori, by design of the physical input only.
△ Less
Submitted 19 June, 2023; v1 submitted 15 March, 2023;
originally announced March 2023.
-
Data-driven Nonlinear Predictive Control for Feedback Linearizable Systems
Authors:
Mohammad Alsalti,
Victor G. Lopez,
Julian Berberich,
Frank Allgöwer,
Matthias A. Müller
Abstract:
We present a data-driven nonlinear predictive control approach for the class of discrete-time multi-input multi-output feedback linearizable nonlinear systems. The scheme uses a non-parametric predictive model based only on input and noisy output data along with a set of basis functions that approximate the unknown nonlinearities. Despite the noisy output data as well as the mismatch caused by the…
▽ More
We present a data-driven nonlinear predictive control approach for the class of discrete-time multi-input multi-output feedback linearizable nonlinear systems. The scheme uses a non-parametric predictive model based only on input and noisy output data along with a set of basis functions that approximate the unknown nonlinearities. Despite the noisy output data as well as the mismatch caused by the use of basis functions, we show that the proposed multistep robust data-driven nonlinear predictive control scheme is recursively feasible and renders the closed-loop system practically exponentially stable. We illustrate our results on a model of a fully-actuated double inverted pendulum.
△ Less
Submitted 27 March, 2023; v1 submitted 11 November, 2022;
originally announced November 2022.
-
Practical exponential stability of a robust data-driven nonlinear predictive control scheme
Authors:
Mohammad Alsalti,
Victor G. Lopez,
Julian Berberich,
Frank Allgöwer,
Matthias A. Müller
Abstract:
We provide theoretical guarantees for recursive feasibility and practical exponential stability of the closed-loop system of a feedback linearizable nonlinear system when controlled by a robust data-driven nonlinear predictive control scheme. This technical report serves as a supplementary material to our recent paper "Data-driven Nonlinear Predictive Control for Feedback Linearizable Systems". Th…
▽ More
We provide theoretical guarantees for recursive feasibility and practical exponential stability of the closed-loop system of a feedback linearizable nonlinear system when controlled by a robust data-driven nonlinear predictive control scheme. This technical report serves as a supplementary material to our recent paper "Data-driven Nonlinear Predictive Control for Feedback Linearizable Systems". The arguments shown in this report follow similar steps to those for the LTI case, since feedback linearizable systems are linear in transformed coordinates. However, the proof was suitably adapted to match the nonlinear setting under consideration, and the differences are emphasized throughout the proof.
△ Less
Submitted 27 March, 2023; v1 submitted 3 April, 2022;
originally announced April 2022.
-
Data-based Control of Feedback Linearizable Systems
Authors:
Mohammad Alsalti,
Victor G. Lopez,
Julian Berberich,
Frank Allgöwer,
Matthias A. Müller
Abstract:
We present an extension of Willems' Fundamental Lemma to the class of multi-input multi-output discrete-time feedback linearizable nonlinear systems, thus providing a data-based representation of their input-output trajectories. Two sources of uncertainty are considered. First, the unknown linearizing input is inexactly approximated by a set of basis functions. Second, the measured output data is…
▽ More
We present an extension of Willems' Fundamental Lemma to the class of multi-input multi-output discrete-time feedback linearizable nonlinear systems, thus providing a data-based representation of their input-output trajectories. Two sources of uncertainty are considered. First, the unknown linearizing input is inexactly approximated by a set of basis functions. Second, the measured output data is contaminated by additive noise. Further, we propose an approach to approximate the solution of the data-based simulation and output matching problems, and show that the difference from the true solution is bounded. Finally, the results are illustrated on an example of a fully-actuated double inverted pendulum.
△ Less
Submitted 16 March, 2023; v1 submitted 3 April, 2022;
originally announced April 2022.
-
Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems
Authors:
Victor G. Lopez,
Mohammad Alsalti,
Matthias A. Müller
Abstract:
This paper introduces and analyzes an improved Q-learning algorithm for discrete-time linear time-invariant systems. The proposed method does not require any knowledge of the system dynamics, and it enjoys significant efficiency advantages over other data-based optimal control methods in the literature. This algorithm can be fully executed off-line, as it does not require to apply the current esti…
▽ More
This paper introduces and analyzes an improved Q-learning algorithm for discrete-time linear time-invariant systems. The proposed method does not require any knowledge of the system dynamics, and it enjoys significant efficiency advantages over other data-based optimal control methods in the literature. This algorithm can be fully executed off-line, as it does not require to apply the current estimate of the optimal input to the system as in on-policy algorithms. It is shown that a persistently exciting input, defined from an easily tested matrix rank condition, guarantees the convergence of the algorithm. A data-based method is proposed to design the initial stabilizing feedback gain that the algorithm requires. Robustness of the algorithm in the presence of noisy measurements is analyzed. We compare the proposed algorithm in simulation to different direct and indirect data-based control design methods.
△ Less
Submitted 31 March, 2023; v1 submitted 17 May, 2021;
originally announced May 2021.
-
Data-Based System Analysis and Control of Flat Nonlinear Systems
Authors:
Mohammad Alsalti,
Julian Berberich,
Victor G. Lopez,
Frank Allgöwer,
Matthias A. Müller
Abstract:
Willems et al. showed that all input-output trajectories of a discrete-time linear time-invariant system can be obtained using linear combinations of time shifts of a single, persistently exciting, input-output trajectory of that system. In this paper, we extend this result to the class of discrete-time single-input single-output flat nonlinear systems. We propose a data-based parametrization of a…
▽ More
Willems et al. showed that all input-output trajectories of a discrete-time linear time-invariant system can be obtained using linear combinations of time shifts of a single, persistently exciting, input-output trajectory of that system. In this paper, we extend this result to the class of discrete-time single-input single-output flat nonlinear systems. We propose a data-based parametrization of all trajectories using only input-output data. Further, we use this parametrization to solve the data-based simulation and output-matching control problems for the unknown system without explicitly identifying a model. Finally, we illustrate the main results with numerical examples.
△ Less
Submitted 30 September, 2021; v1 submitted 4 March, 2021;
originally announced March 2021.