Search | arXiv e-print repository

An efficient data-based off-policy Q-learning algorithm for optimal output feedback control of linear systems

Authors: Mohammad Alsalti, Victor G. Lopez, Matthias A. Müller

Abstract: In this paper, we present a Q-learning algorithm to solve the optimal output regulation problem for discrete-time LTI systems. This off-policy algorithm only relies on using persistently exciting input-output data, measured offline. No model knowledge or state measurements are needed and the obtained optimal policy only uses past input-output information. Moreover, our formulation of the proposed… ▽ More In this paper, we present a Q-learning algorithm to solve the optimal output regulation problem for discrete-time LTI systems. This off-policy algorithm only relies on using persistently exciting input-output data, measured offline. No model knowledge or state measurements are needed and the obtained optimal policy only uses past input-output information. Moreover, our formulation of the proposed algorithm renders it computationally efficient. We provide conditions that guarantee the convergence of the algorithm to the optimal solution. Finally, the performance of our method is compared to existing algorithms in the literature. △ Less

Submitted 6 December, 2023; originally announced December 2023.

arXiv:2311.17484 [pdf, ps, other]

Notes on data-driven output-feedback control of linear MIMO systems

Authors: Mohammad Alsalti, Victor G. Lopez, Matthias A. Müller

Abstract: Recent works have approached the data-driven design of output-feedback controllers for discrete-time LTI systems by constructing non-minimal state vectors composed of past inputs and outputs. Depending on the system's complexity (order, lag and number of inputs), it was observed in several works that such an approach presents certain limitations, but no methods were proposed to overcome them. In t… ▽ More Recent works have approached the data-driven design of output-feedback controllers for discrete-time LTI systems by constructing non-minimal state vectors composed of past inputs and outputs. Depending on the system's complexity (order, lag and number of inputs), it was observed in several works that such an approach presents certain limitations, but no methods were proposed to overcome them. In this note, we clarify these limitations and solve them by proposing the construction of (alternative) non-minimal state vectors that facilitate output-feedback control of MIMO discrete-time LTI systems. △ Less

Submitted 29 November, 2023; originally announced November 2023.

arXiv:2309.11238 [pdf, ps, other]

Sample- and computationally efficient data-driven predictive control

Authors: Mohammad Alsalti, Manuel Barkey, Victor G. Lopez, Matthias A. Müller

Abstract: Recently proposed data-driven predictive control schemes for LTI systems use non-parametric representations based on the image of a Hankel matrix of previously collected, persistently exciting, input-output data. Persistence of excitation necessitates that the data is sufficiently long and, hence, the computational complexity of the corresponding finite-horizon optimal control problem increases. I… ▽ More Recently proposed data-driven predictive control schemes for LTI systems use non-parametric representations based on the image of a Hankel matrix of previously collected, persistently exciting, input-output data. Persistence of excitation necessitates that the data is sufficiently long and, hence, the computational complexity of the corresponding finite-horizon optimal control problem increases. In this paper, we propose an efficient data-driven predictive control (eDDPC) scheme which is both more sample efficient (requires less offline data) and computationally efficient (uses less decision variables) compared to existing schemes. This is done by leveraging an alternative data-based representation of the trajectories of LTI systems. We analytically and numerically compare the performance of this scheme to existing ones from the literature. △ Less

Submitted 6 March, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: accepted for presentation at the 22nd European Control Conference (ECC) in Stockholm, Sweden

arXiv:2307.11589 [pdf, other]

Data-based system representations from irregularly measured data

Authors: Mohammad Alsalti, Ivan Markovsky, Victor G. Lopez, Matthias A. Müller

Abstract: Non-parametric representations of dynamical systems based on the image of a Hankel matrix of data are extensively used for data-driven control. However, if samples of data are missing, obtaining such representations becomes a difficult task. By exploiting the kernel structure of Hankel matrices of irregularly measured data generated by a linear time-invariant system, we provide computational metho… ▽ More Non-parametric representations of dynamical systems based on the image of a Hankel matrix of data are extensively used for data-driven control. However, if samples of data are missing, obtaining such representations becomes a difficult task. By exploiting the kernel structure of Hankel matrices of irregularly measured data generated by a linear time-invariant system, we provide computational methods for which any complete finite-length behavior of the system can be obtained. For the special case of periodically missing outputs, we provide conditions on the input such that the former result is guaranteed. In the presence of noise in the data, our method returns an approximate finite-length behavior of the system. We illustrate our result with several examples, including its use for approximate data completion in real-world applications and compare it to alternative methods. △ Less

Submitted 24 February, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

Comments: submitted to IEEE Transactions on Automatic Control

arXiv:2303.08707 [pdf, ps, other]

doi 10.1109/LCSYS.2023.3287133

On the design of persistently exciting inputs for data-driven control of linear and nonlinear systems

Authors: Mohammad Alsalti, Victor G. Lopez, Matthias A. Müller

Abstract: In the context of data-driven control, persistence of excitation (PE) of an input sequence is defined in terms of a rank condition on the Hankel matrix of the input data. For nonlinear systems, recent results employed rank conditions involving collected input and state/output data, for which no guidelines are available on how to satisfy them a priori. In this paper, we first show that a set of dis… ▽ More In the context of data-driven control, persistence of excitation (PE) of an input sequence is defined in terms of a rank condition on the Hankel matrix of the input data. For nonlinear systems, recent results employed rank conditions involving collected input and state/output data, for which no guidelines are available on how to satisfy them a priori. In this paper, we first show that a set of discrete impulses is guaranteed to be persistently exciting for any controllable LTI system. Based on this result, for certain classes of nonlinear systems, we guarantee persistence of excitation of sequences of basis functions a priori, by design of the physical input only. △ Less

Submitted 19 June, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

Journal ref: IEEE Control Systems Letters, 2023

arXiv:2211.06339 [pdf, other]

Data-driven Nonlinear Predictive Control for Feedback Linearizable Systems

Authors: Mohammad Alsalti, Victor G. Lopez, Julian Berberich, Frank Allgöwer, Matthias A. Müller

Abstract: We present a data-driven nonlinear predictive control approach for the class of discrete-time multi-input multi-output feedback linearizable nonlinear systems. The scheme uses a non-parametric predictive model based only on input and noisy output data along with a set of basis functions that approximate the unknown nonlinearities. Despite the noisy output data as well as the mismatch caused by the… ▽ More We present a data-driven nonlinear predictive control approach for the class of discrete-time multi-input multi-output feedback linearizable nonlinear systems. The scheme uses a non-parametric predictive model based only on input and noisy output data along with a set of basis functions that approximate the unknown nonlinearities. Despite the noisy output data as well as the mismatch caused by the use of basis functions, we show that the proposed multistep robust data-driven nonlinear predictive control scheme is recursively feasible and renders the closed-loop system practically exponentially stable. We illustrate our results on a model of a fully-actuated double inverted pendulum. △ Less

Submitted 27 March, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

Comments: accepted to IFAC World Congress 2023. arXiv admin note: substantial text overlap with arXiv:2204.01148

arXiv:2204.01150 [pdf, ps, other]

Practical exponential stability of a robust data-driven nonlinear predictive control scheme

Authors: Mohammad Alsalti, Victor G. Lopez, Julian Berberich, Frank Allgöwer, Matthias A. Müller

Abstract: We provide theoretical guarantees for recursive feasibility and practical exponential stability of the closed-loop system of a feedback linearizable nonlinear system when controlled by a robust data-driven nonlinear predictive control scheme. This technical report serves as a supplementary material to our recent paper "Data-driven Nonlinear Predictive Control for Feedback Linearizable Systems". Th… ▽ More We provide theoretical guarantees for recursive feasibility and practical exponential stability of the closed-loop system of a feedback linearizable nonlinear system when controlled by a robust data-driven nonlinear predictive control scheme. This technical report serves as a supplementary material to our recent paper "Data-driven Nonlinear Predictive Control for Feedback Linearizable Systems". The arguments shown in this report follow similar steps to those for the LTI case, since feedback linearizable systems are linear in transformed coordinates. However, the proof was suitably adapted to match the nonlinear setting under consideration, and the differences are emphasized throughout the proof. △ Less

Submitted 27 March, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

Comments: This technical report serves as a supplementary material to our recent paper "Data-driven Nonlinear Predictive Control for Feedback Linearizable Systems"

arXiv:2204.01148 [pdf, ps, other]

doi 10.1109/TAC.2023.3249289

Data-based Control of Feedback Linearizable Systems

Authors: Mohammad Alsalti, Victor G. Lopez, Julian Berberich, Frank Allgöwer, Matthias A. Müller

Abstract: We present an extension of Willems' Fundamental Lemma to the class of multi-input multi-output discrete-time feedback linearizable nonlinear systems, thus providing a data-based representation of their input-output trajectories. Two sources of uncertainty are considered. First, the unknown linearizing input is inexactly approximated by a set of basis functions. Second, the measured output data is… ▽ More We present an extension of Willems' Fundamental Lemma to the class of multi-input multi-output discrete-time feedback linearizable nonlinear systems, thus providing a data-based representation of their input-output trajectories. Two sources of uncertainty are considered. First, the unknown linearizing input is inexactly approximated by a set of basis functions. Second, the measured output data is contaminated by additive noise. Further, we propose an approach to approximate the solution of the data-based simulation and output matching problems, and show that the difference from the true solution is bounded. Finally, the results are illustrated on an example of a fully-actuated double inverted pendulum. △ Less

Submitted 16 March, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

Journal ref: in IEEE Transactions on Automatic Control, 2023

arXiv:2105.07761 [pdf, ps, other]

doi 10.1109/TAC.2023.3235967

Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems

Authors: Victor G. Lopez, Mohammad Alsalti, Matthias A. Müller

Abstract: This paper introduces and analyzes an improved Q-learning algorithm for discrete-time linear time-invariant systems. The proposed method does not require any knowledge of the system dynamics, and it enjoys significant efficiency advantages over other data-based optimal control methods in the literature. This algorithm can be fully executed off-line, as it does not require to apply the current esti… ▽ More This paper introduces and analyzes an improved Q-learning algorithm for discrete-time linear time-invariant systems. The proposed method does not require any knowledge of the system dynamics, and it enjoys significant efficiency advantages over other data-based optimal control methods in the literature. This algorithm can be fully executed off-line, as it does not require to apply the current estimate of the optimal input to the system as in on-policy algorithms. It is shown that a persistently exciting input, defined from an easily tested matrix rank condition, guarantees the convergence of the algorithm. A data-based method is proposed to design the initial stabilizing feedback gain that the algorithm requires. Robustness of the algorithm in the presence of noisy measurements is analyzed. We compare the proposed algorithm in simulation to different direct and indirect data-based control design methods. △ Less

Submitted 31 March, 2023; v1 submitted 17 May, 2021; originally announced May 2021.

Comments: 12 pages, journal article

Journal ref: IEEE Transactions on Automatic Control, pp. 1-12, 2023

arXiv:2103.02892 [pdf, ps, other]

Data-Based System Analysis and Control of Flat Nonlinear Systems

Authors: Mohammad Alsalti, Julian Berberich, Victor G. Lopez, Frank Allgöwer, Matthias A. Müller

Abstract: Willems et al. showed that all input-output trajectories of a discrete-time linear time-invariant system can be obtained using linear combinations of time shifts of a single, persistently exciting, input-output trajectory of that system. In this paper, we extend this result to the class of discrete-time single-input single-output flat nonlinear systems. We propose a data-based parametrization of a… ▽ More Willems et al. showed that all input-output trajectories of a discrete-time linear time-invariant system can be obtained using linear combinations of time shifts of a single, persistently exciting, input-output trajectory of that system. In this paper, we extend this result to the class of discrete-time single-input single-output flat nonlinear systems. We propose a data-based parametrization of all trajectories using only input-output data. Further, we use this parametrization to solve the data-based simulation and output-matching control problems for the unknown system without explicitly identifying a model. Finally, we illustrate the main results with numerical examples. △ Less

Submitted 30 September, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

Comments: 6 pages, accepted for publication in Proc. of the 2021 IEEE Conference on Decision and Control (CDC)

Showing 1–10 of 10 results for author: Alsalti, M