-
Towards non-stochastic targeted exploration
Authors:
Janani Venkatasubramanian,
Johannes Köhler,
Mark Cannon,
Frank Allgöwer
Abstract:
We present a novel targeted exploration strategy for linear time-invariant systems without stochastic assumptions on the noise, i.e., without requiring independence or zero mean, allowing for deterministic model misspecifications. This work utilizes classical data-dependent uncertainty bounds on the least-squares parameter estimates in the presence of energy-bounded noise. We provide a sufficient…
▽ More
We present a novel targeted exploration strategy for linear time-invariant systems without stochastic assumptions on the noise, i.e., without requiring independence or zero mean, allowing for deterministic model misspecifications. This work utilizes classical data-dependent uncertainty bounds on the least-squares parameter estimates in the presence of energy-bounded noise. We provide a sufficient condition on the exploration data that ensures a desired error bound on the estimated parameter. Using common approximations, we derive a semidefinite program to compute the optimal sinusoidal input excitation. Finally, we highlight the differences and commonalities between the developed non-stochastic targeted exploration strategy and conventional exploration strategies based on classical identification bounds through a numerical example.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Sequential learning and control: Targeted exploration for robust performance
Authors:
Janani Venkatasubramanian,
Johannes Köhler,
Julian Berberich,
Frank Allgöwer
Abstract:
We present a novel dual control strategy for uncertain linear systems based on targeted harmonic exploration and gain-scheduling with performance and excitation guarantees. In the proposed sequential approach, robust control is implemented after exploration with the main feature that the exploration is optimized with respect to the robust control performance. Specifically, we leverage recent resul…
▽ More
We present a novel dual control strategy for uncertain linear systems based on targeted harmonic exploration and gain-scheduling with performance and excitation guarantees. In the proposed sequential approach, robust control is implemented after exploration with the main feature that the exploration is optimized with respect to the robust control performance. Specifically, we leverage recent results on finite excitation using spectral lines to determine a high probability lower bound on the resultant finite excitation of the exploration data. This provides an a priori upper bound on the remaining model uncertainty after exploration, which can further be leveraged in a gain-scheduling controller design that guarantees robust performance. This leads to a semidefinite program-based design which computes an exploration strategy with finite excitation bounds and minimal energy, and a gain-scheduled controller with probabilistic performance bounds that can be implemented after exploration. The effectiveness of our approach and its benefits over common random exploration strategies are demonstrated with an example of a system which is 'hard to learn'.
△ Less
Submitted 12 October, 2023; v1 submitted 19 January, 2023;
originally announced January 2023.
-
Robust Dual Control based on Gain Scheduling
Authors:
Janani Venkatasubramanian,
Johannes Köhler,
Julian Berberich,
Frank Allgöwer
Abstract:
We present a novel strategy for robust dual control of linear time-invariant systems based on gain scheduling with performance guarantees. This work relies on prior results of determining uncertainty bounds of system parameters estimated through exploration. Existing approaches are unable to account for changes of the mean of system parameters in the exploration phase and thus to accurately captur…
▽ More
We present a novel strategy for robust dual control of linear time-invariant systems based on gain scheduling with performance guarantees. This work relies on prior results of determining uncertainty bounds of system parameters estimated through exploration. Existing approaches are unable to account for changes of the mean of system parameters in the exploration phase and thus to accurately capture the dual effect. We address this limitation by selecting the future (uncertain) mean as a scheduling variable in the control design. The result is a semi-definite program-based design that computes a suitable exploration strategy and a robust gain-scheduled controller with probabilistic quadratic performance bounds after the exploration phase.
△ Less
Submitted 13 May, 2021; v1 submitted 9 April, 2020;
originally announced April 2020.