Skip to main content

Showing 1–10 of 10 results for author: Forbes, M G

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.15512  [pdf, other

    eess.SY

    Deep Hankel matrices with random elements

    Authors: Nathan P. Lawrence, Philip D. Loewen, Shuyuan Wang, Michael G. Forbes, R. Bhushan Gopaluni

    Abstract: Willems' fundamental lemma enables a trajectory-based characterization of linear systems through data-based Hankel matrices. However, in the presence of measurement noise, we ask: Is this noisy Hankel-based model expressive enough to re-identify itself? In other words, we study the output prediction accuracy from recursively applying the same persistently exciting input sequence to the model. We f… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: L4DC 2024

  2. arXiv:2310.14098  [pdf, other

    cs.LG cs.AI eess.SY math.OC

    Stabilizing reinforcement learning control: A modular framework for optimizing over all stable behavior

    Authors: Nathan P. Lawrence, Philip D. Loewen, Shuyuan Wang, Michael G. Forbes, R. Bhushan Gopaluni

    Abstract: We propose a framework for the design of feedback controllers that combines the optimization-driven and model-free advantages of deep reinforcement learning with the stability guarantees provided by using the Youla-Kucera parameterization to define the search domain. Recent advances in behavioral systems allow us to construct a data-driven internal model; this enables an alternative realization of… ▽ More

    Submitted 21 March, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: Postprint; 31 pages. arXiv admin note: text overlap with arXiv:2304.03422

    Journal ref: Automatica 2024

  3. Reinforcement Learning with Partial Parametric Model Knowledge

    Authors: Shuyuan Wang, Philip D. Loewen, Nathan P. Lawrence, Michael G. Forbes, R. Bhushan Gopaluni

    Abstract: We adapt reinforcement learning (RL) methods for continuous control to bridge the gap between complete ignorance and perfect knowledge of the environment. Our method, Partial Knowledge Least Squares Policy Iteration (PLSPI), takes inspiration from both model-free RL and model-based control. It uses incomplete information from a partial model and retains RL's data-driven adaption towards optimal pe… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: IFAC World Congress 2023

    Journal ref: IFAC-PapersOnLine 2023

  4. A modular framework for stabilizing deep reinforcement learning control

    Authors: Nathan P. Lawrence, Philip D. Loewen, Shuyuan Wang, Michael G. Forbes, R. Bhushan Gopaluni

    Abstract: We propose a framework for the design of feedback controllers that combines the optimization-driven and model-free advantages of deep reinforcement learning with the stability guarantees provided by using the Youla-Kucera parameterization to define the search domain. Recent advances in behavioral systems allow us to construct a data-driven internal model; this enables an alternative realization of… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: IFAC World Congress 2023

    Journal ref: IFAC-PapersOnLine 2023

  5. arXiv:2209.09301  [pdf, other

    cs.LG cs.AI eess.SY

    Meta-Reinforcement Learning for Adaptive Control of Second Order Systems

    Authors: Daniel G. McClement, Nathan P. Lawrence, Michael G. Forbes, Philip D. Loewen, Johan U. Backström, R. Bhushan Gopaluni

    Abstract: Meta-learning is a branch of machine learning which aims to synthesize data from a distribution of related tasks to efficiently solve new ones. In process control, many systems have similar and well-understood dynamics, which suggests it is feasible to create a generalizable controller through meta-learning. In this work, we formulate a meta reinforcement learning (meta-RL) control strategy that t… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

    Comments: AdCONIP 2022. arXiv admin note: substantial text overlap with arXiv:2203.09661

  6. Meta-Reinforcement Learning for the Tuning of PI Controllers: An Offline Approach

    Authors: Daniel G. McClement, Nathan P. Lawrence, Johan U. Backstrom, Philip D. Loewen, Michael G. Forbes, R. Bhushan Gopaluni

    Abstract: Meta-learning is a branch of machine learning which trains neural network models to synthesize a wide variety of data in order to rapidly solve new problems. In process control, many systems have similar and well-understood dynamics, which suggests it is feasible to create a generalizable controller through meta-learning. In this work, we formulate a meta reinforcement learning (meta-RL) control s… ▽ More

    Submitted 19 September, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: 23 pages; postprint

    Journal ref: Journal of Process Control 2022

  7. Deep Reinforcement Learning with Shallow Controllers: An Experimental Application to PID Tuning

    Authors: Nathan P. Lawrence, Michael G. Forbes, Philip D. Loewen, Daniel G. McClement, Johan U. Backstrom, R. Bhushan Gopaluni

    Abstract: Deep reinforcement learning (RL) is an optimization-driven framework for producing control strategies for general dynamical systems without explicit reliance on process models. Good results have been reported in simulation. Here we demonstrate the challenges in implementing a state of the art deep RL algorithm on a real physical system. Aspects include the interplay between software and existing h… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

    Comments: 37 pages; pre-print

    Journal ref: Control Engineering Practice 2022

  8. arXiv:2103.14722  [pdf, other

    cs.LG eess.SY math.OC

    Almost Surely Stable Deep Dynamics

    Authors: Nathan P. Lawrence, Philip D. Loewen, Michael G. Forbes, Johan U. Backström, R. Bhushan Gopaluni

    Abstract: We introduce a method for learning provably stable deep neural network based dynamic models from observed data. Specifically, we consider discrete-time stochastic dynamic models, as they are of particular interest in practical applications such as estimation and control. However, these aspects exacerbate the challenge of guaranteeing stability. Our method works by embedding a Lyapunov neural netwo… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: NeurIPS 2020; Spotlight Paper

    Journal ref: Advances in Neural Information Processing Systems, volume 33, pages 18942--18953, 2020

  9. arXiv:2005.04539  [pdf, other

    math.OC cs.LG eess.SY

    Optimal PID and Antiwindup Control Design as a Reinforcement Learning Problem

    Authors: Nathan P. Lawrence, Gregory E. Stewart, Philip D. Loewen, Michael G. Forbes, Johan U. Backstrom, R. Bhushan Gopaluni

    Abstract: Deep reinforcement learning (DRL) has seen several successful applications to process control. Common methods rely on a deep neural network structure to model the controller or process. With increasingly complicated control structures, the closed-loop stability of such methods becomes less clear. In this work, we focus on the interpretability of DRL control methods. In particular, we view linear f… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

    Comments: IFAC World Congress 2020

  10. arXiv:2005.04537  [pdf, other

    math.OC cs.LG eess.SY

    Reinforcement Learning based Design of Linear Fixed Structure Controllers

    Authors: Nathan P. Lawrence, Gregory E. Stewart, Philip D. Loewen, Michael G. Forbes, Johan U. Backstrom, R. Bhushan Gopaluni

    Abstract: Reinforcement learning has been successfully applied to the problem of tuning PID controllers in several applications. The existing methods often utilize function approximation, such as neural networks, to update the controller parameters at each time-step of the underlying process. In this work, we present a simple finite-difference approach, based on random search, to tuning linear fixed-structu… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

    Comments: IFAC World Congress 2020