-
Data-Driven Model Reduction and Nonlinear Model Predictive Control of an Air Separation Unit by Applied Koopman Theory
Authors:
Jan C. Schulze,
Danimir T. Doncevic,
Nils Erwes,
Alexander Mitsos
Abstract:
Achieving real-time capability is an essential prerequisite for the industrial implementation of nonlinear model predictive control (NMPC). Data-driven model reduction offers a way to obtain low-order control models from complex digital twins. In particular, data-driven approaches require little expert knowledge of the particular process and its model, and provide reduced models of a well-defined…
▽ More
Achieving real-time capability is an essential prerequisite for the industrial implementation of nonlinear model predictive control (NMPC). Data-driven model reduction offers a way to obtain low-order control models from complex digital twins. In particular, data-driven approaches require little expert knowledge of the particular process and its model, and provide reduced models of a well-defined generic structure. Herein, we apply our recently proposed data-driven reduction strategy based on Koopman theory [Schulze et al. (2022), Comput. Chem. Eng.] to generate a low-order control model of an air separation unit (ASU). The reduced Koopman model combines autoencoders and linear latent dynamics and is constructed using machine learning. Further, we present an NMPC implementation that uses derivative computation tailored to the fixed block structure of reduced Koopman models. Our reduction approach with tailored NMPC implementation enables real-time NMPC of an ASU at an average CPU time decrease by 98 %.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
A Recursively Recurrent Neural Network (R2N2) Architecture for Learning Iterative Algorithms
Authors:
Danimir T. Doncevic,
Alexander Mitsos,
Yue Guo,
Qianxiao Li,
Felix Dietrich,
Manuel Dahmen,
Ioannis G. Kevrekidis
Abstract:
Meta-learning of numerical algorithms for a given task consists of the data-driven identification and adaptation of an algorithmic structure and the associated hyperparameters. To limit the complexity of the meta-learning problem, neural architectures with a certain inductive bias towards favorable algorithmic structures can, and should, be used. We generalize our previously introduced Runge-Kutta…
▽ More
Meta-learning of numerical algorithms for a given task consists of the data-driven identification and adaptation of an algorithmic structure and the associated hyperparameters. To limit the complexity of the meta-learning problem, neural architectures with a certain inductive bias towards favorable algorithmic structures can, and should, be used. We generalize our previously introduced Runge-Kutta neural network to a recursively recurrent neural network (R2N2) superstructure for the design of customized iterative algorithms. In contrast to off-the-shelf deep learning approaches, it features a distinct division into modules for generation of information and for the subsequent assembly of this information towards a solution. Local information in the form of a subspace is generated by subordinate, inner, iterations of recurrent function evaluations starting at the current outer iterate. The update to the next outer iterate is computed as a linear combination of these evaluations, reducing the residual in this space, and constitutes the output of the network. We demonstrate that regular training of the weight parameters inside the proposed superstructure on input/output data of various computational problem classes yields iterations similar to Krylov solvers for linear equation systems, Newton-Krylov solvers for nonlinear equation systems, and Runge-Kutta integrators for ordinary differential equations. Due to its modularity, the superstructure can be readily extended with functionalities needed to represent more general classes of iterative algorithms traditionally based on Taylor series expansions.
△ Less
Submitted 6 July, 2023; v1 submitted 22 November, 2022;
originally announced November 2022.
-
Hearts Gym: Learning Reinforcement Learning as a Team Event
Authors:
Jan Ebert,
Danimir T. Doncevic,
Ramona Kloß,
Stefan Kesselheim
Abstract:
Amidst the COVID-19 pandemic, the authors of this paper organized a Reinforcement Learning (RL) course for a graduate school in the field of data science. We describe the strategy and materials for creating an exciting learning experience despite the ubiquitous Zoom fatigue and evaluate the course qualitatively. The key organizational features are a focus on a competitive hands-on setting in teams…
▽ More
Amidst the COVID-19 pandemic, the authors of this paper organized a Reinforcement Learning (RL) course for a graduate school in the field of data science. We describe the strategy and materials for creating an exciting learning experience despite the ubiquitous Zoom fatigue and evaluate the course qualitatively. The key organizational features are a focus on a competitive hands-on setting in teams, supported by a minimum of lectures providing the essential background on RL. The practical part of the course revolved around Hearts Gym, an RL environment for the card game Hearts that we developed as an entry-level tutorial to RL. Participants were tasked with training agents to explore reward sha** and other RL hyperparameters. For a final evaluation, the agents of the participants competed against each other.
△ Less
Submitted 7 September, 2022;
originally announced September 2022.
-
Identification of MIMO Wiener-type Koopman Models for Data-Driven Model Reduction using Deep Learning
Authors:
Jan C. Schulze,
Danimir T. Doncevic,
Alexander Mitsos
Abstract:
We use Koopman theory to develop a data-driven nonlinear model reduction and identification strategy for multiple-input multiple-output (MIMO) input-affine dynamical systems. While the present literature has focused on linear and bilinear Koopman models, we derive and use a Wiener-type Koopman formulation. We discuss that the Wiener structure is particularly suitable for model reduction, and can b…
▽ More
We use Koopman theory to develop a data-driven nonlinear model reduction and identification strategy for multiple-input multiple-output (MIMO) input-affine dynamical systems. While the present literature has focused on linear and bilinear Koopman models, we derive and use a Wiener-type Koopman formulation. We discuss that the Wiener structure is particularly suitable for model reduction, and can be naturally derived from Koopman theory. Moreover, the Wiener block-structure unifies the mathematical simplicity of linear dynamical blocks and the accuracy of bilinear dynamics. We present a Koopman deep-learning strategy combining autoencoders and linear dynamics that generates low-order surrogate models of MIMO Wiener type. In three case studies, we apply our framework for identification and reduction of a system with input multiplicity, a chemical reactor and a high-purity distillation column. We compare the prediction performance of the identified Wiener models to linear and bilinear Koopman models. We observe the highest accuracy and strongest model reduction capabilities of low-order Wiener-type Koopman models, making them promising for control.
△ Less
Submitted 4 April, 2022; v1 submitted 29 January, 2022;
originally announced January 2022.
-
Personalized Algorithm Generation: A Case Study in Learning ODE Integrators
Authors:
Yue Guo,
Felix Dietrich,
Tom Bertalan,
Danimir T. Doncevic,
Manuel Dahmen,
Ioannis G. Kevrekidis,
Qianxiao Li
Abstract:
We study the learning of numerical algorithms for scientific computing, which combines mathematically driven, handcrafted design of general algorithm structure with a data-driven adaptation to specific classes of tasks. This represents a departure from the classical approaches in numerical analysis, which typically do not feature such learning-based adaptations. As a case study, we develop a machi…
▽ More
We study the learning of numerical algorithms for scientific computing, which combines mathematically driven, handcrafted design of general algorithm structure with a data-driven adaptation to specific classes of tasks. This represents a departure from the classical approaches in numerical analysis, which typically do not feature such learning-based adaptations. As a case study, we develop a machine learning approach that automatically learns effective solvers for initial value problems in the form of ordinary differential equations (ODEs), based on the Runge-Kutta (RK) integrator architecture. We show that we can learn high-order integrators for targeted families of differential equations without the need for computing integrator coefficients by hand. Moreover, we demonstrate that in certain cases we can obtain superior performance to classical RK methods. This can be attributed to certain properties of the ODE families being identified and exploited by the approach. Overall, this work demonstrates an effective learning-based approach to the design of algorithms for the numerical solution of differential equations. This can be readily extended to other numerical tasks.
△ Less
Submitted 9 July, 2022; v1 submitted 4 May, 2021;
originally announced May 2021.