Search | arXiv e-print repository

Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control

Authors: Minjun Sung, Sambhu H. Karumanchi, Aditya Gahlawat, Naira Hovakimyan

Abstract: We introduce $\mathcal{L}_1$-MBRL, a control-theoretic augmentation scheme for Model-Based Reinforcement Learning (MBRL) algorithms. Unlike model-free approaches, MBRL algorithms learn a model of the transition function using data and use it to design a control input. Our approach generates a series of approximate control-affine models of the learned transition function according to the proposed s… ▽ More We introduce $\mathcal{L}_1$-MBRL, a control-theoretic augmentation scheme for Model-Based Reinforcement Learning (MBRL) algorithms. Unlike model-free approaches, MBRL algorithms learn a model of the transition function using data and use it to design a control input. Our approach generates a series of approximate control-affine models of the learned transition function according to the proposed switching law. Using the approximate model, control input produced by the underlying MBRL is perturbed by the $\mathcal{L}_1$ adaptive control, which is designed to enhance the robustness of the system against uncertainties. Importantly, this approach is agnostic to the choice of MBRL algorithm, enabling the use of the scheme with various MBRL algorithms. MBRL algorithms with $\mathcal{L}_1$ augmentation exhibit enhanced performance and sample efficiency across multiple MuJoCo environments, outperforming the original MBRL algorithms, both with and without system noise. △ Less

Submitted 21 March, 2024; originally announced March 2024.

arXiv:2302.07208 [pdf, other]

$\mathcal{L}_1$Quad: $\mathcal{L}_1$ Adaptive Augmentation of Geometric Control for Agile Quadrotors with Performance Guarantees

Authors: Zhuohuan Wu, Sheng Cheng, Pan Zhao, Aditya Gahlawat, Kasey A. Ackerman, Arun Lakshmanan, Chengyu Yang, Jiahao Yu, Naira Hovakimyan

Abstract: Quadrotors that can operate safely in the presence of imperfect model knowledge and external disturbances are crucial in safety-critical applications. We present L1Quad, a control architecture for quadrotors based on the L1 adaptive control. L1Quad enables safe tubes centered around a desired trajectory that the quadrotor is always guaranteed to remain inside. Our design applies to both the rotati… ▽ More Quadrotors that can operate safely in the presence of imperfect model knowledge and external disturbances are crucial in safety-critical applications. We present L1Quad, a control architecture for quadrotors based on the L1 adaptive control. L1Quad enables safe tubes centered around a desired trajectory that the quadrotor is always guaranteed to remain inside. Our design applies to both the rotational and the translational dynamics of the quadrotor. We lump various types of uncertainties and disturbances as unknown nonlinear (time- and state-dependent) forces and moments. Without assuming or enforcing parametric structures, L1Quad can accurately estimate and compensate for these unknown forces and moments. Extensive experimental results demonstrate that L1Quad is able to significantly outperform baseline controllers under a variety of uncertainties with consistently small tracking errors. △ Less

Submitted 14 February, 2023; originally announced February 2023.

Comments: The first two authors contributed equally to this work

arXiv:2112.08222 [pdf, other]

Guaranteed Nonlinear Tracking in the Presence of DNN-Learned Dynamics With Contraction Metrics and Disturbance Estimation

Authors: Pan Zhao, Ziyao Guo, Aditya Gahlawat, Hyungsoo Kang, Naira Hovakimyan

Abstract: This paper presents an approach to trajectory-centric learning control based on contraction metrics and disturbance estimation for nonlinear systems subject to matched uncertainties. The approach uses deep neural networks to learn uncertain dynamics while still providing guarantees of transient tracking performance throughout the learning phase. Within the proposed approach, a disturbance estimati… ▽ More This paper presents an approach to trajectory-centric learning control based on contraction metrics and disturbance estimation for nonlinear systems subject to matched uncertainties. The approach uses deep neural networks to learn uncertain dynamics while still providing guarantees of transient tracking performance throughout the learning phase. Within the proposed approach, a disturbance estimation law is adopted to estimate the pointwise value of the uncertainty, with pre-computable estimation error bounds (EEBs). The learned dynamics, the estimated disturbances, and the EEBs are then incorporated in a robust Riemann energy condition to compute the control law that guarantees exponential convergence of actual trajectories to desired ones throughout the learning phase, even when the learned model is poor. On the other hand, with improved accuracy, the learned model can help improve the robustness of the tracking controller, e.g., against input delays, and can be incorporated to plan better trajectories with improved performance, e.g., lower energy consumption and shorter travel time.The proposed framework is validated on a planar quadrotor example. △ Less

Submitted 12 October, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

Comments: Shorter version submitted to ACC 2023

arXiv:2109.09909 [pdf, other]

Generalization of Safe Optimal Control Actions on Networked Multi-Agent Systems

Authors: Lin Song, Neng Wan, Aditya Gahlawat, Chuyuan Tao, Naira Hovakimyan, Evangelos A. Theodorou

Abstract: We propose a unified framework to fast generate a safe optimal control action for a new task from existing controllers on Multi-Agent Systems (MASs). The control action composition is achieved by taking a weighted mixture of the existing controllers according to the contribution of each component task. Instead of sophisticatedly tuning the cost parameters and other hyper-parameters for safe and re… ▽ More We propose a unified framework to fast generate a safe optimal control action for a new task from existing controllers on Multi-Agent Systems (MASs). The control action composition is achieved by taking a weighted mixture of the existing controllers according to the contribution of each component task. Instead of sophisticatedly tuning the cost parameters and other hyper-parameters for safe and reliable behavior in the optimal control framework, the safety of each single task solution is guaranteed using the control barrier functions (CBFs) for high-degree stochastic systems, which constrains the system state within a known safe operation region where it originates from. Linearity of CBF constraints in control enables the control action composition. The discussed framework can immediately provide reliable solutions to new tasks by taking a weighted mixture of solved component-task actions and filtering on some CBF constraints, instead of performing an extensive sampling to achieve a new controller. Our results are verified and demonstrated on both a single UAV and two cooperative UAV teams in an environment with obstacles. △ Less

Submitted 20 September, 2021; originally announced September 2021.

Comments: 10 pages, 9 figures

arXiv:2109.06998 [pdf, other]

doi 10.1109/ICRA46639.2022.9811946

$\mathcal{L}_1$ Adaptive Augmentation for Geometric Tracking Control of Quadrotors

Authors: Zhuohuan Wu, Sheng Cheng, Kasey A. Ackerman, Aditya Gahlawat, Arun Lakshmanan, Pan Zhao, Naira Hovakimyan

Abstract: This paper introduces an $\mathcal{L}_1$ adaptive control augmentation for geometric tracking control of quadrotors. In the proposed design, the $\mathcal{L}_1$ augmentation handles nonlinear (time- and state-dependent) uncertainties in the quadrotor dynamics without assuming or enforcing parametric structures, while the baseline geometric controller achieves stabilization of the known nonlinear m… ▽ More This paper introduces an $\mathcal{L}_1$ adaptive control augmentation for geometric tracking control of quadrotors. In the proposed design, the $\mathcal{L}_1$ augmentation handles nonlinear (time- and state-dependent) uncertainties in the quadrotor dynamics without assuming or enforcing parametric structures, while the baseline geometric controller achieves stabilization of the known nonlinear model of the system dynamics. The $\mathcal{L}_1$ augmentation applies to both the rotational and the translational dynamics. Experimental results demonstrate that the augmented geometric controller shows consistent and (on average five times) smaller trajectory tracking errors compared with the geometric controller alone when tested for different trajectories and under various types of uncertainties/disturbances. △ Less

Submitted 2 March, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

Comments: accepted by ICRA 2022

arXiv:2109.04453 [pdf, other]

doi 10.1109/LRA.2022.3153712

Tube-Certified Trajectory Tracking for Nonlinear Systems With Robust Control Contraction Metrics

Authors: Pan Zhao, Arun Lakshmanan, Kasey Ackerman, Aditya Gahlawat, Marco Pavone, Naira Hovakimyan

Abstract: This paper presents an approach towards guaranteed trajectory tracking for nonlinear control-affine systems subject to external disturbances based on robust control contraction metrics (CCM) that aims to minimize the $\mathcal L_\infty$ gain from the disturbances to nominal-actual trajectory deviations. The guarantee is in the form of invariant tubes, computed offline and valid for any nominal tra… ▽ More This paper presents an approach towards guaranteed trajectory tracking for nonlinear control-affine systems subject to external disturbances based on robust control contraction metrics (CCM) that aims to minimize the $\mathcal L_\infty$ gain from the disturbances to nominal-actual trajectory deviations. The guarantee is in the form of invariant tubes, computed offline and valid for any nominal trajectories, in which the actual states and inputs of the system are guaranteed to stay despite disturbances. Under mild assumptions, we prove that the proposed robust CCM (RCCM) approach yields tighter tubes than an existing approach based on CCM and input-to-state stability analysis. We show how the RCCM-based tracking controller together with tubes can be incorporated into a feedback motion planning framework to plan safe trajectories for robotic systems. Simulation results illustrate the effectiveness of the proposed method and empirically demonstrate reduced conservatism compared to the CCM-based approach. △ Less

Submitted 6 July, 2023; v1 submitted 9 September, 2021; originally announced September 2021.

Comments: Extended version of a paper published in IEEE Robotics and Automation Letters (2022). 13 pages, 6 figures

arXiv:2103.07519 [pdf, other]

Safe Sampling-Based Air-Ground Rendezvous Algorithm for Complex Urban Environments

Authors: Gabriel Barsi Haberfeld, Aditya Gahlawat, Naira Hovakimyan

Abstract: Demand for fast and economical parcel deliveries in urban environments has risen considerably in recent years. A framework envisions efficient last-mile delivery in urban environments by leveraging a network of ride-sharing vehicles, where Unmanned Aerial Systems (UASs) drop packages on said vehicles, which then cover the majority of the distance before final aerial delivery. Notably, we consider… ▽ More Demand for fast and economical parcel deliveries in urban environments has risen considerably in recent years. A framework envisions efficient last-mile delivery in urban environments by leveraging a network of ride-sharing vehicles, where Unmanned Aerial Systems (UASs) drop packages on said vehicles, which then cover the majority of the distance before final aerial delivery. Notably, we consider the problem of planning a rendezvous path for the UAS to reach a human driver, who may choose between N possible paths and has uncertain behavior, while meeting strict safety constraints. The long planning horizon and safety constraints require robust heuristics that combine learning and optimal control using Gaussian Process Regression, sampling-based optimization, and Model Predictive Control. The resulting algorithm is computationally efficient and shown to be effective in a variety of qualitative scenarios. △ Less

Submitted 12 March, 2021; originally announced March 2021.

Comments: 10 pages, 12 figures. arXiv admin note: text overlap with arXiv:2002.05749

arXiv:2102.09104 [pdf, other]

Distributed Algorithms for Linearly-Solvable Optimal Control in Networked Multi-Agent Systems

Authors: Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou, Petros G. Voulgaris

Abstract: Distributed algorithms for both discrete-time and continuous-time linearly solvable optimal control (LSOC) problems of networked multi-agent systems (MASs) are investigated in this paper. A distributed framework is proposed to partition the optimal control problem of a networked MAS into several local optimal control problems in factorial subsystems, such that each (central) agent behaves optimall… ▽ More Distributed algorithms for both discrete-time and continuous-time linearly solvable optimal control (LSOC) problems of networked multi-agent systems (MASs) are investigated in this paper. A distributed framework is proposed to partition the optimal control problem of a networked MAS into several local optimal control problems in factorial subsystems, such that each (central) agent behaves optimally to minimize the joint cost function of a subsystem that comprises a central agent and its neighboring agents, and the local control actions (policies) only rely on the knowledge of local observations. Under this framework, we not only preserve the correlations between neighboring agents, but moderate the communication and computational complexities by decentralizing the sampling and computational processes over the network. For discrete-time systems modeled by Markov decision processes, the joint Bellman equation of each subsystem is transformed into a system of linear equations and solved using parallel programming. For continuous-time systems modeled by Itô diffusion processes, the joint optimality equation of each subsystem is converted into a linear partial differential equation, whose solution is approximated by a path integral formulation and a sample-efficient relative entropy policy search algorithm, respectively. The learned control policies are generalized to solve the unlearned tasks by resorting to the compositionality principle, and illustrative examples of cooperative UAV teams are provided to verify the effectiveness and advantages of these algorithms. △ Less

Submitted 17 February, 2021; originally announced February 2021.

arXiv:2009.14775 [pdf, other]

Cooperative Path Integral Control for Stochastic Multi-Agent Systems

Authors: Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou, Petros G. Voulgaris

Abstract: A distributed stochastic optimal control solution is presented for cooperative multi-agent systems. The network of agents is partitioned into multiple factorial subsystems, each of which consists of a central agent and neighboring agents. Local control actions that rely only on agents' local observations are designed to optimize the joint cost functions of subsystems. When solving for the local co… ▽ More A distributed stochastic optimal control solution is presented for cooperative multi-agent systems. The network of agents is partitioned into multiple factorial subsystems, each of which consists of a central agent and neighboring agents. Local control actions that rely only on agents' local observations are designed to optimize the joint cost functions of subsystems. When solving for the local control actions, the joint optimality equation for each subsystem is cast as a linear partial differential equation and solved using the Feynman-Kac formula. The solution and the optimal control action are then formulated as path integrals and approximated by a Monte-Carlo method. Numerical verification is provided through a simulation example consisting of a team of cooperative UAVs. △ Less

Submitted 20 March, 2021; v1 submitted 30 September, 2020; originally announced September 2020.

Comments: To appear in American Control Conference 2021, New Orleans, LA, USA

arXiv:2009.13609 [pdf, other]

Compositionality of Linearly Solvable Optimal Control in Networked Multi-Agent Systems

Authors: Lin Song, Neng Wan, Aditya Gahlawat, Naira Hovakimyan, Evangelos A. Theodorou

Abstract: In this paper, we discuss the methodology of generalizing the optimal control law from learned component tasks to unlearned composite tasks on Multi-Agent Systems (MASs), by using the linearity composition principle of linearly solvable optimal control (LSOC) problems. The proposed approach achieves both the compositionality and optimality of control actions simultaneously within the cooperative M… ▽ More In this paper, we discuss the methodology of generalizing the optimal control law from learned component tasks to unlearned composite tasks on Multi-Agent Systems (MASs), by using the linearity composition principle of linearly solvable optimal control (LSOC) problems. The proposed approach achieves both the compositionality and optimality of control actions simultaneously within the cooperative MAS framework in both discrete- and continuous-time in a sample-efficient manner, which reduces the burden of re-computation of the optimal control solutions for the new task on the MASs. We investigate the application of the proposed approach on the MAS with coordination between agents. The experiments show feasible results in investigated scenarios, including both discrete and continuous dynamical systems for task generalization without resampling. △ Less

Submitted 22 March, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

Comments: Accepted to the 2021 American Control Conference (ACC)

arXiv:2009.03864 [pdf, other]

Contraction $\mathcal{L}_1$-Adaptive Control using Gaussian Processes

Authors: Aditya Gahlawat, Arun Lakshmanan, Lin Song, Andrew Patterson, Zhuohuan Wu, Naira Hovakimyan, Evangelos Theodorou

Abstract: We present $\mathcal{CL}_1$-$\mathcal{GP}$, a control framework that enables safe simultaneous learning and control for systems subject to uncertainties. The two main constituents are contraction theory-based $\mathcal{L}_1$ ($\mathcal{CL}_1$) control and Bayesian learning in the form of Gaussian process (GP) regression. The $\mathcal{CL}_1$ controller ensures that control objectives are met while… ▽ More We present $\mathcal{CL}_1$-$\mathcal{GP}$, a control framework that enables safe simultaneous learning and control for systems subject to uncertainties. The two main constituents are contraction theory-based $\mathcal{L}_1$ ($\mathcal{CL}_1$) control and Bayesian learning in the form of Gaussian process (GP) regression. The $\mathcal{CL}_1$ controller ensures that control objectives are met while providing safety certificates. Furthermore, $\mathcal{CL}_1$-$\mathcal{GP}$ incorporates any available data into a GP model of uncertainties, which improves performance and enables the motion planner to achieve optimality safely. This way, the safe operation of the system is always guaranteed, even during the learning transients. We provide a few illustrative examples for the safe learning and control of planar quadrotor systems in a variety of environments. △ Less

Submitted 30 November, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

Comments: Submitted to Learning for Dynamics and Control (L4DC) Conference, 2021

arXiv:2004.14594 [pdf, ps, other]

$\mathcal{L}_1$-$\mathcal{GP}$: $\mathcal{L}_1$ Adaptive Control with Bayesian Learning

Authors: Aditya Gahlawat, Pan Zhao, Andrew Patterson, Naira Hovakimyan, Evangelos A. Theodorou

Abstract: We present $\mathcal{L}_1$-$\mathcal{GP}$, an architecture based on $\mathcal{L}_1$ adaptive control and Gaussian Process Regression (GPR) for safe simultaneous control and learning. On one hand, the $\mathcal{L}_1$ adaptive control provides stability and transient performance guarantees, which allows for GPR to efficiently and safely learn the uncertain dynamics. On the other hand, the learned dy… ▽ More We present $\mathcal{L}_1$-$\mathcal{GP}$, an architecture based on $\mathcal{L}_1$ adaptive control and Gaussian Process Regression (GPR) for safe simultaneous control and learning. On one hand, the $\mathcal{L}_1$ adaptive control provides stability and transient performance guarantees, which allows for GPR to efficiently and safely learn the uncertain dynamics. On the other hand, the learned dynamics can be conveniently incorporated into the $\mathcal{L}_1$ control architecture without sacrificing robustness and tracking performance. Subsequently, the learned dynamics can lead to less conservative designs for performance/robustness tradeoff. We illustrate the efficacy of the proposed architecture via numerical simulations. △ Less

Submitted 30 April, 2020; originally announced April 2020.

arXiv:2004.01142 [pdf, other]

Safe Feedback Motion Planning: A Contraction Theory and $\mathcal{L}_1$-Adaptive Control Based Approach

Authors: Arun Lakshmanan, Aditya Gahlawat, Naira Hovakimyan

Abstract: Autonomous robots that are capable of operating safely in the presence of imperfect model knowledge or external disturbances are vital in safety-critical applications. In this paper, we present a planner-agnostic framework to design and certify safe tubes around desired trajectories that the robot is always guaranteed to remain inside of. By leveraging recent results in contraction analysis and… ▽ More Autonomous robots that are capable of operating safely in the presence of imperfect model knowledge or external disturbances are vital in safety-critical applications. In this paper, we present a planner-agnostic framework to design and certify safe tubes around desired trajectories that the robot is always guaranteed to remain inside of. By leveraging recent results in contraction analysis and $\mathcal{L}_1$-adaptive control we synthesize an architecture that induces safe tubes for nonlinear systems with state and time-varying uncertainties. We demonstrate with a few illustrative examples how contraction theory-based $\mathcal{L}_1$-adaptive control can be used in conjunction with traditional motion planning algorithms to obtain provably safe trajectories. △ Less

Submitted 25 May, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

Comments: Submitted to the Conference on Decision and Control (CDC) 2020

arXiv:2002.05749 [pdf, other]

Risk Sensitive Rendezvous Algorithm for Heterogeneous Agents in Urban Environments

Authors: Gabriel Barsi Haberfeld, Aditya Gahlawat, Naira Hovakimyan

Abstract: Demand for fast and inexpensive parcel deliveries in urban environments has risen considerably in recent years. A framework is envisioned to enforce efficient last mile delivery in urban environments by leveraging a network of ride-sharing vehicles, where Unmanned Aerial Systems (UASs) drop packages on said vehicles which then cover the majority of the distance to finally be picked up by another U… ▽ More Demand for fast and inexpensive parcel deliveries in urban environments has risen considerably in recent years. A framework is envisioned to enforce efficient last mile delivery in urban environments by leveraging a network of ride-sharing vehicles, where Unmanned Aerial Systems (UASs) drop packages on said vehicles which then cover the majority of the distance to finally be picked up by another UAS for delivery. This approach presents many engineering challenges, including the safe rendezvous of both agents: the UAS and the human-operated ground vehicle. In this paper, we introduce a framework to minimize the risk of failure, while allowing for optimal usage of the controlled agent. We formulate a compact fast planner to drive a UAS to a passive ground vehicle with inexact behavior, while providing intuitive and meaningful procedures to guarantee safety with minimal sacrifice of optimality. The resulting algorithm is shown to be fast and implementable in real-time via numerical tests. △ Less

Submitted 17 February, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

Comments: Full version of the same-titled paper accepted to ACC 2021

arXiv:2002.01965 [pdf, other]

Learning Probabilistic Intersection Traffic Models for Trajectory Prediction

Authors: Andrew Patterson, Aditya Gahlawat, Naira Hovakimyan

Abstract: Autonomous agents must be able to safely interact with other vehicles to integrate into urban environments. The safety of these agents is dependent on their ability to predict collisions with other vehicles' future trajectories for replanning and collision avoidance. The information needed to predict collisions can be learned from previously observed vehicle trajectories in a specific environment,… ▽ More Autonomous agents must be able to safely interact with other vehicles to integrate into urban environments. The safety of these agents is dependent on their ability to predict collisions with other vehicles' future trajectories for replanning and collision avoidance. The information needed to predict collisions can be learned from previously observed vehicle trajectories in a specific environment, generating a traffic model. The learned traffic model can then be incorporated as prior knowledge into any trajectory estimation method being used in this environment. This work presents a Gaussian process based probabilistic traffic model that is used to quantify vehicle behaviors in an intersection. The Gaussian process model provides estimates for the average vehicle trajectory, while also capturing the variance between the different paths a vehicle may take in the intersection. The method is demonstrated on a set of time-series position trajectories. These trajectories are reconstructed by removing object recognition errors and missed frames that may occur due to data source processing. To create the intersection traffic model, the reconstructed trajectories are clustered based on their source and destination lanes. For each cluster, a Gaussian process model is created to capture the average behavior and the variance of the cluster. To show the applicability of the Gaussian model, the test trajectories are classified with only partial observations. Performance is quantified by the number of observations required to correctly classify the vehicle trajectory. Both the intersection traffic modeling computations and the classification procedure are timed. These times are presented as results and demonstrate that the model can be constructed in a reasonable amount of time and the classification procedure can be used for online applications. △ Less

Submitted 5 February, 2020; originally announced February 2020.

arXiv:1812.05256 [pdf, other]

Learning to Communicate: A Machine Learning Framework for Heterogeneous Multi-Agent Robotic Systems

Authors: Hyung-** Yoon, Huaiyu Chen, Kehan Long, Heling Zhang, Aditya Gahlawat, Donghwan Lee, Naira Hovakimyan

Abstract: We present a machine learning framework for multi-agent systems to learn both the optimal policy for maximizing the rewards and the encoding of the high dimensional visual observation. The encoding is useful for sharing local visual observations with other agents under communication resource constraints. The actor-encoder encodes the raw images and chooses an action based on local observations and… ▽ More We present a machine learning framework for multi-agent systems to learn both the optimal policy for maximizing the rewards and the encoding of the high dimensional visual observation. The encoding is useful for sharing local visual observations with other agents under communication resource constraints. The actor-encoder encodes the raw images and chooses an action based on local observations and messages sent by the other agents. The machine learning agent generates not only an actuator command to the physical device, but also a communication message to the other agents. We formulate a reinforcement learning problem, which extends the action space to consider the communication action as well. The feasibility of the reinforcement learning framework is demonstrated using a 3D simulation environment with two collaborating agents. The environment provides realistic visual observations to be used and shared between the two agents. △ Less

Submitted 12 December, 2018; originally announced December 2018.

Comments: AIAA SciTech 2019

arXiv:1703.06371 [pdf, ps, other]

A Semi-Definite Programming Approach to Stability Analysis of Linear Partial Differential Equations

Authors: Aditya Gahlawat, Giorgio Valmorbida

Abstract: We consider the stability analysis of a large class of linear 1-D PDEs with polynomial data. This class of PDEs contains, as examples, parabolic and hyperbolic PDEs, PDEs with boundary feedback and systems of in-domain/boundary coupled PDEs. Our approach is Lyapunov based which allows us to reduce the stability problem to the verification of integral inequalities on the subspaces of Hilbert spaces… ▽ More We consider the stability analysis of a large class of linear 1-D PDEs with polynomial data. This class of PDEs contains, as examples, parabolic and hyperbolic PDEs, PDEs with boundary feedback and systems of in-domain/boundary coupled PDEs. Our approach is Lyapunov based which allows us to reduce the stability problem to the verification of integral inequalities on the subspaces of Hilbert spaces. Then, using fundamental theorem of calculus and Green's theorem, we construct a polynomial problem to verify the integral inequalities. Constraining the solution of the polynomial problem to belong to the set of sum-of-squares polynomials subject to affine constraints allows us to use semi-definite programming to algorithmically construct Lyapunov certificates of stability for the systems under consideration. We also provide numerical results of the application of the proposed method on different types of PDEs. △ Less

Submitted 16 September, 2017; v1 submitted 18 March, 2017; originally announced March 2017.

arXiv:1507.05888 [pdf, ps, other]

A Convex Sum-of-Squares Approach to Analysis, State Feedback and Output Feedback Control of Parabolic PDEs

Authors: Aditya Gahlawat, Matthew M. Peet

Abstract: We present an optimization-based framework for analysis and control of linear parabolic partial differential equations (PDEs) with spatially varying coefficients without discretization or numerical approximation. For controller synthesis, we consider both full-state feedback and point observation (output feedback). The input occurs at the boundary (point actuation). We use positive matrices to par… ▽ More We present an optimization-based framework for analysis and control of linear parabolic partial differential equations (PDEs) with spatially varying coefficients without discretization or numerical approximation. For controller synthesis, we consider both full-state feedback and point observation (output feedback). The input occurs at the boundary (point actuation). We use positive matrices to parameterize positive Lyapunov functions and polynomials to parameterize controller and observer gains. We use duality and an invertible state-variable transformation to convexify the controller synthesis problem. Finally, we combine our synthesis condition with the Luenberger observer framework to express the output feedback controller synthesis problem as a set of LMI/SDP constraints. We perform an extensive set of numerical experiments to demonstrate accuracy of the conditions and to prove necessity of the Lyapunov structures chosen. We provide numerical and analytical comparisons with alternative approaches to control including Sturm Liouville theory and backstep**. Finally we use numerical tests to show that the method retains its accuracy for alternative boundary conditions. △ Less

Submitted 2 September, 2016; v1 submitted 21 July, 2015; originally announced July 2015.

Comments: arXiv admin note: text overlap with arXiv:1408.5206

arXiv:1503.06982 [pdf, ps, other]

Output Feedback Control of Inhomogeneous Parabolic PDEs with Point Actuation and Point Measurement using SOS and Semi-Separable Kernels

Authors: Aditya Gahlawat, Matthew M. Peet

Abstract: In this paper we use SOS and SDP to design output feedback controllers for a class of one-dimensional parabolic partial differential equations with point measurements and point actuation. Our approach is based on the use of SOS to search for positive quadratic Lyapunov functions, controllers and observers. These Lyapunov functions, controllers and observers are parameterized by linear operators wh… ▽ More In this paper we use SOS and SDP to design output feedback controllers for a class of one-dimensional parabolic partial differential equations with point measurements and point actuation. Our approach is based on the use of SOS to search for positive quadratic Lyapunov functions, controllers and observers. These Lyapunov functions, controllers and observers are parameterized by linear operators which are defined by SOS polynomials. The main result of the paper is the development of an improved class of observer-based controllers and evidence which indicates that when the system is controllable and observable, these methods will find a observer-based controller for sufficiently high polynomial degree (similar to well-known results from backstep**). △ Less

Submitted 2 April, 2015; v1 submitted 24 March, 2015; originally announced March 2015.

arXiv:1408.5206 [pdf, ps, other]

A Convex Approach to Output Feedback Control of Parabolic PDEs Using Sum-of-Squares

Authors: Aditya Gahlawat, Matthew. M. Peet

Abstract: In this paper we use optimization-based methods to design output-feedback controllers for a class of one-dimensional parabolic partial differential equations. The output may be distributed or point-measurements. The input may be distributed or boundary actuation. We use Lyapunov operators, duality, and the Luenberger observer framework to reformulate the synthesis problem as a convex optimization… ▽ More In this paper we use optimization-based methods to design output-feedback controllers for a class of one-dimensional parabolic partial differential equations. The output may be distributed or point-measurements. The input may be distributed or boundary actuation. We use Lyapunov operators, duality, and the Luenberger observer framework to reformulate the synthesis problem as a convex optimization problem expressed as a set of Linear-Operator-Inequalities (LOIs). We then show how feasibility of these LOIs may be tested using Semidefinite Programming (SDP) and the Sum-of-Squares methodology. △ Less

Submitted 22 August, 2014; originally announced August 2014.

Showing 1–20 of 20 results for author: Gahlawat, A