-
A Sequential Quadratic Programming Approach to the Solution of Open-Loop Generalized Nash Equilibria for Autonomous Racing
Authors:
Edward L. Zhu,
Francesco Borrelli
Abstract:
Dynamic games can be an effective approach for modeling interactive behavior between multiple competitive agents in autonomous racing and they provide a theoretical framework for simultaneous prediction and control in such scenarios. In this work, we propose DG-SQP, a numerical method for the solution of local generalized Nash equilibria (GNE) for open-loop general-sum dynamic games for agents wit…
▽ More
Dynamic games can be an effective approach for modeling interactive behavior between multiple competitive agents in autonomous racing and they provide a theoretical framework for simultaneous prediction and control in such scenarios. In this work, we propose DG-SQP, a numerical method for the solution of local generalized Nash equilibria (GNE) for open-loop general-sum dynamic games for agents with nonlinear dynamics and constraints. In particular, we formulate a sequential quadratic programming (SQP) approach which requires only the solution of a single convex quadratic program at each iteration. The three key elements of the method are a non-monotonic line search for solving the associated KKT equations, a merit function to handle zero sum costs, and a decaying regularization scheme for SQP step selection. We show that our method achieves linear convergence in the neighborhood of local GNE and demonstrate the effectiveness of the approach in the context of head-to-head car racing, where we show significant improvement in solver success rate when comparing against the state-of-the-art PATH solver for dynamic games. An implementation of our solver can be found at https://github.com/zhu-edward/DGSQP.
△ Less
Submitted 29 March, 2024;
originally announced April 2024.
-
Learning Model Predictive Control with Error Dynamics Regression for Autonomous Racing
Authors:
Haoru Xue,
Edward L. Zhu,
John M. Dolan,
Francesco Borrelli
Abstract:
This work presents a novel Learning Model Predictive Control (LMPC) strategy for autonomous racing at the handling limit that can iteratively explore and learn unknown dynamics in high-speed operational domains. We start from existing LMPC formulations and modify the system dynamics learning method. In particular, our approach uses a nominal, global, nonlinear, physics-based model with a local, li…
▽ More
This work presents a novel Learning Model Predictive Control (LMPC) strategy for autonomous racing at the handling limit that can iteratively explore and learn unknown dynamics in high-speed operational domains. We start from existing LMPC formulations and modify the system dynamics learning method. In particular, our approach uses a nominal, global, nonlinear, physics-based model with a local, linear, data-driven learning of the error dynamics. We conducted experiments in simulation and on 1/10th scale hardware, and deployed the proposed LMPC on a full-scale autonomous race car used in the Indy Autonomous Challenge (IAC) with closed loop experiments at the Putnam Park Road Course in Indiana, USA. The results show that the proposed control policy exhibits improved robustness to parameter tuning and data scarcity. Incremental and safety-aware exploration toward the limit of handling and iterative learning of the vehicle dynamics in high-speed domains is observed both in simulations and experiments.
△ Less
Submitted 7 March, 2024; v1 submitted 19 September, 2023;
originally announced September 2023.
-
A Gaussian Process Model for Opponent Prediction in Autonomous Racing
Authors:
Edward L. Zhu,
Finn Lukas Busch,
Jake Johnson,
Francesco Borrelli
Abstract:
In head-to-head racing, an accurate model of interactive behavior of the opposing target vehicle (TV) is required to perform tightly constrained, but highly rewarding maneuvers such as overtaking. However, such information is not typically made available in competitive scenarios, we therefore propose to construct a prediction and uncertainty model given data of the TV from previous races. In parti…
▽ More
In head-to-head racing, an accurate model of interactive behavior of the opposing target vehicle (TV) is required to perform tightly constrained, but highly rewarding maneuvers such as overtaking. However, such information is not typically made available in competitive scenarios, we therefore propose to construct a prediction and uncertainty model given data of the TV from previous races. In particular, a one-step Gaussian process (GP) model is trained on closed-loop interaction data to learn the behavior of a TV driven by an unknown policy. Predictions of the nominal trajectory and associated uncertainty are rolled out via a sampling-based approach and are used in a model predictive control (MPC) policy for the ego vehicle in order to intelligently trade-off between safety and performance when attempting overtaking maneuvers against a TV. We demonstrate the GP-based predictor in closed loop with the MPC policy in simulation races and compare its performance against several predictors from literature. In a Monte Carlo study, we observe that the GP-based predictor achieves similar win rates while maintaining safety in up to 3x more races. We finally demonstrate the prediction and control framework in real-time in a experimental study on a 1/10th scale racecar platform operating at speeds of around 2.8 m/s, and show a significant level of improvement when using the GP-based predictor over a baseline MPC predictor. Videos of the hardware experiments can be found at https://youtu.be/KMSs4ofDfIs.
△ Less
Submitted 1 March, 2023; v1 submitted 26 April, 2022;
originally announced April 2022.
-
A Sequential Quadratic Programming Approach to the Solution of Open-Loop Generalized Nash Equilibria
Authors:
Edward L. Zhu,
Francesco Borrelli
Abstract:
Dynamic games can be an effective approach to modeling interactive behavior between multiple non-cooperative agents and they provide a theoretical framework for simultaneous prediction and control in such scenarios. In this work, we propose a numerical method for the solution of local generalized Nash equilibria (GNE) for the class of open-loop general-sum dynamic games for agents with nonlinear d…
▽ More
Dynamic games can be an effective approach to modeling interactive behavior between multiple non-cooperative agents and they provide a theoretical framework for simultaneous prediction and control in such scenarios. In this work, we propose a numerical method for the solution of local generalized Nash equilibria (GNE) for the class of open-loop general-sum dynamic games for agents with nonlinear dynamics and constraints. In particular, we formulate a sequential quadratic programming (SQP) approach which requires only the solution of a single convex quadratic program at each iteration. Central to the robustness of our approach is a non-monotonic line search method and a novel merit function for SQP step acceptance. We show that our method achieves linear convergence in the neighborhood of local GNE and we derive an update rule for the merit function which helps to improve convergence from a larger set of initial conditions. We demonstrate the effectiveness of the algorithm in the context of car racing, where we show up to 32\% improvement of success rate when comparing against a state-of-the-art solution approach for dynamic games. \url{https://github.com/zhu-edward/DGSQP}.
△ Less
Submitted 15 September, 2022; v1 submitted 30 March, 2022;
originally announced March 2022.
-
Collision Avoidance in Tightly-Constrained Environments without Coordination: a Hierarchical Control Approach
Authors:
Xu Shen,
Edward L. Zhu,
Yvonne R. Stürz,
Francesco Borrelli
Abstract:
We present a hierarchical control approach for maneuvering an autonomous vehicle (AV) in tightly-constrained environments where other moving AVs and/or human driven vehicles are present. A two-level hierarchy is proposed: a high-level data-driven strategy predictor and a lower-level model-based feedback controller. The strategy predictor maps an encoding of a dynamic environment to a set of high-l…
▽ More
We present a hierarchical control approach for maneuvering an autonomous vehicle (AV) in tightly-constrained environments where other moving AVs and/or human driven vehicles are present. A two-level hierarchy is proposed: a high-level data-driven strategy predictor and a lower-level model-based feedback controller. The strategy predictor maps an encoding of a dynamic environment to a set of high-level strategies via a neural network. Depending on the selected strategy, a set of time-varying hyperplanes in the AV's position space is generated online and the corresponding halfspace constraints are included in a lower-level model-based receding horizon controller. These strategy-dependent constraints drive the vehicle towards areas where it is likely to remain feasible. Moreover, the predicted strategy also informs switching between a discrete set of policies, which allows for more conservative behavior when prediction confidence is low. We demonstrate the effectiveness of the proposed data-driven hierarchical control framework in a two-car collision avoidance scenario through simulations and experiments on a 1/10 scale autonomous car platform where the strategy-guided approach outperforms a model predictive control baseline in both cases.
△ Less
Submitted 18 March, 2021; v1 submitted 31 October, 2020;
originally announced November 2020.
-
Distributed Learning Model Predictive Control for Linear Systems
Authors:
Yvonne R. Stürz,
Edward L. Zhu,
Ugo Rosolia,
Karl H. Johansson,
Francesco Borrelli
Abstract:
This paper presents a distributed learning model predictive control (DLMPC) scheme for distributed linear time invariant systems with coupled dynamics and state constraints. The proposed solution method is based on an online distributed optimization scheme with nearest-neighbor communication. If the control task is iterative and data from previous feasible iterations are available, local data are…
▽ More
This paper presents a distributed learning model predictive control (DLMPC) scheme for distributed linear time invariant systems with coupled dynamics and state constraints. The proposed solution method is based on an online distributed optimization scheme with nearest-neighbor communication. If the control task is iterative and data from previous feasible iterations are available, local data are exploited by the subsystems in order to construct the local terminal set and terminal cost, which guarantee recursive feasibility and asymptotic stability, as well as performance improvement over iterations. In case a first feasible trajectory is difficult to obtain, or the task is non-iterative, we further propose an algorithm that efficiently explores the state-space and generates the data required for the construction of the terminal cost and terminal constraint in the MPC problem in a safe and distributed way. In contrast to other distributed MPC schemes which use structured positive invariant sets, the proposed approach involves a control invariant set as the terminal set, on which we do not impose any distributed structure. The proposed iterative scheme converges to the global optimal solution of the underlying infinite horizon optimal control problem under mild conditions. Numerical experiments demonstrate the effectiveness of the proposed DLMPC scheme.
△ Less
Submitted 23 June, 2020;
originally announced June 2020.
-
Trajectory Optimization for Nonlinear Multi-Agent Systems using Decentralized Learning Model Predictive Control
Authors:
Edward L. Zhu,
Yvonne R. Stürz,
Ugo Rosolia,
Francesco Borrelli
Abstract:
We present a decentralized minimum-time trajectory optimization scheme based on learning model predictive control for multi-agent systems with nonlinear decoupled dynamics and coupled state constraints. By performing the same task iteratively, data from previous task executions is used to construct and improve local time-varying safe sets and an approximate value function. These are used in a deco…
▽ More
We present a decentralized minimum-time trajectory optimization scheme based on learning model predictive control for multi-agent systems with nonlinear decoupled dynamics and coupled state constraints. By performing the same task iteratively, data from previous task executions is used to construct and improve local time-varying safe sets and an approximate value function. These are used in a decoupled MPC problem as terminal sets and terminal cost functions. Our framework results in a decentralized controller, which requires no communication between agents over each iteration of task execution, and guarantees persistent feasibility, finite-time closed-loop convergence, and non-decreasing performance of the global system over task iterations. Numerical experiments of a multi-vehicle collision avoidance scenario demonstrate the effectiveness of the proposed scheme.
△ Less
Submitted 18 December, 2020; v1 submitted 2 April, 2020;
originally announced April 2020.
-
Model-Predictive Control with Inverse Statics Optimization for Tensegrity Spine Robots
Authors:
Andrew P. Sabelhaus,
Hua**g Zhao,
Edward L. Zhu,
Adrian K. Agogino,
Alice M. Agogino
Abstract:
Robots with flexible spines based on tensegrity structures have potential advantages over traditional designs with rigid torsos. However, these robots can be difficult to control due to their high-dimensional nonlinear dynamics and actuator constraints. This work presents two controllers for tensegrity spine robots, using model-predictive control (MPC) and inverse statics optimization. The control…
▽ More
Robots with flexible spines based on tensegrity structures have potential advantages over traditional designs with rigid torsos. However, these robots can be difficult to control due to their high-dimensional nonlinear dynamics and actuator constraints. This work presents two controllers for tensegrity spine robots, using model-predictive control (MPC) and inverse statics optimization. The controllers introduce two different approaches to making the control problem computationally tractable. The first utilizes smoothing terms in the MPC problem. The second uses a new inverse statics optimization algorithm, which gives the first feasible solutions to the problem for certain tensegrity robots, to generate reference input trajectories in combination with MPC. Tracking the inverse statics reference input trajectory significantly reduces the number of tuning parameters. The controllers are validated against simulations of two-dimensional and three-dimensional tensegrity spines. Both approaches show noise insensitivity and low tracking error, and can be used for different control goals. The results here demonstrate the first closed-loop control of such structures.
△ Less
Submitted 15 October, 2019; v1 submitted 22 June, 2018;
originally announced June 2018.
-
Inclined Surface Locomotion Strategies for Spherical Tensegrity Robots
Authors:
Lee-Huang Chen,
Brian Cera,
Edward L. Zhu,
Riley Edmunds,
Franklin Rice,
Antonia Bronars,
Ellande Tang,
Saunon R. Malekshahi,
Osvaldo Romero,
Adrian K. Agogino,
Alice M. Agogino
Abstract:
This paper presents a new teleoperated spherical tensegrity robot capable of performing locomotion on steep inclined surfaces. With a novel control scheme centered around the simultaneous actuation of multiple cables, the robot demonstrates robust climbing on inclined surfaces in hardware experiments and speeds significantly faster than previous spherical tensegrity models. This robot is an improv…
▽ More
This paper presents a new teleoperated spherical tensegrity robot capable of performing locomotion on steep inclined surfaces. With a novel control scheme centered around the simultaneous actuation of multiple cables, the robot demonstrates robust climbing on inclined surfaces in hardware experiments and speeds significantly faster than previous spherical tensegrity models. This robot is an improvement over other iterations in the TT-series and the first tensegrity to achieve reliable locomotion on inclined surfaces of up to 24\degree. We analyze locomotion in simulation and hardware under single and multi-cable actuation, and introduce two novel multi-cable actuation policies, suited for steep incline climbing and speed, respectively. We propose compelling justifications for the increased dynamic ability of the robot and motivate development of optimization algorithms able to take advantage of the robot's increased control authority.
△ Less
Submitted 27 August, 2017;
originally announced August 2017.