Search | arXiv e-print repository

Energy-efficient predictive control for connected, automated driving under localization uncertainty

Authors: Eunhyek Joa, Eric Yongkeun Choi, Francesco Borrelli

Abstract: This paper presents a data-driven Model Predictive Control (MPC) for energy-efficient urban road driving for connected, automated vehicles. The proposed MPC aims to minimize total energy consumption by controlling the vehicle's longitudinal motion on roads with traffic lights and preceding vehicles. Its terminal cost function and terminal constraints are learned from data, which consists of the cl… ▽ More This paper presents a data-driven Model Predictive Control (MPC) for energy-efficient urban road driving for connected, automated vehicles. The proposed MPC aims to minimize total energy consumption by controlling the vehicle's longitudinal motion on roads with traffic lights and preceding vehicles. Its terminal cost function and terminal constraints are learned from data, which consists of the closed-loop state and input trajectories. The terminal cost function represents the remaining energy-to-spend starting from a given terminal state. The terminal constraints are designed to ensure that the controlled vehicle timely crosses the upcoming traffic light, adheres to traffic laws, and accounts for the preceding vehicles. We validate the effectiveness of our method through both simulations and real-world vehicle experiments, demonstrating $\textbf{19\%}$ improvement in average energy consumption compared to conventional approaches that involve solving a long-horizon optimal control problem for speed planning and employing a separate controller for speed tracking. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Submitted to IEEE Transactions of Intelligent Vehicles. arXiv admin note: text overlap with arXiv:2402.01059

arXiv:2405.11698 [pdf, other]

Multi-Objective Learning Model Predictive Control

Authors: Siddharth H. Nair, Charlott Vallon, Francesco Borrelli

Abstract: Multi-Objective Learning Model Predictive Control is a novel data-driven control scheme which improves a system's closed-loop performance with respect to several control objectives over iterations of a repeated task. At each task iteration, collected system data is used to construct terminal components of a Model Predictive Controller. The formulation presented in this paper ensures that closed-lo… ▽ More Multi-Objective Learning Model Predictive Control is a novel data-driven control scheme which improves a system's closed-loop performance with respect to several control objectives over iterations of a repeated task. At each task iteration, collected system data is used to construct terminal components of a Model Predictive Controller. The formulation presented in this paper ensures that closed-loop control performance improves between successive iterations with respect to each objective. We provide proofs of recursive feasibility and performance improvement, and show that the converged policy is Pareto optimal. Simulation results demonstrate the applicability of the proposed approach. △ Less

Submitted 19 May, 2024; originally announced May 2024.

arXiv:2403.14545 [pdf, other]

Learning Hierarchical Control For Multi-Agent Capacity-Constrained Systems

Authors: Charlott Vallon, Alessandro Pinto, Bartolomeo Stellato, Francesco Borrelli

Abstract: This paper introduces a novel data-driven hierarchical control scheme for managing a fleet of nonlinear, capacity-constrained autonomous agents in an iterative environment. We propose a control framework consisting of a high-level dynamic task assignment and routing layer and low-level motion planning and tracking layer. Each layer of the control hierarchy uses a data-driven Model Predictive Contr… ▽ More This paper introduces a novel data-driven hierarchical control scheme for managing a fleet of nonlinear, capacity-constrained autonomous agents in an iterative environment. We propose a control framework consisting of a high-level dynamic task assignment and routing layer and low-level motion planning and tracking layer. Each layer of the control hierarchy uses a data-driven Model Predictive Control (MPC) policy, maintaining bounded computational complexity at each calculation of a new task assignment or actuation input. We utilize collected data to iteratively refine estimates of agent capacity usage, and update MPC policy parameters accordingly. Our approach leverages tools from iterative learning control to integrate learning at both levels of the hierarchy, and coordinates learning between levels in order to maintain closed-loop feasibility and performance improvement of the connected architecture. △ Less

Submitted 10 April, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

Comments: 16 pages, 4 figures

arXiv:2403.14536 [pdf, other]

Learning Hierarchical Control Systems for Autonomous Systems with Energy Constraints

Authors: Charlott Vallon, Mark Pustilnik, Alessandro Pinto, Francesco Borrelli

Abstract: This paper focuses on the design of hierarchical control architectures for autonomous systems with energy constraints. We focus on systems where energy storage limitations and slow recharge rates drastically affect the way the autonomous systems are operated. Using examples from space robotics and public transportation, we motivate the need for formally designed learning hierarchical control syste… ▽ More This paper focuses on the design of hierarchical control architectures for autonomous systems with energy constraints. We focus on systems where energy storage limitations and slow recharge rates drastically affect the way the autonomous systems are operated. Using examples from space robotics and public transportation, we motivate the need for formally designed learning hierarchical control systems. We propose a learning control architecture which incorporates learning mechanisms at various levels of the control hierarchy to improve performance and resource utilization. The proposed hierarchical control scheme relies on high-level energy-aware task planning and assignment, complemented by a low-level predictive control mechanism responsible for the autonomous execution of tasks, including motion control and energy management. Simulation examples show the benefits and the limitations of the proposed architecture when learning is used to obtain a more energy-efficient task allocation. △ Less

Submitted 21 March, 2024; originally announced March 2024.

arXiv:2403.13906 [pdf, other]

Clustering Heuristics for Robust Energy Capacitated Vehicle Routing Problem (ECVRP)

Authors: Mark Pustilnik, Francesco Borrelli

Abstract: The paper presents an approach to solving the Robust Energy Capacitated Vehicle Routing Problem (RECVRP), focusing on electric vehicles and their limited battery capacity. A finite number of customers, each with their own demand, have to be serviced by an electric vehicle fleet while ensuring that none of the vehicles run out of energy. The time and energy it takes to travel between any two points… ▽ More The paper presents an approach to solving the Robust Energy Capacitated Vehicle Routing Problem (RECVRP), focusing on electric vehicles and their limited battery capacity. A finite number of customers, each with their own demand, have to be serviced by an electric vehicle fleet while ensuring that none of the vehicles run out of energy. The time and energy it takes to travel between any two points is modeled as a random variable with known distribution. We propose a Mixed Integer Program (MIP) for computing an exact solution and introduce clustering heuristics to enhance the solution speed. This enables efficient re-planning of routes in dynamic scenarios. The methodology transforms the RECVRP into smaller problems, yielding good quality solutions quickly compared to existing methods. We demonstrate the effectiveness of this approach using a well-known benchmark problem set as well as a set of randomly generated problems. △ Less

Submitted 20 March, 2024; originally announced March 2024.

arXiv:2402.14183 [pdf, other]

Parking of Connected Automated Vehicles: Vehicle Control, Parking Assignment, and Multi-agent Simulation

Authors: Xu Shen, Yongkeun Choi, Alex Wong, Francesco Borrelli, Scott Moura, Soomin Woo

Abstract: This paper introduces a novel approach to optimize the parking efficiency for fleets of Connected and Automated Vehicles (CAVs). We present a novel multi-vehicle parking simulator, equipped with hierarchical path planning and collision avoidance capabilities for individual CAVs. The simulator is designed to capture the key decision-making processes in parking, from low-level vehicle control to hig… ▽ More This paper introduces a novel approach to optimize the parking efficiency for fleets of Connected and Automated Vehicles (CAVs). We present a novel multi-vehicle parking simulator, equipped with hierarchical path planning and collision avoidance capabilities for individual CAVs. The simulator is designed to capture the key decision-making processes in parking, from low-level vehicle control to high-level parking assignment, and it enables the effective assessment of parking strategies for large fleets of ground vehicles. We formulate and compare different strategic parking spot assignments to minimize a collective cost. While the proposed framework is designed to optimize various objective functions, we choose the total parking time for the experiment, as it is closely related to the reduction of vehicles' energy consumption and greenhouse gas emissions. We validate the effectiveness of the proposed strategies through empirical evaluation against a dataset of real-world parking lot dynamics, realizing a substantial reduction in parking time by up to 43.8%. This improvement is attributed to the synergistic benefits of driving automation, the utilization of shared infrastructure state data, the exclusion of pedestrian traffic, and the real-time computation of optimal parking spot allocation. △ Less

Submitted 21 February, 2024; originally announced February 2024.

arXiv:2402.01116 [pdf, other]

Scalable Multi-modal Model Predictive Control via Duality-based Interaction Predictions

Authors: Hansung Kim, Siddharth H. Nair, Francesco Borrelli

Abstract: We propose a hierarchical architecture designed for scalable real-time Model Predictive Control (MPC) in complex, multi-modal traffic scenarios. This architecture comprises two key components: 1) RAID-Net, a novel attention-based Recurrent Neural Network that predicts relevant interactions along the MPC prediction horizon between the autonomous vehicle and the surrounding vehicles using Lagrangian… ▽ More We propose a hierarchical architecture designed for scalable real-time Model Predictive Control (MPC) in complex, multi-modal traffic scenarios. This architecture comprises two key components: 1) RAID-Net, a novel attention-based Recurrent Neural Network that predicts relevant interactions along the MPC prediction horizon between the autonomous vehicle and the surrounding vehicles using Lagrangian duality, and 2) a reduced Stochastic MPC problem that eliminates irrelevant collision avoidance constraints, enhancing computational efficiency. Our approach is demonstrated in a simulated traffic intersection with interactive surrounding vehicles, showcasing a 12x speed-up in solving the motion planning problem. A video demonstrating the proposed architecture in multiple complex traffic scenarios can be found here: https://youtu.be/-pRiOnPb9_c. GitHub: https://github.com/MPC-Berkeley/hmpc_raidnet △ Less

Submitted 2 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Accepted at IEEE Intelligent Vehicles Symposium 2024

arXiv:2402.01059 [pdf, other]

Eco-driving under localization uncertainty for connected vehicles on Urban roads: Data-driven approach and Experiment verification

Authors: Eunhyek Joa, Eric Yongkeun Choi, Francesco Borrelli

Abstract: This paper addresses the eco-driving problem for connected vehicles on urban roads, considering localization uncertainty. Eco-driving is defined as longitudinal speed planning and control on roads with the presence of a sequence of traffic lights. We solve the problem by using a data-driven model predictive control (MPC) strategy. This approach involves learning a cost-to-go function and constrain… ▽ More This paper addresses the eco-driving problem for connected vehicles on urban roads, considering localization uncertainty. Eco-driving is defined as longitudinal speed planning and control on roads with the presence of a sequence of traffic lights. We solve the problem by using a data-driven model predictive control (MPC) strategy. This approach involves learning a cost-to-go function and constraints from state-input data. The cost-to-go function represents the remaining energy-to-spend from the given state, and the constraints ensure that the controlled vehicle passes the upcoming traffic light timely while obeying traffic laws. The resulting convex optimization problem has a short horizon and is amenable for real-time implementations. We demonstrate the effectiveness of our approach through real-world vehicle experiments. Our method demonstrates $12\%$ improvement in energy efficiency compared to the traditional approaches, which plan longitudinal speed by solving a long-horizon optimal control problem and track the planned speed using another controller, as evidenced by vehicle experiments. △ Less

Submitted 4 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Accepted for IEEE IV 2024

arXiv:2401.12556 [pdf, other]

Approximate solution of stochastic infinite horizon optimal control problems for constrained linear uncertain systems

Authors: Eunhyek Joa, Francesco Borrelli

Abstract: We propose a Model Predictive Control (MPC) with a single-step prediction horizon to solve infinite horizon optimal control problems with the expected sum of convex stage costs for constrained linear uncertain systems. The proposed method relies on two techniques. First, we estimate the expected values of the convex costs using a computationally tractable approximation, achieved by sampling across… ▽ More We propose a Model Predictive Control (MPC) with a single-step prediction horizon to solve infinite horizon optimal control problems with the expected sum of convex stage costs for constrained linear uncertain systems. The proposed method relies on two techniques. First, we estimate the expected values of the convex costs using a computationally tractable approximation, achieved by sampling across the space of disturbances. Second, we implement a data-driven approach to approximate the optimal value function and its corresponding domain, through systematic exploration of the system's state space. These estimates are subsequently used as the terminal cost and terminal set within the proposed MPC. We prove recursive feasibility, robust constraint satisfaction, and convergence in probability to the target set. Furthermore, we prove that the estimated value function converges to the optimal value function in a local region. The effectiveness of the proposed MPC is illustrated with detailed numerical simulations and comparisons with a value iteration method and a Learning MPC that minimizes a certainty equivalent cost. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: Submitted to the IEEE Transactions on Automatic Control

arXiv:2310.20561 [pdf, other]

Predictive Control for Autonomous Driving with Uncertain, Multi-modal Predictions

Authors: Siddharth H. Nair, Hotae Lee, Eunhyek Joa, Yan Wang, H. Eric Tseng, Francesco Borrelli

Abstract: We propose a Stochastic MPC (SMPC) formulation for path planning with autonomous vehicles in scenarios involving multiple agents with multi-modal predictions. The multi-modal predictions capture the uncertainty of urban driving in distinct modes/maneuvers (e.g., yield, keep speed) and driving trajectories (e.g., speed, turning radius), which are incorporated for multi-modal collision avoidance cha… ▽ More We propose a Stochastic MPC (SMPC) formulation for path planning with autonomous vehicles in scenarios involving multiple agents with multi-modal predictions. The multi-modal predictions capture the uncertainty of urban driving in distinct modes/maneuvers (e.g., yield, keep speed) and driving trajectories (e.g., speed, turning radius), which are incorporated for multi-modal collision avoidance chance constraints for path planning. In the presence of multi-modal uncertainties, it is challenging to reliably compute feasible path planning solutions at real-time frequencies ($\geq$ 10 Hz). Our main technological contribution is a convex SMPC formulation that simultaneously (1) optimizes over parameterized feedback policies and (2) allocates risk levels for each mode of the prediction. The use of feedback policies and risk allocation enhances the feasibility and performance of the SMPC formulation against multi-modal predictions with large uncertainty. We evaluate our approach via simulations and road experiments with a full-scale vehicle interacting in closed-loop with virtual vehicles. We consider distinct, multi-modal driving scenarios: 1) Negotiating a traffic light and a fast, tailgating agent, 2) Executing an unprotected left turn at a traffic intersection, and 3) Changing lanes in the presence of multiple agents. For all of these scenarios, our approach reliably computes multi-modal solutions to the path-planning problem at real-time frequencies. △ Less

Submitted 31 October, 2023; originally announced October 2023.

Comments: The first three authors contributed equally

arXiv:2309.10716 [pdf, other]

Learning Model Predictive Control with Error Dynamics Regression for Autonomous Racing

Authors: Haoru Xue, Edward L. Zhu, John M. Dolan, Francesco Borrelli

Abstract: This work presents a novel Learning Model Predictive Control (LMPC) strategy for autonomous racing at the handling limit that can iteratively explore and learn unknown dynamics in high-speed operational domains. We start from existing LMPC formulations and modify the system dynamics learning method. In particular, our approach uses a nominal, global, nonlinear, physics-based model with a local, li… ▽ More This work presents a novel Learning Model Predictive Control (LMPC) strategy for autonomous racing at the handling limit that can iteratively explore and learn unknown dynamics in high-speed operational domains. We start from existing LMPC formulations and modify the system dynamics learning method. In particular, our approach uses a nominal, global, nonlinear, physics-based model with a local, linear, data-driven learning of the error dynamics. We conducted experiments in simulation and on 1/10th scale hardware, and deployed the proposed LMPC on a full-scale autonomous race car used in the Indy Autonomous Challenge (IAC) with closed loop experiments at the Putnam Park Road Course in Indiana, USA. The results show that the proposed control policy exhibits improved robustness to parameter tuning and data scarcity. Incremental and safety-aware exploration toward the limit of handling and iterative learning of the vehicle dynamics in high-speed domains is observed both in simulations and experiments. △ Less

Submitted 7 March, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

Comments: Accepted by ICRA 2024

arXiv:2309.07262 [pdf, other]

Euclidean and non-Euclidean Trajectory Optimization Approaches for Quadrotor Racing

Authors: Thomas Fork, Francesco Borrelli

Abstract: We present two approaches to compute raceline trajectories for quadrotors by solving an optimal control problem. The approaches involve expressing quadrotor pose in either a Euclidean or non-Euclidean frame of reference and are both based on collocation. The compute times of both approaches are over 100x faster than published methods. Additionally, both approaches compute trajectories with faster… ▽ More We present two approaches to compute raceline trajectories for quadrotors by solving an optimal control problem. The approaches involve expressing quadrotor pose in either a Euclidean or non-Euclidean frame of reference and are both based on collocation. The compute times of both approaches are over 100x faster than published methods. Additionally, both approaches compute trajectories with faster lap time and show improved numerical convergence. In the last part of the paper we devise a novel method to compute racelines in dense obstacle fields using the non-Euclidean approach. △ Less

Submitted 13 September, 2023; originally announced September 2023.

arXiv:2305.14732 [pdf, other]

Increasing Electric Vehicles Utilization in Transit Fleets using Learning, Predictions, Optimization, and Automation

Authors: Jacopo Guanetti, Yeojun Kim, Xu Shen, Joel Donham, Santosh Alexander, Bruce Wootton, Francesco Borrelli

Abstract: This work presents a novel hierarchical approach to increase Battery Electric Buses (BEBs) utilization in transit fleets. The proposed approach relies on three key components. A learning-based BEB digital twin cloud platform is used to accurately predict BEB charge consumption on a per vehicle, per driver, and per route basis, and accurately predict the time-to-charge BEB batteries to any level. T… ▽ More This work presents a novel hierarchical approach to increase Battery Electric Buses (BEBs) utilization in transit fleets. The proposed approach relies on three key components. A learning-based BEB digital twin cloud platform is used to accurately predict BEB charge consumption on a per vehicle, per driver, and per route basis, and accurately predict the time-to-charge BEB batteries to any level. These predictions are then used by a Predictive Block Assignment module to maximize the BEB fleet utilization. This module computes the optimal BEB daily assignment and charge management strategy. A Depot Parking and Charging Queue Management module is used to autonomously park and charge the vehicles based on their charging demands. The paper discusses the technical approach and benefits of each level in architecture and concludes with a realistic simulations study. The study shows that if our approach is employed, BEB fleet utilization can increase by 50% compared to state-of-the-art methods. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: Accepted at the 35th IEEE Intelligent Vehicles Symposium (IV 2023)

arXiv:2304.08576 [pdf, other]

doi 10.1109/IV55152.2023.10186574

Energy-Efficient Lane Changes Planning and Control for Connected Autonomous Vehicles on Urban Roads

Authors: Eunhyek Joa, Hotae Lee, Eric Yongkeun Choi, Francesco Borrelli

Abstract: This paper presents a novel energy-efficient motion planning algorithm for Connected Autonomous Vehicles (CAVs) on urban roads. The approach consists of two components: a decision-making algorithm and an optimization-based trajectory planner. The decision-making algorithm leverages Signal Phase and Timing (SPaT) information from connected traffic lights to select a lane with the aim of reducing en… ▽ More This paper presents a novel energy-efficient motion planning algorithm for Connected Autonomous Vehicles (CAVs) on urban roads. The approach consists of two components: a decision-making algorithm and an optimization-based trajectory planner. The decision-making algorithm leverages Signal Phase and Timing (SPaT) information from connected traffic lights to select a lane with the aim of reducing energy consumption. The algorithm is based on a heuristic rule which is learned from human driving data. The optimization-based trajectory planner generates a safe, smooth, and energy-efficient trajectory toward the selected lane. The proposed strategy is experimentally evaluated in a Vehicle-in-the-Loop (VIL) setting, where a real test vehicle receives SPaT information from both actual and virtual traffic lights and autonomously drives on a testing site, while the surrounding vehicles are simulated. The results demonstrate that the use of SPaT information in autonomous driving leads to improved energy efficiency, with the proposed strategy saving 37.1% energy consumption compared to a lane-kee** algorithm. △ Less

Submitted 27 November, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

Comments: IEEE Intelligent Vehicle Symposium, Anchorage, Alaska, June 4-7, 2023

Journal ref: 2023 IEEE Intelligent Vehicles Symposium (IV). 2023

arXiv:2303.12152 [pdf, other]

Learning for Online Mixed-Integer Model Predictive Control with Parametric Optimality Certificates

Authors: Luigi Russo, Siddharth H. Nair, Luigi Glielmo, Francesco Borrelli

Abstract: We propose a supervised learning framework for computing solutions of multi-parametric Mixed Integer Linear Programs (MILPs) that arise in Model Predictive Control. Our approach also quantifies sub-optimality for the computed solutions. Inspired by Branch-and-Bound techniques, the key idea is to train a Neural Network/Random Forest, which for a given parameter, predicts a strategy consisting of (1… ▽ More We propose a supervised learning framework for computing solutions of multi-parametric Mixed Integer Linear Programs (MILPs) that arise in Model Predictive Control. Our approach also quantifies sub-optimality for the computed solutions. Inspired by Branch-and-Bound techniques, the key idea is to train a Neural Network/Random Forest, which for a given parameter, predicts a strategy consisting of (1) a set of Linear Programs (LPs) such that their feasible sets form a partition of the feasible set of the MILP and (2) a candidate integer solution. For control computation and sub-optimality quantification, we solve a set of LPs online in parallel. We demonstrate our approach for a motion planning example and compare against various commercial and open-source mixed-integer programming solvers. △ Less

Submitted 22 March, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

Comments: First two authors contributed equally

arXiv:2303.12127 [pdf, other]

Robust Output-Lifted Learning Model Predictive Control

Authors: Siddharth H. Nair, Francesco Borrelli

Abstract: We propose an iterative approach for designing Robust Learning Model Predictive Control (LMPC) policies for a class of nonlinear systems with additive, unmodelled dynamics. The nominal dynamics are assumed to be difference flat, i.e., the state and input can be reconstructed using flat output sequences. For the considered class of systems, we synthesize Robust MPC policies and show how to use hist… ▽ More We propose an iterative approach for designing Robust Learning Model Predictive Control (LMPC) policies for a class of nonlinear systems with additive, unmodelled dynamics. The nominal dynamics are assumed to be difference flat, i.e., the state and input can be reconstructed using flat output sequences. For the considered class of systems, we synthesize Robust MPC policies and show how to use historical trajectory data collected during iterative tasks to 1) obtain bounds on the unmodelled dynamics and 2) construct a convex value function approximation along with a convex safe set in the space of output sequences for designing terminal components in the Robust MPC design. We show that the proposed strategy guarantees robust constraint satisfaction, asymptotic convergence to a desired subset of the state space, and non-decreasing closed-loop performance at each policy update. Finally, simulation results demonstrate the effectiveness of the proposed strategy on a minimum time control problem using a constrained nonlinear and uncertain vehicle model. △ Less

Submitted 21 March, 2023; originally announced March 2023.

arXiv:2302.10498 [pdf, ps, other]

doi 10.23919/ACC55779.2023.10155959

Output Feedback Stochastic MPC with Hard Input Constraints

Authors: Eunhyek Joa, Monimoy Bujarbaruah, Francesco Borrelli

Abstract: We present an output feedback stochastic model predictive controller (SMPC) for constrained linear time-invariant systems. The system is perturbed by additive Gaussian disturbances on state and additive Gaussian measurement noise on output. A Kalman filter is used for state estimation and an SMPC is designed to satisfy chance constraints on states and hard constraints on actuator inputs. The propo… ▽ More We present an output feedback stochastic model predictive controller (SMPC) for constrained linear time-invariant systems. The system is perturbed by additive Gaussian disturbances on state and additive Gaussian measurement noise on output. A Kalman filter is used for state estimation and an SMPC is designed to satisfy chance constraints on states and hard constraints on actuator inputs. The proposed SMPC constructs bounded sets for the state evolution and a tube-based constraint tightening strategy where the tightened constraints are time-invariant. We prove that the proposed SMPC can guarantee an infeasibility rate below a user-specified tolerance. We numerically compare our method with a classical output feedback SMPC with simulation results which highlight the efficacy of the proposed algorithm. △ Less

Submitted 27 November, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

Comments: IEEE American Control Conference (ACC) 2023, May 31 - June 2, San Diego, CA, USA

Journal ref: 2023 American Control Conference (ACC) (pp. 2034-2039). IEEE

arXiv:2302.01586 [pdf, other]

Reinforcement Learning and Distributed Model Predictive Control for Conflict Resolution in Highly Constrained Spaces

Authors: Xu Shen, Francesco Borrelli

Abstract: This work presents a distributed algorithm for resolving cooperative multi-vehicle conflicts in highly constrained spaces. By formulating the conflict resolution problem as a Multi-Agent Reinforcement Learning (RL) problem, we can train a policy offline to drive the vehicles towards their destinations safely and efficiently in a simplified discrete environment. During the online execution, each ve… ▽ More This work presents a distributed algorithm for resolving cooperative multi-vehicle conflicts in highly constrained spaces. By formulating the conflict resolution problem as a Multi-Agent Reinforcement Learning (RL) problem, we can train a policy offline to drive the vehicles towards their destinations safely and efficiently in a simplified discrete environment. During the online execution, each vehicle first simulates the interaction among vehicles with the trained policy to obtain its strategy, which is used to guide the computation of a reference trajectory. A distributed Model Predictive Controller (MPC) is then proposed to track the reference while avoiding collisions. The preliminary results show that the combination of RL and distributed MPC has the potential to guide vehicles to resolve conflicts safely and smoothly while being less computationally demanding than the centralized approach. △ Less

Submitted 3 February, 2023; originally announced February 2023.

arXiv:2301.04316 [pdf, other]

Facilitating Cooperative and Distributed Multi-Vehicle Lane Change Maneuvers

Authors: Hansung Kim, Francesco Borrelli

Abstract: A distributed coordination method for solving multi-vehicle lane changes for connected autonomous vehicles (CAVs) is presented. Existing approaches to multi-vehicle lane changes are passive and opportunistic as they are implemented only when the environment allows it. The novel approach of this paper relies on the role of a facilitator assigned to a CAV. The facilitator interacts with and modifies… ▽ More A distributed coordination method for solving multi-vehicle lane changes for connected autonomous vehicles (CAVs) is presented. Existing approaches to multi-vehicle lane changes are passive and opportunistic as they are implemented only when the environment allows it. The novel approach of this paper relies on the role of a facilitator assigned to a CAV. The facilitator interacts with and modifies the environment to enable lane changes of other CAVs. Distributed MPC path planners and a distributed coordination algorithm are used to control the facilitator and other CAVs in a proactive and cooperative way. We demonstrate the effectiveness of the proposed approach through numerical simulations. In particular, we show enhanced feasibility of a multi-CAV lane change in comparison to the simultaneous multi-CAV lane change approach in various traffic conditions generated by using a data-set from real-traffic scenarios. △ Less

Submitted 3 March, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

Comments: 8 pages, 4 figures. Accepted at IFAC World Congress 2023

arXiv:2211.16633 [pdf, other]

Collaborative learning model predictive control for repetitive tasks

Authors: Paula Chanfreut, José María Maestre, Eduardo F. Camacho, Francesco Borrelli

Abstract: This paper presents a cloud-based learning model predictive controller that integrates three interacting components: a set of agents, which must learn to perform a finite set of tasks with the minimum possible local cost; a coordinator, which assigns the tasks to the agents; and the cloud, which stores data to facilitate the agents' learning. The tasks consist in traveling repeatedly between a set… ▽ More This paper presents a cloud-based learning model predictive controller that integrates three interacting components: a set of agents, which must learn to perform a finite set of tasks with the minimum possible local cost; a coordinator, which assigns the tasks to the agents; and the cloud, which stores data to facilitate the agents' learning. The tasks consist in traveling repeatedly between a set of target states while satisfying input and state constraints. In turn, the state constraints may change in time for each of the possible tasks. To deal with it, different modes of operation, which establish different restrictions, are defined. The agents' inputs are found by solving local model predictive control (MPC) problems where the terminal set and cost are defined from previous trajectories. The data collected by each agent is uploaded to the cloud and made accessible to all their peers. Likewise, similarity between tasks is exploited to accelerate the learning process. The applicability of the proposed approach is illustrated by simulation results. △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: Conference on Decision and Control 2022

arXiv:2211.01487 [pdf, other]

Multi-vehicle Conflict Resolution in Highly Constrained Spaces by Merging Optimal Control and Reinforcement Learning

Authors: Xu Shen, Francesco Borrelli

Abstract: We present a novel method to address the problem of multi-vehicle conflict resolution in highly constrained spaces. An optimal control problem is formulated to incorporate nonlinear, non-holonomic vehicle dynamics and exact collision avoidance constraints. A solution to the problem can be obtained by first learning configuration strategies with reinforcement learning (RL) in a simplified discrete… ▽ More We present a novel method to address the problem of multi-vehicle conflict resolution in highly constrained spaces. An optimal control problem is formulated to incorporate nonlinear, non-holonomic vehicle dynamics and exact collision avoidance constraints. A solution to the problem can be obtained by first learning configuration strategies with reinforcement learning (RL) in a simplified discrete environment, and then using these strategies to shape the constraint space of the original problem. Simulation results show that our method can explore efficient actions to resolve conflicts in confined space and generate dexterous maneuvers that are both collision-free and kinematically feasible. △ Less

Submitted 10 November, 2022; v1 submitted 2 November, 2022; originally announced November 2022.

arXiv:2209.10108 [pdf, other]

Stochastic MPC with Realization-Adaptive Constraint Tightening

Authors: Hotae Lee, Monimoy Bujarbaruah, Francesco Borrelli

Abstract: This paper presents a stochastic model predictive controller (SMPC) for linear time-invariant systems in the presence of additive disturbances. The distribution of the disturbance is unknown and is assumed to have a bounded support. A sample-based strategy is used to compute sets of disturbance sequences necessary for robustifying the state chance constraints. These sets are constructed offline us… ▽ More This paper presents a stochastic model predictive controller (SMPC) for linear time-invariant systems in the presence of additive disturbances. The distribution of the disturbance is unknown and is assumed to have a bounded support. A sample-based strategy is used to compute sets of disturbance sequences necessary for robustifying the state chance constraints. These sets are constructed offline using samples of the disturbance extracted from its support. For online MPC implementation, we propose a novel reformulation strategy of the chance constraints, where the constraint tightening is computed by adjusting the offline computed sets based on the previously realized disturbances along the trajectory. The proposed MPC is recursive feasible and can lower conservatism over existing SMPC approaches at the cost of higher offline computational time. Numerical simulations demonstrate the effectiveness of the proposed approach. △ Less

Submitted 29 September, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

Comments: Submitted to ACC 2023

arXiv:2208.03525 [pdf, other]

Stochastic MPC with Dual Control for Autonomous Driving with Multi-Modal Interaction-Aware Predictions

Authors: Siddharth H. Nair, Vijay Govindarajan, Theresa Lin, Yan Wang, Eric H. Tseng, Francesco Borrelli

Abstract: We propose a Stochastic MPC (SMPC) approach for autonomous driving which incorporates multi-modal, interaction-aware predictions of surrounding vehicles. For each mode, vehicle motion predictions are obtained by a control model described using a basis of fixed features with unknown weights. The proposed SMPC formulation finds optimal controls which serves two purposes: 1) reducing conservatism of… ▽ More We propose a Stochastic MPC (SMPC) approach for autonomous driving which incorporates multi-modal, interaction-aware predictions of surrounding vehicles. For each mode, vehicle motion predictions are obtained by a control model described using a basis of fixed features with unknown weights. The proposed SMPC formulation finds optimal controls which serves two purposes: 1) reducing conservatism of the SMPC by optimizing over parameterized control laws and 2) prediction and estimation of feature weights used in interaction-aware modeling using Kalman filtering. The proposed approach is demonstrated on a longitudinal control example, with uncertainties in predictions of the autonomous and surrounding vehicles. △ Less

Submitted 6 August, 2022; originally announced August 2022.

Comments: Accepted to AVEC'22

arXiv:2207.05896 [pdf, other]

Safe Human-Robot Collaborative Transportation via Trust-Driven Role Adaptation

Authors: Tony Zheng, Monimoy Bujarbaruah, Yvonne R. Stürz, Francesco Borrelli

Abstract: We study a human-robot collaborative transportation task in presence of obstacles. The task for each agent is to carry a rigid object to a common target position, while safely avoiding obstacles and satisfying the compliance and actuation constraints of the other agent. Human and robot do not share the local view of the environment. The human policy either assists the robot when they deem the robo… ▽ More We study a human-robot collaborative transportation task in presence of obstacles. The task for each agent is to carry a rigid object to a common target position, while safely avoiding obstacles and satisfying the compliance and actuation constraints of the other agent. Human and robot do not share the local view of the environment. The human policy either assists the robot when they deem the robot actions safe based on their perception of the environment, or actively leads the task. Using estimated human inputs, the robot plans a trajectory for the transported object by solving a constrained finite time optimal control problem. Sensors on the robot measure the inputs applied by the human. The robot then appropriately applies a weighted combination of the human's applied and its own planned inputs, where the weights are chosen based on the robot's trust value on its estimates of the human's inputs. This allows for a dynamic leader-follower role adaptation of the robot throughout the task. Furthermore, under a low value of trust, if the robot approaches any obstacle potentially unknown to the human, it triggers a safe stop** policy, maintaining safety of the system and signaling a required change in the human's intent. With experimental results, we demonstrate the efficacy of the proposed approach. △ Less

Submitted 12 July, 2022; originally announced July 2022.

arXiv:2204.12533 [pdf, other]

A Gaussian Process Model for Opponent Prediction in Autonomous Racing

Authors: Edward L. Zhu, Finn Lukas Busch, Jake Johnson, Francesco Borrelli

Abstract: In head-to-head racing, an accurate model of interactive behavior of the opposing target vehicle (TV) is required to perform tightly constrained, but highly rewarding maneuvers such as overtaking. However, such information is not typically made available in competitive scenarios, we therefore propose to construct a prediction and uncertainty model given data of the TV from previous races. In parti… ▽ More In head-to-head racing, an accurate model of interactive behavior of the opposing target vehicle (TV) is required to perform tightly constrained, but highly rewarding maneuvers such as overtaking. However, such information is not typically made available in competitive scenarios, we therefore propose to construct a prediction and uncertainty model given data of the TV from previous races. In particular, a one-step Gaussian process (GP) model is trained on closed-loop interaction data to learn the behavior of a TV driven by an unknown policy. Predictions of the nominal trajectory and associated uncertainty are rolled out via a sampling-based approach and are used in a model predictive control (MPC) policy for the ego vehicle in order to intelligently trade-off between safety and performance when attempting overtaking maneuvers against a TV. We demonstrate the GP-based predictor in closed loop with the MPC policy in simulation races and compare its performance against several predictors from literature. In a Monte Carlo study, we observe that the GP-based predictor achieves similar win rates while maintaining safety in up to 3x more races. We finally demonstrate the prediction and control framework in real-time in a experimental study on a 1/10th scale racecar platform operating at speeds of around 2.8 m/s, and show a significant level of improvement when using the GP-based predictor over a baseline MPC predictor. Videos of the hardware experiments can be found at https://youtu.be/KMSs4ofDfIs. △ Less

Submitted 1 March, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

Comments: Submitted to IROS 2023

arXiv:2204.10777 [pdf, other]

doi 10.1109/ITSC55140.2022.9922162

ParkPredict+: Multimodal Intent and Motion Prediction for Vehicles in Parking Lots with CNN and Transformer

Authors: Xu Shen, Matthew Lacayo, Nidhir Guggilla, Francesco Borrelli

Abstract: The problem of multimodal intent and trajectory prediction for human-driven vehicles in parking lots is addressed in this paper. Using models designed with CNN and Transformer networks, we extract temporal-spatial and contextual information from trajectory history and local bird's eye view (BEV) semantic images, and generate predictions about intent distribution and future trajectory sequences. Ou… ▽ More The problem of multimodal intent and trajectory prediction for human-driven vehicles in parking lots is addressed in this paper. Using models designed with CNN and Transformer networks, we extract temporal-spatial and contextual information from trajectory history and local bird's eye view (BEV) semantic images, and generate predictions about intent distribution and future trajectory sequences. Our methods outperform existing models in accuracy, while allowing an arbitrary number of modes, encoding complex multi-agent scenarios, and adapting to different parking maps. To train and evaluate our method, we present the first public 4K video dataset of human driving in parking lots with accurate annotation, high frame rate, and rich traffic scenarios. △ Less

Submitted 10 January, 2023; v1 submitted 16 April, 2022; originally announced April 2022.

Comments: Published at IEEE ITSC 2022

arXiv:2204.10446 [pdf, other]

Overtaking Maneuvers on a Nonplanar Racetrack

Authors: Thomas Fork, H. Eric Tseng, Francesco Borrelli

Abstract: We leverage game theory and a new vehicle modeling approach to compute overtaking maneuvers for racecars on a nonplanar surface. We solve for equilibria between noncooperative racing agents and demonstrate that by leveraging the novel nonplanar vehicle dynamics, overtaking can be achieved in situations where simpler models can do not provide a winning strategy. We leverage game theory and a new vehicle modeling approach to compute overtaking maneuvers for racecars on a nonplanar surface. We solve for equilibria between noncooperative racing agents and demonstrate that by leveraging the novel nonplanar vehicle dynamics, overtaking can be achieved in situations where simpler models can do not provide a winning strategy. △ Less

Submitted 21 May, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

arXiv:2204.09720 [pdf, other]

Vehicle Models and Optimal Control on a Nonplanar Surface

Authors: Thomas Fork, H. Eric Tseng, Francesco Borrelli

Abstract: We present a 10 DoF dynamic vehicle model for model-based control on nonplanar road surfaces. A parametric surface is used to describe the road surface, allowing the surface parameterization to describe the pose of the vehicle. We use the proposed approach to compute minimum-time vehicle trajectories on nonplanar surfaces and compare planar and nonplanar models. We present a 10 DoF dynamic vehicle model for model-based control on nonplanar road surfaces. A parametric surface is used to describe the road surface, allowing the surface parameterization to describe the pose of the vehicle. We use the proposed approach to compute minimum-time vehicle trajectories on nonplanar surfaces and compare planar and nonplanar models. △ Less

Submitted 21 May, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

arXiv:2203.16478 [pdf, other]

A Sequential Quadratic Programming Approach to the Solution of Open-Loop Generalized Nash Equilibria

Authors: Edward L. Zhu, Francesco Borrelli

Abstract: Dynamic games can be an effective approach to modeling interactive behavior between multiple non-cooperative agents and they provide a theoretical framework for simultaneous prediction and control in such scenarios. In this work, we propose a numerical method for the solution of local generalized Nash equilibria (GNE) for the class of open-loop general-sum dynamic games for agents with nonlinear d… ▽ More Dynamic games can be an effective approach to modeling interactive behavior between multiple non-cooperative agents and they provide a theoretical framework for simultaneous prediction and control in such scenarios. In this work, we propose a numerical method for the solution of local generalized Nash equilibria (GNE) for the class of open-loop general-sum dynamic games for agents with nonlinear dynamics and constraints. In particular, we formulate a sequential quadratic programming (SQP) approach which requires only the solution of a single convex quadratic program at each iteration. Central to the robustness of our approach is a non-monotonic line search method and a novel merit function for SQP step acceptance. We show that our method achieves linear convergence in the neighborhood of local GNE and we derive an update rule for the merit function which helps to improve convergence from a larger set of initial conditions. We demonstrate the effectiveness of the algorithm in the context of car racing, where we show up to 32\% improvement of success rate when comparing against a state-of-the-art solution approach for dynamic games. \url{https://github.com/zhu-edward/DGSQP}. △ Less

Submitted 15 September, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

arXiv:2109.09792 [pdf, other]

Stochastic MPC with Multi-modal Predictions for Traffic Intersections

Authors: Siddharth H. Nair, Vijay Govindarajan, Theresa Lin, Chris Meissen, H. Eric Tseng, Francesco Borrelli

Abstract: We propose a Stochastic MPC (SMPC) formulation for autonomous driving at traffic intersections which incorporates multi-modal predictions of surrounding vehicles for collision avoidance constraints. The multi-modal predictions are obtained with Gaussian Mixture Models (GMM) and constraints are formulated as chance-constraints. Our main theoretical contribution is a SMPC formulation that optimizes… ▽ More We propose a Stochastic MPC (SMPC) formulation for autonomous driving at traffic intersections which incorporates multi-modal predictions of surrounding vehicles for collision avoidance constraints. The multi-modal predictions are obtained with Gaussian Mixture Models (GMM) and constraints are formulated as chance-constraints. Our main theoretical contribution is a SMPC formulation that optimizes over a novel feedback policy class designed to exploit additional structure in the GMM predictions, and that is amenable to convex programming. The use of feedback policies for prediction is motivated by the need for reduced conservatism in handling multi-modal predictions of the surrounding vehicles, especially prevalent in traffic intersection scenarios. We evaluate our algorithm along axes of mobility, comfort, conservatism and computational efficiency at a simulated intersection in CARLA. Our simulations use a kinematic bicycle model and multimodal predictions trained on a subset of the Lyft Level 5 prediction dataset. To demonstrate the impact of optimizing over feedback policies, we compare our algorithm with two SMPC baselines that handle multi-modal collision avoidance chance constraints by optimizing over open-loop sequences. △ Less

Submitted 25 February, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

Comments: Extended version of ITSC 2022 submission

arXiv:2109.06296 [pdf, other]

Monocular Camera Localization for Automated Vehicles Using Image Retrieval

Authors: Eunhyek Joa, Yibo Sun, Francesco Borrelli

Abstract: We address the problem of finding the current position and heading angle of an autonomous vehicle in real-time using a single camera. Compared to methods which require LiDARs and high definition (HD) 3D maps in real-time, the proposed approach is easily scalable and computationally efficient, at the price of lower precision. The new method combines and adapts existing algorithms in three differe… ▽ More We address the problem of finding the current position and heading angle of an autonomous vehicle in real-time using a single camera. Compared to methods which require LiDARs and high definition (HD) 3D maps in real-time, the proposed approach is easily scalable and computationally efficient, at the price of lower precision. The new method combines and adapts existing algorithms in three different fields: image retrieval, map** database, and particle filtering. The result is a simple, real-time localization method using an image retrieval method whose performance is comparable to other monocular camera localization methods which use a map built with LiDARs. We evaluate the proposed method using the KITTI odometry dataset and via closed-loop experiments with an indoor 1:10 autonomous vehicle. The tests demonstrate real-time capability and a 10cm level accuracy. Also, experimental results of the closed-loop indoor tests show the presence of a positive feedback loop between the localization error and the control error. Such phenomena is analysed in details at the end of the article. △ Less

Submitted 30 November, 2023; v1 submitted 13 September, 2021; originally announced September 2021.

arXiv:2108.07911 [pdf, other]

Compact Cooperative Adaptive Cruise Control for Energy Saving: Air Drag Modelling and Simulation

Authors: Yeojun Kim, Jacopo Guanetti, Francesco Borrelli

Abstract: This paper studies the value of communicated motion predictions in the longitudinal control of connected automated vehicles (CAVs). We focus on a safe cooperative adaptive cruise control (CACC) design and analyze the value of vehicle-to-vehicle (V2V) communication in the presence of uncertain front vehicle acceleration. The interest in CACC is motivated by the potential improvement in energy consu… ▽ More This paper studies the value of communicated motion predictions in the longitudinal control of connected automated vehicles (CAVs). We focus on a safe cooperative adaptive cruise control (CACC) design and analyze the value of vehicle-to-vehicle (V2V) communication in the presence of uncertain front vehicle acceleration. The interest in CACC is motivated by the potential improvement in energy consumption and road throughput. In order to quantify this potential, we characterize experimentally the relationship between inter-vehicular gap, vehicle speed, and (reduction of) energy consumption for a compact plug-in hybrid electric vehicle. The resulting model is leveraged to show efficacy of our control design, which pursues small inter-vehicle gaps between consecutive CAVs and, therefore, improved energy efficiency. Our proposed control design is based on a robust model predictive control framework to systematically account for the system uncertainties. We present a set of thorough simulations aimed at quantifying energy efficiency improvement when vehicle states and predictions exchanged via V2V communication are used in the control law. △ Less

Submitted 17 August, 2021; originally announced August 2021.

Comments: Full (extended) version of the accepted paper at IEEE Transactions on Vehicular Technology

arXiv:2105.06005 [pdf, ps, other]

Data-Driven Strategies for Hierarchical Predictive Control in Unknown Environments

Authors: Charlott Vallon, Francesco Borrelli

Abstract: This article proposes a hierarchical learning architecture for safe data-driven control in unknown environments. We consider a constrained nonlinear dynamical system and assume the availability of state-input trajectories solving control tasks in different environments. In addition to task-invariant system state and input constraints, a parameterized environment model generates task-specific state… ▽ More This article proposes a hierarchical learning architecture for safe data-driven control in unknown environments. We consider a constrained nonlinear dynamical system and assume the availability of state-input trajectories solving control tasks in different environments. In addition to task-invariant system state and input constraints, a parameterized environment model generates task-specific state constraints, which are satisfied by the stored trajectories. Our goal is to use these trajectories to find a safe and high-performing policy for a new task in a new, unknown environment. We propose using the stored data to learn generalizable control strategies. At each time step, based on a local forecast of the new task environment, the learned strategy consists of a target region in the state space and input constraints to guide the system evolution to the target region. These target regions are used as terminal sets by a low-level model predictive controller. We show how to i) design the target sets from past data and then ii) incorporate them into a model predictive control scheme with shifting horizon that ensures safety of the closed-loop system when performing the new task. We prove the feasibility of the resulting control policy, and apply the proposed method to robotic path planning, racing, and computer game applications. △ Less

Submitted 14 July, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

Comments: arXiv admin note: substantial text overlap with arXiv:2005.05948

arXiv:2104.08427 [pdf, other]

Models and Predictive Control for Nonplanar Vehicle Navigation

Authors: Thomas Fork, H. Eric Tseng, Francesco Borrelli

Abstract: We present a simplified model of a vehicle driving on a nonplanar road. A parametric surface is used to describe the nonplanar road which can describe any combination of curvature, bank and slope. We show that the proposed modeling approach generalizes planar vehicle models that reference a centerline, such as the Frenet model. We use the proposed approach for vehicle path planning and following… ▽ More We present a simplified model of a vehicle driving on a nonplanar road. A parametric surface is used to describe the nonplanar road which can describe any combination of curvature, bank and slope. We show that the proposed modeling approach generalizes planar vehicle models that reference a centerline, such as the Frenet model. We use the proposed approach for vehicle path planning and following using model predictive control. We also model and control vehicle contact with the road surface. We demonstrate that the proposed controller improves speed and lane following on complex roads compared to planar vehicle controllers, and mitigates loss of control on complex road surfaces including off-camber turns. △ Less

Submitted 19 April, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

arXiv:2103.12351 [pdf, other]

A Simple Robust MPC for Linear Systems with Parametric and Additive Uncertainty

Authors: Monimoy Bujarbaruah, Ugo Rosolia, Yvonne R. Stürz, Francesco Borrelli

Abstract: We propose a simple and computationally efficient approach for designing a robust Model Predictive Controller (MPC) for constrained uncertain linear systems. The uncertainty is modeled as an additive disturbance and an additive error on the system dynamics matrices. Set based bounds for each component of the model uncertainty are assumed to be known. We separate the constraint tightening strategy… ▽ More We propose a simple and computationally efficient approach for designing a robust Model Predictive Controller (MPC) for constrained uncertain linear systems. The uncertainty is modeled as an additive disturbance and an additive error on the system dynamics matrices. Set based bounds for each component of the model uncertainty are assumed to be known. We separate the constraint tightening strategy into two parts, depending on the length of the MPC horizon. For a horizon length of one, the robust MPC problem is solved exactly, whereas for other horizon lengths, the model uncertainty is over-approximated with a net-additive component. The resulting MPC controller guarantees robust satisfaction of state and input constraints in closed-loop with the uncertain system. With appropriately designed terminal components and an adaptive horizon strategy, we prove the controller's recursive feasibility and stability of the origin. With numerical simulations, we demonstrate that our proposed approach gains up to 15x online computation speedup over a tube MPC strategy, while stabilizing about 98$\%$ of the latter's region of attraction. △ Less

Submitted 23 March, 2021; originally announced March 2021.

Comments: Final version for IEEE American Control Conference (ACC), May 2021. arXiv admin note: text overlap with arXiv:2007.00930

arXiv:2103.04460 [pdf, other]

Learning Environment Constraints in Collaborative Robotics: A Decentralized Leader-Follower Approach

Authors: Monimoy Bujarbaruah, Yvonne R. Stürz, Conrad Holda, Karl H. Johansson, Francesco Borrelli

Abstract: In this paper, we propose a leader-follower hierarchical strategy for two robots collaboratively transporting an object in a partially known environment with obstacles. Both robots sense the local surrounding environment and react to obstacles in their proximity. We consider no explicit communication, so the local environment information and the control actions are not shared between the robots. A… ▽ More In this paper, we propose a leader-follower hierarchical strategy for two robots collaboratively transporting an object in a partially known environment with obstacles. Both robots sense the local surrounding environment and react to obstacles in their proximity. We consider no explicit communication, so the local environment information and the control actions are not shared between the robots. At any given time step, the leader solves a model predictive control (MPC) problem with its known set of obstacles and plans a feasible trajectory to complete the task. The follower estimates the inputs of the leader and uses a policy to assist the leader while reacting to obstacles in its proximity. The leader infers obstacles in the follower's vicinity by using the difference between the predicted and the real-time estimated follower control action. A method to switch the leader-follower roles is used to improve the control performance in tight environments. The efficacy of our approach is demonstrated with detailed comparisons to two alternative strategies, where it achieves the highest success rate, while completing the task fastest. See the link www.dropbox.com/s/hexadigqkvspaeh/IROS_Video.mp4?dl=0 for a descriptive video of the algorithm. △ Less

Submitted 24 July, 2021; v1 submitted 7 March, 2021; originally announced March 2021.

Comments: Full version of our IROS 2021 paper

arXiv:2011.10668 [pdf, other]

Learning How to Solve Bubble Ball

Authors: Hotae Lee, Monimoy Bujarbaruah, Francesco Borrelli

Abstract: "Bubble Ball" is a game built on a 2D physics engine, where a finite set of objects can modify the motion of a bubble-like ball. The objective is to choose the set and the initial configuration of the objects, in order to get the ball to reach a target flag. The presence of obstacles, friction, contact forces and combinatorial object choices make the game hard to solve. In this paper, we propose a… ▽ More "Bubble Ball" is a game built on a 2D physics engine, where a finite set of objects can modify the motion of a bubble-like ball. The objective is to choose the set and the initial configuration of the objects, in order to get the ball to reach a target flag. The presence of obstacles, friction, contact forces and combinatorial object choices make the game hard to solve. In this paper, we propose a hierarchical predictive framework which solves Bubble Ball. Geometric, kinematic and dynamic models are used at different levels of the hierarchy. At each level of the game, data collected during failed iterations are used to update models at all hierarchical level and converge to a feasible solution to the game. The proposed approach successfully solves a large set of Bubble Ball levels within reasonable number of trials. This proposed framework can also be used to solve other physics-based games, especially with limited training data from human demonstrations. △ Less

Submitted 28 April, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

Comments: Accepted to L4DC 2021

arXiv:2011.00413 [pdf, other]

Collision Avoidance in Tightly-Constrained Environments without Coordination: a Hierarchical Control Approach

Authors: Xu Shen, Edward L. Zhu, Yvonne R. Stürz, Francesco Borrelli

Abstract: We present a hierarchical control approach for maneuvering an autonomous vehicle (AV) in tightly-constrained environments where other moving AVs and/or human driven vehicles are present. A two-level hierarchy is proposed: a high-level data-driven strategy predictor and a lower-level model-based feedback controller. The strategy predictor maps an encoding of a dynamic environment to a set of high-l… ▽ More We present a hierarchical control approach for maneuvering an autonomous vehicle (AV) in tightly-constrained environments where other moving AVs and/or human driven vehicles are present. A two-level hierarchy is proposed: a high-level data-driven strategy predictor and a lower-level model-based feedback controller. The strategy predictor maps an encoding of a dynamic environment to a set of high-level strategies via a neural network. Depending on the selected strategy, a set of time-varying hyperplanes in the AV's position space is generated online and the corresponding halfspace constraints are included in a lower-level model-based receding horizon controller. These strategy-dependent constraints drive the vehicle towards areas where it is likely to remain feasible. Moreover, the predicted strategy also informs switching between a discrete set of policies, which allows for more conservative behavior when prediction confidence is low. We demonstrate the effectiveness of the proposed data-driven hierarchical control framework in a two-car collision avoidance scenario through simulations and experiments on a 1/10 scale autonomous car platform where the strategy-guided approach outperforms a model predictive control baseline in both cases. △ Less

Submitted 18 March, 2021; v1 submitted 31 October, 2020; originally announced November 2020.

Comments: 7 pages, 7 figures, accepted at ICRA 2021

arXiv:2007.09562 [pdf, other]

Learning to Play Cup-and-Ball with Noisy Camera Observations

Authors: Monimoy Bujarbaruah, Tony Zheng, Akhil Shetty, Martin Sehr, Francesco Borrelli

Abstract: Playing the cup-and-ball game is an intriguing task for robotics research since it abstracts important problem characteristics including system nonlinearity, contact forces and precise positioning as terminal goal. In this paper, we present a learning model based control strategy for the cup-and-ball game, where a Universal Robots UR5e manipulator arm learns to catch a ball in one of the cups on a… ▽ More Playing the cup-and-ball game is an intriguing task for robotics research since it abstracts important problem characteristics including system nonlinearity, contact forces and precise positioning as terminal goal. In this paper, we present a learning model based control strategy for the cup-and-ball game, where a Universal Robots UR5e manipulator arm learns to catch a ball in one of the cups on a Kendama. Our control problem is divided into two sub-tasks, namely $(i)$ swinging the ball up in a constrained motion, and $(ii)$ catching the free-falling ball. The swing-up trajectory is computed offline, and applied in open-loop to the arm. Subsequently, a convex optimization problem is solved online during the ball's free-fall to control the manipulator and catch the ball. The controller utilizes noisy position feedback of the ball from an Intel RealSense D435 depth camera. We propose a novel iterative framework, where data is used to learn the support of the camera noise distribution iteratively in order to update the control policy. The probability of a catch with a fixed policy is computed empirically with a user specified number of roll-outs. Our design guarantees that probability of the catch increases in the limit, as the learned support nears the true support of the camera noise distribution. High-fidelity Mujoco simulations and preliminary experimental results support our theoretical analysis. △ Less

Submitted 18 July, 2020; originally announced July 2020.

Comments: Final paper for IEEE CASE, Hong Kong, August, 2020. First three authors contributed equally

arXiv:2007.00930 [pdf, other]

Robust MPC for Linear Systems with Parametric and Additive Uncertainty: A Novel Constraint Tightening Approach

Authors: Monimoy Bujarbaruah, Ugo Rosolia, Yvonne R Stürz, Xiao**g Zhang, Francesco Borrelli

Abstract: We propose a novel approach to design a robust Model Predictive Controller (MPC) for constrained uncertain linear systems. The uncertain system is modeled as linear parameter varying with additive disturbance. Set bounds for the system matrices and the additive uncertainty are assumed to be known. We formulate a novel optimization-based constraint tightening strategy around a predicted nominal tra… ▽ More We propose a novel approach to design a robust Model Predictive Controller (MPC) for constrained uncertain linear systems. The uncertain system is modeled as linear parameter varying with additive disturbance. Set bounds for the system matrices and the additive uncertainty are assumed to be known. We formulate a novel optimization-based constraint tightening strategy around a predicted nominal trajectory which utilizes these bounds. With an appropriately designed terminal cost function and constraint set, we prove robust satisfaction of the imposed constraints by the resulting MPC in closed-loop with the uncertain system, and Input to State Stability of the origin. We highlight the efficacy of our proposed approach via a numerical example. △ Less

Submitted 10 August, 2022; v1 submitted 2 July, 2020; originally announced July 2020.

Comments: Dimension of matrices in the MPC problem fixed. Shortened and slightly altered version of this draft published as a full paper in Automatica

arXiv:2006.13406 [pdf, ps, other]

Distributed Learning Model Predictive Control for Linear Systems

Authors: Yvonne R. Stürz, Edward L. Zhu, Ugo Rosolia, Karl H. Johansson, Francesco Borrelli

Abstract: This paper presents a distributed learning model predictive control (DLMPC) scheme for distributed linear time invariant systems with coupled dynamics and state constraints. The proposed solution method is based on an online distributed optimization scheme with nearest-neighbor communication. If the control task is iterative and data from previous feasible iterations are available, local data are… ▽ More This paper presents a distributed learning model predictive control (DLMPC) scheme for distributed linear time invariant systems with coupled dynamics and state constraints. The proposed solution method is based on an online distributed optimization scheme with nearest-neighbor communication. If the control task is iterative and data from previous feasible iterations are available, local data are exploited by the subsystems in order to construct the local terminal set and terminal cost, which guarantee recursive feasibility and asymptotic stability, as well as performance improvement over iterations. In case a first feasible trajectory is difficult to obtain, or the task is non-iterative, we further propose an algorithm that efficiently explores the state-space and generates the data required for the construction of the terminal cost and terminal constraint in the MPC problem in a safe and distributed way. In contrast to other distributed MPC schemes which use structured positive invariant sets, the proposed approach involves a control invariant set as the terminal set, on which we do not impose any distributed structure. The proposed iterative scheme converges to the global optimal solution of the underlying infinite horizon optimal control problem under mild conditions. Numerical experiments demonstrate the effectiveness of the proposed DLMPC scheme. △ Less

Submitted 23 June, 2020; originally announced June 2020.

arXiv:2006.10272 [pdf, other]

Improving Urban Traffic Throughput with Vehicle Platooning: Theory and Experiments

Authors: Stanley W. Smith, Yeojun Kim, Jacopo Guanetti, Ruolin Li, Roya Firoozi, Bruce Wootton, Alexander A. Kurzhanskiy, Francesco Borrelli, Roberto Horowitz, Murat Arcak

Abstract: In this paper we present a model-predictive control (MPC) based approach for vehicle platooning in an urban traffic setting. Our primary goal is to demonstrate that vehicle platooning has the potential to significantly increase throughput at intersections, which can create bottlenecks in the traffic flow. To do so, our approach relies on vehicle connectivity: vehicle-to-vehicle (V2V) and vehicle-t… ▽ More In this paper we present a model-predictive control (MPC) based approach for vehicle platooning in an urban traffic setting. Our primary goal is to demonstrate that vehicle platooning has the potential to significantly increase throughput at intersections, which can create bottlenecks in the traffic flow. To do so, our approach relies on vehicle connectivity: vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) communication. In particular, we introduce a customized V2V message set which features a velocity forecast, i.e. a prediction on the future velocity trajectory, which enables platooning vehicles to accurately maintain short following distances, thereby increasing throughput. Furthermore, V2I communication allows platoons to react immediately to changes in the state of nearby traffic lights, e.g. when the traffic phase becomes green, enabling additional gains in traffic efficiency. We present our design of the vehicle platooning system, and then evaluate performance by estimating the potential gains in terms of throughput using our results from simulation, as well as experiments conducted with real test vehicles on a closed track. Lastly, we briefly overview our demonstration of vehicle platooning on public roadways in Arcadia, CA. △ Less

Submitted 27 July, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

Comments: 14 pages, 11 figures, 4 tables

arXiv:2006.05054 [pdf, other]

Learning to Satisfy Unknown Constraints in Iterative MPC

Authors: Monimoy Bujarbaruah, Charlott Vallon, Francesco Borrelli

Abstract: We propose a control design method for linear time-invariant systems that iteratively learns to satisfy unknown polyhedral state constraints. At each iteration of a repetitive task, the method constructs an estimate of the unknown environment constraints using collected closed-loop trajectory data. This estimated constraint set is improved iteratively upon collection of additional data. An MPC con… ▽ More We propose a control design method for linear time-invariant systems that iteratively learns to satisfy unknown polyhedral state constraints. At each iteration of a repetitive task, the method constructs an estimate of the unknown environment constraints using collected closed-loop trajectory data. This estimated constraint set is improved iteratively upon collection of additional data. An MPC controller is then designed to robustly satisfy the estimated constraint set. This paper presents the details of the proposed approach, and provides robust and probabilistic guarantees of constraint satisfaction as a function of the number of executed task iterations. We demonstrate the safety of the proposed framework and explore the safety vs. performance trade-off in a detailed numerical example. △ Less

Submitted 10 June, 2023; v1 submitted 9 June, 2020; originally announced June 2020.

Comments: Long version of the published paper for IEEE-CDC 2020. First two authors contributed equally. Added some very relevant citations that were missing

arXiv:2006.03704 [pdf, other]

Data-driven Energy Management Strategy for Plug-in Hybrid Electric Vehicles with Real-World Trip Information

Authors: Yongkeun Choi, Jacopo Guanetti, Scott Moura, Francesco Borrelli

Abstract: This paper presents a data-driven supervisory energy management strategy (EMS) for plug-in hybrid electric vehicles which leverages Vehicle-to-Cloud connectivity to increase energy efficiency by learning control policies from completed trips. The proposed EMS consists of two layers, a cloud layer and an on-board layer. The cloud layer has two main tasks: the first task is to learn EMS policy param… ▽ More This paper presents a data-driven supervisory energy management strategy (EMS) for plug-in hybrid electric vehicles which leverages Vehicle-to-Cloud connectivity to increase energy efficiency by learning control policies from completed trips. The proposed EMS consists of two layers, a cloud layer and an on-board layer. The cloud layer has two main tasks: the first task is to learn EMS policy parameters from historical trip data, and the second task is to provide the policy parameters along a certain route requested from the vehicle. The on-board layer receives the learned policy parameters from the cloud layer and computes a real-time solution to the powertrain energy management problem, using a model predictive control scheme. The proposed EMS is evaluated on more than 3000 miles (48 independent driving cycles) of real-world trip data, collected along three commuting routes in California. For the routes, the proposed algorithm shows 3.3%, 7.3%, and 6.5% improvement in average MPGe when compared to a baseline EMS. △ Less

Submitted 5 June, 2020; originally announced June 2020.

Comments: To appear at the 21st International Federation of Automatic Control (IFAC2020)

arXiv:2005.05948 [pdf, other]

Data-Driven Hierarchical Predictive Learning in Unknown Environments

Authors: Charlott Vallon, Francesco Borrelli

Abstract: We propose a hierarchical learning architecture for predictive control in unknown environments. We consider a constrained nonlinear dynamical system and assume the availability of state-input trajectories solving control tasks in different environments. A parameterized environment model generates state constraints specific to each task, which are satisfied by the stored trajectories. Our goal is t… ▽ More We propose a hierarchical learning architecture for predictive control in unknown environments. We consider a constrained nonlinear dynamical system and assume the availability of state-input trajectories solving control tasks in different environments. A parameterized environment model generates state constraints specific to each task, which are satisfied by the stored trajectories. Our goal is to find a feasible trajectory for a new task in an unknown environment. From stored data, we learn strategies in the form of target sets in a reduced-order state space. These strategies are applied to the new task in real-time using a local forecast of the new environment, and the resulting output is used as a terminal region by a low-level receding horizon controller. We show how to i) design the target sets from past data and then ii) incorporate them into a model predictive control scheme with shifting horizon that ensures safety of the closed-loop system when performing the new task. We prove the feasibility of the resulting control policy, and verify the proposed method in a robotic path planning application. △ Less

Submitted 14 July, 2020; v1 submitted 12 May, 2020; originally announced May 2020.

arXiv:2005.01673 [pdf, other]

Task Decomposition for MPC: A Computationally Efficient Approach for Linear Time-Varying Systems

Authors: Charlott Vallon, Francesco Borrelli

Abstract: A Task Decomposition method for iterative learning Model Predictive Control (TDMPC) for linear time-varying systems is presented. We consider the availability of state-input trajectories which solve an original task T1, and design a feasible MPC policy for a new task, T2, using stored data from T1. Our approach applies to tasks T2 which are composed of subtasks contained in T1. In this paper we fo… ▽ More A Task Decomposition method for iterative learning Model Predictive Control (TDMPC) for linear time-varying systems is presented. We consider the availability of state-input trajectories which solve an original task T1, and design a feasible MPC policy for a new task, T2, using stored data from T1. Our approach applies to tasks T2 which are composed of subtasks contained in T1. In this paper we formally define the task decomposition problem, and provide a feasibility proof for the resulting policy. The proposed algorithm reduces the computational burden for linear time-varying systems with piecewise convex constraints. Simulation results demonstrate the improved efficiency of the proposed method on a robotic path-planning task. △ Less

Submitted 4 May, 2020; originally announced May 2020.

arXiv:2004.10293 [pdf, other]

ParkPredict: Motion and Intent Prediction of Vehicles in Parking Lots

Authors: Xu Shen, Ivo Batkovic, Vijay Govindarajan, Paolo Falcone, Trevor Darrell, Francesco Borrelli

Abstract: We investigate the problem of predicting driver behavior in parking lots, an environment which is less structured than typical road networks and features complex, interactive maneuvers in a compact space. Using the CARLA simulator, we develop a parking lot environment and collect a dataset of human parking maneuvers. We then study the impact of model complexity and feature information by comparing… ▽ More We investigate the problem of predicting driver behavior in parking lots, an environment which is less structured than typical road networks and features complex, interactive maneuvers in a compact space. Using the CARLA simulator, we develop a parking lot environment and collect a dataset of human parking maneuvers. We then study the impact of model complexity and feature information by comparing a multi-modal Long Short-Term Memory (LSTM) prediction model and a Convolution Neural Network LSTM (CNN-LSTM) to a physics-based Extended Kalman Filter (EKF) baseline. Our results show that 1) intent can be estimated well (roughly 85% top-1 accuracy and nearly 100% top-3 accuracy with the LSTM and CNN-LSTM model); 2) knowledge of the human driver's intended parking spot has a major impact on predicting parking trajectory; and 3) the semantic representation of the environment improves long term predictions. △ Less

Submitted 21 April, 2020; originally announced April 2020.

Comments: * Indicates equal contribution. Accepted at IEEE Intelligent Vehicles Symposium (IV) 2020

arXiv:2004.05173 [pdf, other]

Output-Lifted Learning Model Predictive Control

Authors: Siddharth H. Nair, Ugo Rosolia, Francesco Borrelli

Abstract: We propose a computationally efficient Learning Model Predictive Control (LMPC) scheme for constrained optimal control of a class of nonlinear systems where the state and input can be reconstructed using lifted outputs. For the considered class of systems, we show how to use historical trajectory data collected during iterative tasks to construct a convex value function approximation along with a… ▽ More We propose a computationally efficient Learning Model Predictive Control (LMPC) scheme for constrained optimal control of a class of nonlinear systems where the state and input can be reconstructed using lifted outputs. For the considered class of systems, we show how to use historical trajectory data collected during iterative tasks to construct a convex value function approximation along with a convex safe set in a lifted space of virtual outputs. These constructions are iteratively updated with historical data and used to synthesize predictive control policies. We show that the proposed strategy guarantees recursive constraint satisfaction, asymptotic stability and non-decreasing closed-loop performance at each policy update. Finally, simulation results demonstrate the effectiveness of the proposed strategy on a piecewise affine (PWA) system, kinematic unicycle and bilinear DC motor. △ Less

Submitted 15 January, 2021; v1 submitted 10 April, 2020; originally announced April 2020.

arXiv:2004.03041 [pdf, other]

Control of Unknown Nonlinear Systems with Linear Time-Varying MPC

Authors: Dimitris Papadimitriou, Ugo Rosolia, Francesco Borrelli

Abstract: We present a Model Predictive Control (MPC) strategy for unknown input-affine nonlinear dynamical systems. A non-parametric method is used to estimate the nonlinear dynamics from observed data. The estimated nonlinear dynamics are then linearized over time varying regions of the state space to construct an Affine Time Varying (ATV) model. Error bounds arising from the estimation and linearization… ▽ More We present a Model Predictive Control (MPC) strategy for unknown input-affine nonlinear dynamical systems. A non-parametric method is used to estimate the nonlinear dynamics from observed data. The estimated nonlinear dynamics are then linearized over time varying regions of the state space to construct an Affine Time Varying (ATV) model. Error bounds arising from the estimation and linearization procedure are computed by using sampling techniques. The ATV model and the uncertainty sets are used to design a robust Model Predictive Control (MPC) problem which guarantees safety for the unknown system with high probability. A simple nonlinear example demonstrates the effectiveness of the approach where commonly used linearization methods fail. △ Less

Submitted 8 October, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

Comments: Fixed double citation in section 4, typos and combined two propositions into one

arXiv:2004.01298 [pdf, other]

Trajectory Optimization for Nonlinear Multi-Agent Systems using Decentralized Learning Model Predictive Control

Authors: Edward L. Zhu, Yvonne R. Stürz, Ugo Rosolia, Francesco Borrelli

Abstract: We present a decentralized minimum-time trajectory optimization scheme based on learning model predictive control for multi-agent systems with nonlinear decoupled dynamics and coupled state constraints. By performing the same task iteratively, data from previous task executions is used to construct and improve local time-varying safe sets and an approximate value function. These are used in a deco… ▽ More We present a decentralized minimum-time trajectory optimization scheme based on learning model predictive control for multi-agent systems with nonlinear decoupled dynamics and coupled state constraints. By performing the same task iteratively, data from previous task executions is used to construct and improve local time-varying safe sets and an approximate value function. These are used in a decoupled MPC problem as terminal sets and terminal cost functions. Our framework results in a decentralized controller, which requires no communication between agents over each iteration of task execution, and guarantees persistent feasibility, finite-time closed-loop convergence, and non-decreasing performance of the global system over task iterations. Numerical experiments of a multi-vehicle collision avoidance scenario demonstrate the effectiveness of the proposed scheme. △ Less

Submitted 18 December, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

Comments: 8 pages, 2 figures, accepted at Conference on Decision and Control 2020

Showing 1–50 of 75 results for author: Borrelli, F