-
A Closed-Form Control for Safety Under Input Constraints Using a Composition of Control Barrier Functions
Authors:
Pedram Rabiee,
Jesse B. Hoagg
Abstract:
We present a new closed-form optimal control that satisfies both safety constraints (i.e., state constraints) and input constraints (e.g., actuator limits) using a composition of multiple control barrier functions (CBFs). This main result is obtained through the combination of several new ideas. First, we present a method for constructing a single CBF from multiple CBFs, which can have different r…
▽ More
We present a new closed-form optimal control that satisfies both safety constraints (i.e., state constraints) and input constraints (e.g., actuator limits) using a composition of multiple control barrier functions (CBFs). This main result is obtained through the combination of several new ideas. First, we present a method for constructing a single CBF from multiple CBFs, which can have different relative degrees. The construction relies on a log-sum-exponential soft-minimum function and yields a CBF whose zero-superlevel set is a subset of the intersection of the zero-superlevel sets of all the CBFs used in the composition. Next, we use the composite soft-minimum CBF to construct a closed-form control that is optimal with respect to a quadratic cost subject to the safety constraints. Finally, we extend the approach and develop a closed-form optimal control that not only guarantees safety but also respects input constraints. The key elements in develo** this novel closed-form control include: the introduction of the control dynamics, which allow the input constraints to be transformed into constraints on the state of the closed-loop system, and the use of the composite soft-minimum CBF to compose multiple safety and input CBFs, which have different relative degrees, into a single CBF. We also demonstrate these new control approaches on a nonholonomic ground robot example.
△ Less
Submitted 30 March, 2024;
originally announced June 2024.
-
Implementation of Linear Parameter Varying System to Investigate the Impact of Varying Flow Rate on the Lithium-ion Batteries Thermal Management System Performance
Authors:
Pedram Rabiee,
Mohammad Hassan Saidi
Abstract:
Battery thermal management system is an indispensable part of the electric vehicles working with Lithium-ion batteries. Accordingly, lithium-ion batteries modeling, battery heat generation, and thermal management are the main focus of researchers and car manufacturers. To fulfill the need of manufacturers in the design process, a faster model than time-consuming Computational Fluid Dynamics models…
▽ More
Battery thermal management system is an indispensable part of the electric vehicles working with Lithium-ion batteries. Accordingly, lithium-ion batteries modeling, battery heat generation, and thermal management are the main focus of researchers and car manufacturers. To fulfill the need of manufacturers in the design process, a faster model than time-consuming Computational Fluid Dynamics models (CFD) is required. Reduced Order Models (ROM) address this requirement to maintain the accuracy of CFD models while could be compiled faster. Linear Time Invariant (LTI) reduced order model has been used in the literature; however, due to the limitation of LTI system, considering the constant flow rate for the cooling fluid, a Linear Parameter Varying system with three scheduling parameters was developed in this study. It is shown that LPV system results could fit accurately to CFD results in conditions that LTI system cannot maintain accuracy. Moreover, it is shown that applying varying water flow rates could result in a smoother temperature profile.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
Safe Exploration in Reinforcement Learning: Training Backup Control Barrier Functions with Zero Training Time Safety Violations
Authors:
Pedram Rabiee,
Amirsaeid Safari
Abstract:
Safe reinforcement learning (RL) aims to satisfy safety constraints during training. However, guaranteeing safety during training remained a challenging problem. This paper presents a novel framework that integrates Backup Control Barrier Functions (BCBFs) with reinforcement learning (RL) to enable safe exploration called RLBUS: Reinforcement Learning Backup Shield. BCBFs incorporate backup contro…
▽ More
Safe reinforcement learning (RL) aims to satisfy safety constraints during training. However, guaranteeing safety during training remained a challenging problem. This paper presents a novel framework that integrates Backup Control Barrier Functions (BCBFs) with reinforcement learning (RL) to enable safe exploration called RLBUS: Reinforcement Learning Backup Shield. BCBFs incorporate backup controllers that predict a system's finite-time response, facilitating online optimization of a control policy that maintains the forward invariance of a safe subset, while satisfying actuator constraints. Building on the soft-minimum/soft-maximum CBF method from prior work, which ensures feasibility and continuity of the BCBF with multiple backup controllers, this paper proposes integrating these BCBFs with RL. This framework leverages RL to learn a better backup policy to enlarge the forward invariant set, while guaranteeing safety during training. By combining backup controllers and RL, the approach provides safety and feasibility guarantees during training and enables safe online exploration with zero training-time safety violations. The method is demonstrated on an inverted pendulum example, where expanding the forward invariant set through RL allows the pendulum to safely explore larger regions of state space.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Composition of Control Barrier Functions With Differing Relative Degrees for Safety Under Input Constraints
Authors:
Pedram Rabiee,
Jesse B. Hoagg
Abstract:
This paper presents a new approach for guaranteed safety subject to input constraints (e.g., actuator limits) using a composition of multiple control barrier functions (CBFs). First, we present a method for constructing a single CBF from multiple CBFs, which can have different relative degrees. This construction relies on a soft minimum function and yields a CBF whose $0$-superlevel set is a subse…
▽ More
This paper presents a new approach for guaranteed safety subject to input constraints (e.g., actuator limits) using a composition of multiple control barrier functions (CBFs). First, we present a method for constructing a single CBF from multiple CBFs, which can have different relative degrees. This construction relies on a soft minimum function and yields a CBF whose $0$-superlevel set is a subset of the union of the $0$-superlevel sets of all the CBFs used in the construction. Next, we extend the approach to systems with input constraints. Specifically, we introduce control dynamics that allow us to express the input constraints as CBFs in the closed-loop state (i.e., the state of the system and the controller). The CBFs constructed from input constraints do not have the same relative degree as the safety constraints. Thus, the composite soft-minimum CBF construction is used to combine the input-constraint CBFs with the safety-constraint CBFs. Finally, we present a feasible real-time-optimization control that guarantees that the state remains in the $0$-superlevel set of the composite soft-minimum CBF. We demonstrate these approaches on a nonholonomic ground robot example.
△ Less
Submitted 15 March, 2024; v1 submitted 30 September, 2023;
originally announced October 2023.
-
The Impact of Reference-Command Preview on Human-in-the-Loop Control Behavior
Authors:
Pedram Rabiee,
S. Alireza Seyyed Mousavi,
Amelia J. S. Sheffler,
Erik Hellström,
Mrdjan Jankovic,
Mario A. Santillo,
T. M. Seigler,
Jesse B. Hoagg
Abstract:
This article presents results from an experiment in which 44 human subjects interact with a dynamic system to perform 40 trials of a command-following task. The reference command is unpredictable and different on each trial, but all subjects have the same sequence of reference commands for the 40 trials. The subjects are divided into 4 groups of 11 subjects. One group performs the command-followin…
▽ More
This article presents results from an experiment in which 44 human subjects interact with a dynamic system to perform 40 trials of a command-following task. The reference command is unpredictable and different on each trial, but all subjects have the same sequence of reference commands for the 40 trials. The subjects are divided into 4 groups of 11 subjects. One group performs the command-following task without preview of the reference command, and the other 3 groups are given preview of the reference command for different time lengths into the future (0.5 s, 1 s, 1.5 s). A subsystem identification algorithm is used to obtain best-fit models of each subject's control behavior on each trial. The time- and frequency-domain performance, as well as the identified models of the control behavior for the 4 groups are examined to investigate the effects of reference-command preview. The results suggest that preview tends to improve performance by allowing the subjects to compensate for sensory time delay and approximate the inverse dynamics in feedforward. However, too much preview may decrease performance by degrading the ability to use the correct phase lead in feedforward.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Soft-Minimum and Soft-Maximum Barrier Functions for Safety with Actuation Constraints
Authors:
Pedram Rabiee,
Jesse B. Hoagg
Abstract:
This paper presents two new control approaches for guaranteed safety (remaining in a safe set) subject to actuator constraints (the control is in a convex polytope). The control signals are computed using real-time optimization, including linear and quadratic programs subject to affine constraints, which are shown to be feasible. The first control method relies on a soft-minimum barrier function t…
▽ More
This paper presents two new control approaches for guaranteed safety (remaining in a safe set) subject to actuator constraints (the control is in a convex polytope). The control signals are computed using real-time optimization, including linear and quadratic programs subject to affine constraints, which are shown to be feasible. The first control method relies on a soft-minimum barrier function that is constructed using a finite-time-horizon prediction of the system trajectories under a known backup control. The main result shows that the control is continuous and satisfies the actuator constraints, and a subset of the safe set is forward invariant under the control. Next, we extend this method to allow from multiple backup controls. This second approach relies on a combined soft-maximum/soft-minimum barrier function, and it has properties similar to the first. We demonstrate these controls on numerical simulations of an inverted pendulum and a nonholonomic ground robot.
△ Less
Submitted 18 February, 2024; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Soft-Minimum Barrier Functions for Safety-Critical Control Subject to Actuation Constraints
Authors:
Pedram Rabiee,
Jesse B. Hoagg
Abstract:
This paper presents a new control approach for guaranteed safety (remaining in a safe set) subject to actuator constraints (the control is in a convex polytope). The control signals are computed using real-time optimization, including linear and quadratic programs subject to affine constraints, which are shown to be feasible. The control method relies on a new soft-minimum barrier function that is…
▽ More
This paper presents a new control approach for guaranteed safety (remaining in a safe set) subject to actuator constraints (the control is in a convex polytope). The control signals are computed using real-time optimization, including linear and quadratic programs subject to affine constraints, which are shown to be feasible. The control method relies on a new soft-minimum barrier function that is constructed using a finite-time-horizon prediction of the system trajectories under a known backup control. The main result shows that: (i) the control is continuous and satisfies the actuator constraints, and (ii) a subset of the safe set is forward invariant under the control. We also demonstrate this control on numerical simulations of an inverted pendulum and a double-integrator ground robot.
△ Less
Submitted 30 August, 2023; v1 submitted 2 April, 2023;
originally announced April 2023.