Skip to main content

Showing 1–50 of 98 results for author: Tomlin, C J

.
  1. arXiv:2402.05279  [pdf, other

    cs.LG

    Safety Filters for Black-Box Dynamical Systems by Learning Discriminating Hyperplanes

    Authors: Will Lavanakul, Jason J. Choi, Koushil Sreenath, Claire J. Tomlin

    Abstract: Learning-based approaches are emerging as an effective approach for safety filters for black-box dynamical systems. Existing methods have relied on certificate functions like Control Barrier Functions (CBFs) and Hamilton-Jacobi (HJ) reachability value functions. The primary motivation for our work is the recognition that ultimately, enforcing the safety constraint as a control input constraint at… ▽ More

    Submitted 21 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: * Indicate co-first authors. This is an extended version of the paper presented at L4DC 2024

  2. arXiv:2311.13824  [pdf, other

    cs.RO eess.SY

    Constraint-Guided Online Data Selection for Scalable Data-Driven Safety Filters in Uncertain Robotic Systems

    Authors: Jason J. Choi, Fernando Castañeda, Wonsuhk Jung, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: As the use of autonomous robotic systems expands in tasks that are complex and challenging to model, the demand for robust data-driven control methods that can certify safety and stability in uncertain conditions is increasing. However, the practical implementation of these methods often faces scalability issues due to the growing amount of data points with system complexity, and a significant rel… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: The first three authors contributed equally to the work. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  3. arXiv:2310.17180  [pdf, other

    eess.SY

    A Forward Reachability Perspective on Robust Control Invariance and Discount Factors in Reachability Analysis

    Authors: Jason J. Choi, Donggun Lee, Boyang Li, Jonathan P. How, Koushil Sreenath, Sylvia L. Herbert, Claire J. Tomlin

    Abstract: Control invariant sets are crucial for various methods that aim to design safe control policies for systems whose state constraints must be satisfied over an indefinite time horizon. In this article, we explore the connections among reachability, control invariance, and Control Barrier Functions (CBFs) by examining the forward reachability problem associated with control invariant sets. We present… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: The first two authors contributed equally to this work

  4. arXiv:2309.06655  [pdf, other

    cs.RO cs.LG

    Out of Distribution Detection via Domain-Informed Gaussian Process State Space Models

    Authors: Alonso Marco, Elias Morley, Claire J. Tomlin

    Abstract: In order for robots to safely navigate in unseen scenarios using learning-based methods, it is important to accurately detect out-of-training-distribution (OoD) situations online. Recently, Gaussian process state-space models (GPSSMs) have proven useful to discriminate unexpected observations by comparing them against probabilistic predictions. However, the capability for the model to correctly di… ▽ More

    Submitted 15 September, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 7 pages, 4 figures

  5. arXiv:2307.01927  [pdf, other

    eess.SY

    Safe Connectivity Maintenance of Underactuated Multi-Agent Networks in Dynamic Oceanic Environments

    Authors: Nicolas Hoischen, Marius Wiggert, Claire J. Tomlin

    Abstract: Autonomous multi-agent systems are increasingly being deployed in environments where winds and ocean currents have a significant influence. Recent work has developed control policies for single agents that leverage flows to achieve their objectives in dynamic environments. However, in multi-agent systems, these flows can cause agents to collide or drift apart and lose direct inter-agent communicat… ▽ More

    Submitted 20 June, 2024; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 8 pages, Published at European Control Conference 2024 (ECC 2024) Nicolas Hoischen and Marius Wiggert contributed equally to this work

  6. arXiv:2307.01917  [pdf, other

    eess.SY cs.AI cs.RO

    Stranding Risk for Underactuated Vessels in Complex Ocean Currents: Analysis and Controllers

    Authors: Andreas Doering, Marius Wiggert, Hanna Krasowski, Manan Doshi, Pierre F. J. Lermusiaux, Claire J. Tomlin

    Abstract: Low-propulsion vessels can take advantage of powerful ocean currents to navigate towards a destination. Recent results demonstrated that vessels can reach their destination with high probability despite forecast errors. However, these results do not consider the critical aspect of safety of such vessels: because of their low propulsion which is much smaller than the magnitude of currents, they mig… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: 6 pages, 3 figures, submitted to 2023 IEEE 62th Annual Conference on Decision and Control (CDC) Andreas Doering and Marius Wiggert contributed equally to this work

  7. arXiv:2307.01916  [pdf, other

    eess.SY cs.AI cs.RO

    Maximizing Seaweed Growth on Autonomous Farms: A Dynamic Programming Approach for Underactuated Systems Navigating on Uncertain Ocean Currents

    Authors: Matthias Killer, Marius Wiggert, Hanna Krasowski, Manan Doshi, Pierre F. J. Lermusiaux, Claire J. Tomlin

    Abstract: Seaweed biomass offers significant potential for climate mitigation, but large-scale, autonomous open-ocean farms are required to fully exploit it. Such farms typically have low propulsion and are heavily influenced by ocean currents. We want to design a controller that maximizes seaweed growth over months by taking advantage of the non-linear time-varying ocean currents for reaching high-growth r… ▽ More

    Submitted 29 August, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: 8 pages, submitted to 2023 IEEE 62th Annual Conference on Decision and Control (CDC) Matthias Killer and Marius Wiggert contributed equally to this work

  8. arXiv:2302.01999  [pdf, other

    cs.RO

    Online and Offline Learning of Player Objectives from Partial Observations in Dynamic Games

    Authors: Lasse Peters, Vicenç Rubies-Royo, Claire J. Tomlin, Laura Ferranti, Javier Alonso-Mora, Cyrill Stachniss, David Fridovich-Keil

    Abstract: Robots deployed to the real world must be able to interact with other agents in their environment. Dynamic game theory provides a powerful mathematical framework for modeling scenarios in which agents have individual objectives and interactions evolve over time. However, a key limitation of such techniques is that they require a-priori knowledge of all players' objectives. In this work, we address… ▽ More

    Submitted 14 May, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2106.03611

  9. arXiv:2210.05015  [pdf, other

    cs.AI cs.RO eess.SY stat.ML

    Optimality Guarantees for Particle Belief Approximation of POMDPs

    Authors: Michael H. Lim, Tyler J. Becker, Mykel J. Kochenderfer, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood w… ▽ More

    Submitted 19 October, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Journal ref: Journal of Artificial Intelligence Research, 77, 1591-1636 (2023)

  10. arXiv:2208.10733  [pdf, other

    eess.SY cs.LG math.OC

    Recursively Feasible Probabilistic Safe Online Learning with Control Barrier Functions

    Authors: Fernando Castañeda, Jason J. Choi, Wonsuhk Jung, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: Learning-based control schemes have recently shown great efficacy performing complex tasks for a wide variety of applications. However, in order to deploy them in real systems, it is of vital importance to guarantee that the system will remain safe during online training and execution. Among the currently most popular methods to tackle this challenge, Control Barrier Functions (CBFs) serve as math… ▽ More

    Submitted 26 September, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Journal article. Includes the results of the 2021 CDC paper titled "Pointwise feasibility of gaussian process-based safety-critical control under model uncertainty" and proposes a recursively feasible safe online learning algorithm as new contribution

  11. arXiv:2204.07539  [pdf, other

    eess.SY

    Stability and Robustness of a Hybrid Control Law for the Half-bridge Inverter

    Authors: Gabriel E. Colón-Reyes, Kaylene C. Stocking, Duncan S. Callaway, Claire J. Tomlin

    Abstract: Hybrid systems combine both discrete and continuous state dynamics. Power electronic inverters are inherently hybrid systems: they are controlled via discrete-valued switching inputs which determine the evolution of the continuous-valued current and voltage state dynamics. Hybrid systems analysis could prove increasingly useful as large numbers of renewable energy sources are incorporated to the… ▽ More

    Submitted 18 May, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

  12. arXiv:2204.01986  [pdf, other

    eess.SY math.OC

    On the Computational Consequences of Cost Function Design in Nonlinear Optimal Control

    Authors: Tyler Westenbroek, Anand Siththaranjan, Mohsin Sarwari, Claire J. Tomlin, Shankar S. Sastry

    Abstract: Optimal control is an essential tool for stabilizing complex nonlinear systems. However, despite the extensive impacts of methods such as receding horizon control, dynamic programming and reinforcement learning, the design of cost functions for a particular system often remains a heuristic-driven process of trial and error. In this paper we seek to gain insights into how the choice of cost functio… ▽ More

    Submitted 17 November, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  13. arXiv:2203.12303  [pdf, other

    eess.SY math.DS

    Koopman-Based Neural Lyapunov Functions for General Attractors

    Authors: Shankar A. Deka, Alonso M. Valle, Claire J. Tomlin

    Abstract: Koopman spectral theory has grown in the past decade as a powerful tool for dynamical systems analysis and control. In this paper, we show how recent data-driven techniques for estimating Koopman-Invariant subspaces with neural networks can be leveraged to extract Lyapunov certificates for the underlying system. In our work, we specifically focus on systems with a limit-cycle, beyond just an isola… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Submitted to CDC 2022

  14. arXiv:2203.10142  [pdf, other

    eess.SY cs.AI cs.LG math.OC

    Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning

    Authors: **gqi Li, Donggun Lee, Somayeh Sojoudi, Claire J. Tomlin

    Abstract: In this paper, we consider the infinite-horizon reach-avoid zero-sum game problem, where the goal is to find a set in the state space, referred to as the reach-avoid set, such that the system starting at a state therein could be controlled to reach a given target set without violating constraints under the worst-case disturbance. We address this problem by designing a new value function with a con… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

  15. arXiv:2201.08538  [pdf, other

    cs.RO eess.SY

    Computation of Regions of Attraction for Hybrid Limit Cycles Using Reachability: An Application to Walking Robots

    Authors: Jason J. Choi, Ayush Agrawal, Koushil Sreenath, Claire J. Tomlin, Somil Bansal

    Abstract: Contact-rich robotic systems, such as legged robots and manipulators, are often represented as hybrid systems. However, the stability analysis and region-of-attraction computation for these systems are often challenging because of the discontinuous state changes upon contact (also referred to as state resets). In this work, we cast the computation of region-ofattraction as a Hamilton-Jacobi (HJ) r… ▽ More

    Submitted 8 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Accepted to IEEE RA-L & ICRA, 2022

  16. arXiv:2112.12288  [pdf, other

    cs.LG cs.RO eess.SY

    Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning

    Authors: Kai-Chieh Hsu, Vicenç Rubies-Royo, Claire J. Tomlin, Jaime F. Fisac

    Abstract: Reach-avoid optimal control problems, in which the system must reach certain goal conditions while staying clear of unacceptable failure modes, are central to safety and liveness assurance for autonomous robotic systems, but their exact solutions are intractable for complex dynamics and environments. Recent successes in reinforcement learning methods to approximately solve optimal control problems… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: Accepted in Robotics: Science and Systems (RSS), 2021

  17. arXiv:2112.09456  [pdf, other

    cs.AI cs.LG cs.RO eess.SY

    Compositional Learning-based Planning for Vision POMDPs

    Authors: Sampada Deglurkar, Michael H. Lim, Johnathan Tucker, Zachary N. Sunberg, Aleksandra Faust, Claire J. Tomlin

    Abstract: The Partially Observable Markov Decision Process (POMDP) is a powerful framework for capturing decision-making problems that involve state and transition uncertainty. However, most current POMDP planners cannot effectively handle high-dimensional image observations prevalent in real world applications, and often require lengthy online training that requires interaction with the environment. In thi… ▽ More

    Submitted 2 December, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  18. arXiv:2109.10450  [pdf, other

    eess.SY cs.RO math.DS

    Towards cyber-physical systems robust to communication delays: A differential game approach

    Authors: Shankar A. Deka, Donggun Lee, Claire J. Tomlin

    Abstract: Collaboration between interconnected cyber-physical systems is becoming increasingly pervasive. Time-delays in communication channels between such systems are known to induce catastrophic failure modes, like high frequency oscillations in robotic manipulators in bilateral teleoperation or string instability in platoons of autonomous vehicles. This paper considers nonlinear time-delay systems repre… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 7 pages, 5 figures, Submitted to IEEE Control Systems Letters

    MSC Class: 34K35; 49L12; 93D21

  19. arXiv:2109.04874  [pdf, other

    cs.RO eess.SY

    Discretizing Dynamics for Maximum Likelihood Constraint Inference

    Authors: Kaylene C. Stocking, David L. McPherson, Robert P. Matthew, Claire J. Tomlin

    Abstract: Maximum likelihood constraint inference is a powerful technique for identifying unmodeled constraints that affect the behavior of a demonstrator acting under a known objective function. However, it was originally formulated only for discrete state-action spaces. Continuous dynamics are more useful for modeling many real-world systems of interest, including the movements of humans and robots. We pr… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: 10 pages, 7 figures

  20. arXiv:2109.00140  [pdf, other

    math.OC eess.SY

    Lax Formulae for Efficiently Solving Two Classes of State-Constrained Optimal Control Problems

    Authors: Donggun Lee, Claire J. Tomlin

    Abstract: This paper presents Lax formulae for solving the following optimal control problems: minimize the maximum (or the minimum) cost over a time horizon, while satisfying a state constraint. We present a viscosity theory, and by applying the theory to the Hamilton-Jacobi (HJ) equations, these Lax formulae are derived. A numerical algorithm for the Lax formulae is presented: under certain conditions, th… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

  21. arXiv:2106.15006  [pdf, other

    math.OC eess.SY

    Hamilton-Jacobi Equations for Two Classes of State-Constrained Zero-Sum Games

    Authors: Donggun Lee, Claire J. Tomlin

    Abstract: This paper presents Hamilton-Jacobi (HJ) formulations for two classes of two-player zero-sum games: one with a maximum cost value over time, and one with a minimum cost value over time. In the zero-sum game setting, player A minimizes the given cost while satisfying state constraints, and player B wants to prevent player A's success. For each class of problems, this paper presents two HJ equations… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

  22. arXiv:2106.13440  [pdf, other

    eess.SY math.OC

    A Computationally Efficient Hamilton-Jacobi-based Formula for State-Constrained Optimal Control Problems

    Authors: Donggun Lee, Claire J. Tomlin

    Abstract: This paper investigates a Hamilton-Jacobi (HJ) analysis to solve finite-horizon optimal control problems for high-dimensional systems. Although grid-based methods, such as the level-set method [1], numerically solve a general class of HJ partial differential equations, the computational complexity is exponential in the dimension of the continuous state. To manage this computational complexity, met… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

  23. arXiv:2106.07108  [pdf, other

    eess.SY cs.LG math.OC

    Pointwise Feasibility of Gaussian Process-based Safety-Critical Control under Model Uncertainty

    Authors: Fernando Castañeda, Jason J. Choi, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs) are popular tools for enforcing safety and stability of a controlled system, respectively. They are commonly utilized to build constraints that can be incorporated in a min-norm quadratic program (CBF-CLF-QP) which solves for a safety-critical control input. However, since these constraints rely on a model of the system, when t… ▽ More

    Submitted 1 October, 2021; v1 submitted 13 June, 2021; originally announced June 2021.

    Comments: The first two authors contributed equally. Accepted for publication in IEEE 60th Conference on Decision and Control (CDC 2021)

  24. arXiv:2106.03611  [pdf, other

    cs.RO cs.MA

    Inferring Objectives in Continuous Dynamic Games from Noise-Corrupted Partial State Observations

    Authors: Lasse Peters, David Fridovich-Keil, Vicenç Rubies-Royo, Claire J. Tomlin, Cyrill Stachniss

    Abstract: Robots and autonomous systems must interact with one another and their environment to provide high-quality services to their users. Dynamic game theory provides an expressive theoretical framework for modeling scenarios involving multiple agents with differing objectives interacting over time. A core challenge when formulating a dynamic game is designing objectives for each agent that capture desi… ▽ More

    Submitted 7 August, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: Submitted to RSS2021

  25. arXiv:2104.02808  [pdf, other

    eess.SY

    Robust Control Barrier-Value Functions for Safety-Critical Control

    Authors: Jason J. Choi, Donggun Lee, Koushil Sreenath, Claire J. Tomlin, Sylvia L. Herbert

    Abstract: This paper works towards unifying two popular approaches in the safety control community: Hamilton-Jacobi (HJ) reachability and Control Barrier Functions (CBFs). HJ Reachability has methods for direct construction of value functions that provide safety guarantees and safe controllers, however the online implementation can be overly conservative and/or rely on chattering bang-bang control. The CBF… ▽ More

    Submitted 25 October, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: IEEE CDC 2021

  26. arXiv:2103.05746  [pdf, other

    cs.RO cs.AI cs.HC eess.SY

    Analyzing Human Models that Adapt Online

    Authors: Andrea Bajcsy, Anand Siththaranjan, Claire J. Tomlin, Anca D. Dragan

    Abstract: Predictive human models often need to adapt their parameters online from human data. This raises previously ignored safety-related questions for robots relying on these models such as what the model could learn online and how quickly could it learn it. For instance, when will the robot have a confident estimate in a nearby human's goal? Or, what parameter initializations guarantee that the robot c… ▽ More

    Submitted 30 September, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: ICRA 2021

  27. FaSTrack: a Modular Framework for Real-Time Motion Planning and Guaranteed Safe Tracking

    Authors: Mo Chen, Sylvia L. Herbert, Haimin Hu, Ye Pu, Jaime F. Fisac, Somil Bansal, SooJean Han, Claire J. Tomlin

    Abstract: Real-time, guaranteed safe trajectory planning is vital for navigation in unknown environments. However, real-time navigation algorithms typically sacrifice robustness for computation speed. Alternatively, provably safe trajectory planning tends to be too computationally intensive for real-time replanning. We propose FaSTrack, Fast and Safe Tracking, a framework that achieves both real-time replan… ▽ More

    Submitted 13 March, 2021; v1 submitted 13 February, 2021; originally announced February 2021.

    Comments: Published in the IEEE Transactions on Automatic Control

  28. arXiv:2101.12086  [pdf, other

    eess.SY

    Risk-sensitive safety analysis using Conditional Value-at-Risk

    Authors: Margaret P. Chapman, Riccardo Bonalli, Kevin M. Smith, Insoon Yang, Marco Pavone, Claire J. Tomlin

    Abstract: This paper develops a safety analysis method for stochastic systems that is sensitive to the possibility and severity of rare harmful outcomes. We define risk-sensitive safe sets as sub-level sets of the solution to a non-standard optimal control problem, where a random maximum cost is assessed via Conditional Value-at-Risk (CVaR). The objective function represents the maximum extent of constraint… ▽ More

    Submitted 25 June, 2022; v1 submitted 28 January, 2021; originally announced January 2021.

    Journal ref: IEEE Transactions on Automatic Control, 2022

  29. arXiv:2101.05916  [pdf, other

    cs.RO cs.LG eess.SY

    Scalable Learning of Safety Guarantees for Autonomous Systems using Hamilton-Jacobi Reachability

    Authors: Sylvia Herbert, Jason J. Choi, Suvansh Sanjeev, Marsalis Gibson, Koushil Sreenath, Claire J. Tomlin

    Abstract: Autonomous systems like aircraft and assistive robots often operate in scenarios where guaranteeing safety is critical. Methods like Hamilton-Jacobi reachability can provide guaranteed safe sets and controllers for such systems. However, often these same scenarios have unknown or uncertain environments, system dynamics, or predictions of other agents. As the system is operating, it may learn new k… ▽ More

    Submitted 2 April, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: The first two authors are co-first authors. ICRA 2021

  30. arXiv:2012.10140  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Voronoi Progressive Widening: Efficient Online Solvers for Continuous State, Action, and Observation POMDPs

    Authors: Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: This paper introduces Voronoi Progressive Widening (VPW), a generalization of Voronoi optimistic optimization (VOO) and action progressive widening to partially observable Markov decision processes (POMDPs). Tree search algorithms can use VPW to effectively handle continuous or hybrid action spaces by efficiently balancing local and global action searching. This paper proposes two VPW-based algori… ▽ More

    Submitted 1 April, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

  31. arXiv:2011.07183  [pdf, other

    eess.SY cs.LG math.OC

    Gaussian Process-based Min-norm Stabilizing Controller for Control-Affine Systems with Uncertain Input Effects and Dynamics

    Authors: Fernando Castañeda, Jason J. Choi, Bike Zhang, Claire J. Tomlin, Koushil Sreenath

    Abstract: This paper presents a method to design a min-norm Control Lyapunov Function (CLF)-based stabilizing controller for a control-affine system with uncertain dynamics using Gaussian Process (GP) regression. In order to estimate both state and input-dependent model uncertainty, we propose a novel compound kernel that captures the control-affine nature of the problem. Furthermore, by the use of GP Upper… ▽ More

    Submitted 23 March, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: The first two authors contributed equally. To appear at the 2021 American Control Conference (ACC)

  32. arXiv:2011.04815  [pdf, other

    cs.RO eess.SY

    Encoding Defensive Driving as a Dynamic Nash Game

    Authors: Chih-Yuan Chiu, David Fridovich-Keil, Claire J. Tomlin

    Abstract: Robots deployed in real-world environments should operate safely in a robust manner. In scenarios where an "ego" agent navigates in an environment with multiple other "non-ego" agents, two modes of safety are commonly proposed -- adversarial robustness and probabilistic constraint satisfaction. However, while the former is generally computationally intractable and leads to overconservative solutio… ▽ More

    Submitted 30 March, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: Accepted to ICRA 2021

  33. arXiv:2011.00601  [pdf, other

    eess.SY cs.GT cs.RO

    Approximate Solutions to a Class of Reachability Games

    Authors: David Fridovich-Keil, Claire J. Tomlin

    Abstract: In this paper, we present a method for finding approximate Nash equilibria in a broad class of reachability games. These games are often used to formulate both collision avoidance and goal satisfaction. Our method is computationally efficient, running in real-time for scenarios involving multiple players and more than ten state dimensions. The proposed approach forms a family of increasingly exact… ▽ More

    Submitted 20 March, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: Conference paper at ICRA 2021

  34. arXiv:2009.02874  [pdf

    cs.LG eess.SY math.DS stat.ML

    Dynamically Computing Adversarial Perturbations for Recurrent Neural Networks

    Authors: Shankar A. Deka, Dušan M. Stipanović, Claire J. Tomlin

    Abstract: Convolutional and recurrent neural networks have been widely employed to achieve state-of-the-art performance on classification tasks. However, it has also been noted that these networks can be manipulated adversarially with relative ease, by carefully crafted additive perturbations to the input. Though several experimentally established prior works exist on crafting and defending against attacks,… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: Submitted to IEEE Transactions on Neural Networks and Learning Systems

    MSC Class: 68T07; 93B52; 93C10; 49N90 ACM Class: I.2.8

  35. arXiv:2004.07584  [pdf, other

    eess.SY cs.LG cs.RO

    Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions

    Authors: Jason Choi, Fernando Castañeda, Claire J. Tomlin, Koushil Sreenath

    Abstract: In this paper, the issue of model uncertainty in safety-critical control is addressed with a data-driven approach. For this purpose, we utilize the structure of an input-ouput linearization controller based on a nominal model along with a Control Barrier Function and Control Lyapunov Function based Quadratic Program (CBF-CLF-QP). Specifically, we propose a novel reinforcement learning framework wh… ▽ More

    Submitted 4 June, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: The first two authors contributed equally to this work

  36. arXiv:2004.07276  [pdf, other

    eess.SY cs.LG cs.RO

    Improving Input-Output Linearizing Controllers for Bipedal Robots via Reinforcement Learning

    Authors: Fernando Castañeda, Mathias Wulfman, Ayush Agrawal, Tyler Westenbroek, Claire J. Tomlin, S. Shankar Sastry, Koushil Sreenath

    Abstract: The main drawbacks of input-output linearizing controllers are the need for precise dynamics models and not being able to account for input constraints. Model uncertainty is common in almost every robotic application and input saturation is present in every real world system. In this paper, we address both challenges for the specific case of bipedal robot control by the use of reinforcement learni… ▽ More

    Submitted 2 May, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: Final version appearing in Learning for Dynamics and Control (L4DC) 2020 Conference

  37. arXiv:2004.02766  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

    Authors: Tyler Westenbroek, Eric Mazumdar, David Fridovich-Keil, Valmik Prabhu, Claire J. Tomlin, S. Shankar Sastry

    Abstract: This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enab… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  38. arXiv:2002.04354  [pdf, other

    cs.RO eess.SY

    Inference-Based Strategy Alignment for General-Sum Differential Games

    Authors: Lasse Peters, David Fridovich-Keil, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: In many settings where multiple agents interact, the optimal choices for each agent depend heavily on the choices of the others. These coupled interactions are well-described by a general-sum differential game, in which players have differing objectives, the state evolves in continuous time, and optimal play may be characterized by one of many equilibrium concepts, e.g., a Nash equilibrium. Often,… ▽ More

    Submitted 6 May, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  39. arXiv:1910.13369  [pdf, other

    cs.RO cs.LG eess.SY

    A Hamilton-Jacobi Reachability-Based Framework for Predicting and Analyzing Human Motion for Safe Planning

    Authors: Somil Bansal, Andrea Bajcsy, Ellis Ratner, Anca D. Dragan, Claire J. Tomlin

    Abstract: Real-world autonomous systems often employ probabilistic predictive models of human behavior during planning to reason about their future motion. Since accurately modeling human behavior a priori is challenging, such models are often parameterized, enabling the robot to adapt predictions based on observations by maintaining a distribution over the model parameters. Although this enables data and p… ▽ More

    Submitted 5 April, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

  40. arXiv:1910.13272  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Feedback Linearization for Unknown Systems via Reinforcement Learning

    Authors: Tyler Westenbroek, David Fridovich-Keil, Eric Mazumdar, Shreyas Arora, Valmik Prabhu, S. Shankar Sastry, Claire J. Tomlin

    Abstract: We present a novel approach to control design for nonlinear systems which leverages model-free policy optimization techniques to learn a linearizing controller for a physical plant with unknown dynamics. Feedback linearization is a technique from nonlinear control which renders the input-output dynamics of a nonlinear plant \emph{linear} under application of an appropriate feedback controller. Onc… ▽ More

    Submitted 21 April, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

  41. arXiv:1910.04332  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Sparse tree search optimality guarantees in POMDPs with continuous observation spaces

    Authors: Michael H. Lim, Claire J. Tomlin, Zachary N. Sunberg

    Abstract: Partially observable Markov decision processes (POMDPs) with continuous state and observation spaces have powerful flexibility for representing real-world decision and control problems but are notoriously difficult to solve. Recent online sampling-based algorithms that use observation likelihood weighting have shown unprecedented effectiveness in domains with continuous observation spaces. However… ▽ More

    Submitted 5 June, 2023; v1 submitted 9 October, 2019; originally announced October 2019.

  42. arXiv:1910.00681  [pdf, other

    eess.SY cs.GT cs.MA cs.RO

    An Iterative Quadratic Method for General-Sum Differential Games with Feedback Linearizable Dynamics

    Authors: David Fridovich-Keil, Vicenc Rubies-Royo, Claire J. Tomlin

    Abstract: Iterative linear-quadratic (ILQ) methods are widely used in the nonlinear optimal control community. Recent work has applied similar methodology in the setting of multiplayer general-sum differential games. Here, ILQ methods are capable of finding local equilibria in interactive motion planning problems in real-time. As in most iterative procedures, however, this approach can be sensitive to initi… ▽ More

    Submitted 19 March, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: 7 pages, 5 figures, accepted to IEEE International Conference on Robotics and Automation (2020)

  43. arXiv:1909.09703   

    eess.SY

    Risk-sensitive safety specifications for stochastic systems using Conditional Value-at-Risk

    Authors: Margaret P. Chapman, Jonathan P. Lacotte, Kevin M. Smith, Insoon Yang, Yuxi Han, Marco Pavone, Claire J. Tomlin

    Abstract: This paper proposes a safety analysis method that facilitates a tunable balance between the worst-case and risk-neutral perspectives. First, we define a risk-sensitive safe set to specify the degree of safety attained by a stochastic system. This set is defined as a sublevel set of the solution to an optimal control problem that is expressed using the Conditional Value-at-Risk (CVaR) measure. This… ▽ More

    Submitted 27 July, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: There are technical issues with some of the proofs (e.g., interchangeability between an inf and an integral, which follows from a technical issue with one of the papers we referenced, and topological arguments), which we are working on correcting using different techniques

  44. arXiv:1909.05699  [pdf, other

    eess.SY cs.LG

    Closed-loop Model Selection for Kernel-based Models using Bayesian Optimization

    Authors: Thomas Beckers, Somil Bansal, Claire J. Tomlin, Sandra Hirche

    Abstract: Kernel-based nonparametric models have become very attractive for model-based control approaches for nonlinear systems. However, the selection of the kernel and its hyperparameters strongly influences the quality of the learned model. Classically, these hyperparameters are optimized to minimize the prediction error of the model but this process totally neglects its later usage in the control loop.… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Journal ref: IEEE Conference on Decision and Control 2019

  45. arXiv:1909.04694  [pdf, other

    eess.SY cs.RO

    Efficient Iterative Linear-Quadratic Approximations for Nonlinear Multi-Player General-Sum Differential Games

    Authors: David Fridovich-Keil, Ellis Ratner, Lasse Peters, Anca D. Dragan, Claire J. Tomlin

    Abstract: Many problems in robotics involve multiple decision making agents. To operate efficiently in such settings, a robot must reason about the impact of its decisions on the behavior of other agents. Differential games offer an expressive theoretical framework for formulating these types of multi-agent problems. Unfortunately, most numerical solution techniques scale poorly with state dimension and are… ▽ More

    Submitted 18 March, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: 8 pages, 4 figures, accepted to the IEEE International Conference on Robotics and Automation

  46. arXiv:1905.00532  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    An Efficient Reachability-Based Framework for Provably Safe Autonomous Navigation in Unknown Environments

    Authors: Andrea Bajcsy, Somil Bansal, Eli Bronstein, Varun Tolani, Claire J. Tomlin

    Abstract: Real-world autonomous vehicles often operate in a priori unknown environments. Since most of these systems are safety-critical, it is important to ensure they operate safely in the face of environment uncertainty, such as unseen obstacles. Current safety analysis tools enable autonomous systems to reason about safety given full information about the state of the environment a priori. However, thes… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

  47. arXiv:1903.07715  [pdf, other

    eess.SY

    Reachability-Based Safety Guarantees using Efficient Initializations

    Authors: Sylvia L. Herbert, Shromona Ghosh, Somil Bansal, Claire J. Tomlin

    Abstract: Hamilton-Jacobi-Isaacs (HJI) reachability analysis is a powerful tool for analyzing the safety of autonomous systems. This analysis is computationally intensive and typically performed offline. Online, however, the autonomous system may experience changes in system dynamics, external disturbances, and/or the surrounding environment, requiring updated safety guarantees. Rather than restarting the s… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: Submitted to the 2019 IEEE Conference on Decision and Control, 8 pages, 3 figures

  48. arXiv:1902.11277  [pdf, other

    eess.SY

    A Risk-Sensitive Finite-Time Reachability Approach for Safety of Stochastic Dynamic Systems

    Authors: Margaret P. Chapman, Jonathan Lacotte, Aviv Tamar, Donggun Lee, Kevin M. Smith, Victoria Cheng, Jaime F. Fisac, Susmit Jha, Marco Pavone, Claire J. Tomlin

    Abstract: A classic reachability problem for safety of dynamic systems is to compute the set of initial states from which the state trajectory is guaranteed to stay inside a given constraint set over a given time horizon. In this paper, we leverage existing theory of reachability analysis and risk measures to devise a risk-sensitive reachability approach for safety of stochastic dynamic systems under non-ad… ▽ More

    Submitted 30 April, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

  49. A New Simulation Metric to Determine Safe Environments and Controllers for Systems with Unknown Dynamics

    Authors: Shromona Ghosh, Somil Bansal, Alberto Sangiovanni-Vincentelli, Sanjit A. Seshia, Claire J. Tomlin

    Abstract: We consider the problem of extracting safe environments and controllers for reach-avoid objectives for systems with known state and control spaces, but unknown dynamics. In a given environment, a common approach is to synthesize a controller from an abstraction or a model of the system (potentially learned from data). However, in many situations, the relationship between the dynamics of the model… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: 22nd ACM International Conference on Hybrid Systems: Computation and Control (2019)

  50. arXiv:1811.07834  [pdf, other

    cs.RO eess.SY

    Safely Probabilistically Complete Real-Time Planning and Exploration in Unknown Environments

    Authors: David Fridovich-Keil, Jaime F. Fisac, Claire J. Tomlin

    Abstract: We present a new framework for motion planning that wraps around existing kinodynamic planners and guarantees recursive feasibility when operating in a priori unknown, static environments. Our approach makes strong guarantees about overall safety and collision avoidance by utilizing a robust controller derived from reachability analysis. We ensure that motion plans never exit the safe backward rea… ▽ More

    Submitted 6 March, 2019; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: 7 pages, accepted to ICRA 2019