Skip to main content

Showing 1–15 of 15 results for author: Westenbroek, T

.
  1. arXiv:2307.08168  [pdf, other

    cs.LG cs.RO

    Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models

    Authors: Tyler Westenbroek, Jacob Levy, David Fridovich-Keil

    Abstract: We focus on develo** efficient and reliable policy optimization strategies for robot learning with real-world data. In recent years, policy gradient methods have emerged as a promising paradigm for training control policies in simulation. However, these approaches often remain too data inefficient or unreliable to train on real robotic hardware. In this paper we introduce a novel policy gradient… ▽ More

    Submitted 6 November, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

  2. arXiv:2305.09619  [pdf, other

    cs.LG math.OC stat.ML

    The Power of Learned Locally Linear Models for Nonlinear Policy Optimization

    Authors: Daniel Pfrommer, Max Simchowitz, Tyler Westenbroek, Nikolai Matni, Stephen Tu

    Abstract: A common pipeline in learning-based control is to iteratively estimate a model of system dynamics, and apply a trajectory optimization algorithm - e.g.~$\mathtt{iLQR}$ - on the learned model to minimize a target cost. This paper conducts a rigorous analysis of a simplified variant of this strategy for general nonlinear systems. We analyze an algorithm which iterates between estimating local linear… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  3. arXiv:2208.06721  [pdf, other

    cs.RO eess.SY

    Lyapunov Design for Robust and Efficient Robotic Reinforcement Learning

    Authors: Tyler Westenbroek, Fernando Castaneda, Ayush Agrawal, Shankar Sastry, Koushil Sreenath

    Abstract: Recent advances in the reinforcement learning (RL) literature have enabled roboticists to automatically train complex policies in simulated environments. However, due to the poor sample complexity of these methods, solving RL problems using real-world data remains a challenging problem. This paper introduces a novel cost-sha** method which aims to reduce the number of samples needed to learn a s… ▽ More

    Submitted 17 November, 2022; v1 submitted 13 August, 2022; originally announced August 2022.

  4. arXiv:2204.01986  [pdf, other

    eess.SY math.OC

    On the Computational Consequences of Cost Function Design in Nonlinear Optimal Control

    Authors: Tyler Westenbroek, Anand Siththaranjan, Mohsin Sarwari, Claire J. Tomlin, Shankar S. Sastry

    Abstract: Optimal control is an essential tool for stabilizing complex nonlinear systems. However, despite the extensive impacts of methods such as receding horizon control, dynamic programming and reinforcement learning, the design of cost functions for a particular system often remains a heuristic-driven process of trial and error. In this paper we seek to gain insights into how the choice of cost functio… ▽ More

    Submitted 17 November, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  5. arXiv:2103.15010  [pdf, other

    math.OC cs.LG eess.SY

    On the Stability of Nonlinear Receding Horizon Control: A Geometric Perspective

    Authors: Tyler Westenbroek, Max Simchowitz, Michael I. Jordan, S. Shankar Sastry

    Abstract: %!TEX root = LCSS_main_max.tex The widespread adoption of nonlinear Receding Horizon Control (RHC) strategies by industry has led to more than 30 years of intense research efforts to provide stability guarantees for these methods. However, current theoretical guarantees require that each (generally nonconvex) planning problem can be solved to (approximate) global optimality, which is an unrealis… ▽ More

    Submitted 25 January, 2024; v1 submitted 27 March, 2021; originally announced March 2021.

  6. arXiv:2004.10331  [pdf, other

    math.OC eess.SY

    Learning Min-norm Stabilizing Control Laws for Systems with Unknown Dynamics

    Authors: Tyler Westenbroek, Fernando Castaneda, Ayush Agrawal, S. Shankar Sastry, Koushil Sreenath

    Abstract: This paper introduces a framework for learning a minimum-norm stabilizing controller for a system with unknown dynamics using model-free policy optimization methods. The approach begins by first designing a Control Lyapunov Function (CLF) for a (possibly inaccurate) dynamics model for the system, along with a function which specifies a minimum acceptable rate of energy dissipation for the CLF at d… ▽ More

    Submitted 1 October, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

  7. arXiv:2004.07276  [pdf, other

    eess.SY cs.LG cs.RO

    Improving Input-Output Linearizing Controllers for Bipedal Robots via Reinforcement Learning

    Authors: Fernando CastaƱeda, Mathias Wulfman, Ayush Agrawal, Tyler Westenbroek, Claire J. Tomlin, S. Shankar Sastry, Koushil Sreenath

    Abstract: The main drawbacks of input-output linearizing controllers are the need for precise dynamics models and not being able to account for input constraints. Model uncertainty is common in almost every robotic application and input saturation is present in every real world system. In this paper, we address both challenges for the specific case of bipedal robot control by the use of reinforcement learni… ▽ More

    Submitted 2 May, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: Final version appearing in Learning for Dynamics and Control (L4DC) 2020 Conference

  8. arXiv:2004.02766  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

    Authors: Tyler Westenbroek, Eric Mazumdar, David Fridovich-Keil, Valmik Prabhu, Claire J. Tomlin, S. Shankar Sastry

    Abstract: This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enab… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  9. arXiv:1910.13272  [pdf, other

    math.OC cs.AI cs.LG eess.SY

    Feedback Linearization for Unknown Systems via Reinforcement Learning

    Authors: Tyler Westenbroek, David Fridovich-Keil, Eric Mazumdar, Shreyas Arora, Valmik Prabhu, S. Shankar Sastry, Claire J. Tomlin

    Abstract: We present a novel approach to control design for nonlinear systems which leverages model-free policy optimization techniques to learn a linearizing controller for a physical plant with unknown dynamics. Feedback linearization is a technique from nonlinear control which renders the input-output dynamics of a nonlinear plant \emph{linear} under application of an appropriate feedback controller. Onc… ▽ More

    Submitted 21 April, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

  10. arXiv:1904.12768  [pdf, other

    cs.GT stat.ML

    Competitive Statistical Estimation with Strategic Data Sources

    Authors: Tyler Westenbroek, Roy Dong, Lillian J. Ratliff, S. Shankar Sastry

    Abstract: In recent years, data has played an increasingly important role in the economy as a good in its own right. In many settings, data aggregators cannot directly verify the quality of the data they purchase, nor the effort exerted by data sources when creating the data. Recent work has explored mechanisms to ensure that the data sources share high quality data with a single data aggregator, addressing… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

    Comments: accepted in the IEEE Transactions on Automatic Control

  11. arXiv:1903.11781  [pdf, other

    math.DS

    Technical Report: Optimal Control of Piecwise-smooth Control Systems via Singular Perturbations

    Authors: Tyler Westenbroek, Xiaobin Xiong, Aaron D Ames, S Shankar Sastry

    Abstract: This paper investigates optimal control problems formulated over a class of piecewise-smooth vector fields. Instead of optimizing over the discontinuous system directly, we instead formulate optimal control problems over a family of regularizations which are obtained by "smoothing out" the discontinuity in the original system. It is shown that the smooth problems can be used to obtain accurate der… ▽ More

    Submitted 31 March, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

  12. arXiv:1803.08092  [pdf, other

    math.DS

    A New Solution Concept and Family of Relaxations for Hybrid Dynamical Systems

    Authors: Tyler Westenbroek, Humberto Gonzalez, S. Shankar Sastry

    Abstract: We introduce a holistic framework for the analysis, approximation and control of the trajectories of hybrid dynamical systems which display event-triggered discrete jumps in the continuous state. We begin by demonstrating how to explicitly represent the dynamics of this class of systems using a single piecewise-smooth vector field defined on a manifold, and then employ Filippov's solution concept… ▽ More

    Submitted 14 December, 2018; v1 submitted 21 March, 2018; originally announced March 2018.

    Comments: Final Version Appearing in CDC 2018

  13. arXiv:1710.08483  [pdf, other

    math.DS

    On the Relaxation of Hybrid Dynamical Systems

    Authors: Tyler Westenbroek, S. Shankar Sastry, Humberto Gonzalez

    Abstract: Hybrid dynamical systems have proven to be a powerful modeling abstraction, yet fundamental questions regarding the dynamical properties of these systems remain. In this paper, we develop a novel class of relaxations which we use to recover a number of classic systems theoretic properties for hybrid systems, such as existence and uniqueness of trajectories, even past the point of Zeno. Our relaxat… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

  14. arXiv:1704.01195  [pdf, other

    cs.GT

    Statistical Estimation with Strategic Data Sources in Competitive Settings

    Authors: Tyler Westenbroek, Roy Dong, Lillian J. Ratliff, S. Shankar Sastry

    Abstract: In this paper, we introduce a preliminary model for interactions in the data market. Recent research has shown ways in which a data aggregator can design mechanisms for users to ensure the quality of data, even in situations where the users are effort-averse (i.e. prefer to submit lower-quality estimates) and the data aggregator cannot observe the effort exerted by the users (i.e. the contract suf… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

  15. arXiv:1510.09127  [pdf, other

    math.OC

    Optimal Control of Hybrid Systems Using a Feedback Relaxed Control Formulation

    Authors: Tyler Westenbroek, Humberto Gonzalez

    Abstract: We present a numerically tractable formulation for computing the optimal control of the class of hybrid dynamical systems whose trajectories are continuous. Our formulation, an extension of existing relaxed-control techniques for switched dynamical systems, incorporates the domain information of each discrete mode as part of the constraints in the optimization problem. Moreover, our numerical resu… ▽ More

    Submitted 25 May, 2016; v1 submitted 30 October, 2015; originally announced October 2015.