Skip to main content

Showing 1–3 of 3 results for author: Westenbroek, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.09619  [pdf, other

    cs.LG math.OC stat.ML

    The Power of Learned Locally Linear Models for Nonlinear Policy Optimization

    Authors: Daniel Pfrommer, Max Simchowitz, Tyler Westenbroek, Nikolai Matni, Stephen Tu

    Abstract: A common pipeline in learning-based control is to iteratively estimate a model of system dynamics, and apply a trajectory optimization algorithm - e.g.~$\mathtt{iLQR}$ - on the learned model to minimize a target cost. This paper conducts a rigorous analysis of a simplified variant of this strategy for general nonlinear systems. We analyze an algorithm which iterates between estimating local linear… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  2. arXiv:2004.02766  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

    Authors: Tyler Westenbroek, Eric Mazumdar, David Fridovich-Keil, Valmik Prabhu, Claire J. Tomlin, S. Shankar Sastry

    Abstract: This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enab… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  3. arXiv:1904.12768  [pdf, other

    cs.GT stat.ML

    Competitive Statistical Estimation with Strategic Data Sources

    Authors: Tyler Westenbroek, Roy Dong, Lillian J. Ratliff, S. Shankar Sastry

    Abstract: In recent years, data has played an increasingly important role in the economy as a good in its own right. In many settings, data aggregators cannot directly verify the quality of the data they purchase, nor the effort exerted by data sources when creating the data. Recent work has explored mechanisms to ensure that the data sources share high quality data with a single data aggregator, addressing… ▽ More

    Submitted 29 April, 2019; originally announced April 2019.

    Comments: accepted in the IEEE Transactions on Automatic Control