Skip to main content

Showing 1–7 of 7 results for author: Stachowicz, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04714  [pdf, other

    cs.RO cs.AI cs.LG

    RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes

    Authors: Kyle Stachowicz, Sergey Levine

    Abstract: Reinforcement learning provides an appealing framework for robotic control due to its ability to learn expressive policies purely through real-world interaction. However, this requires addressing real-world constraints and avoiding catastrophic failures during training, which might severely impede both learning progress and the performance of the final policy. In many robotics settings, this amoun… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: In review, RSS 2024

  2. arXiv:2403.00991  [pdf, other

    cs.RO cs.CV cs.LG

    SELFI: Autonomous Self-Improvement with Reinforcement Learning for Social Navigation

    Authors: Noriaki Hirose, Dhruv Shah, Kyle Stachowicz, Ajay Sridhar, Sergey Levine

    Abstract: Autonomous self-improving robots that interact and improve with experience are key to the real-world deployment of robotic systems. In this paper, we propose an online learning method, SELFI, that leverages online robot experience to rapidly fine-tune pre-trained control policies efficiently. SELFI applies online model-free reinforcement learning on top of offline model-based learning to bring out… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 11pages, 13 figures, 2 tables

  3. arXiv:2306.14846  [pdf, other

    cs.RO cs.CV cs.LG

    ViNT: A Foundation Model for Visual Navigation

    Authors: Dhruv Shah, Ajay Sridhar, Nitish Dashora, Kyle Stachowicz, Kevin Black, Noriaki Hirose, Sergey Levine

    Abstract: General-purpose pre-trained models ("foundation models") have enabled practitioners to produce generalizable solutions for individual machine learning problems with datasets that are significantly smaller than those required for learning from scratch. Such models are typically trained on large and diverse datasets with weak supervision, consuming much more training data than is available for any i… ▽ More

    Submitted 24 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted for oral presentation at CoRL 2023

  4. arXiv:2304.09831  [pdf, other

    cs.RO cs.AI cs.LG

    FastRLAP: A System for Learning High-Speed Driving via Deep RL and Autonomous Practicing

    Authors: Kyle Stachowicz, Dhruv Shah, Arjun Bhorkar, Ilya Kostrikov, Sergey Levine

    Abstract: We present a system that enables an autonomous small-scale RC car to drive aggressively from visual observations using reinforcement learning (RL). Our system, FastRLAP (faster lap), trains autonomously in the real world, without human interventions, and without requiring any simulation or expert demonstrations. Our system integrates a number of important components to make this possible: we initi… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  5. arXiv:2201.12925  [pdf, other

    math.OC cs.RO

    Multimodal Maximum Entropy Dynamic Games

    Authors: Oswin So, Kyle Stachowicz, Evangelos A. Theodorou

    Abstract: Environments with multi-agent interactions often result a rich set of modalities of behavior between agents due to the inherent suboptimality of decision making processes when agents settle for satisfactory decisions. However, existing algorithms for solving these dynamic games are strictly unimodal and fail to capture the intricate multimodal behaviors of the agents. In this paper, we propose MME… ▽ More

    Submitted 2 February, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: Under review for RSS 2022. Supplementary Video: https://youtu.be/7molN_Q38dk

  6. arXiv:2111.09207  [pdf, other

    cs.RO eess.SY

    Optimal-Horizon Model-Predictive Control with Differential Dynamic Programming

    Authors: Kyle Stachowicz, Evangelos A. Theodorou

    Abstract: We present an algorithm, based on the Differential Dynamic Programming framework, to handle trajectory optimization problems in which the horizon is determined online rather than fixed a priori. This algorithm exhibits exact one-step convergence for linear, quadratic, time-invariant problems and is fast enough for real-time nonlinear model-predictive control. We show derivations for the nonlinear… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Submitted to ICRA 2022

  7. Safety Embedded Differential Dynamic Programming Using Discrete Barrier States

    Authors: Hassan Almubarak, Kyle Stachowicz, Nader Sadegh, Evangelos A. Theodorou

    Abstract: Certified safe control is a growing challenge in robotics, especially when performance and safety objectives must be concurrently achieved. In this work, we extend the barrier state (BaS) concept, recently proposed for safe stabilization of continuous time systems, to safety embedded trajectory optimization for discrete time systems using discrete barrier states (DBaS). The constructed DBaS is emb… ▽ More

    Submitted 2 February, 2022; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: Added extensive quantitative comparisons and analysis in the implementation examples, and revised discussions and illustrations

    Journal ref: IEEE ROBOTICS AND AUTOMATION LETTERS, VOL. 7, NO. 2, APRIL 2022