Skip to main content

Showing 1–22 of 22 results for author: McAllister, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03995  [pdf, other

    eess.SY cs.AI

    AC4MPC: Actor-Critic Reinforcement Learning for Nonlinear Model Predictive Control

    Authors: Rudolf Reiter, Andrea Ghezzi, Katrin Baumgärtner, Jasper Hoffmann, Robert D. McAllister, Moritz Diehl

    Abstract: \Ac{MPC} and \ac{RL} are two powerful control strategies with, arguably, complementary advantages. In this work, we show how actor-critic \ac{RL} techniques can be leveraged to improve the performance of \ac{MPC}. The \ac{RL} critic is used as an approximation of the optimal value function, and an actor roll-out provides an initial guess for primal variables of the \ac{MPC}. A parallel control arc… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2401.18075  [pdf, other

    cs.CV

    CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting

    Authors: Jiezhi Yang, Khushi Desai, Charles Packer, Harshil Bhatia, Nicholas Rhinehart, Rowan McAllister, Joseph Gonzalez

    Abstract: We propose CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting, a method for predicting future 3D scenes given past observations, such as 2D ego-centric images. Our method maps an image to a distribution over plausible 3D latent scene configurations using a probabilistic encoder, and predicts the evolution of the hypothesized scenes through time. Our latent scene representation… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  3. arXiv:2312.17168  [pdf, other

    cs.LG cs.AI

    Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?

    Authors: Gunshi Gupta, Tim G. J. Rudner, Rowan Thomas McAllister, Adrien Gaidon, Yarin Gal

    Abstract: Causal confusion is a phenomenon where an agent learns a policy that reflects imperfect spurious correlations in the data. Such a policy may falsely appear to be optimal during training if most of the training data contain such spurious correlations. This phenomenon is particularly pronounced in domains such as robotics, with potentially large gaps between the open- and closed-loop performance of… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: Published in Proceedings of the 2nd Conference on Causal Learning and Reasoning (CLeaR 2021)

  4. arXiv:2310.08710  [pdf, other

    cs.RO cs.LG

    Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

    Authors: Cole Gulino, Justin Fu, Wenjie Luo, George Tucker, Eli Bronstein, Yiren Lu, Jean Harb, Xinlei Pan, Yan Wang, Xiangyu Chen, John D. Co-Reyes, Rishabh Agarwal, Rebecca Roelofs, Yao Lu, Nico Montali, Paul Mougin, Zoey Yang, Brandyn White, Aleksandra Faust, Rowan McAllister, Dragomir Anguelov, Benjamin Sapp

    Abstract: Simulation is an essential tool to develop and benchmark autonomous vehicle planning software in a safe and cost-effective manner. However, realistic simulation requires accurate modeling of nuanced and complex multi-agent interactive behaviors. To address these challenges, we introduce Waymax, a new data-driven simulator for autonomous driving in multi-agent scenes, designed for large-scale simul… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  5. arXiv:2310.03333  [pdf, other

    cs.CV

    Real-time Multi-modal Object Detection and Tracking on Edge for Regulatory Compliance Monitoring

    Authors: Jia Syuen Lim, Ziwei Wang, Jiajun Liu, Abdelwahed Khamis, Reza Arablouei, Robert Barlow, Ryan McAllister

    Abstract: Regulatory compliance auditing across diverse industrial domains requires heightened quality assurance and traceability. Present manual and intermittent approaches to such auditing yield significant challenges, potentially leading to oversights in the monitoring process. To address these issues, we introduce a real-time, multi-modal sensing system employing 3D time-of-flight and RGB cameras, coupl… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  6. arXiv:2301.12012  [pdf, other

    cs.RO cs.LG eess.SY

    In-Distribution Barrier Functions: Self-Supervised Policy Filters that Avoid Out-of-Distribution States

    Authors: Fernando Castañeda, Haruki Nishimura, Rowan McAllister, Koushil Sreenath, Adrien Gaidon

    Abstract: Learning-based control approaches have shown great promise in performing complex tasks directly from high-dimensional perception data for real robotic systems. Nonetheless, the learned controllers can behave unexpectedly if the trajectories of the system divert from the training data distribution, which can compromise safety. In this work, we propose a control filter that wraps any reference polic… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  7. arXiv:2210.01368  [pdf, other

    cs.LG cs.RO

    RAP: Risk-Aware Prediction for Robust Planning

    Authors: Haruki Nishimura, Jean Mercat, Blake Wulfe, Rowan McAllister, Adrien Gaidon

    Abstract: Robust planning in interactive scenarios requires predicting the uncertain future to make risk-aware decisions. Unfortunately, due to long-tail safety-critical events, the risk is often under-estimated by finite-sampling approximations of probabilistic motion forecasts. This can lead to overconfident and unsafe robot behavior, even with robust planners. Instead of assuming full prediction coverage… ▽ More

    Submitted 11 January, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 22 pages, 14 figures, 3 tables. First two authors contributed equally. Conference on Robot Learning (CoRL) 2022 (oral)

  8. arXiv:2205.07395  [pdf, other

    cs.CY

    Sociotechnical Specification for the Broader Impacts of Autonomous Vehicles

    Authors: Thomas Krendl Gilbert, Aaron J. Snoswell, Michael Dennis, Rowan McAllister, Cathy Wu

    Abstract: Autonomous Vehicles (AVs) will have a transformative impact on society. Beyond the local safety and efficiency of individual vehicles, these effects will also change how people interact with the entire transportation system. This will generate a diverse range of large and foreseeable effects on social outcomes, as well as how those outcomes are distributed. However, the ability to control both the… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: Paper accepted for presentation at ICRA 2022 workshop "Fresh Perspectives on the Future of Autonomous Driving"

  9. arXiv:2204.13319  [pdf, other

    cs.LG cs.RO

    Control-Aware Prediction Objectives for Autonomous Driving

    Authors: Rowan McAllister, Blake Wulfe, Jean Mercat, Logan Ellis, Sergey Levine, Adrien Gaidon

    Abstract: Autonomous vehicle software is typically structured as a modular pipeline of individual components (e.g., perception, prediction, and planning) to help separate concerns into interpretable sub-tasks. Even when end-to-end training is possible, each module has its own set of objectives used for safety assurance, sample efficiency, regularization, or interpretability. However, intermediate objectives… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted at IEEE International Conference on Robotics and Automation (ICRA) 2022

  10. arXiv:2201.10081  [pdf, ps, other

    cs.LG cs.AI

    Dynamics-Aware Comparison of Learned Reward Functions

    Authors: Blake Wulfe, Ashwin Balakrishna, Logan Ellis, Jean Mercat, Rowan McAllister, Adrien Gaidon

    Abstract: The ability to learn reward functions plays an important role in enabling the deployment of intelligent agents in the real world. However, comparing reward functions, for example as a means of evaluating reward learning methods, presents a challenge. Reward functions are typically compared by considering the behavior of optimized policies, but this approach conflates deficiencies in the reward fun… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

  11. arXiv:2104.12446  [pdf, other

    cs.CV cs.LG cs.RO

    Heterogeneous-Agent Trajectory Forecasting Incorporating Class Uncertainty

    Authors: Boris Ivanovic, Kuan-Hui Lee, Pavel Tokmakov, Blake Wulfe, Rowan McAllister, Adrien Gaidon, Marco Pavone

    Abstract: Reasoning about the future behavior of other agents is critical to safe robot navigation. The multiplicity of plausible futures is further amplified by the uncertainty inherent to agent state estimation from data, including positions, velocities, and semantic class. Forecasting methods, however, typically neglect class uncertainty, conditioning instead only on the agent's most likely class, even t… ▽ More

    Submitted 2 March, 2022; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: 15 pages, 15 figures, 6 tables

  12. arXiv:2104.10558  [pdf, other

    cs.RO cs.CV cs.LG

    Contingencies from Observations: Tractable Contingency Planning with Learned Behavior Models

    Authors: Nicholas Rhinehart, Jeff He, Charles Packer, Matthew A. Wright, Rowan McAllister, Joseph E. Gonzalez, Sergey Levine

    Abstract: Humans have a remarkable ability to make decisions by accurately reasoning about future events, including the future behaviors and states of mind of other agents. Consider driving a car through a busy intersection: it is necessary to reason about the physics of the vehicle, the intentions of other drivers, and their beliefs about your own intentions. If you signal a turn, another driver might yiel… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: To be published at ICRA 2021. Project page: https://sites.google.com/view/contingency-planning

  13. arXiv:2104.10190  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Outcome-Driven Reinforcement Learning via Variational Inference

    Authors: Tim G. J. Rudner, Vitchyr H. Pong, Rowan McAllister, Yarin Gal, Sergey Levine

    Abstract: While reinforcement learning algorithms provide automated acquisition of optimal policies, practical application of such methods requires a number of design decisions, such as manually designing reward functions that not only define the task, but also provide sufficient sha** to accomplish it. In this paper, we view reinforcement learning as inferring policies that achieve desired outcomes, rath… ▽ More

    Submitted 28 December, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: Published in Advances in Neural Information Processing Systems 34 (NeurIPS 2021)

  14. arXiv:2006.14911  [pdf, other

    cs.LG cs.RO stat.ML

    Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts?

    Authors: Angelos Filos, Panagiotis Tigas, Rowan McAllister, Nicholas Rhinehart, Sergey Levine, Yarin Gal

    Abstract: Out-of-training-distribution (OOD) scenarios are a common challenge of learning agents at deployment, typically leading to arbitrary deductions and poorly-informed decisions. In principle, detection of and adaptation to OOD scenes can mitigate their adverse effects. In this paper, we highlight the limitations of current approaches to novel driving scenes and propose an epistemic uncertainty-aware… ▽ More

    Submitted 2 September, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: The first two authors contributed equally. Accepted at ICML 2020. Supplementary videos and code available at: https://sites.google.com/view/av-detect-recover-adapt

  15. arXiv:2006.10742  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Invariant Representations for Reinforcement Learning without Reconstruction

    Authors: Amy Zhang, Rowan McAllister, Roberto Calandra, Yarin Gal, Sergey Levine

    Abstract: We study how representation learning can accelerate reinforcement learning from rich observations, such as images, without relying either on domain knowledge or pixel-reconstruction. Our goal is to learn representations that both provide for effective downstream control and invariance to task-irrelevant details. Bisimulation metrics quantify behavioral similarity between states in continuous MDPs,… ▽ More

    Submitted 6 April, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Accepted as an oral at ICLR 2021

  16. arXiv:2004.11345  [pdf, other

    cs.RO cs.AI cs.LG

    Model-Based Meta-Reinforcement Learning for Flight with Suspended Payloads

    Authors: Suneel Belkhale, Rachel Li, Gregory Kahn, Rowan McAllister, Roberto Calandra, Sergey Levine

    Abstract: Transporting suspended payloads is challenging for autonomous aerial vehicles because the payload can cause significant and unpredictable changes to the robot's dynamics. These changes can lead to suboptimal flight performance or even catastrophic failure. Although adaptive control and learning-based methods can in principle adapt to changes in these hybrid robot-payload systems, rapid mid-flight… ▽ More

    Submitted 2 February, 2021; v1 submitted 23 April, 2020; originally announced April 2020.

    Journal ref: IEEE Robotics and Automation Letters 2021

  17. arXiv:1905.13402  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Safety Augmented Value Estimation from Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks

    Authors: Brijen Thananjeyan, Ashwin Balakrishna, Ugo Rosolia, Felix Li, Rowan McAllister, Joseph E. Gonzalez, Sergey Levine, Francesco Borrelli, Ken Goldberg

    Abstract: Reinforcement learning (RL) for robotics is challenging due to the difficulty in hand-engineering a dense cost function, which can lead to unintended behavior, and dynamical uncertainty, which makes exploration and constraint satisfaction challenging. We address these issues with a new model-based reinforcement learning algorithm, Safety Augmented Value Estimation from Demonstrations (SAVED), whic… ▽ More

    Submitted 15 May, 2020; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: Robotics and Automation Letters and International Conference on Robotics and Automation 2020. First two authors contributed equally

    Journal ref: Robotics and Automation Letters 2020

  18. arXiv:1905.01296  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    PRECOG: PREdiction Conditioned On Goals in Visual Multi-Agent Settings

    Authors: Nicholas Rhinehart, Rowan McAllister, Kris Kitani, Sergey Levine

    Abstract: For autonomous vehicles (AVs) to behave appropriately on roads populated by human-driven vehicles, they must be able to reason about the uncertain intentions and decisions of other drivers from rich perceptual information. Towards these capabilities, we present a probabilistic forecasting model of future interactions between a variable number of agents. We perform both standard forecasting and the… ▽ More

    Submitted 30 September, 2019; v1 submitted 3 May, 2019; originally announced May 2019.

    Comments: To appear at the IEEE International Conference on Computer Vision (ICCV 2019). Website: https://sites.google.com/view/precog

  19. arXiv:1812.10687  [pdf, other

    cs.LG stat.ML

    Robustness to Out-of-Distribution Inputs via Task-Aware Generative Uncertainty

    Authors: Rowan McAllister, Gregory Kahn, Jeff Clune, Sergey Levine

    Abstract: Deep learning provides a powerful tool for machine perception when the observations resemble the training data. However, real-world robotic systems must react intelligently to their observations even in unexpected circumstances. This requires a system to reason about its own uncertainty given unfamiliar, out-of-distribution observations. Approximate Bayesian approaches are commonly used to estimat… ▽ More

    Submitted 27 December, 2018; originally announced December 2018.

  20. arXiv:1810.06544  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Deep Imitative Models for Flexible Inference, Planning, and Control

    Authors: Nicholas Rhinehart, Rowan McAllister, Sergey Levine

    Abstract: Imitation Learning (IL) is an appealing approach to learn desirable autonomous behavior. However, directing IL to achieve arbitrary goals is difficult. In contrast, planning-based algorithms use dynamics models and reward functions to achieve goals. Yet, reward functions that evoke desirable behavior are often difficult to specify. In this paper, we propose Imitative Models to combine the benefits… ▽ More

    Submitted 30 September, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

  21. arXiv:1805.12114  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models

    Authors: Kurtland Chua, Roberto Calandra, Rowan McAllister, Sergey Levine

    Abstract: Model-based reinforcement learning (RL) algorithms can attain excellent sample efficiency, but often lag behind the best model-free algorithms in terms of asymptotic performance. This is especially true with high-capacity parametric function approximators, such as deep networks. In this paper, we study how to bridge this gap, by employing uncertainty-aware dynamics models. We propose a new algorit… ▽ More

    Submitted 2 November, 2018; v1 submitted 30 May, 2018; originally announced May 2018.

    Comments: NIPS 2018, video and code available at https://sites.google.com/view/drl-in-a-handful-of-trials/

  22. arXiv:1602.02523  [pdf, ps, other

    stat.ML cs.LG eess.SY

    Data-Efficient Reinforcement Learning in Continuous-State POMDPs

    Authors: Rowan McAllister, Carl Edward Rasmussen

    Abstract: We present a data-efficient reinforcement learning algorithm resistant to observation noise. Our method extends the highly data-efficient PILCO algorithm (Deisenroth & Rasmussen, 2011) into partially observed Markov decision processes (POMDPs) by considering the filtering process during policy evaluation. PILCO conducts policy search, evaluating each policy by first predicting an analytic distribu… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.