Skip to main content

Showing 1–7 of 7 results for author: Villaflor, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.07232  [pdf, other

    cs.RO cs.LG

    Tractable Joint Prediction and Planning over Discrete Behavior Modes for Urban Driving

    Authors: Adam Villaflor, Brian Yang, Huangyuan Su, Katerina Fragkiadaki, John Dolan, Jeff Schneider

    Abstract: Significant progress has been made in training multimodal trajectory forecasting models for autonomous driving. However, effectively integrating these models with downstream planners and model-based control approaches is still an open problem. Although these models have conventionally been evaluated for open-loop prediction, we show that they can be used to parameterize autoregressive closed-loop… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  2. arXiv:2207.10295  [pdf, other

    cs.LG cs.AI cs.RO

    Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning

    Authors: Adam Villaflor, Zhe Huang, Swapnil Pande, John Dolan, Jeff Schneider

    Abstract: Impressive results in natural language processing (NLP) based on the Transformer neural network architecture have inspired researchers to explore viewing offline reinforcement learning (RL) as a generic sequence modeling problem. Recent works based on this paradigm have achieved state-of-the-art results in several of the mostly deterministic offline Atari and D4RL benchmarks. However, because thes… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

  3. arXiv:2204.12026  [pdf, other

    cs.LG

    BATS: Best Action Trajectory Stitching

    Authors: Ian Char, Viraj Mehta, Adam Villaflor, John M. Dolan, Jeff Schneider

    Abstract: The problem of offline reinforcement learning focuses on learning a good policy from a log of environment interactions. Past efforts for develo** algorithms in this area have revolved around introducing constraints to online reinforcement learning algorithms to ensure the actions of the learned policy are constrained to the logged data. In this work, we explore an alternative approach by plannin… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to NeurIPS Offline RL Workshop 2021

  4. arXiv:2103.12070  [pdf, ps, other

    cs.LG cs.AI cs.MA cs.RO

    Learning to Robustly Negotiate Bi-Directional Lane Usage in High-Conflict Driving Scenarios

    Authors: Christoph Killing, Adam Villaflor, John M. Dolan

    Abstract: Recently, autonomous driving has made substantial progress in addressing the most common traffic scenarios like intersection navigation and lane changing. However, most of these successes have been limited to scenarios with well-defined traffic rules and require minimal negotiation with other vehicles. In this paper, we introduce a previously unconsidered, yet everyday, high-conflict driving scena… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

    Comments: 7 pages, 7 figures

  5. arXiv:1810.07167  [pdf, other

    cs.RO cs.AI cs.LG

    Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation

    Authors: Gregory Kahn, Adam Villaflor, Pieter Abbeel, Sergey Levine

    Abstract: A general-purpose intelligent robot must be able to learn autonomously and be able to accomplish multiple tasks in order to be deployed in the real world. However, standard reinforcement learning approaches learn separate task-specific policies and assume the reward function for each task is known a priori. We propose a framework that learns event cues from off-policy data, and can flexibly combin… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: Accepted to the Conference on Robot Learning (CoRL) 2018. Video at https://youtu.be/lOLT7zifEkg

  6. arXiv:1709.10489  [pdf, other

    cs.LG cs.AI cs.RO

    Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

    Authors: Gregory Kahn, Adam Villaflor, Bosen Ding, Pieter Abbeel, Sergey Levine

    Abstract: Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn fro… ▽ More

    Submitted 17 May, 2018; v1 submitted 29 September, 2017; originally announced September 2017.

    Comments: ICRA 2018

  7. arXiv:1702.01182  [pdf, other

    cs.LG cs.RO

    Uncertainty-Aware Reinforcement Learning for Collision Avoidance

    Authors: Gregory Kahn, Adam Villaflor, Vitchyr Pong, Pieter Abbeel, Sergey Levine

    Abstract: Reinforcement learning can enable complex, adaptive behavior to be learned automatically for autonomous robotic platforms. However, practical deployment of reinforcement learning methods must contend with the fact that the training process itself can be unsafe for the robot. In this paper, we consider the specific case of a mobile robot learning to navigate an a priori unknown environment while av… ▽ More

    Submitted 3 February, 2017; originally announced February 2017.