Skip to main content

Showing 1–25 of 25 results for author: Rhinehart, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.18075  [pdf, other

    cs.CV

    CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting

    Authors: Jiezhi Yang, Khushi Desai, Charles Packer, Harshil Bhatia, Nicholas Rhinehart, Rowan McAllister, Joseph Gonzalez

    Abstract: We propose CARFF: Conditional Auto-encoded Radiance Field for 3D Scene Forecasting, a method for predicting future 3D scenes given past observations, such as 2D ego-centric images. Our method maps an image to a distribution over plausible 3D latent scene configurations using a probabilistic encoder, and predicts the evolution of the hypothesized scenes through time. Our latent scene representation… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  2. arXiv:2305.12032  [pdf, other

    cs.CV cs.LG cs.MA cs.RO

    The Waymo Open Sim Agents Challenge

    Authors: Nico Montali, John Lambert, Paul Mougin, Alex Kuefler, Nick Rhinehart, Michelle Li, Cole Gulino, Tristan Emrich, Zoey Yang, Shimon Whiteson, Brandyn White, Dragomir Anguelov

    Abstract: Simulation with realistic, interactive agents represents a key task for autonomous vehicle software development. In this work, we introduce the Waymo Open Sim Agents Challenge (WOSAC). WOSAC is the first public challenge to tackle this task and propose corresponding metrics. The goal of the challenge is to stimulate the design of realistic simulators that can be used to evaluate and train a behavi… ▽ More

    Submitted 11 December, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to NeurIPS 2023, Track on Datasets and Benchmarks. Public leaderboard available at https://waymo.com/open/challenges/2023/sim-agents/

  3. arXiv:2212.08244  [pdf, other

    cs.RO cs.CV cs.LG

    Offline Reinforcement Learning for Visual Navigation

    Authors: Dhruv Shah, Arjun Bhorkar, Hrish Leen, Ilya Kostrikov, Nick Rhinehart, Sergey Levine

    Abstract: Reinforcement learning can enable robots to navigate to distant goals while optimizing user-specified reward functions, including preferences for following lanes, staying on paved paths, or avoiding freshly mowed grass. However, online learning from trial-and-error for real-world robots is logistically challenging, and methods that instead can utilize existing datasets of robotic navigation data c… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: Project page https://sites.google.com/view/revind/home

  4. arXiv:2112.03899  [pdf, other

    cs.LG cs.AI

    Information is Power: Intrinsic Control via Information Capture

    Authors: Nicholas Rhinehart, Jenny Wang, Glen Berseth, John D. Co-Reyes, Danijar Hafner, Chelsea Finn, Sergey Levine

    Abstract: Humans and animals explore their environment and acquire useful skills even in the absence of clear goals, exhibiting intrinsic motivation. The study of intrinsic motivation in artificial agents is concerned with the following question: what is a good general-purpose objective for an agent? We study this question in dynamic partially-observed environments, and argue that a compact and general lear… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: NeurIPS 2021

  5. Hybrid Imitative Planning with Geometric and Predictive Costs in Off-road Environments

    Authors: Nitish Dashora, Daniel Shin, Dhruv Shah, Henry Leopold, David Fan, Ali Agha-Mohammadi, Nicholas Rhinehart, Sergey Levine

    Abstract: Geometric methods for solving open-world off-road navigation tasks, by learning occupancy and metric maps, provide good generalization but can be brittle in outdoor environments that violate their assumptions (e.g., tall grass). Learning-based methods can directly learn collision-free behavior from raw observations, but are difficult to integrate with standard geometry-based pipelines. This create… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

  6. arXiv:2107.07394  [pdf, other

    cs.LG cs.AI

    Explore and Control with Adversarial Surprise

    Authors: Arnaud Fickinger, Natasha Jaques, Samyak Parajuli, Michael Chang, Nicholas Rhinehart, Glen Berseth, Stuart Russell, Sergey Levine

    Abstract: Unsupervised reinforcement learning (RL) studies how to leverage environment statistics to learn useful behaviors without the cost of reward engineering. However, a central challenge in unsupervised RL is to extract behaviors that meaningfully affect the world and cover the range of possible outcomes, without getting distracted by inherently unpredictable, uncontrollable, and stochastic elements i… ▽ More

    Submitted 28 December, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

  7. arXiv:2104.10558  [pdf, other

    cs.RO cs.CV cs.LG

    Contingencies from Observations: Tractable Contingency Planning with Learned Behavior Models

    Authors: Nicholas Rhinehart, Jeff He, Charles Packer, Matthew A. Wright, Rowan McAllister, Joseph E. Gonzalez, Sergey Levine

    Abstract: Humans have a remarkable ability to make decisions by accurately reasoning about future events, including the future behaviors and states of mind of other agents. Consider driving a car through a busy intersection: it is necessary to reason about the physics of the vehicle, the intentions of other drivers, and their beliefs about your own intentions. If you signal a turn, another driver might yiel… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: To be published at ICRA 2021. Project page: https://sites.google.com/view/contingency-planning

  8. arXiv:2104.05859  [pdf, other

    cs.RO cs.AI cs.LG

    Rapid Exploration for Open-World Navigation with Latent Goal Models

    Authors: Dhruv Shah, Benjamin Eysenbach, Gregory Kahn, Nicholas Rhinehart, Sergey Levine

    Abstract: We describe a robotic learning system for autonomous exploration and navigation in diverse, open-world environments. At the core of our method is a learned latent variable model of distances and actions, along with a non-parametric topological memory of images. We use an information bottleneck to regularize the learned policy, giving us (i) a compact visual representation of goals, (ii) improved g… ▽ More

    Submitted 11 October, 2023; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Presented at 5th Annual Conference on Robot Learning (CoRL 2021), London, UK as an Oral Talk. Project page and dataset release at https://sites.google.com/view/recon-robot

  9. ViNG: Learning Open-World Navigation with Visual Goals

    Authors: Dhruv Shah, Benjamin Eysenbach, Gregory Kahn, Nicholas Rhinehart, Sergey Levine

    Abstract: We propose a learning-based navigation system for reaching visually indicated goals and demonstrate this system on a real mobile robot platform. Learning provides an appealing alternative to conventional methods for robotic navigation: instead of reasoning about environments in terms of geometry and maps, learning can enable a robot to learn about navigational affordances, understand what types of… ▽ More

    Submitted 26 March, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: Presented at International Conference on Robotics and Automation (ICRA) 2021

  10. arXiv:2011.10024  [pdf, other

    cs.LG cs.RO

    Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

    Authors: Avi Singh, Huihan Liu, Gaoyue Zhou, Albert Yu, Nicholas Rhinehart, Sergey Levine

    Abstract: Reinforcement learning provides a general framework for flexible decision making and control, but requires extensive data collection for each new task that an agent needs to learn. In other machine learning fields, such as natural language processing or computer vision, pre-training on large, previously collected datasets to bootstrap learning for new tasks has emerged as a powerful paradigm to re… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: First two authors contributed equally. Project website: https://sites.google.com/view/parrot-rl

  11. arXiv:2010.14497  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Conservative Safety Critics for Exploration

    Authors: Homanga Bharadhwaj, Aviral Kumar, Nicholas Rhinehart, Sergey Levine, Florian Shkurti, Animesh Garg

    Abstract: Safe exploration presents a major challenge in reinforcement learning (RL): when active data collection requires deploying partially trained policies, we must ensure that these policies avoid catastrophically unsafe regions, while still enabling trial and error learning. In this paper, we target the problem of safe exploration in RL by learning a conservative safety estimate of environment states… ▽ More

    Submitted 26 April, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: Published as a conference paper in ICLR 2021

  12. arXiv:2006.14911  [pdf, other

    cs.LG cs.RO stat.ML

    Can Autonomous Vehicles Identify, Recover From, and Adapt to Distribution Shifts?

    Authors: Angelos Filos, Panagiotis Tigas, Rowan McAllister, Nicholas Rhinehart, Sergey Levine, Yarin Gal

    Abstract: Out-of-training-distribution (OOD) scenarios are a common challenge of learning agents at deployment, typically leading to arbitrary deductions and poorly-informed decisions. In principle, detection of and adaptation to OOD scenes can mitigate their adverse effects. In this paper, we highlight the limitations of current approaches to novel driving scenes and propose an epistemic uncertainty-aware… ▽ More

    Submitted 2 September, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: The first two authors contributed equally. Accepted at ICML 2020. Supplementary videos and code available at: https://sites.google.com/view/av-detect-recover-adapt

  13. arXiv:2003.08376  [pdf, other

    cs.CV cs.AI cs.LG cs.MA cs.RO

    Inverting the Pose Forecasting Pipeline with SPF2: Sequential Pointcloud Forecasting for Sequential Pose Forecasting

    Authors: Xinshuo Weng, Jianren Wang, Sergey Levine, Kris Kitani, Nicholas Rhinehart

    Abstract: Many autonomous systems forecast aspects of the future in order to aid decision-making. For example, self-driving vehicles and robotic manipulation systems often forecast future object poses by first detecting and tracking objects. However, this detect-then-forecast pipeline is expensive to scale, as pose forecasting algorithms typically require labeled sequences of object poses, which are costly… ▽ More

    Submitted 6 November, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: Published in Conference on Robot Learning (CoRL), 2020. Project webpage: http://www.xinshuoweng.com/projects/SPF2/

  14. arXiv:1912.05510  [pdf, other

    cs.LG cs.AI stat.ML

    SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

    Authors: Glen Berseth, Daniel Geng, Coline Devin, Nicholas Rhinehart, Chelsea Finn, Dinesh Jayaraman, Sergey Levine

    Abstract: Every living organism struggles against disruptive environmental forces to carve out and maintain an orderly niche. We propose that such a struggle to achieve and preserve order might offer a principle for the emergence of useful behaviors in artificial agents. We formalize this idea into an unsupervised reinforcement learning method called surprise minimizing reinforcement learning (SMiRL). SMiRL… ▽ More

    Submitted 7 February, 2021; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: ICLR 2021

    ACM Class: G.3

  15. arXiv:1905.01296  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    PRECOG: PREdiction Conditioned On Goals in Visual Multi-Agent Settings

    Authors: Nicholas Rhinehart, Rowan McAllister, Kris Kitani, Sergey Levine

    Abstract: For autonomous vehicles (AVs) to behave appropriately on roads populated by human-driven vehicles, they must be able to reason about the uncertain intentions and decisions of other drivers from rich perceptual information. Towards these capabilities, we present a probabilistic forecasting model of future interactions between a variable number of agents. We perform both standard forecasting and the… ▽ More

    Submitted 30 September, 2019; v1 submitted 3 May, 2019; originally announced May 2019.

    Comments: To appear at the IEEE International Conference on Computer Vision (ICCV 2019). Website: https://sites.google.com/view/precog

  16. arXiv:1904.06250  [pdf, other

    cs.CV cs.AI cs.LG

    Generative Hybrid Representations for Activity Forecasting with No-Regret Learning

    Authors: Jiaqi Guan, Ye Yuan, Kris M. Kitani, Nicholas Rhinehart

    Abstract: Automatically reasoning about future human behaviors is a difficult problem but has significant practical applications to assistive systems. Part of this difficulty stems from learning systems' inability to represent all kinds of behaviors. Some behaviors, such as motion, are best described with continuous representations, whereas others, such as picking up a cup, are best described with discrete… ▽ More

    Submitted 3 April, 2020; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: Oral presentation at CVPR 2020

  17. arXiv:1810.06544  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Deep Imitative Models for Flexible Inference, Planning, and Control

    Authors: Nicholas Rhinehart, Rowan McAllister, Sergey Levine

    Abstract: Imitation Learning (IL) is an appealing approach to learn desirable autonomous behavior. However, directing IL to achieve arbitrary goals is difficult. In contrast, planning-based algorithms use dynamics models and reward functions to achieve goals. Yet, reward functions that evoke desirable behavior are often difficult to specify. In this paper, we propose Imitative Models to combine the benefits… ▽ More

    Submitted 30 September, 2019; v1 submitted 15 October, 2018; originally announced October 2018.

  18. arXiv:1810.01266  [pdf, other

    cs.LG cs.AI stat.ML

    Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

    Authors: Arjun Sharma, Mohit Sharma, Nicholas Rhinehart, Kris M. Kitani

    Abstract: The use of imitation learning to learn a single policy for a complex task that has multiple modes or hierarchical structure can be challenging. In fact, previous work has shown that when the modes are known, learning separate policies for each mode or sub-task can greatly improve the performance of imitation learning. In this work, we discover the interaction between sub-tasks from their resulting… ▽ More

    Submitted 11 March, 2019; v1 submitted 29 September, 2018; originally announced October 2018.

    Comments: Accepted as conference paper at ICLR'19

  19. arXiv:1806.08479  [pdf, other

    cs.HC cs.AI

    Human-Interactive Subgoal Supervision for Efficient Inverse Reinforcement Learning

    Authors: Xinlei Pan, Eshed Ohn-Bar, Nicholas Rhinehart, Yan Xu, Yilin Shen, Kris M. Kitani

    Abstract: Humans are able to understand and perform complex tasks by strategically structuring the tasks into incremental steps or subgoals. For a robot attempting to learn to perform a sequential task with critical subgoal states, such states can provide a natural opportunity for interaction with a human expert. This paper analyzes the benefit of incorporating a notion of subgoals into Inverse Reinforcemen… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

  20. arXiv:1806.07822  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Learning Neural Parsers with Deterministic Differentiable Imitation Learning

    Authors: Tanmay Shankar, Nicholas Rhinehart, Katharina Muelling, Kris M. Kitani

    Abstract: We explore the problem of learning to decompose spatial tasks into segments, as exemplified by the problem of a painting robot covering a large object. Inspired by the ability of classical decision tree algorithms to construct structured partitions of their input spaces, we formulate the problem of decomposing objects into segments as a parsing approach. We make the insight that the derivation of… ▽ More

    Submitted 19 September, 2018; v1 submitted 20 June, 2018; originally announced June 2018.

    Comments: Accepted to Conference on Robot Learning, CoRL 2018

  21. arXiv:1709.08520  [pdf, other

    stat.ML cs.LG

    Predictive-State Decoders: Encoding the Future into Recurrent Networks

    Authors: Arun Venkatraman, Nicholas Rhinehart, Wen Sun, Lerrel Pinto, Martial Hebert, Byron Boots, Kris M. Kitani, J. Andrew Bagnell

    Abstract: Recurrent neural networks (RNNs) are a vital modeling technique that rely on internal states learned indirectly by optimization of a supervised, unsupervised, or reinforcement training loss. RNNs are used to model dynamic processes that are characterized by underlying latent states whose form is often unknown, precluding its analytic representation inside an RNN. In the Predictive-State Representa… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: NIPS 2017

  22. arXiv:1709.06030  [pdf, other

    cs.LG stat.ML

    N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning

    Authors: Anubhav Ashok, Nicholas Rhinehart, Fares Beainy, Kris M. Kitani

    Abstract: While bigger and deeper neural network architectures continue to advance the state-of-the-art for many computer vision tasks, real-world adoption of these networks is impeded by hardware and speed constraints. Conventional model compression methods attempt to address this problem by modifying the architecture manually or using pre-defined heuristics. Since the space of all reduced architectures is… ▽ More

    Submitted 17 December, 2017; v1 submitted 18 September, 2017; originally announced September 2017.

  23. arXiv:1612.07796  [pdf, other

    cs.CV

    First-Person Activity Forecasting with Online Inverse Reinforcement Learning

    Authors: Nicholas Rhinehart, Kris M. Kitani

    Abstract: We address the problem of incrementally modeling and forecasting long-term goals of a first-person camera wearer: what the user will do, where they will go, and what goal they seek. In contrast to prior work in trajectory forecasting, our algorithm, DARKO, goes further to reason about semantic states (will I pick up an object?), and future goal states that are far in terms of both space and time.… ▽ More

    Submitted 6 August, 2017; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: To appear at ICCV 2017 (Oral)

  24. arXiv:1605.01679  [pdf, other

    cs.CV

    Learning Action Maps of Large Environments via First-Person Vision

    Authors: Nicholas Rhinehart, Kris M. Kitani

    Abstract: When people observe and interact with physical spaces, they are able to associate functionality to regions in the environment. Our goal is to automate dense functional understanding of large spaces by leveraging sparse activity demonstrations recorded from an ego-centric viewpoint. The method we describe enables functionality estimation in large scenes where people have behaved, as well as novel s… ▽ More

    Submitted 5 May, 2016; originally announced May 2016.

    Comments: To appear at CVPR 2016

  25. arXiv:1410.7376  [pdf, other

    cs.CV

    Visual Chunking: A List Prediction Framework for Region-Based Object Detection

    Authors: Nicholas Rhinehart, Jiaji Zhou, Martial Hebert, J. Andrew Bagnell

    Abstract: We consider detecting objects in an image by iteratively selecting from a set of arbitrarily shaped candidate regions. Our generic approach, which we term visual chunking, reasons about the locations of multiple object instances in an image while expressively describing object boundaries. We design an optimization criterion for measuring the performance of a list of such detections as a natural ex… ▽ More

    Submitted 16 March, 2015; v1 submitted 27 October, 2014; originally announced October 2014.

    Comments: to appear at ICRA 2015