Skip to main content

Showing 1–21 of 21 results for author: Karkus, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12095  [pdf, other

    cs.CV cs.AI cs.RO

    DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

    Authors: Letian Wang, Seung Wook Kim, Jiawei Yang, Cunjun Yu, Boris Ivanovic, Steven L. Waslander, Yue Wang, Sanja Fidler, Marco Pavone, Peter Karkus

    Abstract: We propose DistillNeRF, a self-supervised learning framework addressing the challenge of understanding 3D environments from limited 2D observations in autonomous driving. Our method is a generalizable feedforward model that predicts a rich neural scene representation from sparse, single-frame multi-view camera inputs, and is trained self-supervised with differentiable rendering to reconstruct RGB,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2310.18301  [pdf, other

    cs.RO cs.AI eess.SY

    Interactive Joint Planning for Autonomous Vehicles

    Authors: Yuxiao Chen, Sushant Veer, Peter Karkus, Marco Pavone

    Abstract: In highly interactive driving scenarios, the actions of one agent greatly influences those of its neighbors. Planning safe motions for autonomous vehicles in such interactive environments, therefore, requires reasoning about the impact of the ego's intended motion plan on nearby agents' behavior. Deep-learning-based models have recently achieved great success in trajectory prediction and many mode… ▽ More

    Submitted 22 November, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

  3. arXiv:2310.05885  [pdf, other

    cs.RO

    DTPP: Differentiable Joint Conditional Prediction and Cost Evaluation for Tree Policy Planning in Autonomous Driving

    Authors: Zhiyu Huang, Peter Karkus, Boris Ivanovic, Yuxiao Chen, Marco Pavone, Chen Lv

    Abstract: Motion prediction and cost evaluation are vital components in the decision-making system of autonomous vehicles. However, existing methods often ignore the importance of cost learning and treat them as separate modules. In this study, we employ a tree-structured policy planner and propose a differentiable joint training framework for both ego-conditioned prediction and cost models, resulting in a… ▽ More

    Submitted 23 February, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 2024 IEEE International Conference on Robotics and Automation

  4. arXiv:2304.00673  [pdf, other

    cs.CV

    Partial-View Object View Synthesis via Filtered Inversion

    Authors: Fan-Yun Sun, Jonathan Tremblay, Valts Blukis, Kevin Lin, Danfei Xu, Boris Ivanovic, Peter Karkus, Stan Birchfield, Dieter Fox, Ruohan Zhang, Yunzhu Li, Jiajun Wu, Marco Pavone, Nick Haber

    Abstract: We propose Filtering Inversion (FINV), a learning framework and optimization process that predicts a renderable 3D object representation from one or few partial views. FINV addresses the challenge of synthesizing novel views of objects from partial observations, spanning cases where the object is not entirely in view, is partially occluded, or is only observed from similar views. To achieve this,… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: project website: http://cs.stanford.edu/~sunfanyun/finv

  5. arXiv:2301.11902  [pdf, other

    cs.RO eess.SY

    Tree-structured Policy Planning with Learned Behavior Models

    Authors: Yuxiao Chen, Peter Karkus, Boris Ivanovic, Xinshuo Weng, Marco Pavone

    Abstract: Autonomous vehicles (AVs) need to reason about the multimodal behavior of neighboring agents while planning their own motion. Many existing trajectory planners seek a single trajectory that performs well under \emph{all} plausible futures simultaneously, ignoring bi-directional interactions and thus leading to overly conservative plans. Policy planning, whereby the ego agent plans a policy that re… ▽ More

    Submitted 26 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  6. arXiv:2212.06437  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    DiffStack: A Differentiable and Modular Control Stack for Autonomous Vehicles

    Authors: Peter Karkus, Boris Ivanovic, Shie Mannor, Marco Pavone

    Abstract: Autonomous vehicle (AV) stacks are typically built in a modular fashion, with explicit components performing detection, tracking, prediction, planning, control, etc. While modularity improves reusability, interpretability, and generalizability, it also suffers from compounding errors, information bottlenecks, and integration challenges. To overcome these challenges, a prominent approach is to conv… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: CoRL 2022 camera ready

  7. arXiv:2212.03323  [pdf, other

    cs.RO eess.SY

    Receding Horizon Planning with Rule Hierarchies for Autonomous Vehicles

    Authors: Sushant Veer, Karen Leung, Ryan Cosner, Yuxiao Chen, Peter Karkus, Marco Pavone

    Abstract: Autonomous vehicles must often contend with conflicting planning requirements, e.g., safety and comfort could be at odds with each other if avoiding a collision calls for slamming the brakes. To resolve such conflicts, assigning importance ranking to rules (i.e., imposing a rule hierarchy) has been proposed, which, in turn, induces rankings on trajectories based on the importance of the rules they… ▽ More

    Submitted 12 December, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

  8. arXiv:2211.04878  [pdf, other

    cs.LG cs.AI

    Foundation Models for Semantic Novelty in Reinforcement Learning

    Authors: Tarun Gupta, Peter Karkus, Tong Che, Danfei Xu, Marco Pavone

    Abstract: Effectively exploring the environment is a key challenge in reinforcement learning (RL). We address this challenge by defining a novel intrinsic reward based on a foundation model, such as contrastive language image pretraining (CLIP), which can encode a wealth of domain-independent semantic visual-language knowledge about the world. Specifically, our intrinsic reward is defined based on pre-train… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: Foundation Models for Decision Making Workshop at Neural Information Processing Systems, 2022

  9. arXiv:2210.14584  [pdf, other

    cs.LG cs.RO

    Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models

    Authors: Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone

    Abstract: Reasoning with occluded traffic agents is a significant open challenge for planning for autonomous vehicles. Recent deep learning models have shown impressive results for predicting occluded agents based on the behaviour of nearby visible agents; however, as we show in experiments, these models are difficult to integrate into downstream planning. To this end, we propose Bi-level Variational Occlus… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 7 pages, 6 figures

  10. arXiv:2105.07593  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    Differentiable SLAM-net: Learning Particle SLAM for Visual Navigation

    Authors: Peter Karkus, Shaojun Cai, David Hsu

    Abstract: Simultaneous localization and map** (SLAM) remains challenging for a number of downstream applications, such as visual robot navigation, because of rapid turns, featureless walls, and poor camera quality. We introduce the Differentiable SLAM Network (SLAM-net) along with a navigation architecture to enable planar robot navigation in previously unseen indoor environments. SLAM-net encodes a parti… ▽ More

    Submitted 19 May, 2021; v1 submitted 16 May, 2021; originally announced May 2021.

    Comments: CVPR 2021, extended results

  11. arXiv:2010.01298  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban

    Authors: Peter Karkus, Mehdi Mirza, Arthur Guez, Andrew Jaegle, Timothy Lillicrap, Lars Buesing, Nicolas Heess, Theophane Weber

    Abstract: Intelligent robots need to achieve abstract objectives using concrete, spatiotemporally complex sensory information and motor control. Tabula rasa deep reinforcement learning (RL) has tackled demanding tasks in terms of either visual, abstract, or physical reasoning, but solving these jointly remains a formidable challenge. One recent, unsolved benchmark task that integrates these challenges is Mu… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

  12. arXiv:2009.05524  [pdf, other

    cs.AI cs.LG

    Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

    Authors: Mehdi Mirza, Andrew Jaegle, Jonathan J. Hunt, Arthur Guez, Saran Tunyasuvunakool, Alistair Muldal, Théophane Weber, Peter Karkus, Sébastien Racanière, Lars Buesing, Timothy Lillicrap, Nicolas Heess

    Abstract: Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly with its actions. However, when humans play such games, they do not just reason about the moves but also interact with their physical environment. They… ▽ More

    Submitted 29 October, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: 17 pages + appendix. Updated text and references

  13. arXiv:2005.09530  [pdf, other

    cs.CV cs.LG cs.RO

    Differentiable Map** Networks: Learning Structured Map Representations for Sparse Visual Localization

    Authors: Peter Karkus, Anelia Angelova, Vincent Vanhoucke, Rico Jonschkowski

    Abstract: Map** and localization, preferably from a small number of observations, are fundamental tasks in robotics. We address these tasks by combining spatial structure (differentiable map**) and end-to-end learning in a novel neural network architecture: the Differentiable Map** Network (DMN). The DMN constructs a spatially structured view-embedding map and uses it for subsequent visual localizatio… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: ICRA 2020

  14. arXiv:2002.09884  [pdf, other

    cs.LG cs.AI stat.ML

    Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations

    Authors: Xiao Ma, Peter Karkus, David Hsu, Wee Sun Lee, Nan Ye

    Abstract: Deep reinforcement learning is successful in decision making for sophisticated games, such as Atari, Go, etc. However, real-world decision making often requires reasoning with partial information extracted from complex visual observations. This paper presents Discriminative Particle Filter Reinforcement Learning (DPFRL), a new reinforcement learning framework for complex partial observations. DPFR… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

    Comments: Accepted to ICLR 2020

  15. arXiv:1905.12885  [pdf, other

    cs.LG stat.ML

    Particle Filter Recurrent Neural Networks

    Authors: Xiao Ma, Peter Karkus, David Hsu, Wee Sun Lee

    Abstract: Recurrent neural networks (RNNs) have been extraordinarily successful for prediction with sequential data. To tackle highly variable and noisy real-world data, we introduce Particle Filter Recurrent Neural Networks (PF-RNNs), a new RNN family that explicitly models uncertainty in its internal structure: while an RNN relies on a long, deterministic latent state vector, a PF-RNN maintains a latent s… ▽ More

    Submitted 1 December, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: Accepted to AAAI 2020

  16. arXiv:1905.11602  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Differentiable Algorithm Networks for Composable Robot Learning

    Authors: Peter Karkus, Xiao Ma, David Hsu, Leslie Pack Kaelbling, Wee Sun Lee, Tomas Lozano-Perez

    Abstract: This paper introduces the Differentiable Algorithm Network (DAN), a composable architecture for robot learning systems. A DAN is composed of neural network modules, each encoding a differentiable robot algorithm and an associated model; and it is trained end-to-end from data. DAN combines the strengths of model-driven modular system design and data-driven end-to-end learning. The algorithms and mo… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: RSS 2019 camera ready. Video is available at https://youtu.be/4jcYlTSJF4Y

  17. arXiv:1904.11761  [pdf, other

    cs.LG cs.AI stat.ML

    Factored Contextual Policy Search with Bayesian Optimization

    Authors: Robert Pinsler, Peter Karkus, Andras Kupcsik, David Hsu, Wee Sun Lee

    Abstract: Scarce data is a major challenge to scaling robot learning to truly complex tasks, as we need to generalize locally learned policies over different task contexts. Contextual policy search offers data-efficient learning and generalization by explicitly conditioning the policy on a parametric context space. In this paper, we further structure the contextual policy representation. We propose to facto… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: To appear in ICRA 2019

  18. arXiv:1807.06696  [pdf, other

    cs.RO cs.AI cs.LG

    Integrating Algorithmic Planning and Deep Learning for Partially Observable Navigation

    Authors: Peter Karkus, David Hsu, Wee Sun Lee

    Abstract: We propose to take a novel approach to robot system design where each building block of a larger system is represented as a differentiable program, i.e. a deep neural network. This representation allows for integrating algorithmic planning and deep learning in a principled manner, and thus combine the benefits of model-free and model-based methods. We apply the proposed approach to a challenging p… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: MLPC workshop, ICRA 2018

  19. arXiv:1805.08975  [pdf, other

    cs.RO cs.AI cs.CV cs.LG stat.ML

    Particle Filter Networks with Application to Visual Localization

    Authors: Peter Karkus, David Hsu, Wee Sun Lee

    Abstract: Particle filtering is a powerful approach to sequential state estimation and finds application in many domains, including robot localization, object tracking, etc. To apply particle filtering in practice, a critical challenge is to construct probabilistic system models, especially for systems with complex dynamics or rich sensory inputs such as camera images. This paper introduces the Particle Fil… ▽ More

    Submitted 25 October, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: CoRL 2018 camera ready

  20. arXiv:1703.06692  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    QMDP-Net: Deep Learning for Planning under Partial Observability

    Authors: Peter Karkus, David Hsu, Wee Sun Lee

    Abstract: This paper introduces the QMDP-net, a neural network architecture for planning under partial observability. The QMDP-net combines the strengths of model-free learning and model-based planning. It is a recurrent policy network, but it represents a policy for a parameterized set of tasks by connecting a model with a planning algorithm that solves the model, thus embedding the solution structure of p… ▽ More

    Submitted 2 November, 2017; v1 submitted 20 March, 2017; originally announced March 2017.

    Comments: NIPS 2017 camera-ready

  21. arXiv:1612.01746  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Factored Contextual Policy Search with Bayesian Optimization

    Authors: Peter Karkus, Andras Kupcsik, David Hsu, Wee Sun Lee

    Abstract: Scarce data is a major challenge to scaling robot learning to truly complex tasks, as we need to generalize locally learned policies over different "contexts". Bayesian optimization approaches to contextual policy search (CPS) offer data-efficient policy learning that generalize over a context space. We propose to improve data-efficiency by factoring typically considered contexts into two componen… ▽ More

    Submitted 28 May, 2019; v1 submitted 6 December, 2016; originally announced December 2016.

    Comments: BayesOpt 2016, NeurIPS Workshop. A full paper extension is available at arXiv:1904.11761