Skip to main content

Showing 1–17 of 17 results for author: Bahl, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.04549  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play

    Authors: Lili Chen, Shikhar Bahl, Deepak Pathak

    Abstract: Learning from unstructured and uncurated data has become the dominant paradigm for generative approaches in language and vision. Such unstructured and unguided behavior data, commonly known as play, is also easier to collect in robotics but much more difficult to learn from due to its inherently multimodal, noisy, and suboptimal nature. In this paper, we study this problem of learning goal-directe… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: In CoRL 2023. Website at https://play-fusion.github.io

  2. arXiv:2310.19797  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    DEFT: Dexterous Fine-Tuning for Real-World Hand Policies

    Authors: Aditya Kannan, Kenneth Shaw, Shikhar Bahl, Pragna Mannam, Deepak Pathak

    Abstract: Dexterity is often seen as a cornerstone of complex manipulation. Humans are able to perform a host of skills with their hands, from making food to operating tools. In this paper, we investigate these challenges, especially in the case of soft, deformable objects as well as complex, relatively long-horizon tasks. However, learning such behaviors from scratch can be data inefficient. To circumvent… ▽ More

    Submitted 12 December, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: In CoRL 2023. Website at https://dexterous-finetuning.github.io/

  3. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  4. arXiv:2309.02435  [pdf, other

    cs.LG cs.AI cs.CV cs.NE cs.RO

    Efficient RL via Disentangled Environment and Agent Representations

    Authors: Kevin Gmelin, Shikhar Bahl, Russell Mendonca, Deepak Pathak

    Abstract: Agents that are aware of the separation between themselves and their environments can leverage this understanding to form effective representations of visual input. We propose an approach for learning such structured representations for RL algorithms, using visual knowledge of the agent, such as its shape or mask, which is often inexpensive to obtain. This is incorporated into the RL objective usi… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: ICML 2023. Website at https://sear-rl.github.io/

  5. arXiv:2308.10901  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.NE

    Structured World Models from Human Videos

    Authors: Russell Mendonca, Shikhar Bahl, Deepak Pathak

    Abstract: We tackle the problem of learning complex, general behaviors directly in the real world. We propose an approach for robots to efficiently learn manipulation skills using only a handful of real-world interaction trajectories from many different settings. Inspired by the success of learning from large-scale datasets in the fields of computer vision and natural language, our belief is that in order t… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: RSS 2023. Website at https://human-world-model.github.io

  6. arXiv:2304.08488  [pdf, other

    cs.RO cs.AI cs.CV cs.LG cs.NE

    Affordances from Human Videos as a Versatile Representation for Robotics

    Authors: Shikhar Bahl, Russell Mendonca, Lili Chen, Unnat Jain, Deepak Pathak

    Abstract: Building a robot that can understand and learn to interact by watching humans has inspired several vision problems. However, despite some successful results on static datasets, it remains unclear how current models can be used on a robot directly. In this paper, we aim to bridge this gap by leveraging videos of human interactions in an environment centric manner. Utilizing internet videos of human… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR 2023. Website at https://robo-affordances.github.io/

  7. arXiv:2302.06604  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    ALAN: Autonomously Exploring Robotic Agents in the Real World

    Authors: Russell Mendonca, Shikhar Bahl, Deepak Pathak

    Abstract: Robotic agents that operate autonomously in the real world need to continuously explore their environment and learn from the data collected, with minimal human supervision. While it is possible to build agents that can learn in such a manner without supervision, current methods struggle to scale to the real world. Thus, we propose ALAN, an autonomously exploring robotic agent, that can perform tas… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: ICRA 2023. Website at https://robo-explorer.github.io/

  8. arXiv:2212.04498  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    VideoDex: Learning Dexterity from Internet Videos

    Authors: Kenneth Shaw, Shikhar Bahl, Deepak Pathak

    Abstract: To build general robotic agents that can operate in many environments, it is often imperative for the robot to collect experience in the real world. However, this is often not feasible due to safety, time, and hardware restrictions. We thus propose leveraging the next best thing as real-world experience: internet videos of humans using their hands. Visual priors, such as visual features, are often… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted at CoRL 2022. Website at https://video-dex.github.io

  9. arXiv:2207.09450  [pdf, other

    cs.RO cs.AI cs.CV cs.LG eess.SY

    Human-to-Robot Imitation in the Wild

    Authors: Shikhar Bahl, Abhinav Gupta, Deepak Pathak

    Abstract: We approach the problem of learning by watching humans in the wild. While traditional approaches in Imitation and Reinforcement Learning are promising for learning in the real world, they are either sample inefficient or are constrained to lab settings. Meanwhile, there has been a lot of success in processing passive, unstructured human data. We propose tackling this problem via an efficient one-s… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Published at RSS 2022. Demos at https://human2robot.github.io

  10. arXiv:2203.08098  [pdf, other

    cs.RO

    RB2: Robotic Manipulation Benchmarking with a Twist

    Authors: Sudeep Dasari, Jianren Wang, Joyce Hong, Shikhar Bahl, Yixin Lin, Austin Wang, Abitha Thankaraj, Karanbir Chahal, Berk Calli, Saurabh Gupta, David Held, Lerrel Pinto, Deepak Pathak, Vikash Kumar, Abhinav Gupta

    Abstract: Benchmarks offer a scientific way to compare algorithms using objective performance metrics. Good benchmarks have two features: (a) they should be widely useful for many research groups; (b) and they should produce reproducible findings. In robotic manipulation research, there is a trade-off between reproducibility and broad accessibility. If the benchmark is kept restrictive (fixed hardware, obje… ▽ More

    Submitted 30 October, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: accepted at the NeurIPS 2021 Datasets and Benchmarks Track

  11. arXiv:2107.05627  [pdf, other

    cs.LG cs.AI cs.CV cs.RO eess.SY

    Hierarchical Neural Dynamic Policies

    Authors: Shikhar Bahl, Abhinav Gupta, Deepak Pathak

    Abstract: We tackle the problem of generalization to unseen configurations for dynamic tasks in the real world while learning from high-dimensional image input. The family of nonlinear dynamical system-based methods have successfully demonstrated dynamic robot behaviors but have difficulty in generalizing to unseen configurations as well as learning from image inputs. Recent works approach this issue by usi… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: Accepted at RSS 2021. Videos and code at https://shikharbahl.github.io/hierarchical-ndps/

  12. arXiv:2012.02788  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Neural Dynamic Policies for End-to-End Sensorimotor Learning

    Authors: Shikhar Bahl, Mustafa Mukadam, Abhinav Gupta, Deepak Pathak

    Abstract: The current dominant paradigm in sensorimotor control, whether imitation or reinforcement learning, is to train policies directly in raw action spaces such as torque, joint angle, or end-effector position. This forces the agent to make decisions individually at each timestep in training, and hence, limits the scalability to continuous, high-dimensional, and long-horizon tasks. In contrast, researc… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: NeurIPS 2020 (Spotlight). Code and videos at https://shikharbahl.github.io/neural-dynamic-policies/

  13. arXiv:1910.11670  [pdf, other

    cs.RO cs.CV cs.LG

    Contextual Imagined Goals for Self-Supervised Robotic Learning

    Authors: Ashvin Nair, Shikhar Bahl, Alexander Khazatsky, Vitchyr Pong, Glen Berseth, Sergey Levine

    Abstract: While reinforcement learning provides an appealing formalism for learning individual skills, a general-purpose robotic system must be able to master an extensive repertoire of behaviors. Instead of learning a large collection of skills individually, can we instead enable a robot to propose and practice its own behaviors automatically, learning about the affordances and behaviors that it can perfor… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 12 pages, to be presented at Conference on Robot Learning (CoRL) 2019. Project website: https://ccrig.github.io/

  14. arXiv:1906.05841  [pdf, other

    cs.RO cs.CV cs.LG

    Deep Reinforcement Learning for Industrial Insertion Tasks with Visual Inputs and Natural Rewards

    Authors: Gerrit Schoettler, Ashvin Nair, Jianlan Luo, Shikhar Bahl, Juan Aparicio Ojea, Eugen Solowjow, Sergey Levine

    Abstract: Connector insertion and many other tasks commonly found in modern manufacturing settings involve complex contact dynamics and friction. Since it is difficult to capture related physical effects with first-order modeling, traditional control methods often result in brittle and inaccurate controllers, which have to be manually tuned. Reinforcement learning (RL) methods have been demonstrated to be c… ▽ More

    Submitted 2 August, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

  15. arXiv:1903.03698  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

    Authors: Vitchyr H. Pong, Murtaza Dalal, Steven Lin, Ashvin Nair, Shikhar Bahl, Sergey Levine

    Abstract: Autonomous agents that must exhibit flexible and broad capabilities will need to be equipped with large repertoires of skills. Defining each skill with a manually-designed reward function limits this repertoire and imposes a manual engineering burden. Self-supervised agents that set their own goals can automate this process, but designing appropriate goal setting objectives can be difficult, and o… ▽ More

    Submitted 4 August, 2020; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: ICML 2020. 8 pages, 8 figures; 9 pages appendix (6 additional figures)

  16. arXiv:1812.03201  [pdf, other

    cs.RO cs.LG

    Residual Reinforcement Learning for Robot Control

    Authors: Tobias Johannink, Shikhar Bahl, Ashvin Nair, Jianlan Luo, Avinash Kumar, Matthias Loskyll, Juan Aparicio Ojea, Eugen Solowjow, Sergey Levine

    Abstract: Conventional feedback control methods can solve various types of robot control problems very efficiently by capturing the structure with explicit models, such as rigid body equations of motion. However, many control problems in modern manufacturing deal with contacts and friction, which are difficult to capture with first-order physical modeling. Hence, applying control design methodologies to the… ▽ More

    Submitted 18 December, 2018; v1 submitted 7 December, 2018; originally announced December 2018.

    Comments: 7 pages

  17. arXiv:1807.04742  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Visual Reinforcement Learning with Imagined Goals

    Authors: Ashvin Nair, Vitchyr Pong, Murtaza Dalal, Shikhar Bahl, Steven Lin, Sergey Levine

    Abstract: For an autonomous agent to fulfill a wide range of user-specified goals at test time, it must be able to learn broadly applicable and general-purpose skill repertoires. Furthermore, to provide the requisite level of generality, these skills must handle raw sensory input such as images. In this paper, we propose an algorithm that acquires such general-purpose skills by combining unsupervised repres… ▽ More

    Submitted 4 December, 2018; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: 15 pages, NeurIPS 2018