Skip to main content

Showing 1–7 of 7 results for author: Shafiullah, N M M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.03181  [pdf, other

    cs.LG cs.AI cs.RO

    Behavior Generation with Latent Actions

    Authors: Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. ** Kim, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: Generative modeling of complex behaviors from labeled datasets has been a longstanding problem in decision making. Unlike language or image generation, decision making requires modeling actions - continuous-valued vectors that are multimodal in their distribution, potentially drawn from uncurated sources, where generation errors can compound in sequential prediction. A recent class of models calle… ▽ More

    Submitted 28 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Github repo: https://github.com/jayLEE0301/vq_bet_official

  2. arXiv:2401.12202  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics

    Authors: Peiqi Liu, Yaswanth Orru, Jay Vakil, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: Remarkable progress has been made in recent years in the fields of vision, language, and robotics. We now have vision models capable of recognizing objects based on language queries, navigation systems that can effectively control mobile systems, and gras** models that can handle a wide range of objects. Despite these advancements, general-purpose applications of robotics still lag behind, even… ▽ More

    Submitted 29 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: Github repo: https://github.com/ok-robot/ok-robot

  3. arXiv:2311.16098  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    On Bringing Robots Home

    Authors: Nur Muhammad Mahi Shafiullah, Anant Rai, Haritheja Etukuru, Yiqian Liu, Ishan Misra, Soumith Chintala, Lerrel Pinto

    Abstract: Throughout history, we have successfully integrated various machines into our homes. Dishwashers, laundry machines, stand mixers, and robot vacuums are a few recent examples. However, these machines excel at performing only a single task effectively. The concept of a "generalist machine" in homes - a domestic assistant that can adapt and learn from our needs, all while remaining cost-effective - h… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Project website and videos are available at https://dobb-e.com, technical documentation for getting started is available at https://docs.dobb-e.com, and code is released at https://github.com/notmahi/dobb-e

  4. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, A**kya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  5. arXiv:2210.10047  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data

    Authors: Zichen Jeff Cui, Yibin Wang, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

    Abstract: While large-scale sequence modeling from offline data has led to impressive performance gains in natural language and image generation, directly translating such ideas to robotics has been challenging. One critical reason for this is that uncurated robot demonstration data, i.e. play data, collected from non-expert human demonstrators are often noisy, diverse, and distributionally multi-modal. Thi… ▽ More

    Submitted 15 December, 2022; v1 submitted 18 October, 2022; originally announced October 2022.

    Comments: Code and data available at: https://play-to-policy.github.io; (fixed metadata author name format)

  6. arXiv:2210.05663  [pdf, other

    cs.RO cs.CV

    CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

    Authors: Nur Muhammad Mahi Shafiullah, Chris Paxton, Lerrel Pinto, Soumith Chintala, Arthur Szlam

    Abstract: We propose CLIP-Fields, an implicit scene model that can be used for a variety of tasks, such as segmentation, instance identification, semantic search over space, and view localization. CLIP-Fields learns a map** from spatial locations to semantic embedding vectors. Importantly, we show that this map** can be trained with supervision coming only from web-image and web-text trained models such… ▽ More

    Submitted 22 May, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Code, video, and interactive demonstrations available at https://mahis.life/clip-fields. Accepted for publication at Robotics: Science and Systems 2023 in Daegu, Korea

  7. arXiv:2206.11251  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Behavior Transformers: Cloning $k$ modes with one stone

    Authors: Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto

    Abstract: While behavior learning has made impressive progress in recent times, it lags behind computer vision and natural language processing due to its inability to leverage large, human-generated datasets. Human behaviors have wide variance, multiple modes, and human demonstrations typically do not come with reward labels. These properties limit the applicability of current methods in Offline RL and Beha… ▽ More

    Submitted 11 October, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Code and data available at https://github.com/notmahi/bet