Skip to main content

Showing 1–15 of 15 results for author: Dalal, M

.
  1. arXiv:2406.15377  [pdf

    cs.CY cs.AI cs.LG cs.NE cs.PL cs.SE

    Model Callers for Transforming Predictive and Generative AI Applications

    Authors: Mukesh Dalal

    Abstract: We introduce a novel software abstraction termed "model caller," acting as an intermediary for AI and ML model calling, advocating its transformative utility beyond existing model-serving frameworks. This abstraction offers multiple advantages: enhanced accuracy and reduced latency in model predictions, superior monitoring and observability of models, more streamlined AI system architectures, simp… ▽ More

    Submitted 17 April, 2024; originally announced June 2024.

    Comments: 18 pages, 14 figures

    MSC Class: 68T05 (Primary) 68T07; 68N19; 68T35 (Secondary) ACM Class: I.2.0; I.2.1; I.2.5; I.2.11; D.2.11; D.3.3; H.1.2; J.0

  2. arXiv:2405.01534  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks

    Authors: Murtaza Dalal, Tarun Chiruvolu, Devendra Chaplot, Ruslan Salakhutdinov

    Abstract: Large Language Models (LLMs) have been shown to be capable of performing high-level planning for long-horizon robotics tasks, yet existing methods require access to a pre-defined skill library (e.g. picking, placing, pulling, pushing, navigating). However, LLM planning does not address how to design or learn those behaviors, which remains challenging particularly in long-horizon settings. Furtherm… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Published at ICLR 2024. Website at https://mihdalal.github.io/planseqlearn/ 9 pages, 3 figures, 3 tables; 14 pages appendix (7 additional figures)

  3. arXiv:2308.00642  [pdf, ps, other

    cs.IT

    Reversible complement cyclic codes over finite chain rings

    Authors: Monika Dalal, Sucheta Dutt, Ranjeet Sehmi

    Abstract: Let k be an arbitrary element of a finite commutative chain ring R and u be a unit in R. In this work, we present necessary conditions which are sufficient as well for a cyclic code to be a (u,k) reversible complement code over R. Using these conditions, all principally generated cyclic codes over the ring Z_{2}+vZ_{2}+v^{2}Z_{2}, v^{3}=0 of length 4 have been checked to find whether they are (1,1… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  4. arXiv:2307.09156  [pdf, ps, other

    cs.IT

    Reversible cyclic codes over finite chain rings

    Authors: Monika Dalal, Sucheta Dutt, Ranjeet Sehmi

    Abstract: In this paper, necessary and sufficient conditions for the reversibility of a cyclic code of arbitrary length over a finite commutative chain ring have been derived. MDS reversible cyclic codes having length p^s over a finite chain ring with nilpotency index 2 have been characterized and a few examples of MDS reversible cyclic codes have been presented. Further, it is shown that the torsion codes… ▽ More

    Submitted 23 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

  5. arXiv:2305.16309  [pdf, other

    cs.RO cs.CV cs.LG

    Imitating Task and Motion Planning with Visuomotor Transformers

    Authors: Murtaza Dalal, Ajay Mandlekar, Caelan Garrett, Ankur Handa, Ruslan Salakhutdinov, Dieter Fox

    Abstract: Imitation learning is a powerful tool for training robot manipulation policies, allowing them to learn from expert demonstrations without manual programming or trial-and-error. However, common methods of data collection, such as human supervision, scale poorly, as they are time-consuming and labor-intensive. In contrast, Task and Motion Planning (TAMP) can autonomously generate large-scale dataset… ▽ More

    Submitted 17 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Conference on Robot Learning (CoRL) 2023. 8 pages, 5 figures, 2 tables; 11 pages appendix (10 additional figures)

  6. arXiv:2303.15819  [pdf, ps, other

    cs.IT

    MDS and MHDR cyclic codes over finite chain rings

    Authors: Monika Dalal, Sucheta Dutt, Ranjeet Sehmi

    Abstract: In this work, a unique set of generators for a cyclic code over a finite chain ring has been established. The minimal spanning set and rank of the code have also been determined. Further, sufficient as well as necessary conditions for a cyclic code to be an MDS code and for a cyclic code to be an MHDR code have been obtained. Some examples of optimal cyclic codes have also been presented.

    Submitted 28 March, 2023; originally announced March 2023.

  7. arXiv:2112.01001  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    SEAL: Self-supervised Embodied Active Learning using Exploration and 3D Consistency

    Authors: Devendra Singh Chaplot, Murtaza Dalal, Saurabh Gupta, Jitendra Malik, Ruslan Salakhutdinov

    Abstract: In this paper, we explore how we can build upon the data and models of Internet images and use them to adapt to robot vision without requiring any extra labels. We present a framework called Self-supervised Embodied Active Learning (SEAL). It utilizes perception models trained on internet images to learn an active exploration policy. The observations gathered by this exploration policy are labelle… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: Published at NeurIPS 2021. See project webpage at https://devendrachaplot.github.io/projects/seal

  8. arXiv:2110.15360  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives

    Authors: Murtaza Dalal, Deepak Pathak, Ruslan Salakhutdinov

    Abstract: Despite the potential of reinforcement learning (RL) for building general-purpose robotic systems, training RL agents to solve robotics tasks still remains challenging due to the difficulty of exploration in purely continuous action spaces. Addressing this problem is an active area of research with the majority of focus on improving RL methods via better optimization or more efficient exploration.… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

    Comments: Published at NeurIPS 2021. Website at https://mihdalal.github.io/raps/

  9. arXiv:2006.09359  [pdf, other

    cs.LG cs.RO stat.ML

    AWAC: Accelerating Online Reinforcement Learning with Offline Datasets

    Authors: Ashvin Nair, Abhishek Gupta, Murtaza Dalal, Sergey Levine

    Abstract: Reinforcement learning (RL) provides an appealing formalism for learning control policies from experience. However, the classic active formulation of RL necessitates a lengthy active exploration process for each behavior, making it difficult to apply in real-world settings such as robotic control. If we can instead allow RL algorithms to effectively use previously collected data to aid the online… ▽ More

    Submitted 24 April, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 17 pages. Website: https://awacrl.github.io/

  10. arXiv:2003.02636  [pdf, other

    cs.RO cs.LG stat.ML

    Scalable Multi-Task Imitation Learning with Autonomous Improvement

    Authors: Avi Singh, Eric Jang, Alexander Irpan, Daniel Kappler, Murtaza Dalal, Sergey Levine, Mohi Khansari, Chelsea Finn

    Abstract: While robot learning has demonstrated promising results for enabling robots to automatically acquire new skills, a critical challenge in deploying learning-based systems is scale: acquiring enough data for the robot to effectively generalize broadly. Imitation learning, in particular, has remained a stable and powerful approach for robot learning, but critically relies on expert operators for data… ▽ More

    Submitted 25 February, 2020; originally announced March 2020.

    Comments: Accepted to ICRA 2020. Supplementary material at https://sites.google.com/view/scalable-mili

  11. arXiv:1910.07737  [pdf, other

    cs.LG stat.ML

    Autoregressive Models: What Are They Good For?

    Authors: Murtaza Dalal, Alexander C. Li, Rohan Taori

    Abstract: Autoregressive (AR) models have become a popular tool for unsupervised learning, achieving state-of-the-art log likelihood estimates. We investigate the use of AR models as density estimators in two settings -- as a learning signal for image translation, and as an outlier detector -- and find that these density estimates are much less reliable than previously thought. We examine the underlying opt… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted for the Information Theory and Machine Learning workshop at NeurIPS 2019

  12. arXiv:1903.03698  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Skew-Fit: State-Covering Self-Supervised Reinforcement Learning

    Authors: Vitchyr H. Pong, Murtaza Dalal, Steven Lin, Ashvin Nair, Shikhar Bahl, Sergey Levine

    Abstract: Autonomous agents that must exhibit flexible and broad capabilities will need to be equipped with large repertoires of skills. Defining each skill with a manually-designed reward function limits this repertoire and imposes a manual engineering burden. Self-supervised agents that set their own goals can automate this process, but designing appropriate goal setting objectives can be difficult, and o… ▽ More

    Submitted 4 August, 2020; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: ICML 2020. 8 pages, 8 figures; 9 pages appendix (6 additional figures)

  13. arXiv:1807.04742  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Visual Reinforcement Learning with Imagined Goals

    Authors: Ashvin Nair, Vitchyr Pong, Murtaza Dalal, Shikhar Bahl, Steven Lin, Sergey Levine

    Abstract: For an autonomous agent to fulfill a wide range of user-specified goals at test time, it must be able to learn broadly applicable and general-purpose skill repertoires. Furthermore, to provide the requisite level of generality, these skills must handle raw sensory input such as images. In this paper, we propose an algorithm that acquires such general-purpose skills by combining unsupervised repres… ▽ More

    Submitted 4 December, 2018; v1 submitted 12 July, 2018; originally announced July 2018.

    Comments: 15 pages, NeurIPS 2018

  14. arXiv:1803.06773  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Composable Deep Reinforcement Learning for Robotic Manipulation

    Authors: Tuomas Haarnoja, Vitchyr Pong, Aurick Zhou, Murtaza Dalal, Pieter Abbeel, Sergey Levine

    Abstract: Model-free deep reinforcement learning has been shown to exhibit good performance in domains ranging from video games to simulated robotic manipulation and locomotion. However, model-free methods are known to perform poorly when the interaction time with the environment is limited, as is the case for most real-world robotic tasks. In this paper, we study how maximum entropy policies trained using… ▽ More

    Submitted 18 March, 2018; originally announced March 2018.

    Comments: Videos: https://sites.google.com/view/composing-real-world-policies/

  15. arXiv:1802.09081  [pdf, other

    cs.LG

    Temporal Difference Models: Model-Free Deep RL for Model-Based Control

    Authors: Vitchyr Pong, Shixiang Gu, Murtaza Dalal, Sergey Levine

    Abstract: Model-free reinforcement learning (RL) is a powerful, general tool for learning complex behaviors. However, its sample efficiency is often impractically large for solving challenging real-world problems, even with off-policy algorithms such as Q-learning. A limiting factor in classic model-free RL is that the learning signal consists only of scalar rewards, ignoring much of the rich information co… ▽ More

    Submitted 24 February, 2020; v1 submitted 25 February, 2018; originally announced February 2018.

    Comments: Appeared in ICLR 2018; typos corrected