Skip to main content

Showing 1–10 of 10 results for author: Mishra, U A

.
  1. arXiv:2401.03360  [pdf, other

    cs.RO

    Generative Skill Chaining: Long-Horizon Skill Planning with Diffusion Models

    Authors: Utkarsh A. Mishra, Shangjie Xue, Yongxin Chen, Danfei Xu

    Abstract: Long-horizon tasks, usually characterized by complex subtask dependencies, present a significant challenge in manipulation planning. Skill chaining is a practical approach to solving unseen tasks by combining learned skill priors. However, such methods are myopic if sequenced greedily and face scalability issues with search-based planning strategy. To address these challenges, we introduce Generat… ▽ More

    Submitted 13 October, 2023; originally announced January 2024.

    Comments: Accepted at CoRL 2023: https://openreview.net/forum?id=HtJE9ly5dT

  2. arXiv:2303.12700  [pdf, other

    cs.RO cs.CV

    ReorientDiff: Diffusion Model based Reorientation for Object Manipulation

    Authors: Utkarsh A. Mishra, Yongxin Chen

    Abstract: The ability to manipulate objects in a desired configurations is a fundamental requirement for robots to complete various practical applications. While certain goals can be achieved by picking and placing the objects of interest directly, object reorientation is needed for precise placement in most of the tasks. In such scenarios, the object must be reoriented and re-positioned into intermediate p… ▽ More

    Submitted 14 September, 2023; v1 submitted 27 February, 2023; originally announced March 2023.

    Comments: 7 pages, 5 figures; More details here: https://utkarshmishra04.github.io/ReorientDiff

  3. arXiv:2205.12124  [pdf, other

    cs.RO

    Memory based neural networks for end-to-end autonomous driving

    Authors: Sergio Paniego Blanco, Sakshay Mahna, Utkarsh A. Mishra, JoseMaria Canas

    Abstract: Recent works in end-to-end control for autonomous driving have investigated the use of vision-based exteroceptive perception. Inspired by such results, we propose a new end-to-end memory-based neural architecture for robot steering and throttle control. We describe and compare this architecture with previous approaches using fundamental error metrics (MAE, MSE) and several external metrics based o… ▽ More

    Submitted 27 April, 2022; originally announced May 2022.

    Comments: 6 pages, 3 figures, Code available: https://github.com/JdeRobot/BehaviorMetrics and https://www.github.com/JdeRobot/DeepLearningStudio

  4. arXiv:2112.02999  [pdf, other

    cs.RO

    Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning

    Authors: Utkarsh A. Mishra, Soumya R. Samineni, Prakhar Goel, Chandravaran Kunjeti, Himanshu Lodha, Aman Singh, Aditya Sagi, Shalabh Bhatnagar, Shishir Kolathaya

    Abstract: Recent works in Reinforcement Learning (RL) combine model-free (Mf)-RL algorithms with model-based (Mb)-RL approaches to get the best from both: asymptotic performance of Mf-RL and high sample-efficiency of Mb-RL. Inspired by these works, we propose a hierarchical framework that integrates online learning for the Mb-trajectory optimization with off-policy methods for the Mf-RL. In particular, two… ▽ More

    Submitted 4 November, 2021; originally announced December 2021.

    Comments: 8 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:2110.12239

  5. arXiv:2111.07775  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Representations for Pixel-based Control: What Matters and Why?

    Authors: Manan Tomar, Utkarsh A. Mishra, Amy Zhang, Matthew E. Taylor

    Abstract: Learning representations for pixel-based control has garnered significant attention recently in reinforcement learning. A wide range of methods have been proposed to enable efficient learning, leading to sample complexities similar to those in the full state setting. However, moving beyond carefully curated pixel data sets (centered crop, appropriate lighting, clear background, etc.) remains chall… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  6. arXiv:2109.12665  [pdf, other

    cs.RO

    Linear Policies are Sufficient to Realize Robust Bipedal Walking on Challenging Terrains

    Authors: Lokesh Krishna, Guillermo A. Castillo, Utkarsh A. Mishra, Ayonga Hereid, Shishir Kolathaya

    Abstract: In this work, we demonstrate robust walking in the bipedal robot Digit on uneven terrains by just learning a single linear policy. In particular, we propose a new control pipeline, wherein the high-level trajectory modulator shapes the end-foot ellipsoidal trajectories, and the low-level gait controller regulates the torso and ankle orientation. The foot-trajectory modulator uses a linear policy a… ▽ More

    Submitted 5 October, 2021; v1 submitted 26 September, 2021; originally announced September 2021.

    Comments: 8 pages, 10 Figures

  7. arXiv:2106.15273  [pdf, other

    cs.RO cs.AI

    Learning Control Policies for Imitating Human Gaits

    Authors: Utkarsh A. Mishra

    Abstract: The work presented in this report introduces a framework aimed towards learning to imitate human gaits. Humans exhibit movements like walking, running, and jum** in the most efficient manner, which served as the source of motivation for this project. Skeletal and Musculoskeletal human models were considered for motions in the sagittal plane, and results from both were compared exhaustively. Whil… ▽ More

    Submitted 15 May, 2021; originally announced June 2021.

    Comments: 47 pages, 17 figures, Bachelor of Technology Final Year Project Report

  8. arXiv:2104.01662  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning Linear Policies for Robust Bipedal Locomotion on Terrains with Varying Slopes

    Authors: Lokesh Krishna, Utkarsh A. Mishra, Guillermo A. Castillo, Ayonga Hereid, Shishir Kolathaya

    Abstract: In this paper, with a view toward deployment of light-weight control frameworks for bipedal walking robots, we realize end-foot trajectories that are shaped by a single linear feedback policy. We learn this policy via a model-free and a gradient-free learning algorithm, Augmented Random Search (ARS), in the two robot platforms Rabbit and Digit. Our contributions are two-fold: a) By using torso and… ▽ More

    Submitted 9 August, 2021; v1 submitted 4 April, 2021; originally announced April 2021.

    Comments: 6 pages, 5 figures, Accepted in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021) in Prague, Czech Republic

  9. arXiv:2012.02301  [pdf, other

    cs.RO cs.LG

    Planning Brachistochrone Hip Trajectory for a Toe-Foot Bipedal Robot going Downstairs

    Authors: Gaurav Bhardwaj, Utkarsh A. Mishra, N. Sukavanam, R. Balasubramanian

    Abstract: A novel efficient downstairs trajectory is proposed for a 9 link biped robot model with toe-foot. Brachistochrone is the fastest descent trajectory for a particle moving only under the influence of gravity. In most situations, while climbing downstairs, human hip also follow brachistochrone trajectory for a more responsive motion. Here, an adaptive trajectory planning algorithm is developed so tha… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: 6 pages, 6 figures, Accepted for presentation at the RoAI 2020: International Conference on Robotics and Artificial Intelligence 2020, IIT Madras and will be published in the Proceedings by the Journal of Physics: Conference Series. arXiv admin note: substantial text overlap with arXiv:2012.01417

  10. arXiv:2012.01417  [pdf, other

    cs.RO cs.NE eess.SY

    Cycloidal Trajectory Realization on Staircase based on Neural Network Temporal Quantized Lagrange Dynamics (NNTQLD) with Ant Colony Optimization for a 9-Link Bipedal Robot

    Authors: Gaurav Bhardwaj, Utkarsh A. Mishra, N. Sukavanam, R. Balasubramanian

    Abstract: In this paper, a novel optimal technique for joint angles trajectory tracking control with energy optimization for a biped robot with toe foot is proposed. For the task of climbing stairs by a 9-link biped model, a cycloid trajectory for swing phase is proposed in such a way that the cycloid variables depend on the staircase dimensions. Zero Moment Point(ZMP) criteria is taken for satisfying stabi… ▽ More

    Submitted 21 July, 2021; v1 submitted 2 December, 2020; originally announced December 2020.