Skip to main content

Showing 1–1 of 1 results for author: Lodha, H

.
  1. arXiv:2112.02999  [pdf, other

    cs.RO

    Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning

    Authors: Utkarsh A. Mishra, Soumya R. Samineni, Prakhar Goel, Chandravaran Kunjeti, Himanshu Lodha, Aman Singh, Aditya Sagi, Shalabh Bhatnagar, Shishir Kolathaya

    Abstract: Recent works in Reinforcement Learning (RL) combine model-free (Mf)-RL algorithms with model-based (Mb)-RL approaches to get the best from both: asymptotic performance of Mf-RL and high sample-efficiency of Mb-RL. Inspired by these works, we propose a hierarchical framework that integrates online learning for the Mb-trajectory optimization with off-policy methods for the Mf-RL. In particular, two… ▽ More

    Submitted 4 November, 2021; originally announced December 2021.

    Comments: 8 pages, 4 figures. arXiv admin note: substantial text overlap with arXiv:2110.12239