Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

Rengarajan, Desik; Chaudhary, Sapana; Kim, Jaewon; Kalathil, Dileep; Shakkottai, Srinivas

Computer Science > Machine Learning

arXiv:2209.13048 (cs)

[Submitted on 26 Sep 2022]

Title:Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

Authors:Desik Rengarajan, Sapana Chaudhary, Jaewon Kim, Dileep Kalathil, Srinivas Shakkottai

View PDF

Abstract:Meta reinforcement learning (Meta-RL) is an approach wherein the experience gained from solving a variety of tasks is distilled into a meta-policy. The meta-policy, when adapted over only a small (or just a single) number of steps, is able to perform near-optimally on a new, related task. However, a major challenge to adopting this approach to solve real-world problems is that they are often associated with sparse reward functions that only indicate whether a task is completed partially or fully. We consider the situation where some data, possibly generated by a sub-optimal agent, is available for each task. We then develop a class of algorithms entitled Enhanced Meta-RL using Demonstrations (EMRLD) that exploit this information even if sub-optimal to obtain guidance during training. We show how EMRLD jointly utilizes RL and supervised learning over the offline data to generate a meta-policy that demonstrates monotone performance improvements. We also develop a warm started variant called EMRLD-WS that is particularly efficient for sub-optimal demonstration data. Finally, we show that our EMRLD algorithms significantly outperform existing approaches in a variety of sparse reward environments, including that of a mobile robot.

Comments:	Accepted to NeurIPS 2022; first two authors contributed equally
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2209.13048 [cs.LG]
	(or arXiv:2209.13048v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.13048

Submission history

From: Desik Rengarajan [view email]
[v1] Mon, 26 Sep 2022 22:01:12 UTC (20,507 KB)

Computer Science > Machine Learning

Title:Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators