What Matters in Learning from Offline Human Demonstrations for Robot Manipulation

Mandlekar, Ajay; Xu, Danfei; Wong, Josiah; Nasiriany, Soroush; Wang, Chen; Kulkarni, Rohun; Fei-Fei, Li; Savarese, Silvio; Zhu, Yuke; Martín-Martín, Roberto

Computer Science > Robotics

arXiv:2108.03298v1 (cs)

[Submitted on 6 Aug 2021 (this version), latest version 25 Sep 2021 (v2)]

Title:What Matters in Learning from Offline Human Demonstrations for Robot Manipulation

Authors:Ajay Mandlekar, Danfei Xu, Josiah Wong, Soroush Nasiriany, Chen Wang, Rohun Kulkarni, Li Fei-Fei, Silvio Savarese, Yuke Zhu, Roberto Martín-Martín

View PDF

Abstract:Imitating human demonstrations is a promising approach to endow robots with various manipulation capabilities. While recent advances have been made in imitation learning and batch (offline) reinforcement learning, a lack of open-source human datasets and reproducible learning methods make assessing the state of the field difficult. In this paper, we conduct an extensive study of six offline learning algorithms for robot manipulation on five simulated and three real-world multi-stage manipulation tasks of varying complexity, and with datasets of varying quality. Our study analyzes the most critical challenges when learning from offline human data for manipulation. Based on the study, we derive a series of lessons including the sensitivity to different algorithmic design choices, the dependence on the quality of the demonstrations, and the variability based on the stop** criteria due to the different objectives in training and evaluation. We also highlight opportunities for learning from human datasets, such as the ability to learn proficient policies on challenging, multi-stage tasks beyond the scope of current reinforcement learning methods, and the ability to easily scale to natural, real-world manipulation scenarios where only raw sensory signals are available. We have open-sourced our datasets and all algorithm implementations to facilitate future research and fair comparisons in learning from human demonstration data. Codebase, datasets, trained models, and more available at this https URL

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2108.03298 [cs.RO]
	(or arXiv:2108.03298v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2108.03298

Submission history

From: Ajay Mandlekar [view email]
[v1] Fri, 6 Aug 2021 20:48:30 UTC (16,158 KB)
[v2] Sat, 25 Sep 2021 00:37:01 UTC (16,159 KB)

Computer Science > Robotics

Title:What Matters in Learning from Offline Human Demonstrations for Robot Manipulation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:What Matters in Learning from Offline Human Demonstrations for Robot Manipulation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators