Skip to main content

Showing 1–1 of 1 results for author: Sreenivas, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2106.03207  [pdf, other

    cs.LG stat.ML

    Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage

    Authors: Jonathan D. Chang, Masatoshi Uehara, Dhruv Sreenivas, Rahul Kidambi, Wen Sun

    Abstract: This paper studies offline Imitation Learning (IL) where an agent learns to imitate an expert demonstrator without additional online environment interactions. Instead, the learner is presented with a static offline dataset of state-action-next state transition triples from a potentially less proficient behavior policy. We introduce Model-based IL from Offline data (MILO): an algorithmic framework… ▽ More

    Submitted 31 January, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: 42 pages, 5 figures, 7 tables