Skip to main content

Showing 1–1 of 1 results for author: Alumootil, V

.
  1. arXiv:2102.06961  [pdf, other

    cs.LG

    PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators

    Authors: Anish Agarwal, Abdullah Alomar, Varkey Alumootil, Devavrat Shah, Dennis Shen, Zhi Xu, Cindy Yang

    Abstract: We consider offline reinforcement learning (RL) with heterogeneous agents under severe data scarcity, i.e., we only observe a single historical trajectory for every agent under an unknown, potentially sub-optimal policy. We find that the performance of state-of-the-art offline and model-based RL methods degrade significantly given such limited data availability, even for commonly perceived "solved… ▽ More

    Submitted 10 November, 2021; v1 submitted 13 February, 2021; originally announced February 2021.