Skip to main content

Showing 1–1 of 1 results for author: Jordan, D R

.
  1. arXiv:2210.08642  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data

    Authors: Allen Nie, Yannis Flet-Berliac, Deon R. Jordan, William Steenbergen, Emma Brunskill

    Abstract: Offline reinforcement learning (RL) can be used to improve future performance by leveraging historical data. There exist many different algorithms for offline RL, and it is well recognized that these algorithms, and their hyperparameter settings, can lead to decision policies with substantially differing performance. This prompts the need for pipelines that allow practitioners to systematically pe… ▽ More

    Submitted 12 January, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: 32 pages. Published at NeurIPS 2022. Presented at RLDM 2022