Skip to main content

Showing 1–1 of 1 results for author: Schwantes, T

.
  1. arXiv:2312.11551  [pdf, other

    cs.LG cs.AI

    Probabilistic Offline Policy Ranking with Approximate Bayesian Computation

    Authors: Longchao Da, Porter Jenkins, Trevor Schwantes, Jeffrey Dotson, Hua Wei

    Abstract: In practice, it is essential to compare and rank candidate policies offline before real-world deployment for safety and reliability. Prior work seeks to solve this offline policy ranking (OPR) problem through value-based methods, such as Off-policy evaluation (OPE). However, they fail to analyze special cases performance (e.g., worst or best cases), due to the lack of holistic characterization of… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 19 pages with 7 pages main paper, 10 pages appendix. Accepted to AAAI 2024 main track

    ACM Class: I.2.6