Skip to main content

Showing 1–1 of 1 results for author: Birchler, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.01076  [pdf, other

    cs.LG cs.AI stat.ML

    Hallucinated Adversarial Control for Conservative Offline Policy Evaluation

    Authors: Jonas Rothfuss, Bhavya Sukhija, Tobias Birchler, Parnian Kassraie, Andreas Krause

    Abstract: We study the problem of conservative off-policy evaluation (COPE) where given an offline dataset of environment interactions, collected by other agents, we seek to obtain a (tight) lower bound on a policy's performance. This is crucial when deciding whether a given policy satisfies certain minimal performance/safety criteria before it can be deployed in the real world. To this end, we introduce HA… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: Conference on Uncertainty in Artificial Intelligence (UAI) 2023, first three authors contributed equally