SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Kiyohara, Haruka; Kishimoto, Ren; Kawakami, Kosuke; Kobayashi, Ken; Nakata, Kazuhide; Saito, Yuta

Computer Science > Machine Learning

arXiv:2311.18206 (cs)

[Submitted on 30 Nov 2023 (v1), last revised 11 Mar 2024 (this version, v3)]

Title:SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Authors:Haruka Kiyohara, Ren Kishimoto, Kosuke Kawakami, Ken Kobayashi, Kazuhide Nakata, Yuta Saito

View PDF

Abstract:This paper introduces SCOPE-RL, a comprehensive open-source Python software designed for offline reinforcement learning (offline RL), off-policy evaluation (OPE), and selection (OPS). Unlike most existing libraries that focus solely on either policy learning or evaluation, SCOPE-RL seamlessly integrates these two key aspects, facilitating flexible and complete implementations of both offline RL and OPE processes. SCOPE-RL put particular emphasis on its OPE modules, offering a range of OPE estimators and robust evaluation-of-OPE protocols. This approach enables more in-depth and reliable OPE compared to other packages. For instance, SCOPE-RL enhances OPE by estimating the entire reward distribution under a policy rather than its mere point-wise expected value. Additionally, SCOPE-RL provides a more thorough evaluation-of-OPE by presenting the risk-return tradeoff in OPE results, extending beyond mere accuracy evaluations in existing OPE literature. SCOPE-RL is designed with user accessibility in mind. Its user-friendly APIs, comprehensive documentation, and a variety of easy-to-follow examples assist researchers and practitioners in efficiently implementing and experimenting with various offline RL methods and OPE estimators, tailored to their specific problem contexts. The documentation of SCOPE-RL is available at this https URL.

Comments:	preprint, open-source software: this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.18206 [cs.LG]
	(or arXiv:2311.18206v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.18206

Submission history

From: Haruka Kiyohara [view email]
[v1] Thu, 30 Nov 2023 02:56:43 UTC (4,141 KB)
[v2] Mon, 4 Dec 2023 18:42:03 UTC (4,142 KB)
[v3] Mon, 11 Mar 2024 00:38:57 UTC (4,150 KB)

Computer Science > Machine Learning

Title:SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators