Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels

Rajeswar, Sai; Mazzaglia, Pietro; Verbelen, Tim; Piché, Alexandre; Dhoedt, Bart; Courville, Aaron; Lacoste, Alexandre

Computer Science > Artificial Intelligence

arXiv:2209.12016 (cs)

[Submitted on 24 Sep 2022 (v1), last revised 25 May 2023 (this version, v2)]

Title:Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels

Authors:Sai Rajeswar, Pietro Mazzaglia, Tim Verbelen, Alexandre Piché, Bart Dhoedt, Aaron Courville, Alexandre Lacoste

View PDF

Abstract:Controlling artificial agents from visual sensory data is an arduous task. Reinforcement learning (RL) algorithms can succeed but require large amounts of interactions between the agent and the environment. To alleviate the issue, unsupervised RL proposes to employ self-supervised interaction and learning, for adapting faster to future tasks. Yet, as shown in the Unsupervised RL Benchmark (URLB; Laskin et al. 2021), whether current unsupervised strategies can improve generalization capabilities is still unclear, especially in visual control settings. In this work, we study the URLB and propose a new method to solve it, using unsupervised model-based RL, for pre-training the agent, and a task-aware fine-tuning strategy combined with a new proposed hybrid planner, Dyna-MPC, to adapt the agent for downstream tasks. On URLB, our method obtains 93.59% overall normalized performance, surpassing previous baselines by a staggering margin. The approach is empirically evaluated through a large-scale empirical study, which we use to validate our design choices and analyze our models. We also show robust performance on the Real-Word RL benchmark, hinting at resiliency to environment perturbations during adaptation. Project website: this https URL

Comments:	Accepted at ICML 2023 (oral)
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2209.12016 [cs.AI]
	(or arXiv:2209.12016v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2209.12016

Submission history

From: Pietro Mazzaglia [view email]
[v1] Sat, 24 Sep 2022 14:22:29 UTC (2,951 KB)
[v2] Thu, 25 May 2023 00:50:57 UTC (2,599 KB)

Computer Science > Artificial Intelligence

Title:Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators