Unsupervised Model-based Pre-training for Data-efficient Control from Pixels

Rajeswar, Sai; Mazzaglia, Pietro; Verbelen, Tim; Piché, Alexandre; Dhoedt, Bart; Courville, Aaron; Lacoste, Alexandre

Computer Science > Artificial Intelligence

arXiv:2209.12016v1 (cs)

[Submitted on 24 Sep 2022 (this version), latest version 25 May 2023 (v2)]

Title:Unsupervised Model-based Pre-training for Data-efficient Control from Pixels

Authors:Sai Rajeswar, Pietro Mazzaglia, Tim Verbelen, Alexandre Piché, Bart Dhoedt, Aaron Courville, Alexandre Lacoste

View PDF

Abstract:Controlling artificial agents from visual sensory data is an arduous task. Reinforcement learning (RL) algorithms can succeed in this but require large amounts of interactions between the agent and the environment. To alleviate the issue, unsupervised RL proposes to employ self-supervised interaction and learning, for adapting faster to future tasks. Yet, whether current unsupervised strategies improve generalization capabilities is still unclear, especially in visual control settings. In this work, we design an effective unsupervised RL strategy for data-efficient visual control. First, we show that world models pre-trained with data collected using unsupervised RL can facilitate adaptation for future tasks. Then, we analyze several design choices to adapt efficiently, effectively reusing the agents' pre-trained components, and learning and planning in imagination, with our hybrid planner, which we dub Dyna-MPC. By combining the findings of a large-scale empirical study, we establish an approach that strongly improves performance on the Unsupervised RL Benchmark, requiring 20$\times$ less data to match the performance of supervised methods. The approach also demonstrates robust performance on the Real-Word RL benchmark, hinting that the approach generalizes to noisy environments.

Comments:	Presented at DARL Workshop @ ICML 2022
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2209.12016 [cs.AI]
	(or arXiv:2209.12016v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2209.12016

Submission history

From: Pietro Mazzaglia [view email]
[v1] Sat, 24 Sep 2022 14:22:29 UTC (2,951 KB)
[v2] Thu, 25 May 2023 00:50:57 UTC (2,599 KB)

Computer Science > Artificial Intelligence

Title:Unsupervised Model-based Pre-training for Data-efficient Control from Pixels

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Unsupervised Model-based Pre-training for Data-efficient Control from Pixels

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators