A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning

Wang, Zhi; Chen, Chunlin; Dong, Daoyi

doi:10.1109/TCYB.2022.3170485

Computer Science > Machine Learning

arXiv:2205.10787v1 (cs)

[Submitted on 22 May 2022]

Title:A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning

Authors:Zhi Wang, Chunlin Chen, Daoyi Dong

View PDF

Abstract:While reinforcement learning (RL) algorithms are achieving state-of-the-art performance in various challenging tasks, they can easily encounter catastrophic forgetting or interference when faced with lifelong streaming information. In the paper, we propose a scalable lifelong RL method that dynamically expands the network capacity to accommodate new knowledge while preventing past memories from being perturbed. We use a Dirichlet process mixture to model the non-stationary task distribution, which captures task relatedness by estimating the likelihood of task-to-cluster assignments and clusters the task models in a latent space. We formulate the prior distribution of the mixture as a Chinese restaurant process (CRP) that instantiates new mixture components as needed. The update and expansion of the mixture are governed by the Bayesian non-parametric framework with an expectation maximization (EM) procedure, which dynamically adapts the model complexity without explicit task boundaries or heuristics. Moreover, we use the domain randomization technique to train robust prior parameters for the initialization of each task model in the mixture, thus the resulting model can better generalize and adapt to unseen tasks. With extensive experiments conducted on robot navigation and locomotion domains, we show that our method successfully facilitates scalable lifelong RL and outperforms relevant existing methods.

Comments:	Manuscript accepted by IEEE Transactions on Cybernetics, 2022, DOI: DOI: https://doi.org/10.1109/TCYB.2022.3170485
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2205.10787 [cs.LG]
	(or arXiv:2205.10787v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.10787
Related DOI:	https://doi.org/10.1109/TCYB.2022.3170485

Submission history

From: Zhi Wang [view email]
[v1] Sun, 22 May 2022 09:48:41 UTC (3,399 KB)

Computer Science > Machine Learning

Title:A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Dirichlet Process Mixture of Robust Task Models for Scalable Lifelong Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators