System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Sur, Indranil; Daniels, Zachary; Rahman, Abrar; Faber, Kamil; Gallardo, Gianmarco J.; Hayes, Tyler L.; Taylor, Cameron E.; Gurbuz, Mustafa Burak; Smith, James; Joshi, Sahana; Japkowicz, Nathalie; Baron, Michael; Kira, Zsolt; Kanan, Christopher; Corizzo, Roberto; Divakaran, Ajay; Piacentino, Michael; Hostetler, Jesse; Raghavan, Aswin

doi:10.1145/3564121.3565236

Computer Science > Machine Learning

arXiv:2212.04603 (cs)

[Submitted on 8 Dec 2022]

Title:System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Authors:Indranil Sur, Zachary Daniels, Abrar Rahman, Kamil Faber, Gianmarco J. Gallardo, Tyler L. Hayes, Cameron E. Taylor, Mustafa Burak Gurbuz, James Smith, Sahana Joshi, Nathalie Japkowicz, Michael Baron, Zsolt Kira, Christopher Kanan, Roberto Corizzo, Ajay Divakaran, Michael Piacentino, Jesse Hostetler, Aswin Raghavan

View PDF

Abstract:As Artificial and Robotic Systems are increasingly deployed and relied upon for real-world applications, it is important that they exhibit the ability to continually learn and adapt in dynamically-changing environments, becoming Lifelong Learning Machines. Continual/lifelong learning (LL) involves minimizing catastrophic forgetting of old tasks while maximizing a model's capability to learn new tasks. This paper addresses the challenging lifelong reinforcement learning (L2RL) setting. Pushing the state-of-the-art forward in L2RL and making L2RL useful for practical applications requires more than develo** individual L2RL algorithms; it requires making progress at the systems-level, especially research into the non-trivial problem of how to integrate multiple L2RL algorithms into a common framework. In this paper, we introduce the Lifelong Reinforcement Learning Components Framework (L2RLCF), which standardizes L2RL systems and assimilates different continual learning components (each addressing different aspects of the lifelong learning problem) into a unified system. As an instantiation of L2RLCF, we develop a standard API allowing easy integration of novel lifelong learning components. We describe a case study that demonstrates how multiple independently-developed LL components can be integrated into a single realized system. We also introduce an evaluation environment in order to measure the effect of combining various system components. Our evaluation environment employs different LL scenarios (sequences of tasks) consisting of Starcraft-2 minigames and allows for the fair, comprehensive, and quantitative comparison of different combinations of components within a challenging common evaluation environment.

Comments:	The Second International Conference on AIML Systems, October 12--15, 2022, Bangalore, India
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2212.04603 [cs.LG]
	(or arXiv:2212.04603v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2212.04603
Related DOI:	https://doi.org/10.1145/3564121.3565236

Submission history

From: Indranil Sur Mr [view email]
[v1] Thu, 8 Dec 2022 23:32:57 UTC (156 KB)

Computer Science > Machine Learning

Title:System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators