Towards Task-Prioritized Policy Composition

Rietz, Finn; Schaffernicht, Erik; Stoyanov, Todor; Stork, Johannes A.

Computer Science > Machine Learning

arXiv:2209.09536 (cs)

[Submitted on 20 Sep 2022]

Title:Towards Task-Prioritized Policy Composition

Authors:Finn Rietz, Erik Schaffernicht, Todor Stoyanov, Johannes A. Stork

View PDF

Abstract:Combining learned policies in a prioritized, ordered manner is desirable because it allows for modular design and facilitates data reuse through knowledge transfer. In control theory, prioritized composition is realized by null-space control, where low-priority control actions are projected into the null-space of high-priority control actions. Such a method is currently unavailable for Reinforcement Learning. We propose a novel, task-prioritized composition framework for Reinforcement Learning, which involves a novel concept: The indifferent-space of Reinforcement Learning policies. Our framework has the potential to facilitate knowledge transfer and modular design while greatly increasing data efficiency and data reuse for Reinforcement Learning agents. Further, our approach can ensure high-priority constraint satisfaction, which makes it promising for learning in safety-critical domains like robotics. Unlike null-space control, our approach allows learning globally optimal policies for the compound task by online learning in the indifference-space of higher-level policies after initial compound policy construction.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2209.09536 [cs.LG]
	(or arXiv:2209.09536v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.09536

Submission history

From: Finn Rietz [view email]
[v1] Tue, 20 Sep 2022 08:08:04 UTC (673 KB)

Computer Science > Machine Learning

Title:Towards Task-Prioritized Policy Composition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Task-Prioritized Policy Composition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators