Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

Azran, Guy; Danesh, Mohamad H.; Albrecht, Stefano V.; Keren, Sarah

Computer Science > Artificial Intelligence

arXiv:2307.05209 (cs)

[Submitted on 11 Jul 2023 (v1), last revised 21 Feb 2024 (this version, v4)]

Title:Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

Authors:Guy Azran, Mohamad H. Danesh, Stefano V. Albrecht, Sarah Keren

View PDF HTML (experimental)

Abstract:Recent studies show that deep reinforcement learning (DRL) agents tend to overfit to the task on which they were trained and fail to adapt to minor environment changes. To expedite learning when transferring to unseen tasks, we propose a novel approach to representing the current task using reward machines (RMs), state machine abstractions that induce subtasks based on the current task's rewards and dynamics. Our method provides agents with symbolic representations of optimal transitions from their current abstract state and rewards them for achieving these transitions. These representations are shared across tasks, allowing agents to exploit knowledge of previously encountered symbols and transitions, thus enhancing transfer. Empirical results show that our representations improve sample efficiency and few-shot transfer in a variety of domains.

Comments:	Proceedings of the 38th AAAI Conference on Artificial Intelligence (AAAI), 2024
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2307.05209 [cs.AI]
	(or arXiv:2307.05209v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2307.05209

Submission history

From: Guy Azran [view email]
[v1] Tue, 11 Jul 2023 12:28:05 UTC (495 KB)
[v2] Wed, 20 Dec 2023 10:51:06 UTC (597 KB)
[v3] Sun, 18 Feb 2024 11:58:39 UTC (597 KB)
[v4] Wed, 21 Feb 2024 01:06:35 UTC (597 KB)

Computer Science > Artificial Intelligence

Title:Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Contextual Pre-planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators