Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking

Stolz, Roland; Krasowski, Hanna; Thumm, Jakob; Eichelbeck, Michael; Gassert, Philipp; Althoff, Matthias

Computer Science > Machine Learning

arXiv:2406.03704 (cs)

[Submitted on 6 Jun 2024]

Title:Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking

Authors:Roland Stolz, Hanna Krasowski, Jakob Thumm, Michael Eichelbeck, Philipp Gassert, Matthias Althoff

View PDF HTML (experimental)

Abstract:Continuous action spaces in reinforcement learning (RL) are commonly defined as interval sets. While intervals usually reflect the action boundaries for tasks well, they can be challenging for learning because the typically large global action space leads to frequent exploration of irrelevant actions. Yet, little task knowledge can be sufficient to identify significantly smaller state-specific sets of relevant actions. Focusing learning on these relevant actions can significantly improve training efficiency and effectiveness. In this paper, we propose to focus learning on the set of relevant actions and introduce three continuous action masking methods for exactly map** the action space to the state-dependent set of relevant actions. Thus, our methods ensure that only relevant actions are executed, enhancing the predictability of the RL agent and enabling its use in safety-critical applications. We further derive the implications of the proposed methods on the policy gradient. Using Proximal Policy Optimization (PPO), we evaluate our methods on three control tasks, where the relevant action set is computed based on the system dynamics and a relevant state set. Our experiments show that the three action masking methods achieve higher final rewards and converge faster than the baseline without action masking.

Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2406.03704 [cs.LG]
	(or arXiv:2406.03704v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.03704

Submission history

From: Hanna Krasowski [view email]
[v1] Thu, 6 Jun 2024 02:55:16 UTC (429 KB)

Computer Science > Machine Learning

Title:Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators