Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea

Krasowski, Hanna; Althoff, Matthias

doi:10.1109/TIV.2024.3400597

Computer Science > Machine Learning

arXiv:2402.08502 (cs)

[Submitted on 13 Feb 2024 (v1), last revised 16 May 2024 (this version, v2)]

Title:Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea

Authors:Hanna Krasowski, Matthias Althoff

View PDF HTML (experimental)

Abstract:For safe operation, autonomous vehicles have to obey traffic rules that are set forth in legal documents formulated in natural language. Temporal logic is a suitable concept to formalize such traffic rules. Still, temporal logic rules often result in constraints that are hard to solve using optimization-based motion planners. Reinforcement learning (RL) is a promising method to find motion plans for autonomous vehicles. However, vanilla RL algorithms are based on random exploration and do not automatically comply with traffic rules. Our approach accomplishes guaranteed rule-compliance by integrating temporal logic specifications into RL. Specifically, we consider the application of vessels on the open sea, which must adhere to the Convention on the International Regulations for Preventing Collisions at Sea (COLREGS). To efficiently synthesize rule-compliant actions, we combine predicates based on set-based prediction with a statechart representing our formalized rules and their priorities. Action masking then restricts the RL agent to this set of verified rule-compliant actions. In numerical evaluations on critical maritime traffic situations, our agent always complies with the formalized legal rules and never collides while achieving a high goal-reaching rate during training and deployment. In contrast, vanilla and traffic rule-informed RL agents frequently violate traffic rules and collide even after training.

Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2402.08502 [cs.LG]
	(or arXiv:2402.08502v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.08502
Related DOI:	https://doi.org/10.1109/TIV.2024.3400597

Submission history

From: Hanna Krasowski [view email]
[v1] Tue, 13 Feb 2024 14:59:19 UTC (1,275 KB)
[v2] Thu, 16 May 2024 21:14:14 UTC (1,503 KB)

Computer Science > Machine Learning

Title:Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators