Regret-Based Defense in Adversarial Reinforcement Learning

Belaire, Roman; Varakantham, Pradeep; Nguyen, Thanh; Lo, David

Computer Science > Machine Learning

arXiv:2302.06912 (cs)

[Submitted on 14 Feb 2023 (v1), last revised 27 Mar 2024 (this version, v4)]

Title:Regret-Based Defense in Adversarial Reinforcement Learning

Authors:Roman Belaire, Pradeep Varakantham, Thanh Nguyen, David Lo

View PDF HTML (experimental)

Abstract:Deep Reinforcement Learning (DRL) policies have been shown to be vulnerable to small adversarial noise in observations. Such adversarial noise can have disastrous consequences in safety-critical environments. For instance, a self-driving car receiving adversarially perturbed sensory observations about nearby signs (e.g., a stop sign physically altered to be perceived as a speed limit sign) or objects (e.g., cars altered to be recognized as trees) can be fatal. Existing approaches for making RL algorithms robust to an observation-perturbing adversary have focused on reactive approaches that iteratively improve against adversarial examples generated at each iteration. While such approaches have been shown to provide improvements over regular RL methods, they are reactive and can fare significantly worse if certain categories of adversarial examples are not generated during training. To that end, we pursue a more proactive approach that relies on directly optimizing a well-studied robustness measure, regret instead of expected value. We provide a principled approach that minimizes maximum regret over a "neighborhood" of observations to the received "observation". Our regret criterion can be used to modify existing value- and policy-based Deep RL methods. We demonstrate that our approaches provide a significant improvement in performance across a wide variety of benchmarks against leading approaches for robust Deep RL.

Comments:	Accepted at AAMAS 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2302.06912 [cs.LG]
	(or arXiv:2302.06912v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.06912

Submission history

From: Roman Belaire [view email]
[v1] Tue, 14 Feb 2023 08:56:50 UTC (108 KB)
[v2] Wed, 15 Feb 2023 02:21:17 UTC (108 KB)
[v3] Wed, 23 Aug 2023 07:27:20 UTC (2,276 KB)
[v4] Wed, 27 Mar 2024 06:57:30 UTC (840 KB)

Computer Science > Machine Learning

Title:Regret-Based Defense in Adversarial Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Regret-Based Defense in Adversarial Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators