Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis

Li, Tao; Lei, Haozhe; Zhu, Quanyan

Computer Science > Machine Learning

arXiv:2208.00081 (cs)

[Submitted on 29 Jul 2022 (v1), last revised 8 Mar 2023 (this version, v2)]

Title:Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis

Authors:Tao Li, Haozhe Lei, Quanyan Zhu

View PDF

Abstract:Meta reinforcement learning (meta RL), as a combination of meta-learning ideas and reinforcement learning (RL), enables the agent to adapt to different tasks using a few samples. However, this sampling-based adaptation also makes meta RL vulnerable to adversarial attacks. By manipulating the reward feedback from sampling processes in meta RL, an attacker can mislead the agent into building wrong knowledge from training experience, which deteriorates the agent's performance when dealing with different tasks after adaptation. This paper provides a game-theoretical underpinning for understanding this type of security risk. In particular, we formally define the sampling attack model as a Stackelberg game between the attacker and the agent, which yields a minimax formulation. It leads to two online attack schemes: Intermittent Attack and Persistent Attack, which enable the attacker to learn an optimal sampling attack, defined by an $\epsilon$-first-order stationary point, within $\mathcal{O}(\epsilon^{-2})$ iterations. These attack schemes freeride the learning progress concurrently without extra interactions with the environment. By corroborating the convergence results with numerical experiments, we observe that a minor effort of the attacker can significantly deteriorate the learning performance, and the minimax approach can also help robustify the meta RL algorithms.

Comments:	updates: github repo posted
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2208.00081 [cs.LG]
	(or arXiv:2208.00081v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2208.00081

Submission history

From: Tao Li [view email]
[v1] Fri, 29 Jul 2022 21:29:29 UTC (4,877 KB)
[v2] Wed, 8 Mar 2023 01:29:10 UTC (2,097 KB)

Computer Science > Machine Learning

Title:Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sampling Attacks on Meta Reinforcement Learning: A Minimax Formulation and Complexity Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators