Skip to main content

Showing 1–1 of 1 results for author: Nöther, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.13851  [pdf, other

    cs.LG cs.AI cs.CR cs.MA

    Implicit Poisoning Attacks in Two-Agent Reinforcement Learning: Adversarial Policies for Training-Time Attacks

    Authors: Mohammad Mohammadi, Jonathan Nöther, Debmalya Mandal, Adish Singla, Goran Radanovic

    Abstract: In targeted poisoning attacks, an attacker manipulates an agent-environment interaction to force the agent into adopting a policy of interest, called target policy. Prior work has primarily focused on attacks that modify standard MDP primitives, such as rewards or transitions. In this paper, we study targeted poisoning attacks in a two-agent setting where an attacker implicitly poisons the effecti… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.