Generalizable Adversarial Attacks with Latent Variable Perturbation Modelling

Bose, Avishek Joey; Cianflone, Andre; Hamilton, William L.

Computer Science > Machine Learning

arXiv:1905.10864 (cs)

[Submitted on 26 May 2019 (v1), last revised 20 Jan 2020 (this version, v3)]

Title:Generalizable Adversarial Attacks with Latent Variable Perturbation Modelling

Authors:Avishek Joey Bose, Andre Cianflone, William L. Hamilton

View PDF

Abstract:Adversarial attacks on deep neural networks traditionally rely on a constrained optimization paradigm, where an optimization procedure is used to obtain a single adversarial perturbation for a given input example. In this work we frame the problem as learning a distribution of adversarial perturbations, enabling us to generate diverse adversarial distributions given an unperturbed input. We show that this framework is domain-agnostic in that the same framework can be employed to attack different input domains with minimal modification. Across three diverse domains---images, text, and graphs---our approach generates whitebox attacks with success rates that are competitive with or superior to existing approaches, with a new state-of-the-art achieved in the graph domain. Finally, we demonstrate that our framework can efficiently generate a diverse set of attacks for a single given input, and is even capable of attacking \textit{unseen} test instances in a zero-shot manner, exhibiting attack generalization.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:1905.10864 [cs.LG]
	(or arXiv:1905.10864v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.10864

Submission history

From: Avishek Bose [view email]
[v1] Sun, 26 May 2019 19:38:15 UTC (753 KB)
[v2] Wed, 12 Jun 2019 23:30:40 UTC (754 KB)
[v3] Mon, 20 Jan 2020 23:22:52 UTC (777 KB)

Computer Science > Machine Learning

Title:Generalizable Adversarial Attacks with Latent Variable Perturbation Modelling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generalizable Adversarial Attacks with Latent Variable Perturbation Modelling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators