Provably Robust Adversarial Examples

Dimitrov, Dimitar I.; Singh, Gagandeep; Gehr, Timon; Vechev, Martin

Computer Science > Machine Learning

arXiv:2007.12133 (cs)

[Submitted on 23 Jul 2020 (v1), last revised 17 Mar 2022 (this version, v3)]

Title:Provably Robust Adversarial Examples

Authors:Dimitar I. Dimitrov, Gagandeep Singh, Timon Gehr, Martin Vechev

View PDF

Abstract:We introduce the concept of provably robust adversarial examples for deep neural networks - connected input regions constructed from standard adversarial examples which are guaranteed to be robust to a set of real-world perturbations (such as changes in pixel intensity and geometric transformations). We present a novel method called PARADE for generating these regions in a scalable manner which works by iteratively refining the region initially obtained via sampling until a refined region is certified to be adversarial with existing state-of-the-art verifiers. At each step, a novel optimization procedure is applied to maximize the region's volume under the constraint that the convex relaxation of the network behavior with respect to the region implies a chosen bound on the certification objective. Our experimental evaluation shows the effectiveness of PARADE: it successfully finds large provably robust regions including ones containing $\approx 10^{573}$ adversarial examples for pixel intensity and $\approx 10^{599}$ for geometric perturbations. The provability enables our robust examples to be significantly more effective against state-of-the-art defenses based on randomized smoothing than the individual attacks used to construct the regions.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2007.12133 [cs.LG]
	(or arXiv:2007.12133v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2007.12133

Submission history

From: Dimitar I. Dimitrov [view email]
[v1] Thu, 23 Jul 2020 17:03:56 UTC (488 KB)
[v2] Sun, 26 Jul 2020 22:45:30 UTC (488 KB)
[v3] Thu, 17 Mar 2022 19:36:50 UTC (396 KB)

Computer Science > Machine Learning

Title:Provably Robust Adversarial Examples

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Provably Robust Adversarial Examples

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators