A Closer Look at the Adversarial Robustness of Information Bottleneck Models

Korshunova, Iryna; Stutz, David; Alemi, Alexander A.; Wiles, Olivia; Gowal, Sven

Computer Science > Machine Learning

arXiv:2107.05712 (cs)

[Submitted on 12 Jul 2021]

Title:A Closer Look at the Adversarial Robustness of Information Bottleneck Models

Authors:Iryna Korshunova, David Stutz, Alexander A. Alemi, Olivia Wiles, Sven Gowal

View PDF

Abstract:We study the adversarial robustness of information bottleneck models for classification. Previous works showed that the robustness of models trained with information bottlenecks can improve upon adversarial training. Our evaluation under a diverse range of white-box $l_{\infty}$ attacks suggests that information bottlenecks alone are not a strong defense strategy, and that previous results were likely influenced by gradient obfuscation.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2107.05712 [cs.LG]
	(or arXiv:2107.05712v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2107.05712

Submission history

From: Iryna Korshunova [view email]
[v1] Mon, 12 Jul 2021 20:05:08 UTC (3,884 KB)

Computer Science > Machine Learning

Title:A Closer Look at the Adversarial Robustness of Information Bottleneck Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Closer Look at the Adversarial Robustness of Information Bottleneck Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators