Provably Robust Detection of Out-of-distribution Data (almost) for free

Meinke, Alexander; Bitterwolf, Julian; Hein, Matthias

Computer Science > Machine Learning

arXiv:2106.04260 (cs)

[Submitted on 8 Jun 2021 (v1), last revised 18 Oct 2022 (this version, v2)]

Title:Provably Robust Detection of Out-of-distribution Data (almost) for free

Authors:Alexander Meinke, Julian Bitterwolf, Matthias Hein

View PDF

Abstract:The application of machine learning in safety-critical systems requires a reliable assessment of uncertainty. However, deep neural networks are known to produce highly overconfident predictions on out-of-distribution (OOD) data. Even if trained to be non-confident on OOD data, one can still adversarially manipulate OOD data so that the classifier again assigns high confidence to the manipulated samples. We show that two previously published defenses can be broken by better adapted attacks, highlighting the importance of robustness guarantees around OOD data. Since the existing method for this task is hard to train and significantly limits accuracy, we construct a classifier that can simultaneously achieve provably adversarially robust OOD detection and high clean accuracy. Moreover, by slightly modifying the classifier's architecture our method provably avoids the asymptotic overconfidence problem of standard neural networks. We provide code for all our experiments.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2106.04260 [cs.LG]
	(or arXiv:2106.04260v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.04260

Submission history

From: Alexander Meinke [view email]
[v1] Tue, 8 Jun 2021 11:40:49 UTC (508 KB)
[v2] Tue, 18 Oct 2022 11:40:06 UTC (1,098 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-06

Change to browse by:

cs
cs.AI
cs.CV

References & Citations

DBLP - CS Bibliography

listing | bibtex

Julian Bitterwolf
Matthias Hein

export BibTeX citation

Computer Science > Machine Learning

Title:Provably Robust Detection of Out-of-distribution Data (almost) for free

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Provably Robust Detection of Out-of-distribution Data (almost) for free

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators