Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Tomani, Christian; Buettner, Florian

Computer Science > Machine Learning

arXiv:2012.10923 (cs)

[Submitted on 20 Dec 2020 (v1), last revised 2 Mar 2021 (this version, v2)]

Title:Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Authors:Christian Tomani, Florian Buettner

View PDF

Abstract:To facilitate a wide-spread acceptance of AI systems guiding decision making in real-world applications, trustworthiness of deployed models is key. That is, it is crucial for predictive models to be uncertainty-aware and yield well-calibrated (and thus trustworthy) predictions for both in-domain samples as well as under domain shift. Recent efforts to account for predictive uncertainty include post-processing steps for trained neural networks, Bayesian neural networks as well as alternative non-Bayesian approaches such as ensemble approaches and evidential deep learning. Here, we propose an efficient yet general modelling approach for obtaining well-calibrated, trustworthy probabilities for samples obtained after a domain shift. We introduce a new training strategy combining an entropy-encouraging loss term with an adversarial calibration loss term and demonstrate that this results in well-calibrated and technically trustworthy predictions for a wide range of domain drifts. We comprehensively evaluate previously proposed approaches on different data modalities, a large range of data sets including sequence data, network architectures and perturbation strategies. We observe that our modelling approach substantially outperforms existing state-of-the-art approaches, yielding well-calibrated predictions under domain drift.

Comments:	In Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-2021). Code available at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2012.10923 [cs.LG]
	(or arXiv:2012.10923v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2012.10923

Submission history

From: Christian Tomani [view email]
[v1] Sun, 20 Dec 2020 13:39:29 UTC (484 KB)
[v2] Tue, 2 Mar 2021 19:27:46 UTC (1,091 KB)

Computer Science > Machine Learning

Title:Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators