Are Transformers More Robust? Towards Exact Robustness Verification for Transformers

Liao, Brian Hsuan-Cheng; Cheng, Chih-Hong; Esen, Hasan; Knoll, Alois

Computer Science > Machine Learning

arXiv:2202.03932 (cs)

[Submitted on 8 Feb 2022 (v1), last revised 19 May 2023 (this version, v4)]

Title:Are Transformers More Robust? Towards Exact Robustness Verification for Transformers

Authors:Brian Hsuan-Cheng Liao, Chih-Hong Cheng, Hasan Esen, Alois Knoll

View PDF

Abstract:As an emerging type of Neural Networks (NNs), Transformers are used in many domains ranging from Natural Language Processing to Autonomous Driving. In this paper, we study the robustness problem of Transformers, a key characteristic as low robustness may cause safety concerns. Specifically, we focus on Sparsemax-based Transformers and reduce the finding of their maximum robustness to a Mixed Integer Quadratically Constrained Programming (MIQCP) problem. We also design two pre-processing heuristics that can be embedded in the MIQCP encoding and substantially accelerate its solving. We then conduct experiments using the application of Land Departure Warning to compare the robustness of Sparsemax-based Transformers against that of the more conventional Multi-Layer-Perceptron (MLP) NNs. To our surprise, Transformers are not necessarily more robust, leading to profound considerations in selecting appropriate NN architectures for safety-critical domain applications.

Comments:	Accepted at SafeComp 2023, 14 pages (Springer LNCS format), 3 figures, 2 tables, 2 algorithms
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2202.03932 [cs.LG]
	(or arXiv:2202.03932v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2202.03932

Submission history

From: Brian Hsuan-Cheng Liao [view email]
[v1] Tue, 8 Feb 2022 15:27:33 UTC (538 KB)
[v2] Wed, 21 Sep 2022 13:30:04 UTC (538 KB)
[v3] Thu, 23 Feb 2023 16:21:34 UTC (605 KB)
[v4] Fri, 19 May 2023 10:54:49 UTC (450 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-02

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Chih-Hong Cheng
Maximilian Kneißl
Alois C. Knoll

export BibTeX citation

Computer Science > Machine Learning

Title:Are Transformers More Robust? Towards Exact Robustness Verification for Transformers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Are Transformers More Robust? Towards Exact Robustness Verification for Transformers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators