Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware Minimization

Zhu, Mingli; Wei, Shaokui; Shen, Li; Fan, Yanbo; Wu, Baoyuan

Computer Science > Artificial Intelligence

arXiv:2304.11823 (cs)

[Submitted on 24 Apr 2023]

Title:Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware Minimization

Authors:Mingli Zhu, Shaokui Wei, Li Shen, Yanbo Fan, Baoyuan Wu

View PDF

Abstract:Backdoor defense, which aims to detect or mitigate the effect of malicious triggers introduced by attackers, is becoming increasingly critical for machine learning security and integrity. Fine-tuning based on benign data is a natural defense to erase the backdoor effect in a backdoored model. However, recent studies show that, given limited benign data, vanilla fine-tuning has poor defense performance. In this work, we provide a deep study of fine-tuning the backdoored model from the neuron perspective and find that backdoorrelated neurons fail to escape the local minimum in the fine-tuning process. Inspired by observing that the backdoorrelated neurons often have larger norms, we propose FTSAM, a novel backdoor defense paradigm that aims to shrink the norms of backdoor-related neurons by incorporating sharpness-aware minimization with fine-tuning. We demonstrate the effectiveness of our method on several benchmark datasets and network architectures, where it achieves state-of-the-art defense performance. Overall, our work provides a promising avenue for improving the robustness of machine learning models against backdoor attacks.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2304.11823 [cs.AI]
	(or arXiv:2304.11823v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2304.11823

Submission history

From: Mingli Zhu [view email]
[v1] Mon, 24 Apr 2023 05:13:52 UTC (10,593 KB)

Computer Science > Artificial Intelligence

Title:Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware Minimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Enhancing Fine-Tuning Based Backdoor Defense with Sharpness-Aware Minimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators