Learn from the Past: A Proxy Guided Adversarial Defense Framework with Self Distillation Regularization

Liu, Yaohua; Gao, Jiaxin; Jiao, Xianghao; Liu, Zhu; Fan, Xin; Liu, Risheng

Computer Science > Machine Learning

arXiv:2310.12713 (cs)

[Submitted on 19 Oct 2023 (v1), last revised 10 Mar 2024 (this version, v2)]

Title:Learn from the Past: A Proxy Guided Adversarial Defense Framework with Self Distillation Regularization

Authors:Yaohua Liu, Jiaxin Gao, Xianghao Jiao, Zhu Liu, Xin Fan, Risheng Liu

View PDF HTML (experimental)

Abstract:Adversarial Training (AT), pivotal in fortifying the robustness of deep learning models, is extensively adopted in practical applications. However, prevailing AT methods, relying on direct iterative updates for target model's defense, frequently encounter obstacles such as unstable training and catastrophic overfitting. In this context, our work illuminates the potential of leveraging the target model's historical states as a proxy to provide effective initialization and defense prior, which results in a general proxy guided defense framework, `LAST' ({\bf L}earn from the P{\bf ast}). Specifically, LAST derives response of the proxy model as dynamically learned fast weights, which continuously corrects the update direction of the target model. Besides, we introduce a self-distillation regularized defense objective, ingeniously designed to steer the proxy model's update trajectory without resorting to external teacher models, thereby ameliorating the impact of catastrophic overfitting on performance. Extensive experiments and ablation studies showcase the framework's efficacy in markedly improving model robustness (e.g., up to 9.2\% and 20.3\% enhancement in robust accuracy on CIFAR10 and CIFAR100 datasets, respectively) and training stability. These improvements are consistently observed across various model architectures, larger datasets, perturbation sizes, and attack modalities, affirming LAST's ability to consistently refine both single-step and multi-step AT strategies. The code will be available at~\url{this https URL}.

Comments:	13 Pages
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2310.12713 [cs.LG]
	(or arXiv:2310.12713v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.12713

Submission history

From: Risheng Liu [view email]
[v1] Thu, 19 Oct 2023 13:13:41 UTC (32,095 KB)
[v2] Sun, 10 Mar 2024 16:17:08 UTC (19,122 KB)

Computer Science > Machine Learning

Title:Learn from the Past: A Proxy Guided Adversarial Defense Framework with Self Distillation Regularization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learn from the Past: A Proxy Guided Adversarial Defense Framework with Self Distillation Regularization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators