Removing Undesirable Feature Contributions Using Out-of-Distribution Data

Lee, Saehyung; Park, Changhwa; Lee, Hyungyu; Yi, Jihun; Lee, Jonghyun; Yoon, Sungroh

Computer Science > Machine Learning

arXiv:2101.06639 (cs)

[Submitted on 17 Jan 2021 (v1), last revised 21 Nov 2021 (this version, v3)]

Title:Removing Undesirable Feature Contributions Using Out-of-Distribution Data

Authors:Saehyung Lee, Changhwa Park, Hyungyu Lee, Jihun Yi, Jonghyun Lee, Sungroh Yoon

View PDF

Abstract:Several data augmentation methods deploy unlabeled-in-distribution (UID) data to bridge the gap between the training and inference of neural networks. However, these methods have clear limitations in terms of availability of UID data and dependence of algorithms on pseudo-labels. Herein, we propose a data augmentation method to improve generalization in both adversarial and standard learning by using out-of-distribution (OOD) data that are devoid of the abovementioned issues. We show how to improve generalization theoretically using OOD data in each learning scenario and complement our theoretical analysis with experiments on CIFAR-10, CIFAR-100, and a subset of ImageNet. The results indicate that undesirable features are shared even among image data that seem to have little correlation from a human point of view. We also present the advantages of the proposed method through comparison with other data augmentation methods, which can be used in the absence of UID data. Furthermore, we demonstrate that the proposed method can further improve the existing state-of-the-art adversarial training.

Comments:	Published as a conference paper at ICLR 2021
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2101.06639 [cs.LG]
	(or arXiv:2101.06639v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2101.06639

Submission history

From: Saehyung Lee [view email]
[v1] Sun, 17 Jan 2021 10:26:34 UTC (169 KB)
[v2] Wed, 3 Mar 2021 05:40:51 UTC (171 KB)
[v3] Sun, 21 Nov 2021 00:41:56 UTC (169 KB)

Computer Science > Machine Learning

Title:Removing Undesirable Feature Contributions Using Out-of-Distribution Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Removing Undesirable Feature Contributions Using Out-of-Distribution Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators