Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

Teney, Damien; Wang, **dong; Abbasnejad, Ehsan

Computer Science > Machine Learning

arXiv:2305.16817v1 (cs)

[Submitted on 26 May 2023 (this version), latest version 2 Jun 2023 (v2)]

Title:Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

Authors:Damien Teney, **dong Wang, Ehsan Abbasnejad

View PDF

Abstract:Mixup is a highly successful technique to improve generalization of neural networks by augmenting the training data with combinations of random pairs. Selective mixup is a family of methods that apply mixup to specific pairs, e.g. only combining examples across classes or domains. These methods have claimed remarkable improvements on benchmarks with distribution shifts, but their mechanisms and limitations remain poorly understood.
We examine an overlooked aspect of selective mixup that explains its success in a completely new light. We find that the non-random selection of pairs affects the training distribution and improve generalization by means completely unrelated to the mixing. For example in binary classification, mixup across classes implicitly resamples the data for a uniform class distribution - a classical solution to label shift. We show empirically that this implicit resampling explains much of the improvements in prior work. Theoretically, these results rely on a regression toward the mean, an accidental property that we identify in several datasets.
We have found a new equivalence between two successful methods: selective mixup and resampling. We identify limits of the former, confirm the effectiveness of the latter, and find better combinations of their respective benefits.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2305.16817 [cs.LG]
	(or arXiv:2305.16817v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.16817

Submission history

From: Damien Teney [view email]
[v1] Fri, 26 May 2023 10:56:22 UTC (7,652 KB)
[v2] Fri, 2 Jun 2023 18:21:38 UTC (7,652 KB)

Computer Science > Machine Learning

Title:Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Selective Mixup Helps with Distribution Shifts, But Not (Only) because of Mixup

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators