Discovering environments with XRM

Pezeshki, Mohammad; Bouchacourt, Diane; Ibrahim, Mark; Ballas, Nicolas; Vincent, Pascal; Lopez-Paz, David

Computer Science > Machine Learning

arXiv:2309.16748 (cs)

[Submitted on 28 Sep 2023]

Title:Discovering environments with XRM

Authors:Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim, Nicolas Ballas, Pascal Vincent, David Lopez-Paz

View PDF

Abstract:Successful out-of-distribution generalization requires environment annotations. Unfortunately, these are resource-intensive to obtain, and their relevance to model performance is limited by the expectations and perceptual biases of human annotators. Therefore, to enable robust AI systems across applications, we must develop algorithms to automatically discover environments inducing broad generalization. Current proposals, which divide examples based on their training error, suffer from one fundamental problem. These methods add hyper-parameters and early-stop** criteria that are impossible to tune without a validation set with human-annotated environments, the very information subject to discovery. In this paper, we propose Cross-Risk-Minimization (XRM) to address this issue. XRM trains two twin networks, each learning from one random half of the training data, while imitating confident held-out mistakes made by its sibling. XRM provides a recipe for hyper-parameter tuning, does not require early-stop**, and can discover environments for all training and validation data. Domain generalization algorithms built on top of XRM environments achieve oracle worst-group-accuracy, solving a long-standing problem in out-of-distribution generalization.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2309.16748 [cs.LG]
	(or arXiv:2309.16748v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.16748

Submission history

From: Mohammad Pezeshki [view email]
[v1] Thu, 28 Sep 2023 17:55:45 UTC (1,191 KB)

Computer Science > Machine Learning

Title:Discovering environments with XRM

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Discovering environments with XRM

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators