Towards Last-layer Retraining for Group Robustness with Fewer Annotations

LaBonte, Tyler; Muthukumar, Vidya; Kumar, Abhishek

Computer Science > Machine Learning

arXiv:2309.08534 (cs)

[Submitted on 15 Sep 2023 (v1), last revised 15 Nov 2023 (this version, v3)]

Title:Towards Last-layer Retraining for Group Robustness with Fewer Annotations

Authors:Tyler LaBonte, Vidya Muthukumar, Abhishek Kumar

View PDF

Abstract:Empirical risk minimization (ERM) of neural networks is prone to over-reliance on spurious correlations and poor generalization on minority groups. The recent deep feature reweighting (DFR) technique achieves state-of-the-art group robustness via simple last-layer retraining, but it requires held-out group and class annotations to construct a group-balanced reweighting dataset. In this work, we examine this impractical requirement and find that last-layer retraining can be surprisingly effective with no group annotations (other than for model selection) and only a handful of class annotations. We first show that last-layer retraining can greatly improve worst-group accuracy even when the reweighting dataset has only a small proportion of worst-group data. This implies a "free lunch" where holding out a subset of training data to retrain the last layer can substantially outperform ERM on the entire dataset with no additional data or annotations. To further improve group robustness, we introduce a lightweight method called selective last-layer finetuning (SELF), which constructs the reweighting dataset using misclassifications or disagreements. Our empirical and theoretical results present the first evidence that model disagreement upsamples worst-group data, enabling SELF to nearly match DFR on four well-established benchmarks across vision and language tasks with no group annotations and less than 3% of the held-out class annotations. Our code is available at this https URL.

Comments:	NeurIPS 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2309.08534 [cs.LG]
	(or arXiv:2309.08534v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.08534

Submission history

From: Tyler LaBonte [view email]
[v1] Fri, 15 Sep 2023 16:52:29 UTC (127 KB)
[v2] Mon, 13 Nov 2023 18:24:16 UTC (103 KB)
[v3] Wed, 15 Nov 2023 04:18:39 UTC (104 KB)

Computer Science > Machine Learning

Title:Towards Last-layer Retraining for Group Robustness with Fewer Annotations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Last-layer Retraining for Group Robustness with Fewer Annotations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators