Generalization Bounds for Few-Shot Transfer Learning with Pretrained Classifiers

Galanti, Tomer; György, András; Hutter, Marcus

Computer Science > Machine Learning

arXiv:2212.12532 (cs)

[Submitted on 23 Dec 2022 (v1), last revised 16 Jul 2023 (this version, v2)]

Title:Generalization Bounds for Few-Shot Transfer Learning with Pretrained Classifiers

Authors:Tomer Galanti, András György, Marcus Hutter

View PDF

Abstract:We study the ability of foundation models to learn representations for classification that are transferable to new, unseen classes. Recent results in the literature show that representations learned by a single classifier over many classes are competitive on few-shot learning problems with representations learned by special-purpose algorithms designed for such problems. We offer a theoretical explanation for this behavior based on the recently discovered phenomenon of class-feature-variability collapse, that is, that during the training of deep classification networks the feature embeddings of samples belonging to the same class tend to concentrate around their class means. More specifically, we show that the few-shot error of the learned feature map on new classes (defined as the classification error of the nearest class-center classifier using centers learned from a small number of random samples from each new class) is small in case of class-feature-variability collapse, under the assumption that the classes are selected independently from a fixed distribution. This suggests that foundation models can provide feature maps that are transferable to new downstream tasks, even with very few samples; to our knowledge, this is the first performance bound for transfer-learning that is non-vacuous in the few-shot setting.

Comments:	arXiv admin note: substantial text overlap with arXiv:2112.15121
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2212.12532 [cs.LG]
	(or arXiv:2212.12532v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2212.12532

Submission history

From: Tomer Galanti [view email]
[v1] Fri, 23 Dec 2022 18:46:05 UTC (7,128 KB)
[v2] Sun, 16 Jul 2023 23:41:07 UTC (3,564 KB)

Computer Science > Machine Learning

Title:Generalization Bounds for Few-Shot Transfer Learning with Pretrained Classifiers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generalization Bounds for Few-Shot Transfer Learning with Pretrained Classifiers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators