Few-shot Learner Parameterization by Diffusion Time-steps

Yue, Zhongqi; Zhou, Pan; Hong, Richang; Zhang, Hanwang; Sun, Qianru

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.02649 (cs)

[Submitted on 5 Mar 2024 (v1), last revised 27 Mar 2024 (this version, v2)]

Title:Few-shot Learner Parameterization by Diffusion Time-steps

Authors:Zhongqi Yue, Pan Zhou, Richang Hong, Hanwang Zhang, Qianru Sun

View PDF HTML (experimental)

Abstract:Even when using large multi-modal foundation models, few-shot learning is still challenging -- if there is no proper inductive bias, it is nearly impossible to keep the nuanced class attributes while removing the visually prominent attributes that spuriously correlate with class labels. To this end, we find an inductive bias that the time-steps of a Diffusion Model (DM) can isolate the nuanced class attributes, i.e., as the forward diffusion adds noise to an image at each time-step, nuanced attributes are usually lost at an earlier time-step than the spurious attributes that are visually prominent. Building on this, we propose Time-step Few-shot (TiF) learner. We train class-specific low-rank adapters for a text-conditioned DM to make up for the lost attributes, such that images can be accurately reconstructed from their noisy ones given a prompt. Hence, at a small time-step, the adapter and prompt are essentially a parameterization of only the nuanced class attributes. For a test image, we can use the parameterization to only extract the nuanced class attributes for classification. TiF learner significantly outperforms OpenCLIP and its adapters on a variety of fine-grained and customized few-shot learning tasks. Codes are in this https URL.

Comments:	Accepted by CVPR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.02649 [cs.CV]
	(or arXiv:2403.02649v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.02649

Submission history

From: Zhongqi Yue [view email]
[v1] Tue, 5 Mar 2024 04:38:13 UTC (9,970 KB)
[v2] Wed, 27 Mar 2024 03:34:00 UTC (9,971 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Few-shot Learner Parameterization by Diffusion Time-steps

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Few-shot Learner Parameterization by Diffusion Time-steps

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators