Search | arXiv e-print repository

Transductive Zero-Shot and Few-Shot CLIP

Authors: Ségolène Martin, Yunshi Huang, Fereshteh Shakeri, Jean-Christophe Pesquet, Ismail Ben Ayed

Abstract: Transductive inference has been widely investigated in few-shot image classification, but completely overlooked in the recent, fast growing literature on adapting vision-langage models like CLIP. This paper addresses the transductive zero-shot and few-shot CLIP classification challenge, in which inference is performed jointly across a mini-batch of unlabeled query samples, rather than treating eac… ▽ More Transductive inference has been widely investigated in few-shot image classification, but completely overlooked in the recent, fast growing literature on adapting vision-langage models like CLIP. This paper addresses the transductive zero-shot and few-shot CLIP classification challenge, in which inference is performed jointly across a mini-batch of unlabeled query samples, rather than treating each instance independently. We initially construct informative vision-text probability features, leading to a classification problem on the unit simplex set. Inspired by Expectation-Maximization (EM), our optimization-based classification objective models the data probability distribution for each class using a Dirichlet law. The minimization problem is then tackled with a novel block Majorization-Minimization algorithm, which simultaneously estimates the distribution parameters and class assignments. Extensive numerical experiments on 11 datasets underscore the benefits and efficacy of our batch inference approach.On zero-shot tasks with test batches of 75 samples, our approach yields near 20% improvement in ImageNet accuracy over CLIP's zero-shot performance. Additionally, we outperform state-of-the-art methods in the few-shot setting. The code is available at: https://github.com/SegoleneMartin/transductive-CLIP. △ Less

Submitted 8 April, 2024; originally announced May 2024.

Comments: 2024 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2024, Seattle (USA), Washington, United States

arXiv:2404.02285 [pdf, other]

LP++: A Surprisingly Strong Linear Probe for Few-Shot CLIP

Authors: Yunshi Huang, Fereshteh Shakeri, Jose Dolz, Malik Boudiaf, Houda Bahig, Ismail Ben Ayed

Abstract: In a recent, strongly emergent literature on few-shot CLIP adaptation, Linear Probe (LP) has been often reported as a weak baseline. This has motivated intensive research building convoluted prompt learning or feature adaptation strategies. In this work, we propose and examine from convex-optimization perspectives a generalization of the standard LP baseline, in which the linear classifier weights… ▽ More In a recent, strongly emergent literature on few-shot CLIP adaptation, Linear Probe (LP) has been often reported as a weak baseline. This has motivated intensive research building convoluted prompt learning or feature adaptation strategies. In this work, we propose and examine from convex-optimization perspectives a generalization of the standard LP baseline, in which the linear classifier weights are learnable functions of the text embedding, with class-wise multipliers blending image and text knowledge. As our objective function depends on two types of variables, i.e., the class visual prototypes and the learnable blending parameters, we propose a computationally efficient block coordinate Majorize-Minimize (MM) descent algorithm. In our full-batch MM optimizer, which we coin LP++, step sizes are implicit, unlike standard gradient descent practices where learning rates are intensively searched over validation sets. By examining the mathematical properties of our loss (e.g., Lipschitz gradient continuity), we build majorizing functions yielding data-driven learning rates and derive approximations of the loss's minima, which provide data-informed initialization of the variables. Our image-language objective function, along with these non-trivial optimization insights and ingredients, yields, surprisingly, highly competitive few-shot CLIP performances. Furthermore, LP++ operates in black-box, relaxes intensive validation searches for the optimization hyper-parameters, and runs orders-of-magnitudes faster than state-of-the-art few-shot CLIP adaptation methods. Our code is available at: \url{https://github.com/FereshteShakeri/FewShot-CLIP-Strong-Baseline.git}. △ Less

Submitted 2 April, 2024; originally announced April 2024.

arXiv:2206.00092 [pdf, other]

FHIST: A Benchmark for Few-shot Classification of Histological Images

Authors: Fereshteh Shakeri, Malik Boudiaf, Sina Mohammadi, Ivaxi Sheth, Mohammad Havaei, Ismail Ben Ayed, Samira Ebrahimi Kahou

Abstract: Few-shot learning has recently attracted wide interest in image classification, but almost all the current public benchmarks are focused on natural images. The few-shot paradigm is highly relevant in medical-imaging applications due to the scarcity of labeled data, as annotations are expensive and require specialized expertise. However, in medical imaging, few-shot learning research is sparse, lim… ▽ More Few-shot learning has recently attracted wide interest in image classification, but almost all the current public benchmarks are focused on natural images. The few-shot paradigm is highly relevant in medical-imaging applications due to the scarcity of labeled data, as annotations are expensive and require specialized expertise. However, in medical imaging, few-shot learning research is sparse, limited to private data sets and is at its early stage. In particular, the few-shot setting is of high interest in histology due to the diversity and fine granularity of cancer related tissue classification tasks, and the variety of data-preparation techniques. This paper introduces a highly diversified public benchmark, gathered from various public datasets, for few-shot histology data classification. We build few-shot tasks and base-training data with various tissue types, different levels of domain shifts stemming from various cancer sites, and different class-granularity levels, thereby reflecting realistic scenarios. We evaluate the performances of state-of-the-art few-shot learning methods on our benchmark, and observe that simple fine-tuning and regularization methods achieve better results than the popular meta-learning and episodic-training paradigm. Furthermore, we introduce three scenarios based on the domain shifts between the source and target histology data: near-domain, middle-domain and out-domain. Our experiments display the potential of few-shot learning in histology classification, with state-of-art few shot learning methods approaching the supervised-learning baselines in the near-domain setting. In our out-domain setting, for 5-way 5-shot, the best performing method reaches 60% accuracy. We believe that our work could help in building realistic evaluations and fair comparisons of few-shot learning methods and will further encourage research in the few-shot paradigm. △ Less

Submitted 31 May, 2022; originally announced June 2022.

Comments: Code available at: https://github.com/mboudiaf/Few-shot-histology

arXiv:1507.05786 [pdf, other]

doi 10.1051/0004-6361/201424491

Solar extreme ultraviolet variability of the quiet Sun

Authors: F. Shakeri, L. Teriaca, S. K. Solanki

Abstract: The last solar minimum has been unusually quiet compared to the previous minima (since space-based radiometric measurements are available). The Sun's magnetic flux was substantially lower during this minimum. Some studies also show that the total solar irradiance during the minimum after cycle 23 may have dropped below the values known from the two minima prior to that. For chromospheric and coron… ▽ More The last solar minimum has been unusually quiet compared to the previous minima (since space-based radiometric measurements are available). The Sun's magnetic flux was substantially lower during this minimum. Some studies also show that the total solar irradiance during the minimum after cycle 23 may have dropped below the values known from the two minima prior to that. For chromospheric and coronal radiation, the situation is less clear-cut. The Sun's 10.7\,cm flux shows a decrease of $\sim4\%$ during the solar minimum in 2008 compared to the previous minimum, but \ion{Ca}{II} K does not. Here we consider additional wavelengths in the extreme ultraviolet (EUV), specifically transitions of \ion{He}{I} at 584.3\,Å and \ion{O}{V} at 629.7\,Å, of which the CDS spectrometer aboard SOHO has been taking regular scans along the solar central meridian since 1996. We analysed this unique dataset to verify if and how the radiance distribution undergoes measurable variations between cycle minima. To achieve this aim we determined the radiance distribution of quiet areas around the Sun centre. Concentrating on the last two solar minima, we found out that there is very little variation in the radiance distribution of the chromospheric spectral line \ion{He}{I} between these minima. The same analysis shows a modest, although significant, 4\% variation in the radiance distribution of the transition region spectral line \ion{O}{V}. These results are comparable to those obtained by earlier studies employing other spectral features, and they confirm that chromospheric indices display a small variation, whereas in the TR a more significant reduction of the brighter features is visible. △ Less

Submitted 21 July, 2015; originally announced July 2015.

Journal ref: A&A 581, A51 (2015)

Showing 1–4 of 4 results for author: Shakeri, F