GeneCIS: A Benchmark for General Conditional Image Similarity

Vaze, Sagar; Carion, Nicolas; Misra, Ishan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.07969 (cs)

[Submitted on 13 Jun 2023]

Title:GeneCIS: A Benchmark for General Conditional Image Similarity

Authors:Sagar Vaze, Nicolas Carion, Ishan Misra

View PDF

Abstract:We argue that there are many notions of 'similarity' and that models, like humans, should be able to adapt to these dynamically. This contrasts with most representation learning methods, supervised or self-supervised, which learn a fixed embedding function and hence implicitly assume a single notion of similarity. For instance, models trained on ImageNet are biased towards object categories, while a user might prefer the model to focus on colors, textures or specific elements in the scene. In this paper, we propose the GeneCIS ('genesis') benchmark, which measures models' ability to adapt to a range of similarity conditions. Extending prior work, our benchmark is designed for zero-shot evaluation only, and hence considers an open-set of similarity conditions. We find that baselines from powerful CLIP models struggle on GeneCIS and that performance on the benchmark is only weakly correlated with ImageNet accuracy, suggesting that simply scaling existing methods is not fruitful. We further propose a simple, scalable solution based on automatically mining information from existing image-caption datasets. We find our method offers a substantial boost over the baselines on GeneCIS, and further improves zero-shot performance on related image retrieval benchmarks. In fact, though evaluated zero-shot, our model surpasses state-of-the-art supervised models on MIT-States. Project page at this https URL.

Comments:	CVPR 2023 (Highlighted Paper). Project page at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
Cite as:	arXiv:2306.07969 [cs.CV]
	(or arXiv:2306.07969v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.07969

Submission history

From: Sagar Vaze [view email]
[v1] Tue, 13 Jun 2023 17:59:58 UTC (3,752 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GeneCIS: A Benchmark for General Conditional Image Similarity

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GeneCIS: A Benchmark for General Conditional Image Similarity

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators