Dynamic Query Selection for Fast Visual Perceiver

Dancette, Corentin; Cord, Matthieu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2205.10873v1 (cs)

[Submitted on 22 May 2022 (this version), latest version 21 Mar 2023 (v2)]

Title:Dynamic Query Selection for Fast Visual Perceiver

Authors:Corentin Dancette, Matthieu Cord

View PDF

Abstract:Transformers have been matching deep convolutional networks for vision architectures in recent works. Most work is focused on getting the best results on large-scale benchmarks, and scaling laws seem to be the most successful strategy: bigger models, more data, and longer training result in higher performance. However, the reduction of network complexity and inference time remains under-explored. The Perceiver model offers a solution to this problem: by first performing a Cross-attention with a fixed number Q of latent query tokens, the complexity of the L-layers Transformer network that follows is bounded by O(LQ^2). In this work, we explore how to make Perceivers even more efficient, by reducing the number of queries Q during inference while limiting the accuracy drop.

Comments:	Accepted at the Transformer for Vision workshop, CVPR 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.10873 [cs.CV]
	(or arXiv:2205.10873v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2205.10873

Submission history

From: Corentin Dancette [view email]
[v1] Sun, 22 May 2022 17:23:51 UTC (862 KB)
[v2] Tue, 21 Mar 2023 10:53:32 UTC (862 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Dynamic Query Selection for Fast Visual Perceiver

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Dynamic Query Selection for Fast Visual Perceiver

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators