On-the-fly Object Detection using StyleGAN with CLIP Guidance

Lu, Yuzhe; Liu, Shusen; Thiagarajan, Jayaraman J.; Sakla, Wesam; Anirudh, Rushil

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.16742 (cs)

[Submitted on 30 Oct 2022]

Title:On-the-fly Object Detection using StyleGAN with CLIP Guidance

Authors:Yuzhe Lu, Shusen Liu, Jayaraman J. Thiagarajan, Wesam Sakla, Rushil Anirudh

View PDF

Abstract:We present a fully automated framework for building object detectors on satellite imagery without requiring any human annotation or intervention. We achieve this by leveraging the combined power of modern generative models (e.g., StyleGAN) and recent advances in multi-modal learning (e.g., CLIP). While deep generative models effectively encode the key semantics pertinent to a data distribution, this information is not immediately accessible for downstream tasks, such as object detection. In this work, we exploit CLIP's ability to associate image features with text descriptions to identify neurons in the generator network, which are subsequently used to build detectors on-the-fly.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2210.16742 [cs.CV]
	(or arXiv:2210.16742v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.16742

Submission history

From: Yuzhe Lu [view email]
[v1] Sun, 30 Oct 2022 04:43:01 UTC (1,842 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2022-10

Change to browse by:

cs
cs.AI
cs.LG

References & Citations

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:On-the-fly Object Detection using StyleGAN with CLIP Guidance

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:On-the-fly Object Detection using StyleGAN with CLIP Guidance

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators