Domain-Controlled Prompt Learning

Cao, Qinglong; Xu, Zhengqin; Chen, Yuntian; Ma, Chao; Yang, Xiaokang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2310.07730 (cs)

[Submitted on 30 Sep 2023 (v1), last revised 12 Dec 2023 (this version, v2)]

Title:Domain-Controlled Prompt Learning

Authors:Qinglong Cao, Zhengqin Xu, Yuntian Chen, Chao Ma, Xiaokang Yang

View PDF HTML (experimental)

Abstract:Large pre-trained vision-language models, such as CLIP, have shown remarkable generalization capabilities across various tasks when appropriate text prompts are provided. However, adapting these models to specific domains, like remote sensing images (RSIs), medical images, etc, remains unexplored and challenging. Existing prompt learning methods often lack domain-awareness or domain-transfer mechanisms, leading to suboptimal performance due to the misinterpretation of specific images in natural image patterns. To tackle this dilemma, we proposed a \textbf{Domain-Controlled Prompt Learning} for the specific domains. Specifically, the large-scale specific domain foundation model (LSDM) is first introduced to provide essential specific domain knowledge. Using lightweight neural networks, we transfer this knowledge into domain biases, which control both the visual and language branches to obtain domain-adaptive prompts in a directly incorporating manner. Simultaneously, to overcome the existing overfitting challenge, we propose a novel noisy-adding strategy, without extra trainable parameters, to help the model escape the suboptimal solution in a global domain oscillation manner. Experimental results show our method achieves state-of-the-art performance in specific domain image recognition datasets. Our code is available at this https URL.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2310.07730 [cs.CV]
	(or arXiv:2310.07730v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2310.07730

Submission history

From: Yuntian Chen [view email]
[v1] Sat, 30 Sep 2023 02:59:49 UTC (3,121 KB)
[v2] Tue, 12 Dec 2023 08:56:17 UTC (3,121 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Domain-Controlled Prompt Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Domain-Controlled Prompt Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators