BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning

Oh, Changdae; Hwang, Hyeji; Lee, Hee-young; Lim, YongTaek; Jung, Geunyoung; Jung, Jiyoung; Choi, Hosik; Song, Kyungwoo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.14773 (cs)

[Submitted on 26 Mar 2023 (v1), last revised 8 Jul 2023 (this version, v2)]

Title:BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning

Authors:Changdae Oh, Hyeji Hwang, Hee-young Lee, YongTaek Lim, Geunyoung Jung, Jiyoung Jung, Hosik Choi, Kyungwoo Song

View PDF

Abstract:With the surge of large-scale pre-trained models (PTMs), fine-tuning these models to numerous downstream tasks becomes a crucial problem. Consequently, parameter efficient transfer learning (PETL) of large models has grasped huge attention. While recent PETL methods showcase impressive performance, they rely on optimistic assumptions: 1) the entire parameter set of a PTM is available, and 2) a sufficiently large memory capacity for the fine-tuning is equipped. However, in most real-world applications, PTMs are served as a black-box API or proprietary software without explicit parameter accessibility. Besides, it is hard to meet a large memory requirement for modern PTMs. In this work, we propose black-box visual prompting (BlackVIP), which efficiently adapts the PTMs without knowledge about model architectures and parameters. BlackVIP has two components; 1) Coordinator and 2) simultaneous perturbation stochastic approximation with gradient correction (SPSA-GC). The Coordinator designs input-dependent image-shaped visual prompts, which improves few-shot adaptation and robustness on distribution/location shift. SPSA-GC efficiently estimates the gradient of a target model to update Coordinator. Extensive experiments on 16 datasets demonstrate that BlackVIP enables robust adaptation to diverse domains without accessing PTMs' parameters, with minimal memory requirements. Code: \url{this https URL}

Comments:	Accepted to CVPR 2023 (v2: citation error was fixed)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2303.14773 [cs.CV]
	(or arXiv:2303.14773v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.14773

Submission history

From: Changdae Oh [view email]
[v1] Sun, 26 Mar 2023 16:42:05 UTC (34,678 KB)
[v2] Sat, 8 Jul 2023 12:13:50 UTC (34,678 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators