BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning

About

With the surge of large-scale pre-trained models (PTMs), fine-tuning these models to numerous downstream tasks becomes a crucial problem. Consequently, parameter efficient transfer learning (PETL) of large models has grasped huge attention. While recent PETL methods showcase impressive performance, they rely on optimistic assumptions: 1) the entire parameter set of a PTM is available, and 2) a sufficiently large memory capacity for the fine-tuning is equipped. However, in most real-world applications, PTMs are served as a black-box API or proprietary software without explicit parameter accessibility. Besides, it is hard to meet a large memory requirement for modern PTMs. In this work, we propose black-box visual prompting (BlackVIP), which efficiently adapts the PTMs without knowledge about model architectures and parameters. BlackVIP has two components; 1) Coordinator and 2) simultaneous perturbation stochastic approximation with gradient correction (SPSA-GC). The Coordinator designs input-dependent image-shaped visual prompts, which improves few-shot adaptation and robustness on distribution/location shift. SPSA-GC efficiently estimates the gradient of a target model to update Coordinator. Extensive experiments on 16 datasets demonstrate that BlackVIP enables robust adaptation to diverse domains without accessing PTMs' parameters, with minimal memory requirements. Code: \url{https://github.com/changdaeoh/BlackVIP}

Changdae Oh, Hyeji Hwang, Hee-young Lee, YongTaek Lim, Geunyoung Jung, Jiyoung Jung, Hosik Choi, Kyungwoo Song• 2023

Related benchmarks

Task	Dataset	Result
Image Classification	Food-101	Accuracy86.6	570
Image Classification	EuroSAT	Accuracy73.1	569
Image Classification	UCF101	Top-1 Acc69.1	527
Image Classification	RESISC45	Accuracy64.5	472
Image Classification	SUN397	Accuracy64.7	425
Image Classification	SVHN	Accuracy44.3	395
Image Classification	StanfordCars	--	384
Image Classification	Oxford-IIIT Pets	Accuracy89.7	378
Image Classification	ImageNet	Top-1 Accuracy67.1	366
Image Classification	Oxford Flowers 102	Accuracy70.6	234

Showing 10 of 38 rows

Other info

Code

Follow for update

@wizwand_team Discord