DePT: Decoupled Prompt Tuning

About

This work breaks through the Base-New Tradeoff (BNT)dilemma in prompt tuning, i.e., the better the tuned model generalizes to the base (or target) task, the worse it generalizes to new tasks, and vice versa. Specifically, through an in-depth analysis of the learned features of the base and new tasks, we observe that the BNT stems from a channel bias issue, i.e., the vast majority of feature channels are occupied by base-specific knowledge, resulting in the collapse of taskshared knowledge important to new tasks. To address this, we propose the Decoupled Prompt Tuning (DePT) framework, which decouples base-specific knowledge from feature channels into an isolated feature space during prompt tuning, so as to maximally preserve task-shared knowledge in the original feature space for achieving better zero-shot generalization on new tasks. Importantly, our DePT is orthogonal to existing prompt tuning methods, hence it can improve all of them. Extensive experiments on 11 datasets show the strong flexibility and effectiveness of DePT. Our code and pretrained models are available at https://github.com/Koorye/DePT.

Ji Zhang, Shihan Wu, Lianli Gao, Heng Tao Shen, Jingkuan Song• 2023

Related benchmarks

Task	Dataset	Result
Image Classification	ImageNet	Top-1 Accuracy72.77	366
Image Classification	FGVC-Aircraft (test)	Accuracy24.3	322
Image Classification	Stanford Cars (test)	Accuracy66.23	320
Image Classification	DTD (test)	Accuracy46.6	316
Image Classification	SUN397 (test)	Top-1 Accuracy67.3	231
Image Classification	Caltech101 (test)	Accuracy94.23	204
Image Classification	EuroSAT (test)	Accuracy45.83	177
Image Classification	Flowers-102 (test)	Top-1 Accuracy72.17	152
Base-to-New Generalization	Avg over 11 datasets	Base Score85.28	102
Image Classification	Food101 (test)	--	97

Showing 10 of 65 rows

Other info

Code

Follow for update

@wizwand_team Discord