Dropout Prompt Learning: Towards Robust and Adaptive Vision-Language Models

About

Dropout is a widely used regularization technique which improves the generalization ability of a model by randomly dropping neurons. In light of this, we propose Dropout Prompt Learning, which aims for applying dropout to improve the robustness of the vision-language models. Different from the vanilla dropout, we apply dropout on the tokens of the textual and visual branches, where we evaluate the token significance considering both intra-modal context and inter-modal alignment, enabling flexible dropout probabilities for each token. Moreover, to maintain semantic alignment for general knowledge transfer while encouraging the diverse representations that dropout introduces, we further propose residual entropy regularization. Experiments on 15 benchmarks show our method's effectiveness in challenging scenarios like low-shot learning, long-tail classification, and out-of-distribution generalization. Notably, our method surpasses regularization-based methods including KgCoOp by 5.10% and PromptSRC by 2.13% in performance on base-to-novel generalization.

Biao Chen, Lin Zuo, Mengmeng Jing, Kunbin He, Yuchen Wang• 2025

Related benchmarks

Task	Dataset	Result
Base-to-New Generalization	Avg over 11 datasets	Base Score86.12	102
Base-to-New Generalization	DTD	Base Accuracy85.43	94
Base-to-New Generalization	ImageNet	Base Accuracy78.24	93
Base-to-New Generalization	FGVCAircraft	Base Performance49.26	90
Base-to-New Generalization	OxfordPets	Base Score96.38	76
Base-to-New Generalization	UCF101	Base Accuracy88.16	71
Base-to-New Generalization	Caltech101	Base Score98.72	70
Base-to-New Generalization	StanfordCars	Base Score82.87	69
Base-to-novel generalization	SUN397	Base Score83.82	55
Base-to-novel generalization	Flowers102	Base Accuracy98.61	55

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord