Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Dropout Prompt Learning: Towards Robust and Adaptive Vision-Language Models

About

Dropout is a widely used regularization technique which improves the generalization ability of a model by randomly dropping neurons. In light of this, we propose Dropout Prompt Learning, which aims for applying dropout to improve the robustness of the vision-language models. Different from the vanilla dropout, we apply dropout on the tokens of the textual and visual branches, where we evaluate the token significance considering both intra-modal context and inter-modal alignment, enabling flexible dropout probabilities for each token. Moreover, to maintain semantic alignment for general knowledge transfer while encouraging the diverse representations that dropout introduces, we further propose residual entropy regularization. Experiments on 15 benchmarks show our method's effectiveness in challenging scenarios like low-shot learning, long-tail classification, and out-of-distribution generalization. Notably, our method surpasses regularization-based methods including KgCoOp by 5.10% and PromptSRC by 2.13% in performance on base-to-novel generalization.

Biao Chen, Lin Zuo, Mengmeng Jing, Kunbin He, Yuchen Wang• 2025

Related benchmarks

TaskDatasetResultRank
Base-to-New GeneralizationAvg over 11 datasets
Base Score86.12
90
Base-to-New GeneralizationDTD
Base Accuracy85.43
82
Base-to-New GeneralizationImageNet
Base Accuracy78.24
81
Base-to-New GeneralizationFGVCAircraft
Base Performance49.26
78
Base-to-New GeneralizationUCF101
Base Accuracy88.16
71
Base-to-New GeneralizationOxfordPets
Base Score96.38
64
Base-to-New GeneralizationCaltech101
Base Score98.72
58
Base-to-New GeneralizationStanfordCars
Base Score82.87
57
Image ClassificationImageNet to 10 Target Datasets (Caltech101, OxfordPets, StanfordCars, Flowers102, Food101, FGVCAircraft, SUN397, DTD, EuroSAT, UCF101) (test)
ImageNet Accuracy71.94
48
Base-to-novel generalizationSUN397
Base Score83.82
43
Showing 10 of 15 rows

Other info

Follow for update