Bayesian Prompt Learning for Image-Language Model Generalization

About

Foundational image-language models have generated considerable interest due to their efficient adaptation to downstream tasks by prompt learning. Prompt learning treats part of the language model input as trainable while freezing the rest, and optimizes an Empirical Risk Minimization objective. However, Empirical Risk Minimization is known to suffer from distributional shifts which hurt generalizability to prompts unseen during training. By leveraging the regularization ability of Bayesian methods, we frame prompt learning from the Bayesian perspective and formulate it as a variational inference problem. Our approach regularizes the prompt space, reduces overfitting to the seen prompts and improves the prompt generalization on unseen prompts. Our framework is implemented by modeling the input prompt space in a probabilistic manner, as an a priori distribution which makes our proposal compatible with prompt learning approaches that are unconditional or conditional on the image. We demonstrate empirically on 15 benchmarks that Bayesian prompt learning provides an appropriate coverage of the prompt space, prevents learning spurious features, and exploits transferable invariant features. This results in better generalization of unseen prompts, even across different datasets and domains. Code available at: https://github.com/saic-fi/Bayesian-Prompt-Learning

Mohammad Mahdi Derakhshani, Enrique Sanchez, Adrian Bulat, Victor Guilherme Turrisi da Costa, Cees G. M. Snoek, Georgios Tzimiropoulos, Brais Martinez• 2022

Related benchmarks

Task	Dataset	Result
Image Classification	ImageNet V2	Top-1 Acc64.23	749
Image Classification	ImageNet A	Top-1 Acc51.33	698
Image Classification	DTD	Accuracy46.1	599
Image Classification	ImageNet-R	Top-1 Acc77	581
Image Classification	Food-101	Accuracy86.3	570
Image Classification	EuroSAT	Accuracy45.87	569
Image Classification	Flowers102	Accuracy70.9	558
Image Classification	UCF101	Top-1 Acc68.67	527
Image Classification	ImageNet	--	431
Image Classification	SUN397	Accuracy67.47	425

Showing 10 of 67 rows

Other info

Code

Follow for update

@wizwand_team Discord