Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prompt Learning via Meta-Regularization

About

Pre-trained vision-language models have shown impressive success on various computer vision tasks with their zero-shot generalizability. Recently, prompt learning approaches have been explored to efficiently and effectively adapt the vision-language models to a variety of downstream tasks. However, most existing prompt learning methods suffer from task overfitting since the general knowledge of the pre-trained vision language models is forgotten while the prompts are finetuned on a small data set from a specific target task. To address this issue, we propose a Prompt Meta-Regularization (ProMetaR) to improve the generalizability of prompt learning for vision-language models. Specifically, ProMetaR meta-learns both the regularizer and the soft prompts to harness the task-specific knowledge from the downstream tasks and task-agnostic general knowledge from the vision-language models. Further, ProMetaR augments the task to generate multiple virtual tasks to alleviate the meta-overfitting. In addition, we provide the analysis to comprehend how ProMetaR improves the generalizability of prompt tuning in the perspective of the gradient alignment. Our extensive experiments demonstrate that our ProMetaR improves the generalizability of conventional prompt learning methods under base-to-base/base-to-new and domain generalization settings. The code of ProMetaR is available at https://github.com/mlvlab/ProMetaR.

Jinyoung Park, Juyeon Ko, Hyunwoo J. Kim• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationEuroSAT
Accuracy45.02
497
Image ClassificationFood-101
Accuracy86.39
494
Image ClassificationDTD
Accuracy47.08
487
Image ClassificationFlowers102
Accuracy71.13
478
Image ClassificationSUN397
Accuracy67.41
425
Image ClassificationUCF101
Top-1 Acc69.5
404
Image ClassificationImageNet
Top-1 Accuracy71.29
324
Image ClassificationAircraft
Accuracy24.78
302
Image ClassificationStanfordCars
Accuracy65.83
266
Image ClassificationCaltech101
Base Accuracy98.11
129
Showing 10 of 24 rows

Other info

Code

Follow for update