Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TCP:Textual-based Class-aware Prompt tuning for Visual-Language Model

About

Prompt tuning represents a valuable technique for adapting pre-trained visual-language models (VLM) to various downstream tasks. Recent advancements in CoOp-based methods propose a set of learnable domain-shared or image-conditional textual tokens to facilitate the generation of task-specific textual classifiers. However, those textual tokens have a limited generalization ability regarding unseen domains, as they cannot dynamically adjust to the distribution of testing classes. To tackle this issue, we present a novel Textual-based Class-aware Prompt tuning(TCP) that explicitly incorporates prior knowledge about classes to enhance their discriminability. The critical concept of TCP involves leveraging Textual Knowledge Embedding (TKE) to map the high generalizability of class-level textual knowledge into class-aware textual tokens. By seamlessly integrating these class-aware prompts into the Text Encoder, a dynamic class-aware classifier is generated to enhance discriminability for unseen domains. During inference, TKE dynamically generates class-aware prompts related to the unseen classes. Comprehensive evaluations demonstrate that TKE serves as a plug-and-play module effortlessly combinable with existing methods. Furthermore, TCP consistently achieves superior performance while demanding less training time. Code:https://github.com/htyao89/Textual-based_Class-aware_prompt_tuning/

Hantao Yao, Rui Zhang, Changsheng Xu• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationEuroSAT
Accuracy51.45
497
Image ClassificationFlowers102
Accuracy95.46
478
Image ClassificationImageNet--
429
Image ClassificationDTD
Accuracy69.8
419
Image ClassificationUCF101
Top-1 Acc83.64
404
Action RecognitionUCF101
Accuracy51.45
365
Image ClassificationFood101
Accuracy86.69
309
Image ClassificationAircraft
Accuracy41.09
302
Image ClassificationCIFAR-100
Accuracy59.67
302
Image ClassificationStanfordCars
Accuracy91.25
266
Showing 10 of 114 rows
...

Other info

Follow for update