Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

S-Prompts Learning with Pre-trained Transformers: An Occam's Razor for Domain Incremental Learning

About

State-of-the-art deep neural networks are still struggling to address the catastrophic forgetting problem in continual learning. In this paper, we propose one simple paradigm (named as S-Prompting) and two concrete approaches to highly reduce the forgetting degree in one of the most typical continual learning scenarios, i.e., domain increment learning (DIL). The key idea of the paradigm is to learn prompts independently across domains with pre-trained transformers, avoiding the use of exemplars that commonly appear in conventional methods. This results in a win-win game where the prompting can achieve the best for each domain. The independent prompting across domains only requests one single cross-entropy loss for training and one simple K-NN operation as a domain identifier for inference. The learning paradigm derives an image prompt learning approach and a novel language-image prompt learning approach. Owning an excellent scalability (0.03% parameter increase per domain), the best of our approaches achieves a remarkable relative improvement (an average of about 30%) over the best of the state-of-the-art exemplar-free methods for three standard DIL tasks, and even surpasses the best of them relatively by about 6% in average when they use exemplars. Source code is available at \url{https://github.com/iamwangyabin/S-Prompts}.

Yabin Wang, Zhiwu Huang, Xiaopeng Hong• 2022

Related benchmarks

TaskDatasetResultRank
Image ClassificationDomainNet (test)--
209
Audio ClassificationESC-50 (test)
Accuracy55
84
Class-incremental learningCIFAR-100 Split (test)
Avg Acc87.57
75
Class-incremental learningSplit ImageNet-R
Average Forgetting Measure4.73
57
Class-incremental learningSplit CIFAR-100 (10-task)
CAA92.25
41
Audio ClassificationUS8K (test)
R@1 Accuracy0.4257
41
Audio ClassificationSpeech Commands V2 (test)
Accuracy27.23
35
Domain-incremental learningCORe50 (test)
Test Accuracy93.19
34
Domain-incremental learningCDDB Hard (test)
Average Accuracy91.91
25
Domain-incremental learningDomainNet (test)
Average Accuracy67.78
25
Showing 10 of 39 rows

Other info

Code

Follow for update