Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

About

Continual learning can empower vision-language models to continuously acquire new knowledge, without the need for access to the entire historical dataset. However, mitigating the performance degradation in large-scale models is non-trivial due to (i) parameter shifts throughout lifelong learning and (ii) significant computational burdens associated with full-model tuning. In this work, we present a parameter-efficient continual learning framework to alleviate long-term forgetting in incremental learning with vision-language models. Our approach involves the dynamic expansion of a pre-trained CLIP model, through the integration of Mixture-of-Experts (MoE) adapters in response to new tasks. To preserve the zero-shot recognition capability of vision-language models, we further introduce a Distribution Discriminative Auto-Selector (DDAS) that automatically routes in-distribution and out-of-distribution inputs to the MoE Adapter and the original CLIP, respectively. Through extensive experiments across various settings, our proposed method consistently outperforms previous state-of-the-art approaches while concurrently reducing parameter training burdens by 60%. Our code locates at https://github.com/JiazuoYu/MoE-Adapters4CL

Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Ping Hu, Dong Wang, Huchuan Lu, You He• 2024

Related benchmarks

TaskDatasetResultRank
Class-incremental learningCIFAR-100
Average Accuracy85.27
60
Class-incremental learningImageNet-R 10-task--
44
Class-incremental learningImageNet-R 20-task
Average Accuracy83.78
33
Class-incremental learningCIFAR100 10 Tasks
Accuracy84.75
29
Class-incremental learningImageNet-R 5-task
Avg Accuracy (A_bar)83.61
27
Class-incremental learningCIFAR-100 20 tasks
Avg Acc84.34
26
Continual LearningMedXtreme (Order II)
Accuracy64.7
13
Continual LearningMedXtreme (Order I)
ACC65.3
13
Continual LearningHieraMedTransfer Order I
Transfer Performance56.8
13
Continual LearningHieraMedTransfer Order II
Transfer Score47.6
13
Showing 10 of 20 rows

Other info

Follow for update