Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

About

Continual learning can empower vision-language models to continuously acquire new knowledge, without the need for access to the entire historical dataset. However, mitigating the performance degradation in large-scale models is non-trivial due to (i) parameter shifts throughout lifelong learning and (ii) significant computational burdens associated with full-model tuning. In this work, we present a parameter-efficient continual learning framework to alleviate long-term forgetting in incremental learning with vision-language models. Our approach involves the dynamic expansion of a pre-trained CLIP model, through the integration of Mixture-of-Experts (MoE) adapters in response to new tasks. To preserve the zero-shot recognition capability of vision-language models, we further introduce a Distribution Discriminative Auto-Selector (DDAS) that automatically routes in-distribution and out-of-distribution inputs to the MoE Adapter and the original CLIP, respectively. Through extensive experiments across various settings, our proposed method consistently outperforms previous state-of-the-art approaches while concurrently reducing parameter training burdens by 60%. Our code locates at https://github.com/JiazuoYu/MoE-Adapters4CL

Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Ping Hu, Dong Wang, Huchuan Lu, You He• 2024

Related benchmarks

TaskDatasetResultRank
Image ClassificationFood101
Accuracy82.9
457
Image ClassificationCIFAR100
Accuracy68.24
301
Class-incremental learningCIFAR-100
Averaged Incremental Accuracy85.21
281
Object DetectionMS-COCO
AP5052.3
208
Class-incremental learningCIFAR-100
Average Accuracy85.27
150
Multi-Task Incremental LearningMTIL Order II
Average Acc84.1
76
Image ClassificationImageNet A--
73
Image ClassificationPlaces365--
67
Class-incremental learningCIFAR100 10 Tasks
Accuracy84.75
66
Class-incremental learningImageNet-R 5-task
Avg Accuracy (A_bar)83.61
64
Showing 10 of 111 rows
...

Other info

Follow for update