Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Enhancing Visual Continual Learning with Language-Guided Supervision

About

Continual learning (CL) aims to empower models to learn new tasks without forgetting previously acquired knowledge. Most prior works concentrate on the techniques of architectures, replay data, regularization, \etc. However, the category name of each class is largely neglected. Existing methods commonly utilize the one-hot labels and randomly initialize the classifier head. We argue that the scarce semantic information conveyed by the one-hot labels hampers the effective knowledge transfer across tasks. In this paper, we revisit the role of the classifier head within the CL paradigm and replace the classifier with semantic knowledge from pretrained language models (PLMs). Specifically, we use PLMs to generate semantic targets for each class, which are frozen and serve as supervision signals during training. Such targets fully consider the semantic correlation between all classes across tasks. Empirical studies show that our approach mitigates forgetting by alleviating representation drifting and facilitating knowledge transfer across tasks. The proposed method is simple to implement and can seamlessly be plugged into existing methods with negligible adjustments. Extensive experiments based on eleven mainstream baselines demonstrate the effectiveness and generalizability of our approach to various protocols. For example, under the class-incremental learning setting on ImageNet-100, our method significantly improves the Top-1 accuracy by 3.2\% to 6.1\% while reducing the forgetting rate by 2.6\% to 13.1\%.

Bolin Ni, Hongbo Zhao, Chenghao Zhang, Ke Hu, Gaofeng Meng, Zhaoxiang Zhang, Shiming Xiang• 2024

Related benchmarks

TaskDatasetResultRank
Class-incremental learningCIFAR-100
Averaged Incremental Accuracy78
234
Class-incremental learningCIFAR-100 10 (test)
Average Top-1 Accuracy78
75
Class-incremental learningImageNet-100 B=50, C=10 1.0
Avg Incremental Acc80.6
42
Task-Incremental LearningCIFAR100 (test)--
31
Class-incremental learningCIFAR100 (B=5, C=5) (test)
Avg Acc78
16
Class-incremental learningCIFAR100 (B=2, C=2) (test)
Average Accuracy78
16
Class-incremental learningImageNet100
Average Accuracy46.3
16
Class-incremental learningImageNet-100 B=50, C=5 1.0
Avg Incremental Acc76.7
14
Class-incremental learningImageNet-100 B=10, C=10 1.0
Avg Incremental Acc79.5
14
Class-incremental learningImageNet-100 B=5, C=5 1.0
Average Incremental Accuracy76.2
14
Showing 10 of 11 rows

Other info

Follow for update