Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners

About

Few-Shot Class Incremental Learning (FSCIL) is a task that requires a model to learn new classes incrementally without forgetting when only a few samples for each class are given. FSCIL encounters two significant challenges: catastrophic forgetting and overfitting, and these challenges have driven prior studies to primarily rely on shallow models, such as ResNet-18. Even though their limited capacity can mitigate both forgetting and overfitting issues, it leads to inadequate knowledge transfer during few-shot incremental sessions. In this paper, we argue that large models such as vision and language transformers pre-trained on large datasets can be excellent few-shot incremental learners. To this end, we propose a novel FSCIL framework called PriViLege, Pre-trained Vision and Language transformers with prompting functions and knowledge distillation. Our framework effectively addresses the challenges of catastrophic forgetting and overfitting in large models through new pre-trained knowledge tuning (PKT) and two losses: entropy-based divergence loss and semantic knowledge distillation loss. Experimental results show that the proposed PriViLege significantly outperforms the existing state-of-the-art methods with a large margin, e.g., +9.38% in CUB200, +20.58% in CIFAR-100, and +13.36% in miniImageNet. Our implementation code is available at https://github.com/KHU-AGI/PriViLege.

Keon-Hee Park, Kyungwoo Song, Gyeong-Moon Park• 2024

Related benchmarks

Task	Dataset	Result
Few-Shot Class-Incremental Learning	CIFAR100	Accuracy (S0)90.88	89
Few-Shot Class-Incremental Learning	CUB-200	Session 1 Accuracy81.3	85
Class-incremental learning	FGVC Aircraft	Accuracy Last45.55	41
Few-Shot Class-Incremental Learning	CUB200 (incremental sessions)	Session 0 Accuracy82.21	37
Class-incremental learning	CIFAR100 rho=0.1 (test)	Alast Accuracy66.6	28
Class-incremental learning	CIFAR100 rho=0.01 (test)	Alast48.3	28
Few-Shot Class-Incremental Learning	CIFAR-100 60 base classes, 5-way 5-shot, Session 0	Accuracy90.88	20
Few-Shot Class-Incremental Learning	CIFAR-100 60 base classes 5-way 5-shot Session 1	Accuracy89.39	20
Few-Shot Class-Incremental Learning	CIFAR-100 60 base classes 5-way 5-shot Session 2	Accuracy88.97	20
Few-Shot Class-Incremental Learning	CIFAR-100 60 base classes 5-way 5-shot Session 3	Accuracy87.55	20

Showing 10 of 49 rows

Other info

Code

Follow for update

@wizwand_team Discord