FeCAM: Exploiting the Heterogeneity of Class Distributions in Exemplar-Free Continual Learning

About

Exemplar-free class-incremental learning (CIL) poses several challenges since it prohibits the rehearsal of data from previous tasks and thus suffers from catastrophic forgetting. Recent approaches to incrementally learning the classifier by freezing the feature extractor after the first task have gained much attention. In this paper, we explore prototypical networks for CIL, which generate new class prototypes using the frozen feature extractor and classify the features based on the Euclidean distance to the prototypes. In an analysis of the feature distributions of classes, we show that classification based on Euclidean metrics is successful for jointly trained features. However, when learning from non-stationary data, we observe that the Euclidean metric is suboptimal and that feature distributions are heterogeneous. To address this challenge, we revisit the anisotropic Mahalanobis distance for CIL. In addition, we empirically show that modeling the feature covariance relations is better than previous attempts at sampling features from normal distributions and training a linear classifier. Unlike existing methods, our approach generalizes to both many- and few-shot CIL settings, as well as to domain-incremental settings. Interestingly, without updating the backbone network, our method obtains state-of-the-art results on several standard continual learning benchmarks. Code is available at https://github.com/dipamgoswami/FeCAM.

Dipam Goswami, Yuyang Liu, Bart{\l}omiej Twardowski, Joost van de Weijer• 2023

Related benchmarks

Task	Dataset	Result
Class-incremental learning	CIFAR100 (test)	--	116
Class-incremental learning	CUB-200 (test)	Alast53.47	51
Class-incremental learning	CIFAR-100 10T	Avg Accuracy (A_T)86.09	40
Class-incremental learning	Stanford Cars (test)	Accuracy (Last)45.3	38
Exemplar-Free Class-Incremental Learning	ImageNet subset (test)	A_last54.8	30
Exemplar-Free Class-Incremental Learning	CIFAR-100 (test)	Accuracy Last (Alast)48.3	30
Semantic segmentation	Med JASCL-Disjoint Session 0: TS	Dice Score70	28
Continual Segmentation	Med JASCL Disjoint	Total Drop (%)94	28
Semantic segmentation	Med JASCL-Disjoint Session 2: BCV	Dice Score4.2	28
Semantic segmentation	Med JASCL-Disjoint Session 1: AMOS	Dice Score4.8	28

Showing 10 of 48 rows

Other info

Follow for update

@wizwand_team Discord