Make Continual Learning Stronger via C-Flat

About

Model generalization ability upon incrementally acquiring dynamically updating knowledge from sequentially arriving tasks is crucial to tackle the sensitivity-stability dilemma in Continual Learning (CL). Weight loss landscape sharpness minimization seeking for flat minima lying in neighborhoods with uniform low loss or smooth gradient is proven to be a strong training regime improving model generalization compared with loss minimization based optimizer like SGD. Yet only a few works have discussed this training regime for CL, proving that dedicated designed zeroth-order sharpness optimizer can improve CL performance. In this work, we propose a Continual Flatness (C-Flat) method featuring a flatter loss landscape tailored for CL. C-Flat could be easily called with only one line of code and is plug-and-play to any CL methods. A general framework of C-Flat applied to all CL categories and a thorough comparison with loss minima optimizer and flat minima based CL approaches is presented in this paper, showing that our method can boost CL performance in almost all cases. Code is available at https://github.com/WanNaa/C-Flat.

Ang Bian, Wei Li, Hangjie Yuan, Chengrong Yu, Mang Wang, Zixiang Zhao, Aojun Lu, Pengliang Ji, Tao Feng• 2024

Related benchmarks

Task	Dataset	Result
Class-incremental learning	ImageNet-R B0 Inc20	Last Accuracy77.25	98
Class-incremental learning	CIFAR-100 B0_Inc5	Average Accuracy71.11	63
Class-incremental learning	CIFAR-100 B0_Inc10	Avg Accuracy94.41	60
Class-incremental learning	ImageNet-100 B=50, C=10 1.0	Avg Incremental Acc86.64	42
Class-incremental learning	CUB (B0 Inc10)	Last Accuracy88.76	39
Semantic segmentation	Med JASCL-Disjoint Session 1: AMOS	Dice Score17.4	28
Semantic segmentation	Med JASCL-Disjoint Session 0: TS	Dice Score70	28
Continual Segmentation	Med JASCL Disjoint	Total Drop (%)95.7	28
Semantic segmentation	Med JASCL-Disjoint Session 2: BCV	Dice Score3	28
Incremental Learning	CIFAR100 T=50	Last Accuracy84.03	19

Showing 10 of 24 rows

Other info

Follow for update

@wizwand_team Discord