Resolving Task Confusion in Dynamic Expansion Architectures for Class Incremental Learning
About
The dynamic expansion architecture is becoming popular in class incremental learning, mainly due to its advantages in alleviating catastrophic forgetting. However, task confusion is not well assessed within this framework, e.g., the discrepancy between classes of different tasks is not well learned (i.e., inter-task confusion, ITC), and certain priority is still given to the latest class batch (i.e., old-new confusion, ONC). We empirically validate the side effects of the two types of confusion. Meanwhile, a novel solution called Task Correlated Incremental Learning (TCIL) is proposed to encourage discriminative and fair feature utilization across tasks. TCIL performs a multi-level knowledge distillation to propagate knowledge learned from old tasks to the new one. It establishes information flow paths at both feature and logit levels, enabling the learning to be aware of old classes. Besides, attention mechanism and classifier re-scoring are applied to generate more fair classification scores. We conduct extensive experiments on CIFAR100 and ImageNet100 datasets. The results demonstrate that TCIL consistently achieves state-of-the-art accuracy. It mitigates both ITC and ONC, while showing advantages in battle with catastrophic forgetting even no rehearsal memory is reserved.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Class-incremental learning | CIFAR100-B0 5 steps (test) | Last Step Top-1 Acc69.58 | 31 | |
| Class-incremental learning | CIFAR100 B0 (20 steps) (test) | Last Step Top-1 Acc64.08 | 31 | |
| Class-incremental learning | S-CIFAR-100 10 Step | Avg Top-1 Acc76.33 | 19 | |
| Class-incremental learning | CIFAR100-B0 5 steps (non-rehearsal) | Average Metric77.33 | 18 | |
| Class-incremental learning | CIFAR100 10 steps B0 (test) | -- | 13 | |
| Class-incremental learning | ImageNet100 B0 (test) | Top-5 Avg Acc94.17 | 10 | |
| Class-incremental learning | S-CIFAR-100 5 Step (test) | Avg Top-1 Acc64.4 | 9 | |
| Class-incremental learning | S-CIFAR-100 20 Step | Avg Top-1 Acc74.32 | 9 | |
| Class-incremental learning | CIFAR100 B50 2 steps (test) | Avg Top-1 Acc76.42 | 8 | |
| Class-incremental learning | CIFAR100-B50 (5 steps) (test) | Avg Top-1 Accuracy0.7488 | 8 |