Class Incremental Learning with Task-Specific Batch Normalization and Out-of-Distribution Detection
About
This study focuses on incremental learning for image classification, exploring how to reduce catastrophic forgetting of all learned knowledge when access to old data is restricted. The challenge lies in balancing plasticity (learning new knowledge) and stability (retaining old knowledge). Based on whether the task identifier (task-ID) is available during testing, incremental learning is divided into task incremental learning (TIL) and class incremental learning (CIL). The TIL paradigm often uses multiple classifier heads, selecting the corresponding head based on the task-ID. Since the CIL paradigm cannot access task-ID, methods originally developed for TIL require explicit task-ID prediction to bridge this gap and enable their adaptation to the CIL paradigm. {In this study, a novel continual learning framework extends the TIL method for CIL by introducing out-of-distribution detection for task-ID prediction. Our framework utilizes task-specific Batch Normalization (BN) and task-specific classification heads to effectively adjust feature map distributions for each task, enhancing plasticity. With far fewer parameters than convolutional kernels, task-specific BN helps minimize parameter growth, preserving stability. Based on multiple task-specific classification heads, we introduce an ``unknow'' class for each head. During training, data from other tasks are mapped to this unknown class. During inference, the task-ID is predicted by selecting the classification head with the lowest probability assigned to the unknown class. Our method achieves state-of-the-art performance on two medical image datasets and two natural image datasets. The source code is available at https://github.com/z1968357787/mbn_ood_git_main.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Class-incremental learning | CIFAR-100 10T | Avg Accuracy (A_T)80.34 | 35 | |
| Class-incremental learning | CIFAR-100 T=20 (test) | Final Accuracy69.81 | 25 | |
| Incremental Learning | CIFAR100 T=50 | Last Accuracy68.15 | 19 | |
| Class-incremental learning | CUB200 (10T) | Last Accuracy42.33 | 15 | |
| Class-incremental learning | CUB200 (20T) | Last Accuracy37.21 | 15 | |
| Class-incremental learning | Path16 Order I 1.0 (train test) | Last Accuracy73.25 | 15 | |
| Class-incremental learning | Path16 Order II 1.0 (train test) | Last Accuracy72.25 | 15 | |
| Class-incremental learning | Skin8 Memory size 40 1.0 (train test) | Last Accuracy49.93 | 15 | |
| Class-incremental learning | Skin8 Memory size 16 1.0 (train test) | Last Accuracy44.81 | 15 | |
| Incremental Learning | CIFAR100 T=10 | Last Accuracy69.59 | 4 |