SAMix: Calibrated and Accurate Continual Learning via Sphere-Adaptive Mixup and Neural Collapse
About
While most continual learning methods focus on mitigating forgetting and improving accuracy, they often overlook the critical aspect of network calibration, despite its importance. Neural collapse, a phenomenon where last-layer features collapse to their class means, has demonstrated advantages in continual learning by reducing feature-classifier misalignment. Few works aim to improve the calibration of continual models for more reliable predictions. Our work goes a step further by proposing a novel method that not only enhances calibration but also improves performance by reducing overconfidence, mitigating forgetting, and increasing accuracy. We introduce Sphere-Adaptive Mixup (SAMix), an adaptive mixup strategy tailored for neural collapse-based methods. SAMix adapts the mixing process to the geometric properties of feature spaces under neural collapse, ensuring more robust regularization and alignment. Experiments show that SAMix significantly boosts performance, surpassing SOTA methods in continual learning while also improving model calibration. SAMix enhances both across-task accuracy and the broader reliability of predictions, making it a promising advancement for robust continual learning systems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Class-incremental learning | CIFAR-10 Seq | Final Average Accuracy (FAA)78.62 | 53 | |
| Task-Incremental Learning | CIFAR-100 Seq | -- | 28 | |
| Task-Incremental Learning | Tiny ImageNet Seq | FF21.51 | 27 | |
| Class-incremental learning | TinyImageNet Seq | Average Accuracy22.95 | 25 | |
| Task-Incremental Learning | Seq-Tiny-ImageNet | Average Accuracy57.13 | 25 | |
| Task-Incremental Learning | CIFAR-10 Seq | Average Accuracy96.98 | 25 | |
| Class-incremental learning | CIFAR-100 Seq | Average Forgetting29.94 | 23 | |
| Class-incremental learning | Seq-CIFAR-100 | Average Accuracy55.85 | 23 | |
| Task-Incremental Learning | Seq-CIFAR-100 | Average Forgetting9.66 | 14 | |
| Continual Learning | Seq-CIFAR-100 | AECE0.078 | 12 |