Overcoming Catastrophic Forgetting by Incremental Moment Matching
About
Catastrophic forgetting is a problem of neural networks that loses the information of the first task after training the second task. Here, we propose a method, i.e. incremental moment matching (IMM), to resolve this problem. IMM incrementally matches the moment of the posterior distribution of the neural network which is trained on the first and the second task, respectively. To make the search space of posterior parameter smooth, the IMM procedure is complemented by various transfer learning techniques including weight transfer, L2-norm of the old and the new parameter, and a variant of dropout with the old parameter. We analyze our approach on a variety of datasets including the MNIST, CIFAR-10, Caltech-UCSD-Birds, and Lifelog datasets. The experimental results show that IMM achieves state-of-the-art performance by balancing the information between an old and a new network.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Class-incremental learning | CIFAR10 (test) | Average Accuracy32.36 | 59 | |
| Class-incremental learning | MNIST (test) | Average Accuracy67.25 | 35 | |
| Class-incremental learning | ImageNet-1000 1.0 (test) | Top-5 Acc (Avg)54.8 | 14 | |
| Continual Learning | 8-task sequence (CIFAR10, CIFAR100, FaceScrub, FashionMNIST, NotMNIST, MNIST, SVHN, TrafficSigns) after 8th task | Average Forgetting Ratio-0.49 | 10 | |
| Task-incremental Image Classification | Tiny-ImageNet 200 10 (test) | Task 1 Score50.6 | 10 | |
| Continual Learning | 8-task sequence (CIFAR10, CIFAR100, FaceScrub, FashionMNIST, NotMNIST, MNIST, SVHN, TrafficSigns) after 2nd task | Avg Forgetting Ratio-0.12 | 10 | |
| Task-Incremental Learning | Split MNIST | Avg Acc (A<=2)94.1 | 6 |