Less-forgetting Learning in Deep Neural Networks
About
A catastrophic forgetting problem makes deep neural networks forget the previously learned information, when learning data collected in new environments, such as by different sensors or in different light conditions. This paper presents a new method for alleviating the catastrophic forgetting problem. Unlike previous research, our method does not use any information from the source domain. Surprisingly, our method is very effective to forget less of the information in the source domain, and we show the effectiveness of our method using several experiments. Furthermore, we observed that the forgetting problem occurs between mini-batches when performing general training processes using stochastic gradient descent methods, and this problem is one of the factors that degrades generalization performance of the network. We also try to solve this problem using the proposed method. Finally, we show our less-forgetting learning method is also helpful to improve the performance of deep neural networks in terms of recognition rates.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Incremental Learning | TinyImageNet | Avg Incremental Accuracy60.2 | 83 | |
| Task-Incremental Learning | CIFAR-100 | Accuracy74.5 | 24 | |
| 3D Brain Lesion Segmentation | S1 Sequence (BRATS-Decathlon, ATLAS, MSSEG, ISLES, WMH) | Average Score18.3 | 20 | |
| 3D Brain Lesion Segmentation | S2 Sequence MSSEG, BRATS-Decathlon, ISLES, WMH, ATLAS | AVG Score11.16 | 20 | |
| 3D Brain Lesion Segmentation | Mean of Sequences S1 and S2 | Average Score14.73 | 20 | |
| Continual Learning | 8-task sequence (CIFAR10, CIFAR100, FaceScrub, FashionMNIST, NotMNIST, MNIST, SVHN, TrafficSigns) after 2nd task | Avg Forgetting Ratio-0.73 | 10 | |
| Continual Learning | 8-task sequence (CIFAR10, CIFAR100, FaceScrub, FashionMNIST, NotMNIST, MNIST, SVHN, TrafficSigns) after 8th task | Average Forgetting Ratio-0.92 | 10 | |
| Task-incremental Image Classification | Tiny-ImageNet 200 10 (test) | Task 1 Score32.4 | 10 |