Do Your Best and Get Enough Rest for Continual Learning
About
According to the forgetting curve theory, we can enhance memory retention by learning extensive data and taking adequate rest. This means that in order to effectively retain new knowledge, it is essential to learn it thoroughly and ensure sufficient rest so that our brain can memorize without forgetting. The main takeaway from this theory is that learning extensive data at once necessitates sufficient rest before learning the same data again. This aspect of human long-term memory retention can be effectively utilized to address the continual learning of neural networks. Retaining new knowledge for a long period of time without catastrophic forgetting is the critical problem of continual learning. Therefore, based on Ebbinghaus' theory, we introduce the view-batch model that adjusts the learning schedules to optimize the recall interval between retraining the same samples. The proposed view-batch model allows the network to get enough rest to learn extensive knowledge from the same samples with a recall interval of sufficient length. To this end, we specifically present two approaches: 1) a replay method that guarantees the optimal recall interval, and 2) a self-supervised learning that acquires extensive knowledge from a single training sample at a time. We empirically show that these approaches of our method are aligned with the forgetting curve theory, which can enhance long-term memory. In our experiments, we also demonstrate that our method significantly improves many state-of-the-art continual learning methods in various protocols and scenarios. We open-source this project at https://github.com/hankyul2/ViewBatchModel.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Domain-incremental learning | DomainNet (test) | Average Accuracy42.16 | 25 | |
| Continual Learning | S-CIFAR-10 (last) | TIL Last Top-1 Acc0.9707 | 24 | |
| Continual Learning | S-Tiny-ImageNet (last) | TIL Last Top-1 Acc68.91 | 24 | |
| Class-incremental learning | S-CIFAR-100 10 Step | Avg Top-1 Acc78.12 | 19 | |
| Class-incremental learning | CIFAR100-B0 5 steps (non-rehearsal) | Average Metric79.23 | 18 | |
| Class-incremental learning | S-CIFAR-100 20 Step | Avg Top-1 Acc76.95 | 9 | |
| Class-incremental learning | S-CIFAR-100 5 Step (test) | Avg Top-1 Acc66.4 | 9 | |
| Class-incremental learning | CIFAR-100 Sequential (test) | Final Top-1 Accuracy91.78 | 7 | |
| Class-incremental learning | ImageNet-R Sequential (test) | Last Top-1 Accuracy80.07 | 7 |