Formalizing the Generalization-Forgetting Trade-off in Continual Learning
About
We formulate the continual learning (CL) problem via dynamic programming and model the trade-off between catastrophic forgetting and generalization as a two-player sequential game. In this approach, player 1 maximizes the cost due to lack of generalization whereas player 2 minimizes the cost due to catastrophic forgetting. We show theoretically that a balance point between the two players exists for each task and that this point is stable (once the balance is achieved, the two players stay at the balance point). Next, we introduce balanced continual learning (BCL), which is designed to attain balance between generalization and forgetting and empirically demonstrate that BCL is comparable to or better than the state of the art.
Krishnan Raghavan, Prasanna Balaprakash• 2021
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Incremental Task Learning (ITL) | Permuted MNIST (test) | Retained Accuracy97.61 | 32 | |
| Incremental Task Learning (ITL) | split-MNIST (test) | Retained Accuracy99.52 | 32 | |
| Incremental Task Learning | split-CIFAR100 (test) | Retained Accuracy81.82 | 24 | |
| Incremental Domain Learning (IDL) | split-MNIST (test) | Retained Accuracy98.71 | 16 | |
| Incremental Domain Learning (IDL) | Permuted MNIST (test) | Retained Accuracy97.51 | 16 | |
| Incremental Domain Learning | CIFAR100 split (test) | Retained Accuracy62.11 | 12 |
Showing 6 of 6 rows