DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning
About
Continual learning aims to enable a single model to learn a sequence of tasks without catastrophic forgetting. Top-performing methods usually require a rehearsal buffer to store past pristine examples for experience replay, which, however, limits their practical value due to privacy and memory constraints. In this work, we present a simple yet effective framework, DualPrompt, which learns a tiny set of parameters, called prompts, to properly instruct a pre-trained model to learn tasks arriving sequentially without buffering past examples. DualPrompt presents a novel approach to attach complementary prompts to the pre-trained backbone, and then formulates the objective as learning task-invariant and task-specific "instructions". With extensive experimental validation, DualPrompt consistently sets state-of-the-art performance under the challenging class-incremental setting. In particular, DualPrompt outperforms recent advanced continual learning methods with relatively large buffer sizes. We also introduce a more challenging benchmark, Split ImageNet-R, to help generalize rehearsal-free continual learning research. Source code is available at https://github.com/google-research/l2p.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Visual Question Answering | VizWiz | Accuracy40.91 | 1043 | |
| Visual Question Answering | GQA | Accuracy30.75 | 963 | |
| Text-based Visual Question Answering | TextVQA | Accuracy31.48 | 496 | |
| Image Classification | CIFAR-100 | Accuracy84.77 | 302 | |
| Class-incremental learning | CIFAR-100 | Averaged Incremental Accuracy90.39 | 234 | |
| Science Question Answering | ScienceQA | Accuracy60.01 | 229 | |
| Image Classification | DomainNet (test) | -- | 209 | |
| Few-Shot Class-Incremental Learning | miniImageNet (test) | Accuracy (Session 1)86.85 | 173 | |
| Few-Shot Class-Incremental Learning | CIFAR100 (test) | Session 4 Top-1 Acc68.45 | 122 | |
| Class-incremental learning | ImageNet-R | Average Accuracy82.73 | 103 |