Rehearsal-Free Modular and Compositional Continual Learning for Language Models

About

Continual learning aims at incrementally acquiring new knowledge while not forgetting existing knowledge. To overcome catastrophic forgetting, methods are either rehearsal-based, i.e., store data examples from previous tasks for data replay, or isolate parameters dedicated to each task. However, rehearsal-based methods raise privacy and memory issues, and parameter-isolation continual learning does not consider interaction between tasks, thus hindering knowledge transfer. In this work, we propose MoCL, a rehearsal-free Modular and Compositional Continual Learning framework which continually adds new modules to language models and composes them with existing modules. Experiments on various benchmarks show that MoCL outperforms state of the art and effectively facilitates knowledge transfer.

Mingyang Wang, Heike Adel, Lukas Lange, Jannik Str\"otgen, Hinrich Sch\"utze• 2024

Related benchmarks

Task	Dataset	Result
Continual Learning	Standard CL Benchmark	Avg Final Acc0.782	71
Text Classification	AGNews, Amazon, DBPedia, Yahoo, and Yelp (test)	Exact Match (EM)79.7	55
Continual Learning	Long CL benchmark N=15	Long1 Performance75.2	18
Continual Learning	Standard Continual Learning Benchmark N=4	Forward Retention (FR)4.14	6
Continual Learning	Long Continual Learning Benchmark N=15	Forward Retention10.25	6

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord