Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Progressive Prompts: Continual Learning for Language Models

About

We introduce Progressive Prompts - a simple and efficient approach for continual learning in language models. Our method allows forward transfer and resists catastrophic forgetting, without relying on data replay or a large number of task-specific parameters. Progressive Prompts learns a new soft prompt for each task and sequentially concatenates it with the previously learned prompts, while keeping the base model frozen. Experiments on standard continual learning benchmarks show that our approach outperforms state-of-the-art methods, with an improvement >20% in average test accuracy over the previous best-preforming method on T5 model. We also explore a more challenging continual learning setup with longer sequences of tasks and show that Progressive Prompts significantly outperforms prior methods.

Anastasia Razdaibiedina, Yuning Mao, Rui Hou, Madian Khabsa, Mike Lewis, Amjad Almahairi• 2023

Related benchmarks

TaskDatasetResultRank
Language UnderstandingMMLU
Accuracy30.58
844
ReasoningBBH--
726
Physical Commonsense ReasoningPIQA
Accuracy53.05
696
Continual LearningTRACE
BWT (%)5.79
124
Continual LearningStandard CL Benchmark
Avg Final Acc0.751
71
Text ClassificationAGNews, Amazon, DBPedia, Yahoo, and Yelp (test)
Exact Match (EM)79.9
55
Continual LearningLarge Number of Tasks
Average Performance78
50
Continual LearningStandard CL Benchmark
BWT (Avg Order 1-3)76.31
38
Continual LearningContinual Learning Benchmark 15-Task
Average Accuracy77.9
28
Continual LearningLong Sequence (Order 1)
AP78.98
20
Showing 10 of 36 rows

Other info

Follow for update