Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning

About

Continual fine-tuning of large language models (LLMs) is becoming increasingly crucial as these models are deployed in dynamic environments where tasks and data distributions evolve over time. While strong adaptability enables rapid acquisition of new knowledge, it also exposes LLMs to catastrophic forgetting, where previously learned skills degrade during sequential training. Existing replay-based strategies, such as fixed interleaved replay, accuracy-supervised, and loss-driven scheduling, remain limited: some depend on heuristic rules and provide only partial mitigation of forgetting, while others improve performance but incur substantial computational overhead. Motivated by retention dynamics under sequential fine-tuning, we propose Memory-Inspired Sampler and Scheduler Replay (MSSR), an experience replay framework that estimates sample-level memory strength and schedules rehearsal at adaptive intervals to mitigate catastrophic forgetting while maintaining fast adaptation. Extensive experiments across three backbone models and 11 sequential tasks show that MSSR consistently outperforms state-of-the-art replay baselines, with particularly strong gains on reasoning-intensive and multiple-choice benchmarks.

Yiyang Lu, Yu He, Jianlong Chen, Hongyuan Zha• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningGSM8K
Accuracy74.8
1362
Mathematical ReasoningMATH
Accuracy31.7
882
Question AnsweringSciQ
Accuracy97.2
283
Reading ComprehensionBoolQ
Accuracy91.1
279
Question AnsweringARC
Accuracy69.2
230
Text ClassificationAGNews
Accuracy78.7
61
Question AnsweringSQuAD
Score79.95
29
Mathematical ReasoningMATH
MATH1 Score86.7
21
Showing 8 of 8 rows

Other info

Follow for update