Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FOREVER: Forgetting Curve-Inspired Memory Replay for Language Model Continual Learning

About

Continual learning (CL) for large language models (LLMs) aims to enable sequential knowledge acquisition without catastrophic forgetting. Memory replay methods are widely used for their practicality and effectiveness, but most rely on fixed, step-based heuristics that often misalign with the model's actual learning progress, since identical training steps can result in varying degrees of parameter change. Motivated by recent findings that LLM forgetting mirrors the Ebbinghaus human forgetting curve, we propose FOREVER (FORgEtting curVe-inspired mEmory Replay), a novel CL framework that aligns replay schedules with a model-centric notion of time. FOREVER defines model time using the magnitude of optimizer updates, allowing forgetting curve-inspired replay intervals to align with the model's internal evolution rather than raw training steps. Building on this approach, FOREVER incorporates a forgetting curve-based replay scheduler to determine when to replay and an intensity-aware regularization mechanism to adaptively control how to replay. Extensive experiments on three CL benchmarks and models ranging from 0.6B to 13B parameters demonstrate that FOREVER consistently mitigates catastrophic forgetting.

Yujie Feng, Hao Wang, Jian Li, Xu Chu, Zhaolu Kang, Yiran Liu, Yasha Wang, Philip S. Yu, Xiao-Ming Wu• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical ReasoningMATH 500
Top-1 Accuracy69.6
384
Scientific ReasoningGPQA D
Accuracy (%)54
77
General CapabilityAggregate (GPQA-D, GSM8K, HumanEval, MATH-500, MBPP, MMLU-Pro)
Average Accuracy75.9
66
Question AnsweringGPQA Diamond
Accuracy54
61
Language UnderstandingMMLU-Pro
MMLU-Pro Accuracy72.8
60
Language UnderstandingMMLU-Pro
Overall Accuracy (MMLU-Pro)66.5
24
Language UnderstandingMMLU-Pro
Overall Score0.665
20
Language Understanding and ReasoningMMLU-Pro
Overall Score66.5
20
Question AnsweringMMLU-Pro (full)
Overall Accuracy (MMLU-Pro QA)54.7
16
Language UnderstandingMMLU Pro (test)--
11
Showing 10 of 10 rows

Other info

Follow for update