UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Language Models

About

Lifelong learning enables large language models (LLMs) to adapt to evolving information by continually updating their internal knowledge. An ideal system should support efficient, wide-ranging updates while preserving existing capabilities and ensuring reliable deployment. Model editing stands out as a promising solution for this goal, offering a focused and efficient way to revise a model's internal knowledge. Although recent paradigms have made notable progress, they often struggle to meet the demands of practical lifelong adaptation at scale. To bridge this gap, we propose UltraEdit, a training-, subject-, and memory-free approach that is well-suited for ultra-scalable, real-world lifelong model editing. UltraEdit fundamentally differs from traditional paradigms by computing parameter shifts in one step using only a hidden state and its gradient, making the approach simple yet efficient. To improve scalability in lifelong settings, UltraEdit employs a lifelong normalization strategy that continuously updates feature statistics across turns, allowing it to adapt to distributional shifts and maintain consistency over time. UltraEdit achieves editing speeds more than $7\times$ faster than the previous state-of-the-art method, while requiring $4\times$ less VRAM. This makes it the only method currently capable of editing a 7B LLM on a 24GB consumer-grade GPU. Furthermore, we construct UltraEditBench, the largest dataset in the field to date with over 2M editing pairs, and demonstrate that our method supports up to 2M edits while maintaining high accuracy. Comprehensive experiments on five datasets and six models show that UltraEdit consistently achieves superior performance across diverse model editing scenarios, taking a further step towards safe and scalable lifelong learning. Our code is available at https://github.com/XiaojieGu/UltraEdit.

Xiaojie Gu, Ziying Huang, Jia-Chen Gu, Kai Zhang• 2025

Related benchmarks

Task	Dataset	Result
Knowledge Editing	zsRE	Generality77.08	268
Commonsense Reasoning	ARC-C	Accuracy46	215
Mathematical Reasoning	GSM8K	Math Score73	197
Lifelong Model Editing	zsRE	Efficacy90.07	89
Model Editing	UltraEditBench	Efficacy85.7	78
Model Editing	zsRE	Reliability0.92	72
Model Editing	zsRE	Efficacy90.07	71
Lifelong Model Editing	WikiBigEdit	Efficacy79.6	63
Model Editing	FEVER	Efficacy98.23	49
Model Editing	WikiBigEdit	Efficacy79.6	49

Showing 10 of 34 rows

Other info

Follow for update

@wizwand_team Discord