Fine-tuning Done Right in Model Editing

About

Fine-tuning, a foundational method for adapting large language models, has long been considered ineffective for model editing. Here, we challenge this belief, arguing that the reported failure arises not from the inherent limitation of fine-tuning itself, but from adapting it to the sequential nature of the editing task, a single-pass depth-first pipeline that optimizes each sample to convergence before moving on. While intuitive, this depth-first pipeline coupled with sample-wise updating over-optimizes each edit and induces interference across edits. Our controlled experiments reveal that simply restoring fine-tuning to the standard breadth-first (i.e., epoch-based) pipeline with mini-batch optimization substantially improves its effectiveness for model editing. Moreover, fine-tuning in editing also suffers from suboptimal tuning parameter locations inherited from prior methods. Through systematic analysis of tuning locations, we derive LocFT-BF, a simple and effective localized editing method built on the restored fine-tuning framework. Extensive experiments across diverse LLMs and datasets demonstrate that LocFT-BF outperforms state-of-the-art methods by large margins. Notably, to our knowledge, it is the first to sustain 100K edits and 72B-parameter models,10 x beyond prior practice, without sacrificing general capabilities. By clarifying a long-standing misconception and introducing a principled localized tuning strategy, we advance fine-tuning from an underestimated baseline to a leading method for model editing, establishing a solid foundation for future research.

Wanli Yang, Rui Tang, Hongyu Zang, Du Su, Qi Cao, Jingang Wang, Huawei Shen, Xueqi Cheng, Fei Sun• 2025

Related benchmarks

Task	Dataset	Result
Commonsense Reasoning	ARC-C	Accuracy50	215
Mathematical Reasoning	GSM8K	Math Score73	197
Model Editing	WikiBigEdit	MMLU69.2	34
Instruction Following Evaluation	IFEval	IFEval Score67.6	32
Model Editing	CounterFact	Reliability61.1	26
Model Editing	zsRE	Reliability69.5	26
Model Editing	zsRE	Reliability0.535	16
Multi-task Language Understanding	MMLU	MMLU Score68	14
Model Editing	ZsRE 3,000 samples (test)	Relational Score99.1	13
Model Editing	WikiBigEdit 3,000 samples (test)	Reliability99.9	13

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord