Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rethinking Residual Distribution in Locate-then-Edit Model Editing

About

Model editing enables targeted updates to the knowledge of large language models (LLMs) with minimal retraining. Among existing approaches, locate-then-edit methods constitute a prominent paradigm: they first identify critical layers, then compute residuals at the final critical layer based on the target edit, and finally apply least-squares-based multi-layer updates via $\textbf{residual distribution}$. While empirically effective, we identify a counterintuitive failure mode: residual distribution, a core mechanism in these methods, introduces weight shift errors that undermine editing precision. Through theoretical and empirical analysis, we show that such errors increase with the distribution distance, batch size, and edit sequence length, ultimately leading to inaccurate or suboptimal edits. To address this, we propose the $\textbf{B}$oundary $\textbf{L}$ayer $\textbf{U}$pdat$\textbf{E (BLUE)}$ strategy to enhance locate-then-edit methods. Sequential batch editing experiments on three LLMs and two datasets demonstrate that BLUE not only delivers an average performance improvement of 35.59\%, significantly advancing the state of the art in model editing, but also enhances the preservation of LLMs' general capabilities. Our code is available at https://github.com/xpq-tech/BLUE.

Xiaopeng Li, Shanwen Wang, Shasha Li, Shezheng Song, Bin Ji, Jun Ma, Jie Yu• 2025

Related benchmarks

TaskDatasetResultRank
Knowledge EditingCounterFact
Efficacy99.62
362
Knowledge EditingzsRE
Generality96.28
268
Knowledge EditingRippleEdits POPULAR (full requested-edit set)
Rel.98.4
30
Knowledge EditingCounterfact (first 2000 edits)
Accuracy96.9
17
Knowledge EditingZsRE massive editing (2,000 items)
Efficacy Success94.6
9
LLM Parameter EditingMCF
Efficacy Success94.3
8
LLM Parameter EditingzsRE
Efficacy Success82.5
8
LLM Parameter EditingMCF (test)
Efficacy Success99.8
8
LLM Parameter EditingZSRE (test)
Efficacy Success Rate99.3
8
Showing 9 of 9 rows

Other info

Follow for update