Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

EtCon: Edit-then-Consolidate for Reliable Knowledge Editing

About

Knowledge editing aims to update specific facts in large language models (LLMs) without full retraining. Prior efforts sought to tune the knowledge layers of LLMs, achieving improved performance in controlled, teacher-forced evaluations. However, they still encounter challenges in real-world autoregressive generation scenarios, which greatly limit their practical applicability. Our empirical analysis reveals two issues: (1) Most methods degrade pre-trained capabilities after injecting new knowledge; (2) They may exhibit a discrepancy between stored parametric knowledge and inference-time autoregressive generation behavior. To this end, we propose EtCon, an edit-then-consolidate paradigm that couples targeted edits with post-edit consolidation. Specifically, our framework comprises two stages: (1) Targeted Proximal Supervised Fine-Tuning (TPSFT) performs a constrained targeted edit to update parametric knowledge while controlling policy drift. (2) Group Relative Policy Optimization (GRPO) consolidates the edit by aligning autoregressive trajectories with the intended fact. Extensive experiments demonstrate that our EtCon improves editing reliability and real-world generalization, while better preserving pre-trained capabilities.

Ruilin Li, Yibin Wang, Wenhong Zhu, Chenglin Li, Jinghao Zhang, Chenliang Li, Junchi Yan, Jiaqi Wang• 2025

Related benchmarks

TaskDatasetResultRank
Question AnsweringSQuAD
F119.6
127
Logical reasoningLogiQA
Accuracy38.4
84
General Knowledge AssessmentC-Eval
Accuracy78.5
37
Discrete reasoningDROP
Exact Match (EM)2.5
19
Lifelong Knowledge EditingzsRE
Reliability73.5
14
Lifelong Knowledge EditingCounterFact
Reliability67.1
14
Lifelong Knowledge EditingQAEdit
Reliability75.1
14
Knowledge EditingQAEdit
Reliability75.1
8
Conversational Question AnsweringCoQA
EM55.1
8
Multi-hop Knowledge EditingMQuAKE CF v2
2-hop Score24.2
6
Showing 10 of 10 rows

Other info

GitHub

Follow for update