WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models

About

Large language models (LLMs) need knowledge updates to meet the ever-growing world facts and correct the hallucinated responses, facilitating the methods of lifelong model editing. Where the updated knowledge resides in memories is a fundamental question for model editing. In this paper, we find that editing either long-term memory (direct model parameters) or working memory (non-parametric knowledge of neural network activations/representations by retrieval) will result in an impossible triangle -- reliability, generalization, and locality can not be realized together in the lifelong editing settings. For long-term memory, directly editing the parameters will cause conflicts with irrelevant pretrained knowledge or previous edits (poor reliability and locality). For working memory, retrieval-based activations can hardly make the model understand the edits and generalize (poor generalization). Therefore, we propose WISE to bridge the gap between memories. In WISE, we design a dual parametric memory scheme, which consists of the main memory for the pretrained knowledge and a side memory for the edited knowledge. We only edit the knowledge in the side memory and train a router to decide which memory to go through when given a query. For continual editing, we devise a knowledge-sharding mechanism where different sets of edits reside in distinct subspaces of parameters, and are subsequently merged into a shared memory without conflicts. Extensive experiments show that WISE can outperform previous model editing methods and overcome the impossible triangle under lifelong model editing of question answering, hallucination, and out-of-distribution settings across trending LLM architectures, e.g., GPT, LLaMA, and Mistral. Code is available at https://github.com/zjunlp/EasyEdit.

Peng Wang, Zexi Li, Ningyu Zhang, Ziwen Xu, Yunzhi Yao, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen• 2024

Related benchmarks

Task	Dataset	Result
Knowledge Editing	zsRE	Generality8.4	268
Model Editing	UltraEditBench	Efficacy52.51	78
Model Editing	zsRE	Reliability0.94	72
Model Editing	zsRE	Efficacy40.94	71
Model Editing	FEVER	Efficacy93.95	49
Model Editing	WikiBigEdit	Efficacy49.88	49
Model Editing	WikiBigEdit	--	34
Knowledge Editing	ZsRE 10,000 facts	Reliability36.88	27
Knowledge Editing	Counterfact 10,000 facts	Relational Score1.84e+3	27
Knowledge Model Editing	CounterFact	Efficacy14.73	26

Showing 10 of 54 rows

Other info

Follow for update

@wizwand_team Discord