Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Memory-Based Model Editing at Scale

About

Even the largest neural networks make errors, and once-correct predictions can become invalid as the world changes. Model editors make local updates to the behavior of base (pre-trained) models to inject updated knowledge or correct undesirable behaviors. Existing model editors have shown promise, but also suffer from insufficient expressiveness: they struggle to accurately model an edit's intended scope (examples affected by the edit), leading to inaccurate predictions for test inputs loosely related to the edit, and they often fail altogether after many edits. As a higher-capacity alternative, we propose Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model (SERAC), which stores edits in an explicit memory and learns to reason over them to modulate the base model's predictions as needed. To enable more rigorous evaluation of model editors, we introduce three challenging language model editing problems based on question answering, fact-checking, and dialogue generation. We find that only SERAC achieves high performance on all three problems, consistently outperforming existing approaches to model editing by a significant margin. Code, data, and additional project information will be made available at https://sites.google.com/view/serac-editing.

Eric Mitchell, Charles Lin, Antoine Bosselut, Christopher D. Manning, Chelsea Finn• 2022

Related benchmarks

TaskDatasetResultRank
Lifelong Knowledge EditingE-VQA Lifelong Sequential
Rel. Score89.33
72
Knowledge EditingMMEdit E-VQA
Reliability99.4
61
Knowledge EditingVLKEB
Reliability89.77
45
Knowledge EditingMMEdit E-IC 1.0 (test)
Reliability99.71
24
Knowledge EditingE-VQA MMEdit 1.0 (test)
Reliability97.6
24
Knowledge EditingMzsRE Edit: EN, Test: EN
Reliability9.13e+3
23
Multimodal Knowledge EditingMMQAKE Original Image
M-Acc6.58
18
Multimodal Knowledge EditingMMQAKE Rephrased Image
M-Acc1.04
18
Knowledge EditingMMEdit E-IC
Reliability99.7
16
Sentiment editingConvSent (OOD)
Edit Success Score62.75
16
Showing 10 of 38 rows

Other info

Follow for update