Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Knowledge Editing through Chain-of-Thought

About

Knowledge Editing is a technique that updates large language models (LLMs) with new information to maintain their world knowledge. This approach avoids the need to rebuild the model from scratch, thereby addressing the high costs associated with frequent retraining. Among these, the in-context editing paradigm stands out for its effectiveness in integrating new knowledge while preserving the model's original capabilities. Despite its potential, existing in-context knowledge editing methods are often task-specific, focusing primarily on multi-hop QA tasks using structured knowledge triples. Moreover, their reliance on few-shot prompting for task decomposition makes them unstable and less effective in generalizing across diverse tasks. In response to these limitations, we propose EditCoT, a novel knowledge editing framework that flexibly and efficiently updates LLMs across various tasks without retraining. EditCoT works by generating a chain-of-thought (CoT) for a given input and then iteratively refining this CoT process using a CoT editor based on updated knowledge. We evaluate EditCoT across a diverse range of benchmarks, covering multiple languages and tasks. The results demonstrate that our approach achieves state-of-the-art performance while offering superior generalization, effectiveness, and stability compared to existing methods, marking a significant advancement in the field of knowledge updating. The code and data of EditCoT are available at: https://github.com/bebr2/EditCoT .

Changyue Wang, Weihang Su, Qingyao Ai, Yichen Tang, Yiqun Liu• 2024

Related benchmarks

TaskDatasetResultRank
Knowledge EditingCounterFact
Efficacy96.73
301
Knowledge EditingzsRE--
181
Knowledge EditingWikiUpdate
Edit Success77.68
30
Knowledge EditingCounterfact uns
Edit Success Rate87.76
30
Knowledge EditingMQuAKE
Edit Success Rate96.73
30
Sentiment editingConvSent
Success Rate (1K Edits)79.15
14
Showing 6 of 6 rows

Other info

Follow for update