Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA

About

Large language models (LLMs) have shown great success in various Natural Language Processing (NLP) tasks, whist they still need updates after deployment to fix errors or keep pace with the changing knowledge in the world. Researchers formulate such problem as Model Editing and have developed various editors focusing on different axes of editing properties. However, current editors can hardly support all properties and rely on heavy computational resources. In this paper, we propose a plug-in Model Editing method based on neuron-indexed dynamic LoRA (MELO), which alters the behavior of language models by dynamically activating certain LoRA blocks according to the index built in an inner vector database. Our method satisfies various editing properties with high efficiency and can be easily integrated into multiple LLM backbones. Experimental results show that our proposed MELO achieves state-of-the-art editing performance on three sequential editing tasks (document classification, question answering and hallucination correction), while requires the least trainable parameters and computational cost.

Lang Yu, Qin Chen, Jie Zhou, Liang He• 2023

Related benchmarks

TaskDatasetResultRank
Image ClassificationImageNet-R
Accuracy78.98
217
Hallucination CorrectionUniEdit
Error Rate (ERR)1
24
Hallucination CorrectionWikiBigEdit
Error Rate (ERR)1
24
Hallucination CorrectionHallucination
Error Rate (ERR)17.45
10
Question AnsweringzsRE
Error Rate (ERR)72
9
Image ClassificationVTAB Sim50
Accuracy91.07
3
Image ClassificationVTAB 5T small
Accuracy87.67
3
Showing 7 of 7 rows

Other info

Follow for update