Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration

About

The swift advancement in Multimodal LLMs (MLLMs) also presents significant challenges for effective knowledge editing. Current methods, including intrinsic knowledge editing and external knowledge resorting, each possess strengths and weaknesses, struggling to balance the desired properties of reliability, generality, and locality when applied to MLLMs. In this paper, we propose UniKE, a novel multimodal editing method that establishes a unified perspective and paradigm for intrinsic knowledge editing and external knowledge resorting. Both types of knowledge are conceptualized as vectorized key-value memories, with the corresponding editing processes resembling the assimilation and accommodation phases of human cognition, conducted at the same semantic levels. Within such a unified framework, we further promote knowledge collaboration by disentangling the knowledge representations into the semantic and truthfulness spaces. Extensive experiments validate the effectiveness of our method, which ensures that the post-edit MLLM simultaneously maintains excellent reliability, generality, and locality. The code for UniKE is available at \url{https://github.com/beepkh/UniKE}.

Kaihang Pan, Zhaoyu Fan, Juncheng Li, Qifan Yu, Hao Fei, Siliang Tang, Richang Hong, Hanwang Zhang, Qianru Sun• 2024

Related benchmarks

TaskDatasetResultRank
Knowledge EditingMMEdit E-VQA
Reliability98.8
61
Knowledge EditingE-VQA MMEdit 1.0 (test)
Reliability94.32
24
Knowledge EditingMMEdit E-IC 1.0 (test)
Reliability74.01
24
Knowledge EditingMMEdit E-IC
Reliability98.3
16
Multimodal Knowledge EditingMMEdit 10-step sequential editing on VQA
Reliability91.5
12
Knowledge EditingMMEdit One-Step Editing
Reliability98
7
Cross-task Knowledge EditingMMEdit cross-task
Rel. Score90.7
6
Image Caption EditingMMEdit 10-step sequential editing
Relevance91.8
6
Image Captioning EditingMMEdit
Relevance Score88.4
6
Knowledge EditingMMEdit Sequential Editing
Reliability90.2
6
Showing 10 of 16 rows

Other info

Code

Follow for update