MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs

About

Current multimodal misinformation detection (MMD) methods often assume a single source and type of forgery for each sample, which is insufficient for real-world scenarios where multiple forgery sources coexist. The lack of a benchmark for mixed-source misinformation has hindered progress in this field. To address this, we introduce MMFakeBench, the first comprehensive benchmark for mixed-source MMD. MMFakeBench includes 3 critical sources: textual veracity distortion, visual veracity distortion, and cross-modal consistency distortion, along with 12 sub-categories of misinformation forgery types. We further conduct an extensive evaluation of 6 prevalent detection methods and 15 Large Vision-Language Models (LVLMs) on MMFakeBench under a zero-shot setting. The results indicate that current methods struggle under this challenging and realistic mixed-source MMD setting. Additionally, we propose MMD-Agent, a novel approach to integrate the reasoning, action, and tool-use capabilities of LVLM agents, significantly enhancing accuracy and generalization. We believe this study will catalyze future research into more realistic mixed-source multimodal misinformation and provide a fair evaluation of misinformation detection methods.

Xuannan Liu, Zekun Li, Peipei Li, Huaibo Huang, Shuhan Xia, Xing Cui, Linzhi Huang, Weihong Deng, Zhaofeng He• 2024

Related benchmarks

Task	Dataset	Result
Multi-modal manipulation detection	ROM NYT domain 1.0 (test)	Accuracy61.34	23
Distortion Diagnosis	ReMMDBench (500-sample split)	Precision43.08	17
Veracity Classification	ReMMDBench (500-sample split)	Accuracy26.4	17
Multimodal misinformation detection	MMFakeBench (val)	F1 Score78.8	4
Multi-modal manipulation detection	ROM Wash. domain 1.0 (test)	Accuracy51.23	3
Multi-modal manipulation detection	ROM Average across 5 domains 1.0 (test)	Accuracy57.45	3
Multimodal misinformation detection	MMFakeBench official (test)	Accuracy59.2	3

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord