Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and Models

About

We propose end-to-end multimodal fact-checking and explanation generation, where the input is a claim and a large collection of web sources, including articles, images, videos, and tweets, and the goal is to assess the truthfulness of the claim by retrieving relevant evidence and predicting a truthfulness label (e.g., support, refute or not enough information), and to generate a statement to summarize and explain the reasoning and ruling process. To support this research, we construct Mocheg, a large-scale dataset consisting of 15,601 claims where each claim is annotated with a truthfulness label and a ruling statement, and 33,880 textual paragraphs and 12,112 images in total as evidence. To establish baseline performances on Mocheg, we experiment with several state-of-the-art neural architectures on the three pipelined subtasks: multimodal evidence retrieval, claim verification, and explanation generation, and demonstrate that the performance of the state-of-the-art end-to-end multimodal fact-checking does not provide satisfactory outcomes. To the best of our knowledge, we are the first to build the benchmark dataset and solutions for end-to-end multimodal fact-checking and explanation generation. The dataset, source code and model checkpoints are available at https://github.com/VT-NLP/Mocheg.

Barry Menglong Yao, Aditya Shah, Lichao Sun, Jin-Hee Cho, Lifu Huang (1) __INSTITUTION_5__ Virginia Tech, (2) Lehigh University)• 2022

Related benchmarks

TaskDatasetResultRank
Claim VerificationAIChartClaim
Macro F159
38
Claim VerificationChartCheck
Macro F10.578
38
Claim VerificationMocheg
Macro F145.6
32
Claim VerificationMR2
Macro F168
32
Explanation GenerationAIChartClaim 1.0 (test)
ROUGE-141.5
9
Explanation GenerationAIChartClaim
ROUGE-L33.4
9
Explanation GenerationChartCheck 1.0 (test)
ROUGE-147.1
9
Explanation GenerationChartCheck
ROUGE-L39.6
9
Explanation GenerationAIChartClaim (test)
ROUGE-139.5
9
Explanation GenerationChartCheck (test)
ROUGE-145.3
9
Showing 10 of 23 rows

Other info

Follow for update