Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FIRE: Fact-checking with Iterative Retrieval and Verification

About

Fact-checking long-form text is challenging, and it is therefore common practice to break it down into multiple atomic claims. The typical approach to fact-checking these atomic claims involves retrieving a fixed number of pieces of evidence, followed by a verification step. However, this method is usually not cost-effective, as it underutilizes the verification model's internal knowledge of the claim and fails to replicate the iterative reasoning process in human search strategies. To address these limitations, we propose FIRE, a novel agent-based framework that integrates evidence retrieval and claim verification in an iterative manner. Specifically, FIRE employs a unified mechanism to decide whether to provide a final answer or generate a subsequent search query, based on its confidence in the current judgment. We compare FIRE with other strong fact-checking frameworks and find that it achieves slightly better performance while reducing large language model (LLM) costs by an average of 7.6 times and search costs by 16.5 times. These results indicate that FIRE holds promise for application in large-scale fact-checking operations. Our code is available at https://github.com/mbzuai-nlp/fire.git.

Zhuohan Xie, Rui Xing, Yuxia Wang, Jiahui Geng, Hasan Iqbal, Dhruv Sahnan, Iryna Gurevych, Preslav Nakov• 2024

Related benchmarks

TaskDatasetResultRank
Hallucination DetectionHaluEval
F1 Score70.73
75
Hallucination DetectionMMLU-Pro
Accuracy61.05
30
Hallucination DetectionXTRUST
Accuracy58.09
30
Veracity AssessmentFactCheck-Bench
Macro-F176
26
Fact CheckingFeLMWk
F1 (True)0.77
16
Claim VerificationAmbiguousSnopes
Precision31
14
Claim VerificationExClaim
Precision (P)24
14
Fact CheckingDeepFact-Bench (test)
Accuracy58.5
13
Fact CheckingFEVER
Balanced Accuracy90.6
12
Veracity AssessmentFacTool-QA
True F189
12
Showing 10 of 13 rows

Other info

Follow for update