FIRE: Fact-checking with Iterative Retrieval and Verification

About

Fact-checking long-form text is challenging, and it is therefore common practice to break it down into multiple atomic claims. The typical approach to fact-checking these atomic claims involves retrieving a fixed number of pieces of evidence, followed by a verification step. However, this method is usually not cost-effective, as it underutilizes the verification model's internal knowledge of the claim and fails to replicate the iterative reasoning process in human search strategies. To address these limitations, we propose FIRE, a novel agent-based framework that integrates evidence retrieval and claim verification in an iterative manner. Specifically, FIRE employs a unified mechanism to decide whether to provide a final answer or generate a subsequent search query, based on its confidence in the current judgment. We compare FIRE with other strong fact-checking frameworks and find that it achieves slightly better performance while reducing large language model (LLM) costs by an average of 7.6 times and search costs by 16.5 times. These results indicate that FIRE holds promise for application in large-scale fact-checking operations. Our code is available at https://github.com/mbzuai-nlp/fire.git.

Zhuohan Xie, Rui Xing, Yuxia Wang, Jiahui Geng, Hasan Iqbal, Dhruv Sahnan, Iryna Gurevych, Preslav Nakov• 2024

Related benchmarks

Task	Dataset	Result
Hallucination Detection	HaluEval	--	135
Hallucination Detection	MMLU-Pro	--	31
Hallucination Detection	XTRUST	Accuracy58.09	30
Veracity Assessment	FactCheck-Bench	Macro-F176	26
Fact Checking	FeLMWk	F1 (True)0.77	16
Long-form Retrieval-Augmented Generation	LongFact	Information Density (Sci.)229.9	14
Long-form Factual Generation	LongFact	Fact Recall (FR) - Science79.7	14
Long-form RAG Evaluation	RAGChecker	Fact Recall70.7	14
Claim Verification	AmbiguousSnopes	Precision31	14
Claim Verification	ExClaim	Precision (P)24	14

Showing 10 of 19 rows

Other info

Follow for update

@wizwand_team Discord