AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators

About

With the rise of generative AI, automated fact-checking methods to combat misinformation are becoming more and more important. However, factual claim detection, the first step in a fact-checking pipeline, suffers from two key issues that limit its scalability and generalizability: (1) inconsistency in definitions of the task and what a claim is, and (2) the high cost of manual annotation. To address (1), we review the definitions in related work and propose a unifying definition of factual claims that focuses on verifiability. To address (2), we introduce AFaCTA (Automatic Factual Claim deTection Annotator), a novel framework that assists in the annotation of factual claims with the help of large language models (LLMs). AFaCTA calibrates its annotation confidence with consistency along three predefined reasoning paths. Extensive evaluation and experiments in the domain of political speech reveal that AFaCTA can efficiently assist experts in annotating factual claims and training high-quality classifiers, and can work with or without expert supervision. Our analyses also result in PoliClaim, a comprehensive claim detection dataset spanning diverse political topics.

Jingwei Ni, Minjing Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold• 2024

Related benchmarks

Task	Dataset	Result
Claim Verification	CheckThat! S (100+/100+) 2021 re-annotated (dev)	Agreement43.7	5
Factual Claim Detection	PoliClaim Perfectly consistent samples S_con (test)	Agreement83.3	4
Factual Claim Detection	PoliClaim Inconsistent samples S_inc (test)	Agreement41.8	4
Factual Claim Detection	PoliClaim Full S (test)	Agreement61.5	3
Claim Verification	CheckThat! S24_con 2021 re-annotated (dev)	Agreement58.4	2

Showing 5 of 5 rows

Other info

Code

Follow for update

@wizwand_team Discord