Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Claim Verification benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Claim Verification
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
9-dataset aggregate retrieval-free setting (test)
GPT-4.1
ROC-AUC
84
70
2mo ago
Med Claim
Direct Question
Accuracy
85
56
26d ago
PerplexityAI (test)
DYDECOMP
Verification Confidence
82.3
52
3mo ago
Strategy Claim
Direct Question
Accuracy
77
49
26d ago
Truthful Claim
ArgLLM
Accuracy
81
49
26d ago
ChartCheck
MEVER
Macro F1
0.643
38
3mo ago
AIChartClaim
MEVER
Macro F1
71.6
38
3mo ago
MR2
MEVER
Macro F1
77.7
32
3mo ago
Mocheg
MEVER
Macro F1
49.7
32
3mo ago
HoVer (test)
TOME-2
Accuracy
73.1
31
2mo ago
SciTab-OD
Llama-400B
Macro F1
77
28
1mo ago
AVeriTeC Retrieved (I) (dev)
DebateCV
Accuracy
73.6
28
1mo ago
AVeriTeC Retrieved (H) (dev)
DebateCV
Accuracy
72.8
28
1mo ago
AVeriTeC Golden (dev)
DebateCV
Accuracy
83.4
28
1mo ago
FactKG (test)
SimGRAG
Average Accuracy
86.8
20
3mo ago
PrimeFacts Five Class
Llama-3.3-70B
Macro F1 Score
42
19
26d ago
PrimeFacts Two Class
Llama-3.3-70B
Macro F1
81
19
26d ago
DIALFACT (val)
Aug-WoW
Accuracy
70.4
18
3mo ago
DIALFACT (test)
Aug-WoW
Accuracy
69.2
18
3mo ago
SemTab
MACE
Micro F1
90
14
1mo ago
AmbiguousSnopes
CO-FACTCHECKER
Precision
39
14
1mo ago
ExClaim
CO-FACTCHECKER
Precision (P)
34
14
1mo ago
Claim verification dataset
GEMINI + DACLR
Precision
74.92
12
6d ago
FinDVer (test)
Mistral-Large
Accuracy
76
12
1mo ago
FinDVer mini (test)
Qwen-2.5
Accuracy
76
12
1mo ago
Showing 25 of 54 rows
25 / page
50 / page
100 / page
1
2
3
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs