Share your thoughts, 1 month free Claude Pro on usSee more

Binary Fact-checking on Claim Verify

0.896Macro F1

InFi-Checker-Qwen

Updated 5mo ago

Evaluation Results

Method	Links
InFi-Checker-Qwen 2026.01		0.896
GPT-5 2026.01		0.877
MiniCheck 2026.01		0.856
ClearCheck (COT) 2026.01		0.854
Claude-3.7-Sonnet 2026.01		0.837
o3 2026.01		0.833
GPT-4.1 2026.01		0.816
AlignScore-large 2026.01		0.798
GPT-4o 2026.01		0.783
FactCG 2026.01		0.762
InFi-Checker-Llama 2026.01		0.759
DeepSeek-V3.2-NoThink 2026.01		0.754
Qwen3-8B 2026.01		0.661
Llama-3.1-8B-Instruct 2026.01		0.636