Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Fact Verification on FEVER (AUROC)
Loading...
0.737
AUROC
VERDI LR
0.40836
0.49368
0.579
0.66432
May 11, 2026
AUROC
Updated 21d ago
Evaluation Results
Method
Method
Links
AUROC
VERDI LR
Model=GPT-4.1-mini
2026.05
0.737
VERDI LR
Model=Qwen3.5-4B
2026.05
0.699
VERDI LR
Model=GPT-5.4-mini
2026.05
0.662
VERDI LR
Model=Qwen3.5-9B
2026.05
0.648
VERDI LR
Model=Qwen3.5-27B
2026.05
0.56
Logprob
Model=Qwen3.5-4B
2026.05
0.494
Logprob
Model=Qwen3.5-27B
2026.05
0.479
Logprob
Model=Qwen3.5-9B
2026.05
0.421
Feedback
Search any
task
Search any
task