Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Verification on SummEval (AUROC)
Loading...
0.755
AUROC
VERDI LR
0.3078
0.4239
0.54
0.6561
May 11, 2026
AUROC
Updated 21d ago
Evaluation Results
Method
Method
Links
AUROC
VERDI LR
Model=GPT-5.4-mini
2026.05
0.755
VERDI LR
Model=GPT-4.1-mini
2026.05
0.717
VERDI LR
Model=Qwen3.5-4B
2026.05
0.681
VERDI LR
Model=Qwen3.5-9B
2026.05
0.654
VERDI LR
Model=Qwen3.5-27B
2026.05
0.637
Logprob
Model=Qwen3.5-9B
2026.05
0.417
Logprob
Model=Qwen3.5-4B
2026.05
0.373
Logprob
Model=Qwen3.5-27B
2026.05
0.325
Feedback
Search any
task
Search any
task