Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Prefill-stage hallucination risk detection on Benchmark-500 Strict Consensus Pvote = 1.0 vs. Clean
Loading...
0.6939
AUROC (Mean)
Risk-Cos
0.337076
0.429713
0.52235
0.614987
Mar 20, 2026
AUROC (Mean)
AUROC 95% CI (Lower Bound)
Updated 27d ago
Evaluation Results
Method
Method
Links
AUROC (Mean)
AUROC 95% CI (Lower Bound)
Risk-Cos
generation_mode=prefil...
2026.03
0.6939
0.58
Risk-Margin
generation_mode=prefil...
2026.03
0.685
0.57
Risk-Entropy
generation_mode=prefil...
2026.03
0.5323
0.44
Risk-Loss
generation_mode=prefil...
2026.03
0.3508
0.25
Feedback
Search any
task
Search any
task