Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Response-level Hallucination Detection on HDM-Bench
Loading...
76.3
AUROC
HallucinationProbes
54.9904
60.5227
66.055
71.5873
Apr 17, 2026
AUROC
Updated 1mo ago
Evaluation Results
Method
Method
Links
AUROC
HallucinationProbes
Detector category=Whit...
2026.04
76.3
RAGognizer
Detector category=Whit...
2026.04
72.72
MiniCheck-7B
Detector category=Blac...
2026.04
71.7
HDM-2-3B
Detector category=Blac...
2026.04
69.62
SelfCheckGPT (NLI)
Detector category=Blac...
2026.04
68.44
DeBERTa-v3 (Ent.)
Detector category=Blac...
2026.04
62.6
DeBERTa-v3 (Con.)
Detector category=Blac...
2026.04
60.9
LettuceDetect-L
Detector category=Blac...
2026.04
60.85
HHEM-2.1-Open
Detector category=Blac...
2026.04
59.97
INSIDE (EigenScore)
Detector category=Whit...
2026.04
56.35
Semantic Entropy
Detector category=Whit...
2026.04
55.81
Feedback
Search any
task
Search any
task