Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hallucination Detection on Fact*
Loading...
78.6
AUC-ROC
SDES
53.12
59.735
66.35
72.965
May 30, 2025
AUC-ROC
Updated 4d ago
Evaluation Results
Method
Method
Links
AUC-ROC
SDES
LLM=Vicuna-13B
2025.05
78.6
CDES
LLM=Gemma-2-9B
2025.05
73.9
AvgProb
LLM=LLama-2-13B
2025.05
59.3
P(True)
LLM=LLama-2-7B
2025.05
54.1
Feedback
Search any
task
Search any
task