Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Citation Hallucination Detection on Citation strings (full dataset)
Loading...
0.876
AUC
GBM
0.72
0.7605
0.801
0.8415
Feb 7, 2026
AUC
Average Precision (AP)
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
AUC
Average Precision (AP)
Accuracy
GBM
Evaluation Protocol=5-...
2026.02
0.876
0.82
79.1
Random Forest
Evaluation Protocol=5-...
2026.02
0.857
0.797
76.2
Logistic Regression
Evaluation Protocol=5-...
2026.02
0.726
0.64
65.2
Feedback
Search any
task
Search any
task