Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Hallucination Detection on DiaHalu (sampled)
Loading...
87.7
Precision (Class 0)
CAAFC
83.315
85.5075
87.7
89.8925
May 12, 2026
Precision (Class 0)
Recall (Class 0)
F1 Score (Class 0)
Precision (Class 1)
Recall (Class 1)
F1 Score (Class 1)
Accuracy
Macro Precision
Macro Recall
Macro F1 Score
Weighted Precision
Weighted Recall
Weighted F1 Score
Updated 21d ago
Evaluation Results
Method
Method
Links
Precision (Class 0)
Recall (Class 0)
F1 Score (Class 0)
Precision (Class 1)
Recall (Class 1)
F1 Score (Class 1)
Accuracy
Macro Precision
Macro Recall
Macro F1 Score
Weighted Precision
Weighted Recall
Weighted F1 Score
CAAFC
Backbone=Gemma-27B, su...
2026.05
87.7
46.7
61
63.9
93.5
75.9
70.2
75.8
70.1
68.4
75.7
70.2
68.5
Feedback
Search any
task
Search any
task