Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MedHallu

Benchmarks

Task NameDataset NameSOTA ResultTrend
Hallucination DetectionMedHallu
AUROC1
24
Hallucination DetectionMedHallu (test)
Precision (HP)88.2
11
Generative Question AnsweringMedHallu
HALL Score67.33
10
Showing 3 of 3 rows