Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MedicalQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Medical Question AnsweringMedicalQA
Accuracy86
33
Hallucination DetectionMedicalQA
AUROC78.95
28
Selective PredictionMedicalQA
E-AURC0.3373
28
Question AnsweringMedicalQA
Score84.2
12
Question AnsweringMedicalQA (test)
ROUGE52.9
12
RetrievalMedicalQA
nDCG@155.3
6
Showing 6 of 6 rows