Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DDXPlus

Benchmarks

Task NameDataset NameSOTA ResultTrend
Medical Question AnsweringDDXPlus
Accuracy86.5
28
Automated Medical DiagnosisDDXPlus (test)
IL25.75
9
Medical ReasoningDDXPlus
Performance Score81.1
8
Confidence EstimationDDXPlus
AUROC0.795
7
ClassificationDDXPlus
Accuracy50.1
4
Showing 5 of 5 rows