Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Medical Diagnosis on MedR-Bench

81Diagnostic Accuracy

GPT-5.4

49.857.96674.1May 8, 2026
Updated 23d ago

Evaluation Results

MethodLinks
2026.05
81
2026.05
74
2026.05
71
2026.05
69
2026.05
68
2026.05
66
2026.05
65
2026.05
61
2026.05
51