Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Medical Diagnosis benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Medical Diagnosis
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
MIMIC-IV diagnostic evaluation set (test)
GLEAN (N=3)
Accuracy
78.33
54
3mo ago
agent-CMB
Medical-CoT*
Rounds
18.34
25
3mo ago
MedQA agent
MedKGI
Rounds
9.11
25
3mo ago
AgentClinic OOD original (test)
Aloe-Beta-70B
Similarity (Sim)
0.684
20
23d ago
MedEinst Robust 1.0
ECR-Agent (Qwen3-32B)
Robust Accuracy
24.21
18
3mo ago
MedEinst Baseline 1.0
ECR-Agent (Qwen3-32B)
Baseline Accuracy
69.49
18
3mo ago
COVID19-CT
SH-PEFT
F1 Score
83
16
3mo ago
MAU (test)
UMed-LVLM
DL Score
53
13
3mo ago
PMC-Patients
MedExAgent-8B
Similarity Score
62.6
12
23d ago
DDxPlus
MedExAgent-8B
Similarity
96.6
12
23d ago
DDXPlus n=50
BMBE + GPT-5.4-nano
Top-1 Accuracy
78
12
1mo ago
Step-CoT (test)
Ours (Teacher)
Accuracy
78.3
10
2mo ago
CXR14 (external)
DeepMedix
Precision for Edema
71.26
10
3mo ago
MedAction 300 Hard
GPT-5.4
Diag. Acc.
82
9
23d ago
MedR-Bench
GPT-5.4
Diagnostic Accuracy
81
9
23d ago
DiagnosisArena (test)
GoS
Match (LLM-as-a-Judge)
31.88
9
2mo ago
MediQ (test)
O4-MINI
Average Outcome Reward
74.67
9
2mo ago
NEJM
DDO
Rounds
17.91
9
3mo ago
IndicMedDialog Telugu 1.0 (test)
GEMMA
Diagnostic Accuracy
6.38
8
20d ago
IndicMedDialog Tamil 1.0 (test)
GEMMA
Diagnostic Accuracy
11.91
8
20d ago
IndicMedDialog Assamese 1.0 (test)
Tiny-AYA
Diagnostic Accuracy
8.08
8
20d ago
IndicMedDialog Punjabi 1.0 (test)
IndicMedLM
Diagnostic Accuracy
20.42
8
20d ago
IndicMedDialog Gujarati 1.0 (test)
Tiny-AYA
Diagnostic Accuracy
0.3702
8
20d ago
IndicMedDialog Urdu 1.0 (test)
IndicMedLM
Diagnostic Accuracy
28.51
8
20d ago
IndicMedDialog Bengali 1.0 (test)
IndicMedLM
Diagnostic Accuracy
58.72
8
20d ago
Showing 25 of 38 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs