Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Medical Question Answering benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Medical Question Answering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
MedMCQA
ProtRLSearch
Accuracy
90.4
346
3d ago
MedQA
ToolTree
Accuracy
93.88
153
18d ago
MedMCQA (test)
Gemini2.5-Pro
Accuracy
84.13
134
1mo ago
PubMedQA
HuatuoGPT-o1-70B
Accuracy
81.4
92
10d ago
MedExpQA
Gemini-2.5-Flash
Overall Accuracy
86.19
70
1mo ago
MedBullets
Multi-Agent Medical Decision Consensus Matrix System
Accuracy
84.2
65
9d ago
MMLU Med
MAPLE
Accuracy
85.19
61
11d ago
MedMCQA
Llama-3.1-8B-Instruct
BLEU Score
10.82
54
1mo ago
DDXPlus
Multi-Agent Medical Decision Consensus Matrix System
Accuracy
86.5
43
1mo ago
MedQA
CascadeDebate
Accuracy
86.44
40
3d ago
BioASQ
ReFilter
Accuracy
80.74
38
11d ago
MedicalQA
Symphony-Coord
Accuracy
86
33
1mo ago
MedXpertQA
MA-RAG-ext
Accuracy
22.2
31
18d ago
HeadQA
Zero-Shot CoT
Accuracy
92.2
30
16d ago
Medec
BFRS
Accuracy
69.2
30
16d ago
MedCalc-Bench
Zero-Shot CoT
Accuracy
35.3
30
16d ago
MedQA
HCQR
Decision-Useful Rate
89.8
30
29d ago
Polish Board Certification Examinations
Meta-Llama-3.1-405B-Instruct-FP8
Average Score
69.2
30
1mo ago
MMLU-P
GPT-4.1-mini
Accuracy
97.1
29
1mo ago
PubMedQA
SPPFT
Factual Accuracy (FA)
95.63
28
3d ago
CV-MedExQA (test)
AU-probe
AUROC
0.9987
28
1mo ago
CV-MedMCQA (test)
AU-probe
AUROC
0.9999
28
1mo ago
CV-MedQA (test)
AU-probe
AUROC
0.9998
28
1mo ago
Medical QA Evaluation Suite (MedQA, MedMCQA, MMLU-Med, PubMedQA, BioASQ, SEER, DDXPlus, MIMIC-IV)
SPO Planning
MedQA Score
77.45
27
1mo ago
MedConceptsQA
GPT-4
Accuracy
94.27
26
1mo ago
Showing 25 of 150 rows
25 / page
50 / page
100 / page
1
2
3
4
5
6
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs