Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MedMCQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Medical Question AnsweringMedMCQA
Accuracy90.4
521
Medical Question AnsweringMedMCQA (test)
Accuracy84.13
134
Question AnsweringMedMCQA
Accuracy64.67
98
Medical ReasoningMedMCQA
Accuracy86
58
Medical Question AnsweringMedMCQA
BLEU Score10.82
54
Question AnsweringMedMCQA
AUC75.95
51
Question AnsweringMedMCQA (test)
Test Error Rate0.163
48
Hallucination DetectionMedMCQA
AUC75.57
42
Multiple-choice Question AnsweringMedMCQA
Accuracy88.9
42
Data SelectionMedMCQA (fresh candidate pool)
Accuracy57.4
34
MedicalMedMCQA
Accuracy (ACC)58.2
33
Multi-Turn Medical DialogueMedMCQA
Accuracy63.31
32
Medical Question AnsweringMedMCQA
Pass@1 Accuracy53.6
28
Medical Knowledge EditingMedMCQA edit
Efficacy51
18
Question AnsweringMedMCQA
R@172.33
15
Machine UnlearningMedMCQA QF=1000
Forget Accuracy90
14
LLM RoutingMEDMCQA (val)
Top-1 Acc96.3
14
LLM RoutingMedMCQA
Top-1 Acc96.3
14
Clinical Question AnsweringMedMCQA
Accuracy86.1
14
Medical Question AnsweringMedMCQA
Tau Correlation4.3
13
Medical information extraction and understandingMedMCQA
Perplexity (PPL)3.28
12
Medical ReasoningMedMCQA
Token Cost (tokens/question)1,047
11
Question AnsweringMedMCQA (dev)
Accuracy0.791
11
Biomedical Question AnsweringMedMCQA In-Domain (test)
Accuracy90
10
Question AnsweringMedMCQA 1,000-example evaluation slice
Accuracy39.8
9
Showing 25 of 47 rows