Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MedMCQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Medical Question AnsweringMedMCQA
Accuracy90.4
346
Medical Question AnsweringMedMCQA (test)
Accuracy84.13
134
Medical Question AnsweringMedMCQA
BLEU Score10.82
54
Question AnsweringMedMCQA (test)
Test Error Rate0.163
48
Multi-Turn Medical DialogueMedMCQA
Accuracy63.31
32
MedicalMedMCQA
Accuracy (ACC)58.2
21
Medical Knowledge EditingMedMCQA edit
Efficacy51
18
Machine UnlearningMedMCQA QF=1000
Forget Accuracy90
14
LLM RoutingMEDMCQA (val)
Top-1 Acc96.3
14
LLM RoutingMedMCQA
Top-1 Acc96.3
14
Clinical Question AnsweringMedMCQA
Accuracy86.1
14
Medical Question AnsweringMedMCQA
Tau Correlation4.3
13
Multiple-choice Question AnsweringMedMCQA
Accuracy40.97
12
Medical ReasoningMedMCQA
Token Cost (tokens/question)1,047
11
Medical ReasoningMedMCQA
Accuracy86
11
Question AnsweringMedMCQA (dev)
Accuracy0.791
11
Medical Question AnsweringMedMCQA
Pass@1 Accuracy53.6
10
Biomedical Question AnsweringMedMCQA In-Domain (test)
Accuracy90
10
Question AnsweringMedMCQA
FDR (%)6.43
9
Medical Question AnsweringMedMCQA translated (test)
Accuracy (ZH)43.2
9
Question AnsweringMedMCQA
Accuracy64.3
8
Medical ReasoningMedMCQA OOD (out-of-distribution)
Accuracy66.2
7
Out-of-Distribution DetectionMedMCQA Far-Domain
AUROC84.4
7
Missingness Bias ReductionMedMCQA
KL Divergence1.13
7
Question AnsweringMedMCQA (val)
Accuracy90
7
Showing 25 of 38 rows