Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clinical Question Answering on MedMCQA
Loading...
86.1
Accuracy
SAG
47.204
57.302
67.4
77.498
Feb 8, 2026
Accuracy
Gap
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Gap
SAG
Backbone=Qwen-4B each,...
2026.02
86.1
7
SAG
Backbone=Qwen-4B each,...
2026.02
85.8
5.4
SAG
Backbone=Llama-3B each...
2026.02
85.2
6.3
SAG
Backbone=Llama-3B each...
2026.02
84.7
10.2
SAG
Backbone=Qwen-4B each,...
2026.02
79.6
17.9
SAG
Backbone=Llama-3B each...
2026.02
77.8
19.7
Single giant LLM
Backbone=Qwen-72B, Opt...
2026.02
75.1
15.1
Single giant LLM
Backbone=Llama-70B, Op...
2026.02
74.2
30.7
Single giant LLM
Backbone=Qwen-72B, Opt...
2026.02
72.5
19.3
Me-LLaMA
Model Type=Clinical sp...
2026.02
71.1
26.9
Single giant LLM
Backbone=Llama-70B, Op...
2026.02
70.1
18.8
Single giant LLM
Backbone=Qwen-72B, Opt...
2026.02
63
30.9
Single giant LLM
Backbone=Llama-70B, Op...
2026.02
51.2
17.4
Meditron
Model Type=Clinical sp...
2026.02
48.7
28.3
Feedback
Search any
task
Search any
task