Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Medical Knowledge Evaluation on MMLU Med
Loading...
88.1
Accuracy
Gemini 3 Pro
68.964
73.932
78.9
83.868
Dec 25, 2025
Jan 11, 2026
Jan 28, 2026
Feb 14, 2026
Mar 3, 2026
Mar 20, 2026
Apr 6, 2026
Accuracy
Updated 11d ago
Evaluation Results
Method
Method
Links
Accuracy
Gemini 3 Pro
Temperature=default
2026.04
88.1
Gemini 3 Flash
Temperature=default
2026.04
87.5
MedGemma 1
Model Scale=27B, Tempe...
2026.04
86.2
DOS-CPT
Backbone=Qwen3-14B-Bas...
2025.12
82.11
LPS-CPT
Backbone=Qwen3-14B-Bas...
2025.12
81.85
HPS-CPT
Backbone=Qwen3-14B-Bas...
2025.12
81.68
RS-CPT
Backbone=Qwen3-14B-Bas...
2025.12
81.62
Qwen3-14B-Base
Backbone=Qwen3-14B-Bas...
2025.12
81.19
Qwen3 VL
Model Scale=4B, Temper...
2026.04
78.3
MedGemma 1
Model Scale=4B, Temper...
2026.04
70
MedGemma 1.5
Model Scale=4B, Temper...
2026.04
69.7
Feedback
Search any
task
Search any
task