Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Question Answering on HLE-Med (test)
Loading...
28.19
Accuracy
Tongyi-DeepResearch-30BA3B
10.9364
15.4157
19.895
24.3743
Jan 26, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Tongyi-DeepResearch-30BA3B
Model Group=DeepResear...
2026.01
28.19
DEEPMED-14B-RL
Model Group=DeepResear...
2026.01
26.84
MedResaon-8B
Model Group=Medical Re...
2026.01
22.4
Deepseek-v3.2-685B
Model Group=Awesome Ge...
2026.01
19.46
DEEPMED-14B-SFT
Model Group=DeepResear...
2026.01
19.46
Kimi-K2-Thinking-1TB
Model Group=Awesome Ge...
2026.01
18.79
M1-1K-32B
Model Group=Medical Re...
2026.01
16.78
BaiChuan-M2-32B
Model Group=Medical Re...
2026.01
16.78
Qwen3-30BA3B-Thinking
Model Group=Awesome Ge...
2026.01
14.77
M1-1K-7B
Model Group=Medical Re...
2026.01
14.77
Gemini2.5-Pro
Model Group=Awesome Ge...
2026.01
14.36
Qwen3-14B
Model Group=Awesome Ge...
2026.01
12.75
HuatuoGPT-o1-70B
Model Group=Medical Re...
2026.01
11.6
Feedback
Search any
task
Search any
task