Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Reasoning on HLE-Med
Loading...
36.91
Pass@1
Gemini 3.0 Pro
17.0876
22.2338
27.38
32.5262
Feb 13, 2026
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Gemini 3.0 Pro
Decoding=Greedy
2026.02
36.91
MedXIAOHE
Mode=Thinking mode, De...
2026.02
25.77
GPT-5.2 Thinking
Mode=Thinking mode, De...
2026.02
24.7
Gemini 2.5 Pro
Decoding=Greedy
2026.02
17.85
Feedback
Search any
task
Search any
task