Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Instruction Following on MultiChallenge (Pass@1)
Loading...
66.8
Pass@1
Gemini 3.0 Pro
56.192
58.946
61.7
64.454
Feb 13, 2026
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Gemini 3.0 Pro
Decoding=Greedy
2026.02
66.8
MedXIAOHE
Mode=Thinking mode, De...
2026.02
61.9
GPT-5.2 Thinking
Mode=Thinking mode, De...
2026.02
60
Gemini 2.5 Pro
Decoding=Greedy
2026.02
56.6
Feedback
Search any
task
Search any
task