Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Instruction Following on MulDimIF
Loading...
78.7
Pass@1
MedXIAOHE
72.98
74.465
75.95
77.435
Feb 13, 2026
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
MedXIAOHE
Mode=Thinking mode, De...
2026.02
78.7
GPT-5.2 Thinking
Mode=Thinking mode, De...
2026.02
78.6
Gemini 3.0 Pro
Decoding=Greedy
2026.02
77.8
Gemini 2.5 Pro
Decoding=Greedy
2026.02
73.2
Feedback
Search any
task
Search any
task