Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Medical Instruction Following on MedMTbench
Loading...
0.6375
Pass@1
MedXIAOHE
0.4425
0.493125
0.54375
0.594375
Feb 13, 2026
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
MedXIAOHE
Mode=Thinking mode, De...
2026.02
0.6375
GPT-5.2 Thinking
Mode=Thinking mode, De...
2026.02
0.513
Gemini 3.0 Pro
Decoding=Greedy
2026.02
0.498
Gemini 2.5 Pro
Decoding=Greedy
2026.02
0.45
Feedback
Search any
task
Search any
task