Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Medical Reasoning on Medical-O1-Reasoning-SFT (test)

0.5127Wins

LLM-AutoDP

-0.0205080.1179210.256350.394779Jan 28, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.51270.09630.391
2026.01
0.50660.10030.3931
2026.01
0.48420.11780.398
2026.01
0.47930.08110.4396
2026.01
0.47690.08730.4358
2026.01
0.46880.10170.4295
2026.01
0.46880.08890.4423
2026.01
0.44210.08890.469
2026.01
0.4330.01080.5562
2026.01
010
2026.01
010
2026.01
010