Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Medical Reasoning on HealthBench

66.2HealthBench Score

Baichuan-M3

59.85661.50363.1564.797Feb 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
66.2
2026.02
65.1
63.3
2026.02
60.1