Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Scientific Reasoning on Materials

79.2Average Score @16

SDPO (on-policy)

3546.47557.9569.425Jan 28, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
79.2--
2026.01
78.3--
2026.01
78.1--
2026.01
77.1--
2026.01
75.8--
2026.01
75.3--
2026.01
74.4--
2026.01
74.3--
2026.01
73.8--
2026.01
73.3--
2026.01
72.1--
2026.01
67.9--
2026.01
58.9--
2026.01
36.7--
2026.01
-58.9-
2026.01
-74.377.1
2026.01
-73.974.1
2026.01
-72.178.4
2026.01
-36.7-
2026.01
-70.975
2026.01
-73.373.5
2026.01
-73.779.1