Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Math Reasoning on Math Reasoning 1.5B model (val)

69.4Validation Accuracy

Execution-Guided Search

47.14452.92258.764.478Jan 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
69.4
2026.01
68.8
2026.01
48