Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on LMB Hard

46.2Accuracy

Qwen2.5-32B-Instruct + Bootcamp-SFT-RL

20.82427.4123440.588Aug 12, 2025
Updated 13d ago

Evaluation Results

MethodLinks
2025.08
46.2
2025.08
43.7
2025.08
36.8
2025.08
33.2
2025.08
22
2025.08
21.8