Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on LMB Hard
Loading...
46.2
Accuracy
Qwen2.5-32B-Instruct + Bootcamp-SFT-RL
20.824
27.412
34
40.588
Aug 12, 2025
Accuracy
Updated 13d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-32B-Instruct + Bootcamp-SFT-RL
Model=Qwen2.5-32B-Inst...
2025.08
46.2
DS-R1-Distilled-Qwen-32B + Bootcamp-RL
Model=DS-R1-Distilled-...
2025.08
43.7
DS-R1-Distilled-Qwen-32B
Model=DS-R1-Distilled-...
2025.08
36.8
Qwen2.5-32B-Instruct + Bootcamp-SFT
Model=Qwen2.5-32B-Inst...
2025.08
33.2
Qwen2.5-32B-Instruct
Model=Qwen2.5-32B-Inst...
2025.08
22
Qwen2.5-32B-Instruct + Bootcamp-RL
Model=Qwen2.5-32B-Inst...
2025.08
21.8
Feedback
Search any
task
Search any
task