Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on OpenR1-Math-220k (unseen)
Loading...
46
Accuracy
LMNet
16.88
24.44
32
39.56
May 19, 2025
Accuracy
Updated 20d ago
Evaluation Results
Method
Method
Links
Accuracy
LMNet
Model=Qwen2.5-1.5B, Tr...
2025.05
46
SFT
Model=Qwen2.5-1.5B, Tr...
2025.05
34.7
Prompt
Model=Qwen2.5-1.5B
2025.05
29
LMNet
Model=Qwen2.5-0.5B, Tr...
2025.05
29
SFT
Model=Qwen2.5-0.5B, Tr...
2025.05
23.2
Prompt
Model=Qwen2.5-0.5B
2025.05
18
Feedback
Search any
task
Search any
task