Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on MATH (test) (Reward, Acc (%))
Loading...
10.83
Reward
SEA
-2.8564
0.6968
4.25
7.8032
May 26, 2025
Reward
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Reward
Accuracy
SEA
Base Model=LLaMA-3.2-1...
2025.05
10.83
32
BoN-64
Base Model=LLaMA-3.2-1...
2025.05
7.41
16
BoN-32
Base Model=LLaMA-3.2-1...
2025.05
6.49
15.5
SFT
Base Model=LLaMA-3.2-1...
2025.05
6.19
27.5
BoN-8
Base Model=LLaMA-3.2-1...
2025.05
4.84
19.5
RS
Base Model=LLaMA-3.2-1...
2025.05
1.42
13
CBS
Base Model=LLaMA-3.2-1...
2025.05
-2.02
0.5
ARGS
Base Model=LLaMA-3.2-1...
2025.05
-2.33
7
Feedback
Search any
task
Search any
task