Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on AMC23
Loading...
65
Accuracy
TTSV
23.4
34.2
45
55.8
Dec 4, 2025
Accuracy
Absolute Improvement
Relative Gain
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Absolute Improvement
Relative Gain
TTSV
Model=Qwen2.5-Math-7B
2025.12
65
22.5
52.94
Qwen2.5-Math-7B
Model=Qwen2.5-Math-7B
2025.12
42.5
-
-
LLaMA-3.1-8B
Model=LLaMA-3.1-8B
2025.12
25
-
-
TTSV
Model=LLaMA-3.1-8B
2025.12
25
0
0
EM
Model=Qwen2.5-Math-7B
2025.12
-
17.5
-
EM
Model=LLaMA-3.1-8B
2025.12
-
0.3
-
Feedback
Search any
task
Search any
task