Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on Olympiad Bench (Accuracy, Absolute Improvement, Relative Gain)
Loading...
29.8
Accuracy
TTSV
10.872
15.786
20.7
25.614
Dec 4, 2025
Accuracy
Absolute Improvement
Relative Gain
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Absolute Improvement
Relative Gain
TTSV
Model=Qwen2.5-Math-7B
2025.12
29.8
13.1
78.44
Qwen2.5-Math-7B
Model=Qwen2.5-Math-7B
2025.12
16.7
-
-
TTSV
Model=LLaMA-3.1-8B
2025.12
14.2
2.6
22.41
LLaMA-3.1-8B
Model=LLaMA-3.1-8B
2025.12
11.6
-
-
EM
Model=Qwen2.5-Math-7B
2025.12
-
16
-
EM
Model=LLaMA-3.1-8B
2025.12
-
-1.6
-
Feedback
Search any
task
Search any
task