Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on MATH (Accuracy @ t1, t2 Comparison)
Loading...
38.4
Accuracy @ t1
Prompt based
17.808
23.154
28.5
33.846
May 22, 2025
Accuracy @ t1
Accuracy @ t2
Delta (t1, t2)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy @ t1
Accuracy @ t2
Delta (t1, t2)
Prompt based
Backbone=Gemma-2-27B-i...
2025.05
38.4
43.8
5.4
Prompt based
Backbone=Gemma-2-27B-i...
2025.05
38.4
45
6.6
Prompt based
Backbone=Gemma-2-9B-it...
2025.05
34.6
40.4
5.8
Prompt based
Backbone=Gemma-2-9B-it...
2025.05
34.6
40
5.4
ReflectEvo
Backbone=Gemma-2-9B-it...
2025.05
34.6
40
5.4
ReflectEvo
Backbone=Gemma-2-9B-it...
2025.05
34.6
35
0.4
ReflectEvo
Backbone=Gemma-2-9B-it...
2025.05
34.6
40
5.4
ReflectEvo
Backbone=Gemma-2-9B-it...
2025.05
34.6
40
5.4
SFT based
Backbone=Gemma-2-9B-it...
2025.05
18.6
-
-
Feedback
Search any
task
Search any
task