Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on SVAMP (Acc, Cost)
Loading...
79.53
Accuracy
CoT
-2.2348
18.9926
40.22
61.4474
Mar 14, 2026
Accuracy
Cost
Updated 26d ago
Evaluation Results
Method
Method
Links
Accuracy
Cost
CoT
Model=Gemma3
2026.03
79.53
2,504
CoT
Model=Llama3.1
2026.03
76.21
2,257
CoT
Model=Qwen3-8B
2026.03
75.76
2,125
DST
Model=Llama3.1
2026.03
1.9
902
DST
Model=Gemma3
2026.03
1.8
1,111
ToT
Model=Llama3.1
2026.03
1.69
4,294
ToT
Model=Gemma3
2026.03
1.64
4,702
DST
Model=Qwen3-8B
2026.03
1.48
2,365
ToT
Model=Qwen3-8B
2026.03
1.35
3,903
DPTS
Model=Llama3.1
2026.03
0.98
3,049
DPTS
Model=Qwen3-8B
2026.03
0.92
2,855
DPTS
Model=Gemma3
2026.03
0.91
3,597
Feedback
Search any
task
Search any
task