Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on SVAMP (Exact-match Accuracy)
Loading...
66.12
Exact Match Accuracy
Full-data Fine-tuning
10.2512
24.7556
39.26
53.7644
Oct 8, 2025
Exact Match Accuracy
Updated 19d ago
Evaluation Results
Method
Method
Links
Exact Match Accuracy
Full-data Fine-tuning
Backbone=LLAMA-2-7B, C...
2025.10
66.12
LESS
Backbone=LLAMA-2-7B, C...
2025.10
65.45
S2L
Backbone=LLAMA-2-7B, C...
2025.10
65.3
TRIM
Backbone=LLAMA-2-7B, C...
2025.10
65.12
TAGCOS
Backbone=LLAMA-2-7B, C...
2025.10
64.15
Random
Backbone=LLAMA-2-7B, C...
2025.10
63.3
Pretrained (no Fine-tuning)
Backbone=LLAMA-2-7B, C...
2025.10
12.4
Feedback
Search any
task
Search any
task