Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on AMC (accuracy)
Loading...
57.5
Accuracy (AMC)
Eurus-2-7B-PRIME-R-TAP
29.004
36.402
43.8
51.198
Mar 2, 2026
Accuracy (AMC)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy (AMC)
Eurus-2-7B-PRIME-R-TAP
training=R-TAP integrated
2026.03
57.5
Qwen2.5-Math-7B-Inst.
2026.03
50.6
Eurus-2-7B-PRIME
2026.03
50.6
RLOO
2026.03
47
GPT-4o
2026.03
45.8
Llama-3.1-70B-Inst.
2026.03
37.3
Eurus-2-7B-SFT
2026.03
30.1
Feedback
Search any
task
Search any
task