Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on MATH 500 (Avg@2)
Loading...
82.1
Avg@2
TTRL
48.82
57.46
66.1
74.74
Jan 21, 2026
Avg@2
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@2
TTRL
Base=Base, Verifier=Rule
2026.01
82.1
Oat-Zero
Base=Math, Verifier=Rule
2026.01
80.8
RLPR
Base=Base, Verifier=None
2026.01
78
SimpleRL-Zoo
Base=Math, Verifier=Rule
2026.01
77.1
General Reasoner
Base=Base, Verifier=Model
2026.01
77
DARL
Base=Base, Verifier=None
2026.01
76.6
RLVR
Base=Base, Verifier=Rule
2026.01
76.5
PRIME
Base=Math, Verifier=Rule
2026.01
76.4
SimpleRL-Zoo
Base=Base, Verifier=Rule
2026.01
76.3
Qwen2.5-7B-Inst
Base=-, Verifier=None
2026.01
75.4
VeriFree
Base=Base, Verifier=None
2026.01
73.5
Qwen2.5-7B
Base=-, Verifier=None
2026.01
63
RLPR
Base=Inst, Verifier=None
2026.01
54.1
DARL
Base=Inst, Verifier=None
2026.01
52.8
RLVR
Base=Inst, Verifier=Rule
2026.01
51.9
Llama3.1-8B-Inst
Base=-, Verifier=None
2026.01
50.1
Feedback
Search any
task
Search any
task