Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on MATH-500 (In-distribution)
Loading...
77
Acc
RLTR
70.76
72.38
74
75.62
Feb 9, 2026
Acc
Maj@4
Maj@16
Maj@64
Updated 3d ago
Evaluation Results
Method
Method
Links
Acc
Maj@4
Maj@16
Maj@64
RLTR
Samples for Acc.=64
2026.02
77
79
83.8
84.2
RLVR
Samples for Acc.=64
2026.02
76.2
78.2
80.2
82.2
Base model
Samples for Acc.=64
2026.02
71
78
81.2
82.6
Feedback
Search any
task
Search any
task