Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
STEM Reasoning on AIME 2024
Loading...
77.7
Score
Qwen3-8B-as-GenRM
73.54
74.62
75.7
76.78
Feb 6, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Qwen3-8B-as-GenRM
Reward Model=Qwen3-8B-...
2026.02
77.7
Qwen3-8B
Reward Model=Baseline
2026.02
77.6
GenRM-R-Align-14B
Reward Model=GenRM-R-A...
2026.02
76.5
Qwen3-14B-as-GenRM
Reward Model=Qwen3-14B...
2026.02
76.1
GenRM-RLVR-14B
Reward Model=GenRM-RLV...
2026.02
75.5
GenRM-R-Align-8B
Reward Model=GenRM-R-A...
2026.02
75.4
GenRM-RLVR-8B
Reward Model=GenRM-RLV...
2026.02
73.7
Feedback
Search any
task
Search any
task