Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on AIME 2025 (Pass@k and Voting Metrics)

95.4Pass@1

Qwen3-235B-A22B-Thinking-2507

36.84852.04967.2582.451Dec 11, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
95.496.797.998.3100
2025.12
92.393.395.196.796.7
2025.12
87.188.387.590.896.7
2025.12
7078.376.28083.3
2025.12
55.963.866.66876.7
2025.12
445056.157.870
2025.12
39.149.256.355.466.7