Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Math Reasoning on AIME 2025 (Acc, ∆Tok)

80Accuracy

Qwen3-Next-80B

0.23220.94141.6562.359Oct 1, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
80-
2025.10
76.2-
2025.10
73.3-16.7
2025.10
69.2-
2025.10
68.3-
2025.10
68.3-4.3
2025.10
66.7-35.6
2025.10
65.8-6
2025.10
63.7-
2025.10
62.8-33.4
2025.10
60.4-13.5
2025.10
60-37.9
2025.10
59.6-
2025.10
56.7-16.6
2025.10
56.7-10.6
2025.10
53.3-28.9
2025.10
53.3-8.6
2025.10
53.3-5.2
2025.10
50-27.1
2025.10
37.6-
2025.10
33.4-26.4
2025.10
33.3-9.8
2025.10
33.2-20.3
2025.10
30.4-76
2025.10
30-100
2025.10
29.2-66.6
2025.10
26.7-66.4
2025.10
26.7-36.5
2025.10
26.2-72
2025.10
25-67.8
2025.10
23.3-100
2025.10
23.3-100
2025.10
22.9-73.5
2025.10
20-100
2025.10
20-100
2025.10
3.3-100