Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Math Reasoning on AIME 2024 (Acc, ∆Tok)

90Accuracy

Qwen3-Next-80B

-0.16823.24146.6570.059Oct 1, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
90-
2025.10
86.7-24.3
2025.10
77.9-7.4
2025.10
77.1-
2025.10
77.1-6.2
2025.10
75.8-
2025.10
73.3-6.4
2025.10
71.7-
2025.10
70-19.8
2025.10
68.8-
2025.10
67.5-
2025.10
67.2-33.5
2025.10
66.7-30.5
2025.10
66.7-12
2025.10
64.6-12.7
2025.10
63.3-34.3
2025.10
63.3-28.8
2025.10
63.3-9.4
2025.10
56.7-16.6
2025.10
50-28.2
2025.10
49.5-
2025.10
45.2-25
2025.10
40-32.4
2025.10
39.6-63.1
2025.10
39.2-61.5
2025.10
38.8-68
2025.10
36.7-100
2025.10
35-72.6
2025.10
33.4-31.8
2025.10
29.2-64.5
2025.10
25.4-71.4
2025.10
23.3-100
2025.10
23.3-100
2025.10
23.3-100
2025.10
16.7-100
2025.10
3.3-100