Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mathematical Reasoning on AIME24 (Accuracy)

90.31Accuracy

Qwen3-30B-A3B-Thinking-2507

0.547623.851347.15570.4587Dec 15, 2025Jan 8, 2026Feb 2, 2026Feb 27, 2026Mar 23, 2026Apr 17, 2026May 12, 2026
Updated 21d ago

Evaluation Results

MethodLinks
90.31
90
2026.05
16.25
2026.05
15.83
2026.05
14.58
2026.05
13.75
2026.05
13.75
2026.05
13.33
2026.05
12.91
2026.05
12.5
2026.05
6.67
2026.05
4