Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematics on AIME 2025 (Score, TPF)
Loading...
63.33
Score
LLaDA2.1-flash
20.43
31.5675
42.705
53.8425
Feb 9, 2026
Score
TPF
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
TPF
LLaDA2.1-flash
Inference Mode=S Mode
2026.02
63.33
5.36
LLaDA2.1-flash
Inference Mode=Q Mode
2026.02
63.33
3.46
Qwen3-30B-A3B-Inst-2507
2026.02
61.88
1
LLaDA2.0-flash
2026.02
60
4.57
Ling-flash-2.0
2026.02
55.89
1
Ling-mini-2.0
2026.02
47.66
-
LLaDA2.1-mini
mode=Q Mode
2026.02
43.33
3.29
LLaDA2.0-mini
2026.02
36.67
2.41
LLaDA2.1-mini
mode=S Mode
2026.02
36.67
6.34
Qwen3-8B
no think=true
2026.02
22.08
-
Feedback
Search any
task
Search any
task