Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AIME 2024 (avg@8)
Loading...
90.4
avg@8 Score
Nanbeige4-3B-Thinking
75.424
79.312
83.2
87.088
Dec 6, 2025
avg@8 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
avg@8 Score
Nanbeige4-3B-Thinking
Release Identifier=2511
2025.12
90.4
Qwen3-30A3-2507
Release Identifier=2507
2025.12
89.2
Qwen3-4B-2507
Release Identifier=2507
2025.12
83.3
Qwen3-32B-2504
Release Identifier=2504
2025.12
81.4
Qwen3-14B-2504
Release Identifier=2504
2025.12
79.3
Qwen3-8B-2504
Release Identifier=2504
2025.12
76
Feedback
Search any
task
Search any
task