Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Mathematical Reasoning on AIME 2025 (avg@8)

85.6Avg@8

Nanbeige4-3B-Thinking

66.56871.50976.4581.391Dec 6, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
85.6
2025.12
85
2025.12
81.3
2025.12
72.9
2025.12
70.4
2025.12
67.3