Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematics on AIME 25 (Avg@32)
Loading...
80.2
Avg@32
GLM 4.6
55.552
61.951
68.35
74.749
Dec 30, 2025
Avg@32
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@32
GLM 4.6
Evaluation Mode=Chat
2025.12
80.2
LongCat-Flash Exp-Chat
Evaluation Mode=Chat
2025.12
74.9
LongCat-Flash Chat
Evaluation Mode=Chat
2025.12
61.3
DeepSeek V3.2
Evaluation Mode=Chat
2025.12
56.5
Feedback
Search any
task
Search any
task