Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AIME 25 (Mean@32)
Loading...
89.58
Mean@32
FlashMLA
85.2536
86.3768
87.5
88.6232
Feb 11, 2026
Mean@32
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean@32
FlashMLA
Backbone=LongCat-Flash...
2026.02
89.58
SnapMLA
Backbone=LongCat-Flash...
2026.02
88.44
FlashMLA
Backbone=DeepSeek-V3.1...
2026.02
87.92
SnapMLA
Backbone=DeepSeek-V3.1...
2026.02
85.42
Feedback
Search any
task
Search any
task