Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on BeyondAIME (Mean@10)
Loading...
71.8
Mean@10
FlashMLA
69.408
70.029
70.65
71.271
Feb 11, 2026
Mean@10
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean@10
FlashMLA
Backbone=DeepSeek-V3.1...
2026.02
71.8
SnapMLA
Backbone=LongCat-Flash...
2026.02
70.2
SnapMLA
Backbone=DeepSeek-V3.1...
2026.02
69.9
FlashMLA
Backbone=LongCat-Flash...
2026.02
69.5
Feedback
Search any
task
Search any
task