Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Mathematical Reasoning on AIME 25 (Avg@8 accuracy)
Loading...
61.25
AIME 25 Avg@8 Accuracy
MAS-Orchestra
9.25
22.75
36.25
49.75
Jan 21, 2026
AIME 25 Avg@8 Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
AIME 25 Avg@8 Accuracy
MAS-Orchestra
Orchestration Type=Tra...
2026.01
61.25
DebateAgent
Orchestration Type=Sta...
2026.01
57.5
AFlow
Orchestration Type=Inf...
2026.01
53.33
SCAgent
Orchestration Type=Sta...
2026.01
51.67
ReflexionAgent
Orchestration Type=Sta...
2026.01
50.42
CoTAgent
Orchestration Type=Sta...
2026.01
45
MAS-GPT
Orchestration Type=Pub...
2026.01
43.33
MaAS
Orchestration Type=Inf...
2026.01
40.83
ToolOrchestra
Orchestration Type=Pub...
2026.01
11.25
Feedback
Search any
task
Search any
task