Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Generation on MBPP Pro
Loading...
85.56
Pass@1
MASFly
81.4936
82.5493
83.605
84.6607
Feb 14, 2026
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
MASFly
2026.02
85.56
AFlow
2026.02
84.74
GPTSwarm
2026.02
84.2
AgentSquare
2026.02
83.92
MegaAgent
2026.02
83.11
ReAct
2026.02
82.97
Base
2026.02
82.8
MetaGPT
2026.02
82.2
AgentVerse
2026.02
81.65
Feedback
Search any
task
Search any
task