Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General AI Assistant Tasks on GAIA Level 2
Loading...
42.9
Success Rate
EvoMAS-7
-1.716
9.867
21.45
33.033
May 9, 2026
Success Rate
Average Success Rate
Updated 22d ago
Evaluation Results
Method
Method
Links
Success Rate
Average Success Rate
EvoMAS-7
Method Category (Singl...
2026.05
42.9
65
EvoMAS-4
Method Category (Singl...
2026.05
14.2
45
GPT-4o-mini
Method Category (Singl...
2026.05
7.1
22.3
GPT-4o
Method Category (Singl...
2026.05
7.1
29.8
AFlow
Method Category (Singl...
2026.05
7.1
22.1
G-Designer
Method Category (Singl...
2026.05
7.1
34.8
MaAS
Method Category (Singl...
2026.05
7.1
34.8
GPTSwarm
Method Category (Singl...
2026.05
0
18.3
Feedback
Search any
task
Search any
task