Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General AI Assistant on GAIA
Loading...
40.8
GAIA (WS)
TOOLSELF
17.608
23.629
29.65
35.671
Feb 8, 2026
GAIA (WS)
GAIA Score
Updated 4d ago
Evaluation Results
Method
Method
Links
GAIA (WS)
GAIA Score
TOOLSELF
Base Model=Qwen3-14B
2026.02
40.8
38.8
OWL
Base Model=Qwen3-14B
2026.02
36.9
29.2
ReSum
Base Model=Qwen3-14B
2026.02
35.9
-
WebSailor
Base Model=Qwen3-14B
2026.02
35
-
Vanilla Agent
Base Model=Qwen3-14B
2026.02
33
32.1
Co-Sight
Base Model=Qwen3-14B
2026.02
31.1
-
OAgents
Base Model=Qwen3-14B
2026.02
30.1
-
TOOLSELF
Base Model=Qwen3-8B
2026.02
30.1
27.9
Co-Sight
Base Model=Qwen3-8B
2026.02
28.1
-
ReSum
Base Model=Qwen3-8B
2026.02
27.2
-
OWL
Base Model=Qwen3-8B
2026.02
23.3
21.8
OAgents
Base Model=Qwen3-8B
2026.02
22.3
-
Vanilla Agent
Base Model=Qwen3-8B
2026.02
19.4
19.7
WebSailor
Base Model=Qwen3-8B
2026.02
18.5
-
Feedback
Search any
task
Search any
task