Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Deep Research on GAIA (test)
Loading...
49.2
Mean Correct Rate
Our best agent
3.1176
15.0813
27.045
39.0087
Oct 17, 2025
Mean Correct Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Correct Rate
Our best agent
Type=Live web
2025.10
49.2
WebSailor
Type=Live web
2025.10
34
Our base agent
Type=Live web
2025.10
26.2
DeepResearcher
Type=Live web
2025.10
20.63
Search-R1
Type=Static & simulate...
2025.10
18.69
ASearcher
Type=Live web
2025.10
16.91
ZeroSearch
Type=Static & simulate...
2025.10
8.37
R1-Searcher
Type=Static & simulate...
2025.10
4.89
Feedback
Search any
task
Search any
task