Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Deep Research on POP (test)
Loading...
87.9
Mean Correct Rate
WebSailor
37.772
50.786
63.8
76.814
Oct 17, 2025
Mean Correct Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Correct Rate
WebSailor
Type=Live web
2025.10
87.9
Our best agent
Type=Live web
2025.10
86.3
Our base agent
Type=Live web
2025.10
83.6
ASearcher
Type=Live web
2025.10
81.9
DeepResearcher
Type=Live web
2025.10
81.05
Search-R1
Type=Static & simulate...
2025.10
77.2
R1-Searcher
Type=Static & simulate...
2025.10
65.1
ZeroSearch
Type=Static & simulate...
2025.10
39.7
Feedback
Search any
task
Search any
task