Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Deep Research on BAM (test)
Loading...
92.8
Mean Correct Rate
Our best agent
52.344
62.847
73.35
83.853
Oct 17, 2025
Mean Correct Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Correct Rate
Our best agent
Type=Live web
2025.10
92.8
WebSailor
Type=Live web
2025.10
86.8
Our base agent
Type=Live web
2025.10
84.3
DeepResearcher
Type=Live web
2025.10
78.31
Search-R1
Type=Static & simulate...
2025.10
75.3
ASearcher
Type=Live web
2025.10
74.4
R1-Searcher
Type=Static & simulate...
2025.10
62.4
ZeroSearch
Type=Static & simulate...
2025.10
53.9
Feedback
Search any
task
Search any
task