Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WebWalkerQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Web-based Question AnsweringWebWalkerQA
Success Rate81.18
18
Web Browsing and NavigationWebWalkerQA
Average Accuracy71.7
18
Deep ResearchWebWalkerQA original (test)
Pass@172.2
14
Web-based Agent QAWebWalkerQA
Pass@173.53
13
Web-based Agent ReasoningWebWalkerQA Hard
Pass@30.6333
8
Web-based Agent ReasoningWebWalkerQA Medium
Pass@372.86
8
Web-based Agent ReasoningWebWalkerQA Easy
Pass@372.5
8
Showing 7 of 7 rows