Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WebWalker

Benchmarks

Task NameDataset NameSOTA ResultTrend
Knowledge-Intensive ReasoningWebWalker
F1 Score30.5
18
Web Navigation Question AnsweringWebWalker QA
Accuracy72.2
13
SearchWebWalker
Score59.5
7
Web SearchWebWalker
Pass@161.7
6
Showing 4 of 4 rows