Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MiniWoB

Benchmarks

Task NameDataset NameSOTA ResultTrend
Web navigationMiniWob++
Accuracy53.26
15
AgentMiniWob++ (held-in)
Performance (%)87.12
14
Web automationMiniWob 45 tasks subset (test)
Mean Success Rate86.1
6
Web-based task completionMiniWoB++ With feedback 9 tasks
Success Rate91.11
5
Web automationMiniWob 35 tasks subset (test)
Mean Success Rate67
4
enter-text navigationMiniWoB (test)
Success Rate100
3
Showing 6 of 6 rows