| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Browser-use | WebVoyager | Success Rate90.6 | 14 | |
| Web Navigation | WebVoyager (test) | Success Rate87 | 14 | |
| GUI Navigation | WebVoyager | Success Rate (Allrecipes)88.89 | 12 | |
| Web Navigation | WebVoyager 1.0 (test) | Allrecipes62.5 | 12 | |
| Web Navigation | WebVoyager | Success Rate88.8 | 9 | |
| Web Navigation | WebVoyager | Spearman Correlation0.15 | 8 | |
| Web Navigation and Task Completion | WebVoyager Live Websites | Success Rate (All Rec)38.7 | 7 | |
| Web Browsing Task | WebVoyager Easy | Kimi K2 Success Rate84.3 | 5 | |
| Browser Use | WebVoyager (test) | Success Rate87.2 | 4 | |
| Autonomous Web Navigation | WebVoyager | Accuracy87 | 3 | |
| Agent Trajectory Verification Agreement | WebVoyager | Unterminated Rate (%)3.4 | 2 |