| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| HTML observation reduction | WebLinx | Average Wall-Clock Time (s)0 | 11 | |
| Website Navigation | WebLINX IID 1.0 (test) | Overall Score37.4 | 11 | |
| Website Navigation | WebLINX OOD 1.0 (test) | IM84 | 11 | |
| Reranking | WebLINX CandidatesReranking (test) | MAP18.02 | 10 | |
| Single-step action prediction | WebLinx (test-iid) | Cumulative Runtime28.5 | 3 |