| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WebShop Semantic Explicit structural drift (II) | GLOVE | Success Rate95 | 9 | 4d ago | |
| WebShop Semantic Explicit structural drift (Drift I) | GLOVE | Success Rate95 | 9 | 4d ago | |
| WebShop Explicit structural drift (Source) | GLOVE | Success Rate100 | 9 | 4d ago | |
| WebShop Standard (Source) | Generative Agent + GLOVE | Score62.5 | 9 | 4d ago | |
| WebShop Drift II v1.0 | Vanilla + GLOVE | Success Rate95 | 9 | 4d ago | |
| WebShop Drift I v1.0 | MemoryBank + GLOVE | Success Rate90 | 9 | 4d ago | |
| WebShop Source v1.0 | Vanilla + GLOVE | Success Rate100 | 9 | 4d ago |