| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| FrozenLake v1.0 (Drift I) | Vanilla + GLOVE | Success Rate7,500 | 18 | 3mo ago | |
| 2D Grid Navigation S 256/128/0.8 (OOD-sparse) | MapEM-os | Accuracy100 | 12 | 22d ago | |
| 2D Grid Navigation D 64/32/0.2 (OOD-dense) | MapWM-r2 | Accuracy100 | 12 | 22d ago | |
| 2D Grid Navigation Sequence length 128, grid width 64 (IID) | MapWM-r2 | Accuracy100 | 12 | 22d ago | |
| 1D Grid Navigation OOD-sparse S 256/128/0.8 | MapWM-r1 | Accuracy100 | 12 | 22d ago | |
| 1D Grid Navigation 64/32/0.2 (OOD-dense D) | MapWM-r1 | Accuracy100 | 12 | 22d ago | |
| 1D Grid Navigation Sequence length 128, grid width 64 (IID) | MapWM-r1 | Accuracy100 | 12 | 22d ago | |
| FrozenLake Drift II v1.0 | Vanilla + GLOVE | Success Rate75 | 9 | 3mo ago | |
| FrozenLake Source v1.0 | Generative Agent + GLOVE | Success Rate100 | 9 | 3mo ago | |
| 5D Navigation (OOD-s) | MapEM-s | Accuracy87 | 5 | 22d ago | |
| 5D Navigation (OOD-d) | MapEM-s | Accuracy100 | 5 | 22d ago | |
| 5D Navigation (IID) | MapEM-os | Accuracy100 | 5 | 22d ago | |
| 3D Navigation (OOD-s) | MapEM-os | Accuracy99 | 5 | 22d ago | |
| 3D Navigation (OOD-d) | MapEM-s | Accuracy100 | 5 | 22d ago | |
| 3D Navigation (IID) | MapWM | Accuracy100 | 5 | 22d ago |