| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| FrozenLake reward reversal Hidden drift | Vanilla + GLOVE | Score100 | 45 | 4d ago | |
| FrozenLake (Source) | MemoryBank + GLOVE | Score67.5 | 36 | 4d ago | |
| FrozenLake Drift II | Generative Agent + GLOVE | Success Rate8,500 | 18 | 4d ago | |
| FrozenLake reward reversal Source | Score90 | 18 | 4d ago | ||
| FrozenLake Implicit Hidden Drift | Success Rate (Source)88.8 | 14 | 4d ago | ||
| FrozenLake Explicit Structural Drift II | Success Rate (Source)83.7 | 14 | 4d ago | ||
| FrozenLake Drift I | Voyager + GLOVE | Success Rate85 | 9 | 4d ago | |
| FrozenLake Hidden drift | Generative Agent + GLOVE | Score100 | 9 | 4d ago | |
| FrozenLake reward reversal Implicit Drift Hidden drift | Score100 | 9 | 4d ago | ||
| FrozenLake reward reversal Implicit Drift (Source) | Voyager + GLOVE | Score45 | 9 | 4d ago | |
| Key Minigrid 17x17 zero-shot 1.0 | DEGen | Success Rate (SixteenRooms)100 | 3 | 4d ago |