| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Goal Recognition | MiniGrid Suboptimal trajectories | F1 Score100 | 36 | |
| Goal Recognition | MiniGrid (test) | F1 Score1 | 36 | |
| Reinforcement Learning | MiniGrid v0 (test) | GoToDoor-8x8 Success Rate0.944 | 9 | |
| Navigation and Procedural Generation RL | MiniGrid | GoToDoor-8x894.4 | 9 | |
| Navigation | MiniGrid Four Rooms | Average Episodic Reward0.672 | 7 | |
| Four Rooms | MiniGrid | Average Pass Rate88.7 | 7 | |
| Partially observable navigation | Minigrid PerfectMaze (M) | Solved Rate82 | 6 | |
| Partially observable navigation | Minigrid LargeCorridor | Solved Rate95 | 6 | |
| Partially observable navigation | Minigrid SmallCorridor | Solved Rate97 | 6 | |
| Partially observable navigation | Minigrid SimpleCrossing | Solved Rate88 | 6 | |
| Partially observable navigation | Minigrid Maze3 | Solved Rate96 | 6 | |
| Partially observable navigation | Minigrid Maze2 | Solved Rate97 | 6 | |
| Partially observable navigation | Minigrid Maze | Solved Rate82 | 6 | |
| Partially observable navigation | Minigrid Labyrinth2 | Solved Rate97 | 6 | |
| Partially observable navigation | Minigrid Labyrinth | Solved Rate100 | 6 | |
| Partially observable navigation | Minigrid 16Rooms2 | Solved Rate100 | 6 | |
| Partially observable navigation | Minigrid 16Rooms | Solved Rate97 | 6 | |
| Solve Rate | Minigrid | SixteenRooms Solve Rate100 | 5 | |
| UnlockPickUp | MiniGrid | Mean Return0.68 | 5 | |
| Unlock | MiniGrid | Mean Return0.75 | 5 | |
| MemoryS16 | MiniGrid | Mean Return0.51 | 5 | |
| MemoryS8 | MiniGrid | Mean Return0.52 | 5 | |
| FourRooms | MiniGrid | Mean Return0.16 | 5 | |
| EmptyRandom-16x16 | MiniGrid | Mean Return0.59 | 5 | |
| EmptyRandom-8x8 | MiniGrid | Mean Return0.77 | 5 |