| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Goal Recognition | MiniGrid Suboptimal trajectories | F1 Score100 | 36 | |
| Goal Recognition | MiniGrid (test) | F1 Score1 | 36 | |
| Reinforcement Learning | MiniGrid | Training Duration (hours)5.82 | 9 | |
| Navigation | MiniGrid held-out mazes | 16Rooms86 | 9 | |
| Reinforcement Learning | MiniGrid v0 (test) | GoToDoor-8x8 Success Rate0.944 | 9 | |
| Navigation and Procedural Generation RL | MiniGrid | GoToDoor-8x894.4 | 9 | |
| Agent Success Rate | MiniGrid full-view | MTE99 | 8 | |
| Environment Interaction | MiniGrid | Environment Steps (M)41,000,000 | 7 | |
| Navigation | MiniGrid Four Rooms | Average Episodic Reward0.672 | 7 | |
| Four Rooms | MiniGrid | Average Pass Rate88.7 | 7 | |
| Generalization to Unseen Objects | MiniGrid Case 1: Unseen Objects v1 (test) | Target Generalization Score9.6 | 6 | |
| Language Conditioned Transfer | MiniGrid Reverse Task Case 3 (target) | Target Success Count (Case 3)98 | 6 | |
| Combined Generalization | MiniGrid (Case 2) | TGT Score9.8 | 6 | |
| Partially observable navigation | Minigrid PerfectMaze (M) | Solved Rate82 | 6 | |
| Partially observable navigation | Minigrid LargeCorridor | Solved Rate95 | 6 | |
| Partially observable navigation | Minigrid SmallCorridor | Solved Rate97 | 6 | |
| Partially observable navigation | Minigrid SimpleCrossing | Solved Rate88 | 6 | |
| Partially observable navigation | Minigrid Maze3 | Solved Rate96 | 6 | |
| Partially observable navigation | Minigrid Maze2 | Solved Rate97 | 6 | |
| Partially observable navigation | Minigrid Maze | Solved Rate82 | 6 | |
| Partially observable navigation | Minigrid Labyrinth2 | Solved Rate97 | 6 | |
| Partially observable navigation | Minigrid Labyrinth | Solved Rate100 | 6 | |
| Partially observable navigation | Minigrid 16Rooms2 | Solved Rate100 | 6 | |
| Partially observable navigation | Minigrid 16Rooms | Solved Rate97 | 6 | |
| Solve Rate | Minigrid | SixteenRooms Solve Rate100 | 5 |