Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MiniGrid

Benchmarks

Task NameDataset NameSOTA ResultTrend
Goal RecognitionMiniGrid Suboptimal trajectories
F1 Score100
36
Goal RecognitionMiniGrid (test)
F1 Score1
36
Reinforcement LearningMiniGrid
Training Duration (hours)5.82
9
NavigationMiniGrid held-out mazes
16Rooms86
9
Reinforcement LearningMiniGrid v0 (test)
GoToDoor-8x8 Success Rate0.944
9
Navigation and Procedural Generation RLMiniGrid
GoToDoor-8x894.4
9
Agent Success RateMiniGrid full-view
MTE99
8
Environment InteractionMiniGrid
Environment Steps (M)41,000,000
7
NavigationMiniGrid Four Rooms
Average Episodic Reward0.672
7
Four RoomsMiniGrid
Average Pass Rate88.7
7
Generalization to Unseen ObjectsMiniGrid Case 1: Unseen Objects v1 (test)
Target Generalization Score9.6
6
Language Conditioned TransferMiniGrid Reverse Task Case 3 (target)
Target Success Count (Case 3)98
6
Combined GeneralizationMiniGrid (Case 2)
TGT Score9.8
6
Partially observable navigationMinigrid PerfectMaze (M)
Solved Rate82
6
Partially observable navigationMinigrid LargeCorridor
Solved Rate95
6
Partially observable navigationMinigrid SmallCorridor
Solved Rate97
6
Partially observable navigationMinigrid SimpleCrossing
Solved Rate88
6
Partially observable navigationMinigrid Maze3
Solved Rate96
6
Partially observable navigationMinigrid Maze2
Solved Rate97
6
Partially observable navigationMinigrid Maze
Solved Rate82
6
Partially observable navigationMinigrid Labyrinth2
Solved Rate97
6
Partially observable navigationMinigrid Labyrinth
Solved Rate100
6
Partially observable navigationMinigrid 16Rooms2
Solved Rate100
6
Partially observable navigationMinigrid 16Rooms
Solved Rate97
6
Solve RateMinigrid
SixteenRooms Solve Rate100
5
Showing 25 of 67 rows