MAZE

Benchmarks

Task Name	Dataset Name	SOTA Result
Maze Navigation	Maze (test)	Success Rate0.8	25
Spatial decision-making	Maze size 8	Success Rate92.71	22
Multi-Agent Path Finding	Medium Maze 25x25 world size, 32.8% static obstacle rate	Success Rate100	20
Visual Planning	Maze	Path Length Metric 37.2	19
Maze Navigation	Maze Hard	Accuracy97.66	18
One-step next-observation prediction	Maze (test)	Token F198	16
Goal Prediction	Maze OOD	Classification Accuracy70	15
Goal Prediction	Maze ID	Classification Accuracy93.3	15
Reasoning	Maze Hard	pass@1 Accuracy93.7	15
Maze Navigation	Maze (Standard)	Accuracy0.9961	14
Imitation Learning	Maze	Success Rate94.4	12
Heuristic Search	Maze Grid Map	Runtime (ms)258	12
Reinforcement Learning	Maze Gymnasium	Mean Best Reward0.97	12
Sequential Planning	Maze	Score (L=8)100	12
Planning	10x10 Maze	Validity Rate57	12
Spatial decision-making	Maze size 16	Success Rate64.85	11
Spatial decision-making	Maze size 12	Success Rate78.31	11
Spatial decision-making	Maze size 4	Success Rate (%)94.38	11
Video Reasoning	Maze (test)	Precision82.1	11
Maze Navigation	Maze	RSR100	11
Multi-Objective Reinforcement Learning	Maze	Mean Episode Reward (MER)223.55	11
Navigation	Maze 2D	Success Rate90	10
Spatial Reasoning	Maze 10×10	CR (%)61.37	10
Logical Reasoning	Maze	Pass@198	10
Video Generation	Maze	Maze Flow (Base)96.5	10

Showing 25 of 90 rows