Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Labyrinth

Benchmarks

Task NameDataset NameSOTA ResultTrend
PlanningLabyrinth unseen problems
Completion Rate100
11
PlanningLabyrinth known optimal problems
Optimal Solutions Rate100
11
PlanningLabyrinth
Completion Rate100
9
Hierarchical PlanningLabyrinth
Token Cost5,173
6
PlanningLabyrinth (test)
Average Solving Time (s)0.54
5
Transfer LearningLabyrinth
Mean Score491
5
Showing 6 of 6 rows