Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Grid-world

Benchmarks

Task NameDataset NameSOTA ResultTrend
Spatial Navigation15x15 Grid World 50 environments 20 days
Median Path Cost (Mean)15.2
8
Plan Generation6 grid world domains Seen Appearances
Success Rate (Frozenlake)95.2
6
DarkroomGrid World
Offline Training Time (hour)0.18
6
Goal-driven navigationGrid-world Overall (unseen maps)
SR100
5
Goal-driven navigationGrid-world Unseen Goals (unseen maps)
Success Rate100
5
Goal-driven navigationGrid-world Seen Goals (unseen maps)
SR100
5
Large Dark Key-to-DoorLarge Grid World
Offline Training Time (hour)3.16
3
Large Darkroom DynamicLarge Grid World
Offline Training Time (hour)2.63
3
Large Darkroom HardLarge Grid World
Offline Training Time (hour)2.78
3
Large DarkroomLarge Grid World
Offline Training Time (hour)2.38
3
Dark Key-to-DoorGrid World
Offline Training Time (hour)0.41
3
Darkroom HardGrid World
Offline Training Time (hour)0.2
3
Safety-constrained Reinforcement LearningGrid-world Time-Variant Safety Threshold (100 randomly generated environments)
Safety Violations0
2
Safety-constrained Reinforcement LearningGrid-world Time-Invariant Safety Threshold (100 randomly generated environments)
Safety Violation Count0
2
Reinforcement LearningGrid World Npick=5, Sparse (test)
Maximum Average Return0.7
2
Reinforcement LearningGrid World Npick=3 Dense (test)
Max Average Return3
2
Showing 16 of 16 rows