Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Grid-world

Benchmarks

Task NameDataset NameSOTA ResultTrend
Spatial Navigation15x15 Grid World 50 environments 20 days
Median Path Cost (Mean)15.2
8
DarkroomGrid World
Offline Training Time (hour)0.18
6
Goal-driven navigationGrid-world Overall (unseen maps)
SR100
5
Goal-driven navigationGrid-world Unseen Goals (unseen maps)
Success Rate100
5
Goal-driven navigationGrid-world Seen Goals (unseen maps)
SR100
5
Large Dark Key-to-DoorLarge Grid World
Offline Training Time (hour)3.16
3
Large Darkroom DynamicLarge Grid World
Offline Training Time (hour)2.63
3
Large Darkroom HardLarge Grid World
Offline Training Time (hour)2.78
3
Large DarkroomLarge Grid World
Offline Training Time (hour)2.38
3
Dark Key-to-DoorGrid World
Offline Training Time (hour)0.41
3
Darkroom HardGrid World
Offline Training Time (hour)0.2
3
Safety-constrained Reinforcement LearningGrid-world Time-Variant Safety Threshold (100 randomly generated environments)
Safety Violations0
2
Safety-constrained Reinforcement LearningGrid-world Time-Invariant Safety Threshold (100 randomly generated environments)
Safety Violation Count0
2
Reinforcement LearningGrid World Npick=5, Sparse (test)
Maximum Average Return0.7
2
Reinforcement LearningGrid World Npick=3 Dense (test)
Max Average Return3
2
Showing 15 of 15 rows