Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BlocksWorld

Benchmarks

Task NameDataset NameSOTA ResultTrend
PlanningBlocksworld (test)
Accuracy97
21
PlanningBlocksworld
Blocksworld Accuracy97
21
PlanningBlocksWorld
Success Rate100
20
Agent TaskBlocksWorld
Success Rate100
17
Agent Behavior AdaptationBlocksWorld (BW) (test)
Loop Ratio51
17
Generalized PlanningBlocksworld
Scale55
12
PlanningBlocks (Blocksworld)
Accuracy100
12
PlanningBlocksworld unseen problems
Completion Rate100
11
PlanningBlocksworld known optimal problems
Optimal Rate1
11
Spatial ReasoningBlocksworld 5-7
Completion Rate30.5
10
PlanningBlocksworld
Completion Rate100
9
Planning CoverageBlocksworld 30 tasks Autoscale (test)
Coverage16
6
HTN PlanningBlocksworld GTOHP
Coverage30
6
PlanningBlocksworld (test)
Average Solving Time (s)0.39
5
Task and Motion PlanningBlocksworld n=6
Success Rate80
4
Task and Motion PlanningBlocksworld n=5
Success Rate (SR)100
4
Task and Motion PlanningBlocksworld n=4
Success Rate (%)90
4
Task and Motion PlanningBlocksworld (n=3)
Success Rate100
4
Planning EfficiencyBlocksworld Planning
Ntokens589.3
4
Heuristic Planningblocksworld p23
Expansion Rate (states/sec)619,260
3
Planningblocksworld-8b ML
Accuracy100
3
Next-token predictionblocksworld-8b (test)
Accuracy99.8
3
Next-token predictionblocksworld 8b (train)
Accuracy100
3
PlanningBlocksworld 1000 samples (test)
Plan Length40.74
2
PlanningBlocksworld 26-100 blocks (test)
Completion Rate (%)97.5
2
Showing 25 of 27 rows