Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Robotouille

Benchmarks

Task NameDataset NameSOTA ResultTrend
Embodied Task PlanningRobotouille Synchronous
Pass@1 Accuracy97
15
Embodied Task PlanningRobotouille Asynchronous (test)
Pass@1 Accuracy86
15
Makespan AccuracyRobotouille
Makespan Accuracy20
12
Robotic PlanningRobotouille Impossible
Solved Percentage100
7
Robotic PlanningRobotouille Hard
Solved Rate58.1
7
Robotic PlanningRobotouille Easy
Solved Rate81
7
Asynchronous planningRobotouille
Makespan Accuracy17.5
3
Robot task code generationRobotouille simulator (overall)
Execution Success Rate79
3
Showing 8 of 8 rows