Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration

About

Pre-trained large language models (LLMs) show promise for robotic task planning but often struggle to guarantee correctness in long-horizon problems. Task and motion planning (TAMP) addresses this by grounding symbolic plans in low-level execution, yet it relies heavily on manually engineered planning domains. To improve long-horizon planning reliability and reduce human intervention, we present Planning Domain Derivation with LLMs (PDDLLM), a framework that automatically induces symbolic predicates and actions directly from demonstration trajectories by combining LLM reasoning with physical simulation roll-outs. Unlike prior domain-inference methods that rely on partially predefined or language descriptions of planning domains, PDDLLM constructs domains without manual domain initialization and automatically integrates them with motion planners to produce executable plans, enhancing long-horizon planning automation. Across 1,200 tasks in nine environments, PDDLLM outperforms six LLM-based planning baselines, achieving at least 20\% higher success rates, reduced token costs, and successful deployment on multiple physical robot platforms.

Jinbang Huang, Yixin Xiao, Zhanguang Zhang, Mark Coates, Jianye Hao, Yingxue Zhang• 2025

Related benchmarks

TaskDatasetResultRank
Parts AssemblyPDDLLM v1 (test)
Planning Success Rate100
6
AlignmentPDDLLM v1 (test)
Planning Success Rate100
6
Bridge BuildingPDDLLM v1 (test)
Planning Success Rate87.2
6
Burger CookingPDDLLM v1 (test)
Planning Success Rate91.7
6
Color ClassificationPDDLLM v1 (test)
Planning Success Rate100
6
OverallPDDLLM v1 (test)
Planning Success Rate93.3
6
RearrangePDDLLM v1 (test)
Planning Success Rate64.3
6
StackPDDLLM v1 (test)
Planning Success Rate97.5
6
Tower of HanoiPDDLLM v1 (test)
Planning Success Rate100
6
UnstackPDDLLM v1 (test)
Planning Success Rate0.977
6
Showing 10 of 18 rows

Other info

Follow for update