Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning

About

Conventional Task and Motion Planning (TAMP) approaches rely on manually crafted interfaces connecting symbolic task planning with continuous motion generation. These domain-specific and labor-intensive modules are limited in addressing emerging tasks in real-world settings. Here, we present LLM^3, a novel Large Language Model (LLM)-based TAMP framework featuring a domain-independent interface. Specifically, we leverage the powerful reasoning and planning capabilities of pre-trained LLMs to propose symbolic action sequences and select continuous action parameters for motion planning. Crucially, LLM^3 incorporates motion planning feedback through prompting, allowing the LLM to iteratively refine its proposals by reasoning about motion failure. Consequently, LLM^3 interfaces between task planning and motion planning, alleviating the intricate design process of handling domain-specific messages between them. Through a series of simulations in a box-packing domain, we quantitatively demonstrate the effectiveness of LLM^3 in solving TAMP problems and the efficiency in selecting action parameters. Ablation studies underscore the significant contribution of motion failure reasoning to the success of LLM^3. Furthermore, we conduct qualitative experiments on a physical manipulator, demonstrating the practical applicability of our approach in real-world settings.

Shu Wang, Muzhi Han, Ziyuan Jiao, Zeyu Zhang, Ying Nian Wu, Song-Chun Zhu, Hangxin Liu• 2024

Related benchmarks

TaskDatasetResultRank
Dual-arm task planningKitchen Scene
TEI0.934
16
Dual-arm task planningAgricultural Greenhouse Scene
TFR40
16
Unconstrained Motion PlanningKitchen
Success Rate0.00e+0
13
Dual-arm task planningX-DAPT Medium Packages
TFR32.7
8
Dual-arm task planningX-DAPT Hard Packages
TFR40.1
8
Dual-arm task planningX-DAPT Easy Packages
TFR (Task Success Fraction)12.5
8
Dual-arm task planningOffice Scene
TEI0.985
8
Dual-arm task planningFactory Scene
TEI0.913
8
Dual-arm task planningX-DAPT Office scene
TFR0.142
8
Dual-arm task planningX-DAPT Factory scene
TFR35.1
8
Showing 10 of 40 rows

Other info

Follow for update