Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model

About

In the realm of data-driven AI technology, the application of open-source large language models (LLMs) in robotic task planning represents a significant milestone. Recent robotic task planning methods based on open-source LLMs typically leverage vast task planning datasets to enhance models' planning abilities. While these methods show promise, they struggle with complex long-horizon tasks, which require comprehending more context and generating longer action sequences. This paper addresses this limitation by proposing MLDT, theMulti-Level Decomposition Task planning method. This method innovatively decomposes tasks at the goal-level, task-level, and action-level to mitigate the challenge of complex long-horizon tasks. In order to enhance open-source LLMs' planning abilities, we introduce a goal-sensitive corpus generation method to create high-quality training data and conduct instruction tuning on the generated corpus. Since the complexity of the existing datasets is not high enough, we construct a more challenging dataset, LongTasks, to specifically evaluate planning ability on complex long-horizon tasks. We evaluate our method using various LLMs on four datasets in VirtualHome. Our results demonstrate a significant performance enhancement in robotic task planning, showcasing MLDT's effectiveness in overcoming the limitations of existing methods based on open-source LLMs as well as its practicality in complex, real-world scenarios.

Yike Wu, Jiatao Zhang, Nan Hu, LanLing Tang, Guilin Qi, Jun Shao, Jie Ren, Wei Song• 2024

Related benchmarks

TaskDatasetResultRank
Monkey-banana task executionDual Bananas Scene 4
Success Rate99
6
Robotic PlanningComprehensive Scene 13
SR98
6
Robotic Task PlanningClassic (Scene 2)
Success Rate85
6
Monkey-banana task executionDual Bananas Scene 5
Success Rate65
6
Robotic PlanningComprehensive (Scene 14)
SR12
6
Robotic Task PlanningClassic (Scene 1)
Success Rate (SR)98
6
Task PlanningShortsighted Monkey (Scene 7)
SR96
6
Monkey-banana task executionDual Bananas Scene 6
Success Rate90
6
Robotic PlanningScene 15 Comprehensive
SR0.00e+0
6
Robotic Task PlanningClassic (Scene 3)
Success Rate (SR)0.00e+0
6
Showing 10 of 16 rows

Other info

Follow for update