Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vision-Language Interpreter for Robot Task Planning

About

Large language models (LLMs) are accelerating the development of language-guided robot planners. Meanwhile, symbolic planners offer the advantage of interpretability. This paper proposes a new task that bridges these two trends, namely, multimodal planning problem specification. The aim is to generate a problem description (PD), a machine-readable file used by the planners to find a plan. By generating PDs from language instruction and scene observation, we can drive symbolic planners in a language-guided framework. We propose a Vision-Language Interpreter (ViLaIn), a new framework that generates PDs using state-of-the-art LLM and vision-language models. ViLaIn can refine generated PDs via error message feedback from the symbolic planner. Our aim is to answer the question: How accurately can ViLaIn and the symbolic planner generate valid robot plans? To evaluate ViLaIn, we introduce a novel dataset called the problem description generation (ProDG) dataset. The framework is evaluated with four new evaluation metrics. Experimental results show that ViLaIn can generate syntactically correct problems with more than 99\% accuracy and valid plans with more than 58\% accuracy. Our code and dataset are available at https://github.com/omron-sinicx/ViLaIn.

Keisuke Shirai, Cristian C. Beltran-Hernandez, Masashi Hamaya, Atsushi Hashimoto, Shohei Tanaka, Kento Kawaharazuka, Kazutoshi Tanaka, Yoshitaka Ushiku, Shinsuke Mori• 2023

Related benchmarks

TaskDatasetResultRank
Predicate GroundingProDG Blocks
F1 Score98
26
Predicate GroundingProDG Cooking
F1 Score96
21
Predicate GroundingProDG Hanoi
F1 Score65.8
19
Predicate GroundingPyBullet Blocks
F1 Score100
15
Predicate GroundingReal Images Blocksworld
F1 Score100
15
Predicate GroundingPDDLGym Hanoi Color
F1 Score100
10
Object GroundingPDDLGym Blocksworld
F1 Score73.5
10
Predicate GroundingPyBullet Hanoi
F1 Score64.3
10
Goal GroundingProDG Hanoi
F140.7
7
PlanningProDG Blocks
Success Rate (%)79
7
Showing 10 of 20 rows

Other info

Follow for update