Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CLARA: Classifying and Disambiguating User Commands for Reliable Interactive Robotic Agents

About

In this paper, we focus on inferring whether the given user command is clear, ambiguous, or infeasible in the context of interactive robotic agents utilizing large language models (LLMs). To tackle this problem, we first present an uncertainty estimation method for LLMs to classify whether the command is certain (i.e., clear) or not (i.e., ambiguous or infeasible). Once the command is classified as uncertain, we further distinguish it between ambiguous or infeasible commands leveraging LLMs with situational aware context in a zero-shot manner. For ambiguous commands, we disambiguate the command by interacting with users via question generation with LLMs. We believe that proper recognition of the given commands could lead to a decrease in malfunction and undesired actions of the robot, enhancing the reliability of interactive robot agents. We present a dataset for robotic situational awareness, consisting pair of high-level commands, scene descriptions, and labels of command type (i.e., clear, ambiguous, or infeasible). We validate the proposed method on the collected dataset, pick-and-place tabletop simulation. Finally, we demonstrate the proposed approach in real-world human-robot interaction experiments, i.e., handover scenarios.

Jeongeun Park, Seungwon Lim, Joonhyung Lee, Sangbeom Park, Minsuk Chang, Youngjae Yu, Sungjoon Choi• 2023

Related benchmarks

TaskDatasetResultRank
Robotic PlanningLEMMA Single-Agent Multiplicity
Success Rate (SR)48
14
Robotic PlanningLEMMA Single-Agent Underspecified
Success Rate (SR)70
14
Robotic PlanningLEMMA Single-Agent Overall
Success Rate (SR)52
14
Robotic PlanningLEMMA Single-Agent Absence
Success Rate (SR)37
14
Robotic PlanningLEMMA Stack and Pass tasks, partially observed (test)
Success Rate45
8
Robotic Task PlanningLEMMA Single-agent
Calls per Episode10.36
4
Showing 6 of 6 rows

Other info

Follow for update