Chat with UAV -- Human-UAV Interaction Based on Large Language Models

About

The future of UAV interaction systems is evolving from engineer-driven to user-driven, aiming to replace traditional predefined Human-UAV Interaction designs. This shift focuses on enabling more personalized task planning and design, thereby achieving a higher quality of interaction experience and greater flexibility, which can be used in many fileds, such as agriculture, aerial photography, logistics, and environmental monitoring. However, due to the lack of a common language between users and the UAVs, such interactions are often difficult to be achieved. The developments of Large Language Models possess the ability to understand nature languages and Robots' (UAVs') behaviors, marking the possibility of personalized Human-UAV Interaction. Recently, some HUI frameworks based on LLMs have been proposed, but they commonly suffer from difficulties in mixed task planning and execution, leading to low adaptability in complex scenarios. In this paper, we propose a novel dual-agent HUI framework. This framework constructs two independent LLM agents (a task planning agent, and an execution agent) and applies different Prompt Engineering to separately handle the understanding, planning, and execution of tasks. To verify the effectiveness and performance of the framework, we have built a task database covering four typical application scenarios of UAVs and quantified the performance of the HUI framework using three independent metrics. Meanwhile different LLM models are selected to control the UAVs with compared performance. Our user study experimental results demonstrate that the framework improves the smoothness of HUI and the flexibility of task execution in the tasks scenario we set up, effectively meeting users' personalized needs.

Haoran Wang, Zhuohang Chen, Guang Li, Bo Ma, Chuanghuang Li• 2025

Related benchmarks

Task	Dataset	Result
Human Preference Evaluation	User Study (Group 2)	Category CT Score24	3
User Preference Evaluation	User-study dataset CI tasks (Group 1)	Choice Count14	2
User Preference Evaluation	User-study dataset ST tasks (Group 1)	Choice Count16	2
User Preference Evaluation	User-study dataset SI tasks (Group 1)	Choice Count14	2
User Preference Evaluation	User-study dataset CT tasks (Group 1)	Choice Count11	2

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord