UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

About

Unmanned Aerial Vehicles (UAVs) are evolving into language-interactive platforms, enabling more intuitive forms of human-drone interaction. While prior works have primarily focused on high-level planning and long-horizon navigation, we shift attention to language-guided fine-grained trajectory control, where UAVs execute short-range, reactive flight behaviors in response to language instructions. We formalize this problem as the Flying-on-a-Word (Flow) task and introduce UAV imitation learning as an effective approach. In this framework, UAVs learn fine-grained control policies by mimicking expert pilot trajectories paired with atomic language instructions. To support this paradigm, we present UAV-Flow, the first real-world benchmark for language-conditioned, fine-grained UAV control. It includes a task formulation, a large-scale dataset collected in diverse environments, a deployable control framework, and a simulation suite for systematic evaluation. Our design enables UAVs to closely imitate the precise, expert-level flight trajectories of human pilots and supports direct deployment without sim-to-real gap. We conduct extensive experiments on UAV-Flow, benchmarking VLN and VLA paradigms. Results show that VLA models are superior to VLN baselines and highlight the critical role of spatial grounding in the fine-grained Flow setting.

Xiangyu Wang, Donglin Yang, Yue Liao, Wenhao Zheng, wenjun wu, Bin Dai, Hongsheng Li, Si Liu• 2025

Related benchmarks

Task	Dataset	Result
Zero-Shot Aerial Navigation	AerialVLN (test)	Success Rate (SR)36.47	18
Embodied Question Answering	FG-EQA	QAS (Score)1.49	14
Vision-Language Navigation	Various Vision-Language Navigation Datasets	Number of Trajectories4.08e+4	13
UAV Navigation	UAV-Flow-Sim (test)	Approach Success Rate42.86	12
Zero-Shot Aerial Navigation	OpenFly (test)	Success Rate32.14	9
UAV Navigation	Urban Canyon Traversal easy (test)	Navigation Error (m)34.07	4
UAV Navigation	Urban Canyon Traversal Hard (test)	Navigation Error (m)41.35	4
Long-horizon Flow	FLIGHT Long-horizon Flow	SR10.5	3

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord