Layer-Guided UAV Tracking: Enhancing Efficiency and Occlusion Robustness

About

Visual object tracking (VOT) plays a pivotal role in unmanned aerial vehicle (UAV) applications. Addressing the trade-off between accuracy and efficiency, especially under challenging conditions like unpredictable occlusion, remains a significant challenge. This paper introduces LGTrack, a unified UAV tracking framework that integrates dynamic layer selection, efficient feature enhancement, and robust representation learning for occlusions. By employing a novel lightweight Global-Grouped Coordinate Attention (GGCA) module, LGTrack captures long-range dependencies and global contexts, enhancing feature discriminability with minimal computational overhead. Additionally, a lightweight Similarity-Guided Layer Adaptation (SGLA) module replaces knowledge distillation, achieving an optimal balance between tracking precision and inference efficiency. Experiments on three datasets demonstrate LGTrack's state-of-the-art real-time speed (258.7 FPS on UAVDT) while maintaining competitive tracking accuracy (82.8\% precision). Code is available at https://github.com/XiaoMoc/LGTrack

Yang Zhou, Derui Ding, Ran Sun, Ying Sun, Haohua Zhang• 2026

Related benchmarks

Task	Dataset	Result
Visual Object Tracking	UAV123 (test)	--	188
UAV Tracking	UAVDT (test)	Precision82.8	15
UAV Tracking	DTB70 (test)	Precision84	15
UAV Tracking	Average (DTB70, UAVDT, UAV123)	AP83.7	15

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord