Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TCTrack: Temporal Contexts for Aerial Tracking

About

Temporal contexts among consecutive frames are far from being fully utilized in existing visual trackers. In this work, we present TCTrack, a comprehensive framework to fully exploit temporal contexts for aerial tracking. The temporal contexts are incorporated at \textbf{two levels}: the extraction of \textbf{features} and the refinement of \textbf{similarity maps}. Specifically, for feature extraction, an online temporally adaptive convolution is proposed to enhance the spatial features using temporal information, which is achieved by dynamically calibrating the convolution weights according to the previous frames. For similarity map refinement, we propose an adaptive temporal transformer, which first effectively encodes temporal knowledge in a memory-efficient way, before the temporal knowledge is decoded for accurate adjustment of the similarity map. TCTrack is effective and efficient: evaluation on four aerial tracking benchmarks shows its impressive performance; real-world UAV tests show its high speed of over 27 FPS on NVIDIA Jetson AGX Xavier.

Ziang Cao, Ziyuan Huang, Liang Pan, Shiwei Zhang, Ziwei Liu, Changhong Fu• 2022

Related benchmarks

TaskDatasetResultRank
Object TrackingLaSoT
AUC60.5
333
Object TrackingTrackingNet
Precision (P)73.3
225
Visual Object TrackingUAV123 (test)--
188
Object TrackingGOT-10k
AO66.2
74
Visual TrackingUAV123--
41
Visual Object TrackingDTB70 (test)
AUC62.2
19
Visual Object TrackingUAVDT (test)
AUC53
19
Visual Object TrackingUAVTrack112 L (test)
AUC (%)58.3
19
Visual Object TrackingUAVTrack112 (test)
AUC59.4
19
Anti-UAV TrackingAnti-UAV318 (test)
AUC42.4
17
Showing 10 of 18 rows

Other info

Follow for update