Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Self-Supervised Tracking via Target-Aware Data Synthesis

About

While deep-learning based tracking methods have achieved substantial progress, they entail large-scale and high-quality annotated data for sufficient training. To eliminate expensive and exhaustive annotation, we study self-supervised learning for visual tracking. In this work, we develop the Crop-Transform-Paste operation, which is able to synthesize sufficient training data by simulating various appearance variations during tracking, including appearance variations of objects and background interference. Since the target state is known in all synthesized data, existing deep trackers can be trained in routine ways using the synthesized data without human annotation. The proposed target-aware data-synthesis method adapts existing tracking approaches within a self-supervised learning framework without algorithmic changes. Thus, the proposed self-supervised learning mechanism can be seamlessly integrated into existing tracking frameworks to perform training. Extensive experiments show that our method 1) achieves favorable performance against supervised learning schemes under the cases with limited annotations; 2) helps deal with various tracking challenges such as object deformation, occlusion, or background clutter due to its manipulability; 3) performs favorably against state-of-the-art unsupervised tracking methods; 4) boosts the performance of various state-of-the-art supervised learning frameworks, including SiamRPN++, DiMP, and TransT.

Xin Li, Wenjie Pei, Yaowei Wang, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang• 2021

Related benchmarks

TaskDatasetResultRank
Object TrackingLaSoT
AUC45.5
498
Object TrackingTrackingNet
Precision (P)60.6
327
Visual Object TrackingGOT-10k
AO46.7
306
Visual Object TrackingUAV123
AUC0.552
193
Visual Object TrackingOTB-100
AUC65.3
154
TrackingOTB99
AUC0.653
45
Showing 6 of 6 rows

Other info

Follow for update