Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Boosting Self-Supervised Tracking with Contextual Prompts and Noise Learning

About

Learning robust contextual knowledge from unlabeled videos is essential for advancing self-supervised tracking. However, conventional self-supervised trackers lack effective context modeling, while existing context association methods based on non-semantic queries struggle to adapt to unlabeled tracking scenarios, making it difficult to learn reliable contextual cues. In this work, we propose a novel self-supervised tracking framework, named \textbf{\tracker}, which introduces a dual-modal context association mechanism that jointly leverages fine-grained semantic prompts and contextual noise to drive the model toward learning robust tracking representations. Adherent to the easy-to-hard learning principle, our contextual association mechanism operates based on two stages. During early training, instance patch tokens (prompts) are assigned to both forward and backward tracking branches to facilitate the acquisition of tracking knowledge. As training progresses, contextual noise is gradually injected into the model to perturb feature, encouraging the tracker to learn robust tracking representations in a more complex feature space. Thus, this novel contextual association mechanism enables our self-supervised model to learn high-quality tracking representations from unlabeled videos, while being applied exclusively during training to preserve efficient inference. Extensive experiments demonstrate the superiority of our method.

Yaozong Zheng, Qihua Liang, Bineng Zhong, Shuimu Zeng, Yuanliang Xue, Ning Li, Shuxiang Song• 2026

Related benchmarks

TaskDatasetResultRank
Object TrackingLaSoT
AUC67.1
498
Object TrackingTrackingNet
Precision (P)79.7
327
Visual Object TrackingGOT-10k
AO72.7
306
Visual Object TrackingUAV123
AUC0.664
193
Visual Object TrackingTNL2K
AUC55.3
169
Visual Object TrackingOTB-100
AUC71.2
154
Visual Object TrackingLaSoText
AUC49.1
140
Visual Object TrackingVOT 2020
EAO0.522
18
Showing 8 of 8 rows

Other info

Follow for update