Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Decoupled Spatio-Temporal Consistency Learning for Self-Supervised Tracking

About

The success of visual tracking has been largely driven by datasets with manual box annotations. However, these box annotations require tremendous human effort, limiting the scale and diversity of existing tracking datasets. In this work, we present a novel Self-Supervised Tracking framework named \textbf{{\tracker}}, designed to eliminate the need of box annotations. Specifically, a decoupled spatio-temporal consistency training framework is proposed to learn rich target information across timestamps through global spatial localization and local temporal association. This allows for the simulation of appearance and motion variations of instances in real-world scenarios. Furthermore, an instance contrastive loss is designed to learn instance-level correspondences from a multi-view perspective, offering robust instance supervision without additional labels. This new design paradigm enables {\tracker} to effectively learn generic tracking representations in a self-supervised manner, while reducing reliance on extensive box annotations. Extensive experiments on nine benchmark datasets demonstrate that {\tracker} surpasses \textit{SOTA} self-supervised tracking methods, achieving an improvement of more than 25.3\%, 20.4\%, and 14.8\% in AUC (AO) score on the GOT10K, LaSOT, TrackingNet datasets, respectively. Code: https://github.com/GXNU-ZhongLab/SSTrack.

Yaozong Zheng, Bineng Zhong, Qihua Liang, Ning Li, Shuxiang Song• 2025

Related benchmarks

TaskDatasetResultRank
Object TrackingLaSoT
AUC65.9
498
Object TrackingTrackingNet
Precision (P)78.9
327
Visual Object TrackingGOT-10k
AO72.4
306
Visual Object TrackingUAV123
AUC0.655
193
Visual Object TrackingTNL2K
AUC52.1
169
Visual Object TrackingOTB-100
AUC67.9
154
Visual Object TrackingLaSoText
AUC48.5
140
TrackingOTB99
AUC0.679
45
Visual Object TrackingVOT 2020
EAO0.458
18
Visual Object TrackingUGVT UAV View (test)
Success Rate (SR)70.5
16
Showing 10 of 12 rows

Other info

Follow for update