Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning to Track Objects from Unlabeled Videos

About

In this paper, we propose to learn an Unsupervised Single Object Tracker (USOT) from scratch. We identify that three major challenges, i.e., moving object discovery, rich temporal variation exploitation, and online update, are the central causes of the performance bottleneck of existing unsupervised trackers. To narrow the gap between unsupervised trackers and supervised counterparts, we propose an effective unsupervised learning approach composed of three stages. First, we sample sequentially moving objects with unsupervised optical flow and dynamic programming, instead of random cropping. Second, we train a naive Siamese tracker from scratch using single-frame pairs. Third, we continue training the tracker with a novel cycle memory learning scheme, which is conducted in longer temporal spans and also enables our tracker to update online. Extensive experiments show that the proposed USOT learned from unlabeled videos performs well over the state-of-the-art unsupervised trackers by large margins, and on par with recent supervised deep trackers. Code is available at https://github.com/VISION-SJTU/USOT.

Jilai Zheng, Chao Ma, Houwen Peng, Xiaokang Yang• 2021

Related benchmarks

TaskDatasetResultRank
Object TrackingLaSoT
AUC33.7
498
Visual Object TrackingLaSOT (test)--
470
Object TrackingTrackingNet
Precision (P)55.1
327
Visual Object TrackingTNL2K
AUC30
169
Visual Object TrackingVOT 2016
EAO40.2
92
TrackingOTB99
AUC0.589
45
Visual Object TrackingVOT 2018
EAO0.344
36
Visual Object TrackingTrackingNet
Success Rate (AUC)61.6
25
Visual Object TrackingOTB 2015
Success Rate (Suc)63.9
22
Object TrackingVOT 2018
EAO0.29
22
Showing 10 of 12 rows

Other info

Follow for update