Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Do Different Tracking Tasks Require Different Appearance Models?

About

Tracking objects of interest in a video is one of the most popular and widely applicable problems in computer vision. However, with the years, a Cambrian explosion of use cases and benchmarks has fragmented the problem in a multitude of different experimental setups. As a consequence, the literature has fragmented too, and now novel approaches proposed by the community are usually specialised to fit only one specific setup. To understand to what extent this specialisation is necessary, in this work we present UniTrack, a solution to address five different tasks within the same framework. UniTrack consists of a single and task-agnostic appearance model, which can be learned in a supervised or self-supervised fashion, and multiple ``heads'' that address individual tasks and do not require training. We show how most tracking tasks can be solved within this framework, and that the same appearance model can be successfully used to obtain results that are competitive against specialised methods for most of the tasks considered. The framework also allows us to analyse appearance models obtained with the most recent self-supervised methods, thus extending their evaluation and comparison to a larger variety of important problems.

Zhongdao Wang, Hengshuang Zhao, Ya-Li Li, Shengjin Wang, Philip H.S. Torr, Luca Bertinetto• 2021

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2017 (val)
J mean62.8
1193
Visual Object TrackingTrackingNet (test)--
463
Visual Object TrackingLaSOT (test)
AUC35.1
446
Object TrackingLaSoT--
411
Object TrackingTrackingNet
Precision (P)51.2
270
Multi-Object TrackingMOT16 (test)
MOTA74.7
228
Video Instance SegmentationYouTube-VIS (val)
AP30.1
118
Video Object SegmentationDAVIS 2017 (test)
J (Jaccard Index)58.4
107
Video Object SegmentationDAVIS 2017
Jaccard Index (J)58.4
82
Object TrackingOTB 2015 (test)
AUC0.618
63
Showing 10 of 20 rows

Other info

Code

Follow for update