Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Grand Unification of Object Tracking

About

We present a unified method, termed Unicorn, that can simultaneously solve four tracking problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters. Due to the fragmented definitions of the object tracking problem itself, most existing trackers are developed to address a single or part of tasks and overspecialize on the characteristics of specific tasks. By contrast, Unicorn provides a unified solution, adopting the same input, backbone, embedding, and head across all tracking tasks. For the first time, we accomplish the great unification of the tracking network architecture and learning paradigm. Unicorn performs on-par or better than its task-specific counterparts in 8 tracking datasets, including LaSOT, TrackingNet, MOT17, BDD100K, DAVIS16-17, MOTS20, and BDD100K MOTS. We believe that Unicorn will serve as a solid step towards the general vision model. Code is available at https://github.com/MasterBin-IIAU/Unicorn.

Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu• 2022

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2017 (val)
J mean66.1
1130
Multiple Object TrackingMOT17 (test)
MOTA77.2
921
Video Object SegmentationDAVIS 2016 (val)
J Mean86.5
564
Visual Object TrackingTrackingNet (test)
Normalized Precision (Pnorm)86.4
460
Visual Object TrackingLaSOT (test)
AUC68.5
444
Object TrackingLaSoT
AUC68.5
333
Object TrackingTrackingNet
Precision (P)82.2
225
Video Object SegmentationDAVIS 2017 (test)
J (Jaccard Index)65.2
107
Multi-Object TrackingBDD100K (val)
mIDF154
70
Multi-Object Tracking and SegmentationBDD100K segmentation tracking (val)
mMOTSA30.7
54
Showing 10 of 33 rows

Other info

Code

Follow for update