Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

D3S -- A Discriminative Single Shot Segmentation Tracker

About

Template-based discriminative trackers are currently the dominant tracking paradigm due to their robustness, but are restricted to bounding box tracking and a limited range of transformation models, which reduces their localization accuracy. We propose a discriminative single-shot segmentation tracker - D3S, which narrows the gap between visual object tracking and video object segmentation. A single-shot network applies two target models with complementary geometric properties, one invariant to a broad range of transformations, including non-rigid deformations, the other assuming a rigid object to simultaneously achieve high robustness and online target segmentation. Without per-dataset finetuning and trained only for segmentation as the primary output, D3S outperforms all trackers on VOT2016, VOT2018 and GOT-10k benchmarks and performs close to the state-of-the-art trackers on the TrackingNet. D3S outperforms the leading segmentation tracker SiamMask on video object segmentation benchmark and performs on par with top video object segmentation algorithms, while running an order of magnitude faster, close to real-time.

Alan Luke\v{z}i\v{c}, Ji\v{r}\'i Matas, Matej Kristan• 2019

Related benchmarks

TaskDatasetResultRank
Video Object SegmentationDAVIS 2017 (val)
J mean66.1
1130
Video Object SegmentationDAVIS 2016 (val)
J Mean75.4
564
Visual Object TrackingTrackingNet (test)
Normalized Precision (Pnorm)76.8
460
Visual Object TrackingLaSOT (test)
AUC49.2
444
Visual Object TrackingGOT-10k (test)
Average Overlap59.7
378
Video Object SegmentationYouTube-VOS 2019 (val)
J-Score (Seen)60.2
231
Object TrackingTrackingNet
Precision (P)66.4
225
Visual Object TrackingVOT 2020 (test)
EAO0.439
147
Object TrackingGOT-10k
AO59.7
74
Visual Object TrackingVOT 2018 (test)
EAO0.489
54
Showing 10 of 16 rows

Other info

Follow for update