Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TSDM: Tracking by SiamRPN++ with a Depth-refiner and a Mask-generator

About

In a generic object tracking, depth (D) information provides informative cues for foreground-background separation and target bounding box regression. However, so far, few trackers have used depth information to play the important role aforementioned due to the lack of a suitable model. In this paper, a RGB-D tracker named TSDM is proposed, which is composed of a Mask-generator (M-g), SiamRPN++ and a Depth-refiner (D-r). The M-g generates the background masks, and updates them as the target 3D position changes. The D-r optimizes the target bounding box estimated by SiamRPN++, based on the spatial depth distribution difference between the target and the surrounding background. Extensive evaluation on the Princeton Tracking Benchmark and the Visual Object Tracking challenge shows that our tracker outperforms the state-of-the-art by a large margin while achieving 23 FPS. In addition, a light-weight variant can run at 31 FPS and thus it is practical for real world applications. Code and models of TSDM are available at https://github.com/lql-team/TSDM.

Pengyao Zhao, Quanli Liu, Wei Wang, Qiang Guo• 2020

Related benchmarks

TaskDatasetResultRank
RGB-D Object TrackingDepthTrack (test)
Precision39.3
145
Visual Object TrackingDepthTrack
Precision0.442
41
Visual Object TrackingCDTB
Precision64.7
12
Visual Object TrackingARKitTrack (test)
Precision38.9
12
RGB-D Object TrackingCDTB
Precision57.8
9
Showing 5 of 5 rows

Other info

Follow for update