DepthTrack : Unveiling the Power of RGBD Tracking
About
RGBD (RGB plus depth) object tracking is gaining momentum as RGBD sensors have become popular in many application fields such as robotics.However, the best RGBD trackers are extensions of the state-of-the-art deep RGB trackers. They are trained with RGB data and the depth channel is used as a sidekick for subtleties such as occlusion detection. This can be explained by the fact that there are no sufficiently large RGBD datasets to 1) train deep depth trackers and to 2) challenge RGB trackers with sequences for which the depth cue is essential. This work introduces a new RGBD tracking dataset - Depth-Track - that has twice as many sequences (200) and scene types (40) than in the largest existing dataset, and three times more objects (90). In addition, the average length of the sequences (1473), the number of deformable objects (16) and the number of annotated tracking attributes (15) have been increased. Furthermore, by running the SotA RGB and RGBD trackers on DepthTrack, we propose a new RGBD tracking baseline, namely DeT, which reveals that deep RGBD tracking indeed benefits from genuine training data. The code and dataset is available at https://github.com/xiaozai/DeT
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| RGB-D Object Tracking | VOT-RGBD 2022 (public challenge) | EAO65.7 | 167 | |
| RGB-D Object Tracking | DepthTrack (test) | Precision59 | 145 | |
| Visual Object Tracking | DepthTrack | Precision0.56 | 41 | |
| Object Tracking | FE240hz 61 (test) | RSR (HDR)52.5 | 16 | |
| Object Tracking | VisEvent 51 (test) | Rigid RSR48.9 | 16 | |
| Visual Object Tracking | CDTB | Precision67.4 | 12 | |
| Visual Object Tracking | ARKitTrack (test) | Precision42.8 | 12 | |
| RGB-D Object Tracking | CDTB | Precision67.4 | 9 | |
| RGB-D Tracking | RGBD1K (test) | Precision Rate (PR)43.8 | 7 | |
| Single Object Tracking | FE240hz | RSR54.2 | 4 |