MOTS: Multi-Object Tracking and Segmentation
About
This paper extends the popular task of multi-object tracking to multi-object tracking and segmentation (MOTS). Towards this goal, we create dense pixel-level annotations for two existing tracking datasets using a semi-automatic annotation procedure. Our new annotations comprise 65,213 pixel masks for 977 distinct objects (cars and pedestrians) in 10,870 video frames. For evaluation, we extend existing multi-object tracking metrics to this new task. Moreover, we propose a new baseline method which jointly addresses detection, tracking, and segmentation with a single convolutional network. We demonstrate the value of our datasets by achieving improvements in performance when training on MOTS annotations. We believe that our datasets, metrics and baseline will become a valuable resource towards developing multi-object tracking approaches that go beyond 2D bounding boxes. We make our annotations, code, and models available at https://www.vision.rwth-aachen.de/page/mots.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-Object Tracking | KITTI Tracking (test) | MOTA84.83 | 56 | |
| Multiple Object Tracking | 2D MOT15 (test) | MOTA69.2 | 34 | |
| Instance Segmentation Tracking | MOTS (test) | IDF142.4 | 21 | |
| Multi-Object Tracking and Segmentation | KITTI MOTS (val) | sMOTSA (Car)76.2 | 18 | |
| Multi-Object Tracking and Segmentation | MOTS 4-fold cross-validation 2020 (train) | sMOTSA52.7 | 8 | |
| Multi-Object Tracking and Segmentation | MOTSChallenge (leaving-one-out fashion) | sMOTSA52.7 | 6 | |
| Multi-Object Tracking and Segmentation | KITTI MOTS (test) | sMOTSA (Pedestrians)47.3 | 6 | |
| Multi-Object Tracking and Segmentation | MOTS | sMOTSA40.6 | 6 | |
| Multi-Object Tracking and Segmentation | CVPR MOTS Challenge 2020 (test) | sMOTSA40.6 | 5 | |
| Multi-Object Tracking and Segmentation | KITTI MOTS car (test) | HOTA56.63 | 4 |