MOTS: Multi-Object Tracking and Segmentation

About

This paper extends the popular task of multi-object tracking to multi-object tracking and segmentation (MOTS). Towards this goal, we create dense pixel-level annotations for two existing tracking datasets using a semi-automatic annotation procedure. Our new annotations comprise 65,213 pixel masks for 977 distinct objects (cars and pedestrians) in 10,870 video frames. For evaluation, we extend existing multi-object tracking metrics to this new task. Moreover, we propose a new baseline method which jointly addresses detection, tracking, and segmentation with a single convolutional network. We demonstrate the value of our datasets by achieving improvements in performance when training on MOTS annotations. We believe that our datasets, metrics and baseline will become a valuable resource towards developing multi-object tracking approaches that go beyond 2D bounding boxes. We make our annotations, code, and models available at https://www.vision.rwth-aachen.de/page/mots.

Paul Voigtlaender, Michael Krause, Aljosa Osep, Jonathon Luiten, Berin Balachandar Gnana Sekar, Andreas Geiger, Bastian Leibe• 2019

Related benchmarks

Task	Dataset	Result
Multi-Object Tracking	KITTI Tracking (test)	MOTA84.83	56
Multiple Object Tracking	2D MOT15 (test)	MOTA69.2	34
Multi-Object Tracking and Segmentation	KITTI MOTS car (test)	HOTA56.63	24
Multi-Object Tracking and Segmentation	KITTI MOTS pedestrian (test)	HOTA41.93	24
Instance Segmentation Tracking	MOTS (test)	IDF142.4	21
Multi-Object Tracking and Segmentation	KITTI MOTS (val)	sMOTSA (Car)76.2	18
Multi-Object Tracking and Segmentation	MOTS 4-fold cross-validation 2020 (train)	sMOTSA52.7	8
Multiple Object Tracking and Segmentation	MOTS20 (test)	sMOTSA40.6	8
Multi-Object Tracking and Segmentation	MOTSChallenge (leaving-one-out fashion)	sMOTSA52.7	6
Multi-Object Tracking and Segmentation	KITTI MOTS (test)	sMOTSA (Pedestrians)47.3	6

Showing 10 of 12 rows

Other info

Code

Follow for update

@wizwand_team Discord