Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TransCenter: Transformers with Dense Representations for Multiple-Object Tracking

About

Transformers have proven superior performance for a wide variety of tasks since they were introduced. In recent years, they have drawn attention from the vision community in tasks such as image classification and object detection. Despite this wave, an accurate and efficient multiple-object tracking (MOT) method based on transformers is yet to be designed. We argue that the direct application of a transformer architecture with quadratic complexity and insufficient noise-initialized sparse queries - is not optimal for MOT. We propose TransCenter, a transformer-based MOT architecture with dense representations for accurately tracking all the objects while keeping a reasonable runtime. Methodologically, we propose the use of image-related dense detection queries and efficient sparse tracking queries produced by our carefully designed query learning networks (QLN). On one hand, the dense image-related detection queries allow us to infer targets' locations globally and robustly through dense heatmap outputs. On the other hand, the set of sparse tracking queries efficiently interacts with image features in our TransCenter Decoder to associate object positions through time. As a result, TransCenter exhibits remarkable performance improvements and outperforms by a large margin the current state-of-the-art methods in two standard MOT benchmarks with two tracking settings (public/private). TransCenter is also proven efficient and accurate by an extensive ablation study and comparisons to more naive alternatives and concurrent works. For scientific interest, the code is made publicly available at https://github.com/yihongxu/transcenter.

Yihong Xu, Yutong Ban, Guillaume Delorme, Chuang Gan, Daniela Rus, Xavier Alameda-Pineda• 2021

Related benchmarks

TaskDatasetResultRank
Multiple Object TrackingMOT17 (test)
MOTA79.8
921
Multiple Object TrackingMOT20 (test)
MOTA77.9
358
Multi-Object TrackingMOT17 1.0 (test)
MOTA70
48
Multi-Object TrackingMOT 2020 (test)
MOTA61.9
44
Multi-Object TrackingBFT 1.0 (test)
Detection Accuracy66
37
Multi-Object TrackingMOT20 Private detections (test)
IDF158.7
24
Multiple Object TrackingMOT20
MOTA58.5
21
Multi-Object TrackingKITTI Cars (test)
MOTA87.3
20
Multi-Object Tracking (Person)KITTI (test)
MOTA59.1
10
Multi-Object TrackingMOT20 1.0 (test)
IDF150.4
4
Showing 10 of 10 rows

Other info

Code

Follow for update