TransTrack: Multiple Object Tracking with Transformer
About
In this work, we propose TransTrack, a simple but efficient scheme to solve the multiple object tracking problems. TransTrack leverages the transformer architecture, which is an attention-based query-key mechanism. It applies object features from the previous frame as a query of the current frame and introduces a set of learned object queries to enable detecting new-coming objects. It builds up a novel joint-detection-and-tracking paradigm by accomplishing object detection and object association in a single shot, simplifying complicated multi-step settings in tracking-by-detection methods. On MOT17 and MOT20 benchmark, TransTrack achieves 74.5\% and 64.5\% MOTA, respectively, competitive to the state-of-the-art methods. We expect TransTrack to provide a novel perspective for multiple object tracking. The code is available at: \url{https://github.com/PeizeSun/TransTrack}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multiple Object Tracking | MOT17 (test) | MOTA75.2 | 921 | |
| Multiple Object Tracking | MOT20 (test) | MOTA65 | 358 | |
| Multi-Object Tracking | DanceTrack (test) | HOTA0.455 | 355 | |
| Multi-Object Tracking | SportsMOT (test) | HOTA68.9 | 199 | |
| Multi-Object Tracking | MOT 2016 (test) | MOTA74.5 | 59 | |
| Multi-Object Tracking | MOT17 | MOTA74.5 | 55 | |
| Multi-Object Tracking | MOT17 1.0 (test) | MOTA74.5 | 48 | |
| Multi-Object Tracking | MOT 2020 (test) | MOTA65 | 44 | |
| Multi-Object Tracking | BFT 1.0 (test) | Detection Accuracy64.2 | 37 | |
| Multi-Object Tracking | MOT 2017 (test) | MOTA74.5 | 34 |