Fusion of Head and Full-Body Detectors for Multi-Object Tracking
About
In order to track all persons in a scene, the tracking-by-detection paradigm has proven to be a very effective approach. Yet, relying solely on a single detector is also a major limitation, as useful image information might be ignored. Consequently, this work demonstrates how to fuse two detectors into a tracking system. To obtain the trajectories, we propose to formulate tracking as a weighted graph labeling problem, resulting in a binary quadratic program. As such problems are NP-hard, the solution can only be approximated. Based on the Frank-Wolfe algorithm, we present a new solver that is crucial to handle such difficult problems. Evaluation on pedestrian tracking is provided for multiple scenarios, showing superior results over single detector tracking and standard QP-solvers. Finally, our tracker ranks 2nd on the MOT16 benchmark and 1st on the new MOT17 benchmark, outperforming over 90 trackers.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multiple Object Tracking | MOT17 (test) | MOTA51.3 | 921 | |
| Multi-Object Tracking | MOT16 (test) | MOTA47.8 | 228 | |
| Multi-Object Tracking | MOT17 1.0 (test) | MOTA51.3 | 48 | |
| Multi-Object Tracking and Segmentation | MOTSChallenge (leaving-one-out fashion) | sMOTSA49.3 | 6 |