Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Association via Track-Detection Matching for Multi-Object Tracking

About

Multi-object tracking aims to maintain object identities over time by associating detections across video frames. Two dominant paradigms exist in literature: tracking-by-detection methods, which are computationally efficient but rely on handcrafted association heuristics, and end-to-end approaches, which learn association from data at the cost of higher computational complexity. We propose Track-Detection Link Prediction (TDLP), a tracking-by-detection method that performs per-frame association via link prediction between tracks and detections, i.e., by predicting the correct continuation of each track at every frame. TDLP is architecturally designed primarily for geometric features such as bounding boxes, while optionally incorporating additional cues, including pose and appearance. Unlike heuristic-based methods, TDLP learns association directly from data without handcrafted rules, while remaining modular and computationally efficient compared to end-to-end trackers. Extensive experiments on multiple benchmarks demonstrate that TDLP consistently surpasses state-of-the-art performance across both tracking-by-detection and end-to-end methods. Finally, we provide a detailed analysis comparing link prediction with metric learning-based association and show that link prediction is more effective, particularly when handling heterogeneous features such as detection bounding boxes. Our code is available at \href{https://github.com/Robotmurlock/TDLP}{https://github.com/Robotmurlock/TDLP}.

Momir Ad\v{z}emovi\'c• 2025

Related benchmarks

TaskDatasetResultRank
Multiple Object TrackingMOT17 (test)
MOTA78
921
Multi-Object TrackingDanceTrack (test)
HOTA0.701
355
Multi-Object TrackingSportsMOT (test)
HOTA81.9
199
Multi-Object TrackingSoccerNet (test)
HOTA56.3
23
Multi-Object TrackingBEE24 (test)
HOTA51.9
11
Showing 5 of 5 rows

Other info

Follow for update