DINTR: Tracking via Diffusion-based Interpolation

About

Object tracking is a fundamental task in computer vision, requiring the localization of objects of interest across video frames. Diffusion models have shown remarkable capabilities in visual generation, making them well-suited for addressing several requirements of the tracking problem. This work proposes a novel diffusion-based methodology to formulate the tracking task. Firstly, their conditional process allows for injecting indications of the target object into the generation process. Secondly, diffusion mechanics can be developed to inherently model temporal correspondences, enabling the reconstruction of actual frames in video. However, existing diffusion models rely on extensive and unnecessary mapping to a Gaussian noise domain, which can be replaced by a more efficient and stable interpolation process. Our proposed interpolation mechanism draws inspiration from classic image-processing techniques, offering a more interpretable, stable, and faster approach tailored specifically for the object tracking task. By leveraging the strengths of diffusion models while circumventing their limitations, our Diffusion-based INterpolation TrackeR (DINTR) presents a promising new paradigm and achieves a superior multiplicity on seven benchmarks across five indicator representations.

Pha Nguyen, Ngan Le, Jackson Cothren, Alper Yilmaz, Khoa Luu• 2024

Related benchmarks

Task	Dataset	Result
Multiple Object Tracking	MOT17 (test)	MOTA78	1038
Video Object Segmentation	DAVIS 2017	Jaccard Index (J)72.5	82
Point Tracking	DAVIS TAP-Vid	Average Jaccard (AJ)62.3	52
Point Tracking	TAP-Vid Kinetics	Overall Accuracy89.4	48
Single Object Tracking	LaSoT	Success Rate70	15
Point Tracking	TAP-Vid RGB Stacking	Avg Displacement Error77.5	13
Point Tracking	Kubric TAP-Vid	Average Jaccard85.5	9
Pose Tracking	PoseTrack 2021 (test)	mAP82.5	8
Multi-Object Tracking and Segmentation	MOTS	sMOTSA67.4	6
Multiple Object Tracking with textual prompt input	GroOT	MOTA68.9	5

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord