Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Dense Optical Tracking: Connecting the Dots

About

Recent approaches to point tracking are able to recover the trajectory of any scene point through a large portion of a video despite the presence of occlusions. They are, however, too slow in practice to track every point observed in a single frame in a reasonable amount of time. This paper introduces DOT, a novel, simple and efficient method for solving this problem. It first extracts a small set of tracks from key regions at motion boundaries using an off-the-shelf point tracking algorithm. Given source and target frames, DOT then computes rough initial estimates of a dense flow field and visibility mask through nearest-neighbor interpolation, before refining them using a learnable optical flow estimator that explicitly handles occlusions and can be trained on synthetic data with ground-truth correspondences. We show that DOT is significantly more accurate than current optical flow techniques, outperforms sophisticated "universal" trackers like OmniMotion, and is on par with, or better than, the best point tracking algorithms like CoTracker while being at least two orders of magnitude faster. Quantitative and qualitative experiments with synthetic and real videos validate the promise of the proposed approach. Code, data, and videos showcasing the capabilities of our approach are available in the project webpage: https://16lemoing.github.io/dot .

Guillaume Le Moing, Jean Ponce, Cordelia Schmid• 2023

Related benchmarks

TaskDatasetResultRank
Point TrackingDAVIS
AJ60.1
38
Point TrackingTAP-Vid-Kinetics (val)
Average Displacement Error63.8
25
2D Long-range optical flowCVO 7 frames (Final)
EPE (all)1.34
16
2D Long-range optical flowCVO Clean 7 frames
EPE (all)1.29
16
Point TrackingRGB-Stacking
Average Delta87.7
13
2D Long-range optical flowCVO Extended (48 frames)
EPE (all)4.98
10
Showing 6 of 6 rows

Other info

Follow for update