Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

UFM: A Simple Path towards Unified Dense Correspondence with Flow

About

Dense image correspondence is central to many applications, such as visual odometry, 3D reconstruction, object association, and re-identification. Historically, dense correspondence has been tackled separately for wide-baseline scenarios and optical flow estimation, despite the common goal of matching content between two images. In this paper, we develop a Unified Flow & Matching model (UFM), which is trained on unified data for pixels that are co-visible in both source and target images. UFM uses a simple, generic transformer architecture that directly regresses the (u,v) flow. It is easier to train and more accurate for large flows compared to the typical coarse-to-fine cost volumes in prior work. UFM is 28% more accurate than state-of-the-art flow methods (Unimatch), while also having 62% less error and 6.7x faster than dense wide-baseline matchers (RoMa). UFM is the first to demonstrate that unified training can outperform specialized approaches across both domains. This result enables fast, general-purpose correspondence and opens new directions for multi-modal, long-range, and real-time correspondence tasks.

Yuchen Zhang, Nikhil Keetha, Chenwei Lyu, Bhuvan Jhamb, Yutian Chen, Yuheng Qiu, Jay Karhade, Shreyas Jha, Yaoyu Hu, Deva Ramanan, Sebastian Scherer, Wenshan Wang• 2025

Related benchmarks

TaskDatasetResultRank
Optical FlowSintel (train)
AEPE (Clean)1.15
200
Relative Pose EstimationMegaDepth 1500
AUC @ 20°72.4
151
Optical FlowKITTI (train)
Fl-all0.11
84
Relative Pose EstimationScanNet 1500
AUC@5°31.3
30
Feature MatchingWxBS
mAA (10px)53.3
30
Image MatchingHardMatch
mAA@10px33.9
18
Correspondence EvaluationHardMatch
PCK@5px35.2
8
Showing 7 of 7 rows

Other info

Follow for update