Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

RANSAC-Flow: generic two-stage image alignment

About

This paper considers the generic problem of dense alignment between two images, whether they be two frames of a video, two widely different views of a scene, two paintings depicting similar content, etc. Whereas each such task is typically addressed with a domain-specific solution, we show that a simple unsupervised approach performs surprisingly well across a range of tasks. Our main insight is that parametric and non-parametric alignment methods have complementary strengths. We propose a two-stage process: first, a feature-based parametric coarse alignment using one or more homographies, followed by non-parametric fine pixel-wise alignment. Coarse alignment is performed using RANSAC on off-the-shelf deep features. Fine alignment is learned in an unsupervised way by a deep network which optimizes a standard structural similarity metric (SSIM) between the two images, plus cycle-consistency. Despite its simplicity, our method shows competitive results on a range of tasks and datasets, including unsupervised optical flow on KITTI, dense correspondences on Hpatches, two-view geometry estimation on YFCC100M, localization on Aachen Day-Night, and, for the first time, fine alignment of artworks on the Brughel dataset. Our code and data are available at http://imagine.enpc.fr/~shenx/RANSAC-Flow/

Xi Shen, Fran\c{c}ois Darmon, Alexei A. Efros, Mathieu Aubry• 2020

Related benchmarks

TaskDatasetResultRank
Optical Flow EstimationKITTI 2015 (train)
Fl-epe12.48
431
Optical FlowKITTI 2012 (train)--
115
Geometric MatchingHPatches 240 x 240
AEE (I)0.51
33
Geometric MatchingMegaDepth (test)
PCK@153.47
22
Geometric MatchingHPatches
AEE (Avg)23.84
14
Geometric MatchingRobotCar (test)
PCK@12.1
9
Two-View Camera Pose EstimationYFCC100m 4 scenes
mAP @5°64.9
8
Two-view geometry estimationYFCC100M 61 (test)
mAP @5°64.88
7
Showing 8 of 8 rows

Other info

Follow for update