Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning to Find Good Correspondences

About

We develop a deep architecture to learn to find good correspondences for wide-baseline stereo. Given a set of putative sparse matches and the camera intrinsics, we train our network in an end-to-end fashion to label the correspondences as inliers or outliers, while simultaneously using them to recover the relative pose, as encoded by the essential matrix. Our architecture is based on a multi-layer perceptron operating on pixel coordinates rather than directly on the image, and is thus simple and small. We introduce a novel normalization technique, called Context Normalization, which allows us to process each data point separately while imbuing it with global information, and also makes the network invariant to the order of the correspondences. Our experiments on multiple challenging datasets demonstrate that our method is able to drastically improve the state of the art with little training data.

Kwang Moo Yi, Eduard Trulls, Yuki Ono, Vincent Lepetit, Mathieu Salzmann, Pascal Fua• 2017

Related benchmarks

TaskDatasetResultRank
Camera pose estimationSUN3D (Known Scene)
mAP @ 5°20.6
58
Camera pose estimationYFCC100M (Unknown Scene)
mAP @ 5°48.03
36
Camera pose estimationSUN3D Unknown Scene
mAP (5°)16.4
36
Camera pose estimationYFCC100M Known Scene
mAP @ 5°34.55
36
Outlier removalSUN3D (Known Scene)
Precision53.7
28
Outlier removalYFCC100M Known Scene
Precision54.43
28
Relative Pose EstimationYFCC100M (Unknown Scene)
mAP AUC@5°23.71
22
Relative Pose EstimationSUN3D Unknown Scene
mAP AUC @ 5 deg9.73
22
Outlier removalYFCC100M
Precision0.5284
19
Outlier removalSUN3D
Precision46.11
19
Showing 10 of 15 rows

Other info

Follow for update