Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Co-Attention for Conditioned Image Matching

About

We propose a new approach to determine correspondences between image pairs in the wild under large changes in illumination, viewpoint, context, and material. While other approaches find correspondences between pairs of images by treating the images independently, we instead condition on both images to implicitly take account of the differences between them. To achieve this, we introduce (i) a spatial attention mechanism (a co-attention module, CoAM) for conditioning the learned features on both images, and (ii) a distinctiveness score used to choose the best matches at test time. CoAM can be added to standard architectures and trained using self-supervision or supervised data, and achieves a significant performance improvement under hard conditions, e.g. large viewpoint changes. We demonstrate that models using CoAM achieve state of the art or competitive results on a wide range of tasks: local matching, camera localization, 3D reconstruction, and image stylization.

Olivia Wiles, Sebastien Ehrhardt, Andrew Zisserman• 2020

Related benchmarks

TaskDatasetResultRank
Sparse 3D ReconstructionETH Local Feature Benchmark Madrid Metropolis v1.0
nReg702
17
3D ReconstructionETH local feature benchmark Gendarmenmarkt
Image Count1.07e+3
16
3D ReconstructionETH local feature benchmark Tower of London
Image Count804
16
3D ReconstructionMadrid Metropolis
Reg Images Count702
11
Sparse 3D ReconstructionETH Local Feature Benchmark Gendarmenmarkt v1.0
N Reg1.07e+3
8
Sparse 3D ReconstructionETH Local Feature Benchmark Tower of London v1.0
nReg804
8
Two-View Camera Pose EstimationYFCC100m 4 scenes
mAP @5°55.6
8
Two-view geometry estimationYFCC100M 61 (test)
mAP @5°55.58
7
Showing 8 of 8 rows

Other info

Follow for update