Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ASpanFormer: Detector-Free Image Matching with Adaptive Span Transformer

About

Generating robust and reliable correspondences across images is a fundamental task for a diversity of applications. To capture context at both global and local granularity, we propose ASpanFormer, a Transformer-based detector-free matcher that is built on hierarchical attention structure, adopting a novel attention operation which is capable of adjusting attention span in a self-adaptive manner. To achieve this goal, first, flow maps are regressed in each cross attention phase to locate the center of search region. Next, a sampling grid is generated around the center, whose size, instead of being empirically configured as fixed, is adaptively computed from a pixel uncertainty estimated along with the flow map. Finally, attention is computed across two images within derived regions, referred to as attention span. By these means, we are able to not only maintain long-range dependencies, but also enable fine-grained attention among pixels of high relevance that compensates essential locality and piece-wise smoothness in matching tasks. State-of-the-art accuracy on a wide range of evaluation benchmarks validates the strong matching capability of our method.

Hongkai Chen, Zixin Luo, Lei Zhou, Yurun Tian, Mingmin Zhen, Tian Fang, David Mckinnon, Yanghai Tsin, Long Quan• 2022

Related benchmarks

TaskDatasetResultRank
Relative Pose EstimationMegaDepth 1500
AUC @ 5°55.3
104
Relative Pose EstimationMegaDepth (test)
Pose AUC @5°60.92
83
Homography EstimationHPatches
Overall Accuracy (< 1px)46
59
Homography EstimationHPatches
AUC @3px67.4
35
Visual LocalizationAachen Day-Night 1.1 (Day)
Success Rate (0.25m, 2°)89.4
28
Pose EstimationMegaDepth 1500 (test)
AUC @ 5°55.3
27
Homography EstimationHPatches (viewpoint)
Accuracy (<1px)22
27
Pose EstimationScanNet 1500 (test)
AUC@5°25.6
26
Visual LocalizationAachen Day-Night 1.1 (Night)
SR @ 0.25m, 2°77.5
22
Feature MatchingMegaDepth extreme-scale (test)
Matching Coverage83.7
20
Showing 10 of 42 rows

Other info

Follow for update