Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed

About

We present a novel method for efficiently producing semi-dense matches across images. Previous detector-free matcher LoFTR has shown remarkable matching capability in handling large-viewpoint change and texture-poor scenarios but suffers from low efficiency. We revisit its design choices and derive multiple improvements for both efficiency and accuracy. One key observation is that performing the transformer over the entire feature map is redundant due to shared local information, therefore we propose an aggregated attention mechanism with adaptive token selection for efficiency. Furthermore, we find spatial variance exists in LoFTR's fine correlation module, which is adverse to matching accuracy. A novel two-stage correlation layer is proposed to achieve accurate subpixel correspondences for accuracy improvement. Our efficiency optimized model is $\sim 2.5\times$ faster than LoFTR which can even surpass state-of-the-art efficient sparse matching pipeline SuperPoint + LightGlue. Moreover, extensive experiments show that our method can achieve higher accuracy compared with competitive semi-dense matchers, with considerable efficiency benefits. This opens up exciting prospects for large-scale or latency-sensitive applications such as image retrieval and 3D reconstruction. Project page: https://zju3dv.github.io/efficientloftr.

Yifan Wang, Xingyi He, Sida Peng, Dongli Tan, Xiaowei Zhou• 2024

Related benchmarks

TaskDatasetResultRank
Relative Pose EstimationMegaDepth 1500
AUC @ 5°56.4
104
Relative Pose EstimationScanNet 1500 pairs (test)
AUC@5°19.2
48
Homography EstimationHPatches
AUC @3px66.5
35
Visual LocalizationAachen Day-Night 1.1 (Day)
Success Rate (0.25m, 2°)89.6
28
Visual LocalizationAachen Day-Night 1.1 (Night)
SR @ 0.25m, 2°76.4
22
Visual LocalizationSwiss-EPFL out-of-Place
Recall (2m, 2°)81.53
18
Aerial Visual LocalizationUAVD4L LoD (in-Traj.)
Accuracy (2m-2°)84.47
18
Visual LocalizationSwiss-EPFL in-Place
Recall (2m-2°)37.64
18
Aerial Visual LocalizationUAVD4L LoD (out-of-Traj.)
2m-2° Success Rate91.06
18
Relative Pose EstimationMegaDepth View
AUC @ 5°60.2
17
Showing 10 of 21 rows

Other info

Code

Follow for update