Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ALIKED: A Lighter Keypoint and Descriptor Extraction Network via Deformable Transformation

About

Image keypoints and descriptors play a crucial role in many visual measurement tasks. In recent years, deep neural networks have been widely used to improve the performance of keypoint and descriptor extraction. However, the conventional convolution operations do not provide the geometric invariance required for the descriptor. To address this issue, we propose the Sparse Deformable Descriptor Head (SDDH), which learns the deformable positions of supporting features for each keypoint and constructs deformable descriptors. Furthermore, SDDH extracts descriptors at sparse keypoints instead of a dense descriptor map, which enables efficient extraction of descriptors with strong expressiveness. In addition, we relax the neural reprojection error (NRE) loss from dense to sparse to train the extracted sparse descriptors. Experimental results show that the proposed network is both efficient and powerful in various visual measurement tasks, including image matching, 3D reconstruction, and visual relocalization.

Xiaoming Zhao, Xingming Wu, Weihai Chen, Peter C. Y. Chen, Qingsong Xu, Zhengguo Li• 2023

Related benchmarks

TaskDatasetResultRank
Relative Pose EstimationMegaDepth 1500
AUC @ 5°61.1
104
Homography EstimationHPatches
Overall Accuracy (< 1px)51.67
59
Visual LocalizationAachen Day-Night v1.1 (Night)
Success Rate (0.25m, 2°)77.5
58
Pose EstimationKITTI odometry
AUC582.14
51
Visual LocalizationAachen Day-Night v1.1 (Day)
SR (0.25m, 2°)89.1
50
Pose EstimationGraz4K (test)
AUC@555
29
Pose EstimationMegaDepth 1500 (test)
AUC @ 5°41.8
27
Relative Pose EstimationIMC 2022 (Private)
mAA63
24
Relative Pose EstimationIMC 2022 (Public)
mAA0.632
24
Local Feature Extraction EfficiencyNVIDIA Jetson Orin-NX
GFLOPs1.37
24
Showing 10 of 31 rows

Other info

Follow for update