SiLK -- Simple Learned Keypoints

About

Keypoint detection & descriptors are foundational tech-nologies for computer vision tasks like image matching, 3D reconstruction and visual odometry. Hand-engineered methods like Harris corners, SIFT, and HOG descriptors have been used for decades; more recently, there has been a trend to introduce learning in an attempt to improve keypoint detectors. On inspection however, the results are difficult to interpret; recent learning-based methods employ a vast diversity of experimental setups and design choices: empirical results are often reported using different backbones, protocols, datasets, types of supervisions or tasks. Since these differences are often coupled together, it raises a natural question on what makes a good learned keypoint detector. In this work, we revisit the design of existing keypoint detectors by deconstructing their methodologies and identifying the key components. We re-design each component from first-principle and propose Simple Learned Keypoints (SiLK) that is fully-differentiable, lightweight, and flexible. Despite its simplicity, SiLK advances new state-of-the-art on Detection Repeatability and Homography Estimation tasks on HPatches and 3D Point-Cloud Registration task on ScanNet, and achieves competitive performance to state-of-the-art on camera pose estimation in 2022 Image Matching Challenge and ScanNet.

Pierre Gleize, Weiyao Wang, Matt Feiszli• 2023

Related benchmarks

Task	Dataset	Result
Homography Estimation	HPatches	Overall Accuracy (< 1px)62	81
Relative Pose Estimation	ScanNet 1500 pairs (test)	AUC@5°18	56
Visual Localization	IMC 2022	Mean Localization Accuracy56	22
Visual Localization	Aachen Day	Localization Accuracy (0.5m, 5°)87.2	22
Visual Localization	Aachen (Night)	Localization Accuracy (0.5m, 5°)74.5	22
Relative Pose Estimation	MegaDepth 1500 pairs (test)	AUC@5°43.8	17
Relative Pose Estimation	MegaDepth 1500 outdoor pairs (test)	AUC@5°43.8	17
Relative Pose Estimation	Map-free dataset (test)	VCRE AUC0.31	15
Relative Camera Pose Estimation	Megadepth-1500 1.0 (test)	AUC@5°16.2	10
Two-View Geometry Matching	IMC 2022 25	mAA @ 1068.6	8

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord