SiLK -- Simple Learned Keypoints
About
Keypoint detection & descriptors are foundational tech-nologies for computer vision tasks like image matching, 3D reconstruction and visual odometry. Hand-engineered methods like Harris corners, SIFT, and HOG descriptors have been used for decades; more recently, there has been a trend to introduce learning in an attempt to improve keypoint detectors. On inspection however, the results are difficult to interpret; recent learning-based methods employ a vast diversity of experimental setups and design choices: empirical results are often reported using different backbones, protocols, datasets, types of supervisions or tasks. Since these differences are often coupled together, it raises a natural question on what makes a good learned keypoint detector. In this work, we revisit the design of existing keypoint detectors by deconstructing their methodologies and identifying the key components. We re-design each component from first-principle and propose Simple Learned Keypoints (SiLK) that is fully-differentiable, lightweight, and flexible. Despite its simplicity, SiLK advances new state-of-the-art on Detection Repeatability and Homography Estimation tasks on HPatches and 3D Point-Cloud Registration task on ScanNet, and achieves competitive performance to state-of-the-art on camera pose estimation in 2022 Image Matching Challenge and ScanNet.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Relative Pose Estimation | ScanNet 1500 pairs (test) | AUC@5°18 | 48 | |
| Relative Pose Estimation | MegaDepth 1500 pairs (test) | AUC@5°43.8 | 17 | |
| Relative Pose Estimation | MegaDepth 1500 outdoor pairs (test) | AUC@5°43.8 | 17 | |
| Relative Pose Estimation | Map-free dataset (test) | VCRE AUC0.31 | 15 | |
| Relative Camera Pose Estimation | Megadepth-1500 1.0 (test) | AUC@5°16.2 | 10 | |
| Two-View Geometry Matching | IMC 2022 25 | mAA @ 1068.6 | 8 | |
| Homography Estimation | HPatches illumination | MHA @ 3px78.5 | 7 |