ASLFeat: Learning Local Features of Accurate Shape and Localization
About
This work focuses on mitigating two limitations in the joint learning of local feature detectors and descriptors. First, the ability to estimate the local shape (scale, orientation, etc.) of feature points is often neglected during dense feature extraction, while the shape-awareness is crucial to acquire stronger geometric invariance. Second, the localization accuracy of detected keypoints is not sufficient to reliably recover camera geometry, which has become the bottleneck in tasks such as 3D reconstruction. In this paper, we present ASLFeat, with three light-weight yet effective modifications to mitigate above issues. First, we resort to deformable convolutional networks to densely estimate and apply local transformation. Second, we take advantage of the inherent feature hierarchy to restore spatial resolution and low-level details for accurate keypoint localization. Finally, we use a peakiness measurement to relate feature responses and derive more indicative detection scores. The effect of each modification is thoroughly studied, and the evaluation is extensively conducted across a variety of practical scenarios. State-of-the-art results are reported that demonstrate the superiority of our methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Relative Pose Estimation | MegaDepth (test) | Pose AUC @5°33.8 | 83 | |
| Visual Localization | Aachen Day-Night v1.1 (Day) | SR (0.25m, 2°)88 | 70 | |
| Image Matching | DeSurT (833 pairs total) | MS Score28 | 38 | |
| Image Matching | Kinect 2 | Matching Score (MS)0.39 | 38 | |
| Image Matching | Kinect 1 | MS0.31 | 38 | |
| Image Matching | Simulation | MS19 | 38 | |
| 3D Reconstruction | ETH local feature benchmark Tower of London | Track Length12.52 | 24 | |
| 3D Reconstruction | ETH local feature benchmark Gendarmenmarkt | Track Length8.72 | 24 | |
| Visual Localization | Aachen Day-Night v1.1 (test) | Success Rate (0.25m, 2°)72.3 | 24 | |
| Image Matching | HPatches (full) | MMA (Viewpoint)34 | 21 |