SFD2: Semantic-guided Feature Detection and Description
About
Visual localization is a fundamental task for various applications including autonomous driving and robotics. Prior methods focus on extracting large amounts of often redundant locally reliable features, resulting in limited efficiency and accuracy, especially in large-scale environments under challenging conditions. Instead, we propose to extract globally reliable features by implicitly embedding high-level semantics into both the detection and description processes. Specifically, our semantic-aware detector is able to detect keypoints from reliable regions (e.g. building, traffic lane) and suppress unreliable areas (e.g. sky, car) implicitly instead of relying on explicit semantic labels. This boosts the accuracy of keypoint matching by reducing the number of features sensitive to appearance changes and avoiding the need of additional segmentation networks at test time. Moreover, our descriptors are augmented with semantics and have stronger discriminative ability, providing more inliers at test time. Particularly, experiments on long-term large-scale visual localization Aachen Day-Night and RobotCar-Seasons datasets demonstrate that our model outperforms previous local features and gives competitive accuracy to advanced matchers but is about 2 and 3 times faster when using 2k and 4k keypoints, respectively.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Visual Localization | Aachen Day-Night v1.1 (Day) | SR (0.25m, 2°)88.2 | 70 | |
| Pose Estimation | MegaDepth 1500 (test) | AUC @ 5°45 | 38 | |
| 3D Reconstruction | ETH local feature benchmark Gendarmenmarkt | Track Length5.2 | 24 | |
| 3D Reconstruction | ETH local feature benchmark Tower of London | Track Length4.54 | 24 | |
| 3D Reconstruction | Madrid Metropolis | Track Length4.74 | 19 | |
| Visual Localization | Aachen Day-Night v1.0 (Night) | Success Rate (0.25m, 2°)87.8 | 17 | |
| 3D Reconstruction | ETH Herzjesu Small-Scale | Track Length6.21 | 16 | |
| Visual Localization | Aachen Day-Night v1.0 (Day) | Success Rate (0.25m, 2°)88.2 | 14 | |
| Visual Localization | Aachen Day-Night v1.1 | Success Rate (2°, 0.25m)78 | 12 | |
| Pose Estimation | ScanNet (test) | AUC@5°12.1 | 11 |