Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DISK: Learning local features with policy gradient

About

Local feature frameworks are difficult to learn in an end-to-end fashion, due to the discreteness inherent to the selection and matching of sparse keypoints. We introduce DISK (DIScrete Keypoints), a novel method that overcomes these obstacles by leveraging principles from Reinforcement Learning (RL), optimizing end-to-end for a high number of correct feature matches. Our simple yet expressive probabilistic model lets us keep the training and inference regimes close, while maintaining good enough convergence properties to reliably train from scratch. Our features can be extracted very densely while remaining discriminative, challenging commonly held assumptions about what constitutes a good keypoint, as showcased in Fig. 1, and deliver state-of-the-art results on three public benchmarks.

Micha{\l} J. Tyszkiewicz, Pascal Fua, Eduard Trulls• 2020

Related benchmarks

TaskDatasetResultRank
Semantic CorrespondencePF-WILLOW
PCK@0.1 (bbox)17
109
Relative Pose EstimationMegaDepth 1500
AUC @ 5°54.68
104
Relative Pose EstimationMegaDepth (test)
Pose AUC @5°45.31
83
Homography EstimationHPatches
Overall Accuracy (< 1px)51.3
59
Visual LocalizationAachen Day-Night v1.1 (Night)
Success Rate (0.25m, 2°)78
58
Pose EstimationKITTI odometry
AUC584.14
51
Visual LocalizationAachen Day-Night v1.1 (Day)
SR (0.25m, 2°)87.3
50
Image MatchingKinect 1
MS0.53
38
Image MatchingDeSurT (833 pairs total)
MS Score44
38
Image MatchingKinect 2
Matching Score (MS)0.52
38
Showing 10 of 58 rows

Other info

Code

Follow for update