SuperGlue: Learning Feature Matching with Graph Neural Networks

About

This paper introduces SuperGlue, a neural network that matches two sets of local features by jointly finding correspondences and rejecting non-matchable points. Assignments are estimated by solving a differentiable optimal transport problem, whose costs are predicted by a graph neural network. We introduce a flexible context aggregation mechanism based on attention, enabling SuperGlue to reason about the underlying 3D scene and feature assignments jointly. Compared to traditional, hand-designed heuristics, our technique learns priors over geometric transformations and regularities of the 3D world through end-to-end training from image pairs. SuperGlue outperforms other learned approaches and achieves state-of-the-art results on the task of pose estimation in challenging real-world indoor and outdoor environments. The proposed method performs matching in real-time on a modern GPU and can be readily integrated into modern SfM or SLAM systems. The code and trained weights are publicly available at https://github.com/magicleap/SuperGluePretrainedNetwork.

Paul-Edouard Sarlin, Daniel DeTone, Tomasz Malisiewicz, Andrew Rabinovich• 2019

Related benchmarks

Task	Dataset	Result
Visual Place Recognition	MSLS (val)	Recall@178.1	305
Visual Place Recognition	Tokyo24/7	Recall@188.2	229
Visual Place Recognition	Pitts30k	Recall@188.7	170
Visual Place Recognition	Nordland	Recall@129.1	163
Visual Place Recognition	MSLS Challenge	Recall@150.6	156
Relative Pose Estimation	MegaDepth 1500	AUC @ 20°76.5	151
Image Retrieval	Revisited Paris (RPar) (Hard)	mAP70.4	115
Visual Place Recognition	Pittsburgh30k (test)	Recall@187.2	106
Image Retrieval	Revisited Paris (RPar) (Medium)	mAP86.2	100
Relative Pose Estimation	MegaDepth (test)	Pose AUC @5°42.2	83

Showing 10 of 167 rows

...

Other info

Follow for update

@wizwand_team Discord