MegaLoc: One Retrieval to Place Them All

About

Retrieving images from the same location as a given query is an important component of multiple computer vision tasks, like Visual Place Recognition, Landmark Retrieval, Visual Localization, 3D reconstruction, and SLAM. However, existing solutions are built to specifically work for one of these tasks, and are known to fail when the requirements slightly change or when they meet out-of-distribution data. In this paper we combine a variety of existing methods, training techniques, and datasets to train a retrieval model, called MegaLoc, that is performant on multiple tasks. We find that MegaLoc (1) achieves state of the art on a large number of Visual Place Recognition datasets, (2) impressive results on common Landmark Retrieval datasets, and (3) sets a new state of the art for Visual Localization on the LaMAR datasets, where we only changed the retrieval method to the existing localization pipeline. The code for MegaLoc is available at https://github.com/gmberton/MegaLoc

Gabriele Berton, Carlo Masone• 2025

Related benchmarks

Task	Dataset	Result
Visual Place Recognition	MSLS (val)	Recall@193.5	305
Visual Place Recognition	Tokyo24/7	Recall@196.5	229
Visual Place Recognition	Nordland	Recall@194.2	169
Visual Place Recognition	Pittsburgh30k (test)	Recall@194.1	106
Visual Place Recognition	Oxford RobotCar (Dusk)	Recall@191.6	78
Place Recognition	nuScenes (BS)	AR@186.39	25
Place Recognition	nuScenes (SON)	AR@180.88	24
Visual Place Recognition	Nordland subsampled 2.5m (Fall query)	Recall@183.5	23
Multi-view Depth Estimation	ETH3D	Relative Error (rel)3.25	21
Place Recognition	NCLT (Query: 2012-06-15, Database: 2012-01-08)	AR@181.28	16

Showing 10 of 38 rows

Other info

Follow for update

@wizwand_team Discord