Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MegaLoc: One Retrieval to Place Them All

About

Retrieving images from the same location as a given query is an important component of multiple computer vision tasks, like Visual Place Recognition, Landmark Retrieval, Visual Localization, 3D reconstruction, and SLAM. However, existing solutions are built to specifically work for one of these tasks, and are known to fail when the requirements slightly change or when they meet out-of-distribution data. In this paper we combine a variety of existing methods, training techniques, and datasets to train a retrieval model, called MegaLoc, that is performant on multiple tasks. We find that MegaLoc (1) achieves state of the art on a large number of Visual Place Recognition datasets, (2) impressive results on common Landmark Retrieval datasets, and (3) sets a new state of the art for Visual Localization on the LaMAR datasets, where we only changed the retrieval method to the existing localization pipeline. The code for MegaLoc is available at https://github.com/gmberton/MegaLoc

Gabriele Berton, Carlo Masone• 2025

Related benchmarks

TaskDatasetResultRank
Place RecognitionnuScenes (BS)
AR@186.39
18
Place RecognitionnuScenes (SON)
AR@180.88
17
Place RecognitionNCLT (Query: 2012-06-15, Database: 2012-01-08)
AR@181.28
16
Place RecognitionnuScenes Simulated Fog (SQ)
AR@159.58
16
Place RecognitionNCLT (Query: 2013-02-23, Database: 2012-01-08)
AR@10.6082
16
Multi-view Depth EstimationETH3D
Relative Error (rel)3.25
12
Place RecognitionSelf-collected dataset
AR@171.17
11
Camera pose estimationOn-the-Go Large noise
ATE0.0637
8
Camera pose estimationPhototourism Small noise
ATE0.2689
8
Camera pose estimationOn-the-Go Small noise
ATE0.0529
8
Showing 10 of 32 rows

Other info

Follow for update