Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization

About

Visual localization is the task of estimating a camera pose in a known environment. In this paper, we utilize 3D Gaussian Splatting (3DGS)-based representations for accurate and privacy-preserving visual localization. We propose Gaussian Splatting Feature Fields (GSFFs), a scene representation for visual localization that combines an explicit geometry model (3DGS) with an implicit feature field. We leverage the dense geometric information and differentiable rasterization algorithm from 3DGS to learn robust feature representations grounded in 3D. In particular, we align a 3D scale-aware feature field and a 2D feature encoder in a common embedding space through a contrastive framework. Using a 3D structure-informed clustering procedure, we further regularize the representation learning and seamlessly convert the features to segmentations, which can be used for privacy-preserving visual localization. Pose refinement, which involves aligning either feature maps or segmentations from a query image with those rendered from the GSFFs scene representation, is used to achieve localization. The resulting privacy- and non-privacy-preserving localization pipelines, evaluated on multiple real-world datasets, show state-of-the-art performances.

Maxime Pietrantoni, Gabriela Csurka, Torsten Sattler• 2025

Related benchmarks

TaskDatasetResultRank
Visual LocalizationIndoor-6 v1 (scene2a)
MPE (m)0.06
12
Visual LocalizationIndoor-6 v1 (scene1)
Mean Pose Error (m)0.1
12
Visual LocalizationIndoor-6 v1 (scene3)
Mean Pose Error (m)0.06
12
Visual LocalizationIndoor-6 scene5 v1
MPE (m)0.14
12
Visual LocalizationIndoor-6 v1 (average)
Mean Pose Error (m)0.08
12
Visual LocalizationIndoor-6 v1 (scene4a)
Mean Pose Error (m)0.07
12
Visual LocalizationIndoor-6 v1 (scene6)
MPE (m)0.07
12
Showing 7 of 7 rows

Other info

Follow for update