Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NICER-SLAM: Neural Implicit Scene Encoding for RGB SLAM

About

Neural implicit representations have recently become popular in simultaneous localization and mapping (SLAM), especially in dense visual SLAM. However, previous works in this direction either rely on RGB-D sensors, or require a separate monocular SLAM approach for camera tracking and do not produce high-fidelity dense 3D scene reconstruction. In this paper, we present NICER-SLAM, a dense RGB SLAM system that simultaneously optimizes for camera poses and a hierarchical neural implicit map representation, which also allows for high-quality novel view synthesis. To facilitate the optimization process for mapping, we integrate additional supervision signals including easy-to-obtain monocular geometric cues and optical flow, and also introduce a simple warping loss to further enforce geometry consistency. Moreover, to further boost performance in complicated indoor scenes, we also propose a local adaptive transformation from signed distance functions (SDFs) to density in the volume rendering equation. On both synthetic and real-world datasets we demonstrate strong performance in dense mapping, tracking, and novel view synthesis, even competitive with recent RGB-D SLAM systems.

Zihan Zhu, Songyou Peng, Viktor Larsson, Zhaopeng Cui, Martin R. Oswald, Andreas Geiger, Marc Pollefeys• 2023

Related benchmarks

TaskDatasetResultRank
Appearance reconstructionWaymo 8 scenes
PSNR16.41
54
Camera TrackingReplica
Rotation Error (rm-0)1.36
38
ReconstructionReplica average over 8 scenes
Accuracy (Dist)3.65
21
Camera pose estimation7Scenes (test)
Chess Error0.033
16
Camera pose estimationReplica
ATE RMSE (cm)1.88
15
Relative Pose Estimation7 Scenes--
12
Camera pose estimationReplica 54 (full video)
Average Error1.88
9
3D ReconstructionReplica (test)
Avg Acc3.65
9
Tracking and Mapping7Scenes--
8
TrackingWaymo--
7
Showing 10 of 38 rows

Other info

Follow for update