Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

About

Neural implicit representations have recently shown encouraging results in various domains, including promising progress in simultaneous localization and mapping (SLAM). Nevertheless, existing methods produce over-smoothed scene reconstructions and have difficulty scaling up to large scenes. These limitations are mainly due to their simple fully-connected network architecture that does not incorporate local information in the observations. In this paper, we present NICE-SLAM, a dense SLAM system that incorporates multi-level local information by introducing a hierarchical scene representation. Optimizing this representation with pre-trained geometric priors enables detailed reconstruction on large indoor scenes. Compared to recent neural implicit SLAM systems, our approach is more scalable, efficient, and robust. Experiments on five challenging datasets demonstrate competitive results of NICE-SLAM in both mapping and tracking quality. Project page: https://pengsongyou.github.io/nice-slam

Zihan Zhu, Songyou Peng, Viktor Larsson, Weiwei Xu, Hujun Bao, Zhaopeng Cui, Martin R. Oswald, Marc Pollefeys• 2021

Related benchmarks

TaskDatasetResultRank
Camera pose estimationScanNet
ATE RMSE (Avg.)9.05
61
3D Geometry ReconstructionScanNet
Accuracy3.2
54
Camera TrackingScanNet v2 (test)
ATE RMSE (cm)5.59
28
TrackingTUM RGB-D 44 (various sequences)
Average Error69.96
28
Camera TrackingBONN dynamic sequences
Balloon Error24.4
25
Absolute Trajectory EstimationTUM RGB-D
Desk Error0.027
23
TrackingBonn RGB-D dataset
Balloon266.8
23
ReconstructionReplica average over 8 scenes
Accuracy (Dist)2.373
21
Visual SLAMTUM RGB-D fr1 desk
ATE RMSE (cm)19.317
21
Visual SLAMTUM RGB-D fr2 xyz
Translation RMSE (m)0.361
21
Showing 10 of 119 rows
...

Other info

Code

Follow for update