Point-SLAM: Dense Neural Point Cloud-based SLAM
About
We propose a dense neural simultaneous localization and mapping (SLAM) approach for monocular RGBD input which anchors the features of a neural scene representation in a point cloud that is iteratively generated in an input-dependent data-driven manner. We demonstrate that both tracking and mapping can be performed with the same point-based neural scene representation by minimizing an RGBD-based re-rendering loss. In contrast to recent dense neural SLAM methods which anchor the scene features in a sparse grid, our point-based approach allows dynamically adapting the anchor point density to the information density of the input. This strategy reduces runtime and memory usage in regions with fewer details and dedicates higher point density to resolve fine details. Our approach performs either better or competitive to existing dense neural RGBD SLAM methods in tracking, mapping and rendering accuracy on the Replica, TUM-RGBD and ScanNet datasets. The source code is available at https://github.com/eriksandstroem/Point-SLAM.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Camera pose estimation | ScanNet | -- | 133 | |
| Photometric Rendering | Replica (room0-2, office0-4) | PSNR39.16 | 80 | |
| Camera Tracking | Replica | Rotation Error (rm-0)0.61 | 48 | |
| Camera Tracking | TUM RGB-D | Tracking Error (fr1/desk)2.73 | 36 | |
| Absolute Trajectory Estimation | TUM RGB-D | Desk Error0.043 | 36 | |
| Tracking | ScanNet | ATE RMSE (Seq 00)10.2 | 29 | |
| Camera pose estimation | TUM RGB-D 36 | Error (desk)4.34 | 26 | |
| Tracking | TUM RGBD (test) | fr1/desk Error2.73 | 24 | |
| Visual SLAM | TUM RGB-D fr1 desk | -- | 24 | |
| Mesh Reconstruction | Replica Room 0 | Depth L1 Error0.3 | 21 |