Compact 3D Gaussian Splatting For Dense Visual SLAM
About
Recent work has shown that 3D Gaussian-based SLAM enables high-quality reconstruction, accurate pose estimation, and real-time rendering of scenes. However, these approaches are built on a tremendous number of redundant 3D Gaussian ellipsoids, leading to high memory and storage costs, and slow training speed. To address the limitation, we propose a compact 3D Gaussian Splatting SLAM system that reduces the number and the parameter size of Gaussian ellipsoids. A sliding window-based masking strategy is first proposed to reduce the redundant ellipsoids. Then we observe that the covariance matrix (geometry) of most 3D Gaussian ellipsoids are extremely similar, which motivates a novel geometry codebook to compress 3D Gaussian geometric attributes, i.e., the parameters. Robust and accurate pose estimation is achieved by a global bundle adjustment method with reprojection loss. Extensive experiments demonstrate that our method achieves faster training and rendering speed while maintaining the state-of-the-art (SOTA) quality of the scene representation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Camera Tracking | TUM RGB-D | Tracking Error (fr1/desk)1.57 | 36 | |
| Tracking | ScanNet | ATE RMSE (Seq 00)10.81 | 29 | |
| Camera Tracking | Replica 31 | Average Error (Replica 31)0.27 | 14 | |
| Scene Reconstruction | Replica | Average Depth Error0.66 | 12 | |
| Dense SLAM | Replica 31 | Tracking Latency per Iteration13.25 | 12 | |
| Dense SLAM | ScanNet 4 | Tracking Iteration Time (ms)15.21 | 12 | |
| Image Reconstruction | Replica | Average PSNR37.81 | 11 | |
| Rendering | Long Corridor Dataset View (train) | PSNR28.9 | 10 | |
| Rendering | Long Corridor Dataset Novel View (test) | PSNR26.3 | 10 |