WaterSplat-SLAM: Photorealistic Monocular SLAM in Underwater Environment
About
Underwater monocular SLAM is a challenging problem with applications from autonomous underwater vehicles to marine archaeology. However, existing underwater SLAM methods struggle to produce maps with high-fidelity rendering. In this paper, we propose WaterSplat-SLAM, a novel monocular underwater SLAM system that achieves robust pose estimation and photorealistic dense mapping. Specifically, we couple semantic medium filtering into two-view 3D reconstruction prior to enable underwater-adapted camera tracking and depth estimation. Furthermore, we present a semantic-guided rendering and adaptive map management strategy with an online medium-aware Gaussian map, modeling underwater environment in a photorealistic and compact manner. Experiments on multiple underwater datasets demonstrate that WaterSplat-SLAM achieves robust camera tracking and high-fidelity rendering in underwater environments.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Novel View Synthesis | SeaThru-NeRF (J.G.-RedSea) | PSNR23.68 | 18 | |
| Novel View Synthesis | SeaThru-NeRF Panama | PSNR25.95 | 18 | |
| Novel View Synthesis | SeaThru-NeRF Curasao | PSNR28.91 | 17 | |
| Novel View Synthesis | SeaThru-NeRF Avg | PSNR24.14 | 7 | |
| Novel View Synthesis | WaterSplat-SLAM Pool_up2 | PSNR33.22 | 7 | |
| Novel View Synthesis | WaterSplat-SLAM Average | PSNR30.19 | 7 | |
| Camera Tracking | WaterSplat-SLAM (Pipe_local) | ATE (m)0.164 | 7 | |
| Camera Tracking | WaterSplat-SLAM Pool_up2 | ATE (m)0.224 | 7 | |
| Camera Tracking | WaterSplat-SLAM Big_gate | ATE (m)0.066 | 7 | |
| Camera Tracking | WaterSplat-SLAM Pool_up | ATE (m)0.289 | 6 |