WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
About
We present WildGS-SLAM, a robust and efficient monocular RGB SLAM system designed to handle dynamic environments by leveraging uncertainty-aware geometric mapping. Unlike traditional SLAM systems, which assume static scenes, our approach integrates depth and uncertainty information to enhance tracking, mapping, and rendering performance in the presence of moving objects. We introduce an uncertainty map, predicted by a shallow multi-layer perceptron and DINOv2 features, to guide dynamic object removal during both tracking and mapping. This uncertainty map enhances dense bundle adjustment and Gaussian map optimization, improving reconstruction accuracy. Our system is evaluated on multiple datasets and demonstrates artifact-free view synthesis. Results showcase WildGS-SLAM's superior performance in dynamic environments compared to state-of-the-art methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Tracking | TUM RGB-D 44 (various sequences) | Average Error1.51 | 28 | |
| Camera Tracking | BONN dynamic sequences | Balloon Error2.8 | 25 | |
| Tracking | TUM RGBD (test) | fr1/desk Error1.7 | 18 | |
| Camera Tracking | TUM dynamic scene sequences RGB-D (test) | f3/w_s ATE (cm)0.4 | 17 | |
| Camera pose estimation | Sintel 14-sequence | ATE18.2 | 15 | |
| Novel View Synthesis | RigScapes Aerial+Road | PSNR11.1 | 11 | |
| Novel View Synthesis | SmallCity | PSNR10.35 | 11 | |
| Tracking | Wild-SLAM MoCap Dataset 1.0 (test) | Score (ANYmal2)0.3 | 11 | |
| Pose Estimation | TUM | s.s Error0.51 | 8 | |
| Camera Tracking | Wild-SLAM MoCap Dataset | Person Tracking Error0.8 | 8 |