Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments

About

Current Simultaneous Localization and Mapping (SLAM) methods based on Neural Radiance Fields (NeRF) or 3D Gaussian Splatting excel in reconstructing static 3D scenes but struggle with tracking and reconstruction in dynamic environments, such as real-world scenes with moving elements. Existing NeRF-based SLAM approaches addressing dynamic challenges typically rely on RGB-D inputs, with few methods accommodating pure RGB input. To overcome these limitations, we propose Dy3DGS-SLAM, the first 3D Gaussian Splatting (3DGS) SLAM method for dynamic scenes using monocular RGB input. To address dynamic interference, we fuse optical flow masks and depth masks through a probabilistic model to obtain a fused dynamic mask. With only a single network iteration, this can constrain tracking scales and refine rendered geometry. Based on the fused dynamic mask, we designed a novel motion loss to constrain the pose estimation network for tracking. In mapping, we use the rendering loss of dynamic pixels, color, and depth to eliminate transient interference and occlusion caused by dynamic objects. Experimental results demonstrate that Dy3DGS-SLAM achieves state-of-the-art tracking and rendering in dynamic environments, outperforming or matching existing RGB-D methods.

Mingrui Li, Yiming Zhou, Hongxing Zhou, Xinggang Hu, Florian Roemer, Hongyu Wang, Ahmad Osman• 2025

Related benchmarks

TaskDatasetResultRank
Camera TrackingBonn ps_track
ATE (cm)5.6
15
Camera TrackingTUM fr3 w xyz
ATE (cm)5.8
15
Camera TrackingTUM fr3 w half
ATE (cm)7
15
Camera TrackingBonn balloon 2
ATE (cm)1.9
14
Camera TrackingBonn balloon
Absolute Trajectory Error (cm)4.5
14
Camera TrackingTUM fr3 s half
ATE (cm)3.4
13
Showing 6 of 6 rows

Other info

Follow for update