RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields

About

Leveraging neural implicit representation to conduct dense RGB-D SLAM has been studied in recent years. However, this approach relies on a static environment assumption and does not work robustly within a dynamic environment due to the inconsistent observation of geometry and photometry. To address the challenges presented in dynamic environments, we propose a novel dynamic SLAM framework with neural radiance field. Specifically, we introduce a motion mask generation method to filter out the invalid sampled rays. This design effectively fuses the optical flow mask and semantic mask to enhance the precision of motion mask. To further improve the accuracy of pose estimation, we have designed a divide-and-conquer pose optimization algorithm that distinguishes between keyframes and non-keyframes. The proposed edge warp loss can effectively enhance the geometry constraints between adjacent frames. Extensive experiments are conducted on the two challenging datasets, and the results show that RoDyn-SLAM achieves state-of-the-art performance among recent neural RGB-D methods in both accuracy and robustness.

Haochen Jiang, Yueming Xu, Kejie Li, Jianfeng Feng, Li Zhang• 2024

Related benchmarks

Task	Dataset	Result
Tracking	TUM RGB-D 44 (various sequences)	Average Error5.26	41
Camera Tracking	BONN dynamic sequences	Balloon Error7.9	38
Absolute Trajectory Estimation	TUM RGB-D	--	36
Tracking	TUM 8 dynamic scenes	f3 Walk Scale/Translation Error1.7	28
Tracking	Bonn RGB-D dataset	Balloon211.5	23
Tracking	Bonn RGB-D Dynamic Dataset	Balloon ATE RMSE7.9	18
Camera Tracking	TUM dynamic scene sequences RGB-D (test)	f3/w_s ATE (cm)1.7	17
Tracking	TUM-RGBD (various sequences)	Average Translational Error5.26	16
Camera Tracking	TUM dynamic scene sequences	ATE Component w_x (f3)8.3	15
Camera Tracking	TUM fr3 w half	ATE (cm)5.6	15

Showing 10 of 26 rows

Other info

Follow for update

@wizwand_team Discord