Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping

About

Simultaneous Localization and Mapping (SLAM) is essential for precise surgical interventions and robotic tasks in minimally invasive procedures. While recent advancements in 3D Gaussian Splatting (3DGS) have improved SLAM with high-quality novel view synthesis and fast rendering, these systems struggle with accurate depth and surface reconstruction due to multi-view inconsistencies. Simply incorporating SLAM and 3DGS leads to mismatches between the reconstructed frames. In this work, we present Endo-2DTAM, a real-time endoscopic SLAM system with 2D Gaussian Splatting (2DGS) to address these challenges. Endo-2DTAM incorporates a surface normal-aware pipeline, which consists of tracking, mapping, and bundle adjustment modules for geometrically accurate reconstruction. Our robust tracking module combines point-to-point and point-to-plane distance metrics, while the mapping module utilizes normal consistency and depth distortion to enhance surface reconstruction quality. We also introduce a pose-consistent strategy for efficient and geometrically coherent keyframe sampling. Extensive experiments on public endoscopic datasets demonstrate that Endo-2DTAM achieves an RMSE of $1.87\pm 0.63$ mm for depth reconstruction of surgical scenes while maintaining computationally efficient tracking, high-quality visual appearance, and real-time rendering. Our code will be released at github.com/lastbasket/Endo-2DTAM.

Yiming Huang, Beilei Cui, Long Bai, Zhen Chen, Jinlin Wu, Zhen Li, Hongbin Liu, Hongliang Ren• 2025

Related benchmarks

TaskDatasetResultRank
Camera LocalizationStereoMIS (P2-3)
RMSE0.013
16
Camera LocalizationStereoMIS (P2-4)
RMSE31.06
16
Camera LocalizationStereoMIS Average
RMSE25.2
16
Camera LocalizationStereoMIS (P2-2)
RMSE36.36
16
Camera LocalizationStereoMIS (P2-5)
RMSE33.35
14
4D ReconstructionEndoMapper Sequence 3
PSNR15.12
14
4D ReconstructionEndoMapper Sequence 1
PSNR15.509
14
4D ReconstructionEndoMapper Sequence 2
PSNR15.686
14
Camera LocalizationC3VD c1_descending_t4_v4 v2
RMSE30.33
9
Camera LocalizationC3VD c1_sigmoid2_t4_v4 v2
RMSE30.57
9
Showing 10 of 23 rows

Other info

Follow for update