Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DefSLAM: Tracking and Mapping of Deforming Scenes from Monocular Sequences

About

Monocular SLAM algorithms perform robustly when observing rigid scenes, however, they fail when the observed scene deforms, for example, in medical endoscopy applications. We present DefSLAM, the first monocular SLAM capable of operating in deforming scenes in real-time. Our approach intertwines Shape-from-Template (SfT) and Non-Rigid Structure-from-Motion (NRSfM) techniques to deal with the exploratory sequences typical of SLAM. A deformation tracking thread recovers the pose of the camera and the deformation of the observed map, at frame rate, by means of SfT processing a template that models the scene shape-at-rest. A deformation mapping thread runs in parallel with the tracking to update the template, at keyframe rate, by means of an isometric NRSfM processing a batch of full perspective keyframes. In our experiments, DefSLAM processes close-up sequences of deforming scenes, both in a laboratory controlled experiment and in medical endoscopy sequences, producing accurate 3D models of the scene with respect to the moving camera.

Jose Lamarca, Shaifali Parashar, Adrien Bartoli, J.M.M. Montiel• 2019

Related benchmarks

TaskDatasetResultRank
Camera LocalizationStereoMIS (P2-3)
RMSE0.026
16
Camera LocalizationStereoMIS (P2-2)
RMSE28.96
16
Camera LocalizationStereoMIS (P2-4)
RMSE33.73
16
Camera LocalizationStereoMIS Average
RMSE27.67
16
Camera LocalizationStereoMIS (P2-5)
RMSE28.46
14
Camera LocalizationC3VD c1_sigmoid2_t4_v4 v2
RMSE10.35
9
Camera LocalizationC3VD v2 (c2_transverse1_t1_v4)
RMSE13.49
9
Camera LocalizationC3VD Average v2
RMSE13.94
9
Camera LocalizationC3VD c1_descending_t4_v4 v2
RMSE21.35
9
Camera LocalizationC3VD c1_sigmoid1_t4_v4 v2
RMSE10.59
8
Showing 10 of 10 rows

Other info

Follow for update