Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

iMAP: Implicit Mapping and Positioning in Real-Time

About

We show for the first time that a multilayer perceptron (MLP) can serve as the only scene representation in a real-time SLAM system for a handheld RGB-D camera. Our network is trained in live operation without prior data, building a dense, scene-specific implicit 3D model of occupancy and colour which is also immediately used for tracking. Achieving real-time SLAM via continual training of a neural network against a live image stream requires significant innovation. Our iMAP algorithm uses a keyframe structure and multi-processing computation flow, with dynamic information-guided pixel sampling for speed, with tracking at 10 Hz and global map updating at 2 Hz. The advantages of an implicit MLP over standard dense SLAM techniques include efficient geometry representation with automatic detail control and smooth, plausible filling-in of unobserved regions such as the back surfaces of objects.

Edgar Sucar, Shikun Liu, Joseph Ortiz, Andrew J. Davison• 2021

Related benchmarks

TaskDatasetResultRank
Camera pose estimationScanNet--
119
TrackingTUM RGB-D 44 (various sequences)
Average Error97.85
41
Camera TrackingReplica
Rotation Error (rm-0)3.12
38
Absolute Trajectory EstimationTUM RGB-D
Desk Error0.049
36
Camera TrackingScanNet v2 (test)
ATE RMSE (cm)11.91
28
Camera TrackingBONN dynamic sequences--
25
TrackingBonn RGB-D dataset
Balloon267
23
Camera TrackingTUM RGB-D
Tracking Error (fr1/desk)7.2
23
ReconstructionReplica average over 8 scenes
Accuracy (Dist)3.621
21
Camera TrackingTUM RGB-D
ATE RMSE (cm)4.23
18
Showing 10 of 47 rows

Other info

Follow for update