MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
About
We present a real-time monocular dense SLAM system designed bottom-up from MASt3R, a two-view 3D reconstruction and matching prior. Equipped with this strong prior, our system is robust on in-the-wild video sequences despite making no assumption on a fixed or parametric camera model beyond a unique camera centre. We introduce efficient methods for pointmap matching, camera tracking and local fusion, graph construction and loop closure, and second-order global optimisation. With known calibration, a simple modification to the system achieves state-of-the-art performance across various benchmarks. Altogether, we propose a plug-and-play monocular SLAM system capable of producing globally-consistent poses and dense geometry while operating at 15 FPS.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Visual-Inertial Odometry | EuRoC (All sequences) | MH1 Error0.023 | 51 | |
| 3D Reconstruction | 7 Scenes | -- | 32 | |
| Absolute Trajectory Estimation | TUM RGB-D | Desk Error0.016 | 23 | |
| Tracking | TUM-RGBD (various sequences) | Average Translational Error0.03 | 16 | |
| Camera pose estimation | 7Scenes (test) | Chess Error0.053 | 16 | |
| Absolute Pose Estimation | TUM RGB-D v1 | Error (desk)0.016 | 14 | |
| Camera pose estimation | TUM RGB-D 36 | Error (360)0.07 | 9 | |
| 3D Reconstruction | TUM | CD0.057 | 8 | |
| Pose Estimation | TUM-RGBD | ATE0.082 | 8 | |
| Pose Estimation | KITTI (Sequences 00-10) | KITTI Seq 01 Result530.4 | 8 |