MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera
About
In this paper, we propose MonoRec, a semi-supervised monocular dense reconstruction architecture that predicts depth maps from a single moving camera in dynamic environments. MonoRec is based on a multi-view stereo setting which encodes the information of multiple consecutive images in a cost volume. To deal with dynamic objects in the scene, we introduce a MaskModule that predicts moving object masks by leveraging the photometric inconsistencies encoded in the cost volumes. Unlike other multi-view stereo methods, MonoRec is able to reconstruct both static and moving objects by leveraging the predicted masks. Furthermore, we present a novel multi-stage training scheme with a semi-supervised loss formulation that does not require LiDAR depth values. We carefully evaluate MonoRec on the KITTI dataset and show that it achieves state-of-the-art performance compared to both multi-view and single-view methods. With the model trained on KITTI, we further demonstrate that MonoRec is able to generalize well to both the Oxford RobotCar dataset and the more challenging TUM-Mono dataset recorded by a handheld camera. Code and related materials will be available at https://vision.in.tum.de/research/monorec.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Depth Estimation | KITTI Odometry 11 (Sequence 07) | AbsRel0.053 | 12 | |
| Depth Estimation | KITTI Odometry 11 (Sequence 00) | AbsRel5.4 | 12 | |
| Multi-view Depth Estimation | KITTI Odometry 11 (sequence 05) | Absolute Relative Error0.068 | 12 | |
| Multi-view Depth Estimation | KITTI Odometry 11 (sequence 04) | Absolute Relative Error0.075 | 12 | |
| Depth Estimation | KITTI Odometry 11 (Sequence 06) | Abs Rel Error0.063 | 12 | |
| Depth Estimation | KITTI Odometry Overall (test) | Abs Rel5 | 8 | |
| Depth Estimation | KITTI Odometry Dynamic (test) | Abs Rel0.36 | 8 | |
| Monocular Depth Estimation | DDAD (Overall) | Abs Rel0.158 | 3 | |
| Monocular Depth Estimation | DDAD Dynamic | Abs Rel0.544 | 3 |