PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image
About
This paper proposes a deep neural architecture, PlaneRCNN, that detects and reconstructs piecewise planar surfaces from a single RGB image. PlaneRCNN employs a variant of Mask R-CNN to detect planes with their plane parameters and segmentation masks. PlaneRCNN then jointly refines all the segmentation masks with a novel loss enforcing the consistency with a nearby view during training. The paper also presents a new benchmark with more fine-grained plane segmentations in the ground-truth, in which, PlaneRCNN outperforms existing state-of-the-art methods with significant margins in the plane detection, segmentation, and reconstruction metrics. PlaneRCNN makes an important step towards robust plane extraction, which would have an immediate impact on a wide range of applications including Robotics, Augmented Reality, and Virtual Reality.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Geometry Reconstruction | ScanNet | Accuracy39.6 | 54 | |
| Monocular Depth Estimation | NYU Depth Eigen v2 (test) | A.Rel0.124 | 49 | |
| 3D Reconstruction | 7 Scenes | -- | 32 | |
| Monocular Depth Estimation | ScanNet (test) | Abs Rel0.18 | 22 | |
| Scene Reconstruction | 3D-Front (test) | CD0.717 | 9 | |
| Plane Geometry Estimation | NYU V2 | Rel0.124 | 4 | |
| Planar Segmentation | ScanNet 6 (test) | VI1.809 | 4 |