Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular Videos

About

We present PlanarRecon -- a novel framework for globally coherent detection and reconstruction of 3D planes from a posed monocular video. Unlike previous works that detect planes in 2D from a single image, PlanarRecon incrementally detects planes in 3D for each video fragment, which consists of a set of key frames, from a volumetric representation of the scene using neural networks. A learning-based tracking and fusion module is designed to merge planes from previous fragments to form a coherent global plane reconstruction. Such design allows PlanarRecon to integrate observations from multiple views within each fragment and temporal information across different ones, resulting in an accurate and coherent reconstruction of the scene abstraction with low-polygonal geometry. Experiments show that the proposed approach achieves state-of-the-art performances on the ScanNet dataset while being real-time.

Yiming Xie, Matheus Gadelha, Fengting Yang, Xiaowei Zhou, Huaizu Jiang• 2022

Related benchmarks

TaskDatasetResultRank
3D Geometry ReconstructionScanNet (Atlas split)
Completeness0.154
11
3D Geometry ReconstructionScanNet V2 (val)
Completeness15.4
7
3D Plane SegmentationScanNet V2 (testplanes)
VOI3.201
7
3D Plane ReconstructionScanNet V2 (testplanes)
Fidelity18.86
7
3D Plane SegmentationScanNet (test)
VOI3.622
7
3D surface reconstructionScanNet V2 (testplanes)
Chamfer Distance9.89
7
Planar ReconstructionScanNet v2 (test)
Geometry Chamfer Distance9.89
5
Planar ReconstructionScanNet++ (test)
Geometry Chamfer17.85
5
3D Plane SegmentationScanNet Atlas (val)
VOI3.622
4
3D Geometry ReconstructionScanNet PlaneAE (val)
Completeness0.143
2
Showing 10 of 11 rows

Other info

Code

Follow for update