BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion
About
Dense 3D reconstruction from a stream of depth images is the key to many mixed reality and robotic applications. Although methods based on Truncated Signed Distance Function (TSDF) Fusion have advanced the field over the years, the TSDF volume representation is confronted with striking a balance between the robustness to noisy measurements and maintaining the level of detail. We present Bi-level Neural Volume Fusion (BNV-Fusion), which leverages recent advances in neural implicit representations and neural rendering for dense 3D reconstruction. In order to incrementally integrate new depth maps into a global neural implicit representation, we propose a novel bi-level fusion strategy that considers both efficiency and reconstruction quality by design. We evaluate the proposed method on multiple datasets quantitatively and qualitatively, demonstrating a significant improvement over existing methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Scene Reconstruction | iTHOR FloorPlan207 | Accuracy92 | 4 | |
| 3D Scene Reconstruction | iTHOR FloorPlan210 | Accuracy93.3 | 4 | |
| 3D Scene Reconstruction | iTHOR (FloorPlan213) | Accuracy93.9 | 4 | |
| 3D Scene Reconstruction | iTHOR FloorPlan220 | Accuracy89.1 | 4 | |
| 3D Scene Reconstruction | iTHOR (FloorPlan229) | Accuracy93.1 | 4 | |
| 3D surface reconstruction | 3D-CRS Scene01 | Accuracy93.7 | 4 | |
| 3D surface reconstruction | 3D-CRS Scene09 | Accuracy89.4 | 4 | |
| 3D surface reconstruction | 3D-CRS Scene17 | Accuracy94.4 | 4 | |
| 3D surface reconstruction | 3D-CRS Scene05 | Accuracy95.2 | 4 | |
| 3D surface reconstruction | 3D-CRS Scene06 | Accuracy89.7 | 4 |