DISN: Deep Implicit Surface Network for High-quality Single-view 3D Reconstruction
About
Reconstructing 3D shapes from single-view images has been a long-standing research problem. In this paper, we present DISN, a Deep Implicit Surface Network which can generate a high-quality detail-rich 3D mesh from an 2D image by predicting the underlying signed distance fields. In addition to utilizing global image features, DISN predicts the projected location for each 3D point on the 2D image, and extracts local features from the image feature maps. Combining global and local features significantly improves the accuracy of the signed distance field prediction, especially for the detail-rich areas. To the best of our knowledge, DISN is the first method that constantly captures details such as holes and thin structures present in 3D shapes from single-view images. DISN achieves the state-of-the-art single-view reconstruction performance on a variety of shape categories reconstructed from both synthetic and real images. Code is available at https://github.com/xharlie/DISN The supplementary can be found at https://xharlie.github.io/images/neurips_2019_supp.pdf
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Reconstruction from a single 2D image | ShapeNet (test) | Volumetric IoU (Airplane)57.5 | 11 | |
| Single-view 3D Reconstruction | ShapeNet Core (test) | IoU (plane)61.7 | 8 | |
| Single-view 3D Reconstruction | ShapeNet chairs | Chamfer Distance (CD)3.02 | 8 | |
| Single-view 3D Reconstruction | ShapeNet Airplanes | CD3.19 | 7 | |
| Single-view Reconstruction | ShapeNet (test) | Chamfer Distance13 | 6 | |
| Single-view 3D Reconstruction | ShapeNet Core all categories | F-Score @ 0.5%7.9 | 4 | |
| Single-view 3D Reconstruction | ShapeNet | CD (car)0.0012 | 4 | |
| Single-view 3D Reconstruction | Objaverse | CD28.2 | 4 | |
| Single-view 3D Reconstruction | Objaverse (test) | Chamfer Distance (CD)28.2 | 4 | |
| Camera pose estimation | ShapeNet (test) | -- | 3 |