3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction
About
Inspired by the recent success of methods that employ shape priors to achieve robust 3D reconstructions, we propose a novel recurrent neural network architecture that we call the 3D Recurrent Reconstruction Neural Network (3D-R2N2). The network learns a mapping from images of objects to their underlying 3D shapes from a large collection of synthetic data. Our network takes in one or more images of an object instance from arbitrary viewpoints and outputs a reconstruction of the object in the form of a 3D occupancy grid. Unlike most of the previous works, our network does not require any image annotations or object class labels for training or testing. Our extensive experimental analysis shows that our reconstruction framework i) outperforms the state-of-the-art methods for single view reconstruction, and ii) enables the 3D reconstruction of objects in situations when traditional SFM/SLAM methods fail (because of lack of texture and/or wide baseline).
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-view 3D Reconstruction | ShapeNet (test) | IoU0.636 | 209 | |
| Multi-view 3D Reconstruction | ShapeNetr2n2 (test) | mIoU65.1 | 160 | |
| Multi-view 3D Reconstruction | ModelNet40 (test) | mIoU46.4 | 112 | |
| Multi-view 3D Reconstruction | ShapeNet | IoU0.636 | 110 | |
| Multi-view 3D Reconstruction | ShapeNet r2n2 13 categories (test) | mIoU65.4 | 80 | |
| 3D Object Reconstruction | ShapeNet (test) | Mean IoU0.636 | 80 | |
| 3D Reconstruction | ShapeNet (test) | EMD0.606 | 74 | |
| Multi-view 3D Reconstruction | ShapeNet ism (test) | mIoU47.2 | 72 | |
| Multi-view 3D object reconstruction | Things3D | IoU33.4 | 32 | |
| Silhouette Prediction | Blobby | mIoU86.5 | 32 |