DeepMVS: Learning Multi-view Stereopsis
About
We present DeepMVS, a deep convolutional neural network (ConvNet) for multi-view stereo reconstruction. Taking an arbitrary number of posed images as input, we first produce a set of plane-sweep volumes and use the proposed DeepMVS network to predict high-quality disparity maps. The key contributions that enable these results are (1) supervised pretraining on a photorealistic synthetic dataset, (2) an effective method for aggregating information across a set of unordered images, and (3) integrating multi-layer feature activations from the pre-trained VGG-19 network. We validate the efficacy of DeepMVS using the ETH3D Benchmark. Our results show that DeepMVS compares favorably against state-of-the-art conventional MVS algorithms and other ConvNet based methods, particularly for near-textureless regions and thin structures.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-view 3D Reconstruction | ShapeNetr2n2 (test) | mIoU68.4 | 160 | |
| Multi-view 3D Reconstruction | ModelNet40 (test) | mIoU52.5 | 112 | |
| Multi-view 3D Reconstruction | ShapeNet r2n2 13 categories (test) | mIoU68.9 | 80 | |
| Multi-view 3D Reconstruction | ShapeNet ism (test) | mIoU51.8 | 72 | |
| Single-view 3D Reconstruction | ShapeNet-R2N2 (test) | mIoU62 | 22 | |
| Depth Estimation | Sun3D (test) | Abs Rel28.2 | 22 | |
| Depth Estimation | Scenes11 (test) | L1 Relative Error0.21 | 12 | |
| Depth Estimation | RGBD-SLAM (test) | Abs Rel0.2938 | 10 | |
| Multi-view 3D Reconstruction | ShapeNet_R2N2 13 categories (test) | Reconstruction Score (1 view)0.524 | 8 | |
| Depth Estimation | Scenes11 (Synthetic) | AbsRel0.21 | 7 |