DeepMVS: Learning Multi-view Stereopsis

About

We present DeepMVS, a deep convolutional neural network (ConvNet) for multi-view stereo reconstruction. Taking an arbitrary number of posed images as input, we first produce a set of plane-sweep volumes and use the proposed DeepMVS network to predict high-quality disparity maps. The key contributions that enable these results are (1) supervised pretraining on a photorealistic synthetic dataset, (2) an effective method for aggregating information across a set of unordered images, and (3) integrating multi-layer feature activations from the pre-trained VGG-19 network. We validate the efficacy of DeepMVS using the ETH3D Benchmark. Our results show that DeepMVS compares favorably against state-of-the-art conventional MVS algorithms and other ConvNet based methods, particularly for near-textureless regions and thin structures.

Po-Han Huang, Kevin Matzen, Johannes Kopf, Narendra Ahuja, Jia-Bin Huang• 2018

Related benchmarks

Task	Dataset	Result
Multi-view 3D Reconstruction	ShapeNetr2n2 (test)	mIoU68.4	160
Multi-view 3D Reconstruction	ModelNet40 (test)	mIoU52.5	112
Multi-view 3D Reconstruction	ShapeNet r2n2 13 categories (test)	mIoU68.9	80
Multi-view 3D Reconstruction	ShapeNet ism (test)	mIoU51.8	72
Single-view 3D Reconstruction	ShapeNet-R2N2 (test)	mIoU62	22
Depth Estimation	Sun3D (test)	Abs Rel28.2	22
Depth Estimation	Scenes11 (test)	L1 Relative Error0.21	12
Depth Estimation	RGBD-SLAM (test)	Abs Rel0.2938	10
Multi-view 3D Reconstruction	ShapeNet_R2N2 13 categories (test)	Reconstruction Score (1 view)0.524	8
Depth Estimation	Scenes11 (Synthetic)	AbsRel0.21	7

Showing 10 of 20 rows

Other info

Follow for update

@wizwand_team Discord