SurfaceNet: An End-to-end 3D Neural Network for Multiview Stereopsis
About
This paper proposes an end-to-end learning framework for multiview stereopsis. We term the network SurfaceNet. It takes a set of images and their corresponding camera parameters as input and directly infers the 3D model. The key advantage of the framework is that both photo-consistency as well geometric relations of the surface structure can be directly learned for the purpose of multiview stereopsis in an end-to-end fashion. SurfaceNet is a fully 3D convolutional network which is achieved by encoding the camera parameters together with the images in a 3D voxel representation. We evaluate SurfaceNet on the large-scale DTU benchmark.
Mengqi Ji, Juergen Gall, Haitian Zheng, Yebin Liu, Lu Fang• 2017
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-view Stereo | DTU (test) | -- | 61 | |
| Multi-view Stereo | DTU 1 (evaluation) | Accuracy Error (mm)0.45 | 51 | |
| Multi-view Stereo Reconstruction | DTU (evaluation) | Mean Distance (mm) - Acc.0.45 | 35 | |
| Point Cloud Reconstruction | DTU (test) | -- | 15 |
Showing 4 of 4 rows