UniFuse: Unidirectional Fusion for 360$^{\circ}$ Panorama Depth Estimation
About
Learning depth from spherical panoramas is becoming a popular research topic because a panorama has a full field-of-view of the environment and provides a relatively complete description of a scene. However, applying well-studied CNNs for perspective images to the standard representation of spherical panoramas, i.e., the equirectangular projection, is suboptimal, as it becomes distorted towards the poles. Another representation is the cubemap projection, which is distortion-free but discontinued on edges and limited in the field-of-view. This paper introduces a new framework to fuse features from the two projections, unidirectionally feeding the cubemap features to the equirectangular features only at the decoding stage. Unlike the recent bidirectional fusion approach operating at both the encoding and decoding stages, our fusion scheme is much more efficient. Besides, we also designed a more effective fusion module for our fusion scheme. Experiments verify the effectiveness of our proposed fusion strategy and module, and our model achieves state-of-the-art performance on four popular datasets. Additional experiments show that our model also has the advantages of model complexity and generalization capability.The code is available at https://github.com/alibaba/UniFuse-Unidirectional-Fusion.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Monocular Depth Estimation | Stanford2D3D (test) | δ1 Accuracy91.3 | 71 | |
| Monocular Depth Estimation | Matterport3D (test) | Delta Acc (< 1.25)88.97 | 48 | |
| Depth Estimation | Matterport3D | delta188.97 | 35 | |
| Monocular 360 Depth Estimation | Matterport3D official (test) | Delta Acc (1.25x)89 | 20 | |
| Depth Estimation | Structure3D (test) | AbsRel0.1506 | 18 | |
| Monocular panoramic depth estimation | Stanford2D3D | Delta 1 Accuracy87.11 | 13 | |
| Depth Estimation | Stanford2D3D | Abs Rel0.1114 | 13 | |
| Monocular Depth Estimation | PanoSunCG | RMSE0.2802 | 11 | |
| 360 Depth Estimation | 3D60 (test) | Abs Rel0.0466 | 11 | |
| Monocular Depth Estimation | Matterport3D 2K resolution (2048x1024) (test) | AbsRel0.2 | 9 |