Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PanoFormer: Panorama Transformer for Indoor 360 Depth Estimation

About

Existing panoramic depth estimation methods based on convolutional neural networks (CNNs) focus on removing panoramic distortions, failing to perceive panoramic structures efficiently due to the fixed receptive field in CNNs. This paper proposes the panorama transformer (named PanoFormer) to estimate the depth in panorama images, with tangent patches from spherical domain, learnable token flows, and panorama specific metrics. In particular, we divide patches on the spherical tangent domain into tokens to reduce the negative effect of panoramic distortions. Since the geometric structures are essential for depth estimation, a self-attention module is redesigned with an additional learnable token flow. In addition, considering the characteristic of the spherical domain, we present two panorama-specific metrics to comprehensively evaluate the panoramic depth estimation models' performance. Extensive experiments demonstrate that our approach significantly outperforms the state-of-the-art (SOTA) methods. Furthermore, the proposed method can be effectively extended to solve semantic panorama segmentation, a similar pixel2pixel task. Code will be available.

Zhijie Shen, Chunyu Lin, Kang Liao, Lang Nie, Zishuo Zheng, Yao Zhao• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationStanford2D3DS (3-fold cross-validation)
mIoU57.03
90
Monocular Depth EstimationStanford2D3D (test)
δ1 Accuracy88.74
71
Monocular Depth EstimationMatterport3D (test)
Delta Acc (< 1.25)89.08
48
Depth EstimationMatterport3D
delta191.84
35
Semantic segmentationStructured3D (test)
mIoU64.47
21
Semantic segmentationStanford2D3D official (3-fold average)
mIoU0.5703
20
Depth EstimationStructure3D (test)
AbsRel0.0861
18
Semantic segmentationStructured3D (val)
mIoU60.98
17
360 Depth EstimationStanford2D3D 1.0 (test)
Abs Rel Error0.0721
14
Monocular panoramic depth estimationStanford2D3D
Delta 1 Accuracy93.94
13
Showing 10 of 22 rows

Other info

Code

Follow for update