Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EGformer: Equirectangular Geometry-biased Transformer for 360 Depth Estimation

About

Estimating the depths of equirectangular (i.e., 360) images (EIs) is challenging given the distorted 180 x 360 field-of-view, which is hard to be addressed via convolutional neural network (CNN). Although a transformer with global attention achieves significant improvements over CNN for EI depth estimation task, it is computationally inefficient, which raises the need for transformer with local attention. However, to apply local attention successfully for EIs, a specific strategy, which addresses distorted equirectangular geometry and limited receptive field simultaneously, is required. Prior works have only cared either of them, resulting in unsatisfactory depths occasionally. In this paper, we propose an equirectangular geometry-biased transformer termed EGformer. While limiting the computational cost and the number of network parameters, EGformer enables the extraction of the equirectangular geometry-aware local attention with a large receptive field. To achieve this, we actively utilize the equirectangular geometry as the bias for the local attention instead of struggling to reduce the distortion of EIs. As compared to the most recent EI depth estimation studies, the proposed approach yields the best depth outcomes overall with the lowest computational cost and the fewest parameters, demonstrating the effectiveness of the proposed methods.

Ilwi Yun, Chanyong Shin, Hyunku Lee, Hyuk-Jae Lee, Chae Eun Rhee• 2023

Related benchmarks

TaskDatasetResultRank
Monocular Depth EstimationStanford2D3D (test)
δ1 Accuracy81.85
81
Depth EstimationMatterport3D
delta190.82
50
Monocular Depth EstimationMatterport3D (test)
Delta Acc (< 1.25)81.58
48
Depth EstimationStanford2D3D
Abs Rel0.0929
27
Monocular 360 Depth EstimationMatterport3D official (test)
Delta Acc (1.25x)81.6
20
Depth EstimationStructured3D Indoor
Abs Rel Error2.73
12
Depth EstimationPanoCity Outdoor
Abs Rel0.0363
12
Panoramic Depth EstimationMatterport3D (test)
Abs Rel0.147
12
Depth EstimationStructured3D (val)
δ1 Accuracy79.79
9
Depth EstimationStanford2D3D sphere rank 7 256x512 (test)
MAE0.17
7
Showing 10 of 13 rows

Other info

Follow for update