Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Discrete Cosine Transform Network for Guided Depth Map Super-Resolution

About

Guided depth super-resolution (GDSR) is an essential topic in multi-modal image processing, which reconstructs high-resolution (HR) depth maps from low-resolution ones collected with suboptimal conditions with the help of HR RGB images of the same scene. To solve the challenges in interpreting the working mechanism, extracting cross-modal features and RGB texture over-transferred, we propose a novel Discrete Cosine Transform Network (DCTNet) to alleviate the problems from three aspects. First, the Discrete Cosine Transform (DCT) module reconstructs the multi-channel HR depth features by using DCT to solve the channel-wise optimization problem derived from the image domain. Second, we introduce a semi-coupled feature extraction module that uses shared convolutional kernels to extract common information and private kernels to extract modality-specific information. Third, we employ an edge attention mechanism to highlight the contours informative for guided upsampling. Extensive quantitative and qualitative evaluations demonstrate the effectiveness of our DCTNet, which outperforms previous state-of-the-art methods with a relatively small number of parameters. The code is available at \url{https://github.com/Zhaozixiang1228/GDSR-DCTNet}.

Zixiang Zhao, Jiangshe Zhang, Shuang Xu, Zudi Lin, Hanspeter Pfister• 2021

Related benchmarks

TaskDatasetResultRank
Depth Super-ResolutionNYU v2 (test)
RMSE13.33
136
Joint Depth Super-Resolution and DenoisingNYU v2 (test)
RMSE6.41
78
Depth Map Super-ResolutionRGB-D-D (test)
RMSE5.99
42
Saliency map super-resolutionDUT-OMRON
F-score98.74
26
Depth Super-ResolutionLu Bicubic downsampling synthetic (test)
RMSE (x4)0.88
20
Depth Super-ResolutionMiddlebury Bicubic downsampling synthetic (test)
RMSE (x4)1.1
20
Depth Super-ResolutionNYU Bicubic downsampling synthetic v2 (test)
RMSE (x4)1.59
20
Depth Super-ResolutionRGB-D-D Bicubic downsampling synthetic (test)
RMSE (4x)1.08
19
RGB-guided depth Super-ResolutionMiddlebury x4 scale 2006
RMSE2.16
17
Depth Super-ResolutionMiddlebury Nearest-neighbor downsampling synthetic (test)
RMSE (x4)1.81
17
Showing 10 of 27 rows

Other info

Follow for update