Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Depth-aware CNN for RGB-D Segmentation

About

Convolutional neural networks (CNN) are limited by the lack of capability to handle geometric information due to the fixed grid kernel structure. The availability of depth data enables progress in RGB-D semantic segmentation with CNNs. State-of-the-art methods either use depth as additional images or process spatial information in 3D volumes or point clouds. These methods suffer from high computation and memory cost. To address these issues, we present Depth-aware CNN by introducing two intuitive, flexible and effective operations: depth-aware convolution and depth-aware average pooling. By leveraging depth similarity between pixels in the process of information propagation, geometry is seamlessly incorporated into CNN. Without introducing any additional parameters, both operators can be easily integrated into existing CNNs. Extensive experiments and ablation studies on challenging RGB-D semantic segmentation benchmarks validate the effectiveness and flexibility of our approach.

Weiyue Wang, Ulrich Neumann• 2018

Related benchmarks

TaskDatasetResultRank
Semantic segmentationNYU v2 (test)
mIoU43.9
248
Semantic segmentationSUN RGB-D (test)
mIoU42
191
Semantic segmentationNYUD v2 (test)
mIoU48.4
187
Semantic segmentationNYU Depth V2 (test)
mIoU43.9
172
Semantic segmentationNYUDv2 40-class (test)
mIoU48.4
99
Semantic segmentationSUN-RGBD (test)
mIoU42
77
Semantic segmentationMFNet nighttime (test)
mIoU43.2
42
Scene ParsingNYUDv2 (test)
mIoU43.9
35
Semantic segmentationMFNet daytime (test)
mIoU42.4
30
Semantic segmentationSUN-RGBD 37 classes (test)
mIoU42
28
Showing 10 of 15 rows

Other info

Follow for update