Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments

About

Semantic scene understanding is essential for mobile agents acting in various environments. Although semantic segmentation already provides a lot of information, details about individual objects as well as the general scene are missing but required for many real-world applications. However, solving multiple tasks separately is expensive and cannot be accomplished in real time given limited computing and battery capabilities on a mobile platform. In this paper, we propose an efficient multi-task approach for RGB-D scene analysis~(EMSANet) that simultaneously performs semantic and instance segmentation~(panoptic segmentation), instance orientation estimation, and scene classification. We show that all tasks can be accomplished using a single neural network in real time on a mobile platform without diminishing performance - by contrast, the individual tasks are able to benefit from each other. In order to evaluate our multi-task approach, we extend the annotations of the common RGB-D indoor datasets NYUv2 and SUNRGB-D for instance segmentation and orientation estimation. To the best of our knowledge, we are the first to provide results in such a comprehensive multi-task setting for indoor scene analysis on NYUv2 and SUNRGB-D.

Daniel Seichter, S\"ohnke Benedikt Fischedick, Mona K\"ohler, Horst-Michael Gro{\ss}• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationNYU v2 (test)
mIoU56.55
248
Semantic segmentationScanNet (val)
mIoU64.11
231
Surface Normal EstimationNYU v2 (test)--
206
Semantic segmentationSUN RGB-D (test)
mIoU49.31
191
Semantic segmentationNYU Depth V2 (test)
mIoU51
172
Semantic segmentationSUN-RGBD (test)
mIoU48.4
77
Semantic segmentationNYU V2
mIoU59
74
Semantic segmentationSUN RGB-D
mIoU48.4
45
Instance SegmentationScanNet (val)--
39
Semantic segmentationNYU Depth V2
mIoU51
27
Showing 10 of 31 rows

Other info

Code

Follow for update