Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion

About

RGB-D salient object detection (SOD) is usually formulated as a problem of classification or regression over two modalities, i.e., RGB and depth. Hence, effective RGBD feature modeling and multi-modal feature fusion both play a vital role in RGB-D SOD. In this paper, we propose a depth-sensitive RGB feature modeling scheme using the depth-wise geometric prior of salient objects. In principle, the feature modeling scheme is carried out in a depth-sensitive attention module, which leads to the RGB feature enhancement as well as the background distraction reduction by capturing the depth geometry prior. Moreover, to perform effective multi-modal feature fusion, we further present an automatic architecture search approach for RGB-D SOD, which does well in finding out a feasible architecture from our specially designed multi-modal multi-scale search space. Extensive experiments on seven standard benchmarks demonstrate the effectiveness of the proposed approach against the state-of-the-art.

Peng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, Xi Li• 2021

Related benchmarks

TaskDatasetResultRank
RGB-D Salient Object DetectionSTERE
S-measure (Sα)0.897
198
RGB-D Salient Object DetectionNJU2K (test)
S-measure (Sα)0.903
137
Saliency Object DetectionSIP
F_beta Score0.891
79
Salient Object DetectionNLPR (test)
F-beta89.7
76
RGB-D Salient Object DetectionNLPR (test)
S-measure (Sα)91.8
71
Saliency DetectionNJUD (test)
MAE0.039
68
Salient Object DetectionNLPR
MAE0.024
52
Salient Object DetectionNJUD
MAE3.9
52
Salient Object DetectionRGBD135 (test)
Sx (Structure Measure)0.92
49
RGB saliency detectionLFSD (test)
F_beta Score88.2
43
Showing 10 of 15 rows

Other info

Follow for update