Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion

About

RGB-D salient object detection (SOD) is usually formulated as a problem of classification or regression over two modalities, i.e., RGB and depth. Hence, effective RGBD feature modeling and multi-modal feature fusion both play a vital role in RGB-D SOD. In this paper, we propose a depth-sensitive RGB feature modeling scheme using the depth-wise geometric prior of salient objects. In principle, the feature modeling scheme is carried out in a depth-sensitive attention module, which leads to the RGB feature enhancement as well as the background distraction reduction by capturing the depth geometry prior. Moreover, to perform effective multi-modal feature fusion, we further present an automatic architecture search approach for RGB-D SOD, which does well in finding out a feasible architecture from our specially designed multi-modal multi-scale search space. Extensive experiments on seven standard benchmarks demonstrate the effectiveness of the proposed approach against the state-of-the-art.

Peng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, Xi Li• 2021

Related benchmarks

Task	Dataset	Result
RGB-D Salient Object Detection	STERE	S-measure (Sα)0.897	232
RGB-D Salient Object Detection	NJU2K (test)	S-measure (Sα)0.903	137
Saliency Object Detection	SIP	F_beta Score0.891	79
Salient Object Detection	NLPR (test)	F-beta89.7	76
RGB-D Salient Object Detection	NLPR (test)	S-measure (Sα)91.8	71
Saliency Detection	NJUD (test)	MAE0.039	68
Salient Object Detection	NLPR	MAE0.024	52
Salient Object Detection	NJUD	MAE3.9	52
Salient Object Detection	RGBD135 (test)	Sx (Structure Measure)0.92	49
RGB saliency detection	LFSD (test)	F_beta Score88.2	43

Showing 10 of 15 rows

Other info

Follow for update

@wizwand_team Discord