Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion
About
RGB-D salient object detection (SOD) is usually formulated as a problem of classification or regression over two modalities, i.e., RGB and depth. Hence, effective RGBD feature modeling and multi-modal feature fusion both play a vital role in RGB-D SOD. In this paper, we propose a depth-sensitive RGB feature modeling scheme using the depth-wise geometric prior of salient objects. In principle, the feature modeling scheme is carried out in a depth-sensitive attention module, which leads to the RGB feature enhancement as well as the background distraction reduction by capturing the depth geometry prior. Moreover, to perform effective multi-modal feature fusion, we further present an automatic architecture search approach for RGB-D SOD, which does well in finding out a feasible architecture from our specially designed multi-modal multi-scale search space. Extensive experiments on seven standard benchmarks demonstrate the effectiveness of the proposed approach against the state-of-the-art.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| RGB-D Salient Object Detection | STERE | S-measure (Sα)0.897 | 198 | |
| RGB-D Salient Object Detection | NJU2K (test) | S-measure (Sα)0.903 | 137 | |
| Saliency Object Detection | SIP | F_beta Score0.891 | 79 | |
| Salient Object Detection | NLPR (test) | F-beta89.7 | 76 | |
| RGB-D Salient Object Detection | NLPR (test) | S-measure (Sα)91.8 | 71 | |
| Saliency Detection | NJUD (test) | MAE0.039 | 68 | |
| Salient Object Detection | NLPR | MAE0.024 | 52 | |
| Salient Object Detection | NJUD | MAE3.9 | 52 | |
| Salient Object Detection | RGBD135 (test) | Sx (Structure Measure)0.92 | 49 | |
| RGB saliency detection | LFSD (test) | F_beta Score88.2 | 43 |