Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Revisiting Image Pyramid Structure for High Resolution Salient Object Detection

About

Salient object detection (SOD) has been in the spotlight recently, yet has been studied less for high-resolution (HR) images. Unfortunately, HR images and their pixel-level annotations are certainly more labor-intensive and time-consuming compared to low-resolution (LR) images and annotations. Therefore, we propose an image pyramid-based SOD framework, Inverse Saliency Pyramid Reconstruction Network (InSPyReNet), for HR prediction without any of HR datasets. We design InSPyReNet to produce a strict image pyramid structure of saliency map, which enables to ensemble multiple results with pyramid-based image blending. For HR prediction, we design a pyramid blending method which synthesizes two different image pyramids from a pair of LR and HR scale from the same image to overcome effective receptive field (ERF) discrepancy. Our extensive evaluations on public LR and HR SOD benchmarks demonstrate that InSPyReNet surpasses the State-of-the-Art (SotA) methods on various SOD metrics and boundary accuracy.

Taehun Kim, Kunhee Kim, Joonyeong Lee, Dongmin Cha, Jiho Lee, Daijin Kim• 2022

Related benchmarks

TaskDatasetResultRank
Salient Object DetectionDUTS (test)
M (MAE)0.024
302
Salient Object DetectionPASCAL-S (test)
MAE0.048
149
Salient Object DetectionHKU-IS (test)
MAE0.021
137
Salient Object DetectionDUT-OMRON
MAE0.045
120
Salient Object DetectionECSSD (test)
S-measure (Sa)0.949
104
Salient Object DetectionDUT-OMRON (test)
MAE0.045
92
Salient Object DetectionHRSOD (test)
F-beta0.949
65
Salient Object DetectionDAVIS S
F_beta95.9
36
Dichotomous Image SegmentationDIS5K (DIS-VD)
S_alpha0.9
30
Dichotomous Image SegmentationDIS5K (DIS-TE1)
S_alpha87.3
16
Showing 10 of 27 rows

Other info

Code

Follow for update