Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Bilateral Reference for High-Resolution Dichotomous Image Segmentation

About

We introduce a novel bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS). It comprises two essential components: the localization module (LM) and the reconstruction module (RM) with our proposed bilateral reference (BiRef). The LM aids in object localization using global semantic information. Within the RM, we utilize BiRef for the reconstruction process, where hierarchical patches of images provide the source reference and gradient maps serve as the target reference. These components collaborate to generate the final predicted maps. We also introduce auxiliary gradient supervision to enhance focus on regions with finer details. Furthermore, we outline practical training strategies tailored for DIS to improve map quality and training process. To validate the general applicability of our approach, we conduct extensive experiments on four tasks to evince that BiRefNet exhibits remarkable performance, outperforming task-specific cutting-edge methods across all benchmarks. Our codes are available at https://github.com/ZhengPeng7/BiRefNet.

Peng Zheng, Dehong Gao, Deng-Ping Fan, Li Liu, Jorma Laaksonen, Wanli Ouyang, Nicu Sebe• 2024

Related benchmarks

TaskDatasetResultRank
Salient Object DetectionDUTS (test)
M (MAE)0.018
357
Camouflaged Object DetectionCOD10K
S-measure (S_alpha)0.913
217
Salient Object DetectionDUT-OMRON
MAE0.038
137
Camouflaged Object DetectionNC4K (test)
Sm0.914
89
Salient Object DetectionHRSOD (test)
F-beta0.963
78
Camouflaged Object DetectionCAMO 250 (test)
M (Mean Score)0.03
65
Salient Object DetectionDAVIS S
F_beta97.9
49
Concealed Object DetectionNC4K
M2.3
46
Camouflaged Object DetectionCHAMELEON 76 (test)
Sm0.932
44
Dichotomous Image SegmentationDIS5K (DIS-VD)
F_beta (Weighted)0.854
44
Showing 10 of 51 rows

Other info

Code

Follow for update