Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation

About

Image segmentation plays an important role in vision understanding. Recently, the emerging vision foundation models continuously achieved superior performance on various tasks. Following such success, in this paper, we prove that the Segment Anything Model 2 (SAM2) can be a strong encoder for U-shaped segmentation models. We propose a simple but effective framework, termed SAM2-UNet, for versatile image segmentation. Specifically, SAM2-UNet adopts the Hiera backbone of SAM2 as the encoder, while the decoder uses the classic U-shaped design. Additionally, adapters are inserted into the encoder to allow parameter-efficient fine-tuning. Preliminary experiments on various downstream tasks, such as camouflaged object detection, salient object detection, marine animal segmentation, mirror detection, and polyp segmentation, demonstrate that our SAM2-UNet can simply beat existing specialized state-of-the-art methods without bells and whistles. Project page: \url{https://github.com/WZH0120/SAM2-UNet}.

Xinyu Xiong, Zihuang Wu, Shuangyi Tan, Wenxue Li, Feilong Tang, Ying Chen, Siying Li, Jie Ma, Guanbin Li• 2024

Related benchmarks

TaskDatasetResultRank
Salient Object DetectionDUTS (test)
M (MAE)0.02
302
Salient Object DetectionECSSD
MAE0.02
202
Polyp SegmentationCVC-ClinicDB (test)
DSC90.7
196
Salient Object DetectionPASCAL-S
MAE0.043
186
Salient Object DetectionHKU-IS
MAE0.019
155
Salient Object DetectionDUT-OMRON
MAE0.039
120
Polyp SegmentationKvasir-SEG (test)
mIoU0.879
87
Polyp SegmentationETIS (test)
Mean Dice79.6
86
Camouflaged Object DetectionCAMO (test)
S_alpha0.884
85
Medical Image SegmentationKvasir-SEG (test)
mIoU87.9
78
Showing 10 of 59 rows

Other info

Code

Follow for update