Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Part-aware Prompted Segment Anything Model for Adaptive Segmentation

About

Precision medicine, such as patient-adaptive treatments assisted by medical image analysis, poses new challenges for segmentation algorithms in adapting to new patients, due to the large variability across different patients and the limited availability of annotated data for each patient. In this work, we propose a data-efficient segmentation algorithm, namely Part-aware Prompted Segment Anything Model ($P^2SAM$). Without any model fine-tuning, $P^2SAM$ enables seamless adaptation to any new patients relying only on one-shot patient-specific data. We introduce a novel part-aware prompt mechanism to select multiple-point prompts based on the part-level features of the one-shot data, which can be extensively integrated into different promptable segmentation models, such as SAM and SAM 2. Moreover, to determine the optimal number of parts for each specific case, we propose a distribution-guided retrieval approach that further enhances the robustness of the part-aware prompt mechanism. $P^2SAM$ improves the performance by +8.0% and +2.0% mean Dice score for two different patient-adaptive segmentation applications, respectively. In addition, $P^2SAM$ also exhibits impressive generalizability in other adaptive segmentation tasks in the natural image domain, e.g., +6.4% mIoU within personalized object segmentation task. The code is available at: https://github.com/Zch0414/p2sam

Chenhui Zhao, Liyue Shen• 2024

Related benchmarks

TaskDatasetResultRank
Polyp SegmentationCVC-ClinicDB (test)
DSC88.76
196
Polyp SegmentationCVC-ClinicDB
Dice Coefficient88.76
81
Medical Image SegmentationCVC-ClinicDB
Dice Score88.76
68
Semantic segmentationCOCO 20i 1-shot--
41
NSCLC segmentation4D-Lung (test)
Mean Dice Score67.23
28
Semantic segmentationFSS-1000 1-shot
mIoU82.4
10
SegmentationCVC-ClinicDB (test)
mIoU88.76
6
Medical Image Segmentation4D-Lung
Dice Score67.23
5
Semantic segmentationLVIS-92i one-shot
mIoU13.7
3
Semantic segmentationPerSeg one-shot
mIoU95.7
3
Showing 10 of 10 rows

Other info

Code

Follow for update