ProtoSAM: One-Shot Medical Image Segmentation With Foundational Models
About
This work introduces a new framework, ProtoSAM, for one-shot medical image segmentation. It combines the use of prototypical networks, known for few-shot segmentation, with SAM - a natural image foundation model. The method proposed creates an initial coarse segmentation mask using the ALPnet prototypical network, augmented with a DINOv2 encoder. Following the extraction of an initial mask, prompts are extracted, such as points and bounding boxes, which are then input into the Segment Anything Model (SAM). State-of-the-art results are shown on several medical image datasets and demonstrate automated segmentation capabilities using a single image example (one shot) with no need for fine-tuning of the foundation model. Our code is available at: https://github.com/levayz/ProtoSAM
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Polyp Segmentation | Kvasir | Dice Score81.54 | 143 | |
| Polyp Segmentation | CVC-ClinicDB | Dice Coefficient76.83 | 96 | |
| Polyp Segmentation | CVC-ColonDB | mDice63.45 | 81 | |
| Medical Image Segmentation | Abd-MRI | Dice (LK)82.75 | 41 | |
| Medical Image Segmentation | Abd-CT | Dice (LK)71.31 | 25 | |
| Polyp Segmentation | PolypGen Data Centre 2 | Dice63.03 | 20 | |
| Medical Image Segmentation | CHAOS-MRI | Spleen Score76.51 | 15 | |
| Polyp Segmentation | PolypGen C3 | mIoU69.79 | 14 | |
| Medical Image Segmentation | Skin-DS | Mel Score75.33 | 13 | |
| Medical Image Segmentation | Synapse-CT | Spleen Score65.5 | 8 |