SAM-Med3D: Towards General-purpose Segmentation Models for Volumetric Medical Images
About
Existing volumetric medical image segmentation models are typically task-specific, excelling at specific target but struggling to generalize across anatomical structures or modalities. This limitation restricts their broader clinical use. In this paper, we introduce SAM-Med3D for general-purpose segmentation on volumetric medical images. Given only a few 3D prompt points, SAM-Med3D can accurately segment diverse anatomical structures and lesions across various modalities. To achieve this, we gather and process a large-scale 3D medical image dataset, SA-Med3D-140K, from a blend of public sources and licensed private datasets. This dataset includes 22K 3D images and 143K corresponding 3D masks. Then SAM-Med3D, a promptable segmentation model characterized by the fully learnable 3D structure, is trained on this dataset using a two-stage procedure and exhibits impressive performance on both seen and unseen segmentation targets. We comprehensively evaluate SAM-Med3D on 16 datasets covering diverse medical scenarios, including different anatomical structures, modalities, targets, and zero-shot transferability to new/unseen tasks. The evaluation shows the efficiency and efficacy of SAM-Med3D, as well as its promising application to diverse downstream tasks as a pre-trained model. Our approach demonstrates that substantial medical resources can be utilized to develop a general-purpose medical AI for various potential applications. Our dataset, code, and models are available at https://github.com/uni-medical/SAM-Med3D.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Segmentation | Pancreas-CT (test) | Dice79.37 | 44 | |
| Aortic Dissection Segmentation | ImageTBAD (test) | True Lumen DSC33.56 | 33 | |
| Organ Segmentation | PET images | Liver DSC (%)85.56 | 22 | |
| Interactive Organ Segmentation | PET Invisible Organs (train) | DSC (Aorta)31.92 | 18 | |
| 3D Liver Segmentation | CBCT 3D (test) | F1 Score65.3 | 15 | |
| Interactive Segmentation | AutoPET-Organ (unseen out-of-distribution) | Liver Performance78.47 | 12 | |
| Tumor Segmentation | AutoPET unseen out-of-distribution | Tumor DSC16.32 | 12 | |
| Organ Segmentation | FLARE22 | DSC Liv.85.4 | 9 | |
| Multimodal Segmentation | MSD Colon | DSC48.21 | 8 | |
| Multimodal Segmentation | KITS | DSC67.15 | 8 |