UNet 3+: A Full-Scale Connected UNet for Medical Image Segmentation
About
Recently, a growing interest has been seen in deep learning-based semantic segmentation. UNet, which is one of deep learning networks with an encoder-decoder architecture, is widely used in medical image segmentation. Combining multi-scale features is one of important factors for accurate segmentation. UNet++ was developed as a modified Unet by designing an architecture with nested and dense skip connections. However, it does not explore sufficient information from full scales and there is still a large room for improvement. In this paper, we propose a novel UNet 3+, which takes advantage of full-scale skip connections and deep supervisions. The full-scale skip connections incorporate low-level details with high-level semantics from feature maps in different scales; while the deep supervision learns hierarchical representations from the full-scale aggregated feature maps. The proposed method is especially benefiting for organs that appear at varying scales. In addition to accuracy improvements, the proposed UNet 3+ can reduce the network parameters to improve the computation efficiency. We further propose a hybrid loss function and devise a classification-guided module to enhance the organ boundary and reduce the over-segmentation in a non-organ image, yielding more accurate segmentation results. The effectiveness of the proposed method is demonstrated on two datasets. The code is available at: github.com/ZJUGiveLab/UNet-Version
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Retinal Vessel Segmentation | CHASE DB1 | Sensitivity (SE)85.74 | 47 | |
| Retinal Vessel Segmentation | STARE | F1 Score84.829 | 40 | |
| Semantic segmentation | CSDD original (test) | F1 Score (w/bg)72.12 | 34 | |
| Semantic segmentation | CSDD (test) | F1 Score (w/bg)78.5 | 34 | |
| Medical Image Segmentation | Spleen Task-5 | Dice Score0.8274 | 27 | |
| Medical Image Segmentation | Heart Task-1 | Dice Score87.52 | 26 | |
| Medical Image Segmentation | Prostate Task-3 | -- | 18 | |
| Semantic segmentation | CSDD without data preprocessing original (test) | F1 Score (w/bg)72.1 | 17 | |
| Semantic segmentation | Dataset original (test) | F1 Score (w/bg)72.12 | 17 | |
| Semantic segmentation | Dataset described in Section 4.5 100% 1.0 (train) | F1 Score (w/bg)78.5 | 17 |