Ambiguous Medical Image Segmentation using Diffusion Models
About
Collective insights from a group of experts have always proven to outperform an individual's best diagnostic for clinical tasks. For the task of medical image segmentation, existing research on AI-based alternatives focuses more on developing models that can imitate the best individual rather than harnessing the power of expert groups. In this paper, we introduce a single diffusion model-based approach that produces multiple plausible outputs by learning a distribution over group insights. Our proposed model generates a distribution of segmentation masks by leveraging the inherent stochastic sampling process of diffusion using only minimal additional learning. We demonstrate on three different medical image modalities- CT, ultrasound, and MRI that our model is capable of producing several possible variants while capturing the frequencies of their occurrences. Comprehensive results show that our proposed approach outperforms existing state-of-the-art ambiguous segmentation networks in terms of accuracy while preserving naturally occurring variation. We also propose a new metric to evaluate the diversity as well as the accuracy of segmentation predictions that aligns with the interest of clinical practice of collective insights.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Medical Image Segmentation | LIDC-IDRI (test) | GED0.297 | 24 | |
| Medical Image Segmentation | MMIS (test) | GED0.235 | 12 | |
| Ambiguous Segmentation | LIDC (test) | HM IoU59.2 | 12 | |
| Biomedical Image Segmentation | QUBIQ Prostate 1 2021 (test) | Soft Dice91.64 | 10 | |
| Biomedical Image Segmentation | QUBIQ Kidney 2021 (test) | Soft Dice68.53 | 10 | |
| Biomedical Image Segmentation | QUBIQ Prostate 2 2021 (test) | Soft Dice0.2191 | 10 | |
| Biomedical Image Segmentation | QUBIQ Tumor 2021 (test) | Soft Dice0.9295 | 9 | |
| Biomedical Image Segmentation | QUBIQ Brain 2021 (test) | Soft Dice74.09 | 9 | |
| Medical Image Segmentation | Stanford COCA (test) | GED0.625 | 5 | |
| Medical Image Segmentation | RACER Home (test) | GED0.758 | 5 |