Brain Imaging Generation with Latent Diffusion Models
About
Deep neural networks have brought remarkable breakthroughs in medical image analysis. However, due to their data-hungry nature, the modest dataset sizes in medical imaging projects might be hindering their full potential. Generating synthetic data provides a promising alternative, allowing to complement training datasets and conducting medical image research at a larger scale. Diffusion models recently have caught the attention of the computer vision community by producing photorealistic synthetic images. In this study, we explore using Latent Diffusion Models to generate synthetic images from high-resolution 3D brain images. We used T1w MRI images from the UK Biobank dataset (N=31,740) to train our models to learn about the probabilistic distribution of brain images, conditioned on covariables, such as age, sex, and brain structure volumes. We found that our models created realistic data, and we could use the conditioning variables to control the data generation effectively. Besides that, we created a synthetic dataset with 100,000 brain images and made it openly available to the scientific community.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Segmentation | BraTS 2020 | Dice0.81 | 36 | |
| 3D Medical Image Synthesis | 3D MRI (test) | FID0.359 | 36 | |
| Classification | BraTS 2020 (test) | AUROC0.885 | 18 | |
| Brain Age Prediction | Brain Age Age ≥ 44 (train) | Absolute Error0.98 | 15 | |
| Brain Age Prediction | Brain Age ≥ 44 (test) | Absolute Error5.3 | 15 | |
| Abdominal lymph node segmentation | ABD-LN (test) | DSC14.95 | 15 | |
| Region-Based Anatomical Plausibility | Brain MRIs 95 Regions of Interest (test) | iMAE46.93 | 11 | |
| Brain MRI Synthesis | Brain MRI 1000 samples (val) | Sex Accuracy79 | 8 | |
| Text-to-Image Generation | MIMIC-CXR Cardiomegaly | FID7.42 | 1 | |
| Text-to-Image Generation | MIMIC-CXR Pleural effusion | FID6.57 | 1 |