MAISI: Medical AI for Synthetic Imaging
About
Medical imaging analysis faces challenges such as data scarcity, high annotation costs, and privacy concerns. This paper introduces the Medical AI for Synthetic Imaging (MAISI), an innovative approach using the diffusion model to generate synthetic 3D computed tomography (CT) images to address those challenges. MAISI leverages the foundation volume compression network and the latent diffusion model to produce high-resolution CT images (up to a landmark volume dimension of 512 x 512 x 768 ) with flexible volume dimensions and voxel spacing. By incorporating ControlNet, MAISI can process organ segmentation, including 127 anatomical structures, as additional conditions and enables the generation of accurately annotated synthetic images that can be used for various downstream tasks. Our experiment results show that MAISI's capabilities in generating realistic, anatomically accurate images for diverse regions and conditions reveal its promising potential to mitigate challenges using synthetic data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Synthesis | TotalSegmentator | SSIM75.5 | 18 | |
| Medical Image Synthesis | VoCo 10k (train/test) | FID0.9139 | 16 | |
| 3D CT Segmentation | Task06 Lung | DSC71 | 10 | |
| Autoencoder Reconstruction | BrainScape (test) | LPIPS0.0415 | 10 | |
| 3D CT Segmentation | Task07 Pancreas | Dice Similarity Coefficient (DSC)80.3 | 10 | |
| 3D CT Segmentation | LiTS | Dice Score 1 (DSC1)93.6 | 10 | |
| 3D CT Segmentation | KiTS 19 | DSC194.4 | 10 | |
| 3D CT Reconstruction | Task07 Pancreas | PSNR36.97 | 9 | |
| 3D CT Reconstruction | LiTS | PSNR37.35 | 9 | |
| 3D CT Reconstruction | KiTS19 | PSNR37.08 | 9 |