Unified Cross-Modal Medical Image Synthesis with Hierarchical Mixture of Product-of-Experts
About
We propose a deep mixture of multimodal hierarchical variational auto-encoders called MMHVAE that synthesizes missing images from observed images in different modalities. MMHVAE's design focuses on tackling four challenges: (i) creating a complex latent representation of multimodal data to generate high-resolution images; (ii) encouraging the variational distributions to estimate the missing information needed for cross-modal image synthesis; (iii) learning to fuse multimodal information in the context of missing data; (iv) leveraging dataset-level information to handle incomplete data sets at training time. Extensive experiments are performed on the challenging problem of pre-operative brain multi-parametric magnetic resonance and intra-operative ultrasound imaging.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Brain Tumor Segmentation | BraTS 2023 (test) | -- | 49 | |
| MRI Synthesis | BraTS 2023 | PSNR (dB)24.76 | 38 | |
| MRI Synthesis | IXI | PSNR (dB)29.88 | 18 |