Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MedVAR: Towards Scalable and Efficient Medical Image Generation via Next-scale Autoregressive Prediction

About

Medical image generation is pivotal in applications like data augmentation for low-resource clinical tasks and privacy-preserving data sharing. However, developing a scalable generative backbone for medical imaging requires architectural efficiency, sufficient multi-organ data, and principled evaluation, yet current approaches leave these aspects unresolved. Therefore, we introduce MedVAR, the first autoregressive-based foundation model that adopts the next-scale prediction paradigm to enable fast and scale-up-friendly medical image synthesis. MedVAR generates images in a coarse-to-fine manner and produces structured multi-scale representations suitable for downstream use. To support hierarchical generation, we curate a harmonized dataset of around 440,000 CT and MRI images spanning six anatomical regions. Comprehensive experiments across fidelity, diversity, and scalability show that MedVAR achieves state-of-the-art generative performance and offers a promising architectural direction for future medical generative foundation models.

Zhicheng He, Yunpeng Zhao, Junde Wu, Ziwei Niu, Zijun Li, Bohan Li, Lanfen Lin, Yueming Jin• 2026

Related benchmarks

TaskDatasetResultRank
Medical Image GenerationMedical Image Dataset
Efficiency Score11.94
25
Image GenerationBrain MRI
RadFID0.19
7
Medical Image GenerationMRI Medical Imaging (val)
KID (Brain)0.018
7
Medical Image GenerationCT Chest
RadFID0.08
6
Medical Image GenerationCT Medical Imaging (val)
KID (Chest)0.012
6
Medical Image GenerationCT Abdomen
RadFID0.05
5
Medical Image GenerationMRI Abdomen
RadFID0.11
5
Medical Image GenerationCT Heart
RadFID0.46
4
Medical Image GenerationCT Spine
RadFID0.07
4
Medical Image GenerationMRI Heart
RadFID0.25
4
Showing 10 of 12 rows

Other info

Follow for update