Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LEAF: Latent Diffusion with Efficient Encoder Distillation for Aligned Features in Medical Image Segmentation

About

Leveraging the powerful capabilities of diffusion models has yielded quite effective results in medical image segmentation tasks. However, existing methods typically transfer the original training process directly without specific adjustments for segmentation tasks. Furthermore, the commonly used pre-trained diffusion models still have deficiencies in feature extraction. Based on these considerations, we propose LEAF, a medical image segmentation model grounded in latent diffusion models. During the fine-tuning process, we replace the original noise prediction pattern with a direct prediction of the segmentation map, thereby reducing the variance of segmentation results. We also employ a feature distillation method to align the hidden states of the convolutional layers with the features from a transformer-based vision encoder. Experimental results demonstrate that our method enhances the performance of the original diffusion model across multiple segmentation datasets for different disease types. Notably, our approach does not alter the model architecture, nor does it increase the number of parameters or computation during the inference phase, making it highly efficient.

Qilin Huang, Tianyu Lin, Zhiguang Chen, Fudan Zheng• 2025

Related benchmarks

TaskDatasetResultRank
Medical Image SegmentationISIC 2018
Dice Score90.5
92
Medical Image SegmentationCVC-ClinicDB
Dice Score95.2
68
Medical Image SegmentationREF
Dice89.5
6
Medical Image SegmentationQaTa
Dice80.2
6
Showing 4 of 4 rows

Other info

Code

Follow for update