DerMAE: Improving skin lesion classification through conditioned latent diffusion and MAE distillation
About
Skin lesion classification datasets often suffer from severe class imbalance, with malignant cases significantly underrepresented, leading to biased decision boundaries during deep learning training. We address this challenge using class-conditioned diffusion models to generate synthetic dermatological images, followed by self-supervised MAE pretraining to enable huge ViT models to learn robust, domain-relevant features. To support deployment in practical clinical settings, where lightweight models are required, we apply knowledge distillation to transfer these representations to a smaller ViT student suitable for mobile devices. Our results show that MAE pretraining on synthetic data, combined with distillation, improves classification performance while enabling efficient on-device inference for practical clinical use.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Skin lesion classification | HAM10000 (test) | Accuracy90.51 | 83 | |
| Categorical skin lesion classification | HAM10000 original (test) | Accuracy0.8915 | 12 |