SDXL-Lightning: Progressive Adversarial Diffusion Distillation
About
We propose a diffusion distillation method that achieves new state-of-the-art in one-step/few-step 1024px text-to-image generation based on SDXL. Our method combines progressive and adversarial distillation to achieve a balance between quality and mode coverage. In this paper, we discuss the theoretical analysis, discriminator design, model formulation, and training techniques. We open-source our distilled SDXL-Lightning models both as LoRA and full UNet weights.
Shanchuan Lin, Anran Wang, Xiao Yang• 2024
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text-to-Image Generation | GenEval | Overall Score53 | 467 | |
| Text-to-Image Generation | GenEval (test) | Two Obj. Acc61 | 169 | |
| Text-to-Image Generation | HPSv2 | HPSv2 Score32.18 | 35 | |
| Text-to-Image Generation | COCO 30k | FID24.46 | 29 | |
| Text-to-Image Generation | COCO 2014 (val) | -- | 25 | |
| Text-to-Image Generation | CompBench | CompBench Score0.4445 | 12 | |
| Text-to-Image Generation | SDXL | FID28.48 | 10 | |
| Text-to-Image Synthesis | COCO 10K prompts 2014 | FID23.92 | 10 | |
| Text-to-Image Generation | COCO 10k-sample 2017 | Precision (P)89 | 8 | |
| Text-to-Image Generation | LAION | P Score0.88 | 8 |
Showing 10 of 14 rows