DemoFusion: Democratising High-Resolution Image Generation With No $$$
About
High-resolution image generation with Generative Artificial Intelligence (GenAI) has immense potential but, due to the enormous capital investment required for training, it is increasingly centralised to a few large corporations, and hidden behind paywalls. This paper aims to democratise high-resolution GenAI by advancing the frontier of high-resolution generation while remaining accessible to a broad audience. We demonstrate that existing Latent Diffusion Models (LDMs) possess untapped potential for higher-resolution image generation. Our novel DemoFusion framework seamlessly extends open-source GenAI models, employing Progressive Upscaling, Skip Residual, and Dilated Sampling mechanisms to achieve higher-resolution image generation. The progressive nature of DemoFusion requires more passes, but the intermediate results can serve as "previews", facilitating rapid prompt iteration.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| High-Resolution Image Generation | LAION-5B 3x3 scaling factor (test) | FID68.82 | 7 | |
| High-Resolution Image Generation | LAION-5B 4x4 scaling factor (test) | FID65.89 | 7 | |
| High-Resolution Image Generation | LAION 5B 2x2 scaling factor (test) | FID63.24 | 7 | |
| High-Resolution Image Generation | High-resolution Image Generation | FID_r81.69 | 6 | |
| High-Resolution Image Generation | Resolution 4096 x 4096 | FID74.75 | 5 |