CRS-Diff: Controllable Remote Sensing Image Generation with Diffusion Model
About
The emergence of generative models has revolutionized the field of remote sensing (RS) image generation. Despite generating high-quality images, existing methods are limited in relying mainly on text control conditions, and thus do not always generate images accurately and stably. In this paper, we propose CRS-Diff, a new RS generative framework specifically tailored for RS image generation, leveraging the inherent advantages of diffusion models while integrating more advanced control mechanisms. Specifically, CRS-Diff can simultaneously support text-condition, metadata-condition, and image-condition control inputs, thus enabling more precise control to refine the generation process. To effectively integrate multiple condition control information, we introduce a new conditional control mechanism to achieve multi-scale feature fusion, thus enhancing the guiding effect of control conditions. To our knowledge, CRS-Diff is the first multiple-condition controllable RS generative model. Experimental results in single-condition and multiple-condition cases have demonstrated the superior ability of our CRS-Diff to generate RS images both quantitatively and qualitatively compared with previous methods. Additionally, our CRS-Diff can serve as a data engine that generates high-quality training data for downstream tasks, e.g., road extraction. The code is available at https://github.com/Sonettoo/CRS-Diff.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Satellite image generation | Git-Spatial 15k | FID93.13 | 14 | |
| AI-generated image detection | Git-Spatial-15k (test) | F1 Score (Fake Class)98.25 | 14 | |
| Satellite image generation | Git-Rand 15k | FID58.13 | 14 | |
| AI-generated image detection | Git-Rand-15k (test) | F1 Score (Fake Class)96.55 | 14 | |
| Satellite image generation | fMoW | FID45.06 | 13 | |
| Text-to-Image Generation | RSICD | FID50.72 | 13 | |
| AI-generated image detection | RSICD (test) | F1 Score (Fake Class)99.53 | 12 | |
| Satellite image generation | RSICD | FID32.99 | 12 | |
| AI-generated image detection | fMoW (test) | F1 Score (Fake Class)96.98 | 12 |