RSEdit: Text-Guided Image Editing for Remote Sensing
About
In this paper, we explore text-guided image editing in the remote sensing domain using generative modeling. We propose \rsedit, a collection of models from U-Net to DiT with various configurations. Specifically, we present the first comprehensive study of conditioning strategies for building image editing models from off-the-shelf text-to-image ones. Our experiments show that \rsedit achieves the best instruction-faithful edits while preserving geospatial structure. We release the code at \url{https://github.com/Bili-Sakura/RSEdit-Preview} and checkpoints at \url{https://huggingface.co/collections/BiliSakura/rsedit}.
Chen Zhenyuan, Zhang Zechuan, Zhang Feng• 2026
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic Change Generation | RSCC re-split (300 images) (test) | F1dam34.11 | 9 | |
| Text-Guided Image Editing | SECOND-CC (test) | SC4.6 | 4 | |
| Text-Guided Image Editing | LEVIR-CC (test) | SC3.88 | 4 |
Showing 3 of 3 rows