Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis

About

We present GeoSynth, a model for synthesizing satellite images with global style and image-driven layout control. The global style control is via textual prompts or geographic location. These enable the specification of scene semantics or regional appearance respectively, and can be used together. We train our model on a large dataset of paired satellite imagery, with automatically generated captions, and OpenStreetMap data. We evaluate various combinations of control inputs, including different types of layout controls. Results demonstrate that our model can generate diverse, high-quality images and exhibits excellent zero-shot generalization. The code and model checkpoints are available at https://github.com/mvrl/GeoSynth.

Srikumar Sastry, Subash Khanal, Aayush Dhakal, Nathan Jacobs• 2024

Related benchmarks

TaskDatasetResultRank
AI-generated image detectionGit-Rand-15k (test)
F1 Score (Fake Class)99.81
14
AI-generated image detectionGit-Spatial-15k (test)
F1 Score (Fake Class)99.84
14
Satellite image generationGit-Rand 15k
FID45.59
14
Satellite image generationGit-Spatial 15k
FID85.8
14
Satellite image generationfMoW
FID70.55
13
AI-generated image detectionfMoW (test)
F1 Score (Fake Class)99.8
12
AI-generated image detectionRSICD (test)
F1 Score (Fake Class)99.44
12
Satellite image generationRSICD
FID47.84
12
Satellite Image SynthesisSatellite Imagery OSM-to-Image
Time/batch (ms)923
4
InpaintingGit-Rand-15k (test)
FID21.3
4
Showing 10 of 10 rows

Other info

Follow for update