Spatial-Temporal Super-Resolution of Satellite Imagery via Conditional Pixel Synthesis
About
High-resolution satellite imagery has proven useful for a broad range of tasks, including measurement of global human population, local economic livelihoods, and biodiversity, among many others. Unfortunately, high-resolution imagery is both infrequently collected and expensive to purchase, making it hard to efficiently and effectively scale these downstream tasks over both time and space. We propose a new conditional pixel synthesis model that uses abundant, low-cost, low-resolution imagery to generate accurate high-resolution imagery at locations and times in which it is unavailable. We show that our model attains photo-realistic sample quality and outperforms competing baselines on a key downstream task -- object counting -- particularly in geographic locations where conditions on the ground are changing rapidly.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Future image generation | Texas housing data (test) | SSIM0.5338 | 6 | |
| Past image generation | Texas housing data (test) | SSIM0.657 | 6 | |
| Super-Resolution | fMoW-Sentinel2 crop field dataset | SSIM0.3905 | 6 | |
| Object Counting | Texas housing dataset (test) | R2 (Buildings, Mean)0.9174 | 4 | |
| Image Generation Quality | Texas housing dataset (test) | Selection Rate (Similarity)53.25 | 3 |