Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Towards Geospatial Foundation Models via Continual Pretraining

About

Geospatial technologies are becoming increasingly essential in our world for a wide range of applications, including agriculture, urban planning, and disaster response. To help improve the applicability and performance of deep learning models on these geospatial tasks, various works have begun investigating foundation models for this domain. Researchers have explored two prominent approaches for introducing such models in geospatial applications, but both have drawbacks in terms of limited performance benefit or prohibitive training cost. Therefore, in this work, we propose a novel paradigm for building highly effective geospatial foundation models with minimal resource cost and carbon impact. We first construct a compact yet diverse dataset from multiple sources to promote feature diversity, which we term GeoPile. Then, we investigate the potential of continual pretraining from large-scale ImageNet-22k models and propose a multi-objective continual pretraining paradigm, which leverages the strong representations of ImageNet while simultaneously providing the freedom to learn valuable in-domain features. Our approach outperforms previous state-of-the-art geospatial pretraining methods in an extensive evaluation on seven downstream datasets covering various tasks such as change detection, classification, multi-label classification, semantic segmentation, and super-resolution.

Matias Mendieta, Boran Han, Xingjian Shi, Yi Zhu, Chen Chen• 2023

Related benchmarks

TaskDatasetResultRank
Change DetectionLEVIR-CD
F1 Score91.73
188
Scene ClassificationAID TR=50%
Accuracy97.09
94
Scene ClassificationAID TR=20%
Accuracy95.47
93
Semantic segmentationiSAID
mIoU68.71
68
Scene ClassificationRESISC-45 (TR=10%)
Accuracy92.73
63
Change DetectionLEVIR
F1 Score89.8
62
Object DetectionDIOR
mAP5072.84
50
Oriented Object DetectionDIOR-R
mAP5067.67
44
ClassificationAID (test)
Top-1 Accuracy97.09
41
Scene ClassificationRESISC-45 (TR=20%)
Accuracy94.64
40
Showing 10 of 39 rows

Other info

Follow for update