Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation

About

Self-supervised pre-training bears potential to generate expressive representations without human annotation. Most pre-training in Earth observation (EO) are based on ImageNet or medium-size, labeled remote sensing (RS) datasets. We share an unlabeled RS dataset SSL4EO-S12 (Self-Supervised Learning for Earth Observation - Sentinel-1/2) to assemble a large-scale, global, multimodal, and multi-seasonal corpus of satellite imagery from the ESA Sentinel-1 \& -2 satellite missions. For EO applications we demonstrate SSL4EO-S12 to succeed in self-supervised pre-training for a set of methods: MoCo-v2, DINO, MAE, and data2vec. Resulting models yield downstream performance close to, or surpassing accuracy measures of supervised learning. In addition, pre-training on SSL4EO-S12 excels compared to existing datasets. We make openly available the dataset, related source code, and pre-trained models at https://github.com/zhu-xlab/SSL4EO-S12.

Yi Wang, Nassim Ait Ali Braham, Zhitong Xiong, Chenying Liu, Conrad M Albrecht, Xiao Xiang Zhu• 2022

Related benchmarks

TaskDatasetResultRank
Change DetectionLEVIR-CD
F1 Score89.05
275
Semantic segmentationiSAID
mIoU64.01
146
Semantic segmentationPotsdam
mIoU63.633
110
Object DetectionDIOR
mAP5064.82
62
Change DetectionOSCD
F1 Score35.08
51
Semantic segmentationSen1Floods11
mIoU (macro)86.71
45
Semantic segmentationMADOS
mIoU60.773
42
ClassificationAID (test)
Top-1 Accuracy94.74
41
Object DetectionDIOR-R
mAP61.23
35
Scene ClassificationRESISC-45 (test)
OA91.27
32
Showing 10 of 23 rows

Other info

Code

Follow for update