Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning

About

The volume of unlabelled Earth observation (EO) data is huge, but many important applications lack labelled training data. However, EO data offers the unique opportunity to pair data from different modalities and sensors automatically based on geographic location and time, at virtually no human labor cost. We seize this opportunity to create MMEarth, a diverse multi-modal pretraining dataset at global scale. Using this new corpus of 1.2 million locations, we propose a Multi-Pretext Masked Autoencoder (MP-MAE) approach to learn general-purpose representations for optical satellite images. Our approach builds on the ConvNeXt V2 architecture, a fully convolutional masked autoencoder (MAE). Drawing upon a suite of multi-modal pretext tasks, we demonstrate that our MP-MAE approach outperforms both MAEs pretrained on ImageNet and MAEs pretrained on domain-specific satellite images. This is shown on several downstream tasks including image classification and semantic segmentation. We find that pretraining with multi-modal pretext tasks notably improves the linear probing performance compared to pretraining on optical satellite images only. This also leads to better label efficiency and parameter efficiency which are crucial aspects in global scale applications.

Vishal Nedungadi, Ankit Kariryaa, Stefan Oehmcke, Serge Belongie, Christian Igel, Nico Lang• 2024

Related benchmarks

TaskDatasetResultRank
Semantic Segmentation (Cropland)AI4SmallFarms
mIoU37.85
42
Semantic Segmentation (Burn Scars)AI4SmallFarms
mIoU82.51
42
Segmentationm-SA crop-type
Mean mIoU38.2
27
Classificationm-so2sat GEO-Bench
Overall Accuracy54.6
22
Segmentationm-cashew GeoBench
mIoU79.8
14
Multi-Label Classificationm-bigearthnet GeoBench
F1 Score67.1
14
ClassificationBigEarthNet 20k
F1 Score67.1
8
ClassificationSo2Sat20k
Accuracy54.6
8
Semantic segmentationSAcrop3k
IoU38.2
4
Semantic segmentationCashew1k
IoU79.8
4
Showing 10 of 10 rows

Other info

Follow for update