Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SpectralEarth: Training Hyperspectral Foundation Models at Scale

About

Foundation models have triggered a paradigm shift in computer vision and are increasingly being adopted in remote sensing, particularly for multispectral imagery. Yet, their potential in hyperspectral imaging (HSI) remains untapped due to the absence of comprehensive and globally representative hyperspectral datasets. To close this gap, we introduce SpectralEarth, a large-scale multitemporal dataset designed to pretrain hyperspectral foundation models leveraging data from the environmental mapping and analysis program (EnMAP). SpectralEarth comprises 538 974 image patches covering 415 153 unique locations from 11 636 globally distributed EnMAP scenes spanning two years of archive. In addition, 17.5% of these locations include multiple timestamps, enabling multitemporal HSI analysis. Utilizing state-of-the-art self-supervised learning algorithms, we pretrain a series of foundation models on SpectralEarth, integrating a spectral adapter into classical vision backbones to accommodate the unique characteristics of HSI. In tandem, we construct nine downstream datasets for land-cover, crop-type mapping, and tree-species classification, providing benchmarks for model evaluation. Experimental results support the versatility of our models and their generalizability across different tasks and sensors. We also highlight computational efficiency during model fine-tuning.

Nassim Ait Ali Braham, Conrad M Albrecht, Julien Mairal, Jocelyn Chanussot, Yi Wang, Xiao Xiang Zhu• 2024

Related benchmarks

TaskDatasetResultRank
Land Cover ClassificationSpectralEarth CORINE (test)
F1 Score78.67
18
Cloud Top Height (CTH) RegressionHyperFM 250k
MSE5.1727
14
Cloud Water Path (CWP) RegressionHyperFM250k
MSE1.2545
14
Cloud Effective Radius (CER) RegressionHyperFM250k
MSE84.287
14
Cloud Optical Thickness (COT) RegressionHyperFM250k
MSE0.3404
14
Scene ClassificationHRSSC (test)
OA84.61
11
Semantic segmentationEnMAP BD-Foret
mIoU69.7
11
Semantic segmentationEnMAP NLCD
mIoU42.9
11
Semantic segmentationEnMAP TreeMap
mIoU41.3
11
Semantic segmentationGF-5 Wuhan
mIoU43.9
11
Showing 10 of 24 rows

Other info

Follow for update