SpectralEarth: Training Hyperspectral Foundation Models at Scale

About

Foundation models have triggered a paradigm shift in computer vision and are increasingly being adopted in remote sensing, particularly for multispectral imagery. Yet, their potential in hyperspectral imaging (HSI) remains untapped due to the absence of comprehensive and globally representative hyperspectral datasets. To close this gap, we introduce SpectralEarth, a large-scale multitemporal dataset designed to pretrain hyperspectral foundation models leveraging data from the environmental mapping and analysis program (EnMAP). SpectralEarth comprises 538 974 image patches covering 415 153 unique locations from 11 636 globally distributed EnMAP scenes spanning two years of archive. In addition, 17.5% of these locations include multiple timestamps, enabling multitemporal HSI analysis. Utilizing state-of-the-art self-supervised learning algorithms, we pretrain a series of foundation models on SpectralEarth, integrating a spectral adapter into classical vision backbones to accommodate the unique characteristics of HSI. In tandem, we construct nine downstream datasets for land-cover, crop-type mapping, and tree-species classification, providing benchmarks for model evaluation. Experimental results support the versatility of our models and their generalizability across different tasks and sensors. We also highlight computational efficiency during model fine-tuning.

Nassim Ait Ali Braham, Conrad M Albrecht, Julien Mairal, Jocelyn Chanussot, Yi Wang, Xiao Xiang Zhu• 2024

Related benchmarks

Task	Dataset	Result
Land Cover Classification	SpectralEarth CORINE (test)	F1 Score78.67	18
Cloud Top Height (CTH) Regression	HyperFM 250k	MSE5.1727	14
Cloud Water Path (CWP) Regression	HyperFM250k	MSE1.2545	14
Cloud Effective Radius (CER) Regression	HyperFM250k	MSE84.287	14
Cloud Optical Thickness (COT) Regression	HyperFM250k	MSE0.3404	14
Scene Classification	HRSSC (test)	OA84.61	11
Semantic segmentation	EnMAP BD-Foret	mIoU69.7	11
Semantic segmentation	EnMAP NLCD	mIoU42.9	11
Semantic segmentation	EnMAP TreeMap	mIoU41.3	11
Semantic segmentation	GF-5 Wuhan	mIoU43.9	11

Showing 10 of 26 rows

Other info

Follow for update

@wizwand_team Discord