HyperFree: A Channel-adaptive and Tuning-free Foundation Model for Hyperspectral Remote Sensing Imagery
About
Advanced interpretation of hyperspectral remote sensing images benefits many precise Earth observation tasks. Recently, visual foundation models have promoted the remote sensing interpretation but concentrating on RGB and multispectral images. Due to the varied hyperspectral channels,existing foundation models would face image-by-image tuning situation, imposing great pressure on hardware and time resources. In this paper, we propose a tuning-free hyperspectral foundation model called HyperFree, by adapting the existing visual prompt engineering. To process varied channel numbers, we design a learned weight dictionary covering full-spectrum from $0.4 \sim 2.5 \, \mu\text{m}$, supporting to build the embedding layer dynamically. To make the prompt design more tractable, HyperFree can generate multiple semantic-aware masks for one prompt by treating feature distance as semantic-similarity. After pre-training HyperFree on constructed large-scale high-resolution hyperspectral images, HyperFree (1 prompt) has shown comparable results with specialized models (5 shots) on 5 tasks and 11 datasets.Code and dataset are accessible at https://rsidea.whu.edu.cn/hyperfree.htm.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cloud Top Height (CTH) Regression | HyperFM 250k | MSE6.4175 | 14 | |
| Cloud Water Path (CWP) Regression | HyperFM250k | MSE1.3451 | 14 | |
| Cloud Effective Radius (CER) Regression | HyperFM250k | MSE86.8931 | 14 | |
| Cloud Optical Thickness (COT) Regression | HyperFM250k | MSE0.3523 | 14 | |
| Hyperspectral Salient Object Detection | HSSOD | AUC95.9 | 7 | |
| Hyperspectral Semantic Segmentation | HSI Drive v2.0 (test) | Accuracy (mu)96.56 | 7 | |
| Hyperspectral Semantic Segmentation | Hyperspectral City 2.0 (test) | Mean Accuracy (mu)90.14 | 7 | |
| Urban scene semantic segmentation | HSICity (test) | mIoU44.6 | 7 |