A Spatial-Spectral-Frequency Interactive Network for Multimodal Remote Sensing Classification
About
Deep learning-based methods have achieved significant success in remote sensing Earth observation data analysis. Numerous feature fusion techniques address multimodal remote sensing image classification by integrating global and local features. However, these techniques often struggle to extract structural and detail features from heterogeneous and redundant multimodal images. With the goal of introducing frequency domain learning to model key and sparse detail features, this paper introduces the spatial-spectral-frequency interaction network (S$^2$Fin), which integrates pairwise fusion modules across the spatial, spectral, and frequency domains. Specifically, we propose a high-frequency sparse enhancement transformer that employs sparse spatial-spectral attention to optimize the parameters of the high-frequency filter. Subsequently, a two-level spatial-frequency fusion strategy is introduced, comprising an adaptive frequency channel module that fuses low-frequency structures with enhanced high-frequency details, and a high-frequency resonance mask that emphasizes sharp edges via phase similarity. In addition, a spatial-spectral attention fusion module further enhances feature extraction at intermediate layers of the network. Experiments on four benchmark multimodal datasets with limited labeled data demonstrate that S$^2$Fin performs superior classification, outperforming state-of-the-art methods. The code is available at https://github.com/HaoLiu-XDU/SSFin.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Remote Sensing Image Classification | Augsburg | Parameters (M)0.63 | 20 | |
| Remote Sensing Image Classification | Yellow River Estuary | Params (M)0.7 | 20 | |
| Remote Sensing Image Classification | LCZ HK | Params (M)0.65 | 20 | |
| Multimodal Remote Sensing Classification | Yellow River Estuary | Overall Accuracy (OA)67.54 | 12 | |
| Classification | LCZ HK (test) | Overall Accuracy (OA)72.26 | 9 | |
| Hyperspectral Image Classification | Houston 2013 (test) | Overall Accuracy (OA)89.19 | 9 | |
| Land Cover Classification | Augsburg 10 train samples per class (test) | Overall Accuracy (OA)79.91 | 9 | |
| Classification | Yellow River Estuary (test) | Accuracy (Spartina Alterniflora)75.71 | 9 | |
| Image Classification | Houston 2013 | Parameters (M)0.7 | 8 | |
| Multimodal Remote Sensing Classification | Houston 2013 | Overall Accuracy89.19 | 2 |