Dual-stage Hyperspectral Image Classification Model with Spectral Supertoken
About
Hyperspectral image classification, a task that assigns pre-defined classes to each pixel in a hyperspectral image of remote sensing scenes, often faces challenges due to the neglect of correlations between spectrally similar pixels. This oversight can lead to inaccurate edge definitions and difficulties in managing minor spectral variations in contiguous areas. To address these issues, we introduce the novel Dual-stage Spectral Supertoken Classifier (DSTC), inspired by superpixel concepts. DSTC employs spectrum-derivative-based pixel clustering to group pixels with similar spectral characteristics into spectral supertokens. By projecting the classification of these tokens onto the image space, we achieve pixel-level results that maintain regional classification consistency and precise boundary. Moreover, recognizing the diversity within tokens, we propose a class-proportion-based soft label. This label adaptively assigns weights to different categories based on their prevalence, effectively managing data distribution imbalances and enhancing classification performance. Comprehensive experiments on WHU-OHS, IP, KSC, and UP datasets corroborate the robust classification capabilities of DSTC and the effectiveness of its individual components. Code will be publicly available at https://github.com/laprf/DSTC.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Hyperspectral Image Classification | Indian Pines (test) | Overall Accuracy (OA)96.15 | 100 | |
| Hyperspectral Image Classification | KSC (test) | Average Accuracy98.02 | 32 | |
| Hyperspectral Classification | WHU-Hi Hanchuan (test) | Average Accuracy98.85 | 31 | |
| Semantic segmentation | Five-Billion-Pixels (Cross-Regional) | mIoU46.27 | 16 | |
| Semantic segmentation | LoveDA Cross-Style | mIoU47.78 | 16 | |
| Semantic segmentation | Potsdam&Vaihingen Cross Spectral Band | mIoU16.62 | 16 | |
| Semantic segmentation | OpenEarthMap (Cross-Continent) | mIoU46.49 | 16 | |
| Semantic segmentation | FLAIR Cross-Regional | mIoU53.42 | 16 | |
| Semantic segmentation | Five-Billion-Pixels Cross-sensor | mIoU29.79 | 16 | |
| Hyperspectral Image Classification | WHU-OHS | Overall Accuracy (OA)79.9 | 15 |