S2M-Net: Spectral-Spatial Mixing for Medical Image Segmentation with Morphology-Aware Adaptive Loss
About
Medical image segmentation requires balancing local precision for boundary-critical clinical applications, global context for anatomical coherence, and computational efficiency for deployment on limited data and hardware a trilemma that existing architectures fail to resolve. Although convolutional networks provide local precision at $\mathcal{O}(n)$ cost but limited receptive fields, vision transformers achieve global context through $\mathcal{O}(n^2)$ self-attention at prohibitive computational expense, causing overfitting on small clinical datasets. We propose S2M-Net, a 4.7M-parameter architecture that achieves $\mathcal{O}(HW \log HW)$ global context through two synergistic innovations: (i) Spectral-Selective Token Mixer (SSTM), which exploits the spectral concentration of medical images via truncated 2D FFT with learnable frequency filtering and content-gated spatial projection, avoiding quadratic attention cost while maintaining global receptive fields; and (ii) Morphology-Aware Adaptive Segmentation Loss (MASL), which automatically analyzes structure characteristics (compactness, tubularity, irregularity, scale) to modulate five complementary loss components through constrained learnable weights, eliminating manual per-dataset tuning. Comprehensive evaluation in 16 medical imaging datasets that span 8 modalities demonstrates state-of-the-art performance: 96.12\% Dice on polyp segmentation, 83.77\% on surgical instruments (+17.85\% over the prior art) and 80.90\% on brain tumors, with consistent 3-18\% improvements over specialized baselines while using 3.5--6$\times$ fewer parameters than transformer-based methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Medical Image Segmentation | BUSI (test) | Dice84.34 | 121 | |
| Medical Image Segmentation | Kvasir-SEG (test) | -- | 78 | |
| Medical Image Segmentation | CVC-ClinicDB (test) | Dice95.43 | 60 | |
| Medical Image Segmentation | ISIC 2018 (test) | Dice Score89.99 | 57 | |
| Medical Image Segmentation | CVC-ColonDB (test) | Dice Score0.9075 | 50 | |
| Medical Image Segmentation | GlaS (test) | Dice Score93.54 | 44 | |
| Segmentation | STARE (test) | Soft Dice81.45 | 31 | |
| Medical Image Segmentation | DRIVE (test) | Dice Score83.2 | 26 | |
| Medical Image Segmentation | BraTS 2020 (test) | Dice Score80.9 | 18 | |
| Medical Image Segmentation | PH2 (test) | Dice Score94.45 | 18 |