Lightweight Frequency Masker for Cross-Domain Few-Shot Semantic Segmentation
About
Cross-domain few-shot segmentation (CD-FSS) is proposed to first pre-train the model on a large-scale source-domain dataset, and then transfer the model to data-scarce target-domain datasets for pixel-level segmentation. The significant domain gap between the source and target datasets leads to a sharp decline in the performance of existing few-shot segmentation (FSS) methods in cross-domain scenarios. In this work, we discover an intriguing phenomenon: simply filtering different frequency components for target domains can lead to a significant performance improvement, sometimes even as high as 14% mIoU. Then, we delve into this phenomenon for an interpretation, and find such improvements stem from the reduced inter-channel correlation in feature maps, which benefits CD-FSS with enhanced robustness against domain gaps and larger activated regions for segmentation. Based on this, we propose a lightweight frequency masker, which further reduces channel correlations by an Amplitude-Phase Masker (APM) module and an Adaptive Channel Phase Attention (ACPA) module. Notably, APM introduces only 0.01% additional parameters but improves the average performance by over 10%, and ACPA imports only 2.5% parameters but further improves the performance by over 1.5%, which significantly surpasses the state-of-the-art CD-FSS methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | LoveDA | mIoU45.04 | 192 | |
| Semantic segmentation | iSAID | -- | 146 | |
| Few-shot Segmentation | DeepGlobe | mIoU44.9 | 83 | |
| Few-shot Segmentation | Chest X-ray | mIoU82.8 | 82 | |
| Medical Image Segmentation | ISIC | DICE39.85 | 79 | |
| Few-shot Semantic Segmentation | Average Deepglobe, ISIC, Chest X-Ray, FSS-1000 | mIoU65.2 | 54 | |
| Few-shot Segmentation | ISIC 2018 | mIoU51.2 | 51 | |
| Few-shot Semantic Segmentation | CD-FSS 1-shot 1.0 (test) | mIoU (Average)60 | 34 | |
| Semantic segmentation | FSS-1000 1-shot | mIoU79.3 | 32 | |
| Semantic segmentation | FSS-1000 5-shot | mIoU81.9 | 29 |