A Strong Baseline for Generalized Few-Shot Semantic Segmentation
About
This paper introduces a generalized few-shot segmentation framework with a straightforward training process and an easy-to-optimize inference phase. In particular, we propose a simple yet effective model based on the well-known InfoMax principle, where the Mutual Information (MI) between the learned feature representations and their corresponding predictions is maximized. In addition, the terms derived from our MI-based formulation are coupled with a knowledge distillation term to retain the knowledge on base classes. With a simple training process, our inference model can be applied on top of any segmentation network trained on base classes. The proposed inference yields substantial improvements on the popular few-shot segmentation benchmarks, PASCAL-$5^i$ and COCO-$20^i$. Particularly, for novel classes, the improvement gains range from 7% to 26% (PASCAL-$5^i$) and from 3% to 12% (COCO-$20^i$) in the 1-shot and 5-shot scenarios, respectively. Furthermore, we propose a more challenging setting, where performance gaps are further exacerbated. Our code is publicly available at https://github.com/sinahmr/DIaM.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Few-shot Semantic Segmentation | PASCAL-5^i 1-shot | mIoU53 | 37 | |
| Generalized Few-Shot Semantic Segmentation | PASCAL-5^i (test) | Base mIoU71.13 | 34 | |
| Generalized Few-Shot Semantic Segmentation | COCO 20^i (test) | Base mIoU0.4863 | 19 | |
| Generalized Few-Shot Segmentation | COCO 20 1-shot i | Base mIoU48.28 | 9 | |
| Generalized Few-Shot Semantic Segmentation | PASCAL-10^i (novel) | Base mIoU70.26 | 6 | |
| Generalized Few-Shot Segmentation | PASCAL-5i | Base Accuracy70.89 | 6 | |
| Generalized Few-Shot Segmentation | COCO-20i | Base Score48.37 | 6 |