CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
About
State-of-the-art semantic segmentation methods were almost exclusively trained on images within a fixed resolution range. These segmentations are inaccurate for very high-resolution images since using bicubic upsampling of low-resolution segmentation does not adequately capture high-resolution details along object boundaries. In this paper, we propose a novel approach to address the high-resolution segmentation problem without using any high-resolution training data. The key insight is our CascadePSP network which refines and corrects local boundaries whenever possible. Although our network is trained with low-resolution segmentation data, our method is applicable to any resolution even for very high-resolution images larger than 4K. We present quantitative and qualitative studies on different datasets to show that CascadePSP can reveal pixel-accurate segmentation boundaries using our novel refinement module without any finetuning. Thus, our method can be regarded as class-agnostic. Finally, we demonstrate the application of our model to scene parsing in multi-class segmentation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | ADE20K (val) | mIoU43.83 | 2731 | |
| Semantic segmentation | INRIA Aerial Image Labeling (test) | Overall IoU69.4 | 33 | |
| Semantic segmentation | BIG dataset (test) | IoU93.93 | 24 | |
| Image Segmentation | BIG (test) | IoU93.93 | 20 | |
| Semantic segmentation | DeepGlobe (test) | mIoU68.5 | 20 | |
| Semantic segmentation | relabeled PASCAL VOC 2012 (test) | IoU92.86 | 16 | |
| Instance Matting | HIM2K Synthetic | IMQmad40.85 | 16 | |
| Instance Matting | HIM2K Natural | IMQmad64.58 | 16 | |
| Semantic segmentation | PASCAL VOC 2012 relabeled (val) | IoU92.86 | 8 | |
| Semantic segmentation | BIG 1.0 (val test) | IoU93.93 | 8 |