CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement

About

State-of-the-art semantic segmentation methods were almost exclusively trained on images within a fixed resolution range. These segmentations are inaccurate for very high-resolution images since using bicubic upsampling of low-resolution segmentation does not adequately capture high-resolution details along object boundaries. In this paper, we propose a novel approach to address the high-resolution segmentation problem without using any high-resolution training data. The key insight is our CascadePSP network which refines and corrects local boundaries whenever possible. Although our network is trained with low-resolution segmentation data, our method is applicable to any resolution even for very high-resolution images larger than 4K. We present quantitative and qualitative studies on different datasets to show that CascadePSP can reveal pixel-accurate segmentation boundaries using our novel refinement module without any finetuning. Thus, our method can be regarded as class-agnostic. Finally, we demonstrate the application of our model to scene parsing in multi-class segmentation.

Ho Kei Cheng, Jihoon Chung, Yu-Wing Tai, Chi-Keung Tang• 2020

Related benchmarks

Task	Dataset	Result
Semantic segmentation	ADE20K (val)	mIoU43.83	3089
Polyp Segmentation	CVC-ClinicDB	--	101
Polyp Segmentation	CVC-ColonDB	mIoU57.92	100
Polyp Segmentation	Kvasir-Seg	mIoU0.8632	54
Semantic segmentation	INRIA Aerial Image Labeling (test)	Overall IoU69.4	33
Semantic segmentation	BIG dataset (test)	IoU93.93	24
Image Segmentation	BIG (test)	IoU93.93	20
Semantic segmentation	DeepGlobe (test)	mIoU68.5	20
Semantic segmentation	relabeled PASCAL VOC 2012 (test)	IoU92.86	16
Instance Matting	HIM2K Synthetic	IMQmad40.85	16

Showing 10 of 16 rows

Other info

Code

Follow for update

@wizwand_team Discord