Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Open-Vocabulary Segmentation with Semantic-Assisted Calibration

About

This paper studies open-vocabulary segmentation (OVS) through calibrating in-vocabulary and domain-biased embedding space with generalized contextual prior of CLIP. As the core of open-vocabulary understanding, alignment of visual content with the semantics of unbounded text has become the bottleneck of this field. To address this challenge, recent works propose to utilize CLIP as an additional classifier and aggregate model predictions with CLIP classification results. Despite their remarkable progress, performance of OVS methods in relevant scenarios is still unsatisfactory compared with supervised counterparts. We attribute this to the in-vocabulary embedding and domain-biased CLIP prediction. To this end, we present a Semantic-assisted CAlibration Network (SCAN). In SCAN, we incorporate generalized semantic prior of CLIP into proposal embedding to avoid collapsing on known categories. Besides, a contextual shift strategy is applied to mitigate the lack of global context and unnatural background noise. With above designs, SCAN achieves state-of-the-art performance on all popular open-vocabulary segmentation benchmarks. Furthermore, we also focus on the problem of existing evaluation system that ignores semantic duplication across categories, and propose a new metric called Semantic-Guided IoU (SG-IoU).

Yong Liu, Sule Bai, Guanbin Li, Yitong Wang, Yansong Tang• 2023

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC (val)
mIoU97.2
362
Semantic segmentationADE20K A-150
mIoU33.5
217
Semantic segmentationPascal Context 59
mIoU59.3
204
Semantic segmentationLoveDA
mIoU23.2
166
Semantic segmentationPC-59
mIoU59.3
148
Semantic segmentationVaihingen
mIoU15.23
140
Semantic segmentationPASCAL-Context 59 class (val)
mIoU59.3
125
Semantic segmentationiSAID
mIoU64.28
122
Semantic segmentationADE20K 847
mIoU1.40e+3
105
Open Vocabulary Semantic SegmentationPascal VOC 20
mIoU97
104
Showing 10 of 46 rows

Other info

Follow for update