Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CoCo-SAM3: Harnessing Concept Conflict in Open-Vocabulary Semantic Segmentation

About

SAM3 advances open-vocabulary semantic segmentation by introducing a prompt-driven mask generation paradigm. However, in multi-class open-vocabulary scenarios, masks generated independently from different category prompts lack a unified and inter-class comparable evidence scale, often resulting in overlapping coverage and unstable competition. Moreover, synonymous expressions of the same concept tend to activate inconsistent semantic and spatial evidence, leading to intra-class drift that exacerbates inter-class conflicts and compromises overall inference stability. To address these issues, we propose CoCo-SAM3 (Concept-Conflict SAM3), which explicitly decouples inference into intra-class enhancement and inter-class competition. Our method first aligns and aggregates evidence from synonymous prompts to strengthen concept consistency. It then performs inter-class competition on a unified comparable scale, enabling direct pixel-wise comparisons among all candidate classes. This mechanism stabilizes multi-class inference and effectively mitigates inter-class conflicts. Without requiring any additional training, CoCo-SAM3 achieves consistent improvements across eight open-vocabulary semantic segmentation benchmarks.

Yanhui Chen, Baoyao Yang, Siqi Liu, Jingchao Wang• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K
mIoU38.3
567
Semantic segmentationCityscapes
mIoU70.7
497
Semantic segmentationPASCAL VOC with background category VOC21 2012
mIoU86.8
51
Semantic segmentationAverage Overall
mIoU64.3
46
Semantic segmentationPascal Context 60 with background
mIoU50.5
43
Semantic segmentationPascal VOC without background 2012 V20
mIoU95.2
42
Semantic segmentationCOCO-Object with background class
mIoU67.9
34
Semantic segmentationPascal Context 59 (PC59) without background
mIoU61.2
20
Semantic segmentationCOCO-S without background
mIoU43.6
19
Showing 9 of 9 rows

Other info

Follow for update