Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation

About

Learning semantic segmentation from weakly-labeled (e.g., image tags only) data is challenging since it is hard to infer dense object regions from sparse semantic tags. Despite being broadly studied, most current efforts directly learn from limited semantic annotations carried by individual image or image pairs, and struggle to obtain integral localization maps. Our work alleviates this from a novel perspective, by exploring rich semantic contexts synergistically among abundant weakly-labeled training data for network learning and inference. In particular, we propose regional semantic contrast and aggregation (RCA) . RCA is equipped with a regional memory bank to store massive, diverse object patterns appearing in training data, which acts as strong support for exploration of dataset-level semantic structure. Particularly, we propose i) semantic contrast to drive network learning by contrasting massive categorical object regions, leading to a more holistic object pattern understanding, and ii) semantic aggregation to gather diverse relational contexts in the memory to enrich semantic representations. In this manner, RCA earns a strong capability of fine-grained semantic understanding, and eventually establishes new state-of-the-art results on two popular benchmarks, i.e., PASCAL VOC 2012 and COCO 2014.

Tianfei Zhou, Meijie Zhang, Fang Zhao, Jianwu Li• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU72.2
2040
Semantic segmentationPASCAL VOC 2012 (test)
mIoU72.8
1342
Semantic segmentationPASCAL VOC (val)
mIoU72.2
338
Semantic segmentationCOCO 2014 (val)
mIoU36.8
251
Semantic segmentationPascal VOC (test)
mIoU72.8
236
Weakly supervised semantic segmentationPASCAL VOC 2012 (test)
mIoU72.8
158
Weakly supervised semantic segmentationPASCAL VOC 2012 (val)
mIoU72.2
154
Semantic segmentationCOCO (val)
mIoU36.8
135
Semantic segmentationVOC 2012 (val)
mIoU72.2
67
Weakly supervised semantic segmentationPASCAL VOC 2012 (train)
mIoU (Mask)74.1
53
Showing 10 of 11 rows

Other info

Code

Follow for update