Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Training-Free Class Purification for Open-Vocabulary Semantic Segmentation

About

Fine-tuning pre-trained vision-language models has emerged as a powerful approach for enhancing open-vocabulary semantic segmentation (OVSS). However, the substantial computational and resource demands associated with training on large datasets have prompted interest in training-free methods for OVSS. Existing training-free approaches primarily focus on modifying model architectures and generating prototypes to improve segmentation performance. However, they often neglect the challenges posed by class redundancy, where multiple categories are not present in the current test image, and visual-language ambiguity, where semantic similarities among categories create confusion in class activation. These issues can lead to suboptimal class activation maps and affinity-refined activation maps. Motivated by these observations, we propose FreeCP, a novel training-free class purification framework designed to address these challenges. FreeCP focuses on purifying semantic categories and rectifying errors caused by redundancy and ambiguity. The purified class representations are then leveraged to produce final segmentation predictions. We conduct extensive experiments across eight benchmarks to validate FreeCP's effectiveness. Results demonstrate that FreeCP, as a plug-and-play module, significantly boosts segmentation performance when combined with other OVSS methods.

Qi Chen, Lingxiao Yang, Yun Chen, Nailong Zhao, Jianhuang Lai, Jie Shao, Xiaohua Xie• 2025

Related benchmarks

TaskDatasetResultRank
Open Vocabulary Semantic SegmentationADE20K without background
mIoU18.4
72
Open Vocabulary Semantic SegmentationCOCO Stuff without background
mIoU24.9
71
Open Vocabulary Semantic SegmentationPASCAL Context Context60 with background
mIoU35.3
69
Open Vocabulary Semantic SegmentationCOCO Object with background
mIoU37.2
68
Open Vocabulary Semantic SegmentationPASCAL Context 59 without background
mIoU38
67
Open Vocabulary Semantic SegmentationCityscapes without background
mIoU33.3
67
Open Vocabulary Semantic SegmentationPascal VOC 20 With Background
mIoU84.3
21
Open Vocabulary Semantic SegmentationPascal VOC 21 (With Background)
mIoU65.8
20
Showing 8 of 8 rows

Other info

Follow for update