Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation

About

Training-free open-vocabulary semantic segmentation (OVS) aims to segment images given a set of arbitrary textual categories without costly model fine-tuning. Existing solutions often explore attention mechanisms of pre-trained models, such as CLIP, or generate synthetic data and design complex retrieval processes to perform OVS. However, their performance is limited by the capability of reliant models or the suboptimal quality of reference sets. In this work, we investigate the largely overlooked data quality problem for this challenging dense scene understanding task, and identify that a high-quality reference set can significantly benefit training-free OVS. With this observation, we introduce a data-quality-oriented framework, comprising a data pipeline to construct a reference set with well-paired segment-text embeddings and a simple similarity-based retrieval to unveil the essential effect of data. Remarkably, extensive evaluations on ten benchmark datasets demonstrate that our method outperforms all existing training-free OVS approaches, highlighting the importance of data-centric design for advancing OVS without training. Our code is available at https://github.com/xiweix/ReME .

Xiwei Xuan, Ziquan Deng, Kwan-Liu Ma• 2025

Related benchmarks

TaskDatasetResultRank
Open Vocabulary Semantic SegmentationCOCOStuff (val)
mIoU33.3
60
Open Vocabulary Semantic SegmentationCityscapes (val)
mIoU59
37
Open Vocabulary Semantic SegmentationPASCAL Context 59 (val)
mIoU53.1
32
Open-Vocabulary SegmentationPascal VOC 21 2012 (val)
mIoU82.2
27
Open-Vocabulary SegmentationPascal Context 60 (val)
mIoU44.6
26
Open-Vocabulary SegmentationCOCO-Object (COCO-O) (val)
mIoU48.2
25
Open-Vocabulary SegmentationADE20K (ADE) (val)
mIoU28.2
25
Open-Vocabulary SegmentationPascal VOC 20 2012 (val)
mIoU93.2
23
Open-Vocabulary SegmentationNatural-scene (NS) benchmark suite V21, PC60, COCO-O, V20, PC59, COCO-S, City, ADE
V21 mIoU (with background)82.2
18
Showing 9 of 9 rows

Other info

Follow for update