Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation

About

We propose a training-free method for open-vocabulary semantic segmentation using Vision-and-Language Models (VLMs). Our approach enhances the initial per-patch predictions of VLMs through label propagation, which jointly optimizes predictions by incorporating patch-to-patch relationships. Since VLMs are primarily optimized for cross-modal alignment and not for intra-modal similarity, we use a Vision Model (VM) that is observed to better capture these relationships. We address resolution limitations inherent to patch-based encoders by applying label propagation at the pixel level as a refinement step, significantly improving segmentation accuracy near class boundaries. Our method, called LPOSS+, performs inference over the entire image, avoiding window-based processing and thereby capturing contextual interactions across the full image. LPOSS+ achieves state-of-the-art performance among training-free methods, across a diverse set of datasets. Code: https://github.com/vladan-stojnic/LPOSS

Vladan Stojni\'c, Yannis Kalantidis, Ji\v{r}\'i Matas, Giorgos Tolias• 2025

Related benchmarks

TaskDatasetResultRank
Open Vocabulary Semantic SegmentationPascal VOC 20
mIoU89.6
104
Open Vocabulary Semantic SegmentationPascal Context PC-59
mIoU35.2
89
Open Vocabulary Semantic SegmentationADE20K without background
mIoU21.8
72
Open Vocabulary Semantic SegmentationCOCO Stuff without background
mIoU25.9
71
Open Vocabulary Semantic SegmentationPASCAL Context Context60 with background
mIoU34.6
69
Open Vocabulary Semantic SegmentationCOCO Object with background
mIoU33.4
68
Open Vocabulary Semantic SegmentationCityscapes without background
mIoU37.3
67
Open Vocabulary Semantic SegmentationPASCAL Context 59 without background
mIoU37.8
67
Open Vocabulary Semantic SegmentationCOCOStuff (val)
mIoU25.9
60
Open Vocabulary Semantic SegmentationPASCAL Context 59 (val)
mIoU37.8
49
Showing 10 of 23 rows

Other info

Follow for update