Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Semantic Alignment in Hyperbolic Space for Open-Vocabulary Semantic Segmentation

About

Open-vocabulary semantic segmentation requires adapting image-level vision-language models such as CLIP to dense pixel-level prediction, which is challenging due to the mismatch between hierarchical structure and semantic alignment in the embedding space. While recent works leverage hyperbolic geometry to model hierarchical relationships, they align embeddings across hierarchical levels but overlook semantic misalignment among embeddings within the same level. In this work, we propose HyRo, a hyperbolic fine-tuning framework that decouples hierarchical and semantic alignment in the Poincar\'e ball model. HyRo aligns hierarchical levels by adjusting the hyperbolic radius and refines semantic relationships through angular alignment using an orthogonal transformation that theoretically preserves the hyperbolic radius. Experiments on standard open-vocabulary semantic segmentation benchmarks demonstrate that HyRo achieves state-of-the-art performance over prior methods.

Hoang M. Truong, Hai Nguyen-Truong, Dang Huynh• 2026

Related benchmarks

TaskDatasetResultRank
Open Vocabulary Semantic SegmentationPascal Context PC-59
mIoU57.3
99
Open Vocabulary Semantic SegmentationADE20K A-150
mIoU31.2
79
Open Vocabulary Semantic SegmentationPASCAL-Context PC-459
mIoU18.9
31
Open Vocabulary Semantic SegmentationADE20K A-847
mIoU12
23
Open Vocabulary Semantic SegmentationPASCAL-VOC PAS-20b
mIoU76.7
12
Open Vocabulary Semantic SegmentationPASCAL VOC PAS-20
mIoU95
10
Showing 6 of 6 rows

Other info

Follow for update