Understanding Multi-Granularity for Open-Vocabulary Part Segmentation
About
Open-vocabulary part segmentation (OVPS) is an emerging research area focused on segmenting fine-grained entities using diverse and previously unseen vocabularies. Our study highlights the inherent complexities of part segmentation due to intricate boundaries and diverse granularity, reflecting the knowledge-based nature of part identification. To address these challenges, we propose PartCLIPSeg, a novel framework utilizing generalized parts and object-level contexts to mitigate the lack of generalization in fine-grained parts. PartCLIPSeg integrates competitive part relationships and attention control, alleviating ambiguous boundaries and underrepresented parts. Experimental results demonstrate that PartCLIPSeg outperforms existing state-of-the-art OVPS methods, offering refined segmentation and an advanced understanding of part relationships within images. Through extensive experiments, our model demonstrated a significant improvement over the state-of-the-art models on the Pascal-Part-116, ADE20K-Part-234, and PartImageNet datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Part Segmentation | Pascal-Part-116 (test) | mIoU (Unseen)31.67 | 18 | |
| Open-Vocabulary Part Segmentation | Pascal-Part-116 zero-shot | mIoU (Seen)50.02 | 13 | |
| Part Segmentation | PartImageNet | Seen56.26 | 12 | |
| Part Segmentation | ADE20K Part-234 | Seen Performance0.3837 | 11 | |
| Open-Vocabulary Part Segmentation | ADE20K Part zero-shot 234 | Seen Recall53.31 | 10 | |
| Part Segmentation | PartImageNet OOD (test) | mIoU (Unseen)59.16 | 8 | |
| Open-Vocabulary Part Segmentation | Pascal-Part-116 (test) | Seen Recall58.97 | 5 | |
| Open-Vocabulary Part Segmentation | Pascal-Part zero-shot 116 | Seen Recall58.46 | 5 | |
| Part Segmentation | Pascal-Part 116 | Seen Boundary IoU36.15 | 4 | |
| Zero-shot Part Segmentation | Pascal-Part-116 cross-dataset (trained on PartImageNet) (test) | mIoU (Pred-All)14.74 | 2 |