Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation

About

Recently, open-vocabulary learning has emerged to accomplish segmentation for arbitrary categories of text-based descriptions, which popularizes the segmentation system to more general-purpose application scenarios. However, existing methods devote to designing specialized architectures or parameters for specific segmentation tasks. These customized design paradigms lead to fragmentation between various segmentation tasks, thus hindering the uniformity of segmentation models. Hence in this paper, we propose FreeSeg, a generic framework to accomplish Unified, Universal and Open-Vocabulary Image Segmentation. FreeSeg optimizes an all-in-one network via one-shot training and employs the same architecture and parameters to handle diverse segmentation tasks seamlessly in the inference procedure. Additionally, adaptive prompt learning facilitates the unified model to capture task-aware and category-sensitive concepts, improving model robustness in multi-task and varied scenarios. Extensive experimental results demonstrate that FreeSeg establishes new state-of-the-art results in performance and generalization on three segmentation tasks, which outperforms the best task-specific architectures by a large margin: 5.5% mIoU on semantic segmentation, 17.6% mAP on instance segmentation, 20.1% PQ on panoptic segmentation for the unseen class on COCO.

Jie Qin, Jie Wu, Pengxiang Yan, Ming Li, Ren Yuxi, Xuefeng Xiao, Yitong Wang, Rui Wang, Shilei Wen, Xin Pan, Xingang Wang• 2023

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU17.9
2731
Semantic segmentationADE20K--
936
Instance SegmentationCOCO (val)
APmk22.8
472
Semantic segmentationPascal VOC (test)
mIoU91.9
236
Panoptic SegmentationCOCO (val)
PQ16.5
219
Semantic segmentationCoco-Stuff (test)--
184
Panoptic SegmentationCOCO 2017 (val)
PQ21.7
172
Semantic segmentationPASCAL-Context 59 class (val)
mIoU34.4
125
Semantic segmentationCOCO
mIoU21.7
96
Panoptic SegmentationADE20K (val)
PQ16.3
89
Showing 10 of 41 rows

Other info

Code

Follow for update