Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

About

Traditional methods for reasoning segmentation rely on supervised fine-tuning with categorical labels and simple descriptions, limiting its out-of-domain generalization and lacking explicit reasoning processes. To address these limitations, we propose Seg-Zero, a novel framework that demonstrates remarkable generalizability and derives explicit chain-of-thought reasoning through cognitive reinforcement. Seg-Zero introduces a decoupled architecture consisting of a reasoning model and a segmentation model. The reasoning model interprets user intentions, generates explicit reasoning chains, and produces positional prompts, which are subsequently used by the segmentation model to generate precious pixel-level masks. We design a sophisticated reward mechanism that integrates both format and accuracy rewards to effectively guide optimization directions. Trained exclusively via reinforcement learning with GRPO and without explicit reasoning data, Seg-Zero achieves robust zero-shot generalization and exhibits emergent test-time reasoning capabilities. Experiments show that Seg-Zero-7B achieves a zero-shot performance of 57.5 on the ReasonSeg benchmark, surpassing the prior LISA-7B by 18\%. This significant improvement highlights Seg-Zero's ability to generalize across domains while presenting an explicit reasoning process. Code is available at https://github.com/dvlab-research/Seg-Zero.

Yuqi Liu, Bohao Peng, Zhisheng Zhong, Zihao Yue, Fanbin Lu, Bei Yu, Jiaya Jia• 2025

Related benchmarks

TaskDatasetResultRank
Referring Expression SegmentationRefCOCO (testA)
cIoU80.3
217
Referring Expression SegmentationRefCOCO+ (testA)
cIoU76.2
190
Reasoning SegmentationReasonSeg (val)
cIoU62
145
Referring Expression SegmentationRefCOCOg (val)
cIoU72.6
107
Reasoning SegmentationReasonSeg (test)
gIoU61.41
102
Referring Expression SegmentationRefCOCOg (test)--
78
Generalized Referring Expression SegmentationgRefCOCO v1 (val)
cIoU65.9
33
Medical Reasoning GroundingU-MRG-14K (test)
IoU (General)16.14
16
Reasoning SegmentationEarthReason (val)
gIoU63
15
Reasoning SegmentationEarthReason (test)
gIoU63.16
15
Showing 10 of 33 rows

Other info

Follow for update