Prompt Group-Aware Training for Robust Text-Guided Nuclei Segmentation

About

Foundation models such as Segment Anything Model 3 (SAM3) enable flexible text-guided medical image segmentation, yet their predictions remain highly sensitive to prompt formulation. Even semantically equivalent descriptions can yield inconsistent masks, limiting reliability in clinical and pathology workflows. We reformulate prompt sensitivity as a group-wise consistency problem. Semantically related prompts are organized into \emph{prompt groups} sharing the same ground-truth mask, and a prompt group-aware training framework is introduced for robust text-guided nuclei segmentation. The approach combines (i) a quality-guided group regularization that leverages segmentation loss as an implicit ranking signal, and (ii) a logit-level consistency constraint with a stop-gradient strategy to align predictions within each group. The method requires no architectural modification and leaves inference unchanged. Extensive experiments on multi-dataset nuclei benchmarks show consistent gains under textual prompting and markedly reduced performance variance across prompt quality levels. On six zero-shot cross-dataset tasks, our method improves Dice by an average of 2.16 points. These results demonstrate improved robustness and generalization for vision-language segmentation in computational pathology.

Yonghuang Wu, Zhenyang Liang, Wenwen Zeng, Xuan Xie, Jinhua Yu• 2026

Related benchmarks

Task	Dataset	Result
Nuclear Instance Segmentation	CPM 17	--	48
Nuclei Instance Segmentation	Kumar 1.0 (All)	Dice84.78	15
Nuclei Segmentation	PanNuke	Dice (All Nuclei)79.42	11
Nuclei Segmentation	CoNSeP	Dice (All Nuclei)76.81	9
all-nuclei segmentation	Histology	Dice Score74.61	4
all-nuclei segmentation	CPM15	Dice Score79.56	4
all-nuclei segmentation	CryoNuSeg	Dice Score77.58	4

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord