Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Prompt Group-Aware Training for Robust Text-Guided Nuclei Segmentation

About

Foundation models such as Segment Anything Model 3 (SAM3) enable flexible text-guided medical image segmentation, yet their predictions remain highly sensitive to prompt formulation. Even semantically equivalent descriptions can yield inconsistent masks, limiting reliability in clinical and pathology workflows. We reformulate prompt sensitivity as a group-wise consistency problem. Semantically related prompts are organized into \emph{prompt groups} sharing the same ground-truth mask, and a prompt group-aware training framework is introduced for robust text-guided nuclei segmentation. The approach combines (i) a quality-guided group regularization that leverages segmentation loss as an implicit ranking signal, and (ii) a logit-level consistency constraint with a stop-gradient strategy to align predictions within each group. The method requires no architectural modification and leaves inference unchanged. Extensive experiments on multi-dataset nuclei benchmarks show consistent gains under textual prompting and markedly reduced performance variance across prompt quality levels. On six zero-shot cross-dataset tasks, our method improves Dice by an average of 2.16 points. These results demonstrate improved robustness and generalization for vision-language segmentation in computational pathology.

Yonghuang Wu, Zhenyang Liang, Wenwen Zeng, Xuan Xie, Jinhua Yu• 2026

Related benchmarks

TaskDatasetResultRank
Nuclear Instance SegmentationCPM 17--
33
Nuclei Instance SegmentationKumar 1.0 (All)
Dice84.78
15
Nuclei SegmentationPanNuke
Dice (All Nuclei)79.42
11
Nuclei SegmentationCoNSeP
Dice (All Nuclei)76.81
9
all-nuclei segmentationHistology
Dice Score74.61
4
all-nuclei segmentationCPM15
Dice Score79.56
4
all-nuclei segmentationCryoNuSeg
Dice Score77.58
4
Showing 7 of 7 rows

Other info

Follow for update