Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Exclusivity-Guided Mask Learning for Semi-Supervised Crowd Instance Segmentation and Counting

About

Semi-supervised crowd analysis is a prominent area of research, as unlabeled data are typically abundant and inexpensive to obtain. However, traditional point-based annotations constrain performance because individual regions are inherently ambiguous, and consequently, learning fine-grained structural semantics from sparse anno tations remains an unresolved challenge. In this paper, we first propose an Exclusion-Constrained Dual-Prompt SAM (EDP-SAM), based on our Nearest Neighbor Exclusion Circle (NNEC) constraint, to generate mask supervision for current datasets. With the aim of segmenting individuals in dense scenes, we then propose Exclusivity-Guided Mask Learning (XMask), which enforces spatial separation through a discriminative mask objective. Gaussian smoothing and a differentiable center sampling strategy are utilized to improve feature continuity and training stability. Building on XMask, we present a semi-supervised crowd counting framework that uses instance mask priors as pseudo-labels, which contain richer shape information than traditional point cues. Extensive experiments on the ShanghaiTech A, UCF-QNRF, and JHU++ datasets (using 5%, 10%, and 40% labeled data) verify that our end-to-end model achieves state-of-the-art semi-supervised segmentation and counting performance, effectively bridging the gap between counting and instance segmentation within a unified framework.

Jiyang Huang, Hongru Cheng, Wei Lin, Jia Wan, Antoni B. Chan• 2026

Related benchmarks

TaskDatasetResultRank
Crowd CountingShanghaiTech Part A (test)
MAE56.2
271
Crowd CountingUCF-QNRF (test)
MAE84
113
Crowd CountingJHU-CROWD++ (test)
MAE61.9
57
Crowd Instance SegmentationShTech A (5% labeled)
IoU@0.531.4
5
Crowd Instance SegmentationShTech A (10% labeled)
mIoU@0.531.8
5
Crowd Instance SegmentationShTech A (40% labeled)
IoU@0.533.3
5
Crowd Instance SegmentationUCF-QNRF (5% labeled)
IoU@0.528.2
5
Crowd Instance SegmentationUCF-QNRF (10% labeled)
IoU@0.529
5
Crowd Instance SegmentationUCF-QNRF (40% labeled)
IoU@0.529.7
5
Crowd Instance SegmentationJHU++ (5% labeled)
IoU@0.525.8
5
Showing 10 of 12 rows

Other info

Follow for update