Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NuNext: Reframing Nucleus Detection as Next-Point Detection

About

Nucleus detection in histopathology is pivotal for a wide range of clinical applications. Existing approaches either regress nuclear proxy maps that require complex post-processing, or employ dense anchors or queries that introduce severe foreground-background imbalance. In this work, we reformulate nucleus detection as next-point prediction, wherein a multimodal large language model is developed to directly output foreground nucleus centroids from the input image. The model is trained in two stages. In the supervised learning stage, we propose spatial-aware soft supervision to relax strict centroid matching and a chain-of-visual-thought strategy to incorporate visual priors that facilitate coordinate prediction. In the reinforcement fine-tuning stage, we design distribution matching reward, low-variance group filtering, and fine-grained advantage shaping to further improve the model's detection quality. Extensive experiments on nine widely used benchmarks demonstrate the superiority of our method. Code will be released soon.

Zhongyi Shui, Honglin Li, Xiaozhong Ji, Ye Zhang, Zijiang Yang, Chenglu Zhu, Yuxuan Sun, Kai Yao, Conghui He, Cheng Tan• 2026

Related benchmarks

TaskDatasetResultRank
Nuclear Instance SegmentationCPM 17
AJI72.3
33
Nuclear Instance SegmentationCoNSeP
AJI58.2
32
Nuclear Instance SegmentationKumar
AJI65.4
24
Nuclei SegmentationGLySAC
AJI0.599
20
Nuclear Instance SegmentationTNBC
AJI69.8
18
Nucleus Instance SegmentationCPM-15
AJI0.656
10
Nucleus Instance SegmentationBRCA-M2C
AJI0.713
10
Nucleus Instance SegmentationCryoNuSeg
AJI0.523
10
Nucleus Instance SegmentationPanNuke
Adrenal bPQ74.17
7
Showing 9 of 9 rows

Other info

Follow for update