Point-to-Mask: From Arbitrary Point Annotations to Mask-Level Infrared Small Target Detection

About

Infrared small target detection (IRSTD) methods predominantly formulate the task as pixel-level segmentation, which requires costly dense annotations and is not well suited to tiny targets with weak texture and ambiguous boundaries. To address this issue, we propose Point-to-Mask, a framework that bridges low-cost point supervision and mask-level detection through two components: a Physics-driven Adaptive Mask Generation (PAMG) module that converts point annotations into compact target masks and geometric cues, and a lightweight Radius-aware Point Regression Network (RPR-Net) that reformulates IRSTD as target center localization and effective radius regression using spatiotemporal motion cues. The two modules form a closed loop: PAMG generates pseudo masks and geometric supervision during training, while the geometric predictions of RPR-Net are fed back to PAMG for pixel-level mask recovery during inference. To facilitate systematic evaluation, we further construct SIRSTD-Pixel, a sequential dataset with refined pixel-level annotations. Experiments show that the proposed framework achieves strong pseudo-label quality, high detection accuracy, and efficient inference, approaching full-supervision performance under point-supervised settings with substantially lower annotation cost. Code and datasets will be available at: https://github.com/GaoScience/point-to-mask.

Weihua Gao, Wenlong Niu, Jie Tang, Man Yang, Jiafeng Zhang, Xiaodong Peng• 2026

Related benchmarks

Task	Dataset	Result
Infrared Small Target Detection	SIRSTD-Pixel	AUC95.77	14
Pseudo-label generation quality	SIRSTD	mIoU88.34	9
Pseudo-label generation quality	NUDT-SIRST	mIoU75.11	9
Pseudo-label generation quality	IRSTD-1K	mIoU71.63	9

Showing 4 of 4 rows

Other info

Follow for update

@wizwand_team Discord