Point-to-Mask: From Arbitrary Point Annotations to Mask-Level Infrared Small Target Detection
About
Infrared small target detection (IRSTD) methods predominantly formulate the task as pixel-level segmentation, which requires costly dense annotations and is not well suited to tiny targets with weak texture and ambiguous boundaries. To address this issue, we propose Point-to-Mask, a framework that bridges low-cost point supervision and mask-level detection through two components: a Physics-driven Adaptive Mask Generation (PAMG) module that converts point annotations into compact target masks and geometric cues, and a lightweight Radius-aware Point Regression Network (RPR-Net) that reformulates IRSTD as target center localization and effective radius regression using spatiotemporal motion cues. The two modules form a closed loop: PAMG generates pseudo masks and geometric supervision during training, while the geometric predictions of RPR-Net are fed back to PAMG for pixel-level mask recovery during inference. To facilitate systematic evaluation, we further construct SIRSTD-Pixel, a sequential dataset with refined pixel-level annotations. Experiments show that the proposed framework achieves strong pseudo-label quality, high detection accuracy, and efficient inference, approaching full-supervision performance under point-supervised settings with substantially lower annotation cost. Code and datasets will be available at: https://github.com/GaoScience/point-to-mask.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Infrared Small Target Detection | SIRSTD-Pixel | AUC95.77 | 14 | |
| Pseudo-label generation quality | SIRSTD | mIoU88.34 | 9 | |
| Pseudo-label generation quality | NUDT-SIRST | mIoU75.11 | 9 | |
| Pseudo-label generation quality | IRSTD-1K | mIoU71.63 | 9 |