Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

About

With the rapidly increasing demand for oriented object detection (OOD), recent research involving weakly-supervised detectors for learning OOD from point annotations has gained great attention. In this paper, we rethink this challenging task setting with the layout among instances and present Point2RBox-v2. At the core are three principles: 1) Gaussian overlap loss. It learns an upper bound for each instance by treating objects as 2D Gaussian distributions and minimizing their overlap. 2) Voronoi watershed loss. It learns a lower bound for each instance through watershed on Voronoi tessellation. 3) Consistency loss. It learns the size/rotation variation between two output sets with respect to an input image and its augmented view. Supplemented by a few devised techniques, e.g. edge loss and copy-paste, the detector is further enhanced. To our best knowledge, Point2RBox-v2 is the first approach to explore the spatial layout among instances for learning point-supervised OOD. Our solution is elegant and lightweight, yet it is expected to give a competitive performance especially in densely packed scenes: 62.61%/86.15%/34.71% on DOTA/HRSC/FAIR1M. Code is available at https://github.com/VisionXLab/point2rbox-v2.

Yi Yu, Botao Ren, Peiyuan Zhang, Mingxin Liu, Junwei Luo, Shaofeng Zhang, Feipeng Da, Junchi Yan, Xue Yang• 2025

Related benchmarks

TaskDatasetResultRank
Oriented Object DetectionDOTA v1.0 (test)
SV79.6
378
Oriented Object DetectionDOTA v1.0
AP5062.61
16
Oriented Object DetectionDIOR
AP5044.45
15
Oriented Object DetectionDOTA v1.5
AP@5054.06
14
Oriented Object DetectionDOTA v2.0
AP5038.79
14
Oriented Object DetectionFAIR1M
AP5034.71
14
Oriented Object DetectionSTAR
AP5014.2
13
Oriented Object DetectionRSAR
AP5030.9
13
Oriented Object DetectionHRSC
AP5086.15
10
Oriented Object DetectionSKU110K
AP5065.64
10
Showing 10 of 10 rows

Other info

Code

Follow for update