Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning RoI Transformer for Detecting Oriented Objects in Aerial Images

About

Object detection in aerial images is an active yet challenging task in computer vision because of the birdview perspective, the highly complex backgrounds, and the variant appearances of objects. Especially when detecting densely packed objects in aerial images, methods relying on horizontal proposals for common object detection often introduce mismatches between the Region of Interests (RoIs) and objects. This leads to the common misalignment between the final object classification confidence and localization accuracy. Although rotated anchors have been used to tackle this problem, the design of them always multiplies the number of anchors and dramatically increases the computational complexity. In this paper, we propose a RoI Transformer to address these problems. More precisely, to improve the quality of region proposals, we first designed a Rotated RoI (RRoI) learner to transform a Horizontal Region of Interest (HRoI) into a Rotated Region of Interest (RRoI). Based on the RRoIs, we then proposed a Rotated Position Sensitive RoI Align (RPS-RoI-Align) module to extract rotation-invariant features from them for boosting subsequent classification and regression. Our RoI Transformer is with light weight and can be easily embedded into detectors for oriented object detection. A simple implementation of the RoI Transformer has achieved state-of-the-art performances on two common and challenging aerial datasets, i.e., DOTA and HRSC2016, with a neglectable reduction to detection speed. Our RoI Transformer exceeds the deformable Position Sensitive RoI pooling when oriented bounding-box annotations are available. Extensive experiments have also validated the flexibility and effectiveness of our RoI Transformer. The results demonstrate that it can be easily integrated with other detector architectures and significantly improve the performances.

Jian Ding, Nan Xue, Yang Long, Gui-Song Xia, Qikai Lu• 2018

Related benchmarks

TaskDatasetResultRank
Oriented Object DetectionDOTA v1.0 (test)
SV68.81
378
Object DetectionDOTA 1.0 (test)
Plane AP88.64
256
Oriented Object DetectionDOTA (test)
AP (Plane)88.64
92
Oriented Object DetectionDOTA v1.5 (test)
mAP65.03
58
Oriented Object DetectionDIOR-R
mAP5063.87
44
Oriented Object DetectionHRSC2016
mAP86.2
35
Object DetectionDOTA
AP (Plane)88.64
28
Oriented Object DetectionDOTA v2.0 (test-dev)
AP (Plane)71.81
18
Oriented Object DetectionRSAR (test)
mAP35.02
9
Showing 9 of 9 rows

Other info

Follow for update