Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Poly Kernel Inception Network for Remote Sensing Detection

About

Object detection in remote sensing images (RSIs) often suffers from several increasing challenges, including the large variation in object scales and the diverse-ranging context. Prior methods tried to address these challenges by expanding the spatial receptive field of the backbone, either through large-kernel convolution or dilated convolution. However, the former typically introduces considerable background noise, while the latter risks generating overly sparse feature representations. In this paper, we introduce the Poly Kernel Inception Network (PKINet) to handle the above challenges. PKINet employs multi-scale convolution kernels without dilation to extract object features of varying scales and capture local context. In addition, a Context Anchor Attention (CAA) module is introduced in parallel to capture long-range contextual information. These two components work jointly to advance the performance of PKINet on four challenging remote sensing detection benchmarks, namely DOTA-v1.0, DOTA-v1.5, HRSC2016, and DIOR-R.

Xinhao Cai, Qiuxia Lai, Yuwei Wang, Wenguan Wang, Zeren Sun, Yazhou Yao• 2024

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO 2017 (val)
AP43.4
2454
Oriented Object DetectionDOTA v1.0 (test)--
378
Object DetectionDOTA 1.0 (test)
Plane AP89.72
256
Object DetectionHRSC 2016 (test)
mAP@0.0790.7
72
Oriented Object DetectionDOTA v1.5 (test)
mAP71.47
58
Object DetectionDOTA v1.5 (test)
mAP71.47
34
Oriented Object DetectionDIOR-R (test)--
28
Object DetectionDOTA v1.0
Overall mAP81.06
24
Oriented Object DetectionDOTA 1.0 (train test)
mAP 5078.4
23
Object DetectionDIOR-R (test)
mAP67.03
22
Showing 10 of 11 rows

Other info

Code

Follow for update