Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Grid R-CNN

About

This paper proposes a novel object detection framework named Grid R-CNN, which adopts a grid guided localization mechanism for accurate object detection. Different from the traditional regression based methods, the Grid R-CNN captures the spatial information explicitly and enjoys the position sensitive property of fully convolutional architecture. Instead of using only two independent points, we design a multi-point supervision formulation to encode more clues in order to reduce the impact of inaccurate prediction of specific points. To take the full advantage of the correlation of points in a grid, we propose a two-stage information fusion strategy to fuse feature maps of neighbor grid points. The grid guided localization approach is easy to be extended to different state-of-the-art detection frameworks. Grid R-CNN leads to high quality object localization, and experiments demonstrate that it achieves a 4.1% AP gain at IoU=0.8 and a 10.0% AP gain at IoU=0.9 on COCO benchmark compared to Faster R-CNN with Res50 backbone and FPN architecture.

Xin Lu, Buyu Li, Yuxin Yue, Quanquan Li, Junjie Yan• 2018

Related benchmarks

TaskDatasetResultRank
Object DetectionCOCO (test-dev)
mAP43.2
1195
Object DetectionMS COCO (test-dev)
mAP@.563
677
Object DetectionCOCO (minival)
mAP41.3
184
Object DetectionPascal VOC
mAP55.3
88
Object DetectionMS COCO 2017 (minival)
AP39.1
50
Object DetectionSAR-Aircraft v1.0 (test)
mAP (AP'07)64.15
27
Object DetectionSARDet-100K (test)
MAP50.05
27
SAR Object DetectionSSDD
mAP5088.9
27
Object DetectionMSAR AP'12 protocol (test)
mAP57.3
24
Object DetectionMSAR AP'07 protocol (test)
mAP55.82
24
Showing 10 of 12 rows

Other info

Follow for update