Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with Attentive Feature Alignment

About

Few-shot object detection (FSOD) aims to detect objects using only a few examples. How to adapt state-of-the-art object detectors to the few-shot domain remains challenging. Object proposal is a key ingredient in modern object detectors. However, the quality of proposals generated for few-shot classes using existing methods is far worse than that of many-shot classes, e.g., missing boxes for few-shot classes due to misclassification or inaccurate spatial locations with respect to true objects. To address the noisy proposal problem, we propose a novel meta-learning based FSOD model by jointly optimizing the few-shot proposal generation and fine-grained few-shot proposal classification. To improve proposal generation for few-shot classes, we propose to learn a lightweight metric-learning based prototype matching network, instead of the conventional simple linear object/nonobject classifier, e.g., used in RPN. Our non-linear classifier with the feature fusion network could improve the discriminative prototype matching and the proposal recall for few-shot classes. To improve the fine-grained few-shot proposal classification, we propose a novel attentive feature alignment method to address the spatial misalignment between the noisy proposals and few-shot classes, thus improving the performance of few-shot object detection. Meanwhile we learn a separate Faster R-CNN detection head for many-shot base classes and show strong performance of maintaining base-classes knowledge. Our model achieves state-of-the-art performance on multiple FSOD benchmarks over most of the shots and metrics.

Guangxing Han, Shiyuan Huang, Jiawei Ma, Yicheng He, Shih-Fu Chang• 2021

Related benchmarks

TaskDatasetResultRank
Object DetectionPASCAL VOC (Novel Set 1)--
71
Object DetectionPascal VOC (Novel Split 2)
nAP5051.4
65
Object DetectionPascal VOC (Novel Split 3)
AP5059.9
65
Object DetectionPascal-5i 2010 (Novel Split 1)
nAP5066.1
54
Object DetectionCOCO-FSOD 30-shot COCO-20
nAP16.6
47
Few-shot Object DetectionMS-COCO 10-shot (novel classes)
nAP12.7
34
Few-shot Object DetectionMS-COCO 30-shot (novel classes)
nAP (Novel)16.6
34
Few-shot Object DetectionCOCO 2014 (novel)
nAP16.6
31
Few-shot Object DetectionCOCO FSOD 10-shot Standard
nAP12.7
17
Object DetectionCOCO-20i 10-shot
nAP12.7
16
Showing 10 of 13 rows

Other info

Follow for update