Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Few-Shot Object Detection with Fully Cross-Transformer

About

Few-shot object detection (FSOD), with the aim to detect novel objects using very few training examples, has recently attracted great research interest in the community. Metric-learning based methods have been demonstrated to be effective for this task using a two-branch based siamese network, and calculate the similarity between image regions and few-shot examples for detection. However, in previous works, the interaction between the two branches is only restricted in the detection head, while leaving the remaining hundreds of layers for separate feature extraction. Inspired by the recent work on vision transformers and vision-language transformers, we propose a novel Fully Cross-Transformer based model (FCT) for FSOD by incorporating cross-transformer into both the feature backbone and detection head. The asymmetric-batched cross-attention is proposed to aggregate the key information from the two branches with different batch sizes. Our model can improve the few-shot similarity learning between the two branches by introducing the multi-level interactions. Comprehensive experiments on both PASCAL VOC and MSCOCO FSOD benchmarks demonstrate the effectiveness of our model.

Guangxing Han, Jiawei Ma, Shiyuan Huang, Long Chen, Shih-Fu Chang• 2022

Related benchmarks

TaskDatasetResultRank
Object DetectionPASCAL VOC (Novel Set 1)
mAP@5064.3
223
Object DetectionPASCAL VOC Novel Set 3 2007+2012
mAP5058.7
139
Object DetectionMS COCO novel classes
nAP2.14e+3
132
Object DetectionPASCAL VOC Set 2 (novel)--
110
Object DetectionPASCAL VOC 2007+2012 (Novel Set 1)--
75
Object DetectionPASCAL VOC Novel Set 2 2007+2012--
75
Object DetectionPASCAL VOC (Novel Set 1)
AP50 (shot=1)49.9
71
Object DetectionPASCAL VOC Set 3 (novel)
AP50 (shot=1)39.5
71
Object DetectionPascal VOC (Novel Split 2)
nAP5051.2
65
Object DetectionPascal VOC (Novel Split 3)
AP5058.7
65
Showing 10 of 21 rows

Other info

Follow for update