Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Few-shot Object Counting and Detection

About

We tackle a new task of few-shot object counting and detection. Given a few exemplar bounding boxes of a target object class, we seek to count and detect all objects of the target class. This task shares the same supervision as the few-shot object counting but additionally outputs the object bounding boxes along with the total object count. To address this challenging problem, we introduce a novel two-stage training strategy and a novel uncertainty-aware few-shot object detector: Counting-DETR. The former is aimed at generating pseudo ground-truth bounding boxes to train the latter. The latter leverages the pseudo ground-truth provided by the former but takes the necessary steps to account for the imperfection of pseudo ground-truth. To validate the performance of our method on the new task, we introduce two new datasets named FSCD-147 and FSCD-LVIS. Both datasets contain images with complex scenes, multiple object classes per image, and a huge variation in object shapes, sizes, and appearance. Our proposed approach outperforms very strong baselines adapted from few-shot object counting and few-shot object detection with a large margin in both counting and detection metrics. The code and models are available at https://github.com/VinAIResearch/Counting-DETR.

Thanh Nguyen, Chau Pham, Khoi Nguyen, Minh Hoai• 2022

Related benchmarks

TaskDatasetResultRank
Object CountingFSC-147 (test)
MAE16.79
297
Object CountingFSC-147 (val)
MAE20.38
211
Object CountingFSC-147 1.0 (test)
MAE16.79
50
Object CountingFSC-147 1.0 (val)
MAE20.38
50
Object DetectionFSCD-147 (test)
AP22.66
29
Object CountingFSCD-LVIS (test)
MAE18.51
21
CountingFSCD-147 (test)
MAE16.79
21
Few-shot Object CountingFSC147 1.0 (test)
MAE16.79
19
Few-shot Object CountingFSC147 1.0 (val)
MAE20.38
19
Object DetectionFSCD-LVIS unseen classes (test)
AP3.85
17
Showing 10 of 15 rows

Other info

Code

Follow for update