Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning To Count Everything

About

Existing works on visual counting primarily focus on one specific category at a time, such as people, animals, and cells. In this paper, we are interested in counting everything, that is to count objects from any category given only a few annotated instances from that category. To this end, we pose counting as a few-shot regression task. To tackle this task, we present a novel method that takes a query image together with a few exemplar objects from the query image and predicts a density map for the presence of all objects of interest in the query image. We also present a novel adaptation strategy to adapt our network to any novel visual category at test time, using only a few exemplar objects from the novel category. We also introduce a dataset of 147 object categories containing over 6000 images that are suitable for the few-shot counting task. The images are annotated with two types of annotation, dots and bounding boxes, and they can be used for developing few-shot counting models. Experiments on this dataset shows that our method outperforms several state-of-the-art object detectors and few-shot counting approaches. Our code and dataset can be found at https://github.com/cvlab-stonybrook/LearningToCountEverything.

Viresh Ranjan, Udbhav Sharma, Thu Nguyen, Minh Hoai• 2021

Related benchmarks

TaskDatasetResultRank
Object CountingFSC-147 (test)
MAE22.08
297
Object CountingFSC-147 (val)
MAE23.75
211
Crowd CountingShanghaiTech Part B
MAE24.8
160
Crowd CountingShanghaiTech Part A
MAE159.1
138
Car Object CountingCARPK (test)
MAE18.19
116
Crowd CountingWorldExpo'10 (test)
Scene 1 Error4.5
80
Object CountingFSC-147 1.0 (val)
MAE23.75
50
Object CountingFSC-147 1.0 (test)
MAE22.08
50
CountingCARPK
MAE18.19
41
Car CountingPUCPR+ (test)
MAE14.68
31
Showing 10 of 41 rows

Other info

Code

Follow for update