Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Context-Aware Crowd Counting

About

State-of-the-art methods for counting people in crowded scenes rely on deep networks to estimate crowd density. They typically use the same filters over the whole image or over large image patches. Only then do they estimate local scale to compensate for perspective distortion. This is typically achieved by training an auxiliary classifier to select, for predefined image patches, the best kernel size among a limited set of choices. As such, these methods are not end-to-end trainable and restricted in the scope of context they can leverage. In this paper, we introduce an end-to-end trainable deep architecture that combines features obtained using multiple receptive field sizes and learns the importance of each such feature at each image location. In other words, our approach adaptively encodes the scale of the contextual information required to accurately predict crowd density. This yields an algorithm that outperforms state-of-the-art crowd counting methods, especially when perspective effects are strong.

Weizhe Liu, Mathieu Salzmann, Pascal Fua• 2018

Related benchmarks

TaskDatasetResultRank
Crowd CountingShanghaiTech Part A (test)
MAE62.3
227
Crowd CountingShanghaiTech Part B (test)
MAE7.7
191
Crowd CountingShanghaiTech Part B
MAE7.8
160
Crowd CountingShanghaiTech Part A
MAE62.3
138
Crowd CountingUCF-QNRF (test)
MAE107
95
Crowd CountingWorldExpo'10 (test)
Scene 1 Error2.4
80
Crowd CountingUCF_CC_50 (test)
MAE212.2
66
Crowd CountingUCF_CC_50
MAE212.2
60
Crowd CountingUCF-QNRF
MAE107
48
Crowd CountingUCF_CC_50 (5-fold cross-validation)
MAE212.2
43
Showing 10 of 19 rows

Other info

Code

Follow for update