Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

NAS-Count: Counting-by-Density with Neural Architecture Search

About

Most of the recent advances in crowd counting have evolved from hand-designed density estimation networks, where multi-scale features are leveraged to address the scale variation problem, but at the expense of demanding design efforts. In this work, we automate the design of counting models with Neural Architecture Search (NAS) and introduce an end-to-end searched encoder-decoder architecture, Automatic Multi-Scale Network (AMSNet). Specifically, we utilize a counting-specific two-level search space. The encoder and decoder in AMSNet are composed of different cells discovered from micro-level search, while the multi-path architecture is explored through macro-level search. To solve the pixel-level isolation issue in MSE loss, AMSNet is optimized with an auto-searched Scale Pyramid Pooling Loss (SPPLoss) that supervises the multi-scale structural information. Extensive experiments on four datasets show AMSNet produces state-of-the-art results that outperform hand-designed models, fully demonstrating the efficacy of NAS-Count.

Yutao Hu, Xiaolong Jiang, Xuhui Liu, Baochang Zhang, Jungong Han, Xianbin Cao, David Doermann• 2020

Related benchmarks

TaskDatasetResultRank
Crowd CountingShanghaiTech Part A (test)
MAE56.7
227
Crowd CountingShanghaiTech Part B (test)
MAE6.7
191
Crowd CountingShanghaiTech Part B
MAE6.7
160
Crowd CountingShanghaiTech Part A
MAE56.7
138
Crowd CountingUCF-QNRF (test)
MAE101.8
95
Crowd CountingUCF_CC_50 (test)
MAE208.4
66
Crowd CountingUCF_CC_50
MAE208.4
60
Crowd CountingUCF-QNRF
MAE101.8
14
Crowd CountingSHHA 55 (test)
MAE56.7
13
Crowd CountingSHHB 55 (test)
MAE6.7
12
Showing 10 of 11 rows

Other info

Follow for update