Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Crowd Counting and Density Estimation by Trellis Encoder-Decoder Network

About

Crowd counting has recently attracted increasing interest in computer vision but remains a challenging problem. In this paper, we propose a trellis encoder-decoder network (TEDnet) for crowd counting, which focuses on generating high-quality density estimation maps. The major contributions are four-fold. First, we develop a new trellis architecture that incorporates multiple decoding paths to hierarchically aggregate features at different encoding stages, which can handle large variations of objects. Second, we design dense skip connections interleaved across paths to facilitate sufficient multi-scale feature fusions and to absorb the supervision information. Third, we propose a new combinatorial loss to enforce local coherence and spatial correlation in density maps. By distributedly imposing this combinatorial loss on intermediate outputs, gradient vanishing can be largely alleviated for better back-propagation and faster convergence. Finally, our TEDnet achieves new state-of-the art performance on four benchmarks, with an improvement up to 14% in terms of MAE.

Xiaolong Jiang, Zehao Xiao, Baochang Zhang, Xiantong Zhen, Xianbin Cao, David Doermann, Ling Shao• 2019

Related benchmarks

TaskDatasetResultRank
Crowd CountingShanghaiTech Part A (test)
MAE64.2
227
Crowd CountingShanghaiTech Part B (test)
MAE8.2
191
Crowd CountingShanghaiTech Part B
MAE8.2
160
Crowd CountingShanghaiTech Part A
MAE64.2
138
Crowd CountingUCF-QNRF (test)
MAE113
95
Crowd CountingUCF_CC_50
MAE249.4
60
Crowd CountingUCF-QNRF
MAE113
48
Crowd CountingUCF_CC_50 (5-fold cross-validation)
MAE249.4
43
Showing 8 of 8 rows

Other info

Follow for update