Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Video Individual Counting for Moving Drones

About

Video Individual Counting (VIC) has received increasing attention for its importance in intelligent video surveillance. Existing works are limited in two aspects, i.e., dataset and method. Previous datasets are captured with fixed or rarely moving cameras with relatively sparse individuals, restricting evaluation for a highly varying view and time in crowded scenes. Existing methods rely on localization followed by association or classification, which struggle under dense and dynamic conditions due to inaccurate localization of small targets. To address these issues, we introduce the MovingDroneCrowd Dataset, featuring videos captured by fast-moving drones in crowded scenes under diverse illuminations, shooting heights and angles. We further propose a Shared Density map-guided Network (SDNet) using a Depth-wise Cross-Frame Attention (DCFA) module to directly estimate shared density maps between consecutive frames, from which the inflow and outflow density maps are derived by subtracting the shared density maps from the global density maps. The inflow density maps across frames are summed up to obtain the number of unique pedestrians in a video. Experiments on our datasets and publicly available ones show the superiority of our method over the state of the arts in highly dynamic and complex crowded scenes. Our dataset and codes have been released publicly.

Yaowu Fan, Jia Wan, Tao Han, Antoni B. Chan, Andy J. Ma• 2025

Related benchmarks

TaskDatasetResultRank
Video Individual CountingCroHD (test)
MAE128.9
26
Video Individual CountingSenseCrowd (test)
MAE8.6
23
Video-level crowd countingMovingDroneCrowd++
MAE76.24
11
Video Individual CountingMovingDroneCrowd
MAE41
10
Video-level crowd countingVSCrowd
MAE8.6
9
Video Individual CountingWuhanMetroCrowd (test)
MAE166
6
Video Individual CountingWuhanMetroCrowd
FPS2.46
6
Showing 7 of 7 rows

Other info

Follow for update