Repulsion Loss: Detecting Pedestrians in a Crowd
About
Detecting individual pedestrians in a crowd remains a challenging problem since the pedestrians often gather together and occlude each other in real-world scenarios. In this paper, we first explore how a state-of-the-art pedestrian detector is harmed by crowd occlusion via experimentation, providing insights into the crowd occlusion problem. Then, we propose a novel bounding box regression loss specifically designed for crowd scenes, termed repulsion loss. This loss is driven by two motivations: the attraction by target, and the repulsion by other surrounding objects. The repulsion term prevents the proposal from shifting to surrounding objects thus leading to more crowd-robust localization. Our detector trained by repulsion loss outperforms all the state-of-the-art methods with a significant improvement in occlusion cases.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Pedestrian Detection | CityPersons (val) | AP (Reasonable)13.7 | 85 | |
| Pedestrian Detection | CrowdHuman (val) | MR^-245.7 | 61 | |
| Pedestrian Detection | CityPersons 1.0 (val) | Miss Rate (Reasonable)13.2 | 21 | |
| Pedestrian Detection | CrowdHuman (test) | MR42.4 | 16 | |
| Pedestrian Detection | CityPersons highly occluded (HO) | Miss Rate56.9 | 16 | |
| Pedestrian Detection | CityPersons original image size (1024x2048 pixels) (test) | AP (Reasonable)13.2 | 11 | |
| Pedestrian Detection | Caltech Reasonable | Miss Rate5 | 11 | |
| Pedestrian Detection | CityPersons (val) | MR-2 (Reasonable)11.6 | 10 | |
| Object Detection | CrowdHuman full-body annotations (val) | Recall90.74 | 10 | |
| Pedestrian Detection | CityPersons reasonable (R) | Miss Rate13.2 | 9 |