Occlusion-aware R-CNN: Detecting Pedestrians in a Crowd

About

Pedestrian detection in crowded scenes is a challenging problem since the pedestrians often gather together and occlude each other. In this paper, we propose a new occlusion-aware R-CNN (OR-CNN) to improve the detection accuracy in the crowd. Specifically, we design a new aggregation loss to enforce proposals to be close and locate compactly to the corresponding objects. Meanwhile, we use a new part occlusion-aware region of interest (PORoI) pooling unit to replace the RoI pooling layer in order to integrate the prior structure information of human body with visibility prediction into the network to handle occlusion. Our detector is trained in an end-to-end fashion, which achieves state-of-the-art results on three pedestrian detection datasets, i.e., CityPersons, ETH, and INRIA, and performs on-pair with the state-of-the-arts on Caltech.

Shifeng Zhang, Longyin Wen, Xiao Bian, Zhen Lei, Stan Z. Li• 2018

Related benchmarks

Task	Dataset	Result
Pedestrian Detection	CityPersons (val)	AP (Reasonable)12.8	85
Pedestrian Detection	CityPersons 1.0 (val)	Miss Rate (Reasonable)12.8	21
Pedestrian Detection	CityPersons original image size (1024x2048 pixels) (test)	AP (Reasonable)12.8	11
Pedestrian Detection	CityPersons (val)	MR-2 (Reasonable)11	10
Oriented Object Detection	RSAR (test)	mAP64.82	9
Pedestrian Detection	CityPersons Reasonable	Miss Rate12.8	9
Pedestrian Detection	CityPersons Reasonable (test)	MR-211.32	8
Pedestrian Detection	CityPersons Reasonable-Small (test)	MR-214.19	3

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord