Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization

About

Pedestrian attribute recognition has been an emerging research topic in the area of video surveillance. To predict the existence of a particular attribute, it is demanded to localize the regions related to the attribute. However, in this task, the region annotations are not available. How to carve out these attribute-related regions remains challenging. Existing methods applied attribute-agnostic visual attention or heuristic body-part localization mechanisms to enhance the local feature representations, while neglecting to employ attributes to define local feature areas. We propose a flexible Attribute Localization Module (ALM) to adaptively discover the most discriminative regions and learns the regional features for each attribute at multiple levels. Moreover, a feature pyramid architecture is also introduced to enhance the attribute-specific localization at low-levels with high-level semantic guidance. The proposed framework does not require additional region annotations and can be trained end-to-end with multi-level deep supervision. Extensive experiments show that the proposed method achieves state-of-the-art results on three pedestrian attribute datasets, including PETA, RAP, and PA-100K.

Chufeng Tang, Lu Sheng, Zhaoxiang Zhang, Xiaolin Hu• 2019

Related benchmarks

Task	Dataset	Result
Pedestrian Attribute Recognition	PA-100K	mA80.68	92
Pedestrian Attribute Recognition	PETA	mA86.3	52
Pedestrian Attribute Recognition	PA-100K (test)	mA80.68	40
Pedestrian Attribute Recognition	EventPAR (test)	mA57.18	40
Pedestrian Attribute Recognition	RAP	mA81.87	26
Pedestrian Attribute Recognition	PETA existing vs. zero-shot (multiple)	mA84.24	26
Attribute Recognition	RAP zero-shot	mA74.28	18
Pedestrian Attribute Recognition	RAP existing vs. zero-shot v2 (multiple)	mA78.21	8

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord