Parsing-based View-aware Embedding Network for Vehicle Re-Identification

About

Vehicle Re-Identification is to find images of the same vehicle from various views in the cross-camera scenario. The main challenges of this task are the large intra-instance distance caused by different views and the subtle inter-instance discrepancy caused by similar vehicles. In this paper, we propose a parsing-based view-aware embedding network (PVEN) to achieve the view-aware feature alignment and enhancement for vehicle ReID. First, we introduce a parsing network to parse a vehicle into four different views, and then align the features by mask average pooling. Such alignment provides a fine-grained representation of the vehicle. Second, in order to enhance the view-aware features, we design a common-visible attention to focus on the common visible views, which not only shortens the distance among intra-instances, but also enlarges the discrepancy of inter-instances. The PVEN helps capture the stable discriminative information of vehicle under different views. The experiments conducted on three datasets show that our model outperforms state-of-the-art methods by a large margin.

Dechao Meng, Liang Li, Xuejing Liu, Yadong Li, Shijie Yang, Zhengjun Zha, Xingyu Gao, Shuhui Wang, Qingming Huang• 2020

Related benchmarks

Task	Dataset	Result
Vehicle Re-identification	VeRi-776 (test)	Rank-195.6	232
Vehicle Re-identification	VehicleID (Small)	R-184.7	61
Vehicle Re-identification	VehicleID (Large)	R-177.8	39
Vehicle Re-identification	VehicleID	Rank-1 Accuracy84.7	26
Vehicle Re-identification	VERI-Wild (Small)	mAP79.8	23
Vehicle Re-identification	VERI-Wild (Medium)	mAP73.91	21
Vehicle Re-identification	VERI-Wild Large	mAP66.2	21
Vehicle Re-identification	VehicleID small (test)	Rank-1 Accuracy84.7	17
Vehicle Re-identification	VeRi-776	mAP79.5	12
Vehicle Re-identification	VehicleID (Medium)	Rank-180.6	9

Showing 10 of 10 rows

Other info

Follow for update

@wizwand_team Discord