Parsing-based View-aware Embedding Network for Vehicle Re-Identification
About
Vehicle Re-Identification is to find images of the same vehicle from various views in the cross-camera scenario. The main challenges of this task are the large intra-instance distance caused by different views and the subtle inter-instance discrepancy caused by similar vehicles. In this paper, we propose a parsing-based view-aware embedding network (PVEN) to achieve the view-aware feature alignment and enhancement for vehicle ReID. First, we introduce a parsing network to parse a vehicle into four different views, and then align the features by mask average pooling. Such alignment provides a fine-grained representation of the vehicle. Second, in order to enhance the view-aware features, we design a common-visible attention to focus on the common visible views, which not only shortens the distance among intra-instances, but also enlarges the discrepancy of inter-instances. The PVEN helps capture the stable discriminative information of vehicle under different views. The experiments conducted on three datasets show that our model outperforms state-of-the-art methods by a large margin.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Vehicle Re-identification | VeRi-776 (test) | Rank-195.6 | 232 | |
| Vehicle Re-identification | VehicleID (Small) | R-184.7 | 61 | |
| Vehicle Re-identification | VehicleID (Large) | R-177.8 | 39 | |
| Vehicle Re-identification | VehicleID | Rank-1 Accuracy84.7 | 23 | |
| Vehicle Re-identification | VERI-Wild (Small) | mAP79.8 | 23 | |
| Vehicle Re-identification | VERI-Wild (Medium) | mAP73.91 | 21 | |
| Vehicle Re-identification | VERI-Wild Large | mAP66.2 | 21 | |
| Vehicle Re-identification | VehicleID small (test) | Rank-1 Accuracy84.7 | 17 | |
| Vehicle Re-identification | VehicleID (Medium) | Rank-180.6 | 9 | |
| Vehicle Re-identification | VeRi-776 | mAP79.5 | 9 |