STA: Spatial-Temporal Attention for Large-Scale Video-based Person Re-Identification
About
In this work, we propose a novel Spatial-Temporal Attention (STA) approach to tackle the large-scale person re-identification task in videos. Different from the most existing methods, which simply compute representations of video clips using frame-level aggregation (e.g. average pooling), the proposed STA adopts a more effective way for producing robust clip-level feature representation. Concretely, our STA fully exploits those discriminative parts of one target person in both spatial and temporal dimensions, which results in a 2-D attention score matrix via inter-frame regularization to measure the importances of spatial parts across different frames. Thus, a more robust clip-level feature representation can be generated according to a weighted sum operation guided by the mined 2-D attention score matrix. In this way, the challenging cases for video-based person re-identification such as pose variation and partial occlusion can be well tackled by the STA. We conduct extensive experiments on two large-scale benchmarks, i.e. MARS and DukeMTMC-VideoReID. In particular, the mAP reaches 87.7% on MARS, which significantly outperforms the state-of-the-arts with a large margin of more than 11.6%.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Video Person Re-ID | MARS | Rank-1 Acc86.3 | 106 | |
| Person Re-Identification | MARS (test) | Rank-186.3 | 72 | |
| Person Re-Identification | MARS | Rank-186.2 | 67 | |
| Video Person Re-Identification | MARS (test) | Rank-186.3 | 35 | |
| Video Person Re-Identification | DukeMTMC-VideoReID | Rank-1 Accuracy96.2 | 26 | |
| Video-to-Video Person Re-identification | MARS (test) | Top-1 Accuracy86.3 | 22 | |
| Video Person Re-Identification | MARS v1 (test) | mAP85.1 | 21 | |
| Video Person Re-Identification | Market-1501 v1 (test) | Rank-186.3 | 21 | |
| Image-to-Video Person Re-identification | DukeMTMC-VideoReID (test) | Top-1 Acc96.2 | 16 | |
| Video-based Person Re-identification | DukeV | R196 | 15 |