Semantically Self-Aligned Network for Text-to-Image Part-aware Person Re-identification
About
Text-to-image person re-identification (ReID) aims to search for images containing a person of interest using textual descriptions. However, due to the significant modality gap and the large intra-class variance in textual descriptions, text-to-image ReID remains a challenging problem. Accordingly, in this paper, we propose a Semantically Self-Aligned Network (SSAN) to handle the above problems. First, we propose a novel method that automatically extracts semantically aligned part-level features from the two modalities. Second, we design a multi-view non-local network that captures the relationships between body parts, thereby establishing better correspondences between body parts and noun phrases. Third, we introduce a Compound Ranking (CR) loss that makes use of textual descriptions for other images of the same identity to provide extra supervision, thereby effectively reducing the intra-class variance in textual features. Finally, to expedite future research in text-to-image ReID, we build a new database named ICFG-PEDES. Extensive experiments demonstrate that SSAN outperforms state-of-the-art approaches by significant margins. Both the new ICFG-PEDES database and the SSAN code are available at https://github.com/zifyloo/SSAN.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Text-to-image Person Re-identification | CUHK-PEDES (test) | Rank-1 Accuracy (R-1)61.37 | 150 | |
| Text-based Person Search | CUHK-PEDES (test) | Rank-161.37 | 142 | |
| Text-based Person Search | ICFG-PEDES (test) | R@154.23 | 104 | |
| Text-to-Image Retrieval | CUHK-PEDES (test) | Recall@161.37 | 96 | |
| Text-based Person Search | RSTPReid (test) | R@143.5 | 85 | |
| Text-to-image Person Re-identification | ICFG-PEDES (test) | Rank-10.5423 | 81 | |
| Text-based Person Search | CUHK-PEDES | Recall@161.4 | 61 | |
| Text-based Person Re-identification | RSTPReid (test) | Rank-1 Acc43.5 | 52 | |
| Person Search | CUHK-PEDES (test) | Recall@161.37 | 47 | |
| Text-to-image Person Re-identification | CUHK-PEDES | Rank-161.37 | 34 |