Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences
About
We address the problem of visible-infrared person re-identification (VI-reID), that is, retrieving a set of person images, captured by visible or infrared cameras, in a cross-modal setting. Two main challenges in VI-reID are intra-class variations across person images, and cross-modal discrepancies between visible and infrared images. Assuming that the person images are roughly aligned, previous approaches attempt to learn coarse image- or rigid part-level person representations that are discriminative and generalizable across different modalities. However, the person images, typically cropped by off-the-shelf object detectors, are not necessarily well-aligned, which distract discriminative person representation learning. In this paper, we introduce a novel feature learning framework that addresses these problems in a unified way. To this end, we propose to exploit dense correspondences between cross-modal person images. This allows to address the cross-modal discrepancies in a pixel-level, suppressing modality-related features from person representations more effectively. This also encourages pixel-wise associations between cross-modal local features, further facilitating discriminative feature learning for VI-reID. Extensive experiments and analyses on standard VI-reID benchmarks demonstrate the effectiveness of our approach, which significantly outperforms the state of the art.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Cross-modality Person Re-identification | SYSU-MM01 (All Search) | Recall@155.4 | 142 | |
| Visible-Thermal Person Re-identification | RegDB Visible to Thermal | Rank-174.2 | 140 | |
| Cross-modality Person Re-identification | SYSU-MM01 (Indoor Search) | Rank-158.5 | 114 | |
| Visible-Thermal Person Re-identification | RegDB Thermal to Visible | Rank-167.5 | 79 | |
| Vehicle Re-identification | MSVR310 | mAP18.67 | 29 | |
| Ship Re-identification | HOSS-ReID Optical to SAR | mAP11.9 | 19 | |
| Ship Re-identification | HOSS-ReID SAR to Optical | mAP8.5 | 19 | |
| Ship Re-identification | HOSS-ReID (All) | mAP33 | 19 | |
| Infrared-to-Visible Video Person Re-identification | BUPTCampus | Rank-10.391 | 18 | |
| Visible-to-Infrared Video Person Re-identification | BUPTCampus | Rank-132.1 | 18 |