FlexiReID: Adaptive Mixture of Expert for Multi-Modal Person Re-Identification
About
Multimodal person re-identification (Re-ID) aims to match pedestrian images across different modalities. However, most existing methods focus on limited cross-modal settings and fail to support arbitrary query-retrieval combinations, hindering practical deployment. We propose FlexiReID, a flexible framework that supports seven retrieval modes across four modalities: rgb, infrared, sketches, and text. FlexiReID introduces an adaptive mixture-of-experts (MoE) mechanism to dynamically integrate diverse modality features and a cross-modal query fusion module to enhance multimodal feature extraction. To facilitate comprehensive evaluation, we construct CIRS-PEDES, a unified dataset extending four popular Re-ID datasets to include all four modalities. Extensive experiments demonstrate that FlexiReID achieves state-of-the-art performance and offers strong generalization in complex scenarios.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Person Re-Identification | Market 1501 | mAP92.1 | 999 | |
| Person Re-Identification | MSMT17 | mAP0.675 | 404 | |
| Text-to-image Person Re-identification | CUHK-PEDES (test) | Rank-1 Accuracy (R-1)69.2 | 150 | |
| Text-to-image Person Re-identification | ICFG-PEDES (test) | Rank-10.6134 | 81 | |
| Text-based Person Re-identification | RSTPReid (test) | Rank-1 Acc55.79 | 52 | |
| Cross-modal Person Re-identification | CUHK-PEDES (test) | Rank@184.92 | 24 | |
| Sketch-to-Real Person Re-identification | ICFG-PEDES (test) | Rank-1 Accuracy (R1)79.28 | 7 | |
| Sketch-to-Real Person Re-identification | RSTPReid (test) | Rank-1 Accuracy (R1)66.79 | 7 | |
| Infrared-to-Real Person Re-identification | CUHK-PEDES (test) | Rank-1 Accuracy (R1)85.26 | 6 | |
| Infrared-to-Real Person Re-identification | ICFG-PEDES (test) | Rank-1 (R1)82.03 | 6 |