Vision-Language Attribute Disentanglement and Reinforcement for Lifelong Person Re-Identification
About
Lifelong person re-identification (LReID) aims to learn from varying domains to obtain a unified person retrieval model. Existing LReID approaches typically focus on learning from scratch or a visual classification-pretrained model, while the Vision-Language Model (VLM) has shown generalizable knowledge in a variety of tasks. Although existing methods can be directly adapted to the VLM, since they only consider global-aware learning, the fine-grained attribute knowledge is underleveraged, leading to limited acquisition and anti-forgetting capacity. To address this problem, we introduce a novel VLM-driven LReID approach named Vision-Language Attribute Disentanglement and Reinforcement (VLADR). Our key idea is to explicitly model the universally shared human attributes to improve inter-domain knowledge transfer, thereby effectively utilizing historical knowledge to reinforce new knowledge learning and alleviate forgetting. Specifically, VLADR includes a Multi-grain Text Attribute Disentanglement mechanism that mines the global and diverse local text attributes of an image. Then, an Inter-domain Cross-modal Attribute Reinforcement scheme is developed, which introduces cross-modal attribute alignment to guide visual attribute extraction and adopts inter-domain attribute alignment to achieve fine-grained knowledge transfer. Experimental results demonstrate that our VLADR outperforms the state-of-the-art methods by 1.9\%-2.2\% and 2.1\%-2.5\% on anti-forgetting and generalization capacity. Our source code is available at https://github.com/zhoujiahuan1991/CVPR2026-VLADR
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Person Re-Identification | Market 1501 | mAP80.1 | 1071 | |
| Person Re-Identification | MSMT17 | mAP0.408 | 514 | |
| Person Re-Identification | CUHK03 | R169.6 | 284 | |
| Person Re-Identification | Seen-domain average (s) | mAP70.4 | 60 | |
| Person Re-Identification | UnSeen Avg | mAP78.1 | 56 | |
| Person Re-Identification | CUHK-SYSU | mAP92 | 50 | |
| Person Re-Identification | SYSU | Rank-1 Accuracy93.6 | 32 | |
| Person Re-Identification | Market | Rank-1 Acc91.3 | 20 | |
| Person Re-Identification | LPW s2 | mAP67.8 | 16 | |
| Person Re-Identification | MSMT17 V2 | mAP45.5 | 16 |