Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Thinking Before Matching: A Reinforcement Reasoning Paradigm Towards General Person Re-Identification

About

Learning identity-discriminative representations with multi-scene generality has become a critical objective in person re-identification (ReID). However, mainstream perception-driven paradigms tend to identify fitting from massive annotated data rather than identity-causal cues understanding, which presents a fragile representation against multiple disruptions. In this work, ReID-R is proposed as a novel reasoning-driven paradigm that achieves explicit identity understanding and reasoning by incorporating chain-of-thought into the ReID pipeline. Specifically, ReID-R consists of a two-stage contribution: (i) Discriminative reasoning warm-up, where a model is trained in a CoT label-free manner to acquire identity-aware feature understanding; and (ii) Efficient reinforcement learning, which proposes a non-trivial sampling to construct scene-generalizable data. On this basis, ReID-R leverages high-quality reward signals to guide the model toward focusing on ID-related cues, achieving accurate reasoning and correct responses. Extensive experiments on multiple ReID benchmarks demonstrate that ReID-R achieves competitive identity discrimination as superior methods using only 14.3K non-trivial data (20.9% of the existing data scale). Furthermore, benefit from inherent reasoning, ReID-R can provide high-quality interpretation for results.

Quan Zhang, Jingze Wu, Jialong Wang, Xiaohua Xie, Jianhuang Lai, Hongbo Chen• 2026

Related benchmarks

TaskDatasetResultRank
Person Re-IdentificationPRCC Clothes-Changing (CC) (test)
R-1 Accuracy46.4
26
Clothing-Change Person Re-identificationVC-Clothes (test)
Rank-190.2
13
Person Re-IdentificationMarket1501 53 (test)
mAP92.5
6
Person Re-IdentificationMSMT17 34 (test)
mAP69.3
5
Person Re-IdentificationCUHK03 20 (test)
mAP85.6
4
Showing 5 of 5 rows

Other info

Follow for update