Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval

About

Given a natural language description, text-based person retrieval aims to identify images of a target person from a large-scale person image database. Existing methods generally face a \textbf{color over-reliance problem}, which means that the models rely heavily on color information when matching cross-modal data. Indeed, color information is an important decision-making accordance for retrieval, but the over-reliance on color would distract the model from other key clues (e.g. texture information, structural information, etc.), and thereby lead to a sub-optimal retrieval performance. To solve this problem, in this paper, we propose to \textbf{C}apture \textbf{A}ll-round \textbf{I}nformation \textbf{B}eyond \textbf{C}olor (\textbf{CAIBC}) via a jointly optimized multi-branch architecture for text-based person retrieval. CAIBC contains three branches including an RGB branch, a grayscale (GRS) branch and a color (CLR) branch. Besides, with the aim of making full use of all-round information in a balanced and effective way, a mutual learning mechanism is employed to enable the three branches which attend to varied aspects of information to communicate with and learn from each other. Extensive experimental analysis is carried out to evaluate our proposed CAIBC method on the CUHK-PEDES and RSTPReid datasets in both \textbf{supervised} and \textbf{weakly supervised} text-based person retrieval settings, which demonstrates that CAIBC significantly outperforms existing methods and achieves the state-of-the-art performance on all the three tasks.

Zijie Wang, Aichun Zhu, Jingyi Xue, Xili Wan, Chao Liu, Tian Wang, Yifeng Li• 2022

Related benchmarks

TaskDatasetResultRank
Text-based Person SearchCUHK-PEDES (test)
Rank-164.43
171
Text-to-image Person Re-identificationCUHK-PEDES (test)
Rank-1 Accuracy (R-1)64.43
150
Text-based Person SearchRSTPReid (test)
R@147.35
136
Text-to-Image RetrievalCUHK-PEDES (test)
Recall@164.43
114
Text-based Person SearchCUHK-PEDES
Recall@164.43
90
Text-to-image person retrievalRSTPReid
Rank-1 Accuracy47.35
66
Text-based Person Re-identificationRSTPReid (test)
Rank-1 Acc47.35
52
Text to ImageCUHK-PEDES
Rank-164.43
28
Text-to-image person retrievalCUHK-PEDES
R@164.43
28
Text-based Person RetrievalCUHK-PEDES 1.0 (test)
R@164.43
15
Showing 10 of 10 rows

Other info

Follow for update