Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Person Search with Natural Language Description

About

Searching persons in large-scale image databases with the query of natural language description has important applications in video surveillance. Existing methods mainly focused on searching persons with image-based or attribute-based queries, which have major limitations for a practical usage. In this paper, we study the problem of person search with natural language description. Given the textual description of a person, the algorithm of the person search is required to rank all the samples in the person database then retrieve the most relevant sample corresponding to the queried description. Since there is no person dataset or benchmark with textual description available, we collect a large-scale person description dataset with detailed natural language annotations and person samples from various sources, termed as CUHK Person Description Dataset (CUHK-PEDES). A wide range of possible models and baselines have been evaluated and compared on the person search benchmark. An Recurrent Neural Network with Gated Neural Attention mechanism (GNA-RNN) is proposed to establish the state-of-the art performance on person search.

Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang• 2017

Related benchmarks

TaskDatasetResultRank
Text-to-image Person Re-identificationCUHK-PEDES (test)
Rank-1 Accuracy (R-1)19.05
150
Text-based Person SearchCUHK-PEDES (test)
Rank-119.05
142
Text-to-Image RetrievalCUHK-PEDES (test)
Recall@119.05
96
Text-based Person SearchCUHK-PEDES
Recall@119.05
61
Person SearchCUHK-PEDES (test)
Recall@119.05
47
Text-to-image Person Re-identificationCUHK-PEDES
Rank-119.05
34
Text to ImageCUHK-PEDES
Rank-119.05
28
Cross-modal Person Re-identificationCUHK-PEDES (test)
Rank@119.05
24
Showing 8 of 8 rows

Other info

Follow for update