Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

PLOT: Text-based Person Search with Part Slot Attention for Corresponding Part Discovery

About

Text-based person search, employing free-form text queries to identify individuals within a vast image collection, presents a unique challenge in aligning visual and textual representations, particularly at the human part level. Existing methods often struggle with part feature extraction and alignment due to the lack of direct part-level supervision and reliance on heuristic features. We propose a novel framework that leverages a part discovery module based on slot attention to autonomously identify and align distinctive parts across modalities, enhancing interpretability and retrieval accuracy without explicit part-level correspondence supervision. Additionally, text-based dynamic part attention adjusts the importance of each part, further improving retrieval outcomes. Our method is evaluated on three public benchmarks, significantly outperforming existing methods.

Jicheol Park, Dongwon Kim, Boseung Jeong, Suha Kwak• 2024

Related benchmarks

TaskDatasetResultRank
Text-to-image Person Re-identificationCUHK-PEDES (test)
Rank-1 Accuracy (R-1)75.28
150
Text-based Person SearchCUHK-PEDES (test)
Rank-175.28
142
Text-based Person SearchICFG-PEDES (test)
R@165.76
104
Text-based Person SearchRSTPReid (test)
R@161.8
85
Text-to-image Person Re-identificationICFG-PEDES 58 (test)
Rank-165.76
15
Text-to-image Person Re-identificationRSTPReid 59 (test)
Rank-1 Recall61.8
12
Showing 6 of 6 rows

Other info

Follow for update