Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Video-based Person Re-identification with Spatial and Temporal Memory Networks

About

Video-based person re-identification (reID) aims to retrieve person videos with the same identity as a query person across multiple cameras. Spatial and temporal distractors in person videos, such as background clutter and partial occlusions over frames, respectively, make this task much more challenging than image-based person reID. We observe that spatial distractors appear consistently in a particular location, and temporal distractors show several patterns, e.g., partial occlusions occur in the first few frames, where such patterns provide informative cues for predicting which frames to focus on (i.e., temporal attentions). Based on this, we introduce a novel Spatial and Temporal Memory Networks (STMN). The spatial memory stores features for spatial distractors that frequently emerge across video frames, while the temporal memory saves attentions which are optimized for typical temporal patterns in person videos. We leverage the spatial and temporal memories to refine frame-level person representations and to aggregate the refined frame-level features into a sequence-level person representation, respectively, effectively handling spatial and temporal distractors in person videos. We also introduce a memory spread loss preventing our model from addressing particular items only in the memories. Experimental results on standard benchmarks, including MARS, DukeMTMC-VideoReID, and LS-VID, demonstrate the effectiveness of our method.

Chanho Eom, Geon Lee, Junghyup Lee, Bumsub Ham• 2021

Related benchmarks

TaskDatasetResultRank
Video Person Re-IDMARS
Rank-1 Acc90.5
106
Video Person Re-IdentificationG2A-VReID Ground to Aerial
mAP66.7
25
Video Person Re-IdentificationAG-VPReID Aerial to Ground
mAP61.6
20
Video-based Person Re-identificationDukeV
R197
15
Person IdentificationNTU RGB-AB View+ (same-activity)
Rank-1 Acc72.98
15
Person IdentificationCharades-AB (same-activity)
Rank 138.72
15
Person IdentificationACC-MM1-Activities (same-activity)
Rank-1 Acc59.44
15
Person IdentificationPKU MMD-AB View+ (same-activity)
Rank-1 Acc76.55
15
Video Person Re-IdentificationLS-VID Ground to Ground
mAP69.2
14
Video-based Person Re-identificationLS-VID
Rank-1 Acc82.1
10
Showing 10 of 10 rows

Other info

Code

Follow for update