Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pose-aware Multi-level Feature Network for Human Object Interaction Detection

About

Reasoning human object interactions is a core problem in human-centric scene understanding and detecting such relations poses a unique challenge to vision systems due to large variations in human-object configurations, multiple co-occurring relation instances and subtle visual difference between relation categories. To address those challenges, we propose a multi-level relation detection strategy that utilizes human pose cues to capture global spatial configurations of relations and as an attention mechanism to dynamically zoom into relevant regions at human part level. Specifically, we develop a multi-branch deep network to learn a pose-augmented relation representation at three semantic levels, incorporating interaction context, object features and detailed semantic part cues. As a result, our approach is capable of generating robust predictions on fine-grained human object interactions with interpretable outputs. Extensive experimental evaluations on public benchmarks show that our model outperforms prior methods by a considerable margin, demonstrating its efficacy in handling complex scenes.

Bo Wan, Desen Zhou, Yongfei Liu, Rongjie Li, Xuming He• 2019

Related benchmarks

TaskDatasetResultRank
Human-Object Interaction DetectionHICO-DET (test)
mAP (full)17.46
493
Human-Object Interaction DetectionV-COCO (test)
AP (Role, Scenario 1)52
270
Human-Object Interaction DetectionHICO-DET
mAP (Full)17.46
233
Human-Object Interaction DetectionHICO-DET Known Object (test)
mAP (Full)20.34
112
Human-Object Interaction DetectionHICO-DET 1 (test)
Full mAP20.34
33
Human-Object Interaction DetectionV-COCO
Box mAP (Scenario 1)52
32
HOI DetectionHICO-DET (test)
Box mAP (Full)17.46
32
Human-Object Interaction DetectionV-COCO
AP (Role)52
23
Human-Object Interaction DetectionHICO-DET 9 (test)
mAP (Full)20.34
21
Human-Object Interaction DetectionHOI-VP
mAP62.3
11
Showing 10 of 12 rows

Other info

Follow for update