Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection

About

We consider the problem of Human-Object Interaction (HOI) Detection, which aims to locate and recognize HOI instances in the form of <human, action, object> in images. Most existing works treat HOIs as individual interaction categories, thus can not handle the problem of long-tail distribution and polysemy of action labels. We argue that multi-level consistencies among objects, actions and interactions are strong cues for generating semantic representations of rare or previously unseen HOIs. Leveraging the compositional and relational peculiarities of HOI labels, we propose ConsNet, a knowledge-aware framework that explicitly encodes the relations among objects, actions and interactions into an undirected graph called consistency graph, and exploits Graph Attention Networks (GATs) to propagate knowledge among HOI categories as well as their constituents. Our model takes visual features of candidate human-object pairs and word embeddings of HOI labels as inputs, maps them into visual-semantic joint embedding space and obtains detection results by measuring their similarities. We extensively evaluate our model on the challenging V-COCO and HICO-DET datasets, and results validate that our approach outperforms state-of-the-arts under both fully-supervised and zero-shot settings. Code is available at https://github.com/yeliudev/ConsNet.

Ye Liu, Junsong Yuan, Chang Wen Chen• 2020

Related benchmarks

TaskDatasetResultRank
Human-Object Interaction DetectionHICO-DET (test)
mAP (full)24.39
493
Human-Object Interaction DetectionV-COCO (test)
AP (Role, Scenario 1)53.2
270
Human-Object Interaction DetectionHICO-DET
mAP (Full)24.39
233
Human-Object Interaction DetectionV-COCO 1.0 (test)
AP_role (#1)53.2
76
HOI DetectionHICO-DET (test)
Box mAP (Full)25.94
32
Visual GroundingWho's Waldo (test)
Accuracy46.4
31
Human-Object Interaction DetectionV-COCO
AP (Role)53.2
23
HOI DetectionHICO-DET Unseen Combination
Unseen mAP16.99
19
HOI DetectionHICO-Det Unseen Object
Unseen Performance19.27
8
Showing 9 of 9 rows

Other info

Code

Follow for update