Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Discovering Human-Object Interaction Concepts via Self-Compositional Learning

About

A comprehensive understanding of human-object interaction (HOI) requires detecting not only a small portion of predefined HOI concepts (or categories) but also other reasonable HOI concepts, while current approaches usually fail to explore a huge portion of unknown HOI concepts (i.e., unknown but reasonable combinations of verbs and objects). In this paper, 1) we introduce a novel and challenging task for a comprehensive HOI understanding, which is termed as HOI Concept Discovery; and 2) we devise a self-compositional learning framework (or SCL) for HOI concept discovery. Specifically, we maintain an online updated concept confidence matrix during training: 1) we assign pseudo-labels for all composite HOI instances according to the concept confidence matrix for self-training; and 2) we update the concept confidence matrix using the predictions of all composite HOI instances. Therefore, the proposed method enables the learning on both known and unknown HOI concepts. We perform extensive experiments on several popular HOI datasets to demonstrate the effectiveness of the proposed method for HOI concept discovery, object affordance recognition and HOI detection. For example, the proposed self-compositional learning framework significantly improves the performance of 1) HOI concept discovery by over 10% on HICO-DET and over 3% on V-COCO, respectively; 2) object affordance recognition by over 9% mAP on MS-COCO and HICO-DET; and 3) rare-first and non-rare-first unknown HOI detection relatively over 30% and 20%, respectively. Code is publicly available at https://github.com/zhihou7/HOI-CL.

Zhi Hou, Baosheng Yu, Dacheng Tao• 2022

Related benchmarks

TaskDatasetResultRank
Human-Object Interaction DetectionHICO-DET (test)
mAP (full)29.75
493
Human-Object Interaction DetectionV-COCO (test)--
270
Human-Object Interaction DetectionHICO-DET Zero-Shot
mAP (Default Unseen)17.19
33
Human-Object Interaction DetectionHICO-DET Rare First v1.0 (test)
Unseen mAP19.07
27
Object affordance recognitionCOCO 2017 (val)
mAP72.08
25
Human-Object Interaction DetectionHICO-DET Non-rare First v1.0 (test)
mAP (Unseen)21.73
22
Object affordance recognitionObject365 COCO-labels v1 (val)
mAP57.53
13
Object affordance recognitionHICO-DET v1 (val)
mAP82.47
13
Object affordance recognitionObject365 Novel classes v1 (val)
mAP18.55
13
HOI concept discoveryHICO-DET
AP (Unknown)33.58
7
Showing 10 of 14 rows

Other info

Code

Follow for update