Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

End-to-End Human Object Interaction Detection with HOI Transformer

About

We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner. Current approaches either decouple HOI task into separated stages of object detection and interaction classification or introduce surrogate interaction problem. In contrast, our method, named HOI Transformer, streamlines the HOI pipeline by eliminating the need for many hand-designed components. HOI Transformer reasons about the relations of objects and humans from global image context and directly predicts HOI instances in parallel. A quintuple matching loss is introduced to force HOI predictions in a unified way. Our method is conceptually much simpler and demonstrates improved accuracy. Without bells and whistles, HOI Transformer achieves $26.61\% $ $ AP $ on HICO-DET and $52.9\%$ $AP_{role}$ on V-COCO, surpassing previous methods with the advantage of being much simpler. We hope our approach will serve as a simple and effective alternative for HOI tasks. Code is available at https://github.com/bbepoch/HoiTransformer .

Cheng Zou, Bohan Wang, Yue Hu, Junqi Liu, Qian Wu, Yu Zhao, Boxun Li, Chenguang Zhang, Chi Zhang, Yichen Wei, Jian Sun• 2021

Related benchmarks

TaskDatasetResultRank
Human-Object Interaction DetectionHICO-DET (test)
mAP (full)26.61
493
Human-Object Interaction DetectionV-COCO (test)
AP (Role, Scenario 1)52.9
270
Human-Object Interaction DetectionHICO-DET
mAP (Full)26.6
233
Human-Object Interaction DetectionHICO-DET Known Object (test)
mAP (Full)29.13
112
Human-Object Interaction DetectionV-COCO 1.0 (test)
AP_role (#1)52.9
76
Human-Object Interaction DetectionHICO-DET 1 (test)
Full mAP29.13
33
Human-Object Interaction DetectionV-COCO
Box mAP (Scenario 1)52.9
32
HOI DetectionHICO-DET v1.0 (test)
mAP (Default, Full)26.61
29
Human-Object Interaction DetectionV-COCO standard (test)
AP (Role 1)52.9
18
Few-shot classificationBongard-HOI (test)
Accuracy (Unseen Act / Unseen Obj)62.87
12
Showing 10 of 11 rows

Other info

Code

Follow for update