Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Detecting Visual Relationships with Deep Relational Networks

About

Relationships among objects play a crucial role in image understanding. Despite the great success of deep learning techniques in recognizing individual objects, reasoning about the relationships among objects remains a challenging task. Previous methods often treat this as a classification problem, considering each type of relationship (e.g. "ride") or each distinct visual phrase (e.g. "person-ride-horse") as a category. Such approaches are faced with significant difficulties caused by the high diversity of visual appearance for each kind of relationships or the large number of distinct visual phrases. We propose an integrated framework to tackle this problem. At the heart of this framework is the Deep Relational Network, a novel formulation designed specifically for exploiting the statistical dependencies between objects and their relationships. On two large datasets, the proposed method achieves substantial improvement over state-of-the-art.

Bo Dai, Yuqi Zhang, Dahua Lin• 2017

Related benchmarks

TaskDatasetResultRank
Relation DetectionVRD (test)
R@5017.73
75
Phrase DetectionVRD (test)
R@5019.93
36
Relationship Phrase DetectionVRD
Recall@5080.78
20
Relationship Phrase DetectionVRD (test)
Recall@10081.9
17
Union Box DetectionVRD
Recall@500.1993
9
Scene Graph GenerationVRD (test)
Recall@5017.73
6
Visual Phrase DetectionVRD (test)
Recall@5019.93
6
Scene Graph GenerationsVG-a (test)
Average Similarity0.2271
5
Union Box DetectionsVG
Recall@5023.95
5
Phrase DetectionVisual Genome (other split)
R@5023.95
4
Showing 10 of 16 rows

Other info

Code

Follow for update