Graph R-CNN for Scene Graph Generation
About
We propose a novel scene graph generation model called Graph R-CNN, that is both effective and efficient at detecting objects and their relations in images. Our model contains a Relation Proposal Network (RePN) that efficiently deals with the quadratic number of potential relations between objects in an image. We also propose an attentional Graph Convolutional Network (aGCN) that effectively captures contextual information between objects and relations. Finally, we introduce a new evaluation metric that is more holistic and realistic than existing metrics. We report state-of-the-art performance on scene graph generation as evaluated using both existing and our proposed metrics.
Jianwei Yang, Jiasen Lu, Stefan Lee, Dhruv Batra, Devi Parikh• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Scene Graph Generation | Visual Genome (test) | R@5011.4 | 86 | |
| Scene Graph Generation | Open Images v6 (test) | wmAPrel33.51 | 74 | |
| Scene Graph Classification | Visual Genome (test) | Recall@10037 | 63 | |
| Predicate Classification | Visual Genome | Recall@5065.4 | 54 | |
| Predicate Classification | Visual Genome (test) | R@5054.2 | 50 | |
| Predicate Classification | Visual Genome (VG) 150 object categories, 50 relationship categories (test) | -- | 44 | |
| Story Ending Generation | ROCStories (test) | BLEU-117.6 | 43 | |
| Scene Graph Detection | Visual Genome (VG) (test) | mR@505.8 | 29 | |
| Predicate Classification | Visual Genome 1.0 (test) | R@10059.1 | 22 | |
| Scene Graph Detection (SGDet) | Visual Genome (VG) | R@5029.7 | 21 |
Showing 10 of 17 rows