Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation

About

To generate "accurate" scene graphs, almost all existing methods predict pairwise relationships in a deterministic manner. However, we argue that visual relationships are often semantically ambiguous. Specifically, inspired by linguistic knowledge, we classify the ambiguity into three types: Synonymy Ambiguity, Hyponymy Ambiguity, and Multi-view Ambiguity. The ambiguity naturally leads to the issue of \emph{implicit multi-label}, motivating the need for diverse predictions. In this work, we propose a novel plug-and-play Probabilistic Uncertainty Modeling (PUM) module. It models each union region as a Gaussian distribution, whose variance measures the uncertainty of the corresponding visual content. Compared to the conventional deterministic methods, such uncertainty modeling brings stochasticity of feature representation, which naturally enables diverse predictions. As a byproduct, PUM also manages to cover more fine-grained relationships and thus alleviates the issue of bias towards frequent relationships. Extensive experiments on the large-scale Visual Genome benchmark show that combining PUM with newly proposed ResCAGCN can achieve state-of-the-art performances, especially under the mean recall metric. Furthermore, we prove the universal effectiveness of PUM by plugging it into some existing models and provide insightful analysis of its ability to generate diverse yet plausible visual relationships.

Gengcong Yang, Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yujiu Yang• 2021

Related benchmarks

TaskDatasetResultRank
Predicate ClassificationVisual Genome
Recall@5066.6
54
Predicate ClassificationVisual Genome 1.0 (test)
R@10068.3
22
Scene Graph Detection (SGDet)Visual Genome (VG)
R@5028.1
21
Scene Graph Classification (SGCls)Visual Genome
R@10039
19
Predicate ClassificationVisual Genome v1.4 (test)
mR@5020.2
12
Scene Graph ClassificationVisual Genome 1.0 (test)
Recall@10039
12
Scene Graph ClassificationVisual Genome v1.4 (test)
mR@500.119
12
Scene Graph DetectionVisual Genome v1.4 (test)
mR@507.9
12
Scene Graph GenerationVisual Genome v1.4 (test)
Mean Score13.9
12
Scene Graph DetectionVisual Genome v1.0 (test)
R@10031.3
12
Showing 10 of 10 rows

Other info

Follow for update