Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language-Conditioned Graph Networks for Relational Reasoning

About

Solving grounded language tasks often requires reasoning about relationships between objects in the context of a given task. For example, to answer the question "What color is the mug on the plate?" we must check the color of the specific mug that satisfies the "on" relationship with respect to the plate. Recent work has proposed various methods capable of complex relational reasoning. However, most of their power is in the inference structure, while the scene is represented with simple local appearance features. In this paper, we take an alternate approach and build contextualized representations for objects in a visual scene to support relational reasoning. We propose a general framework of Language-Conditioned Graph Networks (LCGN), where each node represents an object, and is described by a context-aware representation from related objects through iterative message passing conditioned on the textual input. E.g., conditioning on the "on" relationship to the plate, the object "mug" gathers messages from the object "plate" to update its representation to "mug on the plate", which can be easily consumed by a simple classifier for answer prediction. We experimentally show that our LCGN approach effectively supports relational reasoning and improves performance across several tasks and datasets. Our code is available at http://ronghanghu.com/lcgn.

Ronghang Hu, Anna Rohrbach, Trevor Darrell, Kate Saenko• 2019

Related benchmarks

TaskDatasetResultRank
Visual Question AnsweringGQA
Accuracy55.8
963
Visual Question AnsweringGQA (test-dev)
Accuracy55.8
178
Visual Question AnsweringGQA (test)
Accuracy56.1
119
Visual Question AnsweringCLEVR (test)
Overall Accuracy97.9
61
Video Question AnsweringSTAR (test)
Interaction Score39.01
42
Visual Question AnsweringGQA balanced (test-dev)
Accuracy55.8
32
Visual Question AnsweringGQA (val)
Accuracy63.9
22
Referring Expression ComprehensionCLEVR-Ref+
Accuracy74.8
7
Showing 8 of 8 rows

Other info

Code

Follow for update