The Scattering Compositional Learner: Discovering Objects, Attributes, Relationships in Analogical Reasoning
About
In this work, we focus on an analogical reasoning task that contains rich compositional structures, Raven's Progressive Matrices (RPM). To discover compositional structures of the data, we propose the Scattering Compositional Learner (SCL), an architecture that composes neural networks in a sequence. Our SCL achieves state-of-the-art performance on two RPM datasets, with a 48.7% relative improvement on Balanced-RAVEN and 26.4% on PGM over the previous state-of-the-art. We additionally show that our model discovers compositional representations of objects' attributes (e.g., shape color, size), and their relationships (e.g., progression, union). We also find that the compositional representation makes the SCL significantly more robust to test-time domain shifts and greatly improves zero-shot generalization to previously unseen analogies.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Abstract Visual Reasoning | SVRT reformulated four-choice (test) | Accuracy68.2 | 28 | |
| Compositional Visual Reasoning | CVR | Accuracy (Joint)78.9 | 16 | |
| Abstract Visual Reasoning | MC2R (train) | Accuracy12.5 | 12 | |
| Abstract Visual Reasoning | MC2R 10,000 samples (train) | Accuracy30.4 | 12 | |
| Abstract Visual Reasoning | MC2R 20 samples (train) | Accuracy10.1 | 12 | |
| Abstract Visual Reasoning | MC2R 50 samples (train) | Accuracy10 | 12 | |
| Abstract Visual Reasoning | MC2R (100 train samples) | Accuracy10.1 | 12 | |
| Abstract Visual Reasoning | MC2R 200 (train) | Accuracy10.2 | 12 | |
| Abstract Visual Reasoning | MC2R 500 samples (train) | Accuracy0.105 | 12 | |
| Relational Reasoning | RAVEN (test) | Average Accuracy91.6 | 5 |