Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attention-based Iterative Decomposition for Tensor Product Representation

About

In recent research, Tensor Product Representation (TPR) is applied for the systematic generalization task of deep neural networks by learning the compositional structure of data. However, such prior works show limited performance in discovering and representing the symbolic structure from unseen test data because their decomposition to the structural representations was incomplete. In this work, we propose an Attention-based Iterative Decomposition (AID) module designed to enhance the decomposition operations for the structured representations encoded from the sequential input data with TPR. Our AID can be easily adapted to any TPR-based model and provides enhanced systematic decomposition through a competitive attention mechanism between input features and structured representations. In our experiments, AID shows effectiveness by significantly improving the performance of TPR-based prior works on the series of systematic generalization tasks. Moreover, in the quantitative and qualitative evaluations, AID produces more compositional and well-bound structural representations than other works.

Taewon Park, Inchul Choi, Minho Lee• 2024

Related benchmarks

TaskDatasetResultRank
Language ModelingWikiText-103 (test)
Perplexity37.151
524
Language ModelingWikiText-103 (val)
PPL36.159
180
sys-bAbI tasksys-bAbI original (test)
Gap7.95
22
Relational Reasoningsort-of-CLEVR
Unary Accuracy98.9
8
Visual Relational Reasoningsort-of-CLEVR (test)
Unary Accuracy98.9
6
Showing 5 of 5 rows

Other info

Follow for update