NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization

About

Compositional generalization is crucial for artificial intelligence agents to solve complex vision-language reasoning tasks. Neuro-symbolic approaches have demonstrated promise in capturing compositional structures, but they face critical challenges: (a) reliance on predefined predicates for symbolic representations that limit adaptability, (b) difficulty in extracting predicates from raw data, and (c) using non-differentiable operations for combining primitive concepts. To address these issues, we propose NeSyCoCo, a neuro-symbolic framework that leverages large language models (LLMs) to generate symbolic representations and map them to differentiable neural computations. NeSyCoCo introduces three innovations: (a) augmenting natural language inputs with dependency structures to enhance the alignment with symbolic representations, (b) employing distributed word representations to link diverse, linguistically motivated logical predicates to neural modules, and (c) using the soft composition of normalized predicate scores to align symbolic and differentiable reasoning. Our framework achieves state-of-the-art results on the ReaSCAN and CLEVR-CoGenT compositional generalization benchmarks and demonstrates robust performance with novel concepts in the CLEVR-SYN benchmark.

Danial Kamali, Elham J. Barezi, Parisa Kordjamshidi• 2024

Related benchmarks

Task	Dataset	Result
Abstract Reasoning	CLEVR-RPM (test)	Accuracy100	7
Multi-step Reasoning	CLEVR-Puzzle (test)	Accuracy95	7
Object Localization	ReaSCAN (test)	Success Rate A199.5	6
Referring Expressions	CLEVR-Ref (test)	Accuracy100	5
Visual Question Answering	CLEVR 1.0 (val)	Accuracy99.7	4
Vision-Language Reasoning	CLEVR CoGenT (Split B)	Accuracy78.8	3
Vision-Language Reasoning	CLEVR-CoGenT (Split A)	Accuracy99.6	3
Vision-Language Reasoning	CLEVR-SYN easy (val)	Accuracy92.1	2
Vision-Language Reasoning	CLEVR-SYN medium (val)	Accuracy81.2	2
Vision-Language Reasoning	CLEVR-SYN hard (val)	Accuracy73.4	2

Showing 10 of 10 rows

Other info

Code

Follow for update

@wizwand_team Discord