Native Hierarchical and Compositional Representations with Subspace Embeddings

About

Traditional embeddings represent datapoints as vectors, which makes similarity easy to compute but limits how well they capture hierarchies and compositionality. We propose a fundamentally different approach: representing concepts as linear subspaces. By spanning multiple dimensions, subspaces can model broader concepts with higher-dimensional regions and nest more specific concepts within them. This geometry naturally captures generality through dimension, hierarchy through inclusion, and enables an emergent structure for composition via linear algebraic operations. To make this paradigm trainable, we introduce a differentiable subspace parameterization via soft projection matrices, allowing the effective dimension of each subspace to be learned. Our method not only achieves state-of-the-art performance on hierarchical and natural language inference benchmarks but also provides a geometrically-grounded model of entailment. Further, we demonstrate that while standard vector embeddings degrade to near-random performance on negated queries, subspace embeddings natively capture logical composition without explicit supervision, while preserving compatibility with efficient Euclidean vector search.

Gabriel Moreira, Zita Marinho, Manuel Marques, Jo\~ao Paulo Costeira, Chenyan Xiong• 2025

Related benchmarks

Task	Dataset	Result
Lexical Entailment	HYPERLEX noun part	Spearman's Rho0.683	12
Entailment	Compositional Entailment Conjunction	ROC AUC96.55	10
Entailment	Compositional Entailment Negation	ROC AUC95.76	8
Link Prediction	WordNet Noun Hierarchy	F1 Score (0% Coverage)53.4	8
Hierarchical Reconstruction	WordNet Nouns full transitive closure (test)	mAP98.6	5
Hierarchical Reconstruction	WordNet Verbs full transitive closure (test)	mAP99.9	5

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord