Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Native Hierarchical and Compositional Representations with Subspace Embeddings

About

Traditional embeddings represent datapoints as vectors, which makes similarity easy to compute but limits how well they capture hierarchies and compositionality. We propose a fundamentally different approach: representing concepts as linear subspaces. By spanning multiple dimensions, subspaces can model broader concepts with higher-dimensional regions and nest more specific concepts within them. This geometry naturally captures generality through dimension, hierarchy through inclusion, and enables an emergent structure for composition via linear algebraic operations. To make this paradigm trainable, we introduce a differentiable subspace parameterization via soft projection matrices, allowing the effective dimension of each subspace to be learned. Our method not only achieves state-of-the-art performance on hierarchical and natural language inference benchmarks but also provides a geometrically-grounded model of entailment. Further, we demonstrate that while standard vector embeddings degrade to near-random performance on negated queries, subspace embeddings natively capture logical composition without explicit supervision, while preserving compatibility with efficient Euclidean vector search.

Gabriel Moreira, Zita Marinho, Manuel Marques, Jo\~ao Paulo Costeira, Chenyan Xiong• 2025

Related benchmarks

TaskDatasetResultRank
Lexical EntailmentHYPERLEX noun part
Spearman's Rho0.683
12
EntailmentCompositional Entailment Conjunction
ROC AUC96.55
10
EntailmentCompositional Entailment Negation
ROC AUC95.76
8
Link PredictionWordNet Noun Hierarchy
F1 Score (0% Coverage)53.4
8
Hierarchical ReconstructionWordNet Nouns full transitive closure (test)
mAP98.6
5
Hierarchical ReconstructionWordNet Verbs full transitive closure (test)
mAP99.9
5
Showing 6 of 6 rows

Other info

Follow for update