Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Modality Alignment across Trees on Heterogeneous Hyperbolic Manifolds

About

Modality alignment is critical for vision-language models (VLMs) to effectively integrate information across modalities. However, existing methods extract hierarchical features from text while representing each image with a single feature, leading to asymmetric and suboptimal alignment. To address this, we propose Alignment across Trees, a method that constructs and aligns tree-like hierarchical features for both image and text modalities. Specifically, we introduce a semantic-aware visual feature extraction framework that applies a cross-attention mechanism to visual class tokens from intermediate Transformer layers, guided by textual cues to extract visual features with coarse-to-fine semantics. We then embed the feature trees of the two modalities into hyperbolic manifolds with distinct curvatures to effectively model their hierarchical structures. To align across the heterogeneous hyperbolic manifolds with different curvatures, we formulate a KL distance measure between distributions on heterogeneous manifolds, and learn an intermediate manifold for manifold alignment by minimizing the distance. We prove the existence and uniqueness of the optimal intermediate manifold. Experiments on taxonomic open-set classification tasks across multiple image datasets demonstrate that our method consistently outperforms strong baselines under few-shot and cross-domain settings.

Wei Wu, Xiaomeng Fan, Yuwei Wu, Zhi Gao, Pengxiang Li, Yunde Jia, Mehrtash Harandi• 2025

Related benchmarks

TaskDatasetResultRank
TOS classificationCIFAR100
Leaf Accuracy85.58
41
TOS classificationSUN
Leaf Accuracy76.54
29
Hierarchical classificationSUN (base)
Leaf Accuracy (LA)83.4
18
Hierarchical classificationCifar100 (base)
LA85.58
18
TOS classificationImageNet
Leaf Accuracy71.63
17
Taxonomic Open Set ClassificationImageNet
Leaf Accuracy71.67
15
Hierarchical classificationSUN novel
LA Score78.72
12
Taxonomic Open Set ClassificationRare Species
LA69.96
12
Hierarchical classificationSUN
Leaf Accuracy72.04
6
TOS classificationSUN HM
LA80.99
6
Showing 10 of 19 rows

Other info

Follow for update