Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PSQE: A Theoretical-Practical Approach to Pseudo Seed Quality Enhancement for Unsupervised Multimodal Entity Alignment

About

Multimodal Entity Alignment (MMEA) aims to identify equivalent entities across different data modalities, enabling structural data integration that in turn improves the performance of various large language model applications. To lift the requirement of labeled seed pairs that are difficult to obtain, recent methods shifted to an unsupervised paradigm using pseudo-alignment seeds. However, unsupervised entity alignment in multimodal settings remains underexplored, mainly because the incorporation of multimodal information often results in imbalanced coverage of pseudo-seeds within the knowledge graph. To overcome this, we propose PSQE (Pseudo-Seed Quality Enhancement) to improve the precision and graph coverage balance of pseudo seeds via multimodal information and clustering-resampling. Theoretical analysis reveals the impact of pseudo seeds on existing contrastive learning-based MMEA models. In particular, pseudo seeds can influence the attraction and the repulsion terms in contrastive learning at once, whereas imbalanced graph coverage causes models to prioritize high-density regions, thereby weakening their learning capability for entities in sparse regions. Experimental results validate our theoretical findings and show that PSQE as a plug-and-play module can improve the performance of baselines by considerable margins.

Yunpeng Hong, Chenyang Bu, Jie Zhang, Yi He, Di Wu, Xindong Wu• 2026

Related benchmarks

TaskDatasetResultRank
Multimodal Entity AlignmentDBP15K ZH-EN
H@184.2
11
Multimodal Entity AlignmentDBP15K JA-EN
Hits@189.2
11
Multimodal Entity AlignmentDBP15K FR-EN
H@193.2
11
Entity AlignmentDWY15K DW-V1
H@195.4
7
Entity AlignmentDWY15K DW V2
Hits@193.9
7
Showing 5 of 5 rows

Other info

Follow for update