Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data

About

Discovering genes with similar functions across diverse biomedical contexts poses a significant challenge in gene representation learning due to data heterogeneity. In this study, we resolve this problem by introducing a novel model called Multimodal Similarity Learning Graph Neural Network, which combines Multimodal Machine Learning and Deep Graph Neural Networks to learn gene representations from single-cell sequencing and spatial transcriptomic data. Leveraging 82 training datasets from 10 tissues, three sequencing techniques, and three species, we create informative graph structures for model training and gene representations generation, while incorporating regularization with weighted similarity learning and contrastive learning to learn cross-data gene-gene relationships. This novel design ensures that we can offer gene representations containing functional similarity across different contexts in a joint space. Comprehensive benchmarking analysis shows our model's capacity to effectively capture gene function similarity across multiple modalities, outperforming state-of-the-art methods in gene representation learning by up to 97.5%. Moreover, we employ bioinformatics tools in conjunction with gene representations to uncover pathway enrichment, regulation causal networks, and functions of disease-associated or dosage-sensitive genes. Therefore, our model efficiently produces unified gene representations for the analysis of gene functions, tissue functions, diseases, and species evolution.

Tianyu Liu, Yuge Wang, Rex Ying, Hongyu Zhao• 2023

Related benchmarks

TaskDatasetResultRank
Hazard PredictionBreast Cancer GRN
C-Index0.639
11
BP ClassificationBreast Cancer GRN
Subset Accuracy23.8
11
Gene embeddingHeart
Average Rank2.67
9
Gene embeddingLung
Average Rank2.17
9
Gene embeddingLiver
Average Rank2.5
9
Gene embeddingKidney
Average Rank2.83
9
Gene embeddingThymus
Average Rank2.67
9
Gene embeddingSpleen
Average Rank2
9
Gene embeddingPancreas
Avg Rank2.83
9
Gene embeddingCerebrum
Average Rank1.67
9
Showing 10 of 22 rows

Other info

Code

Follow for update