Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bridging the Gap between Community and Node Representations: Graph Embedding via Community Detection

About

Graph embedding has become a key component of many data mining and analysis systems. Current graph embedding approaches either sample a large number of node pairs from a graph to learn node embeddings via stochastic optimization or factorize a high-order proximity/adjacency matrix of the graph via computationally expensive matrix factorization techniques. These approaches typically require significant resources for the learning process and rely on multiple parameters, which limits their applicability in practice. Moreover, most of the existing graph embedding techniques operate effectively in one specific metric space only (e.g., the one produced with cosine similarity), do not preserve higher-order structural features of the input graph and cannot automatically determine a meaningful number of embedding dimensions. Typically, the produced embeddings are not easily interpretable, which complicates further analyses and limits their applicability. To address these issues, we propose DAOR, a highly efficient and parameter-free graph embedding technique producing metric space-robust, compact and interpretable embeddings without any manual tuning. Compared to a dozen state-of-the-art graph embedding algorithms, DAOR yields competitive results on both node classification (which benefits form high-order proximity) and link prediction (which relies on low-order proximity mostly). Unlike existing techniques, however, DAOR does not require any parameter tuning and improves the embeddings generation speed by several orders of magnitude. Our approach has hence the ambition to greatly simplify and speed up data analysis tasks involving graph representation learning.

Artem Lutov, Dingqi Yang, Philippe Cudr\'e-Mauroux• 2019

Related benchmarks

TaskDatasetResultRank
Node ClassificationDBLP
Micro-F187.86
94
Node ClassificationPPI
Micro F119.07
29
Node ClassificationWiki
Micro F10.5324
23
Node Embedding LearningPPI
Time (s)0.2
20
Node Embedding LearningBlog
Runtime (s)1.6
14
Node Embedding LearningWiki
Time (s)0.4
14
Node Embedding LearningDBLP
Time (s)0.2
14
Link PredictionPPI
Precision@1001.75
14
Link PredictionWiki
Precision@1001.64
14
Node ClassificationBlog
Micro-F133.05
14
Showing 10 of 13 rows

Other info

Code

Follow for update