Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec

About

Since the invention of word2vec, the skip-gram model has significantly advanced the research of network embedding, such as the recent emergence of the DeepWalk, LINE, PTE, and node2vec approaches. In this work, we show that all of the aforementioned models with negative sampling can be unified into the matrix factorization framework with closed forms. Our analysis and proofs reveal that: (1) DeepWalk empirically produces a low-rank transformation of a network's normalized Laplacian matrix; (2) LINE, in theory, is a special case of DeepWalk when the size of vertices' context is set to one; (3) As an extension of LINE, PTE can be viewed as the joint factorization of multiple networks' Laplacians; (4) node2vec is factorizing a matrix related to the stationary distribution and transition probability tensor of a 2nd-order random walk. We further provide the theoretical connections between skip-gram based network embedding algorithms and the theory of graph Laplacian. Finally, we present the NetMF method as well as its approximation algorithm for computing network embedding. Our method offers significant improvements over DeepWalk and LINE for conventional network mining tasks. This work lays the theoretical foundation for skip-gram based network embedding methods, leading to a better understanding of latent network representation learning.

Jiezhong Qiu, Yuxiao Dong, Hao Ma, Jian Li, Kuansan Wang, Jie Tang• 2017

Related benchmarks

TaskDatasetResultRank
Node ClassificationOgbn-arxiv
Accuracy76.21
191
Node ClusteringCora
Accuracy81.75
115
Node ClusteringCiteseer--
110
Node ClassificationCora (semi-supervised)
Accuracy84.44
103
Node ClassificationDBLP
Micro-F193.59
94
ClusteringPubmed
Accuracy81.74
61
Node ClassificationCite semi-supervised
Accuracy71.18
61
Semi-supervised node classificationPubmed
Accuracy82.56
60
Node ClassificationPPI
Micro F115.03
29
Node ClassificationWiki
Micro F10.5762
23
Showing 10 of 30 rows

Other info

Follow for update