Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LINE: Large-scale Information Network Embedding

About

This paper studies the problem of embedding very large information networks into low-dimensional vector spaces, which is useful in many tasks such as visualization, node classification, and link prediction. Most existing graph embedding methods do not scale for real world information networks which usually contain millions of nodes. In this paper, we propose a novel network embedding method called the "LINE," which is suitable for arbitrary types of information networks: undirected, directed, and/or weighted. The method optimizes a carefully designed objective function that preserves both the local and global network structures. An edge-sampling algorithm is proposed that addresses the limitation of the classical stochastic gradient descent and improves both the effectiveness and the efficiency of the inference. Empirical experiments prove the effectiveness of the LINE on a variety of real-world information networks, including language networks, social networks, and citation networks. The algorithm is very efficient, which is able to learn the embedding of a network with millions of vertices and billions of edges in a few hours on a typical single machine. The source code of the LINE is available online.

Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, Qiaozhu Mei• 2015

Related benchmarks

TaskDatasetResultRank
Node ClassificationPubmed--
742
Link PredictionFB15K (test)--
164
Link PredictionCiteseer
AUC79.1
146
RecommendationGowalla (test)
Recall@200.1335
126
Link PredictionPubmed
AUC84.9
123
Link PredictionCora
AUC0.844
116
RecommendationAmazon-Book (test)
Recall@200.041
101
Node ClassificationDBLP
Micro-F186.83
94
RecommendationYelp 2018 (test)
Recall@205.49
90
Relation ExtractionNYT (test)
F1 Score33.2
85
Showing 10 of 217 rows
...

Other info

Code

Follow for update