Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DGL-KE: Training Knowledge Graph Embeddings at Scale

About

Knowledge graphs have emerged as a key abstraction for organizing information in diverse domains and their embeddings are increasingly used to harness their information in various information retrieval and machine learning tasks. However, the ever growing size of knowledge graphs requires computationally efficient algorithms capable of scaling to graphs with millions of nodes and billions of edges. This paper presents DGL-KE, an open-source package to efficiently compute knowledge graph embeddings. DGL-KE introduces various novel optimizations that accelerate training on knowledge graphs with millions of nodes and billions of edges using multi-processing, multi-GPU, and distributed parallelism. These optimizations are designed to increase data locality, reduce communication overhead, overlap computations with memory accesses, and achieve high operation efficiency. Experiments on knowledge graphs consisting of over 86M nodes and 338M edges show that DGL-KE can compute embeddings in 100 minutes on an EC2 instance with 8 GPUs and 30 minutes on an EC2 cluster with 4 machines with 48 cores/machine. These results represent a 2x~5x speedup over the best competing approaches. DGL-KE is available on https://github.com/awslabs/dgl-ke.

Da Zheng, Xiang Song, Chao Ma, Zeyuan Tan, Zihao Ye, Jin Dong, Hao Xiong, Zheng Zhang, George Karypis• 2020

Related benchmarks

TaskDatasetResultRank
Link PredictionWikiKG90M v2
Hits@1046.94
15
Link PredictionYAGO 4.5+T
Hits@1078.21
12
Link PredictionYAGO3
Hits@1082.93
10
Link PredictionYAGO 4.5
Hits@1074.46
8
Link PredictionYAGO4
Hits@100.3786
6
Link PredictionFreebase
Hits@1039.64
6
US elections predictionFreebase
Normalized Mean CV Score0.962
5
Housing prices predictionFreebase
Mean CV Score (Normalized)0.445
5
Movie revenues predictionFreebase
Normalized Mean CV Score61
5
US accidents predictionFreebase
Normalized Mean CV Score68.6
5
Showing 10 of 10 rows

Other info

Follow for update