Representation Learning for Heterogeneous Information Networks via Embedding Events
About
Network representation learning (NRL) has been widely used to help analyze large-scale networks through mapping original networks into a low-dimensional vector space. However, existing NRL methods ignore the impact of properties of relations on the object relevance in heterogeneous information networks (HINs). To tackle this issue, this paper proposes a new NRL framework, called Event2vec, for HINs to consider both quantities and properties of relations during the representation learning process. Specifically, an event (i.e., a complete semantic unit) is used to represent the relation among multiple objects, and both event-driven first-order and second-order proximities are defined to measure the object relevance according to the quantities and properties of relations. We theoretically prove how event-driven proximities can be preserved in the embedding space by Event2vec, which utilizes event embeddings to facilitate learning the object embeddings. Experimental studies demonstrate the advantages of Event2vec over state-of-the-art algorithms on four real-world datasets and three network analysis tasks (including network reconstruction, link prediction, and node classification).
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Link Prediction | DBLP 20% edges (test) | AUC90.1 | 15 | |
| Link Prediction | Yelp 20% edges (test) | AUC86.2 | 15 | |
| Link Prediction | Douban 20% edges (test) | AUC82.3 | 6 | |
| Network reconstruction | DBLP | AUC0.982 | 6 | |
| Network reconstruction | IMDB | AUC0.987 | 6 | |
| Link Prediction | IMDB 20% edges (test) | AUC0.894 | 6 | |
| Network reconstruction | Yelp | AUC92.4 | 6 | |
| Network reconstruction | Douban | AUC87.2 | 6 |