Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

graph2vec: Learning Distributed Representations of Graphs

About

Recent works on representation learning for graph structured data predominantly focus on learning distributed representations of graph substructures such as nodes and subgraphs. However, many graph analytics tasks such as graph classification and clustering require representing entire graphs as fixed length feature vectors. While the aforementioned approaches are naturally unequipped to learn such representations, graph kernels remain as the most effective way of obtaining them. However, these graph kernels use handcrafted features (e.g., shortest paths, graphlets, etc.) and hence are hampered by problems such as poor generalization. To address this limitation, in this work, we propose a neural embedding framework named graph2vec to learn data-driven distributed representations of arbitrary sized graphs. graph2vec's embeddings are learnt in an unsupervised manner and are task agnostic. Hence, they could be used for any downstream task such as graph classification, clustering and even seeding supervised representation learning approaches. Our experiments on several benchmark and large real-world datasets show that graph2vec achieves significant improvements in classification and clustering accuracies over substructure representation learning approaches and are competitive with state-of-the-art graph kernels.

Annamalai Narayanan, Mahinthan Chandramohan, Rajasekar Venkatesan, Lihui Chen, Yang Liu, Shantanu Jaiswal• 2017

Related benchmarks

TaskDatasetResultRank
Graph ClassificationPROTEINS
Accuracy73.3
994
Graph ClassificationMUTAG
Accuracy83.2
862
Graph ClassificationNCI1
Accuracy76.2
501
Graph ClassificationIMDB-B
Accuracy71.1
378
Graph ClassificationIMDB-M
Accuracy50.44
275
Graph ClassificationNCI109
Accuracy74.26
223
Graph ClassificationMUTAG (10-fold cross-validation)
Accuracy83.15
219
Graph ClassificationMutag (test)
Accuracy83.15
217
Graph ClassificationPTC-MR
Accuracy60.2
197
Graph ClassificationPROTEINS (test)
Accuracy73.3
180
Showing 10 of 41 rows

Other info

Code

Follow for update