Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Learning metrics for persistence-based summaries and applications for graph classification

About

Recently a new feature representation and data analysis methodology based on a topological tool called persistent homology (and its corresponding persistence diagram summary) has started to attract momentum. A series of methods have been developed to map a persistence diagram to a vector representation so as to facilitate the downstream use of machine learning tools, and in these approaches, the importance (weight) of different persistence features are often preset. However often in practice, the choice of the weight function should depend on the nature of the specific type of data one considers, and it is thus highly desirable to learn a best weight function (and thus metric for persistence diagrams) from labelled data. We study this problem and develop a new weighted kernel, called WKPI, for persistence summaries, as well as an optimization framework to learn a good metric for persistence summaries. Both our kernel and optimization problem have nice properties. We further apply the learned kernel to the challenging task of graph classification, and show that our WKPI-based classification framework obtains similar or (sometimes significantly) better results than the best results from a range of previous graph classification frameworks on a collection of benchmark datasets.

Qi Zhao, Yusu Wang• 2019

Related benchmarks

TaskDatasetResultRank
Graph ClassificationPROTEINS
Accuracy78.5
1252
Graph ClassificationMUTAG
Accuracy88.3
1103
Graph ClassificationNCI1
Accuracy87.5
658
Graph ClassificationIMDB-B
Accuracy75.1
425
Graph ClassificationDD
Accuracy82
300
Graph ClassificationNCI109
Accuracy87.4
267
Graph ClassificationMUTAG (10-fold cross-validation)
Accuracy88.3
227
Graph ClassificationMutag (test)
Accuracy88.3
224
Graph ClassificationPROTEINS (10-fold cross-validation)
Accuracy78.5
223
Graph ClassificationNCI1 (test)
Accuracy87.5
177
Showing 10 of 40 rows

Other info

Code

Follow for update