Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ToMAToMP: Robust and Multi-Parameter Topological Clustering

About

Topological clustering, and its main algorithm ToMATo, is a clustering method from Topological Data Analysis (TDA) which has been applied successfully in several applications during the last few years. This is due to its high versatility, as clusters are detected from the persistent components in the sublevel sets of any user-defined function (gene expression, pixel values, etc), and efficiency, as topological clustering enjoys robustness guarantees. However, ToMATo is also limited in several ways. First, a graph on the data points needs to be provided as a hyper-parameter of the method (whose fine-tuning is left to the user). Second, ToMATo is known to be very sensitive to outlier values in the function range. Finally, and most importantly, ToMATo can only handle one function at a time, whereas it is critical to use several functions in various applications. In this article, we introduce ToMAToMP: the first topological clustering method able to handle several functions at the same time with theoretical guarantees. More specifically, we leverage a recent tool from multi-parameter persistent homology, called MMA decomposition, to design our clustering algorithm, and prove that it enjoys robustness properties. As corollaries, we show that it can be used to make ToMATo independent of graph tuning, and robust to outliers. Finally, we provide a set of numerical experiments showcasing the efficiency and quality of the clusterings produced by ToMAToMP, by showing strong improvement over non-topological and topological baselines for various datasets.

Ludo Andrianirina, Mathieu Carri\`ere• 2026

Related benchmarks

TaskDatasetResultRank
ClusteringSynthetic
ARI0.9551
17
ClusteringSynthetic
AMI92.68
17
Clustering3dshape
ARI73.33
9
ClusteringImage
AMI0.9311
6
Gene rankingkpmp 1-g split with outliers
Pearson Correlation0.9938
4
Gene rankingspat 1-gr split with outliers
Pearson Correlation0.9797
4
Gene rankingkpmp 2-g split with outliers
Pearson Correlation0.1442
4
Gene rankingKPMP 1-g (test)
TopHits@1099.43
4
Gene ranking1-g spat (test)
TopHits@1097.65
4
Gene rankingkpmp 2-g (test)
TopHits@1050.59
4
Showing 10 of 19 rows

Other info

Follow for update