Understanding and Improving UMAP with Geometric and Topological Priors: The JORC-UMAP Algorithm
About
Nonlinear dimensionality reduction techniques, particularly UMAP, are widely used for visualizing high-dimensional data. However, UMAP's local Euclidean distance assumption often fails to capture intrinsic manifold geometry, leading to topological tearing and structural collapse. We identify UMAP's sensitivity to the k-nearest neighbor graph as a key cause. To address this, we introduce Ollivier-Ricci curvature as a geometric prior, reinforcing edges at geometric bottlenecks and reducing redundant links. Since curvature estimation is noise-sensitive, we also incorporate a topological prior using Jaccard similarity to ensure neighborhood consistency. The resulting method, JORC-UMAP, better distinguishes true manifold structure from spurious connections. Experiments on synthetic and real-world datasets show that JORC-UMAP reduces tearing and collapse more effectively than standard UMAP and other DR methods, as measured by SVM accuracy and triplet preservation scores, while maintaining computational efficiency. This work offers a geometry-aware enhancement to UMAP for more faithful data visualization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Centroid Triplet Accuracy | USPS | Centroid Triplet Accuracy (CTE)81.4 | 5 | |
| Centroid Triplet Accuracy | MNIST | Centroid Triplet Accuracy (CTE)72.4 | 5 | |
| Centroid Triplet Accuracy | Fashion MNIST | Centroid Triplet Accuracy84.1 | 5 | |
| Centroid Triplet Accuracy | 20Newsgroups | Centroid Triplet Accuracy (CTE)0.792 | 5 | |
| Dimensionality Reduction | Fashion MNIST | SVM Accuracy74.6 | 5 | |
| Dimensionality Reduction | MNIST | Random Triplet Accuracy (RTE)62 | 5 | |
| Dimensionality Reduction | Fashion MNIST | RTE77.4 | 5 | |
| Centroid Triplet Accuracy | Olivetti Faces | Centroid Triplet Accuracy (CTE)74.1 | 5 | |
| Dimensionality Reduction | 20Newsgroups | SVM Accuracy44.2 | 5 | |
| Dimensionality Reduction | USPS | RTE (Accuracy)66.2 | 5 |