CA-PCA: Manifold Dimension Estimation, Adapted for Curvature
About
The success of algorithms in the analysis of high-dimensional data is often attributed to the manifold hypothesis, which supposes that this data lie on or near a manifold of much lower dimension. It is often useful to determine or estimate the dimension of this manifold before performing dimension reduction, for instance. Existing methods for dimension estimation are calibrated using a flat unit ball. In this paper, we develop CA-PCA, a version of local PCA based instead on a calibration of a quadratic embedding, acknowledging the curvature of the underlying manifold. Numerous careful experiments show that this adaptation improves the estimator in a wide range of settings.
Anna C. Gilbert, Kevin O'Neill• 2023
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Intrinsic Dimension Estimation | MNIST | Intrinsic Dimension Estimate84.66 | 13 | |
| Intrinsic Dimension Estimation | Isolet | Dimension Estimate54.22 | 10 | |
| Intrinsic Dimension Estimation | M21 manifold d=5 | Mean Dimension Estimate5 | 10 | |
| Intrinsic Dimension Estimation | M41 manifold d=3 | Mean Estimated Dimension5.83 | 10 | |
| Intrinsic Dimension Estimation | M42 manifold d=3 | Mean Estimated Dimension5.83 | 10 | |
| Intrinsic Dimension Estimation | M6 manifold d=1 | Mean Dimension Estimate2.75 | 10 | |
| Intrinsic Dimension Estimation | M7 manifold d=2 | Mean Dimension Estimate2.99 | 10 | |
| Intrinsic Dimension Estimation | M9 manifold d=2 | Mean Estimated Dimension3 | 10 | |
| Intrinsic Dimension Estimation | Manifold M21 (d=5) n=2000 (uniform samples) | Mean Dimension Estimate5 | 10 | |
| Intrinsic Dimension Estimation | ISOMAP sculpture face images | Dimension Estimate25.71 | 10 |
Showing 10 of 54 rows