Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Intrinsic dimension estimation of data by principal component analysis

About

Estimating intrinsic dimensionality of data is a classic problem in pattern recognition and statistics. Principal Component Analysis (PCA) is a powerful tool in discovering dimensionality of data sets with a linear structure; it, however, becomes ineffective when data have a nonlinear structure. In this paper, we propose a new PCA-based method to estimate intrinsic dimension of data with nonlinear structures. Our method works by first finding a minimal cover of the data set, then performing PCA locally on each subset in the cover and finally giving the estimation result by checking up the data variance on all small neighborhood regions. The proposed method utilizes the whole data set to estimate its intrinsic dimension and is convenient for incremental learning. In addition, our new PCA procedure can filter out noise in data and converge to a stable estimation with the neighborhood region size increasing. Experiments on synthetic and real world data sets show effectiveness of the proposed method.

Mingyu Fan, Nannan Gu, Hong Qiao, Bo Zhang• 2010

Related benchmarks

TaskDatasetResultRank
Intrinsic Dimensionality EstimationBenchmark Manifolds
MPE18.4
76
Intrinsic Dimensionality Estimation6D sphere (S6) embedded in R11 with Gaussian noise synthetic (test)
Average Estimated Dimension7
42
Intrinsic Dimension EstimationS10 manifold embedded in R11 sigma = 0.0
Average Estimated Dimension11
14
Intrinsic Dimension EstimationS10 manifold embedded in R11 sigma = 0.01
Average Estimated Dimension11
14
Intrinsic Dimension EstimationS10 manifold embedded in R11 sigma = 0.1
Average Estimated Dimension11
14
Showing 5 of 5 rows

Other info

Follow for update