Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Optimal whitening and decorrelation

About

Whitening, or sphering, is a common preprocessing step in statistical analysis to transform random variables to orthogonality. However, due to rotational freedom there are infinitely many possible whitening procedures. Consequently, there is a diverse range of sphering methods in use, for example based on principal component analysis (PCA), Cholesky matrix decomposition and zero-phase component analysis (ZCA), among others. Here we provide an overview of the underlying theory and discuss five natural whitening procedures. Subsequently, we demonstrate that investigating the cross-covariance and the cross-correlation matrix between sphered and original variables allows to break the rotational invariance and to identify optimal whitening transformations. As a result we recommend two particular approaches: ZCA-cor whitening to produce sphered variables that are maximally similar to the original variables, and PCA-cor whitening to obtain sphered variables that maximally compress the original variables.

Agnan Kessy, Alex Lewin, Korbinian Strimmer• 2015

Related benchmarks

TaskDatasetResultRank
Batch CorrectionDEEP 1
Connectivity Score0.3
11
Batch CorrectionJUMP 1
Conn.0.48
11
Batch CorrectionDEEP 3
Connectivity0.3
9
Batch CorrectionDEEP 2 low diversity batch setting
Connectivity0.29
9
Batch CorrectionJUMP 2
Connectivity0.34
9
Batch CorrectionJUMP 4
Connectivity0.58
9
Batch CorrectionJUMP 3 1.0 (supplementary)
Connectivity0.74
9
Batch CorrectionJUMP large, diverse batch setting 5
Conn.0.34
7
Showing 8 of 8 rows

Other info

Follow for update