Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions
About
Low-rank matrix approximations, such as the truncated singular value decomposition and the rank-revealing QR decomposition, play a central role in data analysis and scientific computing. This work surveys and extends recent research which demonstrates that randomization offers a powerful tool for performing low-rank matrix approximation. These techniques exploit modern computational architectures more fully than classical methods and open the possibility of dealing with truly massive data sets. This paper presents a modular framework for constructing randomized algorithms that compute partial matrix decompositions. These methods use random sampling to identify a subspace that captures most of the action of a matrix. The input matrix is then compressed---either explicitly or implicitly---to this subspace, and the reduced matrix is manipulated deterministically to obtain the desired low-rank factorization. In many cases, this approach beats its classical competitors in terms of accuracy, speed, and robustness. These claims are supported by extensive numerical experiments and a detailed error analysis.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Oil Prediction | Corn | MSE0.3511 | 12 | |
| Protein Prediction | Corn | MSE0.463 | 12 | |
| Starch Prediction | Corn | MSE0.3508 | 12 | |
| Moisture Prediction | Corn | MSE0.1486 | 12 | |
| Grain composition prediction | Corn Link-2 scenario (Client 3) | MSE0.4531 | 4 | |
| Protein Prediction | Tecator Client 3 | MSE0.313 | 4 | |
| Water Prediction | Tecator Client 3 | MSE0.463 | 4 | |
| Fat Prediction | Tecator Client 3 | MSE0.2564 | 4 | |
| Grain composition prediction | Corn Link-1 scenario (Client 2) | MSE1.5381 | 4 | |
| Grain composition prediction | Corn Link-1 scenario (Client 3) | MSE1.168 | 4 |