Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Bayesian CP Factorization of Incomplete Tensors with Automatic Rank Determination

About

CANDECOMP/PARAFAC (CP) tensor factorization of incomplete data is a powerful technique for tensor completion through explicitly capturing the multilinear latent factors. The existing CP algorithms require the tensor rank to be manually specified, however, the determination of tensor rank remains a challenging problem especially for CP rank. In addition, existing approaches do not take into account uncertainty information of latent factors, as well as missing entries. To address these issues, we formulate CP factorization using a hierarchical probabilistic model and employ a fully Bayesian treatment by incorporating a sparsity-inducing prior over multiple latent factors and the appropriate hyperpriors over all hyperparameters, resulting in automatic rank determination. To learn the model, we develop an efficient deterministic Bayesian inference algorithm, which scales linearly with data size. Our method is characterized as a tuning parameter-free approach, which can effectively infer underlying multilinear factors with a low-rank constraint, while also providing predictive distributions over missing entries. Extensive simulations on synthetic data illustrate the intrinsic capability of our method to recover the ground-truth of CP rank and prevent the overfitting problem, even when a large amount of entries are missing. Moreover, the results from real-world applications, including image inpainting and facial image synthesis, demonstrate that our method outperforms state-of-the-art approaches for both tensor factorization and tensor completion in terms of predictive performance.

Qibin Zhao, Liqing Zhang, Andrzej Cichocki• 2014

Related benchmarks

TaskDatasetResultRank
Image CompletionStandard Images SR 30% (test)
PSNR26.69
48
Image CompletionStandard Images SR 20% (test)
PSNR25.34
48
3D Sound Speed Field ReconstructionSSF data
RMSE0.458
36
Tensor completionSynthetic Discrete Data tensor size 30x30x30 (20% Sampling Rate)
RRSE0.14
12
Tensor completionSynthetic Discrete Data tensor size 30x30x30 (30% Sampling Rate)
RRSE0.109
12
Video CompletionSalesman
MPSNR29.077
8
Video CompletionNews
MPSNR28.234
8
Video CompletionSilent
MPSNR30.126
8
Multidimensional Time Series RecoveryAbilene Pattern-2, 30% missing
MAE2.35
7
Multidimensional Time Series RecoveryAbilene Pattern-2 70% missing
MAE1.67
7
Showing 10 of 18 rows

Other info

Follow for update