Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Optimal Sketching for Kronecker Product Regression and Low Rank Approximation

About

We study the Kronecker product regression problem, in which the design matrix is a Kronecker product of two or more matrices. Given $A_i \in \mathbb{R}^{n_i \times d_i}$ for $i=1,2,\dots,q$ where $n_i \gg d_i$ for each $i$, and $b \in \mathbb{R}^{n_1 n_2 \cdots n_q}$, let $\mathcal{A} = A_1 \otimes A_2 \otimes \cdots \otimes A_q$. Then for $p \in [1,2]$, the goal is to find $x \in \mathbb{R}^{d_1 \cdots d_q}$ that approximately minimizes $\|\mathcal{A}x - b\|_p$. Recently, Diao, Song, Sun, and Woodruff (AISTATS, 2018) gave an algorithm which is faster than forming the Kronecker product $\mathcal{A}$ Specifically, for $p=2$ their running time is $O(\sum_{i=1}^q \text{nnz}(A_i) + \text{nnz}(b))$, where nnz$(A_i)$ is the number of non-zero entries in $A_i$. Note that nnz$(b)$ can be as large as $n_1 \cdots n_q$. For $p=1,$ $q=2$ and $n_1 = n_2$, they achieve a worse bound of $O(n_1^{3/2} \text{poly}(d_1d_2) + \text{nnz}(b))$. In this work, we provide significantly faster algorithms. For $p=2$, our running time is $O(\sum_{i=1}^q \text{nnz}(A_i) )$, which has no dependence on nnz$(b)$. For $p<2$, our running time is $O(\sum_{i=1}^q \text{nnz}(A_i) + \text{nnz}(b))$, which matches the prior best running time for $p=2$. We also consider the related all-pairs regression problem, where given $A \in \mathbb{R}^{n \times d}, b \in \mathbb{R}^n$, we want to solve $\min_{x} \|\bar{A}x - \bar{b}\|_p$, where $\bar{A} \in \mathbb{R}^{n^2 \times d}, \bar{b} \in \mathbb{R}^{n^2}$ consist of all pairwise differences of the rows of $A,b$. We give an $O(\text{nnz}(A))$ time algorithm for $p \in[1,2]$, improving the $\Omega(n^2)$ time needed to form $\bar{A}$. Finally, we initiate the study of Kronecker product low rank and low $t$-rank approximation. For input $\mathcal{A}$ as above, we give $O(\sum_{i=1}^q \text{nnz}(A_i))$ time algorithms, which is much faster than computing $\mathcal{A}$.

Huaian Diao, Rajesh Jayaram, Zhao Song, Wen Sun, David P. Woodruff• 2019

Related benchmarks

TaskDatasetResultRank
Kronecker regressionRandom n^2 x d^2 design matrix
Loss4.00e-4
118
Kronecker regressionSynthetic n^2 x d^2 design matrix (test)
Running Time (s)0.0051
118
Low-rank tensor decompositionCardiac MRI tensor
Avg Iteration Time (s)1.3
64
Tucker decompositionCardiac MRI tensor
Relative Reconstruction Error0.409
63
Tensor DecompositionHyperspectral image tensor
Avg Iteration Time (s)2.203
48
Tensor DecompositionCOIL-100
Average Iteration Time (s)10.131
48
Tensor ReconstructionCOIL-100 1.0 (test)
Relative Reconstruction Error0.356
48
Hyperspectral Tensor ReconstructionHyperspectral radiance image tensor (full)
Relative Reconstruction Error0.139
47
Kronecker regressionSynthetic Kronecker product matrices d=64
Loss0.035
15
Showing 9 of 9 rows

Other info

Follow for update