A Multi-resolution Low-rank Tensor Decomposition

Abstract

The (efficient and parsimonious) decomposition of higher-order tensors is a fundamental problem with numerous applications in a variety of fields. Several methods have been proposed in the literature to that end, with the Tucker and PARAFAC decompositions being the most prominent ones. Inspired by the latter, in this work we propose a multi-resolution low-rank tensor decomposition to describe (approximate) a tensor in a hierarchical fashion. The central idea of the decomposition is to recast the tensor into multiple lower-dimensional tensors to exploit the structure at different levels of resolution. The method is first explained, an alternating least squares algorithm is discussed, and preliminary simulations illustrating the potential practical relevance are provided.

Index Terms—  Tensor decomposition, Low-rank approximation, Kronecker decomposition, multi-resolution approximation.

1 Introduction

We live in a digital age where common-life devices, from smartphones to cars, generate massive amounts of data that provide researchers and practitioners a range of opportunities. Processing contemporary information comes, however, at a cost, since data sources are messy and heterogeneous. In this context, parsimonious models emerge as an ideal tool to enhance efficiency when processing such vast amounts of information. This can be done by leveraging the structure of the data, as is the case of information living in multiple (possibly many) dimensions. Multi-dimensional data are prevalent in numerous fields, with representative examples including chemometrics, bioengineering, communications, hyper-spectral imaging, or psychometrics [1, 2]. Traditionally, matrices were used to model those datasets, but tensor-representation models have been recently breaking through. Multi-dimensional arrays, or tensors, are data structures that generalize the concept of vectors and matrices to highly-dimensional domains. In recent years, tensors have also been applied to address numerous data science and machine learning tasks, from simple interpolation to supervised classification [3].

In this data-science context, a problem of particular interest is that of tensor decomposition, which tries to estimate a set of latent factors that summarize the tensor. Many tensor decompositions were developed as the generalization of well-known matrix-decomposition methods to high-dimensional domains [4, 5]. This was the case of the PARAFAC tensor decomposition [6] and its generalization, the Tucker tensor decomposition [7], which can be both understood as higher-order generalizations of the SVD decomposition of a matrix. More specifically, these decompositions aim at describing (approximating) the tensor as a sum of rank-1 tensors, decomposing it as a sum of outer products of vectors (called factors). The PARAFAC decomposition is conceptually simple and its representation complexity scales gracefully (the number of parameters grows linearly with the rank). The Tucker decomposition enjoys additional degrees of freedom at the cost of greater complexity (exponential dependence of the number of parameters with respect to the rank). Hierarchical tensor decompositions, such as the Tensor Train (TT) decomposition [8] or a hierarchical Tucker (hTucker) decomposition [9], try to alleviate this problem. The former unwraps the tensor into a chain of three-dimensional tensors, and the latter generalizes the same idea by organizing the dimensions in a binary tree. Furthermore, in recent years significant effort has been devoted to modify existing decomposition algorithms to deal with factor constraints (e.g., non-negativeness), promote certain priors (e.g., factor sparsity), or be robust to imperfections [10] [11] [12].

However, little to no work has been carried out to study the tensor decomposition from a multi-resolution perspective. This can be specially interesting for tensor signals such as videos, where 2-, 3-, and 4-dimensional components are mixed in a single tensor. In this work, we postulate a simple but novel multi-resolution low-rank decomposition method. More specifically, this paper:

  • Introduces a new multi-resolution tensor decomposition to exploit the low-rank structure of a tensor at different resolutions.

  • Proposes an algorithm to implement the decomposition.

  • Tests the benefits of the model via numerical simulations.

Regarding the first contribution, rather than postulating a low-rank decomposition of the tensor using the original multidimensional representation, we 1) consider a collection of lower-order multidimensional representations of the tensor (where several of the original modes of the tensor are combined into a single one); 2) postulate a low-rank decomposition for each of the lower-dimensional representations; 3) map each of the representations back to the original tensor domain; and 4) model the original tensor as the sum of such low-rank representations. As illustrated in detail in the manuscript, this results in an efficient decomposition method capable of combining low-rank structures present at different resolutions.

Section 2 introduces notation and tensor preliminaries. Section 3 presents our decomposition method. A simple algorithmic approach to address the decomposition is described in Section 4. Illustrative numerical experiments are provided in Section 5.

2 Notation and tensor preliminaries

The entries of a (column) vector 𝐱𝐱{\mathbf{x}}bold_x, a matrix 𝐗𝐗{\mathbf{X}}bold_X and a tensor 𝐗𝐗\mathbf{X}bold_X  are denoted by [𝐱]nsubscriptdelimited-[]𝐱𝑛[{\mathbf{x}}]_{n}[ bold_x ] start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, [𝐗]n1,n2subscriptdelimited-[]𝐗subscript𝑛1subscript𝑛2[{\mathbf{X}}]_{n_{1},n_{2}}[ bold_X ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and [ 𝐗  ]n1,n2,,nIsubscriptdelimited-[] 𝐗  subscript𝑛1subscript𝑛2subscript𝑛𝐼[{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf% {X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt% }}}\hskip 3.375pt}}]_{n_{1},n_{2},...,n_{I}}[ roman_X ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT, respectively, with I𝐼Iitalic_I denoting the order of tensor 𝐗𝐗\mathbf{X}bold_X . Moreover, the n𝑛nitalic_nth column of matrix 𝐗𝐗{\mathbf{X}}bold_X is denoted by [𝐗]nsubscriptdelimited-[]𝐗𝑛[{\mathbf{X}}]_{n}[ bold_X ] start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Sets are represented by calligraphic capital letters. The cardinality of a set 𝒮𝒮{\mathcal{S}}caligraphic_S is denoted by |𝒮|𝒮|{\mathcal{S}}|| caligraphic_S |. When a set 𝒮𝒮{\mathcal{S}}caligraphic_S is ordered, we use the notation 𝒮(i)𝒮𝑖{\mathcal{S}}(i)caligraphic_S ( italic_i ) with 1i|𝒮|1𝑖𝒮1\leq i\leq|{\mathcal{S}}|1 ≤ italic_i ≤ | caligraphic_S | to denote the i𝑖iitalic_ith element of the set. The vertical concatenation of the columns of matrix 𝐗𝐗{\mathbf{X}}bold_X is denoted by vec(𝐗)vec𝐗\mathrm{vec}({\mathbf{X}})roman_vec ( bold_X ). 𝐗Fsubscriptnorm𝐗𝐹\|{\mathbf{X}}\|_{F}∥ bold_X ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT is the Frobenious norm of matrix 𝐗𝐗{\mathbf{X}}bold_X, which can be equivalently written as vec(𝐗)2subscriptnormvec𝐗2\|\mathrm{vec}({\mathbf{X}})\|_{2}∥ roman_vec ( bold_X ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

2.1 Tensor to matrix unfolding

Given a tensor 𝐗𝐗\mathbf{X}bold_X  of order I𝐼Iitalic_I and size N1××NIsubscript𝑁1subscript𝑁𝐼N_{1}\times...\times N_{I}italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT × … × italic_N start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT, there are many ways to unfold the entries of the tensor into a matrix 𝐗𝐗{\mathbf{X}}bold_X. In this section, we are interested in unfoldings where the columns of matrix 𝐗𝐗{\mathbf{X}}bold_X represent one of the original modes of 𝐗𝐗\mathbf{X}bold_X  and the rows of 𝐗𝐗{\mathbf{X}}bold_X represent all the other modes of the tensor. Mathematically, we define the matrix unfolding operator as

𝐗=𝐗absent\displaystyle{\mathbf{X}}=bold_X = matp( 𝐗  )(N1Np1Np+1NI)×Npwheresubscriptmat𝑝 𝐗  superscriptsubscript𝑁1subscript𝑁𝑝1subscript𝑁𝑝1subscript𝑁𝐼subscript𝑁𝑝where\displaystyle\mathrm{mat}_{p}({\leavevmode\hbox{\set@color\hskip 3.375pt% \leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}})\in{\mathbb{R}}^{(N_{1}% ...N_{p-1}N_{p+1}...N_{I})\times N_{p}}\;\mathrm{where}\hfillroman_mat start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( roman_X ) ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT … italic_N start_POSTSUBSCRIPT italic_p - 1 end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_p + 1 end_POSTSUBSCRIPT … italic_N start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) × italic_N start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_where (1)
[𝐗]k,np=[ 𝐗  ]n1,,nIandsubscriptdelimited-[]𝐗𝑘subscript𝑛𝑝subscriptdelimited-[] 𝐗  subscript𝑛1subscript𝑛𝐼and\displaystyle[{\mathbf{X}}]_{k,n_{p}}=[{\leavevmode\hbox{\set@color\hskip 3.37% 5pt\leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode% \hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}}]_{n_{1},...,n_{I}% }\;\text{and}\;[ bold_X ] start_POSTSUBSCRIPT italic_k , italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT = [ roman_X ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT and
k=n1+i=2,ipI(ni1)j=2,jpi1Nj.𝑘subscript𝑛1superscriptsubscriptformulae-sequence𝑖2𝑖𝑝𝐼subscript𝑛𝑖1superscriptsubscriptproductformulae-sequence𝑗2𝑗𝑝𝑖1subscript𝑁𝑗\displaystyle k=n_{1}+\sum_{i=2,i\neq p}^{I}(n_{i}-1)\prod_{j=2,j\neq p}^{i-1}% N_{j}.italic_k = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 2 , italic_i ≠ italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ( italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1 ) ∏ start_POSTSUBSCRIPT italic_j = 2 , italic_j ≠ italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT .

where pI𝑝𝐼p\leq Iitalic_p ≤ italic_I and, to simplify exposition, we have assumed that p>1𝑝1p>1italic_p > 1.

2.2 Tensor to lower-order tensor unfolding

Consider a tensor 𝐗𝐗\mathbf{X}bold_X , of order I𝐼Iitalic_I, and let :={1,2,,I}assign12𝐼{\mathcal{I}}:=\{1,2,...,I\}caligraphic_I := { 1 , 2 , … , italic_I } denote the set containing the indexes of all the modes of 𝐗𝐗\mathbf{X}bold_X .

Definition 1

The ordered set 𝒫={𝒫1,,𝒫P}𝒫subscript𝒫1subscript𝒫𝑃{\mathcal{P}}=\{{\mathcal{P}}_{1},...,{\mathcal{P}}_{P}\}caligraphic_P = { caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , caligraphic_P start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT } is a partition of the set {\mathcal{I}}caligraphic_I if it holds that: 𝒫psubscript𝒫𝑝{\mathcal{P}}_{p}\neq\emptysetcaligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≠ ∅ for all p𝑝pitalic_p, 𝒫p𝒫p=subscript𝒫𝑝subscript𝒫superscript𝑝{\mathcal{P}}_{p}\cap{\mathcal{P}}_{p^{\prime}}=\emptysetcaligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ∩ caligraphic_P start_POSTSUBSCRIPT italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = ∅ for all ppsuperscript𝑝𝑝p^{\prime}\neq pitalic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_p, and p=1P𝒫p=superscriptsubscript𝑝1𝑃subscript𝒫𝑝\bigcup_{p=1}^{P}{\mathcal{P}}_{p}={\mathcal{I}}⋃ start_POSTSUBSCRIPT italic_p = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P end_POSTSUPERSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = caligraphic_I.

We are interested in resha** the entries of the I𝐼Iitalic_Ith order tensor 𝐗𝐗\mathbf{X}bold_X  of size N1××NIsubscript𝑁1subscript𝑁𝐼N_{1}\times...\times N_{I}italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT × … × italic_N start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT to generate a lower-order tensor  𝐗  ˇˇ 𝐗  \check{{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$% \mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 3.375pt}}}overroman_ˇ start_ARG roman_X end_ARG, with order P<I𝑃𝐼P<Iitalic_P < italic_I and according to a given partition 𝒫={𝒫1,,𝒫P}𝒫subscript𝒫1subscript𝒫𝑃{\mathcal{P}}=\{{\mathcal{P}}_{1},...,{\mathcal{P}}_{P}\}caligraphic_P = { caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , caligraphic_P start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT } as specified next

 𝐗  ˇ=ten𝒫( 𝐗  )j=1|𝒫1||𝒫1(j)|××j=1|𝒫P||𝒫P(j)|ˇ 𝐗  subscriptten𝒫 𝐗  superscriptsuperscriptsubscriptproduct𝑗1subscript𝒫1subscript𝒫1𝑗superscriptsubscriptproduct𝑗1subscript𝒫𝑃subscript𝒫𝑃𝑗\displaystyle\check{{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox% {\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{% 3.09999pt}{0.2906pt}}}\hskip 3.375pt}}}=\mathrm{ten}_{\mathcal{P}}({% \leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{X% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 3.375pt}})\in{\mathbb{R}}^{\prod_{j=1}^{|{\mathcal{P}}_{1}|}|{\mathcal% {P}}_{1}(j)|\times...\times\prod_{j=1}^{|{\mathcal{P}}_{P}|}|{\mathcal{P}}_{P}% (j)|}\hfilloverroman_ˇ start_ARG roman_X end_ARG = roman_ten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( roman_X ) ∈ blackboard_R start_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_j ) | × … × ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_j ) | end_POSTSUPERSCRIPT (2)
[ 𝐗  ˇ]k1,,k|𝒫|=[ 𝐗  ]n1,,nIandsubscriptdelimited-[]ˇ 𝐗  subscript𝑘1subscript𝑘𝒫subscriptdelimited-[] 𝐗  subscript𝑛1subscript𝑛𝐼and\displaystyle\hskip 14.22636pt[\check{{\leavevmode\hbox{\set@color\hskip 3.375% pt\leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode% \hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}}}]_{k_{1},...,k_{|% {\mathcal{P}}|}}=[{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{% \set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3% .09999pt}{0.2906pt}}}\hskip 3.375pt}}]_{n_{1},...,n_{I}}\;\;\mathrm{and}[ overroman_ˇ start_ARG roman_X end_ARG ] start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_k start_POSTSUBSCRIPT | caligraphic_P | end_POSTSUBSCRIPT end_POSTSUBSCRIPT = [ roman_X ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_and
kp=n𝒫p(1)if|𝒫p|=1subscript𝑘𝑝subscript𝑛subscript𝒫𝑝1ifsubscript𝒫𝑝1\displaystyle\hskip 14.22636ptk_{p}=n_{{\mathcal{P}}_{p}(1)}\;\;\mathrm{if}\;% \;|{\mathcal{P}}_{p}|=1italic_k start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_n start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT roman_if | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | = 1
kp=n𝒫p(1)+i=2|𝒫p|(n𝒫p(i)1)j=1i1N𝒫p(j)if|𝒫p|>1subscript𝑘𝑝subscript𝑛subscript𝒫𝑝1superscriptsubscript𝑖2subscript𝒫𝑝subscript𝑛subscript𝒫𝑝𝑖1superscriptsubscriptproduct𝑗1𝑖1subscript𝑁subscript𝒫𝑝𝑗ifsubscript𝒫𝑝1\displaystyle\hskip 14.22636ptk_{p}=n_{{\mathcal{P}}_{p}(1)}+\sum_{i=2}^{|{% \mathcal{P}}_{p}|}(n_{{\mathcal{P}}_{p}(i)}-1)\prod_{j=1}^{i-1}N_{{\mathcal{P}% }_{p}(j)}\;\;\mathrm{if}\;\;|{\mathcal{P}}_{p}|>1italic_k start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_n start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT ( italic_n start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_i ) end_POSTSUBSCRIPT - 1 ) ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_j ) end_POSTSUBSCRIPT roman_if | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | > 1

Note that, according to definition of the ten𝒫()subscriptten𝒫\mathrm{ten}_{\mathcal{P}}(\cdot)roman_ten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( ⋅ ) operator, the indexes along the p𝑝pitalic_pth mode of  𝐗  ˇˇ 𝐗  \check{{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$% \mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 3.375pt}}}overroman_ˇ start_ARG roman_X end_ARG represent tuples (m𝒫p(1),,m𝒫p(|𝒫p|))subscript𝑚subscript𝒫𝑝1subscript𝑚subscript𝒫𝑝subscript𝒫𝑝(m_{{\mathcal{P}}_{p}(1)},...,m_{{\mathcal{P}}_{p}(|{\mathcal{P}}_{p}|)})( italic_m start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT , … , italic_m start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | ) end_POSTSUBSCRIPT ) of indexes of the original tensor 𝐗𝐗\mathbf{X}bold_X .

Clearly, if 𝒫={}𝒫{\mathcal{P}}=\{{\mathcal{I}}\}caligraphic_P = { caligraphic_I }, so that P=|𝒫|=1𝑃𝒫1P=|{\mathcal{P}}|=1italic_P = | caligraphic_P | = 1 and |𝒫1|=Isubscript𝒫1𝐼|{\mathcal{P}}_{1}|=I| caligraphic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | = italic_I, we have that ten𝒫( 𝐗  )=vec( 𝐗  )subscriptten𝒫 𝐗  vec 𝐗  \mathrm{ten}_{\mathcal{P}}({\leavevmode\hbox{\set@color\hskip 3.375pt% \leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}})=\mathrm{vec}({% \leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{X% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 3.375pt}})roman_ten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( roman_X ) = roman_vec ( roman_X ). On the other hand, if 𝒫={{1},{2},,{I}}𝒫12𝐼{\mathcal{P}}=\{\{1\},\{2\},...,\{I\}\}caligraphic_P = { { 1 } , { 2 } , … , { italic_I } }, so that P=|𝒫|=I𝑃𝒫𝐼P=|{\mathcal{P}}|=Iitalic_P = | caligraphic_P | = italic_I and |𝒫p|=1subscript𝒫𝑝1|{\mathcal{P}}_{p}|=1| caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | = 1 for all p𝑝pitalic_p, we have that ten𝒫( 𝐗  )= 𝐗  ˇsubscriptten𝒫 𝐗  ˇ 𝐗  \mathrm{ten}_{\mathcal{P}}({\leavevmode\hbox{\set@color\hskip 3.375pt% \leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}})=\check{{\leavevmode% \hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{X}$}% \raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}% \hskip 3.375pt}}}roman_ten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( roman_X ) = overroman_ˇ start_ARG roman_X end_ARG.

Finally, the inverse operator of (2), which recovers the original tensor 𝐗𝐗\mathbf{X}bold_X  using as input the reshaped  𝐗  ˇ=ten𝒫( 𝐗  )ˇ 𝐗  subscriptten𝒫 𝐗  \check{{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$% \mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 3.375pt}}}=\mathrm{ten}_{\mathcal{P}}({\leavevmode\hbox{% \set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4% 906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}})overroman_ˇ start_ARG roman_X end_ARG = roman_ten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( roman_X ), is denoted by unten𝒫( 𝐗  ˇ)= 𝐗  subscriptunten𝒫ˇ 𝐗   𝐗  \mathrm{unten}_{\mathcal{P}}(\check{{\leavevmode\hbox{\set@color\hskip 3.375pt% \leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}}})={\leavevmode\hbox{% \set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4% 906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}}roman_unten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( overroman_ˇ start_ARG roman_X end_ARG ) = roman_X. Since the definition of unten𝒫()subscriptunten𝒫\mathrm{unten}_{\mathcal{P}}(\cdot)roman_unten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( ⋅ ) starting from (2) is straightforward, it is omitted for conciseness.

2.3 Low-rank PARAFAC tensor decomposition

Consider the I𝐼Iitalic_Ith order tensor 𝐗𝐗\mathbf{X}bold_X  along with the matrices 𝐅iNi×Rsubscript𝐅𝑖superscriptsubscript𝑁𝑖𝑅{\mathbf{F}}_{i}\in{\mathbb{R}}^{N_{i}\times R}bold_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT × italic_R end_POSTSUPERSCRIPT for i=1,,I𝑖1𝐼i=1,...,Iitalic_i = 1 , … , italic_I. Then, 𝐗𝐗\mathbf{X}bold_X  is said to have rank R𝑅Ritalic_R if it can be written as

 𝐗  =r=1R[𝐅1]r[𝐅2]r[𝐅I]r 𝐗  superscriptsubscript𝑟1𝑅subscriptdelimited-[]subscript𝐅1𝑟subscriptdelimited-[]subscript𝐅2𝑟subscriptdelimited-[]subscript𝐅𝐼𝑟{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{% X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}% }}\hskip 3.375pt}}=\sum_{r=1}^{R}[{\mathbf{F}}_{1}]_{r}\circledcirc[{\mathbf{F% }}_{2}]_{r}\circledcirc...\circledcirc[{\mathbf{F}}_{I}]_{r}roman_X = ∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT [ bold_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊚ [ bold_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊚ … ⊚ [ bold_F start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT (3)

where \circledcirc is the generalization of the outer product for more than two vectors. That is, if 𝐱N1𝐱superscriptsubscript𝑁1{\mathbf{x}}\in{\mathbb{R}}^{N_{1}}bold_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, 𝐲N2𝐲superscriptsubscript𝑁2{\mathbf{y}}\in{\mathbb{R}}^{N_{2}}bold_y ∈ blackboard_R start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, 𝐳N3𝐳superscriptsubscript𝑁3{\mathbf{z}}\in{\mathbb{R}}^{N_{3}}bold_z ∈ blackboard_R start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT are three generic vectors, then 𝐱𝐲𝐳𝐱𝐲𝐳{\mathbf{x}}\circledcirc{\mathbf{y}}\circledcirc{\mathbf{z}}bold_x ⊚ bold_y ⊚ bold_z is a tensor of order I=3𝐼3I=3italic_I = 3 satisfying [𝐱𝐲𝐳]n1,n2,n3=[𝐱]n1[𝐲]n2[𝐳]n3subscriptdelimited-[]𝐱𝐲𝐳subscript𝑛1subscript𝑛2subscript𝑛3subscriptdelimited-[]𝐱subscript𝑛1subscriptdelimited-[]𝐲subscript𝑛2subscriptdelimited-[]𝐳subscript𝑛3[{\mathbf{x}}\circledcirc{\mathbf{y}}\circledcirc{\mathbf{z}}]_{n_{1},n_{2},n_% {3}}=[{\mathbf{x}}]_{n_{1}}[{\mathbf{y}}]_{n_{2}}[{\mathbf{z}}]_{n_{3}}\in{% \mathbb{R}}[ bold_x ⊚ bold_y ⊚ bold_z ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = [ bold_x ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ bold_y ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ bold_z ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ blackboard_R.

The decomposition in (3) is oftentimes referred to as canonical polyadic decomposition or PARAFAC decomposition, with matrices 𝐅isubscript𝐅𝑖{\mathbf{F}}_{i}bold_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT being referred to as factors. As in the case of matrices, moderate values of R𝑅Ritalic_R induce a parsimonious description of the tensor, since the i=1INisuperscriptsubscriptproduct𝑖1𝐼subscript𝑁𝑖\prod_{i=1}^{I}N_{i}∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT values in 𝐗𝐗\mathbf{X}bold_X  can be equivalently represented by the i=1IRNisuperscriptsubscript𝑖1𝐼𝑅subscript𝑁𝑖\sum_{i=1}^{I}RN_{i}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_R italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT entries in {𝐅i}i=1Isuperscriptsubscriptsubscript𝐅𝑖𝑖1𝐼\{{\mathbf{F}}_{i}\}_{i=1}^{I}{ bold_F start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT.

Using the Khatri-Rao product, denoted as direct-product\odot, and the different unfolding operators introduced in the previous sections, we have that

matp( 𝐗  )subscriptmat𝑝 𝐗  \displaystyle\mathrm{mat}_{p}({\leavevmode\hbox{\set@color\hskip 3.375pt% \leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}})\!\!\!\!roman_mat start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( roman_X ) =\displaystyle== r=1Rmatp([𝐅1]r[𝐅2]r[𝐅I]r)superscriptsubscript𝑟1𝑅subscriptmat𝑝subscriptdelimited-[]subscript𝐅1𝑟subscriptdelimited-[]subscript𝐅2𝑟subscriptdelimited-[]subscript𝐅𝐼𝑟\displaystyle\!\!\!\!\!\!\sum_{r=1}^{R}\mathrm{mat}_{p}([{\mathbf{F}}_{1}]_{r}% \circledcirc[{\mathbf{F}}_{2}]_{r}\circledcirc...\circledcirc[{\mathbf{F}}_{I}% ]_{r})∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT roman_mat start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( [ bold_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊚ [ bold_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊚ … ⊚ [ bold_F start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) (4)
=\displaystyle== (𝐅I𝐅p+1𝐅p1𝐅1)(𝐅p)Tdirect-productsubscript𝐅𝐼subscript𝐅𝑝1subscript𝐅𝑝1subscript𝐅1superscriptsubscript𝐅𝑝𝑇\displaystyle\!\!\!\!\!\!({\mathbf{F}}_{I}\odot...\odot{\mathbf{F}}_{p+1}\odot% {\mathbf{F}}_{p-1}\odot...\odot{\mathbf{F}}_{1})({\mathbf{F}}_{p})^{T}~{}~{}~{% }~{}( bold_F start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ⊙ … ⊙ bold_F start_POSTSUBSCRIPT italic_p + 1 end_POSTSUBSCRIPT ⊙ bold_F start_POSTSUBSCRIPT italic_p - 1 end_POSTSUBSCRIPT ⊙ … ⊙ bold_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ( bold_F start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT
ten𝒫( 𝐗  )subscriptten𝒫 𝐗  \displaystyle\mathrm{ten}_{{\mathcal{P}}}({\leavevmode\hbox{\set@color\hskip 3% .375pt\leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode% \hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}})\!\!roman_ten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( roman_X ) =\displaystyle== r=1Rten𝒫([𝐅1]r[𝐅2]r[𝐅I]r)superscriptsubscript𝑟1𝑅subscriptten𝒫subscriptdelimited-[]subscript𝐅1𝑟subscriptdelimited-[]subscript𝐅2𝑟subscriptdelimited-[]subscript𝐅𝐼𝑟\displaystyle\!\!\!\sum_{r=1}^{R}\mathrm{ten}_{{\mathcal{P}}}([{\mathbf{F}}_{1% }]_{r}\circledcirc[{\mathbf{F}}_{2}]_{r}\circledcirc...\circledcirc[{\mathbf{F% }}_{I}]_{r})∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT roman_ten start_POSTSUBSCRIPT caligraphic_P end_POSTSUBSCRIPT ( [ bold_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊚ [ bold_F start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊚ … ⊚ [ bold_F start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT )
=\displaystyle== r=1R([𝐅ˇ1]r[𝐅ˇ2]r[𝐅ˇP]r)superscriptsubscript𝑟1𝑅subscriptdelimited-[]subscriptˇ𝐅1𝑟subscriptdelimited-[]subscriptˇ𝐅2𝑟subscriptdelimited-[]subscriptˇ𝐅𝑃𝑟\displaystyle\!\!\!\sum_{r=1}^{R}([\check{{\mathbf{F}}}_{1}]_{r}\circledcirc[% \check{{\mathbf{F}}}_{2}]_{r}\circledcirc...\circledcirc[\check{{\mathbf{F}}}_% {P}]_{r})∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT ( [ overroman_ˇ start_ARG bold_F end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊚ [ overroman_ˇ start_ARG bold_F end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ⊚ … ⊚ [ overroman_ˇ start_ARG bold_F end_ARG start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT )
withwith\displaystyle\mathrm{with}\!\!roman_with 𝐅ˇp=𝐅𝒫p(|𝒫p|)𝐅𝒫p(2)𝐅𝒫p(1).subscriptˇ𝐅𝑝direct-productsubscript𝐅subscript𝒫𝑝subscript𝒫𝑝subscript𝐅subscript𝒫𝑝2subscript𝐅subscript𝒫𝑝1\displaystyle\!\!\!\check{{\mathbf{F}}}_{p}={\mathbf{F}}_{{\mathcal{P}}_{p}(|{% \mathcal{P}}_{p}|)}\odot...\odot{\mathbf{F}}_{{\mathcal{P}}_{p}(2)}\odot{% \mathbf{F}}_{{\mathcal{P}}_{p}(1)}.~{}~{}~{}~{}\hfilloverroman_ˇ start_ARG bold_F end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = bold_F start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT | ) end_POSTSUBSCRIPT ⊙ … ⊙ bold_F start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( 2 ) end_POSTSUBSCRIPT ⊙ bold_F start_POSTSUBSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( 1 ) end_POSTSUBSCRIPT . (5)

These expressions will be leveraged in the next section.

3 Multi-resolution low-rank decomposition

Consider a collection of partitions 𝒫(1)superscript𝒫1{\mathcal{P}}^{(1)}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT,…,𝒫(L)superscript𝒫𝐿{\mathcal{P}}^{(L)}caligraphic_P start_POSTSUPERSCRIPT ( italic_L ) end_POSTSUPERSCRIPT, with |𝒫(l)||𝒫(l)|superscript𝒫𝑙superscript𝒫superscript𝑙|{\mathcal{P}}^{(l)}|\leq|{\mathcal{P}}^{(l^{\prime})}|| caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT | ≤ | caligraphic_P start_POSTSUPERSCRIPT ( italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT | for l<l𝑙superscript𝑙l<l^{\prime}italic_l < italic_l start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Given the I𝐼Iitalic_Ith order tensor 𝐗𝐗\mathbf{X}bold_X  and the collection of partitions 𝒫(1)superscript𝒫1{\mathcal{P}}^{(1)}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT,…,𝒫(L)superscript𝒫𝐿{\mathcal{P}}^{(L)}caligraphic_P start_POSTSUPERSCRIPT ( italic_L ) end_POSTSUPERSCRIPT, we propose the following decomposition for the tensor at hand

𝐗𝐗\mathbf{X}bold_X  =l=1L 𝐙  l,withrank(ten𝒫(l)( 𝐙  l))Rl,formulae-sequenceabsentsuperscriptsubscript𝑙1𝐿subscript 𝐙  𝑙withranksubscripttensuperscript𝒫𝑙subscript 𝐙  𝑙subscript𝑅𝑙\displaystyle=\sum_{l=1}^{L}{\leavevmode\hbox{\set@color\hskip 2.75pt% \leavevmode\hbox{\set@color$\mathbf{Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 2.75pt}}_{l},\;\;\mathrm{with}\;% \;\mathrm{rank}\big{(}\mathrm{ten}_{{\mathcal{P}}^{(l)}}({\leavevmode\hbox{% \set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z}$}\raisebox{-1.49% 06pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 2.75pt}}_{% l})\big{)}\leq R_{l},\hfill= ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , roman_with roman_rank ( roman_ten start_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) ) ≤ italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , (6)

which can be equivalently written as

𝐗𝐗\mathbf{X}bold_X  =l=1Lunten𝒫(l)( 𝐙  ˇl),withrank( 𝐙  ˇl)Rl.formulae-sequenceabsentsuperscriptsubscript𝑙1𝐿subscriptuntensuperscript𝒫𝑙subscriptˇ 𝐙  𝑙withranksubscriptˇ 𝐙  𝑙subscript𝑅𝑙\displaystyle=\sum_{l=1}^{L}\mathrm{unten}_{{\mathcal{P}}^{(l)}}(\check{{% \leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z}% $}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}% \hskip 2.75pt}}}_{l}),\;\;\mathrm{with}\;\;\mathrm{rank}\big{(}\check{{% \leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z}% $}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}% \hskip 2.75pt}}}_{l}\big{)}\leq R_{l}.\hfill= ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT roman_unten start_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( overroman_ˇ start_ARG roman_Z end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) , roman_with roman_rank ( overroman_ˇ start_ARG roman_Z end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) ≤ italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT . (7)

where Rlsubscript𝑅𝑙R_{l}italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT is the rank of the tensor associated to the l𝑙litalic_l partition.

Number of parameters: As already explained, one of the most meaningful implications of low-rank tensor models is the fact that they provide a parsimonious description of the tensor, reducing its implicit number of degrees of freedom. The same is true for the decomposition in (6). To be concrete, the tensor  𝐙  ˇl=ten𝒫(l)( 𝐙  l)subscriptˇ 𝐙  𝑙subscripttensuperscript𝒫𝑙subscript 𝐙  𝑙\check{{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$% \mathbf{Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 2.75pt}}}_{l}=\mathrm{ten}_{{\mathcal{P}}^{(l)}}({\leavevmode% \hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z}$}\raisebox% {-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 2.75% pt}}_{l})overroman_ˇ start_ARG roman_Z end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = roman_ten start_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) has order P(l)=|𝒫(l)|superscript𝑃𝑙superscript𝒫𝑙P^{(l)}=|{\mathcal{P}}^{(l)}|italic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT = | caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT |, with the dimension of the p𝑝pitalic_pth mode being j=1|𝒫p(l)||𝒫p(l)(j)|superscriptsubscriptproduct𝑗1superscriptsubscript𝒫𝑝𝑙superscriptsubscript𝒫𝑝𝑙𝑗\prod_{j=1}^{|{\mathcal{P}}_{p}^{(l)}|}|{\mathcal{P}}_{p}^{(l)}(j)|∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT | end_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT ( italic_j ) |. As a result,  𝐙  ˇlsubscriptˇ 𝐙  𝑙\check{{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$% \mathbf{Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 2.75pt}}}_{l}overroman_ˇ start_ARG roman_Z end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT having rank Rlsubscript𝑅𝑙R_{l}italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT implies that

Rlp=1P(l)j=1|𝒫p(l)||𝒫p(l)(j)|subscript𝑅𝑙superscriptsubscript𝑝1superscript𝑃𝑙superscriptsubscriptproduct𝑗1superscriptsubscript𝒫𝑝𝑙superscriptsubscript𝒫𝑝𝑙𝑗R_{l}\sum_{p=1}^{P^{(l)}}\prod_{j=1}^{|{\mathcal{P}}_{p}^{(l)}|}|{\mathcal{P}}% _{p}^{(l)}(j)|italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_p = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT | end_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT ( italic_j ) |

parameters suffice to fully describe the i=1INisuperscriptsubscriptproduct𝑖1𝐼subscript𝑁𝑖\prod_{i=1}^{I}N_{i}∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT entries in  𝐙  ˇlsubscriptˇ 𝐙  𝑙\check{{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$% \mathbf{Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 2.75pt}}}_{l}overroman_ˇ start_ARG roman_Z end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT. Summing across the different L𝐿Litalic_L factors implies that

l=1LRlp=1P(l)j=1|𝒫p(l)||𝒫p(l)(j)|superscriptsubscript𝑙1𝐿subscript𝑅𝑙superscriptsubscript𝑝1superscript𝑃𝑙superscriptsubscriptproduct𝑗1superscriptsubscript𝒫𝑝𝑙superscriptsubscript𝒫𝑝𝑙𝑗\sum_{l=1}^{L}R_{l}\sum_{p=1}^{P^{(l)}}\prod_{j=1}^{|{\mathcal{P}}_{p}^{(l)}|}% |{\mathcal{P}}_{p}^{(l)}(j)|∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_p = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT | end_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT ( italic_j ) |

parameters suffice to fully describe the i=1INisuperscriptsubscriptproduct𝑖1𝐼subscript𝑁𝑖\prod_{i=1}^{I}N_{i}∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT entries in 𝐙𝐙\mathbf{Z}bold_Z .

4 Algorithmic implementation

The decomposition introduced in (6) can be obtained by solving the following minimization problem:

min 𝐙  1 𝐙  L 𝐗  l=1L 𝐙  lFsubscriptsubscript 𝐙  1subscript 𝐙  𝐿subscriptnorm 𝐗  superscriptsubscript𝑙1𝐿subscript 𝐙  𝑙𝐹\displaystyle\min_{\footnotesize{\leavevmode\hbox{\set@color\hskip 2.44444pt% \leavevmode\hbox{\set@color$\mathbf{Z}$}\raisebox{-1.45831pt}{\leavevmode\hbox% {\set@color\rule{2.75555pt}{0.25832pt}}}\hskip 2.44444pt}}_{1}...{\leavevmode% \hbox{\set@color\hskip 2.44444pt\leavevmode\hbox{\set@color$\mathbf{Z}$}% \raisebox{-1.45831pt}{\leavevmode\hbox{\set@color\rule{2.75555pt}{0.25832pt}}}% \hskip 2.44444pt}}_{L}}\Big{\|}{\leavevmode\hbox{\set@color\hskip 3.375pt% \leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}}-\sum_{l=1}^{L}{% \leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z}% $}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}% \hskip 2.75pt}}_{l}\Big{\|}_{F}roman_min start_POSTSUBSCRIPT roman_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT … roman_Z start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ roman_X - ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT (8)
s. t.rank(ten𝒫(l)( 𝐙  l))Rl.s. t.ranksubscripttensuperscript𝒫𝑙subscript 𝐙  𝑙subscript𝑅𝑙\displaystyle\text{s. t.}\;\;\mathrm{rank}\big{(}\mathrm{ten}_{{\mathcal{P}}^{% (l)}}({\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$% \mathbf{Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 2.75pt}}_{l})\big{)}\leq R_{l}.s. t. roman_rank ( roman_ten start_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) ) ≤ italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT .

The approach proposed in this section is to estimate each of the L𝐿Litalic_L tensors sequentially, so that when optimizing with respect to  𝐙  isubscript 𝐙  𝑖{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 2.75pt}}_{i}roman_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT the remaining tensors  𝐙  lsubscript 𝐙  𝑙{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 2.75pt}}_{l}roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT with li𝑙𝑖l\neq iitalic_l ≠ italic_i are kept fixed. As a result, the minimization problem to be solved in the i𝑖iitalic_ith step is:

min 𝐙  i 𝐗  liL 𝐙  l 𝐙  iFsubscriptsubscript 𝐙  𝑖subscriptnorm 𝐗  superscriptsubscript𝑙𝑖𝐿subscript 𝐙  𝑙subscript 𝐙  𝑖𝐹\displaystyle\min_{\footnotesize{\leavevmode\hbox{\set@color\hskip 2.44444pt% \leavevmode\hbox{\set@color$\mathbf{Z}$}\raisebox{-1.45831pt}{\leavevmode\hbox% {\set@color\rule{2.75555pt}{0.25832pt}}}\hskip 2.44444pt}}_{i}}\Big{\|}{% \leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{X% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 3.375pt}}-\sum_{l\neq i}^{L}{\leavevmode\hbox{\set@color\hskip 2.75pt% \leavevmode\hbox{\set@color$\mathbf{Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 2.75pt}}_{l}-{\leavevmode\hbox{% \set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z}$}\raisebox{-1.49% 06pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 2.75pt}}_{% i}\Big{\|}_{F}roman_min start_POSTSUBSCRIPT roman_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ roman_X - ∑ start_POSTSUBSCRIPT italic_l ≠ italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT - roman_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT (9)
s. t.rank(ten𝒫(i)( 𝐙  i))Ris. t.ranksubscripttensuperscript𝒫𝑖subscript 𝐙  𝑖subscript𝑅𝑖\displaystyle\text{s. t.}\;\;\mathrm{rank}\big{(}\mathrm{ten}_{{\mathcal{P}}^{% (i)}}({\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$% \mathbf{Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 2.75pt}}_{i})\big{)}\leq R_{i}s. t. roman_rank ( roman_ten start_POSTSUBSCRIPT caligraphic_P start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( roman_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ≤ italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT

for i=1,,L𝑖1𝐿i=1,...,Litalic_i = 1 , … , italic_L. The constraint in (9) can be handled using a PARAFAC decomposition

 𝐙  i=j=1Ri[𝐇1i]j[𝐇Jii]j,subscript 𝐙  𝑖superscriptsubscript𝑗1subscript𝑅𝑖subscriptdelimited-[]superscriptsubscript𝐇1𝑖𝑗subscriptdelimited-[]superscriptsubscript𝐇subscript𝐽𝑖𝑖𝑗{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 2.75pt}}_{i}=\sum_{j=1}^{R_{i}}[{\mathbf{H}}_{1}^{i}]_{j}\circledcirc.% ..\circledcirc[{\mathbf{H}}_{J_{i}}^{i}]_{j},roman_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT [ bold_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⊚ … ⊚ [ bold_H start_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , (10)

so that (9) can be equivalently formulated as:

min𝐇1i,,𝐇Jii 𝐗  liL 𝐙  lj=1Ri[𝐇1i]j[𝐇Jii]jF.subscriptsuperscriptsubscript𝐇1𝑖superscriptsubscript𝐇subscript𝐽𝑖𝑖subscriptnorm 𝐗  superscriptsubscript𝑙𝑖𝐿subscript 𝐙  𝑙superscriptsubscript𝑗1subscript𝑅𝑖subscriptdelimited-[]superscriptsubscript𝐇1𝑖𝑗subscriptdelimited-[]superscriptsubscript𝐇subscript𝐽𝑖𝑖𝑗𝐹\min_{{\mathbf{H}}_{1}^{i},...,{\mathbf{H}}_{J_{i}}^{i}}\Big{\|}{\leavevmode% \hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{X}$}% \raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}% \hskip 3.375pt}}-\sum_{l\neq i}^{L}{\leavevmode\hbox{\set@color\hskip 2.75pt% \leavevmode\hbox{\set@color$\mathbf{Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{% \set@color\rule{3.09999pt}{0.2906pt}}}\hskip 2.75pt}}_{l}-\sum_{j=1}^{R_{i}}[{% \mathbf{H}}_{1}^{i}]_{j}\circledcirc...\circledcirc[{\mathbf{H}}_{J_{i}}^{i}]_% {j}\Big{\|}_{F}.roman_min start_POSTSUBSCRIPT bold_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , … , bold_H start_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ roman_X - ∑ start_POSTSUBSCRIPT italic_l ≠ italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT [ bold_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⊚ … ⊚ [ bold_H start_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT . (11)

The above problem is non-convex, but fixing all but one of the factors (say the j𝑗jitalic_jth one), it becomes linear in 𝐇jisuperscriptsubscript𝐇𝑗𝑖{\mathbf{H}}_{j}^{i}bold_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT. Under this approach and unfolding the tensor into a matrix 𝐗^i=matp( 𝐗  liL 𝐙  l)superscript^𝐗𝑖subscriptmat𝑝 𝐗  superscriptsubscript𝑙𝑖𝐿subscript 𝐙  𝑙\hat{{\mathbf{X}}}^{i}=\mathrm{mat}_{p}({\leavevmode\hbox{\set@color\hskip 3.3% 75pt\leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode% \hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}}-\sum_{l\neq i}^{L% }{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{% Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}% }}\hskip 2.75pt}}_{l})over^ start_ARG bold_X end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT = roman_mat start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( roman_X - ∑ start_POSTSUBSCRIPT italic_l ≠ italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ), we have the following update rule to constructing an Alternating Least Squares (ALS) algorithm:

min𝐇ji𝐗^i(𝐇Jii𝐇j+1i𝐇j1i𝐇1i)(𝐇ji)TF,subscriptsuperscriptsubscript𝐇𝑗𝑖subscriptnormsuperscript^𝐗𝑖direct-productsuperscriptsubscript𝐇subscript𝐽𝑖𝑖superscriptsubscript𝐇𝑗1𝑖superscriptsubscript𝐇𝑗1𝑖superscriptsubscript𝐇1𝑖superscriptsuperscriptsubscript𝐇𝑗𝑖𝑇𝐹\min_{{\mathbf{H}}_{j}^{i}}||\hat{{\mathbf{X}}}^{i}-({\mathbf{H}}_{J_{i}}^{i}% \odot...\odot{\mathbf{H}}_{j+1}^{i}\odot{\mathbf{H}}_{j-1}^{i}\odot...\odot{% \mathbf{H}}_{1}^{i})({\mathbf{H}}_{j}^{i})^{T}||_{F},roman_min start_POSTSUBSCRIPT bold_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | | over^ start_ARG bold_X end_ARG start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT - ( bold_H start_POSTSUBSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ⊙ … ⊙ bold_H start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ⊙ bold_H start_POSTSUBSCRIPT italic_j - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ⊙ … ⊙ bold_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) ( bold_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT | | start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT , (12)

for all j=1,,Ji𝑗1subscript𝐽𝑖j=1,...,{J_{i}}italic_j = 1 , … , italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Once the Jisubscript𝐽𝑖{J_{i}}italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT factors {𝐇ji}j=1Jisuperscriptsubscriptsuperscriptsubscript𝐇𝑗𝑖𝑗1subscript𝐽𝑖\{{\mathbf{H}}_{j}^{i}\}_{j=1}^{J_{i}}{ bold_H start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT have been obtained, then a) the i𝑖iitalic_ith tensor  𝐙  isubscript 𝐙  𝑖{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 2.75pt}}_{i}roman_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is found using (10) and b) the problem in (9) is solved for the next i𝑖iitalic_i, with i=1,,L𝑖1𝐿i=1,...,Litalic_i = 1 , … , italic_L. As a result, i=1LJisuperscriptsubscript𝑖1𝐿subscript𝐽𝑖\sum_{i=1}^{L}J_{i}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT italic_J start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT instances of (12) need to be run. Note that, when solving (8) via (9)-(12), the order matters. The first  𝐙  lsubscript 𝐙  𝑙{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 2.75pt}}_{l}roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT to be estimated provides the main (coarser) approximation, while the subsequent ones try to fit the residual error between the main tensor 𝐗𝐗\mathbf{X}bold_X  and the sum of the previously estimated components  𝐙  lsubscript 𝐙  𝑙{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 2.75pt}}_{l}roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT, providing a finer approximation. Due to the structure 𝒫(l)superscript𝒫𝑙{\mathcal{P}}^{(l)}caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT, which carries over  𝐙  lsubscript 𝐙  𝑙{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf{Z% }$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}% }\hskip 2.75pt}}_{l}roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT, the order in which the tensors { 𝐙  l}l=1Lsuperscriptsubscriptsubscript 𝐙  𝑙𝑙1𝐿\{{\leavevmode\hbox{\set@color\hskip 2.75pt\leavevmode\hbox{\set@color$\mathbf% {Z}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt% }}}\hskip 2.75pt}}_{l}\}_{l=1}^{L}{ roman_Z start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT are approximated is expected to generate variations in the results.

4.1 Constructing the partitions

The algorithm in the previous section assumes that the partitions 𝒫(1)superscript𝒫1{\mathcal{P}}^{(1)}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT,…,𝒫(L)superscript𝒫𝐿{\mathcal{P}}^{(L)}caligraphic_P start_POSTSUPERSCRIPT ( italic_L ) end_POSTSUPERSCRIPT are given. A simple generic approach to design 𝒫(1)superscript𝒫1{\mathcal{P}}^{(1)}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT,…,𝒫(L)superscript𝒫𝐿{\mathcal{P}}^{(L)}caligraphic_P start_POSTSUPERSCRIPT ( italic_L ) end_POSTSUPERSCRIPT is to rely on a regular multiresolution construction that splits the index set ={1,2,,I}12𝐼{\mathcal{I}}=\{1,2,...,I\}caligraphic_I = { 1 , 2 , … , italic_I } into smaller sets with the same cardinality. More specifically, one can implement a sequential design with L=I1𝐿𝐼1L=I-1italic_L = italic_I - 1 steps for which, at step l{1,,L}𝑙1𝐿l\in\{1,...,L\}italic_l ∈ { 1 , … , italic_L } we split {\mathcal{I}}caligraphic_I into l+1𝑙1l+1italic_l + 1 index sets with (approximately) the same number of elements. The collection of L=I1𝐿𝐼1L=I-1italic_L = italic_I - 1 partitions 𝒫(1)superscript𝒫1{\mathcal{P}}^{(1)}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT,…,𝒫(L)superscript𝒫𝐿{\mathcal{P}}^{(L)}caligraphic_P start_POSTSUPERSCRIPT ( italic_L ) end_POSTSUPERSCRIPT is then naturally given by grou** together the sets obtained in each of those steps. To be more clear, let \lfloor\cdot\rfloor⌊ ⋅ ⌋ and \lceil\cdot\rceil⌈ ⋅ ⌉ be the floor and ceil operators and consider the collection of partitions 𝒫(1)superscript𝒫1{\mathcal{P}}^{(1)}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT,…,𝒫(L)superscript𝒫𝐿{\mathcal{P}}^{(L)}caligraphic_P start_POSTSUPERSCRIPT ( italic_L ) end_POSTSUPERSCRIPT with L=I1𝐿𝐼1L=I-1italic_L = italic_I - 1 and where the l𝑙litalic_lth element is given by

𝒫(l)superscript𝒫𝑙\displaystyle{\mathcal{P}}^{(l)}\!\!\!\!caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT =\displaystyle== {𝒫n(l)}n=1l+1,withsuperscriptsubscriptsubscriptsuperscript𝒫𝑙𝑛𝑛1𝑙1with\displaystyle\!\!\!\!\big{\{}{\mathcal{P}}^{(l)}_{n}\big{\}}_{n=1}^{l+1},\;% \text{with}{ caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT , with
𝒫n(l)={(n1)I/(l+1),,nI/(l+1)}.subscriptsuperscript𝒫𝑙𝑛𝑛1𝐼𝑙1𝑛𝐼𝑙1\displaystyle{\mathcal{P}}^{(l)}_{n}\!=\!\big{\{}\lceil(n-1)I/(l+1)\rceil,...,% \lfloor nI/(l+1)\rfloor\big{\}}.caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = { ⌈ ( italic_n - 1 ) italic_I / ( italic_l + 1 ) ⌉ , … , ⌊ italic_n italic_I / ( italic_l + 1 ) ⌋ } .

In the above definition we have adopted the convention that, if x𝑥xitalic_x is a whole positive number, x=x𝑥𝑥\lfloor x\rfloor=x⌊ italic_x ⌋ = italic_x and x=x+1𝑥𝑥1\lceil x\rceil=x+1⌈ italic_x ⌉ = italic_x + 1. Clearly, the partition design in (4.1) is regular in the sense that it achieves |𝒫(l)|=l+1superscript𝒫𝑙𝑙1|{\mathcal{P}}^{(l)}|=l+1| caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT | = italic_l + 1 for all l𝑙litalic_l and |𝒫(l)(n)|I/(l+1)superscript𝒫𝑙𝑛𝐼𝑙1|{\mathcal{P}}^{(l)}(n)|\approx I/(l+1)| caligraphic_P start_POSTSUPERSCRIPT ( italic_l ) end_POSTSUPERSCRIPT ( italic_n ) | ≈ italic_I / ( italic_l + 1 ) for n=1,,l+1𝑛1𝑙1n=1,...,l+1italic_n = 1 , … , italic_l + 1.

To gain insights, suppose for simplicity that our tensor 𝐗𝐗\mathbf{X}bold_X  of order I𝐼Iitalic_I has size η××η𝜂𝜂\eta\times...\times\etaitalic_η × … × italic_η, i.e., that the value of Nisubscript𝑁𝑖N_{i}italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the same across modes, then the number of parameters required to represent 𝐗𝐗\mathbf{X}bold_X  using the model in (6) and the partitions in (4.1) is approximately

l=1I1Rl(l+1)ηI/(l+l),superscriptsubscript𝑙1𝐼1subscript𝑅𝑙𝑙1superscript𝜂𝐼𝑙𝑙\displaystyle\sum_{l=1}^{I-1}R_{l}(l+1)\eta^{I/(l+l)},∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I - 1 end_POSTSUPERSCRIPT italic_R start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ( italic_l + 1 ) italic_η start_POSTSUPERSCRIPT italic_I / ( italic_l + italic_l ) end_POSTSUPERSCRIPT , (14)

which contrasts with the i=1INi=ηIsuperscriptsubscriptproduct𝑖1𝐼subscript𝑁𝑖superscript𝜂𝐼\prod_{i=1}^{I}N_{i}=\eta^{I}∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_η start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT entries in 𝐗𝐗\mathbf{X}bold_X .

Clearly, alternative ways to build the partitions 𝒫(1)superscript𝒫1{\mathcal{P}}^{(1)}caligraphic_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT,…,𝒫(L)superscript𝒫𝐿{\mathcal{P}}^{(L)}caligraphic_P start_POSTSUPERSCRIPT ( italic_L ) end_POSTSUPERSCRIPT are possible. This is especially relevant when prior knowledge exists and one can leverage it to group indexes based on known (di-)similarities among the underlying dimensions. Due to space limitations discussing such alternative partition techniques is out of the scope of this manuscript, but it is part of our ongoing work.

5 Numerical experiments

The multi-resolution low-rank (MRLR) tensor decomposition scheme is numerically tested in three different scenarios: the first dealing with an amino acids dataset [13], the second one with a video signal [14], and the third one to approximate a multivariate function. The amino acids dataset is a three-mode tensor of size 5×201×615201615\times 201\times 615 × 201 × 61. The video signal is composed of 173 frames of 1080×72010807201080\times 7201080 × 720 pixels each and three channels (R, G, and B). To reduce the computational and memory complexity requirements of the problem, the frames have been sub-sampled and the resolution has been lowered, resulting in a final four-mode tensor of size 9×36×54×39365439\times 36\times 54\times 39 × 36 × 54 × 3. Finally, the multidimensional function in the last scenario has 3superscript3{\mathbb{R}}^{3}blackboard_R start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT as its domain, with each of the three dimensions being discretized using 100 points, so that a tensor with 106superscript10610^{6}10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT entries is obtained. The Tensorly Python package is used to benchmark the MRLR tensor decomposition against other tensor decomposition algorithms [15].

The amino acids tensor 𝐗𝐗\mathbf{X}bold_X  is approximated using a hierarchical structure of a matrix plus a three-mode tensor. The matrix can be build by unfolding the 5×201×615201615\times 201\times 615 × 201 × 61 tensor in different ways. Here, two reshapes have been studied, a 201×305201305201\times 305201 × 305 unfolding (res-1), and a 1005×611005611005\times 611005 × 61 unfolding (aka res-2). The structure of the algorithm resembles that of a gradient-boosting-like approach [16]. First, the initial tensor is approximated by a low-rank structure. Then, the residual is approximated by a low-rank structure too. Subsequent residuals are also approximated if necessary. This sequential process can be started from the coarser unfolding, the matrix, or the other way around (reverse). In this experiment, both alternatives have been tested. The rank of the matrix unfolding is fixed while the rank of the three-mode tensor is gradually increased.

Refer to caption
Fig. 1: Normalized Squared Frobenius Error (15) between the original 5×201×615201615\times 201\times 615 × 201 × 61 amino acids tensor and its approximation obtained via the MRLR and the PARAFAC tensor decompositions when the number of parameters (tensor rank) is increased.

The performance of the algorithms has been measured in terms of Normalized Frobenius Error (NFENFE\mathrm{NFE}roman_NFE) between the true tensor 𝐗𝐗\mathbf{X}bold_X  and the approximation  𝐗  ˇˇ 𝐗  \check{{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{\set@color$% \mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3.09999pt}{0% .2906pt}}}\hskip 3.375pt}}}overroman_ˇ start_ARG roman_X end_ARG, which is given by

NFE= 𝐗   𝐗  ˇF/ 𝐗  F.NFEsubscriptnorm 𝐗  ˇ 𝐗  𝐹subscriptnorm 𝐗  𝐹\mathrm{NFE}={||{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{% \set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3% .09999pt}{0.2906pt}}}\hskip 3.375pt}}-\check{{\leavevmode\hbox{\set@color% \hskip 3.375pt\leavevmode\hbox{\set@color$\mathbf{X}$}\raisebox{-1.4906pt}{% \leavevmode\hbox{\set@color\rule{3.09999pt}{0.2906pt}}}\hskip 3.375pt}}}||_{F}% }\big{/}{||{\leavevmode\hbox{\set@color\hskip 3.375pt\leavevmode\hbox{% \set@color$\mathbf{X}$}\raisebox{-1.4906pt}{\leavevmode\hbox{\set@color\rule{3% .09999pt}{0.2906pt}}}\hskip 3.375pt}}||_{F}}.roman_NFE = | | roman_X - overroman_ˇ start_ARG roman_X end_ARG | | start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT / | | roman_X | | start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT . (15)

The results are reported in Fig. 1. The MRLR decomposition is compared to the PARAFAC decomposition. The res-1 unfolding of the matrix (square-like unfolding) seems to perform better than the res-2 unfolding (tall unfolding). Then, the approximation from the coarser to the finer arrangement beats the reverse one. Moreover, all the MRLR schemes outperform the PARAFAC one in terms of NFENFE\mathrm{NFE}roman_NFE for the same number of parameters. Indeed, the best-performing MRLR algorithm obtains roughly the same NFENFE\mathrm{NFE}roman_NFE as the PARAFAC decomposition using 10,0001000010,00010 , 000 parameters less approximately.

In the second test case, the four-mode video tensor 𝐗𝐗\mathbf{X}bold_X  is unfolded into a 324×162324162324\times 162324 × 162 matrix and a 9×36×1629361629\times 36\times 1629 × 36 × 162 three-mode tensor. The ranks of the matrix and the three-mode tensors have been fixed to 1. The rank of the four-mode tensor approximation is gradually increased. The results are provided in Fig. 2. Again, the coarser-to-finer arrangement outperforms both, the reverse (finer-to-coarser) arrangement, and the PARAFAC decomposition. It needs approximately 1,50015001,5001 , 500 parameters less to achieve the same NFENFE\mathrm{NFE}roman_NFE.

Refer to caption
Fig. 2: Normalized Squared Frobenius Error (15) between the original 9×36×54×39365439\times 36\times 54\times 39 × 36 × 54 × 3 video signal tensor and its approximation obtained via the MRLR and the PARAFAC tensor decompositions when the number of parameters (tensor rank) is increased.

Finally, we tested the MRLR tensor decomposition in a third test case to approximate a multivariate function. Given a set of I𝐼Iitalic_I input variables, with xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denoting the i𝑖iitalic_ith input variable and 𝒳isubscript𝒳𝑖{\mathcal{X}}_{i}caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT the set of all possible values of xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we are interested in functions that map any element (x1,,xI)𝒳subscript𝑥1subscript𝑥𝐼𝒳(x_{1},...,x_{I})\in{\mathcal{X}}( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) ∈ caligraphic_X into a real value. When these functions are discrete, tensors can be used to model them efficiently. Continuous functions can be discretized/quantized. Tensor decomposition methods can then be leveraged for applications such as approximation, or denoising [17]. In such a context, we tested the MRLR tensor decomposition algorithm to model the following multivariate continuous function f:3:𝑓maps-tosuperscript3f:{\mathbb{R}}^{3}\mapsto{\mathbb{R}}italic_f : blackboard_R start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ↦ blackboard_R:

f(x1,x2,x3)=x12+x22e|x2+x3|.𝑓subscript𝑥1subscript𝑥2subscript𝑥3superscriptsubscript𝑥12superscriptsubscript𝑥22superscript𝑒subscript𝑥2subscript𝑥3f(x_{1},x_{2},x_{3})=\frac{x_{1}^{2}+x_{2}^{2}}{e^{|x_{2}+x_{3}|}}.italic_f ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) = divide start_ARG italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_e start_POSTSUPERSCRIPT | italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT end_ARG . (16)

Sampling a three dimensional grid of discrete values ranging from 55-5- 5 to 5555 with an step-size of 0.10.10.10.1 leads to a 100×100×100100100100100\times 100\times 100100 × 100 × 100 tensor 𝐗𝐗\mathbf{X}bold_X  that summarizes the multivariate function in (16). The tensor 𝐗𝐗\mathbf{X}bold_X  can be approximated using the MRLR tensor decomposition to leverage parsimony. The tensor 𝐗𝐗\mathbf{X}bold_X  is unfolded into a 10000×1001000010010000\times 10010000 × 100 matrix, and the coarser-to-finer setup has been implemented. The performance of the MRLR tensor decomposition is again compared to that of the PARAFAC decomposition in terms of NFENFE\mathrm{NFE}roman_NFE for an increasing number of parameters. The results are shown in Fig. 3. As in previous scenarios, the MRLR decomposition outperforms the PARAFAC decomposition for the same number of parameters consistently. At some points, the difference between both algorithms is particularly high. For example, the MRLR tensor decomposition needs roughly 15,0001500015,00015 , 000 parameters to achieve 1%percent11\%1 % of NFENFE\mathrm{NFE}roman_NFE, while the PARAFAC decomposition needs more than 30,0003000030,00030 , 000 parameters.

Refer to caption
Fig. 3: Normalized Squared Frobenius Error (15) between the 100×100×100100100100100\times 100\times 100100 × 100 × 100 tensor sampled from the multivariate function in (16) and its approximation obtained via the MRLR and the PARAFAC tensor decompositions when the number of parameters (tensor rank) is increased.

6 Conclusions

This paper presented a parsimonious multi-resolution low-rank (MRLR) tensor decomposition to approximate a tensor as a sum of low-order tensor unfoldings. An Alternating Least Squares (ALS) algorithm was proposed to implement the MRLR tensor decomposition. Then, the MRLR tensor decomposition was compared against the PARAFAC decomposition in two real-case scenarios, and also in a multivariate function approximation problem. The MRLR tensor decomposition outperformed the PARAFAC decomposition for the same number of parameters, showing that it can efficiently leverage information defined at different dimensional orders.

References

  • [1] R. Bro, “Parafac. tutorial and applications,” Chemometrics and Intelligent Laboratory Systems, vol. 38, no. 2, pp. 149–171, 1997.
  • [2] R. B. Cattell, “Parallel proportional profiles and other principles for determining the choice of factors by rotation,” Psychometrika, vol. 9, no. 4, pp. 267–283, 1944.
  • [3] E. E. Papalexakis, C. Faloutsos, and N. D. Sidiropoulos, “Tensors for data mining and data fusion: Models, applications, and scalable algorithms,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 8, no. 2, pp. 1–44, 2016.
  • [4] T. G. Kolda and B. W. Bader, “Tensor decompositions and applications,” SIAM Review, vol. 51, no. 3, pp. 455–500, 2009.
  • [5] N. D. Sidiropoulos, L. De Lathauwer, X. Fu, K. Huang, E. E. Papalexakis, and C. Faloutsos, “Tensor decomposition for signal processing and machine learning,” IEEE Transactions on Signal Processing, vol. 65, no. 13, pp. 3551–3582, 2017.
  • [6] R. A. Harshman, “Foundations of the parafac procedure: Models and conditions for an “explanatory” multimodal factor analysis,” UCLA Working Papers Phonetics, vol. 16, pp. 1–84, 1970.
  • [7] L. R. Tucker, “Some mathematical notes on three-mode factor analysis,” Psychometrika, vol. 31, no. 3, pp. 279–311, 1966.
  • [8] I. V. Oseledets, “Tensor-train decomposition,” SIAM Journal on Scientific Computing, vol. 33, no. 5, pp. 2295–2317, 2011.
  • [9] L. Grasedyck, D. Kressner, and C. Tobler, “A literature survey of low-rank tensor approximation techniques,” GAMM-Mitteilungen, vol. 36, no. 1, pp. 53–78, 2013.
  • [10] D. Wang, F. Cong, and T. Ristaniemi, “Higher-order nonnegative candecomp/parafac tensor decomposition using proximal algorithm,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).   IEEE, 2019, pp. 3457–3461.
  • [11] Q. Xie, Q. Zhao, D. Meng, and Z. Xu, “Kronecker-basis-representation based tensor sparsity and its applications to tensor recovery,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 40, no. 8, pp. 1888–1902, 2017.
  • [12] O. Kaya and B. Uçar, “Parallel candecomp/parafac decomposition of sparse tensors using dimension trees,” SIAM Journal on Scientific Computing, vol. 40, no. 1, pp. C99–C130, 2018.
  • [13] R. Bro, “Multi-way analysis in the food industry-models, algorithms, and applications,” Ph.D. dissertation, University of Amsterdam (NL), 1998.
  • [14] S. Rozada, “Multi-resolution low-rank tensor decomposition,” https://github.com/sergiorozada12/multiresolution-tensor-decomposition, 2021.
  • [15] J. Kossaifi, Y. Panagakis, A. Anandkumar, and M. Pantic, “Tensorly: Tensor learning in python,” arXiv preprint arXiv:1610.09555, 2016.
  • [16] J. H. Friedman, “Greedy function approximation: a gradient boosting machine,” Annals of Statistics, pp. 1189–1232, 2001.
  • [17] N. Kargas and N. D. Sidiropoulos, “Supervised learning and canonical decomposition of multivariate functions,” IEEE Transactions on Signal Processing, vol. 69, pp. 1097–1107, 2021.