Octonion Phase Retrieval

Roman Jacome, Kumar Vijay Mishra, Brian M. Sadler and Henry Arguello R. J., and H. A. are with Universidad Industrial de Santander, Bucaramanga, Santander 680002 Colombia, e-mail: {roman2162474@correo., henarfu@}uis.edu.co.K. V. M. and B. M. S. are with the United States DEVCOM Army Research Laboratory, Adelphi, MD 20783 USA, e-mail: [email protected], [email protected] research was sponsored by the Army Research Office/Laboratory under Grant Number W911NF-21-1-0099, and the VIE project entitled “Dual blind deconvolution for joint radar-communications processing”. K. V. M. acknowledges support from the National Academies of Sciences, Engineering, and Medicine via the Army Research Laboratory Harry Diamond Distinguished Fellowship. This work was supported by ICETEX and MINCIENCIAS through the CTO 2022-0716 under Grant 8284

Abstract

Signal processing over hypercomplex numbers arises in many optical imaging applications. In particular, spectral image or color stereo data are often processed using octonion algebra. Recently, the eight-band multispectral image phase recovery has gained salience, wherein it is desired to recover the eight bands from the phaseless measurements. In this paper, we tackle this hitherto unaddressed hypercomplex variant of the popular phase retrieval (PR) problem. We propose octonion Wirtinger flow (OWF) to recover an octonion signal from its intensity-only observation. However, contrary to the complex-valued Wirtinger flow, the non-associative nature of octonion algebra and the consequent lack of octonion derivatives make the extension to OWF non-trivial. We resolve this using the pseudo-real-matrix representation of octonion to perform the derivatives in each OWF update. We demonstrate that our approach recovers the octonion signal up to a right-octonion phase factor. Numerical experiments validate OWF-based PR with high accuracy under both noiseless and noisy measurements.

Index Terms:

Hypercomplex signal processing, phase retrieval, optical imaging, octonion, quaternion.

I Introduction

In several engineering problems pertaining to imaging [1], array processing [2, 3], wireless communications [4, 5, 6], filtering [7], and neural networks [8, 9], the signals of interest are hypercomplex, that is, they are elements of some algebras over the field of real numbers [10]. Unlike vector spaces that only allow addition and scalar multiplication, algebras admit both addition and multiplication between the elements of the algebra [11]. Some common examples of hypercomplex signals include quaternions [12], coquaternions or split-quaternions [13], biquaternions [14], and octonions [15]. Instead of tackling each dimension independently, hypercomplex signal processing exploits the corresponding algebra to process all signal dimensions jointly. The quaternion approaches have been successfully applied to color image processing [16] where the color channels are mutually correlated via quaternion algebra. Quaternion signal processing tools have also been extended to Fourier transform [17], neural networks [18], and adaptive filtering [19]. Applying the Cayley-Dickson construction [11] to quaternions for higher dimensions yields an octonion representation [20]. In this paper, we focus on octonion-valued signals.

A recent application of octonions is multispectral image processing [21], wherein each pixel 7-color channel image has a vector-valued representation such that each channel corresponds to different complex-variable dimensions. Octonions have also been exploited for color-stereo image analysis [22], where two 3-color channel images are represented in a different imaginary dimension. The mutual processing along the color channels and two stereo images has been shown to improve the analysis. Recently, there has been broad interest in the recovery of the phase of a multispectral image, which is represented using octonion-valued signals, from its phaseless measurements [23].

Conventional phase retrieval (PR) is a long-standing signal processing problem, where we want to recover a complex-valued signal $\bm{x}\in\mathbb{C}^{n}$ given its phaseless measurements $\bm{y}\in\mathbb{C}^{m}$ as $\bm{y}=|\bm{Ax}|^{2}$ , where the known measurement matrix $\bm{A}\in\mathbb{C}^{m\times n}$ is also complex-valued. This problem arises in several areas such as diffractive imaging [24], X-ray crystallography [25], astronomy [26], and radar waveform design [27]. A plethora of algorithms have been proposed for precise PR solutions and the literature is too expansive to summarize here (see, e.g., [28] for recent surveys, and references therein). Broadly, the PR algorithms follow two approaches: either exploit prior knowledge of the signal structure or make additional measurements of the magnitude via, for example, the Fourier transform.

In the context of hypercomplex signals, recently quaternion PR (QPR) was proposed for vision applications in [29], where the signal and the measurement matrix were quaternion- and real-valued, respectively. This was later extended to a quaternion-valued sensing matrix in [30], a quaternion Wirtinger flow (QWF) algorithm was proposed to solve the QPR problem. The QWF is an extension of its popular complex-valued PR algorithm in [31]. Another QPR application has been reported in multiple image encryption that employs quaternion gyrator transform [32]. In this work, we focus on the hitherto unaddressed octonion PR (OPR) problem that is encountered in the reconstruction of multispectral images.

However, unlike QWF, it is not straightforward to extend Wirtinger flow (WF) [31] to octonions because octonion algebra lacks associative property. Hence, unlike quaternions, deriving Wirtinger-like derivatives for octonion-valued variables is very challenging [33, 34]. We address this problem by employing a pseudo-real-matrix representation [35] of the octonion variables to formulate our octonion WF (OWF). We identify trivial ambiguities in OPR and derive the recovery guarantees. Our numerical experiments with synthetic as well as eight-channel multispectral image real data show accurate OPR with the proposed OWF under noiseless and noisy scenarios.

Throughout this paper, we reserve boldface lowercase, boldface uppercase, and calligraphic letters for vectors, matrices, and index sets, respectively. The set of octonion numbers is denoted as $\mathbb{O}$ . We denote the transpose, conjugate, and Hermitian by $(\cdot)^{T}$ , $(\cdot)^{*}$ , and $(\cdot)^{H}$ , respectively. The identity matrix of size $N\times N$ is $\mathbf{I}_{N}$ . $||\cdot||_{p}$ is the $\ell_{p}$ norm. We denote $|\cdot|$ as the cardinality of a set, $\mathbb{E}\left[\cdot\right]$ is the statistical expectation function, and $\mathbb{P}$ denotes the probability. The functions max and min output their arguments’ maximum and minimum values, respectively. The sign function is defined as $\operatorname{sign}(c)=\frac{c}{|c|}$ .

II Desiderata for Octonion Algebra

We begin with the theoretical desiderata. An octonion number $x$ is defined as $x=x_{0}+\sum_{i=1}^{7}x_{i}{e}_{i}$ , where $x_{i}$ are real-valued coefficients and $e_{i}$ are the octonion units such that ${e}_{i}^{2}=-1$ for $i=1,\dots,7$ . The conjugate is ${x^{*}=x_{0}-\sum_{1}^{7}x_{i}{e}_{i}}$ . The ‘real part’ of $x$ is $x_{0}$ . Octonions are obtained from Cayley Dickson’s construction of quaternions. Octonion algebra is non-associative and non-commutative, that is, for given three octonion numbers $x,y,z\in\mathbb{O}$ , we have $(x\cdot y)\cdot z\neq x\cdot(y\cdot z)$ and $x\cdot y\neq y\cdot x$ . The ‘purely imaginary’ part of the octonion is $\operatorname{Im}x=\sum_{i=1}^{7}x_{i}{e}_{i}$ . The magnitude of an octonion number is $|{x}|=\sqrt{\sum_{i=0}^{7}x_{i}^{2}}$ . The norm of an octonion vector $\mathbf{x}\in\mathbb{O}^{n}$ is $\|\mathbf{x}\|=\sqrt{\sum_{k=1}^{n}|\mathbf{x}_{k}|^{2}}$ . For a real-valued vector $\mathbf{t}\in\mathbb{R}^{n}$ , its $\ell_{2}$ norm is $\|\mathbf{t}\|_{2}=\sqrt{\sum_{k=1}^{n}\mathbf{t}_{k}^{2}}$ . The octonion-valued Gaussian distribution is represented by $\mathcal{N}_{\mathcal{O}}$ which is defined as $\mathcal{N}_{\mathcal{O}}=\mathcal{N}(0,1)+\sum_{i=1}^{7}\mathcal{N}(0,1)e_{i}$ , where $\mathcal{N}(0,1)$ is standard normal distribution. The octonion Gaussian distribution of a $n$ -dimensional octonion random variable is $\mathcal{N}_{\mathcal{O}}^{n}$ , where each vector element is drawn from $\mathcal{N}_{\mathcal{O}}$ . For further details on octonion algebra, we refer the interested reader to [15].

It follows from the non-associative octonion algebra that, unlike in quaternion algebra, a real-matrix representation of an octonion number does not exist. However, [35] proposed a pseudo-real matrix representation that has been successfully employed by many octonion-valued signal applications [21]. To obtain this representation, define the real representation of the octonion number $x\in\mathbb{O}$ as $\aleph(x)=[x_{0},x_{1},\dots,x_{7}]^{T}\in\mathbb{R}^{8}$ . Then, the injective map** $\gimel:\mathbb{O}\rightarrow\mathbb{R}^{8\times 8}$ is the real matrix representation of an octonion number [35]:

\gimel(x)=\begin{bmatrix}x_{0}&-x_{1}&-x_{2}&-x_{3}&-x_{4}&-x_{5}&-x_{6}&-x_{7% }\\ x_{1}&x_{0}&x_{3}&-x_{2}&x_{5}&-x_{4}&-x_{7}&x_{6}\\ x_{2}&-x_{3}&x_{0}&x_{1}&x_{6}&x_{7}&-x_{4}&-x_{5}\\ x_{3}&x_{2}&-x_{1}&x_{0}&x_{7}&-x_{6}&x_{5}&-x_{4}\\ x_{4}&-x_{5}&-x_{6}&-x_{7}&x_{0}&x_{1}&x_{2}&x_{3}\\ x_{5}&x_{4}&-x_{7}&x_{6}&-x_{1}&x_{0}&-x_{3}&x_{2}\\ x_{6}&x_{7}&x_{4}&-x_{5}&-x_{2}&x_{3}&x_{0}&-x_{1}\\ x_{7}&-x_{6}&x_{5}&x_{4}&-x_{3}&-x_{2}&x_{1}&x_{0}\end{bmatrix}.

Both representations $\aleph$ and $\gimel$ are also easily extended to vector/matrix octonion variables i.e., given $\mathbf{A}\in\mathbb{O}^{m\times n}$ , we have $\aleph(\mathbf{A})\in\mathbb{R}^{8n\times m}$ and $\gimel({\mathbf{A}})\in\mathbb{R}^{8m\times 8n}$ . Consider $\mathbf{x}\in\mathbb{O}^{n}$ and $\mathbf{A}\in\mathbb{O}^{m\times n}$ , it holds $\aleph(\mathbf{Ax})=\gimel(\mathbf{A})\aleph(\mathbf{x})$ and $\|\mathbf{x}\|_{2}=\|\aleph(\mathbf{x})\|_{2}$ . This allows us to convert the octonion product into a real-valued matrix/vector multiplication that obeys the octonion product rules. We later employ this representation for gradient-based algorithms for OPR.

III OPR and Trivial Ambiguity

Consider the octonion-valued signal $\bm{x}\in\mathbb{O}^{n}$ and its phaseless measurements $\mathbf{y}=|\mathbf{A}\mathbf{x}|^{2}\in\mathbb{R}_{+}^{m}$ where $\mathbf{A}\in\mathbb{O}^{m\times n}$ is the octonion-valued sensing matrix. Our goal is to recover the octonion-valued signal $\mathbf{x}$ from its phaseless measurements $\mathbf{y}$ . Traditional WF for high dimensional signals would require concatenating all signal components thereby discarding any interaction between them. It is, therefore, desired to devise OPR recovery that also obeys octonion algebra.

As in conventional PR problems, there also exists an intrinsic trivial ambiguity in OPR as explained below.
Trivial Ambiguity: Given a unit octonion $q$ , $|q|=1$ , the signal $\mathbf{x}$ scaled by a global right octonion factor i.e., $q$ is right-multiplied to all the elements of signal $\mathbf{x}$ leads to the same measurements, i.e., $|\mathbf{Ax}q|^{2}=|\mathbf{Ax}|^{2}$ . However, since the octonion algebra is non-commutative, we have $|\mathbf{A}q\mathbf{x}|^{2}\neq|\mathbf{Ax}|^{2}$ . Our goal is to recover $\mathbf{x}$ up to a trivial ambiguity of only on the right octonion phase factor.

To this end, first define $\mathbf{x}=\mathbf{y}q$ . We show that $|\mathbf{a}_{\ell}^{H}\mathbf{x}|^{2}=|\mathbf{a}_{\ell}^{H}\mathbf{y}|^{2}$ for all $\ell=1,\dots,n$ holds with high probability. We have $|\mathbf{a}_{\ell}^{H}\mathbf{x}|^{2}-|\mathbf{a}_{\ell}^{H}\mathbf{y}|^{2}=% \langle\mathbf{x}\mathbf{x}^{H}-\mathbf{y}\mathbf{y}^{H},\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}$ . Further,

\sum_{\ell=1}^{m}\left(|\mathbf{a}_{\ell}^{H}\mathbf{x}|^{2}-|\mathbf{a}_{\ell% }^{H}\mathbf{y}|^{2}\right)^{2}\geq\left\langle\mathbf{x}\mathbf{x}^{H}-% \mathbf{y}\mathbf{y}^{H},{\sum_{\ell=1}^{m}}\mathbf{a}_{\ell}\mathbf{a}_{\ell}% ^{H}\right\rangle_{\mathbb{R}}.

(1)

To lower bound the left-hand side term – a quadratic stochastic process – we, therefore, employ the small ball method [36]. Recall the following Proposition 1:

Proposition 1 (Lower bound on quadratic stochastic process).

[36, Theorem 2.1] Assume $\beta_{\ell}$ where $\ell=1,\dots,m$ to be independent copies of $\beta$ . Denote a family of functions that satisfy a uniform small-ball estimation by $\mathcal{F}$ . For a constant $\tau>0$ , we have $Q_{\mathcal{F}}(\tau)=\operatorname{inf}_{f\in\mathcal{F}}\mathbb{P}\left[|f|% \geq\tau\right]>0$ and based on the expectation of Rademacher process $R_{m}(\mathcal{F})=\mathbb{E}\left[\sup_{f\in\mathcal{F}}\left|\frac{1}{m}\sum% _{\ell=1}^{m}\varepsilon_{\ell}f(\mathbf{\beta_{\ell}})\right|\right]$ , where $\{\varepsilon_{\ell}\}_{\ell=1}^{n}$ are independent, symmetric, binary-valued random variables $\varepsilon_{\ell}\in\{-1,1\}$ . Then, for probability at least $1-e^{-2t^{2}}$ for constant $t>0$ ,

\inf_{f\in\mathcal{F}}\left|\sum_{\ell=1}^{m}\varepsilon_{\ell}f(\mathbf{\beta% _{\ell}})\right|\leq\tau^{2}\left(Q_{\mathcal{F}}(2\tau)-\frac{4}{\tau}R_{m}{% \mathcal{F}}-\frac{t}{\sqrt{n}}\right).

(2)

Due to the randomness of the sensing matrix $\mathbf{A}$ and assuming that the octonion signal follows an octonion Gaussian distribution, we employ Proposition 1 to establish the following result about the trivial ambiguity of OPR.

Theorem 2 (Trivial ambiguity of right-octonion phase factor).

Consider the octonion variables $\mathbf{y}$ and $\mathbf{x}=\mathbf{y}q$ , where $\mathbf{y}\sim\mathcal{N}_{\mathcal{O}}^{n}$ with $q\in\mathbb{O}$ . Define the sensing matrix $\mathbf{A}\sim\mathcal{N}_{\mathcal{O}}^{m\times n}$ with rows $\mathbf{a}_{\ell}\in\mathbb{O}^{n}$ for $\ell=1,\dots,m$ . Then, with a probability $1-e^{-\frac{1}{2}\tilde{c}^{2}m}$ for some positive constant $\tilde{c}$ , $\sum_{\ell=1}^{m}(|\mathbf{a}_{\ell}^{H}\mathbf{x}|-|\mathbf{a}_{\ell}^{H}% \mathbf{y}|)\geq\tilde{c}m\|\mathbf{xx}^{H}-\mathbf{yy}^{H}\|_{F}^{2}$ , with $\tilde{c}>0$ .

Proof:

See Appendix. ∎

IV Recovery Algorithm

Our proposed OWF algorithm performs Wirtinger-like iterations. These are similar to a gradient descent approach but also take into account the octonion algebra. However, the non-convexity of the OPR problem implies that we also suitably initialize the algorithm. To this end, we employ a spectral initialization approach.
OWF Algorithm: The octonion algebra is non-associative and, hence, lacks a clear definition of derivatives for octonion-valued variables [37] including chain rule, high-dimensional gradients, and gradient-based methods such as the WF. Optimization-based methods that employ octonion representation, as in singular value decomposition (SVD) [21] or deep octonion neural networks [38], usually resort to the pseudo-real-matrix representation to perform optimization over the real-valued variable. Inspired by this approach, we propose using this representation to solve the following OPR optimization:

\displaystyle\mathbf{x}^{\ast}=\operatorname*{arg\,min}_{\widetilde{\mathbf{x}% }\in\mathbb{O}^{n}}\sum_{\ell=1}^{m}\left(|\mathbf{a}_{\ell}^{H}\widetilde{% \mathbf{x}}|^{2}-\mathbf{y}_{\ell}\right)^{2}.

(3)

Employing the real matrix representation, the problem of recovery of the octonion signal becomes

\displaystyle\mathbf{x}^{\ast}

\displaystyle=\aleph^{-1}\left(\operatorname*{arg\,min}_{\widetilde{\mathbf{x}% }\in\mathbb{R}^{8n}}\overbrace{\sum_{\ell=1}^{m}\left(\|\gimel\left(\mathbf{a}% _{\ell}^{H}\right)\aleph(\widetilde{\mathbf{x}})\|_{2}^{2}-\mathbf{y}_{\ell}% \right)^{2}}^{f(\widetilde{\mathbf{x}})}\right)

(4)

where the $\ell_{2}$ norm comes from the observation that the norm of an octonion variable is the norm of its real representation.

Then, (4) is solved by gradient descent steps. Here, the key difference with respect to the traditional complex-valued approach lies in the gradient computation. In the complex-valued case, wherein the measurements are $\mathbf{y}=|\mathbf{A}\mathbf{x}|^{2}$ with the signal $\mathbf{x}\in\mathbb{C}^{8n}$ and sensing matrix $\mathbf{A}\in\mathbb{C}^{m\times 8n}$ . Here, the gradient update is $\nabla_{\mathbf{x}}\left(|\mathbf{a}_{\ell}^{*}{\mathbf{x}}|^{2}-\mathbf{y}_{% \ell}\right)^{2}=(\|\mathbf{a}_{\ell}^{*}\mathbf{x}\|^{2}-y_{\ell})(\mathbf{a}% _{\ell}\mathbf{a}_{\ell}^{*})\mathbf{x}$ . However, the octonion real-matrix representation in the OWF considers the interaction among all signal components, which is desired for multispectral we imaging. The OWF gradient of the cost function is $\nabla_{\widetilde{\mathbf{x}}}f(\widetilde{\mathbf{x}})=\sum_{\ell=1}^{m}(\|% \gimel\left(\mathbf{a}_{\ell}^{H}\right)\aleph(\widetilde{\mathbf{x}})\|_{2}^{% 2}-\mathbf{y}_{\ell})\gimel\left(\mathbf{a}_{\ell}^{H}\right)^{T}\gimel\left(% \mathbf{a}_{\ell}^{H}\right)\aleph(\widetilde{\mathbf{x}})$ . Then, the OWF update process in the $i$ -th iteration, where $i\in\{1,\dots,I\}$ such that $I$ is the maximum number of iterations, becomes $\widetilde{\mathbf{x}}^{(i)}=\widetilde{\mathbf{x}}^{(i-1)}-\alpha\nabla f(% \widetilde{\mathbf{x}}^{(i-1)})$ , where $\alpha$ is a suitable selected gradient step size. From inverse real representation of $\widetilde{\mathbf{x}}$ , the octonion signal is $\mathbf{x}^{\ast}=\aleph^{-1}(\widetilde{\mathbf{x}}^{(I)})$ .
Initialization: A key step in most nonconvex PR approaches is the initialization of the algorithm because spurious points in the cost function can lead to local minima. Here, we employ the popular spectral initialization [39], wherein our goal is to obtain the initial estimate of the true signal by computing the leading eigenvector of the octonion-valued matrix $\mathbf{Y}=\frac{1}{m}\sum_{\ell=1}^{m}\mathbf{y}_{\ell}\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\in\mathbb{O}^{n\times n}$ . This may be achieved by solving an octonionic right eigenvalue decomposition. In [40], this was solved for small octonion-valued matrices ( $n<4$ ). However, [40] cannot be extended to larger matrices. Therefore, we propose to adapt the power method for the right quaternion eigenvalue decomposition [41] to compute the leading eigenvalue of $\mathbf{Y}$ . This method employs power iterations over the real matrix representation and computes the inverse real representation operator $\aleph^{-1}(\cdot)$ to yield the equivalent octonion leading eigenvalue.

To measure the error between the estimated octonion signal $\mathbf{x}^{\ast}$ and its true value $\mathbf{x}$ , define the distance $d(\mathbf{x},\mathbf{x}^{\ast})=\min_{z}\|\mathbf{x}^{\ast}-\mathbf{x}z\|$ where $z\in\{z\in\mathbb{O}||z|=1\}$ is only-phase octonion factor. We represent this distance in terms of the pseudo-real-matrix representation of octonions as $d(\mathbf{x}^{\ast},\mathbf{x})=\min_{z}\|\aleph(\mathbf{x}^{\ast})-\gimel(% \mathbf{x}))\aleph(z)\|$ , using the property $\|\mathbf{x}\|=\|\aleph(\mathbf{x})\|$ . After some simple algebra, we get $d(\mathbf{x},\mathbf{x}^{\ast})=\|\aleph(\mathbf{x}^{\ast})-\gimel(\mathbf{x})% )g(\mathbf{x}^{\ast})\|$ , where $g(\mathbf{x}^{\ast})=\operatorname{sign}\left(\left(\gimel(\mathbf{x})^{T}% \aleph(\mathbf{x}^{\ast})\right)\left(\gimel(\mathbf{x})^{T}\gimel(\mathbf{x}% \right)^{-1}\right)$ .

V Numerical Experiments

Refer to caption — Figure 1: (a) Success rate of OWF and concatenated WF for different value of sampling complexity $m/n$ with $n=100$ . (b) Convergence rate of OWF for $m/n=20$ for 2000 iterations.

We validated our OWF algorithm through various numerical experiments using the quaternion and octonion toolbox for MATLAB [42]. Unless otherwise noted, the sensing matrix was drawn from an octonion Gaussian distribution i.e., $\mathbf{A}{\sim}\mathcal{N}_{O}^{m\times n}$ . The maximum OWF iterations were set to $I=2000$ . We set $\alpha=\frac{5m}{\sum_{\ell=1}^{m}\mathbf{y}_{\ell}}$ [30].
Synthetic Data: We experimented with synthetic data, wherein the signal $\mathbf{x}\in\mathbb{O}^{n\times 1}$ was generated as $\mathbf{x}\sim\mathcal{N}_{\mathcal{O}}^{n}$ , where we normalized the signal such that $\|\mathbf{x}\|=1$ and the signal dimension $n=100$ . Over 100 Monte Carlo simulations, we declare signal recovery a “success” when $d(\mathbf{x},\mathbf{x}^{\ast})\leq 1e^{-5}$ . Figure 1 (a) shows the success rate (the mean success of the 100 experiments) for varying sample complexity $m/n$ . We also compared OWF with the traditional WF (using PhasePack library [43]), wherein the signal is concatenation of all eight components, i.e., $\mathbf{x}=[\mathbf{x}_{0}^{T},\dots,\mathbf{x}_{7}^{T}]^{T}\in\mathbb{C}^{8n}$ with a random complex-valued sensing matrix $\mathbf{A}\in\mathbb{C}^{m\times 8n}$ . We used $d(\mathbf{x},\mathbf{x}^{*})=\|\mathbf{x}^{*}-\mathbf{x}\operatorname{sign}(% \mathbf{x}^{H}\mathbf{x})\|$ to assess reconstruction quality. The OWF achieved almost perfect recovery for $m/n>10$ while traditional WF required $m/n>20$ . Figure 1(b) plots the distance function $d(\mathbf{x},\mathbf{x}^{\ast})$ for each iteration for $m/n=20$ showing linear convergence of the OWF algorithm. Next, we tested the OWF algorithm for measurements corrupted by additive Gaussian noise i.e., $\mathbf{y}=|\mathbf{Ax}|^{2}+\bm{\omega}$ where $\bm{\omega}\in\mathbb{R}^{m}$ is sampled from $\bm{\omega}\sim\mathcal{N}\left(0,\frac{\|\mathbf{y}\|_{2}^{2}}{10^{\frac{% \textrm{SNR}}{10}}}\mathbf{I}_{m}\right)$ . We employed the same number of iterations and $n=100$ . We varied the sample complexity $m/n$ and the signal-to-noise-ratio (SNR) from 0 to 30 dB in steps of 5 dB. Figure 2 demonstrates that OWF recovers the octonion signal with high accuracy with $m/n>17$ and SNR $>20$ dB. The absence of a distinct phase transition in both Figures 1 and 2 can be attributed to the utilization of a pseudo-real matrix representation in the algorithm, as precise octonion calculus tools are unavailable.
Real Data: We also validated OWF-based OPR with real data. We used a spectral image (Figure 3a) from the CAVE multispectral image dataset [44]. We employ a central crop of $32\times 32$ pixel and select 8 equispaced spectral bands from the 31 original spectral bands ranging from 400 to 700 nm. Each band was vectorized and selected as a dimension of the octonion signal, thus, in this case, the octonion signal dimension was $n=1024$ . We compared OWF reconstruction with the gradient descent (GD) algorithm [45]. We concatenated all color channels to form the signal $\mathbf{x}^{r}\in\mathbb{R}^{8n}$ , the sensing matrix $\mathbf{A}^{r}\in\mathbb{R}^{m\times 8n}$ and, hence, the measurements $\mathbf{y}^{r}=|\mathbf{A}^{r}\mathbf{x}^{r}|^{2}$ . Note that, unlike (4), this method doesn’t use the real-matrix representation for the product between the rows of $\mathbf{A}^{r}$ ( $\mathbf{a}_{\ell}^{r}$ ) and the signal $\mathbf{x}^{r}$ . We employed the Lanzcos algorithm [46] with $100$ power iterations for initialization. Figure 3a depicts the success rate ( $d(\mathbf{x},\hat{\mathbf{x}})<10^{-3}$ ) for 64 images of the dataset with varying sample complexity $m/n=\{1,5,10,15,20,25,30\}$ . Similar to synthetic data, the OWF (concatenation method) for real data shows perfect recovery for $m/n>10$ ( $m/n>20$ ).

We examined spectral image recovery with $m/n=15$ and conducted OWF for $I=2000$ iterations using the sensing matrix $\mathbf{A}$ generated as in previous experiments. Figures 3b, c, and d depict the ground-truth image, OWF reconstruction, and reconstruction via concatenation, respectively. Quality assessment utilized the peak signal-to-noise ratio (PSNR) $=20\log\left(\frac{\operatorname{max}\left(\mathbf{x},\mathbf{x}^{\ast}\right)% }{\frac{1}{8n}\sum_{i=1}^{n}|\mathbf{x}_{i}-\mathbf{x}^{\ast}_{i}|^{2}}\right)$ [47]. Validation involved examining the recovery of spectral signatures, specifically vectors with eight octonion components at predefined pixel coordinates. Figure 3e demonstrates superior recovery performance with OWF over the real-valued approach with GD at coordinate (10,10) in the reference and reconstructed images.

VI Summary

We introduced an OPR algorithm for recovering 8-dimensional signals from phaseless measurements. The proposed OWF algorithm, derived from the pseudo-real-matrix representation of octonions, was validated through diverse experiments across different scenarios, sample sizes, noise levels, and real signals with multispectral images. This contributes to the advancement of hypercomplex PR applications [48].

Appendix: Proof of Theorem 2

From equation (1), we have $f(\cdot)=|\cdot|$ . First, we find the small-ball estimate bound of $Q_{\mathcal{F}}(\tau)$ . Define $\mathbf{W}=\mathbf{x}\mathbf{x}^{H}-\mathbf{y}\mathbf{y}^{*}$ . Using Paley-Zygmund inequality [49, Lemma 8.26], we get

	$\displaystyle Q_{\mathcal{F}}(\tau)=$	$\displaystyle\inf_{\mathbf{W}}\mathbb{P}[\|\langle\mathbf{W},\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}\|^{2}\leq\tau]$
		$\displaystyle\leq\inf_{\mathbf{W}}\frac{\mathbb{E}\left[\|\langle\mathbf{W},% \mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}\|^{2}\right]-\tau^{2% }}{\mathbb{E}\left[\|\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}% \rangle_{\mathbb{R}}\|^{4}\right]},{\color[rgb]{0,0,0}{0<\tau<1}}.$		(5)

Then, we need to upper (lower) bound the denominator (numerator). Since $\mathbf{W}$ is a rank-2 matrix, we have $\mathbf{W}=\lambda_{1}\mathbf{h}\mathbf{h}^{H}+\lambda_{2}\mathbf{b}\mathbf{b}% ^{H}$ , where $\lambda_{1}+\lambda_{2}=1$ and $\|\mathbf{h}\|=\|\mathbf{b}\|=1$ are normalized eigenvalues and eigenvectors, respectively. Then, $\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}=% \lambda_{1}|\mathbf{h}^{H}\mathbf{a}_{\ell}|^{2}+\lambda_{2}|\mathbf{b}^{H}% \mathbf{a}_{\ell}|^{2}.$ Since $\mathbf{A}\sim\mathcal{N}_{\mathcal{O}}^{m\times n}$ and $\mathbf{x}\sim\mathcal{N}_{\mathcal{O}}^{n}$ , then following the rotation invariance property of octonion product, $\mathbf{h}^{H}\mathbf{a}_{\ell}$ and ${\color[rgb]{0,0,0}{\mathbf{b}^{H}}}\mathbf{a}_{\ell}$ are independent copies of the octonion Gaussian distribution. We observe that $|\mathbf{h}^{H}\mathbf{a}_{\ell}|^{2}$ and $|\mathbf{b}^{*}\mathbf{a}_{\ell}|^{2}$ conform to the $\chi^{2}$ distribution with 8 degrees of freedom. Leveraging computations of high-order moments [50] for the $\chi^{2}$ distribution and after some algebraic manipulations, we get $\mathbb{E}\left[|\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}% \rangle_{\mathbb{R}}|^{4}\right]\leq\frac{1}{4^{4}}\mathbb{E}\left[8|\mathbf{h% }^{H}\mathbf{a}_{\ell}|^{2}+8|\mathbf{b}^{*}\mathbf{a}_{\ell}|^{2}\right]=% \frac{2^{4}\Gamma(16)}{4^{4}\Gamma(8)}\coloneqq c_{0}$ . On the other hand, $\mathbb{E}\left[|\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}% \rangle_{\mathbb{R}}|^{2}\right]\geq\frac{1}{16}\left(\mathbb{E}\left[8|% \mathbf{h}^{H}\mathbf{a}_{\ell}|^{2}\right]-\mathbb{E}\left[8|\mathbf{b}^{*}% \mathbf{a}_{\ell}|^{2}\right]\right)=\frac{1}{4}\coloneqq c_{1}$ . Define the constant $c=\frac{c_{1}}{c_{0}}$ . Using the property ${|\langle\mathbf{P},\mathbf{D}\rangle|}\leq\sqrt{\operatorname{rank}(\mathbf{P% })}\|\mathbf{P}\|_{F}\|\mathbf{D}\|_{2}$ from the real-matrix representation of octonion numbers, we have

\displaystyle R_{m}

\displaystyle=\mathbb{E}\left[\sup_{\mathbf{W}}\frac{1}{m}\sum_{\ell=1}^{m}% \varepsilon_{\ell}\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}% \rangle\right]\leq\frac{\sqrt{2}}{n}\mathbb{E}\left[\left\|\sum_{\ell=1}^{m}% \varepsilon_{\ell}\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}\right\|\right].

We can decompose the octonion variables following the Carley-Dickson octonion construction: $\mathbf{a}_{\ell}=(\bm{\alpha}_{\ell}+\bm{\beta}_{\ell}e_{2})+(\bm{\gamma}_{% \ell}+\bm{\eta}_{\ell}e_{2})e_{4}$ . After some tedious algebra on (Appendix: Proof of Theorem 2) using the aforementioned octonion representation, we obtain

	$\displaystyle\left\\|\sum_{\ell=1}^{m}\varepsilon_{\ell}\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\right\\|=$	$\displaystyle\left\\|\sum_{\ell=1}^{m}\varepsilon_{\ell}((\bm{\alpha}_{\ell}+% \bm{\beta}_{\ell}e_{2})+(\bm{\gamma}_{\ell}+\bm{\eta}_{\ell}e_{2})e_{4})\cdot\right.$
		$\displaystyle\left.((\bm{\alpha}_{\ell}+\bm{\beta}_{\ell}e_{2})+(\bm{\gamma}_{% \ell}+\bm{\eta}_{\ell}e_{2})e_{4})^{*}\right\\|.$		(6)

Re-arranging the terms yields

	$\displaystyle\left\\|\sum_{\ell=1}^{m}\varepsilon_{\ell}\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\right\\|=$	$\displaystyle\left\\|\sum_{\ell=1}^{m}\varepsilon_{\ell}(\bm{\alpha}_{\ell}\bm{% \alpha}_{\ell}^{}+\bm{\beta}_{\ell}\bm{\beta}_{\ell}^{}+\bm{\gamma}_{\ell}% \bm{\gamma}_{\ell}^{}+\bm{\eta}_{\ell}\bm{\eta}_{\ell}^{}+\right.$
		$\displaystyle\left.2\bm{\beta}_{\ell}\bm{\alpha}^{H}_{\ell}+2\bm{\eta}_{\ell}% \bm{\gamma}^{H}_{\ell})\right\\|.$		(7)

From random matrix theory, we have an upper bound on the Rademacher Gaussian series. Using [51, Theorem 4.1.1] in (Appendix: Proof of Theorem 2) gives $\mathbb{E}\left[\|\sum_{\ell=1}^{m}\varepsilon_{\ell}\bm{\alpha}_{\ell}\bm{% \alpha}_{\ell}^{*}\right]\leq\sqrt{2{\sum_{\ell}\|\bm{\alpha}_{\ell}\bm{\alpha% }_{\ell}^{*}\|}\log n}$ . Upper bounding the spectral norm [52] yields $\sum_{\ell}\|\bm{\alpha}_{\ell}\bm{\alpha}_{\ell}^{*}\|\leq\mathcal{O}(\log% \log n)$ . Then, for universal constant $C$ , we have $R_{m}(\mathcal{F})\leq C_{2}n\log m$ .

Putting all terms together, we have that with a probability at least $1-e^{-\frac{1}{2}c^{2}m}$ , we obtain

\displaystyle\inf_{\mathbf{W}}\frac{1}{m}\sum_{\ell=1}^{m}|\langle\mathbf{W},% \mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}|\geq\frac{1}{16}c-C% \log m-\frac{c}{32\sqrt{m}}\geq\frac{c}{64}m.

This leads to $\sum_{\ell=1}^{m}(|\mathbf{a}_{\ell}^{H}\mathbf{x}|-|\mathbf{a}_{\ell}^{H}% \mathbf{y}|)\geq\tilde{c}m\|\mathbf{xx}^{*}-\mathbf{yy}^{*}\|_{F}^{2}$ , for a constant $\tilde{c}>0$ . This proves that for sufficiently small $\tilde{c}$ , the only trivial ambiguity is the right-octonion factor.

References

[1] B. Augereau and P. Carré, “Hypercomplex polynomial wavelet-filter bank transform for color image,” Signal Processing, vol. 136, pp. 16–28, 2017.
[2] N. Le Bihan and J. Mars, “Singular value decomposition of quaternion matrices: A new tool for vector-sensor signal processing,” Signal Processing, vol. 84, no. 7, pp. 1177–1199, 2004.
[3] W. Liu, “Channel equalization and beamforming for quaternion-valued wireless communication systems,” Journal of the Franklin Institute, vol. 354, no. 18, pp. 8721–8733, 2017.
[4] V. Tarokh, H. Jafarkhani, and A. R. Calderbank, “Space-time block codes from orthogonal designs,” IEEE Transactions on Information Theory, vol. 45, no. 5, pp. 1456–1467, 1999.
[5] O. Tirkkonen and A. Hottinen, “Square-matrix embeddable space-time block codes for complex signal constellations,” IEEE Transactions on Information Theory, vol. 48, no. 2, pp. 384–395, 2002.
[6] A. Buvarp, K. V. Mishra, A. I. Zaghloul, and L. M. Mili, “Quaternion-neural-networks-based decoder for RIS-aided polarization-space modulation,” in IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, 2023, pp. 1–2.
[7] F. Ortolani, D. Comminiello, M. Scarpiniti, and A. Uncini, “Frequency domain quaternion adaptive filters: Algorithms and convergence performance,” Signal Processing, vol. 136, pp. 69–80, 2017.
[8] M. Kobayashi, “Uniqueness theorem for quaternionic neural networks,” Signal Processing, vol. 136, pp. 102–106, 2017.
[9] ——, “Fixed points of split quaternionic Hopfield neural networks,” Signal Processing, vol. 136, pp. 38–42, 2017.
[10] T. A. Ell and S. J. Sangwine, “Hypercomplex Fourier transforms of color images,” IEEE Transactions on Image Processing, vol. 16, no. 1, pp. 22–35, 2006.
[11] I. L. Kantor and A. S. Solodovnikov, Hypercomplex numbers: An elementary introduction to algebras. Springer, 1989.
[12] J. Voight, Quaternion algebras. Springer Nature, 2021.
[13] M. Erdoğdu and M. Özdemir, “On complex split quaternion matrices,” Advances in Applied Clifford Algebras, vol. 23, pp. 625–638, 2013.
[14] M. Knus, T. Lam, D. Shapiro, and J. Tignol, “Discriminants of involutions on biquaternion algebras,” in $K$ -Theory and Algebraic Geometry: Connections with Quadratic Forms and Division Algebras: Connections with Quadratic Forms and Division Algebras, ser. Proceedings of Symposia in Pure Mathematics, B. Jacob and A. Rosenberg, Eds. American Mathematical Society, 1995, vol. 58 (2), pp. 279–303.
[15] S. Okubo, Introduction to octonion and other non-associative algebras in physics. Cambridge University Press, 1995, no. 2.
[16] N. Le Bihan and S. J. Sangwine, “Quaternion principal component analysis of color images,” in IEEE International Conference on Image Processing, vol. 1, 2003, pp. I–809.
[17] T. A. Ell, N. Le Bihan, and S. J. Sangwine, Quaternion Fourier transforms for signal and image processing. John Wiley & Sons, 2014.
[18] T. Isokawa, T. Kusakabe, N. Matsui, and F. Peper, “Quaternion neural network and its application,” in International Conference on International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, 2003, pp. 318–324.
[19] G. Wang and R. Xue, “Quaternion filtering based on quaternion involutions and its application in signal processing,” IEEE Access, vol. 7, pp. 149 068–149 079, 2019.
[20] J. Baez, “The octonions,” Bulletin of the american mathematical society, vol. 39, no. 2, pp. 145–205, 2002.
[21] S. Lazendić, H. De Bie, and A. Pižurica, “Octonion sparse representation for color and multispectral image processing,” in European Signal Processing Conference, 2018, pp. 608–612.
[22] M. Yamni, H. Karmouni, M. Sayyouri, H. Qjidaa, and J. Flusser, “Novel octonion moments for color stereo image analysis,” Digital Signal Processing, vol. 108, p. 102878, 2021.
[23] V. Katkovnik, I. Shevkunov, and K. Egiazarian, “ADMM and spectral proximity operators in hyperspectral broadband phase retrieval for quantitative phase imaging,” Signal Processing, p. 109095, 2023.
[24] J. Bacca, S. Pinilla, and H. Arguello, “Super-resolution phase retrieval from designed coded diffraction patterns,” IEEE Transactions on Image Processing, vol. 29, pp. 2598–2609, 2019.
[25] S. Pinilla, J. Bacca, and H. Arguello, “Phase retrieval algorithm via nonconvex minimization using a smoothing function,” IEEE Transactions on Signal Processing, vol. 66, no. 17, pp. 4574–4584, 2018.
[26] C. Fienup and J. Dainty, “Phase retrieval and image reconstruction for astronomy,” in Image recovery: Theory and application, H. Stark, Ed. Academic Press, 1987, pp. 231–275.
[27] S. Pinilla, K. V. Mishra, B. M. Sadler, and H. Arguello, “BanRaW: Band-limited radar waveform design via phase retrieval,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2021, pp. 5449–5453.
[28] S. Pinilla, K. V. Mishra, I. Shevkunov, M. Soltanalian, V. Katkovnik, and K. Egiazarian, “Unfolding-aided bootstrapped phase retrieval in optical imaging: Explainable ai reveals new imaging frontiers,” IEEE Signal Processing Magazine, vol. 40, no. 2, pp. 46–60, 2023.
[29] Y. Chen, C. Cheng, and Q. Sun, “Phase retrieval of complex and vector-valued functions,” Journal of Functional Analysis, vol. 283, no. 7, p. 109593, 2022.
[30] J. Chen and M. K. Ng, “Phase retrieval of quaternion signal via Wirtinger flow,” IEEE Transactions on Signal Processing, vol. 71, pp. 2863–2878, 2023.
[31] E. J. Candès, X. Li, and M. Soltanolkotabi, “Phase retrieval from coded diffraction patterns,” Applied and Computational Harmonic Analysis, vol. 39, no. 2, pp. 277–299, 2015.
[32] Z. Shao, H. Shu, J. Wu, Z. Dong, G. Coatrieux, and J. L. Coatrieux, “Double color image encryption using iterative phase retrieval algorithm in quaternion gyrator domain,” Optics Express, vol. 22, no. 5, pp. 4932–4943, 2014.
[33] D. Xu and D. P. Mandic, “The theory of quaternion matrix derivatives,” IEEE Transactions on Signal Processing, vol. 63, no. 6, pp. 1543–1556, 2015.
[34] L. Qi, Z. Luo, Q.-W. Wang, and X. Zhang, “Quaternion matrix optimization: Motivation and analysis,” Journal of Optimization Theory and Applications, vol. 193, no. 1-3, pp. 621–648, 2022.
[35] L. Rodman, “Hermitian octonion matrices and numerical ranges,” The Electronic Journal of Linear Algebra, vol. 27, pp. 515–533, 2014.
[36] V. Koltchinskii and S. Mendelson, “Bounding the smallest singular value of a random matrix without concentration,” International Mathematics Research Notices, vol. 2015, no. 23, pp. 12 991–13 008, 2015.
[37] M. Bouchard and J. Khalid, “Calculus for eight-dimensional hypercomplex algebras,” Dec. 2022, working paper or preprint. [Online]. Available: https://hal.science/hal-03558753
[38] J. Wu, L. Xu, F. Wu, Y. Kong, L. Senhadji, and H. Shu, “Deep octonion networks,” Neurocomputing, vol. 397, pp. 179–191, 2020.
[39] E. J. Candès, X. Li, and M. Soltanolkotabi, “Phase retrieval via Wirtinger flow: Theory and algorithms,” IEEE Transactions on Information Theory, vol. 61, no. 4, pp. 1985–2007, 2015.
[40] T. Dray and C. A. Manogue, “The octonionic eigenvalue problem,” Advances in Applied Clifford Algebras, vol. 8, pp. 341–364, 1998.
[41] Y. Li, M. Wei, F. Zhang, and J. Zhao, “On the power method for quaternion right eigenvalue problem,” Journal of Computational and Applied Mathematics, vol. 345, pp. 59–69, 2019.
[42] S. Sangwine and N. Le Bihan, “Quaternion and octonion toolbox for MATLAB,” 2016. [Online]. Available: https://qtfm.sourceforge.io/
[43] R. Chandra, T. Goldstein, and C. Studer, “Phasepack: A phase retrieval library,” in IEEE International conference on Sampling Theory and Applications, 2019, pp. 1–5.
[44] F. Yasuma, T. Mitsunaga, D. Iso, and S. Nayar, “Generalized Assorted Pixel Camera: Post-Capture Control of Resolution, Dynamic Range and Spectrum,” epartment of Computer Science, Columbia University, Tech. Rep. CUCS-061-08, Nov 2008.
[45] H. Zhang, L. Zhang, and H. Shen, “A super-resolution reconstruction algorithm for hyperspectral images,” Signal Processing, vol. 92, no. 9, pp. 2082–2096, 2012.
[46] C. Lanczos, “An iteration method for the solution of the eigenvalue problem of linear differential and integral operators,” Journal of Research of the National Bureau of Standards, vol. 45, no. 4, 1950.
[47] A. Hore and D. Ziou, “Image quality metrics: PSNR vs. SSIM,” in IEEE International Conference on Pattern Recognition, 2010, pp. 2366–2369.
[48] R. Jacome, K. V. Mishra, B. M. Sadler, and H. Arguello, “An invitation to hypercomplex phase retrieval: Theory and applications,” IEEE Signal Processng Magazine, 2024, in press.
[49] A. Zygmund, Trigonometric series. Cambridge university press, 2002, vol. 1.
[50] S. Nadarajah, “Explicit expressions for moments of $\chi^{2}$ order statistics,” Bulletin of the Institute of Mathematics, Academia Sinica (New Series), vol. 3, no. 3, pp. 433–444, 2008.
[51] J. A. Tropp, “An introduction to matrix concentration inequalities,” Foundations and Trends® in Machine Learning, vol. 8, no. 1-2, pp. 1–230, 2015.
[52] R. Van Handel, “On the spectral norm of Gaussian random matrices,” Transactions of the American Mathematical Society, vol. 369, no. 11, pp. 8161–8178, 2017.

	$\displaystyle\left\\|\sum_{\ell=1}^{m}\varepsilon_{\ell}\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\right\\|=$	$\displaystyle\left\\|\sum_{\ell=1}^{m}\varepsilon_{\ell}((\bm{\alpha}_{\ell}+% \bm{\beta}_{\ell}e_{2})+(\bm{\gamma}_{\ell}+\bm{\eta}_{\ell}e_{2})e_{4})\cdot\right.$
		$\displaystyle\left.((\bm{\alpha}_{\ell}+\bm{\beta}_{\ell}e_{2})+(\bm{\gamma}_{% \ell}+\bm{\eta}_{\ell}e_{2})e_{4})^{*}\right\\|.$		(6)

	$\displaystyle\left\\|\sum_{\ell=1}^{m}\varepsilon_{\ell}\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\right\\|=$	$\displaystyle\left\\|\sum_{\ell=1}^{m}\varepsilon_{\ell}(\bm{\alpha}_{\ell}\bm{% \alpha}_{\ell}^{}+\bm{\beta}_{\ell}\bm{\beta}_{\ell}^{}+\bm{\gamma}_{\ell}% \bm{\gamma}_{\ell}^{}+\bm{\eta}_{\ell}\bm{\eta}_{\ell}^{}+\right.$
		$\displaystyle\left.2\bm{\beta}_{\ell}\bm{\alpha}^{H}_{\ell}+2\bm{\eta}_{\ell}% \bm{\gamma}^{H}_{\ell})\right\\|.$		(7)