Octonion Phase Retrieval

Roman Jacome, Kumar Vijay Mishra, Brian M. Sadler and Henry Arguello R. J., and H. A. are with Universidad Industrial de Santander, Bucaramanga, Santander 680002 Colombia, e-mail: {roman2162474@correo., henarfu@}uis.edu.co.K. V. M. and B. M. S. are with the United States DEVCOM Army Research Laboratory, Adelphi, MD 20783 USA, e-mail: [email protected], [email protected] research was sponsored by the Army Research Office/Laboratory under Grant Number W911NF-21-1-0099, and the VIE project entitled “Dual blind deconvolution for joint radar-communications processing”. K. V. M. acknowledges support from the National Academies of Sciences, Engineering, and Medicine via the Army Research Laboratory Harry Diamond Distinguished Fellowship. This work was supported by ICETEX and MINCIENCIAS through the CTO 2022-0716 under Grant 8284
Abstract

Signal processing over hypercomplex numbers arises in many optical imaging applications. In particular, spectral image or color stereo data are often processed using octonion algebra. Recently, the eight-band multispectral image phase recovery has gained salience, wherein it is desired to recover the eight bands from the phaseless measurements. In this paper, we tackle this hitherto unaddressed hypercomplex variant of the popular phase retrieval (PR) problem. We propose octonion Wirtinger flow (OWF) to recover an octonion signal from its intensity-only observation. However, contrary to the complex-valued Wirtinger flow, the non-associative nature of octonion algebra and the consequent lack of octonion derivatives make the extension to OWF non-trivial. We resolve this using the pseudo-real-matrix representation of octonion to perform the derivatives in each OWF update. We demonstrate that our approach recovers the octonion signal up to a right-octonion phase factor. Numerical experiments validate OWF-based PR with high accuracy under both noiseless and noisy measurements.

Index Terms:
Hypercomplex signal processing, phase retrieval, optical imaging, octonion, quaternion.

I Introduction

In several engineering problems pertaining to imaging [1], array processing [2, 3], wireless communications [4, 5, 6], filtering [7], and neural networks [8, 9], the signals of interest are hypercomplex, that is, they are elements of some algebras over the field of real numbers [10]. Unlike vector spaces that only allow addition and scalar multiplication, algebras admit both addition and multiplication between the elements of the algebra [11]. Some common examples of hypercomplex signals include quaternions [12], coquaternions or split-quaternions [13], biquaternions [14], and octonions [15]. Instead of tackling each dimension independently, hypercomplex signal processing exploits the corresponding algebra to process all signal dimensions jointly. The quaternion approaches have been successfully applied to color image processing [16] where the color channels are mutually correlated via quaternion algebra. Quaternion signal processing tools have also been extended to Fourier transform [17], neural networks [18], and adaptive filtering [19]. Applying the Cayley-Dickson construction [11] to quaternions for higher dimensions yields an octonion representation [20]. In this paper, we focus on octonion-valued signals.

A recent application of octonions is multispectral image processing [21], wherein each pixel 7-color channel image has a vector-valued representation such that each channel corresponds to different complex-variable dimensions. Octonions have also been exploited for color-stereo image analysis [22], where two 3-color channel images are represented in a different imaginary dimension. The mutual processing along the color channels and two stereo images has been shown to improve the analysis. Recently, there has been broad interest in the recovery of the phase of a multispectral image, which is represented using octonion-valued signals, from its phaseless measurements [23].

Conventional phase retrieval (PR) is a long-standing signal processing problem, where we want to recover a complex-valued signal 𝒙n𝒙superscript𝑛\bm{x}\in\mathbb{C}^{n}bold_italic_x ∈ blackboard_C start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT given its phaseless measurements 𝒚m𝒚superscript𝑚\bm{y}\in\mathbb{C}^{m}bold_italic_y ∈ blackboard_C start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT as 𝒚=|𝑨𝒙|2𝒚superscript𝑨𝒙2\bm{y}=|\bm{Ax}|^{2}bold_italic_y = | bold_italic_A bold_italic_x | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, where the known measurement matrix 𝑨m×n𝑨superscript𝑚𝑛\bm{A}\in\mathbb{C}^{m\times n}bold_italic_A ∈ blackboard_C start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT is also complex-valued. This problem arises in several areas such as diffractive imaging [24], X-ray crystallography [25], astronomy [26], and radar waveform design [27]. A plethora of algorithms have been proposed for precise PR solutions and the literature is too expansive to summarize here (see, e.g., [28] for recent surveys, and references therein). Broadly, the PR algorithms follow two approaches: either exploit prior knowledge of the signal structure or make additional measurements of the magnitude via, for example, the Fourier transform.

In the context of hypercomplex signals, recently quaternion PR (QPR) was proposed for vision applications in [29], where the signal and the measurement matrix were quaternion- and real-valued, respectively. This was later extended to a quaternion-valued sensing matrix in [30], a quaternion Wirtinger flow (QWF) algorithm was proposed to solve the QPR problem. The QWF is an extension of its popular complex-valued PR algorithm in [31]. Another QPR application has been reported in multiple image encryption that employs quaternion gyrator transform [32]. In this work, we focus on the hitherto unaddressed octonion PR (OPR) problem that is encountered in the reconstruction of multispectral images.

However, unlike QWF, it is not straightforward to extend Wirtinger flow (WF) [31] to octonions because octonion algebra lacks associative property. Hence, unlike quaternions, deriving Wirtinger-like derivatives for octonion-valued variables is very challenging [33, 34]. We address this problem by employing a pseudo-real-matrix representation [35] of the octonion variables to formulate our octonion WF (OWF). We identify trivial ambiguities in OPR and derive the recovery guarantees. Our numerical experiments with synthetic as well as eight-channel multispectral image real data show accurate OPR with the proposed OWF under noiseless and noisy scenarios.

Throughout this paper, we reserve boldface lowercase, boldface uppercase, and calligraphic letters for vectors, matrices, and index sets, respectively. The set of octonion numbers is denoted as 𝕆𝕆\mathbb{O}blackboard_O. We denote the transpose, conjugate, and Hermitian by ()Tsuperscript𝑇(\cdot)^{T}( ⋅ ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, ()superscript(\cdot)^{*}( ⋅ ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and ()Hsuperscript𝐻(\cdot)^{H}( ⋅ ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT, respectively. The identity matrix of size N×N𝑁𝑁N\times Nitalic_N × italic_N is 𝐈Nsubscript𝐈𝑁\mathbf{I}_{N}bold_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT. ||||p||\cdot||_{p}| | ⋅ | | start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT is the psubscript𝑝\ell_{p}roman_ℓ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT norm. We denote |||\cdot|| ⋅ | as the cardinality of a set, 𝔼[]𝔼delimited-[]\mathbb{E}\left[\cdot\right]blackboard_E [ ⋅ ] is the statistical expectation function, and \mathbb{P}blackboard_P denotes the probability. The functions max and min output their arguments’ maximum and minimum values, respectively. The sign function is defined as sign(c)=c|c|sign𝑐𝑐𝑐\operatorname{sign}(c)=\frac{c}{|c|}roman_sign ( italic_c ) = divide start_ARG italic_c end_ARG start_ARG | italic_c | end_ARG.

II Desiderata for Octonion Algebra

We begin with the theoretical desiderata. An octonion number x𝑥xitalic_x is defined as x=x0+i=17xiei𝑥subscript𝑥0superscriptsubscript𝑖17subscript𝑥𝑖subscript𝑒𝑖x=x_{0}+\sum_{i=1}^{7}x_{i}{e}_{i}italic_x = italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, where xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are real-valued coefficients and eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are the octonion units such that ei2=1superscriptsubscript𝑒𝑖21{e}_{i}^{2}=-1italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = - 1 for i=1,,7𝑖17i=1,\dots,7italic_i = 1 , … , 7. The conjugate is x=x017xieisuperscript𝑥subscript𝑥0superscriptsubscript17subscript𝑥𝑖subscript𝑒𝑖{x^{*}=x_{0}-\sum_{1}^{7}x_{i}{e}_{i}}italic_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The ‘real part’ of x𝑥xitalic_x is x0subscript𝑥0x_{0}italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Octonions are obtained from Cayley Dickson’s construction of quaternions. Octonion algebra is non-associative and non-commutative, that is, for given three octonion numbers x,y,z𝕆𝑥𝑦𝑧𝕆x,y,z\in\mathbb{O}italic_x , italic_y , italic_z ∈ blackboard_O, we have (xy)zx(yz)𝑥𝑦𝑧𝑥𝑦𝑧(x\cdot y)\cdot z\neq x\cdot(y\cdot z)( italic_x ⋅ italic_y ) ⋅ italic_z ≠ italic_x ⋅ ( italic_y ⋅ italic_z ) and xyyx𝑥𝑦𝑦𝑥x\cdot y\neq y\cdot xitalic_x ⋅ italic_y ≠ italic_y ⋅ italic_x. The ‘purely imaginary’ part of the octonion is Imx=i=17xieiIm𝑥superscriptsubscript𝑖17subscript𝑥𝑖subscript𝑒𝑖\operatorname{Im}x=\sum_{i=1}^{7}x_{i}{e}_{i}roman_Im italic_x = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The magnitude of an octonion number is |x|=i=07xi2𝑥superscriptsubscript𝑖07superscriptsubscript𝑥𝑖2|{x}|=\sqrt{\sum_{i=0}^{7}x_{i}^{2}}| italic_x | = square-root start_ARG ∑ start_POSTSUBSCRIPT italic_i = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. The norm of an octonion vector 𝐱𝕆n𝐱superscript𝕆𝑛\mathbf{x}\in\mathbb{O}^{n}bold_x ∈ blackboard_O start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is 𝐱=k=1n|𝐱k|2norm𝐱superscriptsubscript𝑘1𝑛superscriptsubscript𝐱𝑘2\|\mathbf{x}\|=\sqrt{\sum_{k=1}^{n}|\mathbf{x}_{k}|^{2}}∥ bold_x ∥ = square-root start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | bold_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. For a real-valued vector 𝐭n𝐭superscript𝑛\mathbf{t}\in\mathbb{R}^{n}bold_t ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, its 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm is 𝐭2=k=1n𝐭k2subscriptnorm𝐭2superscriptsubscript𝑘1𝑛superscriptsubscript𝐭𝑘2\|\mathbf{t}\|_{2}=\sqrt{\sum_{k=1}^{n}\mathbf{t}_{k}^{2}}∥ bold_t ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = square-root start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT bold_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. The octonion-valued Gaussian distribution is represented by 𝒩𝒪subscript𝒩𝒪\mathcal{N}_{\mathcal{O}}caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT which is defined as 𝒩𝒪=𝒩(0,1)+i=17𝒩(0,1)eisubscript𝒩𝒪𝒩01superscriptsubscript𝑖17𝒩01subscript𝑒𝑖\mathcal{N}_{\mathcal{O}}=\mathcal{N}(0,1)+\sum_{i=1}^{7}\mathcal{N}(0,1)e_{i}caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT = caligraphic_N ( 0 , 1 ) + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT caligraphic_N ( 0 , 1 ) italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, where 𝒩(0,1)𝒩01\mathcal{N}(0,1)caligraphic_N ( 0 , 1 ) is standard normal distribution. The octonion Gaussian distribution of a n𝑛nitalic_n-dimensional octonion random variable is 𝒩𝒪nsuperscriptsubscript𝒩𝒪𝑛\mathcal{N}_{\mathcal{O}}^{n}caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, where each vector element is drawn from 𝒩𝒪subscript𝒩𝒪\mathcal{N}_{\mathcal{O}}caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT. For further details on octonion algebra, we refer the interested reader to [15].

It follows from the non-associative octonion algebra that, unlike in quaternion algebra, a real-matrix representation of an octonion number does not exist. However, [35] proposed a pseudo-real matrix representation that has been successfully employed by many octonion-valued signal applications [21]. To obtain this representation, define the real representation of the octonion number x𝕆𝑥𝕆x\in\mathbb{O}italic_x ∈ blackboard_O as (x)=[x0,x1,,x7]T8𝑥superscriptsubscript𝑥0subscript𝑥1subscript𝑥7𝑇superscript8\aleph(x)=[x_{0},x_{1},\dots,x_{7}]^{T}\in\mathbb{R}^{8}roman_ℵ ( italic_x ) = [ italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT. Then, the injective map** :𝕆8×8:𝕆superscript88\gimel:\mathbb{O}\rightarrow\mathbb{R}^{8\times 8}roman_ℷ : blackboard_O → blackboard_R start_POSTSUPERSCRIPT 8 × 8 end_POSTSUPERSCRIPT is the real matrix representation of an octonion number [35]:

(x)=[x0x1x2x3x4x5x6x7x1x0x3x2x5x4x7x6x2x3x0x1x6x7x4x5x3x2x1x0x7x6x5x4x4x5x6x7x0x1x2x3x5x4x7x6x1x0x3x2x6x7x4x5x2x3x0x1x7x6x5x4x3x2x1x0].𝑥matrixsubscript𝑥0subscript𝑥1subscript𝑥2subscript𝑥3subscript𝑥4subscript𝑥5subscript𝑥6subscript𝑥7subscript𝑥1subscript𝑥0subscript𝑥3subscript𝑥2subscript𝑥5subscript𝑥4subscript𝑥7subscript𝑥6subscript𝑥2subscript𝑥3subscript𝑥0subscript𝑥1subscript𝑥6subscript𝑥7subscript𝑥4subscript𝑥5subscript𝑥3subscript𝑥2subscript𝑥1subscript𝑥0subscript𝑥7subscript𝑥6subscript𝑥5subscript𝑥4subscript𝑥4subscript𝑥5subscript𝑥6subscript𝑥7subscript𝑥0subscript𝑥1subscript𝑥2subscript𝑥3subscript𝑥5subscript𝑥4subscript𝑥7subscript𝑥6subscript𝑥1subscript𝑥0subscript𝑥3subscript𝑥2subscript𝑥6subscript𝑥7subscript𝑥4subscript𝑥5subscript𝑥2subscript𝑥3subscript𝑥0subscript𝑥1subscript𝑥7subscript𝑥6subscript𝑥5subscript𝑥4subscript𝑥3subscript𝑥2subscript𝑥1subscript𝑥0\gimel(x)=\begin{bmatrix}x_{0}&-x_{1}&-x_{2}&-x_{3}&-x_{4}&-x_{5}&-x_{6}&-x_{7% }\\ x_{1}&x_{0}&x_{3}&-x_{2}&x_{5}&-x_{4}&-x_{7}&x_{6}\\ x_{2}&-x_{3}&x_{0}&x_{1}&x_{6}&x_{7}&-x_{4}&-x_{5}\\ x_{3}&x_{2}&-x_{1}&x_{0}&x_{7}&-x_{6}&x_{5}&-x_{4}\\ x_{4}&-x_{5}&-x_{6}&-x_{7}&x_{0}&x_{1}&x_{2}&x_{3}\\ x_{5}&x_{4}&-x_{7}&x_{6}&-x_{1}&x_{0}&-x_{3}&x_{2}\\ x_{6}&x_{7}&x_{4}&-x_{5}&-x_{2}&x_{3}&x_{0}&-x_{1}\\ x_{7}&-x_{6}&x_{5}&x_{4}&-x_{3}&-x_{2}&x_{1}&x_{0}\end{bmatrix}.roman_ℷ ( italic_x ) = [ start_ARG start_ROW start_CELL italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_CELL start_CELL - italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] .

Both representations \alephroman_ℵ and \gimelroman_ℷ are also easily extended to vector/matrix octonion variables i.e., given 𝐀𝕆m×n𝐀superscript𝕆𝑚𝑛\mathbf{A}\in\mathbb{O}^{m\times n}bold_A ∈ blackboard_O start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT, we have (𝐀)8n×m𝐀superscript8𝑛𝑚\aleph(\mathbf{A})\in\mathbb{R}^{8n\times m}roman_ℵ ( bold_A ) ∈ blackboard_R start_POSTSUPERSCRIPT 8 italic_n × italic_m end_POSTSUPERSCRIPT and (𝐀)8m×8n𝐀superscript8𝑚8𝑛\gimel({\mathbf{A}})\in\mathbb{R}^{8m\times 8n}roman_ℷ ( bold_A ) ∈ blackboard_R start_POSTSUPERSCRIPT 8 italic_m × 8 italic_n end_POSTSUPERSCRIPT. Consider 𝐱𝕆n𝐱superscript𝕆𝑛\mathbf{x}\in\mathbb{O}^{n}bold_x ∈ blackboard_O start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and 𝐀𝕆m×n𝐀superscript𝕆𝑚𝑛\mathbf{A}\in\mathbb{O}^{m\times n}bold_A ∈ blackboard_O start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT, it holds (𝐀𝐱)=(𝐀)(𝐱)𝐀𝐱𝐀𝐱\aleph(\mathbf{Ax})=\gimel(\mathbf{A})\aleph(\mathbf{x})roman_ℵ ( bold_Ax ) = roman_ℷ ( bold_A ) roman_ℵ ( bold_x ) and 𝐱2=(𝐱)2subscriptnorm𝐱2subscriptnorm𝐱2\|\mathbf{x}\|_{2}=\|\aleph(\mathbf{x})\|_{2}∥ bold_x ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ∥ roman_ℵ ( bold_x ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. This allows us to convert the octonion product into a real-valued matrix/vector multiplication that obeys the octonion product rules. We later employ this representation for gradient-based algorithms for OPR.

III OPR and Trivial Ambiguity

Consider the octonion-valued signal 𝒙𝕆n𝒙superscript𝕆𝑛\bm{x}\in\mathbb{O}^{n}bold_italic_x ∈ blackboard_O start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and its phaseless measurements 𝐲=|𝐀𝐱|2+m𝐲superscript𝐀𝐱2superscriptsubscript𝑚\mathbf{y}=|\mathbf{A}\mathbf{x}|^{2}\in\mathbb{R}_{+}^{m}bold_y = | bold_Ax | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT where 𝐀𝕆m×n𝐀superscript𝕆𝑚𝑛\mathbf{A}\in\mathbb{O}^{m\times n}bold_A ∈ blackboard_O start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT is the octonion-valued sensing matrix. Our goal is to recover the octonion-valued signal 𝐱𝐱\mathbf{x}bold_x from its phaseless measurements 𝐲𝐲\mathbf{y}bold_y. Traditional WF for high dimensional signals would require concatenating all signal components thereby discarding any interaction between them. It is, therefore, desired to devise OPR recovery that also obeys octonion algebra.

As in conventional PR problems, there also exists an intrinsic trivial ambiguity in OPR as explained below.
Trivial Ambiguity: Given a unit octonion q𝑞qitalic_q, |q|=1𝑞1|q|=1| italic_q | = 1, the signal 𝐱𝐱\mathbf{x}bold_x scaled by a global right octonion factor i.e., q𝑞qitalic_q is right-multiplied to all the elements of signal 𝐱𝐱\mathbf{x}bold_x leads to the same measurements, i.e., |𝐀𝐱q|2=|𝐀𝐱|2superscript𝐀𝐱𝑞2superscript𝐀𝐱2|\mathbf{Ax}q|^{2}=|\mathbf{Ax}|^{2}| bold_Ax italic_q | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = | bold_Ax | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. However, since the octonion algebra is non-commutative, we have |𝐀q𝐱|2|𝐀𝐱|2superscript𝐀𝑞𝐱2superscript𝐀𝐱2|\mathbf{A}q\mathbf{x}|^{2}\neq|\mathbf{Ax}|^{2}| bold_A italic_q bold_x | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≠ | bold_Ax | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Our goal is to recover 𝐱𝐱\mathbf{x}bold_x up to a trivial ambiguity of only on the right octonion phase factor.

To this end, first define 𝐱=𝐲q𝐱𝐲𝑞\mathbf{x}=\mathbf{y}qbold_x = bold_y italic_q. We show that |𝐚H𝐱|2=|𝐚H𝐲|2superscriptsuperscriptsubscript𝐚𝐻𝐱2superscriptsuperscriptsubscript𝐚𝐻𝐲2|\mathbf{a}_{\ell}^{H}\mathbf{x}|^{2}=|\mathbf{a}_{\ell}^{H}\mathbf{y}|^{2}| bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_x | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_y | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT for all =1,,n1𝑛\ell=1,\dots,nroman_ℓ = 1 , … , italic_n holds with high probability. We have |𝐚H𝐱|2|𝐚H𝐲|2=𝐱𝐱H𝐲𝐲H,𝐚𝐚Hsuperscriptsuperscriptsubscript𝐚𝐻𝐱2superscriptsuperscriptsubscript𝐚𝐻𝐲2subscriptsuperscript𝐱𝐱𝐻superscript𝐲𝐲𝐻subscript𝐚superscriptsubscript𝐚𝐻|\mathbf{a}_{\ell}^{H}\mathbf{x}|^{2}-|\mathbf{a}_{\ell}^{H}\mathbf{y}|^{2}=% \langle\mathbf{x}\mathbf{x}^{H}-\mathbf{y}\mathbf{y}^{H},\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}| bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_x | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_y | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ⟨ bold_xx start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT - bold_yy start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT. Further,

=1m(|𝐚H𝐱|2|𝐚H𝐲|2)2𝐱𝐱H𝐲𝐲H,=1m𝐚𝐚H.superscriptsubscript1𝑚superscriptsuperscriptsuperscriptsubscript𝐚𝐻𝐱2superscriptsuperscriptsubscript𝐚𝐻𝐲22subscriptsuperscript𝐱𝐱𝐻superscript𝐲𝐲𝐻superscriptsubscript1𝑚subscript𝐚superscriptsubscript𝐚𝐻\sum_{\ell=1}^{m}\left(|\mathbf{a}_{\ell}^{H}\mathbf{x}|^{2}-|\mathbf{a}_{\ell% }^{H}\mathbf{y}|^{2}\right)^{2}\geq\left\langle\mathbf{x}\mathbf{x}^{H}-% \mathbf{y}\mathbf{y}^{H},{\sum_{\ell=1}^{m}}\mathbf{a}_{\ell}\mathbf{a}_{\ell}% ^{H}\right\rangle_{\mathbb{R}}.∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_x | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_y | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ ⟨ bold_xx start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT - bold_yy start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT . (1)

To lower bound the left-hand side term – a quadratic stochastic process – we, therefore, employ the small ball method [36]. Recall the following Proposition 1:

Proposition 1 (Lower bound on quadratic stochastic process).

[36, Theorem 2.1] Assume βsubscript𝛽\beta_{\ell}italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT where =1,,m1𝑚\ell=1,\dots,mroman_ℓ = 1 , … , italic_m to be independent copies of β𝛽\betaitalic_β. Denote a family of functions that satisfy a uniform small-ball estimation by \mathcal{F}caligraphic_F. For a constant τ>0𝜏0\tau>0italic_τ > 0, we have Q(τ)=inff[|f|τ]>0subscript𝑄𝜏subscriptinf𝑓delimited-[]𝑓𝜏0Q_{\mathcal{F}}(\tau)=\operatorname{inf}_{f\in\mathcal{F}}\mathbb{P}\left[|f|% \geq\tau\right]>0italic_Q start_POSTSUBSCRIPT caligraphic_F end_POSTSUBSCRIPT ( italic_τ ) = roman_inf start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT blackboard_P [ | italic_f | ≥ italic_τ ] > 0 and based on the expectation of Rademacher process Rm()=𝔼[supf|1m=1mεf(β)|]subscript𝑅𝑚𝔼delimited-[]subscriptsupremum𝑓1𝑚superscriptsubscript1𝑚subscript𝜀𝑓subscript𝛽R_{m}(\mathcal{F})=\mathbb{E}\left[\sup_{f\in\mathcal{F}}\left|\frac{1}{m}\sum% _{\ell=1}^{m}\varepsilon_{\ell}f(\mathbf{\beta_{\ell}})\right|\right]italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( caligraphic_F ) = blackboard_E [ roman_sup start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT | divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_f ( italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) | ], where {ε}=1nsuperscriptsubscriptsubscript𝜀1𝑛\{\varepsilon_{\ell}\}_{\ell=1}^{n}{ italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT are independent, symmetric, binary-valued random variables ε{1,1}subscript𝜀11\varepsilon_{\ell}\in\{-1,1\}italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ { - 1 , 1 }. Then, for probability at least 1e2t21superscript𝑒2superscript𝑡21-e^{-2t^{2}}1 - italic_e start_POSTSUPERSCRIPT - 2 italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT for constant t>0𝑡0t>0italic_t > 0,

inff|=1mεf(β)|τ2(Q(2τ)4τRmtn).subscriptinfimum𝑓superscriptsubscript1𝑚subscript𝜀𝑓subscript𝛽superscript𝜏2subscript𝑄2𝜏4𝜏subscript𝑅𝑚𝑡𝑛\inf_{f\in\mathcal{F}}\left|\sum_{\ell=1}^{m}\varepsilon_{\ell}f(\mathbf{\beta% _{\ell}})\right|\leq\tau^{2}\left(Q_{\mathcal{F}}(2\tau)-\frac{4}{\tau}R_{m}{% \mathcal{F}}-\frac{t}{\sqrt{n}}\right).roman_inf start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT | ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_f ( italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) | ≤ italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_Q start_POSTSUBSCRIPT caligraphic_F end_POSTSUBSCRIPT ( 2 italic_τ ) - divide start_ARG 4 end_ARG start_ARG italic_τ end_ARG italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT caligraphic_F - divide start_ARG italic_t end_ARG start_ARG square-root start_ARG italic_n end_ARG end_ARG ) . (2)

Due to the randomness of the sensing matrix 𝐀𝐀\mathbf{A}bold_A and assuming that the octonion signal follows an octonion Gaussian distribution, we employ Proposition 1 to establish the following result about the trivial ambiguity of OPR.

Theorem 2 (Trivial ambiguity of right-octonion phase factor).

Consider the octonion variables 𝐲𝐲\mathbf{y}bold_y and 𝐱=𝐲q𝐱𝐲𝑞\mathbf{x}=\mathbf{y}qbold_x = bold_y italic_q, where 𝐲𝒩𝒪nsimilar-to𝐲superscriptsubscript𝒩𝒪𝑛\mathbf{y}\sim\mathcal{N}_{\mathcal{O}}^{n}bold_y ∼ caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT with q𝕆𝑞𝕆q\in\mathbb{O}italic_q ∈ blackboard_O. Define the sensing matrix 𝐀𝒩𝒪m×nsimilar-to𝐀superscriptsubscript𝒩𝒪𝑚𝑛\mathbf{A}\sim\mathcal{N}_{\mathcal{O}}^{m\times n}bold_A ∼ caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT with rows 𝐚𝕆nsubscript𝐚superscript𝕆𝑛\mathbf{a}_{\ell}\in\mathbb{O}^{n}bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ blackboard_O start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT for =1,,m1𝑚\ell=1,\dots,mroman_ℓ = 1 , … , italic_m. Then, with a probability 1e12c~2m1superscript𝑒12superscript~𝑐2𝑚1-e^{-\frac{1}{2}\tilde{c}^{2}m}1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG over~ start_ARG italic_c end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT for some positive constant c~~𝑐\tilde{c}over~ start_ARG italic_c end_ARG, =1m(|𝐚H𝐱||𝐚H𝐲|)c~m𝐱𝐱H𝐲𝐲HF2superscriptsubscript1𝑚superscriptsubscript𝐚𝐻𝐱superscriptsubscript𝐚𝐻𝐲~𝑐𝑚superscriptsubscriptnormsuperscript𝐱𝐱𝐻superscript𝐲𝐲𝐻𝐹2\sum_{\ell=1}^{m}(|\mathbf{a}_{\ell}^{H}\mathbf{x}|-|\mathbf{a}_{\ell}^{H}% \mathbf{y}|)\geq\tilde{c}m\|\mathbf{xx}^{H}-\mathbf{yy}^{H}\|_{F}^{2}∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_x | - | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_y | ) ≥ over~ start_ARG italic_c end_ARG italic_m ∥ bold_xx start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT - bold_yy start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, with c~>0~𝑐0\tilde{c}>0over~ start_ARG italic_c end_ARG > 0.

Proof:

See Appendix. ∎

IV Recovery Algorithm

Our proposed OWF algorithm performs Wirtinger-like iterations. These are similar to a gradient descent approach but also take into account the octonion algebra. However, the non-convexity of the OPR problem implies that we also suitably initialize the algorithm. To this end, we employ a spectral initialization approach.
OWF Algorithm: The octonion algebra is non-associative and, hence, lacks a clear definition of derivatives for octonion-valued variables [37] including chain rule, high-dimensional gradients, and gradient-based methods such as the WF. Optimization-based methods that employ octonion representation, as in singular value decomposition (SVD) [21] or deep octonion neural networks [38], usually resort to the pseudo-real-matrix representation to perform optimization over the real-valued variable. Inspired by this approach, we propose using this representation to solve the following OPR optimization:

𝐱=argmin𝐱~𝕆n=1m(|𝐚H𝐱~|2𝐲)2.superscript𝐱subscriptargmin~𝐱superscript𝕆𝑛superscriptsubscript1𝑚superscriptsuperscriptsuperscriptsubscript𝐚𝐻~𝐱2subscript𝐲2\displaystyle\mathbf{x}^{\ast}=\operatorname*{arg\,min}_{\widetilde{\mathbf{x}% }\in\mathbb{O}^{n}}\sum_{\ell=1}^{m}\left(|\mathbf{a}_{\ell}^{H}\widetilde{% \mathbf{x}}|^{2}-\mathbf{y}_{\ell}\right)^{2}.bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT over~ start_ARG bold_x end_ARG ∈ blackboard_O start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over~ start_ARG bold_x end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - bold_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (3)

Employing the real matrix representation, the problem of recovery of the octonion signal becomes

𝐱superscript𝐱\displaystyle\mathbf{x}^{\ast}bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT =1(argmin𝐱~8n=1m((𝐚H)(𝐱~)22𝐲)2f(𝐱~))absentsuperscript1subscriptargmin~𝐱superscript8𝑛superscriptsuperscriptsubscript1𝑚superscriptsuperscriptsubscriptnormsuperscriptsubscript𝐚𝐻~𝐱22subscript𝐲2𝑓~𝐱\displaystyle=\aleph^{-1}\left(\operatorname*{arg\,min}_{\widetilde{\mathbf{x}% }\in\mathbb{R}^{8n}}\overbrace{\sum_{\ell=1}^{m}\left(\|\gimel\left(\mathbf{a}% _{\ell}^{H}\right)\aleph(\widetilde{\mathbf{x}})\|_{2}^{2}-\mathbf{y}_{\ell}% \right)^{2}}^{f(\widetilde{\mathbf{x}})}\right)= roman_ℵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT over~ start_ARG bold_x end_ARG ∈ blackboard_R start_POSTSUPERSCRIPT 8 italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT over⏞ start_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( ∥ roman_ℷ ( bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) roman_ℵ ( over~ start_ARG bold_x end_ARG ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - bold_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_POSTSUPERSCRIPT italic_f ( over~ start_ARG bold_x end_ARG ) end_POSTSUPERSCRIPT ) (4)

where the 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm comes from the observation that the norm of an octonion variable is the norm of its real representation.

Then, (4) is solved by gradient descent steps. Here, the key difference with respect to the traditional complex-valued approach lies in the gradient computation. In the complex-valued case, wherein the measurements are 𝐲=|𝐀𝐱|2𝐲superscript𝐀𝐱2\mathbf{y}=|\mathbf{A}\mathbf{x}|^{2}bold_y = | bold_Ax | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT with the signal 𝐱8n𝐱superscript8𝑛\mathbf{x}\in\mathbb{C}^{8n}bold_x ∈ blackboard_C start_POSTSUPERSCRIPT 8 italic_n end_POSTSUPERSCRIPT and sensing matrix 𝐀m×8n𝐀superscript𝑚8𝑛\mathbf{A}\in\mathbb{C}^{m\times 8n}bold_A ∈ blackboard_C start_POSTSUPERSCRIPT italic_m × 8 italic_n end_POSTSUPERSCRIPT. Here, the gradient update is 𝐱(|𝐚𝐱|2𝐲)2=(𝐚𝐱2y)(𝐚𝐚)𝐱\nabla_{\mathbf{x}}\left(|\mathbf{a}_{\ell}^{*}{\mathbf{x}}|^{2}-\mathbf{y}_{% \ell}\right)^{2}=(\|\mathbf{a}_{\ell}^{*}\mathbf{x}\|^{2}-y_{\ell})(\mathbf{a}% _{\ell}\mathbf{a}_{\ell}^{*})\mathbf{x}∇ start_POSTSUBSCRIPT bold_x end_POSTSUBSCRIPT ( | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT bold_x | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - bold_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ( ∥ bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT bold_x ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) ( bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_x. However, the octonion real-matrix representation in the OWF considers the interaction among all signal components, which is desired for multispectral we imaging. The OWF gradient of the cost function is 𝐱~f(𝐱~)==1m((𝐚H)(𝐱~)22𝐲)(𝐚H)T(𝐚H)(𝐱~)subscript~𝐱𝑓~𝐱superscriptsubscript1𝑚superscriptsubscriptnormsuperscriptsubscript𝐚𝐻~𝐱22subscript𝐲superscriptsuperscriptsubscript𝐚𝐻𝑇superscriptsubscript𝐚𝐻~𝐱\nabla_{\widetilde{\mathbf{x}}}f(\widetilde{\mathbf{x}})=\sum_{\ell=1}^{m}(\|% \gimel\left(\mathbf{a}_{\ell}^{H}\right)\aleph(\widetilde{\mathbf{x}})\|_{2}^{% 2}-\mathbf{y}_{\ell})\gimel\left(\mathbf{a}_{\ell}^{H}\right)^{T}\gimel\left(% \mathbf{a}_{\ell}^{H}\right)\aleph(\widetilde{\mathbf{x}})∇ start_POSTSUBSCRIPT over~ start_ARG bold_x end_ARG end_POSTSUBSCRIPT italic_f ( over~ start_ARG bold_x end_ARG ) = ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( ∥ roman_ℷ ( bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) roman_ℵ ( over~ start_ARG bold_x end_ARG ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - bold_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) roman_ℷ ( bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_ℷ ( bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) roman_ℵ ( over~ start_ARG bold_x end_ARG ). Then, the OWF update process in the i𝑖iitalic_i-th iteration, where i{1,,I}𝑖1𝐼i\in\{1,\dots,I\}italic_i ∈ { 1 , … , italic_I } such that I𝐼Iitalic_I is the maximum number of iterations, becomes 𝐱~(i)=𝐱~(i1)αf(𝐱~(i1))superscript~𝐱𝑖superscript~𝐱𝑖1𝛼𝑓superscript~𝐱𝑖1\widetilde{\mathbf{x}}^{(i)}=\widetilde{\mathbf{x}}^{(i-1)}-\alpha\nabla f(% \widetilde{\mathbf{x}}^{(i-1)})over~ start_ARG bold_x end_ARG start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT = over~ start_ARG bold_x end_ARG start_POSTSUPERSCRIPT ( italic_i - 1 ) end_POSTSUPERSCRIPT - italic_α ∇ italic_f ( over~ start_ARG bold_x end_ARG start_POSTSUPERSCRIPT ( italic_i - 1 ) end_POSTSUPERSCRIPT ), where α𝛼\alphaitalic_α is a suitable selected gradient step size. From inverse real representation of 𝐱~~𝐱\widetilde{\mathbf{x}}over~ start_ARG bold_x end_ARG, the octonion signal is 𝐱=1(𝐱~(I))superscript𝐱superscript1superscript~𝐱𝐼\mathbf{x}^{\ast}=\aleph^{-1}(\widetilde{\mathbf{x}}^{(I)})bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_ℵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( over~ start_ARG bold_x end_ARG start_POSTSUPERSCRIPT ( italic_I ) end_POSTSUPERSCRIPT ).
Initialization: A key step in most nonconvex PR approaches is the initialization of the algorithm because spurious points in the cost function can lead to local minima. Here, we employ the popular spectral initialization [39], wherein our goal is to obtain the initial estimate of the true signal by computing the leading eigenvector of the octonion-valued matrix 𝐘=1m=1m𝐲𝐚𝐚H𝕆n×n𝐘1𝑚superscriptsubscript1𝑚subscript𝐲subscript𝐚superscriptsubscript𝐚𝐻superscript𝕆𝑛𝑛\mathbf{Y}=\frac{1}{m}\sum_{\ell=1}^{m}\mathbf{y}_{\ell}\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\in\mathbb{O}^{n\times n}bold_Y = divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT bold_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ∈ blackboard_O start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT. This may be achieved by solving an octonionic right eigenvalue decomposition. In [40], this was solved for small octonion-valued matrices (n<4𝑛4n<4italic_n < 4). However, [40] cannot be extended to larger matrices. Therefore, we propose to adapt the power method for the right quaternion eigenvalue decomposition [41] to compute the leading eigenvalue of 𝐘𝐘\mathbf{Y}bold_Y. This method employs power iterations over the real matrix representation and computes the inverse real representation operator 1()superscript1\aleph^{-1}(\cdot)roman_ℵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( ⋅ ) to yield the equivalent octonion leading eigenvalue.

To measure the error between the estimated octonion signal 𝐱superscript𝐱\mathbf{x}^{\ast}bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and its true value 𝐱𝐱\mathbf{x}bold_x, define the distance d(𝐱,𝐱)=minz𝐱𝐱z𝑑𝐱superscript𝐱subscript𝑧normsuperscript𝐱𝐱𝑧d(\mathbf{x},\mathbf{x}^{\ast})=\min_{z}\|\mathbf{x}^{\ast}-\mathbf{x}z\|italic_d ( bold_x , bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = roman_min start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT ∥ bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - bold_x italic_z ∥ where z{z𝕆||z|=1}𝑧conditional-set𝑧𝕆𝑧1z\in\{z\in\mathbb{O}||z|=1\}italic_z ∈ { italic_z ∈ blackboard_O | | italic_z | = 1 } is only-phase octonion factor. We represent this distance in terms of the pseudo-real-matrix representation of octonions as d(𝐱,𝐱)=minz(𝐱)(𝐱))(z)d(\mathbf{x}^{\ast},\mathbf{x})=\min_{z}\|\aleph(\mathbf{x}^{\ast})-\gimel(% \mathbf{x}))\aleph(z)\|italic_d ( bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_x ) = roman_min start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT ∥ roman_ℵ ( bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - roman_ℷ ( bold_x ) ) roman_ℵ ( italic_z ) ∥, using the property 𝐱=(𝐱)norm𝐱norm𝐱\|\mathbf{x}\|=\|\aleph(\mathbf{x})\|∥ bold_x ∥ = ∥ roman_ℵ ( bold_x ) ∥. After some simple algebra, we get d(𝐱,𝐱)=(𝐱)(𝐱))g(𝐱)d(\mathbf{x},\mathbf{x}^{\ast})=\|\aleph(\mathbf{x}^{\ast})-\gimel(\mathbf{x})% )g(\mathbf{x}^{\ast})\|italic_d ( bold_x , bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ∥ roman_ℵ ( bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - roman_ℷ ( bold_x ) ) italic_g ( bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ∥, where g(𝐱)=sign(((𝐱)T(𝐱))((𝐱)T(𝐱)1)g(\mathbf{x}^{\ast})=\operatorname{sign}\left(\left(\gimel(\mathbf{x})^{T}% \aleph(\mathbf{x}^{\ast})\right)\left(\gimel(\mathbf{x})^{T}\gimel(\mathbf{x}% \right)^{-1}\right)italic_g ( bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = roman_sign ( ( roman_ℷ ( bold_x ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_ℵ ( bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) ( roman_ℷ ( bold_x ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_ℷ ( bold_x ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ).

V Numerical Experiments

Refer to caption
Figure 1: (a) Success rate of OWF and concatenated WF for different value of sampling complexity m/n𝑚𝑛m/nitalic_m / italic_n with n=100𝑛100n=100italic_n = 100. (b) Convergence rate of OWF for m/n=20𝑚𝑛20m/n=20italic_m / italic_n = 20 for 2000 iterations.

We validated our OWF algorithm through various numerical experiments using the quaternion and octonion toolbox for MATLAB [42]. Unless otherwise noted, the sensing matrix was drawn from an octonion Gaussian distribution i.e., 𝐀𝒩Om×nsimilar-to𝐀superscriptsubscript𝒩𝑂𝑚𝑛\mathbf{A}{\sim}\mathcal{N}_{O}^{m\times n}bold_A ∼ caligraphic_N start_POSTSUBSCRIPT italic_O end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT. The maximum OWF iterations were set to I=2000𝐼2000I=2000italic_I = 2000. We set α=5m=1m𝐲𝛼5𝑚superscriptsubscript1𝑚subscript𝐲\alpha=\frac{5m}{\sum_{\ell=1}^{m}\mathbf{y}_{\ell}}italic_α = divide start_ARG 5 italic_m end_ARG start_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT bold_y start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG [30].
Synthetic Data: We experimented with synthetic data, wherein the signal 𝐱𝕆n×1𝐱superscript𝕆𝑛1\mathbf{x}\in\mathbb{O}^{n\times 1}bold_x ∈ blackboard_O start_POSTSUPERSCRIPT italic_n × 1 end_POSTSUPERSCRIPT was generated as 𝐱𝒩𝒪nsimilar-to𝐱superscriptsubscript𝒩𝒪𝑛\mathbf{x}\sim\mathcal{N}_{\mathcal{O}}^{n}bold_x ∼ caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, where we normalized the signal such that 𝐱=1norm𝐱1\|\mathbf{x}\|=1∥ bold_x ∥ = 1 and the signal dimension n=100𝑛100n=100italic_n = 100. Over 100 Monte Carlo simulations, we declare signal recovery a “success” when d(𝐱,𝐱)1e5𝑑𝐱superscript𝐱1superscript𝑒5d(\mathbf{x},\mathbf{x}^{\ast})\leq 1e^{-5}italic_d ( bold_x , bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ 1 italic_e start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT. Figure 1 (a) shows the success rate (the mean success of the 100 experiments) for varying sample complexity m/n𝑚𝑛m/nitalic_m / italic_n. We also compared OWF with the traditional WF (using PhasePack library [43]), wherein the signal is concatenation of all eight components, i.e., 𝐱=[𝐱0T,,𝐱7T]T8n𝐱superscriptsuperscriptsubscript𝐱0𝑇superscriptsubscript𝐱7𝑇𝑇superscript8𝑛\mathbf{x}=[\mathbf{x}_{0}^{T},\dots,\mathbf{x}_{7}^{T}]^{T}\in\mathbb{C}^{8n}bold_x = [ bold_x start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , … , bold_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT 8 italic_n end_POSTSUPERSCRIPT with a random complex-valued sensing matrix 𝐀m×8n𝐀superscript𝑚8𝑛\mathbf{A}\in\mathbb{C}^{m\times 8n}bold_A ∈ blackboard_C start_POSTSUPERSCRIPT italic_m × 8 italic_n end_POSTSUPERSCRIPT. We used d(𝐱,𝐱)=𝐱𝐱sign(𝐱H𝐱)𝑑𝐱superscript𝐱normsuperscript𝐱𝐱signsuperscript𝐱𝐻𝐱d(\mathbf{x},\mathbf{x}^{*})=\|\mathbf{x}^{*}-\mathbf{x}\operatorname{sign}(% \mathbf{x}^{H}\mathbf{x})\|italic_d ( bold_x , bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ∥ bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - bold_x roman_sign ( bold_x start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_x ) ∥ to assess reconstruction quality. The OWF achieved almost perfect recovery for m/n>10𝑚𝑛10m/n>10italic_m / italic_n > 10 while traditional WF required m/n>20𝑚𝑛20m/n>20italic_m / italic_n > 20. Figure 1(b) plots the distance function d(𝐱,𝐱)𝑑𝐱superscript𝐱d(\mathbf{x},\mathbf{x}^{\ast})italic_d ( bold_x , bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) for each iteration for m/n=20𝑚𝑛20m/n=20italic_m / italic_n = 20 showing linear convergence of the OWF algorithm. Next, we tested the OWF algorithm for measurements corrupted by additive Gaussian noise i.e., 𝐲=|𝐀𝐱|2+𝝎𝐲superscript𝐀𝐱2𝝎\mathbf{y}=|\mathbf{Ax}|^{2}+\bm{\omega}bold_y = | bold_Ax | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + bold_italic_ω where 𝝎m𝝎superscript𝑚\bm{\omega}\in\mathbb{R}^{m}bold_italic_ω ∈ blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT is sampled from 𝝎𝒩(0,𝐲2210SNR10𝐈m)similar-to𝝎𝒩0superscriptsubscriptnorm𝐲22superscript10SNR10subscript𝐈𝑚\bm{\omega}\sim\mathcal{N}\left(0,\frac{\|\mathbf{y}\|_{2}^{2}}{10^{\frac{% \textrm{SNR}}{10}}}\mathbf{I}_{m}\right)bold_italic_ω ∼ caligraphic_N ( 0 , divide start_ARG ∥ bold_y ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 start_POSTSUPERSCRIPT divide start_ARG SNR end_ARG start_ARG 10 end_ARG end_POSTSUPERSCRIPT end_ARG bold_I start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ). We employed the same number of iterations and n=100𝑛100n=100italic_n = 100. We varied the sample complexity m/n𝑚𝑛m/nitalic_m / italic_n and the signal-to-noise-ratio (SNR) from 0 to 30 dB in steps of 5 dB. Figure 2 demonstrates that OWF recovers the octonion signal with high accuracy with m/n>17𝑚𝑛17m/n>17italic_m / italic_n > 17 and SNR>20absent20>20> 20 dB. The absence of a distinct phase transition in both Figures 1 and 2 can be attributed to the utilization of a pseudo-real matrix representation in the algorithm, as precise octonion calculus tools are unavailable.
Real Data: We also validated OWF-based OPR with real data. We used a spectral image (Figure 3a) from the CAVE multispectral image dataset [44]. We employ a central crop of 32×32323232\times 3232 × 32 pixel and select 8 equispaced spectral bands from the 31 original spectral bands ranging from 400 to 700 nm. Each band was vectorized and selected as a dimension of the octonion signal, thus, in this case, the octonion signal dimension was n=1024𝑛1024n=1024italic_n = 1024. We compared OWF reconstruction with the gradient descent (GD) algorithm [45]. We concatenated all color channels to form the signal 𝐱r8nsuperscript𝐱𝑟superscript8𝑛\mathbf{x}^{r}\in\mathbb{R}^{8n}bold_x start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT 8 italic_n end_POSTSUPERSCRIPT, the sensing matrix 𝐀rm×8nsuperscript𝐀𝑟superscript𝑚8𝑛\mathbf{A}^{r}\in\mathbb{R}^{m\times 8n}bold_A start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_m × 8 italic_n end_POSTSUPERSCRIPT and, hence, the measurements 𝐲r=|𝐀r𝐱r|2superscript𝐲𝑟superscriptsuperscript𝐀𝑟superscript𝐱𝑟2\mathbf{y}^{r}=|\mathbf{A}^{r}\mathbf{x}^{r}|^{2}bold_y start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT = | bold_A start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT bold_x start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Note that, unlike (4), this method doesn’t use the real-matrix representation for the product between the rows of 𝐀rsuperscript𝐀𝑟\mathbf{A}^{r}bold_A start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT (𝐚rsuperscriptsubscript𝐚𝑟\mathbf{a}_{\ell}^{r}bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT) and the signal 𝐱rsuperscript𝐱𝑟\mathbf{x}^{r}bold_x start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT. We employed the Lanzcos algorithm [46] with 100100100100 power iterations for initialization. Figure 3a depicts the success rate (d(𝐱,𝐱^)<103𝑑𝐱^𝐱superscript103d(\mathbf{x},\hat{\mathbf{x}})<10^{-3}italic_d ( bold_x , over^ start_ARG bold_x end_ARG ) < 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT) for 64 images of the dataset with varying sample complexity m/n={1,5,10,15,20,25,30}𝑚𝑛151015202530m/n=\{1,5,10,15,20,25,30\}italic_m / italic_n = { 1 , 5 , 10 , 15 , 20 , 25 , 30 }. Similar to synthetic data, the OWF (concatenation method) for real data shows perfect recovery for m/n>10𝑚𝑛10m/n>10italic_m / italic_n > 10 (m/n>20𝑚𝑛20m/n>20italic_m / italic_n > 20).

We examined spectral image recovery with m/n=15𝑚𝑛15m/n=15italic_m / italic_n = 15 and conducted OWF for I=2000𝐼2000I=2000italic_I = 2000 iterations using the sensing matrix 𝐀𝐀\mathbf{A}bold_A generated as in previous experiments. Figures 3b, c, and d depict the ground-truth image, OWF reconstruction, and reconstruction via concatenation, respectively. Quality assessment utilized the peak signal-to-noise ratio (PSNR) =20log(max(𝐱,𝐱)18ni=1n|𝐱i𝐱i|2)absent20max𝐱superscript𝐱18𝑛superscriptsubscript𝑖1𝑛superscriptsubscript𝐱𝑖subscriptsuperscript𝐱𝑖2=20\log\left(\frac{\operatorname{max}\left(\mathbf{x},\mathbf{x}^{\ast}\right)% }{\frac{1}{8n}\sum_{i=1}^{n}|\mathbf{x}_{i}-\mathbf{x}^{\ast}_{i}|^{2}}\right)= 20 roman_log ( divide start_ARG roman_max ( bold_x , bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG start_ARG divide start_ARG 1 end_ARG start_ARG 8 italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - bold_x start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) [47]. Validation involved examining the recovery of spectral signatures, specifically vectors with eight octonion components at predefined pixel coordinates. Figure 3e demonstrates superior recovery performance with OWF over the real-valued approach with GD at coordinate (10,10) in the reference and reconstructed images.

Refer to caption
Figure 2: Success rate of OWF for different values of sampling complexity m/n𝑚𝑛m/nitalic_m / italic_n with n=30𝑛30n=30italic_n = 30 with measurements under additive Gaussian noise
Refer to caption
Figure 3: Reconstruction with real data. (a) Statistical performance for 64 images of the spectral image dataset. (b) RGB representation of the 8-channel spectral image and its individual 8 components on the right panel. (c) OWF-reconstructed image and its components; recovered image’s PSNR =39.01absent39.01=39.01= 39.01 dB (d) GD-reconstructed image and its components; recovered image’s PSNR =24.16absent24.16=24.16= 24.16 dB. (e) Recovered spectral signature.

VI Summary

We introduced an OPR algorithm for recovering 8-dimensional signals from phaseless measurements. The proposed OWF algorithm, derived from the pseudo-real-matrix representation of octonions, was validated through diverse experiments across different scenarios, sample sizes, noise levels, and real signals with multispectral images. This contributes to the advancement of hypercomplex PR applications [48].

Appendix: Proof of Theorem 2

From equation (1), we have f()=||f(\cdot)=|\cdot|italic_f ( ⋅ ) = | ⋅ |. First, we find the small-ball estimate bound of Q(τ)subscript𝑄𝜏Q_{\mathcal{F}}(\tau)italic_Q start_POSTSUBSCRIPT caligraphic_F end_POSTSUBSCRIPT ( italic_τ ). Define 𝐖=𝐱𝐱H𝐲𝐲𝐖superscript𝐱𝐱𝐻superscript𝐲𝐲\mathbf{W}=\mathbf{x}\mathbf{x}^{H}-\mathbf{y}\mathbf{y}^{*}bold_W = bold_xx start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT - bold_yy start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Using Paley-Zygmund inequality [49, Lemma 8.26], we get

Q(τ)=subscript𝑄𝜏absent\displaystyle Q_{\mathcal{F}}(\tau)=italic_Q start_POSTSUBSCRIPT caligraphic_F end_POSTSUBSCRIPT ( italic_τ ) = inf𝐖[|𝐖,𝐚𝐚H|2τ]subscriptinfimum𝐖delimited-[]superscriptsubscript𝐖subscript𝐚superscriptsubscript𝐚𝐻2𝜏\displaystyle\inf_{\mathbf{W}}\mathbb{P}[|\langle\mathbf{W},\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}|^{2}\leq\tau]roman_inf start_POSTSUBSCRIPT bold_W end_POSTSUBSCRIPT blackboard_P [ | ⟨ bold_W , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_τ ]
inf𝐖𝔼[|𝐖,𝐚𝐚H|2]τ2𝔼[|𝐖,𝐚𝐚H|4],0<τ<1.formulae-sequenceabsentsubscriptinfimum𝐖𝔼delimited-[]superscriptsubscript𝐖subscript𝐚superscriptsubscript𝐚𝐻2superscript𝜏2𝔼delimited-[]superscriptsubscript𝐖subscript𝐚superscriptsubscript𝐚𝐻40𝜏1\displaystyle\leq\inf_{\mathbf{W}}\frac{\mathbb{E}\left[|\langle\mathbf{W},% \mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}|^{2}\right]-\tau^{2% }}{\mathbb{E}\left[|\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}% \rangle_{\mathbb{R}}|^{4}\right]},{\color[rgb]{0,0,0}{0<\tau<1}}.≤ roman_inf start_POSTSUBSCRIPT bold_W end_POSTSUBSCRIPT divide start_ARG blackboard_E [ | ⟨ bold_W , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] - italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG blackboard_E [ | ⟨ bold_W , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ] end_ARG , 0 < italic_τ < 1 . (5)

Then, we need to upper (lower) bound the denominator (numerator). Since 𝐖𝐖\mathbf{W}bold_W is a rank-2 matrix, we have 𝐖=λ1𝐡𝐡H+λ2𝐛𝐛H𝐖subscript𝜆1superscript𝐡𝐡𝐻subscript𝜆2superscript𝐛𝐛𝐻\mathbf{W}=\lambda_{1}\mathbf{h}\mathbf{h}^{H}+\lambda_{2}\mathbf{b}\mathbf{b}% ^{H}bold_W = italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_hh start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT + italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_bb start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT, where λ1+λ2=1subscript𝜆1subscript𝜆21\lambda_{1}+\lambda_{2}=1italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1 and 𝐡=𝐛=1norm𝐡norm𝐛1\|\mathbf{h}\|=\|\mathbf{b}\|=1∥ bold_h ∥ = ∥ bold_b ∥ = 1 are normalized eigenvalues and eigenvectors, respectively. Then,𝐖,𝐚𝐚H=λ1|𝐡H𝐚|2+λ2|𝐛H𝐚|2.subscript𝐖subscript𝐚superscriptsubscript𝐚𝐻subscript𝜆1superscriptsuperscript𝐡𝐻subscript𝐚2subscript𝜆2superscriptsuperscript𝐛𝐻subscript𝐚2\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}=% \lambda_{1}|\mathbf{h}^{H}\mathbf{a}_{\ell}|^{2}+\lambda_{2}|\mathbf{b}^{H}% \mathbf{a}_{\ell}|^{2}.⟨ bold_W , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT = italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . Since 𝐀𝒩𝒪m×nsimilar-to𝐀superscriptsubscript𝒩𝒪𝑚𝑛\mathbf{A}\sim\mathcal{N}_{\mathcal{O}}^{m\times n}bold_A ∼ caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m × italic_n end_POSTSUPERSCRIPT and 𝐱𝒩𝒪nsimilar-to𝐱superscriptsubscript𝒩𝒪𝑛\mathbf{x}\sim\mathcal{N}_{\mathcal{O}}^{n}bold_x ∼ caligraphic_N start_POSTSUBSCRIPT caligraphic_O end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, then following the rotation invariance property of octonion product, 𝐡H𝐚superscript𝐡𝐻subscript𝐚\mathbf{h}^{H}\mathbf{a}_{\ell}bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT and 𝐛H𝐚superscript𝐛𝐻subscript𝐚{\color[rgb]{0,0,0}{\mathbf{b}^{H}}}\mathbf{a}_{\ell}bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT are independent copies of the octonion Gaussian distribution. We observe that |𝐡H𝐚|2superscriptsuperscript𝐡𝐻subscript𝐚2|\mathbf{h}^{H}\mathbf{a}_{\ell}|^{2}| bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and |𝐛𝐚|2superscriptsuperscript𝐛subscript𝐚2|\mathbf{b}^{*}\mathbf{a}_{\ell}|^{2}| bold_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT conform to the χ2superscript𝜒2\chi^{2}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT distribution with 8 degrees of freedom. Leveraging computations of high-order moments [50] for the χ2superscript𝜒2\chi^{2}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT distribution and after some algebraic manipulations, we get 𝔼[|𝐖,𝐚𝐚H|4]144𝔼[8|𝐡H𝐚|2+8|𝐛𝐚|2]=24Γ(16)44Γ(8)c0𝔼delimited-[]superscriptsubscript𝐖subscript𝐚superscriptsubscript𝐚𝐻41superscript44𝔼delimited-[]8superscriptsuperscript𝐡𝐻subscript𝐚28superscriptsuperscript𝐛subscript𝐚2superscript24Γ16superscript44Γ8subscript𝑐0\mathbb{E}\left[|\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}% \rangle_{\mathbb{R}}|^{4}\right]\leq\frac{1}{4^{4}}\mathbb{E}\left[8|\mathbf{h% }^{H}\mathbf{a}_{\ell}|^{2}+8|\mathbf{b}^{*}\mathbf{a}_{\ell}|^{2}\right]=% \frac{2^{4}\Gamma(16)}{4^{4}\Gamma(8)}\coloneqq c_{0}blackboard_E [ | ⟨ bold_W , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ] ≤ divide start_ARG 1 end_ARG start_ARG 4 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG blackboard_E [ 8 | bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 8 | bold_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = divide start_ARG 2 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_Γ ( 16 ) end_ARG start_ARG 4 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_Γ ( 8 ) end_ARG ≔ italic_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. On the other hand, 𝔼[|𝐖,𝐚𝐚H|2]116(𝔼[8|𝐡H𝐚|2]𝔼[8|𝐛𝐚|2])=14c1𝔼delimited-[]superscriptsubscript𝐖subscript𝐚superscriptsubscript𝐚𝐻2116𝔼delimited-[]8superscriptsuperscript𝐡𝐻subscript𝐚2𝔼delimited-[]8superscriptsuperscript𝐛subscript𝐚214subscript𝑐1\mathbb{E}\left[|\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}% \rangle_{\mathbb{R}}|^{2}\right]\geq\frac{1}{16}\left(\mathbb{E}\left[8|% \mathbf{h}^{H}\mathbf{a}_{\ell}|^{2}\right]-\mathbb{E}\left[8|\mathbf{b}^{*}% \mathbf{a}_{\ell}|^{2}\right]\right)=\frac{1}{4}\coloneqq c_{1}blackboard_E [ | ⟨ bold_W , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≥ divide start_ARG 1 end_ARG start_ARG 16 end_ARG ( blackboard_E [ 8 | bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] - blackboard_E [ 8 | bold_b start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ) = divide start_ARG 1 end_ARG start_ARG 4 end_ARG ≔ italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Define the constant c=c1c0𝑐subscript𝑐1subscript𝑐0c=\frac{c_{1}}{c_{0}}italic_c = divide start_ARG italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG. Using the property |𝐏,𝐃|rank(𝐏)𝐏F𝐃2𝐏𝐃rank𝐏subscriptnorm𝐏𝐹subscriptnorm𝐃2{|\langle\mathbf{P},\mathbf{D}\rangle|}\leq\sqrt{\operatorname{rank}(\mathbf{P% })}\|\mathbf{P}\|_{F}\|\mathbf{D}\|_{2}| ⟨ bold_P , bold_D ⟩ | ≤ square-root start_ARG roman_rank ( bold_P ) end_ARG ∥ bold_P ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ∥ bold_D ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT from the real-matrix representation of octonion numbers, we have

Rmsubscript𝑅𝑚\displaystyle R_{m}italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT =𝔼[sup𝐖1m=1mε𝐖,𝐚𝐚H]2n𝔼[=1mε𝐚𝐚H].absent𝔼delimited-[]subscriptsupremum𝐖1𝑚superscriptsubscript1𝑚subscript𝜀𝐖subscript𝐚superscriptsubscript𝐚𝐻2𝑛𝔼delimited-[]normsuperscriptsubscript1𝑚subscript𝜀subscript𝐚superscriptsubscript𝐚𝐻\displaystyle=\mathbb{E}\left[\sup_{\mathbf{W}}\frac{1}{m}\sum_{\ell=1}^{m}% \varepsilon_{\ell}\langle\mathbf{W},\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}% \rangle\right]\leq\frac{\sqrt{2}}{n}\mathbb{E}\left[\left\|\sum_{\ell=1}^{m}% \varepsilon_{\ell}\mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}\right\|\right].= blackboard_E [ roman_sup start_POSTSUBSCRIPT bold_W end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⟨ bold_W , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ ] ≤ divide start_ARG square-root start_ARG 2 end_ARG end_ARG start_ARG italic_n end_ARG blackboard_E [ ∥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ∥ ] .

We can decompose the octonion variables following the Carley-Dickson octonion construction: 𝐚=(𝜶+𝜷e2)+(𝜸+𝜼e2)e4subscript𝐚subscript𝜶subscript𝜷subscript𝑒2subscript𝜸subscript𝜼subscript𝑒2subscript𝑒4\mathbf{a}_{\ell}=(\bm{\alpha}_{\ell}+\bm{\beta}_{\ell}e_{2})+(\bm{\gamma}_{% \ell}+\bm{\eta}_{\ell}e_{2})e_{4}bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = ( bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + bold_italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + ( bold_italic_γ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + bold_italic_η start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) italic_e start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT. After some tedious algebra on (Appendix: Proof of Theorem 2) using the aforementioned octonion representation, we obtain

=1mε𝐚𝐚H=normsuperscriptsubscript1𝑚subscript𝜀subscript𝐚superscriptsubscript𝐚𝐻absent\displaystyle\left\|\sum_{\ell=1}^{m}\varepsilon_{\ell}\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\right\|=∥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ∥ = =1mε((𝜶+𝜷e2)+(𝜸+𝜼e2)e4)\displaystyle\left\|\sum_{\ell=1}^{m}\varepsilon_{\ell}((\bm{\alpha}_{\ell}+% \bm{\beta}_{\ell}e_{2})+(\bm{\gamma}_{\ell}+\bm{\eta}_{\ell}e_{2})e_{4})\cdot\right.∥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( ( bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + bold_italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + ( bold_italic_γ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + bold_italic_η start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) italic_e start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) ⋅
((𝜶+𝜷e2)+(𝜸+𝜼e2)e4).\displaystyle\left.((\bm{\alpha}_{\ell}+\bm{\beta}_{\ell}e_{2})+(\bm{\gamma}_{% \ell}+\bm{\eta}_{\ell}e_{2})e_{4})^{*}\right\|.( ( bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + bold_italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + ( bold_italic_γ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + bold_italic_η start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) italic_e start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ . (6)

Re-arranging the terms yields

=1mε𝐚𝐚H=normsuperscriptsubscript1𝑚subscript𝜀subscript𝐚superscriptsubscript𝐚𝐻absent\displaystyle\left\|\sum_{\ell=1}^{m}\varepsilon_{\ell}\mathbf{a}_{\ell}% \mathbf{a}_{\ell}^{H}\right\|=∥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ∥ = =1mε(𝜶𝜶+𝜷𝜷+𝜸𝜸+𝜼𝜼+\displaystyle\left\|\sum_{\ell=1}^{m}\varepsilon_{\ell}(\bm{\alpha}_{\ell}\bm{% \alpha}_{\ell}^{*}+\bm{\beta}_{\ell}\bm{\beta}_{\ell}^{*}+\bm{\gamma}_{\ell}% \bm{\gamma}_{\ell}^{*}+\bm{\eta}_{\ell}\bm{\eta}_{\ell}^{*}+\right.∥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + bold_italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + bold_italic_γ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_γ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + bold_italic_η start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_η start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT +
2𝜷𝜶H+2𝜼𝜸H).\displaystyle\left.2\bm{\beta}_{\ell}\bm{\alpha}^{H}_{\ell}+2\bm{\eta}_{\ell}% \bm{\gamma}^{H}_{\ell})\right\|.2 bold_italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_α start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT + 2 bold_italic_η start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_γ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) ∥ . (7)

From random matrix theory, we have an upper bound on the Rademacher Gaussian series. Using [51, Theorem 4.1.1] in (Appendix: Proof of Theorem 2) gives 𝔼[=1mε𝜶𝜶]2𝜶𝜶logn\mathbb{E}\left[\|\sum_{\ell=1}^{m}\varepsilon_{\ell}\bm{\alpha}_{\ell}\bm{% \alpha}_{\ell}^{*}\right]\leq\sqrt{2{\sum_{\ell}\|\bm{\alpha}_{\ell}\bm{\alpha% }_{\ell}^{*}\|}\log n}blackboard_E [ ∥ ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ] ≤ square-root start_ARG 2 ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ roman_log italic_n end_ARG. Upper bounding the spectral norm [52] yields 𝜶𝜶𝒪(loglogn)subscriptnormsubscript𝜶superscriptsubscript𝜶𝒪𝑛\sum_{\ell}\|\bm{\alpha}_{\ell}\bm{\alpha}_{\ell}^{*}\|\leq\mathcal{O}(\log% \log n)∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∥ bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_italic_α start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ ≤ caligraphic_O ( roman_log roman_log italic_n ). Then, for universal constant C𝐶Citalic_C, we have Rm()C2nlogmsubscript𝑅𝑚subscript𝐶2𝑛𝑚R_{m}(\mathcal{F})\leq C_{2}n\log mitalic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( caligraphic_F ) ≤ italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_n roman_log italic_m.

Putting all terms together, we have that with a probability at least 1e12c2m1superscript𝑒12superscript𝑐2𝑚1-e^{-\frac{1}{2}c^{2}m}1 - italic_e start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_c start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, we obtain

inf𝐖1m=1m|𝐖,𝐚𝐚H|116cClogmc32mc64m.subscriptinfimum𝐖1𝑚superscriptsubscript1𝑚subscript𝐖subscript𝐚superscriptsubscript𝐚𝐻116𝑐𝐶𝑚𝑐32𝑚𝑐64𝑚\displaystyle\inf_{\mathbf{W}}\frac{1}{m}\sum_{\ell=1}^{m}|\langle\mathbf{W},% \mathbf{a}_{\ell}\mathbf{a}_{\ell}^{H}\rangle_{\mathbb{R}}|\geq\frac{1}{16}c-C% \log m-\frac{c}{32\sqrt{m}}\geq\frac{c}{64}m.roman_inf start_POSTSUBSCRIPT bold_W end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_m end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT | ⟨ bold_W , bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ⟩ start_POSTSUBSCRIPT blackboard_R end_POSTSUBSCRIPT | ≥ divide start_ARG 1 end_ARG start_ARG 16 end_ARG italic_c - italic_C roman_log italic_m - divide start_ARG italic_c end_ARG start_ARG 32 square-root start_ARG italic_m end_ARG end_ARG ≥ divide start_ARG italic_c end_ARG start_ARG 64 end_ARG italic_m .

This leads to =1m(|𝐚H𝐱||𝐚H𝐲|)c~m𝐱𝐱𝐲𝐲F2superscriptsubscript1𝑚superscriptsubscript𝐚𝐻𝐱superscriptsubscript𝐚𝐻𝐲~𝑐𝑚superscriptsubscriptnormsuperscript𝐱𝐱superscript𝐲𝐲𝐹2\sum_{\ell=1}^{m}(|\mathbf{a}_{\ell}^{H}\mathbf{x}|-|\mathbf{a}_{\ell}^{H}% \mathbf{y}|)\geq\tilde{c}m\|\mathbf{xx}^{*}-\mathbf{yy}^{*}\|_{F}^{2}∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_x | - | bold_a start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_y | ) ≥ over~ start_ARG italic_c end_ARG italic_m ∥ bold_xx start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - bold_yy start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, for a constant c~>0~𝑐0\tilde{c}>0over~ start_ARG italic_c end_ARG > 0. This proves that for sufficiently small c~~𝑐\tilde{c}over~ start_ARG italic_c end_ARG, the only trivial ambiguity is the right-octonion factor.

References

  • [1] B. Augereau and P. Carré, “Hypercomplex polynomial wavelet-filter bank transform for color image,” Signal Processing, vol. 136, pp. 16–28, 2017.
  • [2] N. Le Bihan and J. Mars, “Singular value decomposition of quaternion matrices: A new tool for vector-sensor signal processing,” Signal Processing, vol. 84, no. 7, pp. 1177–1199, 2004.
  • [3] W. Liu, “Channel equalization and beamforming for quaternion-valued wireless communication systems,” Journal of the Franklin Institute, vol. 354, no. 18, pp. 8721–8733, 2017.
  • [4] V. Tarokh, H. Jafarkhani, and A. R. Calderbank, “Space-time block codes from orthogonal designs,” IEEE Transactions on Information Theory, vol. 45, no. 5, pp. 1456–1467, 1999.
  • [5] O. Tirkkonen and A. Hottinen, “Square-matrix embeddable space-time block codes for complex signal constellations,” IEEE Transactions on Information Theory, vol. 48, no. 2, pp. 384–395, 2002.
  • [6] A. Buvarp, K. V. Mishra, A. I. Zaghloul, and L. M. Mili, “Quaternion-neural-networks-based decoder for RIS-aided polarization-space modulation,” in IEEE International Symposium on Antennas and Propagation and USNC-URSI Radio Science Meeting, 2023, pp. 1–2.
  • [7] F. Ortolani, D. Comminiello, M. Scarpiniti, and A. Uncini, “Frequency domain quaternion adaptive filters: Algorithms and convergence performance,” Signal Processing, vol. 136, pp. 69–80, 2017.
  • [8] M. Kobayashi, “Uniqueness theorem for quaternionic neural networks,” Signal Processing, vol. 136, pp. 102–106, 2017.
  • [9] ——, “Fixed points of split quaternionic Hopfield neural networks,” Signal Processing, vol. 136, pp. 38–42, 2017.
  • [10] T. A. Ell and S. J. Sangwine, “Hypercomplex Fourier transforms of color images,” IEEE Transactions on Image Processing, vol. 16, no. 1, pp. 22–35, 2006.
  • [11] I. L. Kantor and A. S. Solodovnikov, Hypercomplex numbers: An elementary introduction to algebras.   Springer, 1989.
  • [12] J. Voight, Quaternion algebras.   Springer Nature, 2021.
  • [13] M. Erdoğdu and M. Özdemir, “On complex split quaternion matrices,” Advances in Applied Clifford Algebras, vol. 23, pp. 625–638, 2013.
  • [14] M. Knus, T. Lam, D. Shapiro, and J. Tignol, “Discriminants of involutions on biquaternion algebras,” in K𝐾Kitalic_K-Theory and Algebraic Geometry: Connections with Quadratic Forms and Division Algebras: Connections with Quadratic Forms and Division Algebras, ser. Proceedings of Symposia in Pure Mathematics, B. Jacob and A. Rosenberg, Eds.   American Mathematical Society, 1995, vol. 58 (2), pp. 279–303.
  • [15] S. Okubo, Introduction to octonion and other non-associative algebras in physics.   Cambridge University Press, 1995, no. 2.
  • [16] N. Le Bihan and S. J. Sangwine, “Quaternion principal component analysis of color images,” in IEEE International Conference on Image Processing, vol. 1, 2003, pp. I–809.
  • [17] T. A. Ell, N. Le Bihan, and S. J. Sangwine, Quaternion Fourier transforms for signal and image processing.   John Wiley & Sons, 2014.
  • [18] T. Isokawa, T. Kusakabe, N. Matsui, and F. Peper, “Quaternion neural network and its application,” in International Conference on International Conference on Knowledge-Based and Intelligent Information and Engineering Systems, 2003, pp. 318–324.
  • [19] G. Wang and R. Xue, “Quaternion filtering based on quaternion involutions and its application in signal processing,” IEEE Access, vol. 7, pp. 149 068–149 079, 2019.
  • [20] J. Baez, “The octonions,” Bulletin of the american mathematical society, vol. 39, no. 2, pp. 145–205, 2002.
  • [21] S. Lazendić, H. De Bie, and A. Pižurica, “Octonion sparse representation for color and multispectral image processing,” in European Signal Processing Conference, 2018, pp. 608–612.
  • [22] M. Yamni, H. Karmouni, M. Sayyouri, H. Qjidaa, and J. Flusser, “Novel octonion moments for color stereo image analysis,” Digital Signal Processing, vol. 108, p. 102878, 2021.
  • [23] V. Katkovnik, I. Shevkunov, and K. Egiazarian, “ADMM and spectral proximity operators in hyperspectral broadband phase retrieval for quantitative phase imaging,” Signal Processing, p. 109095, 2023.
  • [24] J. Bacca, S. Pinilla, and H. Arguello, “Super-resolution phase retrieval from designed coded diffraction patterns,” IEEE Transactions on Image Processing, vol. 29, pp. 2598–2609, 2019.
  • [25] S. Pinilla, J. Bacca, and H. Arguello, “Phase retrieval algorithm via nonconvex minimization using a smoothing function,” IEEE Transactions on Signal Processing, vol. 66, no. 17, pp. 4574–4584, 2018.
  • [26] C. Fienup and J. Dainty, “Phase retrieval and image reconstruction for astronomy,” in Image recovery: Theory and application, H. Stark, Ed.   Academic Press, 1987, pp. 231–275.
  • [27] S. Pinilla, K. V. Mishra, B. M. Sadler, and H. Arguello, “BanRaW: Band-limited radar waveform design via phase retrieval,” in IEEE International Conference on Acoustics, Speech and Signal Processing, 2021, pp. 5449–5453.
  • [28] S. Pinilla, K. V. Mishra, I. Shevkunov, M. Soltanalian, V. Katkovnik, and K. Egiazarian, “Unfolding-aided bootstrapped phase retrieval in optical imaging: Explainable ai reveals new imaging frontiers,” IEEE Signal Processing Magazine, vol. 40, no. 2, pp. 46–60, 2023.
  • [29] Y. Chen, C. Cheng, and Q. Sun, “Phase retrieval of complex and vector-valued functions,” Journal of Functional Analysis, vol. 283, no. 7, p. 109593, 2022.
  • [30] J. Chen and M. K. Ng, “Phase retrieval of quaternion signal via Wirtinger flow,” IEEE Transactions on Signal Processing, vol. 71, pp. 2863–2878, 2023.
  • [31] E. J. Candès, X. Li, and M. Soltanolkotabi, “Phase retrieval from coded diffraction patterns,” Applied and Computational Harmonic Analysis, vol. 39, no. 2, pp. 277–299, 2015.
  • [32] Z. Shao, H. Shu, J. Wu, Z. Dong, G. Coatrieux, and J. L. Coatrieux, “Double color image encryption using iterative phase retrieval algorithm in quaternion gyrator domain,” Optics Express, vol. 22, no. 5, pp. 4932–4943, 2014.
  • [33] D. Xu and D. P. Mandic, “The theory of quaternion matrix derivatives,” IEEE Transactions on Signal Processing, vol. 63, no. 6, pp. 1543–1556, 2015.
  • [34] L. Qi, Z. Luo, Q.-W. Wang, and X. Zhang, “Quaternion matrix optimization: Motivation and analysis,” Journal of Optimization Theory and Applications, vol. 193, no. 1-3, pp. 621–648, 2022.
  • [35] L. Rodman, “Hermitian octonion matrices and numerical ranges,” The Electronic Journal of Linear Algebra, vol. 27, pp. 515–533, 2014.
  • [36] V. Koltchinskii and S. Mendelson, “Bounding the smallest singular value of a random matrix without concentration,” International Mathematics Research Notices, vol. 2015, no. 23, pp. 12 991–13 008, 2015.
  • [37] M. Bouchard and J. Khalid, “Calculus for eight-dimensional hypercomplex algebras,” Dec. 2022, working paper or preprint. [Online]. Available: https://hal.science/hal-03558753
  • [38] J. Wu, L. Xu, F. Wu, Y. Kong, L. Senhadji, and H. Shu, “Deep octonion networks,” Neurocomputing, vol. 397, pp. 179–191, 2020.
  • [39] E. J. Candès, X. Li, and M. Soltanolkotabi, “Phase retrieval via Wirtinger flow: Theory and algorithms,” IEEE Transactions on Information Theory, vol. 61, no. 4, pp. 1985–2007, 2015.
  • [40] T. Dray and C. A. Manogue, “The octonionic eigenvalue problem,” Advances in Applied Clifford Algebras, vol. 8, pp. 341–364, 1998.
  • [41] Y. Li, M. Wei, F. Zhang, and J. Zhao, “On the power method for quaternion right eigenvalue problem,” Journal of Computational and Applied Mathematics, vol. 345, pp. 59–69, 2019.
  • [42] S. Sangwine and N. Le Bihan, “Quaternion and octonion toolbox for MATLAB,” 2016. [Online]. Available: https://qtfm.sourceforge.io/
  • [43] R. Chandra, T. Goldstein, and C. Studer, “Phasepack: A phase retrieval library,” in IEEE International conference on Sampling Theory and Applications, 2019, pp. 1–5.
  • [44] F. Yasuma, T. Mitsunaga, D. Iso, and S. Nayar, “Generalized Assorted Pixel Camera: Post-Capture Control of Resolution, Dynamic Range and Spectrum,” epartment of Computer Science, Columbia University, Tech. Rep. CUCS-061-08, Nov 2008.
  • [45] H. Zhang, L. Zhang, and H. Shen, “A super-resolution reconstruction algorithm for hyperspectral images,” Signal Processing, vol. 92, no. 9, pp. 2082–2096, 2012.
  • [46] C. Lanczos, “An iteration method for the solution of the eigenvalue problem of linear differential and integral operators,” Journal of Research of the National Bureau of Standards, vol. 45, no. 4, 1950.
  • [47] A. Hore and D. Ziou, “Image quality metrics: PSNR vs. SSIM,” in IEEE International Conference on Pattern Recognition, 2010, pp. 2366–2369.
  • [48] R. Jacome, K. V. Mishra, B. M. Sadler, and H. Arguello, “An invitation to hypercomplex phase retrieval: Theory and applications,” IEEE Signal Processng Magazine, 2024, in press.
  • [49] A. Zygmund, Trigonometric series.   Cambridge university press, 2002, vol. 1.
  • [50] S. Nadarajah, “Explicit expressions for moments of χ2superscript𝜒2\chi^{2}italic_χ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT order statistics,” Bulletin of the Institute of Mathematics, Academia Sinica (New Series), vol. 3, no. 3, pp. 433–444, 2008.
  • [51] J. A. Tropp, “An introduction to matrix concentration inequalities,” Foundations and Trends® in Machine Learning, vol. 8, no. 1-2, pp. 1–230, 2015.
  • [52] R. Van Handel, “On the spectral norm of Gaussian random matrices,” Transactions of the American Mathematical Society, vol. 369, no. 11, pp. 8161–8178, 2017.