Joint Channel and Data Estimation for Multiuser Extremely Large-Scale MIMO Systems

Kabuto Arai, , Koji Ishibashi, ,
Hiroki Iimori, , Paulo Valente Klaine, and Szabolcs Malomsoky
K. Arai and K. Ishibashi are with the Advanced Wireless and Communication Research Center (AWCC), The University of Electro-Communications, Tokyo 182-8285, Japan (e-mail: [email protected], [email protected])H. Iimori, P. V. Klaine, and S. Malomsoky are with Ericsson Research, Ericsson Japan K.K., (e-mail: {hiroki.iimori, paulo.valente.klaine, szabolcs.malomsoky}@ericsson.com)
Abstract

This paper proposes a joint channel and data estimation (JCDE) algorithm for uplink multiuser extremely large-scale multiple-input-multiple-output (XL-MIMO) systems. The initial channel estimation is formulated as a sparse reconstruction problem based on the angle and distance sparsity under the near-field propagation condition. This problem is solved using non-orthogonal pilots through an efficient low complexity two-stage compressed sensing algorithm. Furthermore, the initial channel estimates are refined by employing a JCDE framework driven by both non-orthogonal pilots and estimated data. The JCDE problem is solved by sequential expectation propagation (EP) algorithms, where the channel and data are alternately updated in an iterative manner. In the channel estimation phase, integrating Bayesian inference with a model-based deterministic approach provides precise estimations to effectively exploit the near-field characteristics in the beam-domain. In the data estimation phase, a linear minimum mean square error (LMMSE)-based filter is designed at each sub-array to address the correlation due to energy leakage in the beam-domain arising from the near-field effects. Numerical simulations reveal that the proposed initial channel estimation and JCDE algorithm outperforms the state-of-the-art approaches in terms of channel estimation, data detection, and computational complexity.

Index Terms:
Extremely large-scale-MIMO (XL-MIMO), near-field, joint channel and data estimation, compressed sensing

I Introduction

To meet the demands for high spectral efficiency in future 6G systems [1], it is essential to further exploit spatial multiplexing and abundant spectral resources at mid/high frequency bands such as centimeter-wave (cmWave), milimeter-wave (mmWave), and sub-terahertz (sub-THz). In light of these requirements, extremely large-scale multiple-input-multiple-output (XL-MIMO) [2, 3, 4] has emerged as a promising technology, enabling sharp directive beamforming and extensive spatial multiplexing. However, the significant increase in antenna aperture leads to an expansion of the Rayleigh distance [5, 6], defined as the border between the near-field and far-field regions. Thus, the near-field effects in XL-MIMO systems may not be negligible in some practically-relevant circumstances, such as in small area coverage with high carrier frequency bands [7].

Unlike the conventional far-field channel, the near-field channel depends not only on gains and angles (e.g., angles of arrivals (AoAs)) but also on distances from signal sources such as user equipments (UEs) and scatterers. Hence, conventional channel estimation methods such as [8, 9], which exploit the beam-domain sparsity under the assumption of planar wavefront, experience significant performance degradation in the near-field due to energy leakage effects in the beam-domain. To tackle this issue, the authors in [7] have proposed a polar-domain simultaneous orthogonal matching pursuit (P-SOMP) algorithm, which leverages the angle and distance sparsity known as polar-domain sparsity arising from the aforementioned near-field peculiar characteristics. In P-SOMP, polar (angle-distance) grids are generated by spatially quantizing the polar-domain to utilize compressed sensing techniques, however, in multiuser systems, P-SOMP requires orthogonal pilots to separate multiple UEs. As such, the proposed approach results in non-negligible overhead as the number of UEs grows, especially in XL-MIMO systems capable of spatially multiplexing many UEs.

Considering these challenges, a near-field channel estimation algorithm, which works even with non-orthogonal pilots, has recently been proposed in [10] in the context of grant-free XL-MIMO systems111Note that while this method was originally developed for jointly active user detection and channel estimation, it is also applicable to sole channel estimation problems without active user detection.. However, due to the non-orthogonality among pilots, inter-user interference still remains, so it is necessary to jointly estimate all UE channel components. As a result, this joint estimation significantly increases computational complexity because it requires UE-wise polar grids, which leads to a large grid size. Therefore, the authors in [10] proposed a 2D-compressive sampling matching pursuit (CoSaMP) algorithm, which is based on the CoSaMP algorithm in the polar-UE 2D domain constructed by UE-wise polar grids. While the 2D-CoSaMP algorithm can mitigate computational complexity, its estimation performance is hindered by overfitting to noisy measurements, which results from the inverse operation on over-sampled estimates. Consequently, subsequent data detection suffers from severe performance deterioration, particularly with high-order modulation.

One of the prospective solutions to obtain accurate channel estimate with non-orthogonal pilots is joint channel and data estimation (JCDE) [11, 12, 13, 14, 15], where not only pilot sequences but also estimated data symbols are utilized as pilot replicas, leveraging their statistical quasi-orthogonality. The JCDE problem can be formulated as a bilinear inference problem (BIP). One of the prominent algorithms based on a Bayesian framework for BIP is bilinear generalized approximate message passing (BiGAMP) [16]. BiGAMP is an extension of GAMP, originally designed for a high-dimensional generalized-linear problem by utilizing loopy belief propagation (BP) with central limit theorem (CLT) and Taylor-series approximations based on large system limit to simplify the BP update. Due to the heavy dependency on the large system assumption of BiGAMP, the convergence performance deteriorates significantly in case the system is too small, the pilots are too short or when there is an improper prior distribution [11]. To address these issues, the authors in [11] have proposed bilinear Gaussian belief propagation (BiGaBP), which relaxes the BP update rules of BiGAMP, based on GaBP [17], without heavily relying on the approximation under a large system limit assumption. This relaxation of the approximation leads to performance improvements while maintaining the same complexity order as BiGAMP.

Bilinear inference algorithms that exploit physical model structures, such as channel sparsity in the beam-domain, have been investigated in [13, 14]. In these papers, the channel sparsity is modeled using a Bernoulli Gaussian (BG) prior distribution because this prior is analytically tractable with a closed-form posterior. However, the sparse structure cannot be exactly expressed by the analytically tractable prior, which leads to modeling errors resulting in performance deterioration. To tackle this issue, the authors of [12] have integrated a model-based deterministic approach [18] into a Bayesian inference framework [11], referred to as AoA-aided BiGaBP. This deterministic approach rectifies the model mismatch caused by the use of the tractable prior.

However, since this method assumes a far-field model, the model correction by the deterministic estimation is insufficient for the near-field region in XL-MIMO systems. Moreover, AoA-aided BiGaBP relies on a maximum ratio combining (MRC)-based detector, which cannot address the correlation caused by energy leakage in the beam-domain due to near-field effects. In addition, the computational complexity of the data denoising process in AoA-aided BiGaBP based on prior information for modulation constellations scales proportionally to not only the modulation order but also the number of antennas. This increase in complexity stems from the fact that BiGaBP suppresses the self-feedback of messages in the algorithmic iteration by generating antenna-wise extrinsic values based on BP rules without the Onsager correction term [11]. Consequently, the number of inputs to the denoiser function, based on modulation constellations, increases proportionally to the number of antennas.

Within the context outlined above, we propose a JCDE algorithm for multiuser XL-MIMO systems with non-orthogonal pilots. Our contributions are summarized as follows.

  • Initial channel estimation for JCDE: A novel initialization mechanism for the multiuser near-field channel estimation problem with pilot contamination due to non-orthogonal pilots is proposed, enabling an accurate initial estimate that is then used in the subsequent JCDE algorithm. The proposed initial channel estimation algorithm consists of two stages to maintain low computational complexity. In the first stage, angle and distance parameters for all UEs are estimated from the polar grids using the simultaneous orthogonal matching pursuit (SOMP) algorithm without pairing between each estimated path and the corresponding UE. Subsequently, the second stage involves the UE-path pairing using 2D-OMP [19] with a reduced number of grids constructed on the angle and distance parameters derived from the first stage. Owing to the above two-stage procedure, our proposed initial channel estimation outperforms the existing state-of-the-art scheme [10], while maintaining comparable computational complexity.

  • JCDE algorithm with model-based estimation: A novel bilinear JCDE inference algorithm is proposed, which integrates a model-based deterministic estimation mechanism with a Bayesian inference framework to address possible performance degradation due to modeling errors in the prior knowledge assumed in the Bayesian inference, as in [11]. In contrast to the state-of-the-art AoA-aided BiGaBP under the assumption of far-field propagation, our algorithm estimates the channels as an aggregation of two distinct quantities: 1) a model-based estimate that captures the near-field channel structure and 2) its modeling error that captures how much different the current estimate is from the true channel. The model-based estimate is alternately updated through a matching pursuit algorithm exploiting the near-field model structures, whereas the residual modeling errors and data symbols are jointly estimated by the expectation propagation (EP) algorithm [20], where an approximate posterior is calculated by minimizing the Kullback-Leibler (KL) divergence between the true posterior and the approximate posterior. To tackle the spatial correlation caused by the energy leakage across neighboring beams in the beam domain while reducing the complexity, we introduce a novel posterior calculation design that enables the implementation of a sub-array-wise linear minimum mean square error (LMMSE)-based filter, allowing parallel computation of the matrix inversion with a much smaller dimension than the array size. This design indeed results in lower computational complexity compared to the state-of-the-art method, owing to the modification of an extrinsic value generation that does not rely on BP rules, as shown in the simulation results.

Notation: The notation [𝐀]i,jsubscriptdelimited-[]𝐀𝑖𝑗[\mathbf{A}]_{i,j}[ bold_A ] start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT indicates the (i,j)𝑖𝑗(i,j)( italic_i , italic_j ) element of the matrix 𝐀𝐀\mathbf{A}bold_A. For a random variable x𝑥xitalic_x and a probabilistic density function p(x)𝑝𝑥p(x)italic_p ( italic_x ), 𝔼p(x)[x]subscript𝔼𝑝𝑥delimited-[]𝑥\mathbb{E}_{p(x)}[x]blackboard_E start_POSTSUBSCRIPT italic_p ( italic_x ) end_POSTSUBSCRIPT [ italic_x ] indicate the expectation of x𝑥xitalic_x over p(x)𝑝𝑥p(x)italic_p ( italic_x ). For any function f(𝐳)𝑓𝐳f(\mathbf{z})italic_f ( bold_z ), 𝐳/zif(𝐳)subscript𝐳subscript𝑧𝑖𝑓𝐳\int_{\mathbf{z}/z_{i}}f(\mathbf{z})∫ start_POSTSUBSCRIPT bold_z / italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( bold_z ) denotes the integral of f(𝐳)𝑓𝐳f(\mathbf{z})italic_f ( bold_z ) with respect to 𝐳𝐳\mathbf{z}bold_z except for zisubscript𝑧𝑖z_{i}italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. The operator tensor-product\otimes denotes the Kronecker product. For the index sets ={1,2,,I}12𝐼\mathcal{I}=\{1,2,\ldots,I\}caligraphic_I = { 1 , 2 , … , italic_I } and 𝒥={1,2,,J}𝒥12𝐽\mathcal{J}=\{1,2,\ldots,J\}caligraphic_J = { 1 , 2 , … , italic_J }, ×𝒥𝒥\mathcal{I}\times\mathcal{J}caligraphic_I × caligraphic_J denotes the cartesian product of \mathcal{I}caligraphic_I and 𝒥𝒥\mathcal{J}caligraphic_J. i𝑖\mathcal{I}\setminus icaligraphic_I ∖ italic_i represent the set {1,,i1,i+1,,I}1𝑖1𝑖1𝐼\{1,\ldots,i-1,i+1,\ldots,I\}{ 1 , … , italic_i - 1 , italic_i + 1 , … , italic_I }.

II System Model

Refer to caption
Figure 1: Near-field channel model.
Refer to caption
Figure 2: The amplitude of the channel vector in the beam-domain, 𝐡usubscript𝐡𝑢\mathbf{h}_{u}bold_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT for N=200,L=1,fc=200GHzformulae-sequence𝑁200formulae-sequence𝐿1subscript𝑓c200GHzN=200,L=1,f_{\mathrm{c}}=200\ \mathrm{GHz}italic_N = 200 , italic_L = 1 , italic_f start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT = 200 roman_GHz. One UE is located r𝑟ritalic_r meters away from the BS (r=𝑟r=\inftyitalic_r = ∞ corresponds to the far-field case).

We consider an uplink XL-MIMO system, where a base station (BS) has a uniform linear array (ULA) with N𝑁Nitalic_N-antennas, serving U𝑈Uitalic_U single antenna UEs. The ULA is positioned along the y𝑦yitalic_y-axis, where y(n)=(n1)d(N1)d2,n=1,2,,Nformulae-sequencesuperscript𝑦𝑛𝑛1𝑑𝑁1𝑑2𝑛12𝑁y^{(n)}=(n-1)d-\frac{(N-1)d}{2},\ n=1,2,\ldots,Nitalic_y start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT = ( italic_n - 1 ) italic_d - divide start_ARG ( italic_N - 1 ) italic_d end_ARG start_ARG 2 end_ARG , italic_n = 1 , 2 , … , italic_N is the n𝑛nitalic_n-th antenna coordinate, and d=λ/2𝑑𝜆2d=\lambda/2italic_d = italic_λ / 2 is antenna spacing with wavelength λ𝜆\lambdaitalic_λ, as shown in Fig.1.

II-A Channel Model

The near-field channel in the spatial domain between a BS and the u𝑢uitalic_u-th UE is modeled as

𝐡u𝒮=l=1Lu𝐚(θu,l,ru,l)zu,l=𝐀(𝜽u,𝐫u)𝐳u,superscriptsubscript𝐡𝑢𝒮superscriptsubscript𝑙1subscript𝐿𝑢𝐚subscript𝜃𝑢𝑙subscript𝑟𝑢𝑙subscript𝑧𝑢𝑙𝐀subscript𝜽𝑢subscript𝐫𝑢subscript𝐳𝑢\displaystyle\mathbf{h}_{u}^{\mathcal{S}}=\sum_{l=1}^{L_{u}}\mathbf{a}(\theta_% {u,l},r_{u,l})z_{u,l}=\mathbf{A}(\bm{\theta}_{u},\mathbf{r}_{u})\mathbf{z}_{u},bold_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT bold_a ( italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ) italic_z start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT = bold_A ( bold_italic_θ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , bold_r start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) bold_z start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , (1)

where θu,lsubscript𝜃𝑢𝑙\theta_{u,l}\in\mathbb{R}italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ∈ blackboard_R, and zu,lsubscript𝑧𝑢𝑙z_{u,l}\in\mathbb{C}italic_z start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ∈ blackboard_C denote the AoA and path gain of the l𝑙litalic_l-th path and the u𝑢uitalic_u-th UE, respectively [21].

Without loss of generality, l=1𝑙1l=1italic_l = 1 represents the line-of-sight (LoS) component and l{2,L^u}𝑙2subscript^𝐿𝑢l\in\{2,\ldots\hat{L}_{u}\}italic_l ∈ { 2 , … over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } represents the non-line-of-sight (NLoS) components. Let Lu=1ULu𝐿superscriptsubscript𝑢1𝑈subscript𝐿𝑢L\triangleq\sum_{u=1}^{U}L_{u}italic_L ≜ ∑ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT denote the total number of path including U𝑈Uitalic_U-UEs. Accordingly, ru,1subscript𝑟𝑢1r_{u,1}\in\mathbb{R}italic_r start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT ∈ blackboard_R denotes the distance between the BS and the u𝑢uitalic_u-th UE, and ru,l,l1formulae-sequencesubscript𝑟𝑢𝑙𝑙1r_{u,l}\in\mathbb{R},\ l\neq 1italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ∈ blackboard_R , italic_l ≠ 1 is the distance between the BS and the l𝑙litalic_l-th scatterer around the u𝑢uitalic_u-th UE. Besides, 𝐚(θu,l,ru,l)N×1𝐚subscript𝜃𝑢𝑙subscript𝑟𝑢𝑙superscript𝑁1\mathbf{a}(\theta_{u,l},r_{u,l})\in\mathbb{C}^{N\times 1}bold_a ( italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT denotes the array response vector defined as

[𝐚(θu,l,ru,l)]n=exp[j2πλ(ru,l(n)ru,l)],subscriptdelimited-[]𝐚subscript𝜃𝑢𝑙subscript𝑟𝑢𝑙𝑛𝑗2𝜋𝜆superscriptsubscript𝑟𝑢𝑙𝑛subscript𝑟𝑢𝑙\displaystyle[\mathbf{a}(\theta_{u,l},r_{u,l})]_{n}=\exp\left[-j\frac{2\pi}{% \lambda}\left(r_{u,l}^{(n)}-r_{u,l}\right)\right],[ bold_a ( italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = roman_exp [ - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT - italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ) ] , (2)

where ru,l(n)=ru,l2+y(n)22ru,ly(n)sinθu,lsuperscriptsubscript𝑟𝑢𝑙𝑛superscriptsubscript𝑟𝑢𝑙2superscript𝑦𝑛22subscript𝑟𝑢𝑙superscript𝑦𝑛subscript𝜃𝑢𝑙r_{u,l}^{(n)}=\sqrt{r_{u,l}^{2}+y^{(n)2}-2r_{u,l}y^{(n)}\sin\theta_{u,l}}italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT = square-root start_ARG italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_y start_POSTSUPERSCRIPT ( italic_n ) 2 end_POSTSUPERSCRIPT - 2 italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT roman_sin italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT end_ARG is the distance between the n𝑛nitalic_n-th antenna and the u𝑢uitalic_u-th UE or scatterers.

For the u𝑢uitalic_u-th UE, let us define the collections of AoAs, distances, and path gains as 𝜽u{θu,l}l=1Lusubscript𝜽𝑢superscriptsubscriptsubscript𝜃𝑢𝑙𝑙1subscript𝐿𝑢\bm{\theta}_{u}\triangleq\{\theta_{u,l}\}_{l=1}^{L_{u}}bold_italic_θ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ≜ { italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, 𝐫u{ru,l}l=1Lusubscript𝐫𝑢superscriptsubscriptsubscript𝑟𝑢𝑙𝑙1subscript𝐿𝑢\mathbf{r}_{u}\triangleq\{r_{u,l}\}_{l=1}^{L_{u}}bold_r start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ≜ { italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, and 𝐳u[zu,1,,zu,Lu]TLu×1subscript𝐳𝑢superscriptmatrixsubscript𝑧𝑢1subscript𝑧𝑢subscript𝐿𝑢Tsuperscriptsubscript𝐿𝑢1\mathbf{z}_{u}\triangleq\begin{bmatrix}z_{u,1},\ldots,z_{u,L_{u}}\end{bmatrix}% ^{\mathrm{T}}\in\mathbb{C}^{L_{u}\times 1}bold_z start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ≜ [ start_ARG start_ROW start_CELL italic_z start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT , … , italic_z start_POSTSUBSCRIPT italic_u , italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT, respectively, and the corresponding array response matrix is defined as 𝐀(𝜽u,𝐫u)[𝐚(θu,1,ru,1),,𝐚(θu,Lu,ru,Lu)]N×Lu𝐀subscript𝜽𝑢subscript𝐫𝑢matrix𝐚subscript𝜃𝑢1subscript𝑟𝑢1𝐚subscript𝜃𝑢subscript𝐿𝑢subscript𝑟𝑢subscript𝐿𝑢superscript𝑁subscript𝐿𝑢\mathbf{A}(\bm{\theta}_{u},\mathbf{r}_{u})\triangleq\begin{bmatrix}\mathbf{a}(% \theta_{u,1},r_{u,1}),\ldots,\mathbf{a}(\theta_{u,L_{u}},r_{u,L_{u}})\end{% bmatrix}\in\mathbb{C}^{N\times L_{u}}bold_A ( bold_italic_θ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , bold_r start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ≜ [ start_ARG start_ROW start_CELL bold_a ( italic_θ start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT ) , … , bold_a ( italic_θ start_POSTSUBSCRIPT italic_u , italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT italic_u , italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Then, the channel matrix 𝐇𝒮[𝐡1𝒮,,𝐡U𝒮]N×Usuperscript𝐇𝒮matrixsuperscriptsubscript𝐡1𝒮superscriptsubscript𝐡𝑈𝒮superscript𝑁𝑈\mathbf{H}^{\mathcal{S}}\triangleq\begin{bmatrix}\mathbf{h}_{1}^{\mathcal{S}},% \ldots,\mathbf{h}_{U}^{\mathcal{S}}\end{bmatrix}\in\mathbb{C}^{N\times U}bold_H start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ≜ [ start_ARG start_ROW start_CELL bold_h start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , … , bold_h start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_U end_POSTSUPERSCRIPT is written as

𝐇𝒮=𝐀(𝜽,𝐫)𝐙,superscript𝐇𝒮𝐀𝜽𝐫𝐙\displaystyle\mathbf{H}^{\mathcal{S}}=\mathbf{A}(\bm{\theta},\mathbf{r})% \mathbf{Z},bold_H start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT = bold_A ( bold_italic_θ , bold_r ) bold_Z , (3)

where 𝐀(𝜽,𝐫)[𝐀(𝜽1,𝐫1),,𝐀(𝜽U,𝐫U)]N×L𝐀𝜽𝐫matrix𝐀subscript𝜽1subscript𝐫1𝐀subscript𝜽𝑈subscript𝐫𝑈superscript𝑁𝐿\mathbf{A}(\bm{\theta},\mathbf{r})\triangleq\begin{bmatrix}\mathbf{A}(\bm{% \theta}_{1},\mathbf{r}_{1}),\ldots,\mathbf{A}(\bm{\theta}_{U},\mathbf{r}_{U})% \end{bmatrix}\in\mathbb{C}^{N\times L}bold_A ( bold_italic_θ , bold_r ) ≜ [ start_ARG start_ROW start_CELL bold_A ( bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , bold_A ( bold_italic_θ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT , bold_r start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ) end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_L end_POSTSUPERSCRIPT is the array response matrix consisting of U𝑈Uitalic_U-UEs with AoAs, distances and path gains defined as 𝜽{𝜽u}u=1U𝜽superscriptsubscriptsubscript𝜽𝑢𝑢1𝑈\bm{\theta}\triangleq\{\bm{\theta}_{u}\}_{u=1}^{U}bold_italic_θ ≜ { bold_italic_θ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT, 𝐫{𝐫u}u=1U𝐫superscriptsubscriptsubscript𝐫𝑢𝑢1𝑈\mathbf{r}\triangleq\{\mathbf{r}_{u}\}_{u=1}^{U}bold_r ≜ { bold_r start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT, and 𝐙blkdiag(𝐳1,𝐳2,,𝐳U)L×U𝐙blkdiagmatrixsubscript𝐳1subscript𝐳2subscript𝐳𝑈superscript𝐿𝑈\mathbf{Z}\triangleq\mathrm{blkdiag}\begin{pmatrix}\mathbf{z}_{1},\mathbf{z}_{% 2},\ldots,\mathbf{z}_{U}\end{pmatrix}\in\mathbb{C}^{L\times U}bold_Z ≜ roman_blkdiag ( start_ARG start_ROW start_CELL bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , bold_z start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_L × italic_U end_POSTSUPERSCRIPT, respectively.

The array response vector in the far-field region, i.e., when ru,lsubscript𝑟𝑢𝑙r_{u,l}\rightarrow\inftyitalic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT → ∞ is expressed as [𝐚(θu,l,)]n=exp[j2πy(n)λsinθu,l]subscriptdelimited-[]𝐚subscript𝜃𝑢𝑙𝑛𝑗2𝜋superscript𝑦𝑛𝜆subscript𝜃𝑢𝑙[\mathbf{a}(\theta_{u,l},\infty)]_{n}=\exp\left[j\frac{2\pi y^{(n)}}{\lambda}% \sin\theta_{u,l}\right][ bold_a ( italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT , ∞ ) ] start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = roman_exp [ italic_j divide start_ARG 2 italic_π italic_y start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT end_ARG start_ARG italic_λ end_ARG roman_sin italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ] from (2). As the far-field array response depends only on the angle, the far-field channel 𝐡u𝒮superscriptsubscript𝐡𝑢𝒮\mathbf{h}_{u}^{\mathcal{S}}bold_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT can be converted into a sparse beam-domain channel 𝐡u=𝐃N𝐡u𝒮subscript𝐡𝑢subscript𝐃𝑁superscriptsubscript𝐡𝑢𝒮\mathbf{h}_{u}=\mathbf{D}_{N}\mathbf{h}_{u}^{\mathcal{S}}bold_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = bold_D start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT with the discrete Fourier transform (DFT) matrix 𝐃NN×Nsubscript𝐃𝑁superscript𝑁𝑁\mathbf{D}_{N}\in\mathbb{C}^{N\times N}bold_D start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT. In contrast, as the far-field approximation does not hold in the near-field region, the beam-domain near-field channel 𝐡usubscript𝐡𝑢\mathbf{h}_{u}bold_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT exhibits not a simple sparse structure but rather a cluster sparse structure, which is caused by energy leakage due to a model mismatch between the DFT matrix 𝐃Nsubscript𝐃𝑁\mathbf{D}_{N}bold_D start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT and the near-field array response 𝐚(θu,l,ru,l)𝐚subscript𝜃𝑢𝑙subscript𝑟𝑢𝑙\mathbf{a}(\theta_{u,l},r_{u,l})bold_a ( italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ). To illustrate the energy leakage effects, Fig. 2 depicts the amplitude of the beam-domain channel vector in the near-field and far-field regions. It can be seen that the far-field channel possesses a distinct sparse structure with a peaky spike. On the other hand, the near-field channel exhibits a clustered sparsity with flatter peaks due to energy leakage. Hence, conventional channel estimation methods exploiting the beam-domain sparsity [8, 9, 18] encounter significant performance degradation in the near-field.

II-B Received Signal Model

To estimate the near-field channel, the u𝑢uitalic_u-th UE transmits a non-orthogonal pilot sequence 𝐱p,uKp×1subscript𝐱p𝑢superscriptsubscript𝐾p1\mathbf{x}_{\mathrm{p},u}\in\mathbb{C}^{K_{\mathrm{p}}\times 1}bold_x start_POSTSUBSCRIPT roman_p , italic_u end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT and data symbol 𝐱d,uKd×1subscript𝐱d𝑢superscriptsubscript𝐾d1\mathbf{x}_{\mathrm{d},u}\in\mathbb{C}^{K_{\mathrm{d}}\times 1}bold_x start_POSTSUBSCRIPT roman_d , italic_u end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT subsequently, where Kp(<U)annotatedsubscript𝐾pabsent𝑈K_{\mathrm{p}}(<U)italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ( < italic_U ) and Kdsubscript𝐾dK_{\mathrm{d}}italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT are the length of pilots and data symbols. Each entry of 𝐱d,usubscript𝐱d𝑢\mathbf{x}_{\mathrm{d},u}bold_x start_POSTSUBSCRIPT roman_d , italic_u end_POSTSUBSCRIPT is randomly generated from a Q𝑄Qitalic_Q-quadrature amplitude modulation (QAM) constellation 𝒳{𝒳1,,𝒳Q}𝒳subscript𝒳1subscript𝒳𝑄\mathcal{X}\triangleq\{\mathcal{X}_{1},\ldots,\mathcal{X}_{Q}\}caligraphic_X ≜ { caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , caligraphic_X start_POSTSUBSCRIPT italic_Q end_POSTSUBSCRIPT } with average symbol energy Essubscript𝐸sE_{\mathrm{s}}italic_E start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT. Then, the received pilot 𝐘p𝒮N×Kpsuperscriptsubscript𝐘p𝒮superscript𝑁subscript𝐾p\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}}\in\mathbb{C}^{N\times K_{\mathrm{p}}}bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and data 𝐘d𝒮N×Kdsuperscriptsubscript𝐘d𝒮superscript𝑁subscript𝐾d\mathbf{Y}_{\mathrm{d}}^{\mathcal{S}}\in\mathbb{C}^{N\times K_{\mathrm{d}}}bold_Y start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT end_POSTSUPERSCRIPT in the spatial domain are given by

𝐘p𝒮=𝐇𝒮𝐗p+𝐍p𝒮,𝐘d𝒮=𝐇𝒮𝐗d+𝐍d𝒮,formulae-sequencesuperscriptsubscript𝐘p𝒮superscript𝐇𝒮subscript𝐗psuperscriptsubscript𝐍p𝒮superscriptsubscript𝐘d𝒮superscript𝐇𝒮subscript𝐗dsuperscriptsubscript𝐍d𝒮\displaystyle\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}}=\mathbf{H}^{\mathcal{S}}% \mathbf{X}_{\mathrm{p}}+\mathbf{N}_{\mathrm{p}}^{\mathcal{S}},\ \ \mathbf{Y}_{% \mathrm{d}}^{\mathcal{S}}=\mathbf{H}^{\mathcal{S}}\mathbf{X}_{\mathrm{d}}+% \mathbf{N}_{\mathrm{d}}^{\mathcal{S}},bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT = bold_H start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + bold_N start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , bold_Y start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT = bold_H start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT bold_X start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT + bold_N start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , (4)

where 𝐗p[𝐱p,1,,𝐱p,U]TU×Kpsubscript𝐗psuperscriptmatrixsubscript𝐱p1subscript𝐱p𝑈Tsuperscript𝑈subscript𝐾p\mathbf{X}_{\mathrm{p}}\triangleq\begin{bmatrix}\mathbf{x}_{\mathrm{p},1},% \ldots,\mathbf{x}_{\mathrm{p},U}\end{bmatrix}^{\mathrm{T}}\in\mathbb{C}^{U% \times K_{\mathrm{p}}}bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ≜ [ start_ARG start_ROW start_CELL bold_x start_POSTSUBSCRIPT roman_p , 1 end_POSTSUBSCRIPT , … , bold_x start_POSTSUBSCRIPT roman_p , italic_U end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_U × italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and 𝐗d[𝐱d,1,,𝐱d,U]TU×Kdsubscript𝐗dsuperscriptmatrixsubscript𝐱d1subscript𝐱d𝑈Tsuperscript𝑈subscript𝐾d\mathbf{X}_{\mathrm{d}}\triangleq\begin{bmatrix}\mathbf{x}_{\mathrm{d},1},% \ldots,\mathbf{x}_{\mathrm{d},U}\end{bmatrix}^{\mathrm{T}}\in\mathbb{C}^{U% \times K_{\mathrm{d}}}bold_X start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ≜ [ start_ARG start_ROW start_CELL bold_x start_POSTSUBSCRIPT roman_d , 1 end_POSTSUBSCRIPT , … , bold_x start_POSTSUBSCRIPT roman_d , italic_U end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_U × italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT end_POSTSUPERSCRIPT are the transmitted pilot matrix and data matrix. 𝐍pN×Kpsubscript𝐍psuperscript𝑁subscript𝐾p\mathbf{N}_{\mathrm{p}}\in\mathbb{C}^{N\times K_{\mathrm{p}}}bold_N start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and 𝐍dN×Kdsubscript𝐍dsuperscript𝑁subscript𝐾d\mathbf{N}_{\mathrm{d}}\in\mathbb{C}^{N\times K_{\mathrm{d}}}bold_N start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT end_POSTSUPERSCRIPT are the additive white Gaussian noise (AWGN) matrices, whose entries are generated from 𝒞𝒩(0,σ2)𝒞𝒩0superscript𝜎2\mathcal{CN}(0,\sigma^{2})caligraphic_C caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) with noise variance σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. By stacking the received pilot 𝐘p𝒮superscriptsubscript𝐘p𝒮\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}}bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT and data 𝐘d𝒮superscriptsubscript𝐘d𝒮\mathbf{Y}_{\mathrm{d}}^{\mathcal{S}}bold_Y start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT, the effective received signal becomes 𝐘𝒮[𝐘p𝒮,𝐘d𝒮]N×Ksuperscript𝐘𝒮matrixsuperscriptsubscript𝐘p𝒮superscriptsubscript𝐘d𝒮superscript𝑁𝐾\mathbf{Y}^{\mathcal{S}}\triangleq\begin{bmatrix}\mathbf{Y}_{\mathrm{p}}^{% \mathcal{S}},\mathbf{Y}_{\mathrm{d}}^{\mathcal{S}}\end{bmatrix}\in\mathbb{C}^{% N\times K}bold_Y start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ≜ [ start_ARG start_ROW start_CELL bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , bold_Y start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K end_POSTSUPERSCRIPT, and the sum length of pilots and data KKp+Kd𝐾subscript𝐾psubscript𝐾dK\triangleq K_{\mathrm{p}}+K_{\mathrm{d}}italic_K ≜ italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT, is formulated as

𝐘𝒮=𝐇𝒮𝐗+𝐍𝒮,superscript𝐘𝒮superscript𝐇𝒮𝐗superscript𝐍𝒮\displaystyle\mathbf{Y}^{\mathcal{S}}=\mathbf{H}^{\mathcal{S}}\mathbf{X}+% \mathbf{N}^{\mathcal{S}},bold_Y start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT = bold_H start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT bold_X + bold_N start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , (5)

with 𝐍𝒮[𝐍p𝒮,𝐍d𝒮]N×Ksuperscript𝐍𝒮matrixsuperscriptsubscript𝐍p𝒮superscriptsubscript𝐍d𝒮superscript𝑁𝐾\mathbf{N}^{\mathcal{S}}\triangleq\begin{bmatrix}\mathbf{N}_{\mathrm{p}}^{% \mathcal{S}},\mathbf{N}_{\mathrm{d}}^{\mathcal{S}}\end{bmatrix}\in\mathbb{C}^{% N\times K}bold_N start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ≜ [ start_ARG start_ROW start_CELL bold_N start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , bold_N start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K end_POSTSUPERSCRIPT and 𝐗[𝐗p,𝐗d]U×K𝐗matrixsubscript𝐗psubscript𝐗dsuperscript𝑈𝐾\mathbf{X}\triangleq\begin{bmatrix}\mathbf{X}_{\mathrm{p}},\mathbf{X}_{\mathrm% {d}}\end{bmatrix}\in\mathbb{C}^{U\times K}bold_X ≜ [ start_ARG start_ROW start_CELL bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT , bold_X start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_U × italic_K end_POSTSUPERSCRIPT. For the sake of future convenience, let us define the pilot and data index set as 𝒦𝒦p𝒦d𝒦subscript𝒦psubscript𝒦d\mathcal{K}\triangleq\mathcal{K}_{\mathrm{p}}\cup\mathcal{K}_{\mathrm{d}}caligraphic_K ≜ caligraphic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ∪ caligraphic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT with 𝒦p{1,2,,Kp}subscript𝒦p12subscript𝐾p\mathcal{K}_{\mathrm{p}}\triangleq\{1,2,\ldots,K_{\mathrm{p}}\}caligraphic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ≜ { 1 , 2 , … , italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT } and 𝒦d{Kp+1,,Kp+Kd}subscript𝒦dsubscript𝐾p1subscript𝐾psubscript𝐾d\mathcal{K}_{\mathrm{d}}\triangleq\{K_{\mathrm{p}}+1,\ldots,K_{\mathrm{p}}+K_{% \mathrm{d}}\}caligraphic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ≜ { italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + 1 , … , italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT }.

From (3) and (4), the received pilot 𝐘p𝒮superscriptsubscript𝐘p𝒮\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}}bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT is rewritten as

𝐘p𝒮subscriptsuperscript𝐘𝒮p\displaystyle\mathbf{Y}^{\mathcal{S}}_{\mathrm{p}}bold_Y start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT =𝐀(𝜽,𝐫)𝐕+𝐍p𝒮,absent𝐀𝜽𝐫𝐕superscriptsubscript𝐍p𝒮\displaystyle=\mathbf{A}(\bm{\theta},\mathbf{r})\mathbf{V}+\mathbf{N}_{\mathrm% {p}}^{\mathcal{S}},= bold_A ( bold_italic_θ , bold_r ) bold_V + bold_N start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , (6)

where 𝐕𝐙𝐗=[𝐱1𝐳1T,𝐱2𝐳2T,,𝐱U𝐳UT]TL×Kp𝐕𝐙𝐗superscriptmatrixsubscript𝐱1superscriptsubscript𝐳1Tsubscript𝐱2superscriptsubscript𝐳2Tsubscript𝐱𝑈superscriptsubscript𝐳𝑈TTsuperscript𝐿subscript𝐾p\mathbf{V}\triangleq\mathbf{Z}\mathbf{X}=\begin{bmatrix}\mathbf{x}_{1}\mathbf{% z}_{1}^{\mathrm{T}},\mathbf{x}_{2}\mathbf{z}_{2}^{\mathrm{T}},\ldots,\mathbf{x% }_{U}\mathbf{z}_{U}^{\mathrm{T}}\end{bmatrix}^{\mathrm{T}}\in\mathbb{C}^{L% \times K_{\mathrm{p}}}bold_V ≜ bold_ZX = [ start_ARG start_ROW start_CELL bold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , bold_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_z start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , … , bold_x start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT bold_z start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_L × italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the matrix composed of path gains and pilots.

III Overview of the Proposed Algorithm

This section describes the overview of the proposed algorithm. The overall procedures of the proposed algorithm are illustrated in Fig. 3. As shown in the figure, the proposed algorithm mainly consists of two parts: the initial channel estimation part and subsequent JCDE part. The initial channel estimation part yields an accurate near-field channel estimate to support the convergence of the subsequent JCDE algorithm, and is composed of two stages to reduce the computational complexity. In the first stage, the angle and distance candidates from large-size polar-grids are estimated by utilizing the SOMP algorithm. In the second stage, the pairing between the path candidates obtained in the first stage and corresponding UEs is performed via the 2D-OMP algorithm by using UE specific pilot sequences. The first and second stages for initial channel estimation are described in Section IV-A and Section IV-B, respectively.

In the subsequent JCDE process, the channel and data are jointly estimated via the EP algorithm with a deterministic model-based estimation approach using the initial channel estimate. To exploit the near-field model structures, the beam-domain channel matrix 𝐇N×U𝐇superscript𝑁𝑈\mathbf{H}\in\mathbb{C}^{N\times U}bold_H ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_U end_POSTSUPERSCRIPT is decomposed into a model-based estimate 𝐒^N×U^𝐒superscript𝑁𝑈\hat{\mathbf{S}}\in\mathbb{C}^{N\times U}over^ start_ARG bold_S end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_U end_POSTSUPERSCRIPT and residual channel error 𝐄N×U𝐄superscript𝑁𝑈\mathbf{E}\in\mathbb{C}^{N\times U}bold_E ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_U end_POSTSUPERSCRIPT. 𝐄𝐄\mathbf{E}bold_E and 𝐗𝐗\mathbf{X}bold_X are jointly estimated by the EP algorithm, where the approximate joint posterior for 𝐄𝐄\mathbf{E}bold_E and 𝐗𝐗\mathbf{X}bold_X is calculated as described in Section V-C and V-D. The model-based estimate 𝐒^^𝐒\hat{\mathbf{S}}over^ start_ARG bold_S end_ARG is determined by the initial channel estimate and adaptively updated in the algorithm iterations to further improve estimation performance as described in Section V-F.

Refer to caption
Figure 3: Overflow of the proposed algorithm.

IV Proposed Initial Channel Estimation

IV-A Angle and Distance Estimation

To leverage the near-field channel sparsity, the virtual channel representation in the polar-domain [8] is utilized with polar-grids. The polar-grids are designed by spatially quantizing the angle and distance domain into GθGrsubscript𝐺𝜃subscript𝐺𝑟G_{\theta}G_{r}italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT grid points as 𝜽~{θ~gθ|gθ{1,,Gθ}}~𝜽conditional-setsubscript~𝜃subscript𝑔𝜃subscript𝑔𝜃1subscript𝐺𝜃\tilde{\bm{\theta}}\triangleq\{\tilde{\theta}_{g_{\theta}}|g_{\theta}\in\{1,% \ldots,G_{\theta}\}\}over~ start_ARG bold_italic_θ end_ARG ≜ { over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT | italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ∈ { 1 , … , italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT } } and 𝐫~{r~gr,gθ|gr{1,,Gr},gθ{1,,Gθ}}~𝐫conditional-setsubscript~𝑟subscript𝑔𝑟subscript𝑔𝜃formulae-sequencesubscript𝑔𝑟1subscript𝐺𝑟subscript𝑔𝜃1subscript𝐺𝜃\tilde{\mathbf{r}}\triangleq\left\{\tilde{r}_{g_{r},g_{\theta}}|g_{r}\in\{1,% \ldots,G_{r}\},\ g_{\theta}\in\{1,\ldots,G_{\theta}\}\right\}over~ start_ARG bold_r end_ARG ≜ { over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT | italic_g start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∈ { 1 , … , italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } , italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ∈ { 1 , … , italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT } } with θ~gθ[π,π]subscript~𝜃subscript𝑔𝜃𝜋𝜋\tilde{\theta}_{g_{\theta}}\in[-\pi,\pi]over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ [ - italic_π , italic_π ] and r~gr,gθ[0,]subscript~𝑟subscript𝑔𝑟subscript𝑔𝜃0\tilde{r}_{g_{r},g_{\theta}}\in[0,\infty]over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ [ 0 , ∞ ]. Using the polar-grids 𝜽~~𝜽\tilde{\bm{\theta}}over~ start_ARG bold_italic_θ end_ARG and 𝒓~~𝒓\tilde{\bm{r}}over~ start_ARG bold_italic_r end_ARG, the polar-domain dictionary (i.e., virtual array response matrix) is designed as

𝐀~(\displaystyle\tilde{\mathbf{A}}(over~ start_ARG bold_A end_ARG ( 𝜽~,𝐫~)=[𝐚(θ~1,r~1,1),𝐚(θ~1,r~Gr,1),\displaystyle\tilde{\bm{\theta}},\tilde{\mathbf{r}})=\left[\mathbf{a}(\tilde{% \theta}_{1},\tilde{r}_{1,1}),\ldots\mathbf{a}(\tilde{\theta}_{1},\tilde{r}_{G_% {r},1}),\ldots\right.over~ start_ARG bold_italic_θ end_ARG , over~ start_ARG bold_r end_ARG ) = [ bold_a ( over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT ) , … bold_a ( over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , 1 end_POSTSUBSCRIPT ) , …
,𝐚(θ~Gθ,r~1,Gθ),,𝐚(θ~Gθ,r~Gr,Gθ)]N×GθGr.\displaystyle\!\!\left.\ldots,\mathbf{a}(\tilde{\theta}_{G_{\theta}},\tilde{r}% _{1,G_{\theta}}),\ldots,\mathbf{a}(\tilde{\theta}_{G_{\theta}},\tilde{r}_{G_{r% },G_{\theta}})\right]\in\mathbb{C}^{N\times G_{\theta}G_{r}}.… , bold_a ( over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT , over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT 1 , italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , … , bold_a ( over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT , over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUPERSCRIPT . (7)

From (6) and (IV-A), the received pilot signal 𝐘p𝒮superscriptsubscript𝐘p𝒮\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}}bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT is given by

𝐘p𝒮superscriptsubscript𝐘p𝒮\displaystyle\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}}bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT 𝐀~(𝜽~,𝐫~)𝐕~+𝐍p𝒮,similar-to-or-equalsabsent~𝐀~𝜽~𝐫~𝐕subscriptsuperscript𝐍𝒮p\displaystyle\simeq\tilde{\mathbf{A}}(\tilde{\bm{\theta}},\tilde{\mathbf{r}})% \tilde{\mathbf{V}}+\mathbf{N}^{\mathcal{S}}_{\mathrm{p}},≃ over~ start_ARG bold_A end_ARG ( over~ start_ARG bold_italic_θ end_ARG , over~ start_ARG bold_r end_ARG ) over~ start_ARG bold_V end_ARG + bold_N start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT , (8)

where 𝐕~GθGr×Kp~𝐕superscriptsubscript𝐺𝜃subscript𝐺𝑟subscript𝐾p\tilde{\mathbf{V}}\in\mathbb{C}^{G_{\theta}G_{r}\times K_{\mathrm{p}}}over~ start_ARG bold_V end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT × italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the row sparse matrix such that the number of nonzero rows is only L𝐿Litalic_L and other GθGrLsubscript𝐺𝜃subscript𝐺𝑟𝐿G_{\theta}G_{r}-Litalic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT - italic_L rows are zero since the channel is composed of a total of L𝐿Litalic_L paths defined as in (1), with a sufficiently large number of grids, i.e., GθGrLmuch-greater-thansubscript𝐺𝜃subscript𝐺𝑟𝐿G_{\theta}G_{r}\gg Litalic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ≫ italic_L. Equation (8) exactly holds only if there is no quantization errors in polar grids. In actual environments, however, it approximately holds due to the presence of quantization errors. Therefore, to compensate the quantization errors, we overestimate the number of paths L^>L^𝐿𝐿\hat{L}>Lover^ start_ARG italic_L end_ARG > italic_L based on the propagation environment in the considered carrier frequency [7]. To estimate L^^𝐿\hat{L}over^ start_ARG italic_L end_ARG path candidates from GθGrsubscript𝐺𝜃subscript𝐺𝑟G_{\theta}G_{r}italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT grids, the sparse reconstruction problem for 𝐕~~𝐕\tilde{\mathbf{V}}over~ start_ARG bold_V end_ARG is formulated as

minimize𝐕~𝐘p𝒮𝐀~(𝜽~,𝐫~)𝐕~F2~𝐕minimizesuperscriptsubscriptnormsuperscriptsubscript𝐘p𝒮~𝐀~𝜽~𝐫~𝐕F2\displaystyle\underset{\tilde{\mathbf{V}}}{\text{minimize}}\ \ \left\|\mathbf{% Y}_{\mathrm{p}}^{\mathcal{S}}-\tilde{\mathbf{A}}(\tilde{\bm{\theta}},\tilde{% \mathbf{r}})\tilde{\mathbf{V}}\right\|_{\mathrm{F}}^{2}start_UNDERACCENT over~ start_ARG bold_V end_ARG end_UNDERACCENT start_ARG minimize end_ARG ∥ bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT - over~ start_ARG bold_A end_ARG ( over~ start_ARG bold_italic_θ end_ARG , over~ start_ARG bold_r end_ARG ) over~ start_ARG bold_V end_ARG ∥ start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
subject to𝐕~2,0=L^,subject tosubscriptnorm~𝐕20^𝐿\displaystyle\text{subject to}\ \ \|\tilde{\mathbf{V}}\|_{2,0}=\hat{L},subject to ∥ over~ start_ARG bold_V end_ARG ∥ start_POSTSUBSCRIPT 2 , 0 end_POSTSUBSCRIPT = over^ start_ARG italic_L end_ARG , (9)

where 𝐕~2,0subscriptnorm~𝐕20\|\tilde{\mathbf{V}}\|_{2,0}∥ over~ start_ARG bold_V end_ARG ∥ start_POSTSUBSCRIPT 2 , 0 end_POSTSUBSCRIPT denotes the number of non-zero rows of 𝐕~~𝐕\tilde{\mathbf{V}}over~ start_ARG bold_V end_ARG.

The problem in (IV-A) can be approximately solved by a compressed sensing algorithm for multiple measurement vectors (MMV) problems, e.g., SOMP [8]. The computational complexity of SOMP at the t𝑡titalic_t-th iteration in a naive implementation is 𝒪(GrGθNKp+Nt+Nt2+t3)𝒪subscript𝐺𝑟subscript𝐺𝜃𝑁subscript𝐾p𝑁𝑡𝑁superscript𝑡2superscript𝑡3\mathcal{O}(G_{r}G_{\theta}NK_{\mathrm{p}}+Nt+Nt^{2}+t^{3})caligraphic_O ( italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_N italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + italic_N italic_t + italic_N italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ). Its complexity can be further reduced by using the matrix inversion lemma (MIL) to 𝒪(GrGθKpt+Nt)𝒪subscript𝐺𝑟subscript𝐺𝜃subscript𝐾p𝑡𝑁𝑡\mathcal{O}(G_{r}G_{\theta}K_{\mathrm{p}}t+Nt)caligraphic_O ( italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT italic_t + italic_N italic_t ) [22, 23]. Solving the problem (IV-A) yields the angle and distance candidates corresponding to the non-zero rows of 𝐕~~𝐕\tilde{\mathbf{V}}over~ start_ARG bold_V end_ARG, defined as 𝜽ˇ{θˇl}l=1L^ˇ𝜽superscriptsubscriptsubscriptˇ𝜃𝑙𝑙1^𝐿\check{\bm{\theta}}\triangleq\{\check{\theta}_{l}\}_{l=1}^{\hat{L}}overroman_ˇ start_ARG bold_italic_θ end_ARG ≜ { overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG end_POSTSUPERSCRIPT and 𝐫ˇ{rˇl}l=1L^ˇ𝐫superscriptsubscriptsubscriptˇ𝑟𝑙𝑙1^𝐿\check{\mathbf{r}}\triangleq\{\check{r}_{l}\}_{l=1}^{\hat{L}}overroman_ˇ start_ARG bold_r end_ARG ≜ { overroman_ˇ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG end_POSTSUPERSCRIPT.

IV-B UE-Path Pairing

The path candidate set {θˇl,rˇl}l=1L^superscriptsubscriptsubscriptˇ𝜃𝑙subscriptˇ𝑟𝑙𝑙1^𝐿\{\check{\theta}_{l},\check{r}_{l}\}_{l=1}^{\hat{L}}{ overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , overroman_ˇ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG end_POSTSUPERSCRIPT obtained in the first stage does not specify the association of individual paths with each UE. To estimate individual channels for each UE, the second stage performs UE-path pairing, where the estimated path candidates are associated with each user using UE-specific non-orthogonal pilot sequences {𝐱p,u}u=1Usuperscriptsubscriptsubscript𝐱p𝑢𝑢1𝑈\{\mathbf{x}_{\mathrm{p},u}\}_{u=1}^{U}{ bold_x start_POSTSUBSCRIPT roman_p , italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT 222In case of orthogonal piloting, one can readily imagine that this is a straightforward task.. The usage of limited path candidates 𝜽ˇˇ𝜽\check{\bm{\theta}}overroman_ˇ start_ARG bold_italic_θ end_ARG, 𝐫ˇˇ𝐫\check{\mathbf{r}}overroman_ˇ start_ARG bold_r end_ARG, rather than large-size polar-grids 𝜽~~𝜽\tilde{\bm{\theta}}over~ start_ARG bold_italic_θ end_ARG, 𝐫~~𝐫\tilde{\mathbf{r}}over~ start_ARG bold_r end_ARG sampling the entire polar domain, can lead to a complexity reduction. Using the path set {θˇl,rˇl}l=1L^superscriptsubscriptsubscriptˇ𝜃𝑙subscriptˇ𝑟𝑙𝑙1^𝐿\{\check{\theta}_{l},\check{r}_{l}\}_{l=1}^{\hat{L}}{ overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , overroman_ˇ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG end_POSTSUPERSCRIPT, the polar-domain dictionary matrix is designed as

𝐀ˇ(𝜽ˇ,𝐫ˇ)[𝐚(θˇ1,rˇ1),𝐚(θˇ2,rˇ2),𝐚(θˇL^,rˇL^)]N×L^.ˇ𝐀ˇ𝜽ˇ𝐫matrix𝐚subscriptˇ𝜃1subscriptˇ𝑟1𝐚subscriptˇ𝜃2subscriptˇ𝑟2𝐚subscriptˇ𝜃^𝐿subscriptˇ𝑟^𝐿superscript𝑁^𝐿\displaystyle\check{\mathbf{A}}(\check{\bm{\theta}},\check{\mathbf{r}})\!% \triangleq\!\begin{bmatrix}\mathbf{a}(\check{\theta}_{1},\check{r}_{1}),% \mathbf{a}(\check{\theta}_{2},\check{r}_{2}),\ldots\mathbf{a}(\check{\theta}_{% \hat{L}},\check{r}_{\hat{L}})\end{bmatrix}\in\mathbb{C}^{N\times\hat{L}}.overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) ≜ [ start_ARG start_ROW start_CELL bold_a ( overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , overroman_ˇ start_ARG italic_r end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , bold_a ( overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , overroman_ˇ start_ARG italic_r end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , … bold_a ( overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_L end_ARG end_POSTSUBSCRIPT , overroman_ˇ start_ARG italic_r end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_L end_ARG end_POSTSUBSCRIPT ) end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × over^ start_ARG italic_L end_ARG end_POSTSUPERSCRIPT . (10)

Reducing the size of the polar grids from GrGθsubscript𝐺𝑟subscript𝐺𝜃G_{r}G_{\theta}italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT in (IV-A) to L^(GrGθ)annotated^𝐿much-less-thanabsentsubscript𝐺𝑟subscript𝐺𝜃\hat{L}(\ll G_{r}G_{\theta})over^ start_ARG italic_L end_ARG ( ≪ italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ) in (10) can effectively lower the complexity in the following compressed sensing algorithm. Then, the channel vector for the u𝑢uitalic_u-th UE can be approximated with the polar-domain dictionary 𝐀ˇ(𝜽ˇ,𝐫ˇ)ˇ𝐀ˇ𝜽ˇ𝐫\check{\mathbf{A}}(\check{\bm{\theta}},\check{\mathbf{r}})overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) as

𝐡u𝒮=𝐀(𝜽u,𝐫u)𝐳u𝐀ˇ(𝜽ˇ,𝐫ˇ)𝐳ˇu,superscriptsubscript𝐡𝑢𝒮𝐀subscript𝜽𝑢subscript𝐫𝑢subscript𝐳𝑢similar-to-or-equalsˇ𝐀ˇ𝜽ˇ𝐫subscriptˇ𝐳𝑢\displaystyle\mathbf{h}_{u}^{\mathcal{S}}=\mathbf{A}(\bm{\theta}_{u},\mathbf{r% }_{u})\mathbf{z}_{u}\simeq\check{\mathbf{A}}(\check{\bm{\theta}},\check{% \mathbf{r}})\check{\mathbf{z}}_{u},bold_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT = bold_A ( bold_italic_θ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , bold_r start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) bold_z start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ≃ overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) overroman_ˇ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , (11)

where 𝐳ˇuL^×1subscriptˇ𝐳𝑢superscript^𝐿1\check{\mathbf{z}}_{u}\in\mathbb{C}^{\hat{L}\times 1}overroman_ˇ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG × 1 end_POSTSUPERSCRIPT is the virtual path gain vector.

From (11), the received pilot 𝐘p𝒮superscriptsubscript𝐘p𝒮\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}}bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT is approximated as

𝐘p𝒮superscriptsubscript𝐘p𝒮\displaystyle\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}}bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT u=1U𝐀ˇ(𝜽ˇ,𝐫ˇ)𝐳ˇu𝐱p,uT+𝐍p=𝐀ˇ(𝜽ˇ,𝐫ˇ)𝐙ˇ𝐗p+𝐍p𝒮,similar-to-or-equalsabsentsuperscriptsubscript𝑢1𝑈ˇ𝐀ˇ𝜽ˇ𝐫subscriptˇ𝐳𝑢superscriptsubscript𝐱p𝑢Tsubscript𝐍pˇ𝐀ˇ𝜽ˇ𝐫ˇ𝐙subscript𝐗psuperscriptsubscript𝐍p𝒮\displaystyle\simeq\sum_{u=1}^{U}\check{\mathbf{A}}(\check{\bm{\theta}},\check% {\mathbf{r}})\check{\mathbf{z}}_{u}\mathbf{x}_{\mathrm{p},u}^{\mathrm{T}}\!+\!% \mathbf{N}_{\mathrm{p}}=\check{\mathbf{A}}(\check{\bm{\theta}},\check{\mathbf{% r}})\check{\mathbf{Z}}\mathbf{X}_{\mathrm{p}}+\mathbf{N}_{\mathrm{p}}^{% \mathcal{S}},≃ ∑ start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) overroman_ˇ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT bold_x start_POSTSUBSCRIPT roman_p , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT + bold_N start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT = overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) overroman_ˇ start_ARG bold_Z end_ARG bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + bold_N start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , (12)

with 𝐙ˇ[𝐳ˇ1,𝐳ˇ2,,𝐳ˇU]L^×Uˇ𝐙matrixsubscriptˇ𝐳1subscriptˇ𝐳2subscriptˇ𝐳𝑈superscript^𝐿𝑈\check{\mathbf{Z}}\triangleq\begin{bmatrix}\check{\mathbf{z}}_{1},\check{% \mathbf{z}}_{2},\ldots,\ \check{\mathbf{z}}_{U}\end{bmatrix}\in\mathbb{C}^{% \hat{L}\times U}overroman_ˇ start_ARG bold_Z end_ARG ≜ [ start_ARG start_ROW start_CELL overroman_ˇ start_ARG bold_z end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , overroman_ˇ start_ARG bold_z end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , overroman_ˇ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG × italic_U end_POSTSUPERSCRIPT.

The equation (12) can be transformed into a 1D linear equation as 𝐲p𝒮𝚽p𝐳ˇsimilar-to-or-equalssuperscriptsubscript𝐲p𝒮subscript𝚽pˇ𝐳\mathbf{y}_{\mathrm{p}}^{\mathcal{S}}\simeq\mathbf{\Phi}_{\mathrm{p}}\check{% \mathbf{z}}bold_y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ≃ bold_Φ start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT overroman_ˇ start_ARG bold_z end_ARG with 𝐲p𝒮vec(𝐘p𝒮)superscriptsubscript𝐲p𝒮vecsubscriptsuperscript𝐘𝒮p\mathbf{y}_{\mathrm{p}}^{\mathcal{S}}\triangleq\mathrm{vec}(\mathbf{Y}^{% \mathcal{S}}_{\mathrm{p}})bold_y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ≜ roman_vec ( bold_Y start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ), 𝐳ˇvec(𝐙ˇ)L^U×1ˇ𝐳vecˇ𝐙superscript^𝐿𝑈1\check{\mathbf{z}}\triangleq\mathrm{vec}(\check{\mathbf{Z}})\in\mathbb{C}^{% \hat{L}U\times 1}overroman_ˇ start_ARG bold_z end_ARG ≜ roman_vec ( overroman_ˇ start_ARG bold_Z end_ARG ) ∈ blackboard_C start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG italic_U × 1 end_POSTSUPERSCRIPT and 𝚽p(𝐗pT𝐀ˇ(𝜽ˇ,𝐫ˇ))NKp×L^Usubscript𝚽ptensor-productsuperscriptsubscript𝐗pTˇ𝐀ˇ𝜽ˇ𝐫superscript𝑁subscript𝐾p^𝐿𝑈\mathbf{\Phi}_{\mathrm{p}}\triangleq\left(\mathbf{X}_{\mathrm{p}}^{\mathrm{T}}% \otimes\check{\mathbf{A}}(\check{\bm{\theta}},\check{\mathbf{r}})\right)\in% \mathbb{C}^{NK_{\mathrm{p}}\times\hat{L}U}bold_Φ start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ≜ ( bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ⊗ overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT × over^ start_ARG italic_L end_ARG italic_U end_POSTSUPERSCRIPT. Although the estimation for 𝐳ˇˇ𝐳\check{\mathbf{z}}overroman_ˇ start_ARG bold_z end_ARG from the vectorized observation 𝐲p𝒮superscriptsubscript𝐲p𝒮\mathbf{y}_{\mathrm{p}}^{\mathcal{S}}bold_y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT can be simply addressed by various methods such as OMP [22], this significantly increases the complexity due to the large-size dictionary 𝚽psubscript𝚽p\mathbf{\Phi}_{\mathrm{p}}bold_Φ start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT. Hence, to circumvent the high computational burden, the 2D signal representation in (12) is directly addressed without the vectorized 1D representation. Then, the sparse reconstruction problem for 𝐙ˇˇ𝐙\check{\mathbf{Z}}overroman_ˇ start_ARG bold_Z end_ARG in (12) is formulated as

minimize𝐙ˇ𝐘p𝒮𝐀ˇ(𝜽ˇ,𝐫ˇ)𝐙ˇ𝐗p22ˇ𝐙minimizesuperscriptsubscriptnormsuperscriptsubscript𝐘p𝒮ˇ𝐀ˇ𝜽ˇ𝐫ˇ𝐙subscript𝐗p22\displaystyle\underset{\check{\mathbf{Z}}}{\text{minimize}}\ \ \left\|\mathbf{% Y}_{\mathrm{p}}^{\mathcal{S}}-\check{\mathbf{A}}(\check{\bm{\theta}},\check{% \mathbf{r}})\check{\mathbf{Z}}\mathbf{X}_{\mathrm{p}}\right\|_{\mathrm{2}}^{2}start_UNDERACCENT overroman_ˇ start_ARG bold_Z end_ARG end_UNDERACCENT start_ARG minimize end_ARG ∥ bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT - overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) overroman_ˇ start_ARG bold_Z end_ARG bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
subject to𝐳ˇ0=L^,subject tosubscriptnormˇ𝐳0^𝐿\displaystyle\text{subject to}\ \ \left\|\check{\mathbf{z}}\right\|_{0}=\hat{L},subject to ∥ overroman_ˇ start_ARG bold_z end_ARG ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = over^ start_ARG italic_L end_ARG , (13)

with 𝐳ˇvec(𝐙ˇ)L^U×1ˇ𝐳vecˇ𝐙superscript^𝐿𝑈1\check{\mathbf{z}}\triangleq\mathrm{vec}(\check{\mathbf{Z}})\in\mathbb{C}^{% \hat{L}U\times 1}overroman_ˇ start_ARG bold_z end_ARG ≜ roman_vec ( overroman_ˇ start_ARG bold_Z end_ARG ) ∈ blackboard_C start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG italic_U × 1 end_POSTSUPERSCRIPT.

The optimization problem (IV-B) is solved via a two-dimensional compressed sensing algorithm. The conventional method [10] tackles this problem with the large-size polar dictionary 𝐀~(𝜽~,𝐫~)~𝐀~𝜽~𝐫\tilde{\mathbf{A}}(\tilde{\bm{\theta}},\tilde{\mathbf{r}})over~ start_ARG bold_A end_ARG ( over~ start_ARG bold_italic_θ end_ARG , over~ start_ARG bold_r end_ARG ) in (IV-A) instead of 𝐀ˇ(𝜽ˇ,𝐫ˇ)ˇ𝐀ˇ𝜽ˇ𝐫\check{\mathbf{A}}(\check{\bm{\theta}},\check{\mathbf{r}})overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) in (10) via the 2D-CoSaMP algorithm, which sacrifices estimation performance for complexity reduction compared to 2D-OMP [19]. In contrast, our proposed method solves the optimization problem (IV-B) via the 2D-OMP algorithm using the small-size polar-domain dictionary 𝐀ˇ(𝜽ˇ,𝐫ˇ)ˇ𝐀ˇ𝜽ˇ𝐫\check{\mathbf{A}}(\check{\bm{\theta}},\check{\mathbf{r}})overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) constructed by the path candidates {θˇl,rˇl}l=1L^superscriptsubscriptsubscriptˇ𝜃𝑙subscriptˇ𝑟𝑙𝑙1^𝐿\{\check{\theta}_{l},\check{r}_{l}\}_{l=1}^{\hat{L}}{ overroman_ˇ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , overroman_ˇ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG end_POSTSUPERSCRIPT in the first stage. As a result, the proposed method possesses the prominent capability to overcome the conventional approach [10] while retaining comparable computational complexity. Detailed discussions regarding the complexity of the proposed algorithm are presented in Section VI.

Solving the problem (IV-B) yields the estimated path gain vector 𝐳^uL^u×1subscript^𝐳𝑢superscriptsubscript^𝐿𝑢1\hat{\mathbf{z}}_{u}\in\mathbb{C}^{\hat{L}_{u}\times 1}over^ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT, angle 𝜽^uL^u×1subscript^𝜽𝑢superscriptsubscript^𝐿𝑢1\hat{\bm{\theta}}_{u}\in\mathbb{R}^{\hat{L}_{u}\times 1}over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT, and distance 𝐫^uL^u×1subscript^𝐫𝑢superscriptsubscript^𝐿𝑢1\hat{\mathbf{r}}_{u}\in\mathbb{R}^{\hat{L}_{u}\times 1}over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT corresponding to the non-zero elements of 𝐳ˇuL^×1subscriptˇ𝐳𝑢superscript^𝐿1\check{\mathbf{z}}_{u}\in\mathbb{C}^{\hat{L}\times 1}overroman_ˇ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG × 1 end_POSTSUPERSCRIPT, where L^usubscript^𝐿𝑢\hat{L}_{u}over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT is the estimated number of paths for the u𝑢uitalic_u-th UE. Given the estimates, the initial channel estimate can be obtained as

𝐇^0𝒮=[𝐡^1𝒮,,𝐡^U𝒮]N×U,with 𝐡^u𝒮=𝐀^(𝜽^u,𝐫^u)𝐳^u,formulae-sequencesubscriptsuperscript^𝐇𝒮0matrixsuperscriptsubscript^𝐡1𝒮superscriptsubscript^𝐡𝑈𝒮superscript𝑁𝑈with superscriptsubscript^𝐡𝑢𝒮^𝐀subscript^𝜽𝑢subscript^𝐫𝑢subscript^𝐳𝑢\displaystyle\hat{\mathbf{H}}^{\mathcal{S}}_{0}\!=\!\begin{bmatrix}\hat{% \mathbf{h}}_{1}^{\mathcal{S}},\ldots,\hat{\mathbf{h}}_{U}^{\mathcal{S}}\end{% bmatrix}\in\mathbb{C}^{N\times U},\text{with }\hat{\mathbf{h}}_{u}^{\mathcal{S% }}\!=\!\hat{\mathbf{A}}(\hat{\bm{\theta}}_{u},\hat{\mathbf{r}}_{u})\hat{% \mathbf{z}}_{u},over^ start_ARG bold_H end_ARG start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL over^ start_ARG bold_h end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , … , over^ start_ARG bold_h end_ARG start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_U end_POSTSUPERSCRIPT , with over^ start_ARG bold_h end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT = over^ start_ARG bold_A end_ARG ( over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) over^ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , (14)

where 𝐀^(𝜽^u,𝐫^u)=[𝐚(θ^u,1,r^u,1),,𝐚(θ^u,L^u,r^u,L^u)]N×L^u^𝐀subscript^𝜽𝑢subscript^𝐫𝑢matrix𝐚subscript^𝜃𝑢1subscript^𝑟𝑢1𝐚subscript^𝜃𝑢subscript^𝐿𝑢subscript^𝑟𝑢subscript^𝐿𝑢superscript𝑁subscript^𝐿𝑢\hat{\mathbf{A}}(\hat{\bm{\theta}}_{u},\hat{\mathbf{r}}_{u})=\begin{bmatrix}% \mathbf{a}(\hat{\theta}_{u,1},\hat{r}_{u,1}),\ldots,\mathbf{a}(\hat{\theta}_{u% ,\hat{L}_{u}},\hat{r}_{u,\hat{L}_{u}})\end{bmatrix}\in\mathbb{C}^{N\times\hat{% L}_{u}}over^ start_ARG bold_A end_ARG ( over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) = [ start_ARG start_ROW start_CELL bold_a ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT , over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT ) , … , bold_a ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT , over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL end_ROW end_ARG ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the estimated array response. The proposed initial channel estimation method is summarized in Algorithm 1.

Algorithm 1 Proposed channel estimation algorithm
1:Input: 𝐘p𝒮,𝐗p,L^superscriptsubscript𝐘p𝒮subscript𝐗p^𝐿\mathbf{Y}_{\mathrm{p}}^{\mathcal{S}},\ \mathbf{X}_{\mathrm{p}},\ \hat{L}bold_Y start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT , over^ start_ARG italic_L end_ARG
2:Output: 𝐇^0𝒮,{𝜽^u,𝐫^u,𝐳^u,}u=1U\hat{\mathbf{H}}_{0}^{\mathcal{S}},\{\hat{\bm{\theta}}_{u},\hat{\mathbf{r}}_{u% },\hat{\mathbf{z}}_{u},\}_{u=1}^{U}over^ start_ARG bold_H end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT , { over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , } start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT
3:
4:// First Stage - Angle and distance estimation
5:Calculate polar-domain dictionary 𝐀~(𝜽~,𝐫~)~𝐀~𝜽~𝐫\tilde{\mathbf{A}}(\tilde{\bm{\theta}},\tilde{\mathbf{r}})over~ start_ARG bold_A end_ARG ( over~ start_ARG bold_italic_θ end_ARG , over~ start_ARG bold_r end_ARG ) in (IV-A)
6:Estimate 𝜽ˇ,𝐫ˇˇ𝜽ˇ𝐫\check{\bm{\theta}},\check{\mathbf{r}}overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG by solving (IV-A) via SOMP with MIL [22]
7:// Second Stage - UE-path pairing
8:Calculate polar-domain dictionary 𝐀ˇ(𝜽ˇ,𝐫ˇ)ˇ𝐀ˇ𝜽ˇ𝐫\check{\mathbf{A}}(\check{\bm{\theta}},\check{\mathbf{r}})overroman_ˇ start_ARG bold_A end_ARG ( overroman_ˇ start_ARG bold_italic_θ end_ARG , overroman_ˇ start_ARG bold_r end_ARG ) in (10)
9:Estimate {𝜽^u,𝐫^u,𝐳^u}u=1Usuperscriptsubscriptsubscript^𝜽𝑢subscript^𝐫𝑢subscript^𝐳𝑢𝑢1𝑈\{\hat{\bm{\theta}}_{u},\hat{\mathbf{r}}_{u},\hat{\mathbf{z}}_{u}\}_{u=1}^{U}{ over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT by solving (IV-B) via 2D-OMP [19]
10:Estimate channel matrix 𝐇^0𝒮superscriptsubscript^𝐇0𝒮\hat{\mathbf{H}}_{0}^{\mathcal{S}}over^ start_ARG bold_H end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT in (14)

V Proposed joint channel and data estimation

Given the initial estimates obtained from Algorithm 1, we aim to improve both the channel estimation performance as well as the data estimation accuracy by jointly processing the channel estimation and data detection while considering near-field properties. This section elaborates on the proposed JCDE algorithm with the initial channel estimate.

V-A Pre-processing for Channel and Data Estimation

V-A1 Pre-processing for Channel Estimation

To exploit the channel sparsity, the received signal and channel matrix in the spatial-domain are transformed in the beam-domain as 𝐘𝐃N𝐘𝒮N×K𝐘subscript𝐃𝑁superscript𝐘𝒮superscript𝑁𝐾\mathbf{Y}\triangleq\mathbf{D}_{N}\mathbf{Y}^{\mathcal{S}}\in\mathbb{C}^{N% \times K}bold_Y ≜ bold_D start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_Y start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K end_POSTSUPERSCRIPT and 𝐇𝐃N𝐇𝒮N×U𝐇subscript𝐃𝑁superscript𝐇𝒮superscript𝑁𝑈\mathbf{H}\triangleq\mathbf{D}_{N}\mathbf{H}^{\mathcal{S}}\in\mathbb{C}^{N% \times U}bold_H ≜ bold_D start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_H start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_U end_POSTSUPERSCRIPT , where 𝐃NN×Nsubscript𝐃Nsuperscript𝑁𝑁\mathbf{D}_{\mathrm{N}}\in\mathbb{C}^{N\times N}bold_D start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT is the DFT matrix . As described in Section II-A, the near-field channel has a cluster sparse structure due to energy leakage, thus, to tackle this problem, the channel matrix 𝐇𝐇\mathbf{H}bold_H is first considered as the aggregation of the model-based estimate 𝐒^N×U^𝐒superscript𝑁𝑈\hat{\mathbf{S}}\in\mathbb{C}^{N\times U}over^ start_ARG bold_S end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_U end_POSTSUPERSCRIPT and the residual channel estimation error 𝐄𝐇𝐒^N×U𝐄𝐇^𝐒superscript𝑁𝑈\mathbf{E}\triangleq\mathbf{H}-\hat{\mathbf{S}}\in\mathbb{C}^{N\times U}bold_E ≜ bold_H - over^ start_ARG bold_S end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_U end_POSTSUPERSCRIPT, resulting in

𝐇=𝐒^+𝐄.𝐇^𝐒𝐄\displaystyle\mathbf{H}=\hat{\mathbf{S}}+\mathbf{E}.bold_H = over^ start_ARG bold_S end_ARG + bold_E . (15)

An initial value for the model-based estimate 𝐒^^𝐒\hat{\mathbf{S}}over^ start_ARG bold_S end_ARG is determined with the proposed initial channel estimate 𝐇^0𝒮superscriptsubscript^𝐇0𝒮\hat{\mathbf{H}}_{0}^{\mathcal{S}}over^ start_ARG bold_H end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT in (14) as 𝐒^=𝐃N𝐇^0𝒮^𝐒subscript𝐃𝑁superscriptsubscript^𝐇0𝒮\hat{\mathbf{S}}=\mathbf{D}_{N}\hat{\mathbf{H}}_{0}^{\mathcal{S}}over^ start_ARG bold_S end_ARG = bold_D start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT over^ start_ARG bold_H end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT, and it is adaptively updated based on the near-field model structure as described in Section V-F. As the residual error 𝐄𝐄\mathbf{E}bold_E is defined by subtracting the current estimate 𝐒^^𝐒\hat{\mathbf{S}}over^ start_ARG bold_S end_ARG from the beam-domain channel 𝐇𝐇\mathbf{H}bold_H as in (15), this subtraction results in a sparser domain representation compared to the original beam-domain channel 𝐇𝐇\mathbf{H}bold_H. The dominant path components are removed from 𝐇𝐇\mathbf{H}bold_H by 𝐒^^𝐒\hat{\mathbf{S}}over^ start_ARG bold_S end_ARG, facilitating the sparse matrix reconstruction by considering 𝐄𝐄\mathbf{E}bold_E (instead of 𝐇𝐇\mathbf{H}bold_H) as the variable to be estimated by a Bayesian inference framework.

V-A2 Pre-processing for Data Estimation

For low-complexity data estimation, the conventional methods based on the far-field assumption, such as [13, 12, 14], utilize MRC-based detectors that are effective in the far-field region since the beam-domain channel exhibits a simple sparse structure with a peaky spike and no correlation between the beam indices. However, these detectors are ineffective in the near-field scenario because the near-field channel has cluster sparsity due to energy leakage, and the leaked energy is correlated in the beam-domain. Although LMMSE-based detection methods such as [24, 25] are effective to deal with the correlation, these methods require matrix inversion with the size N𝑁Nitalic_N, which is computationally expensive especially in XL-MIMO systems. To balance the computational complexity and detection performance, the array is virtually divided into multiple sub-arrays, and a sub-array-wise LMMSE-based detector is designed similarly to [26]. In contrast to [26], which assumes perfect channel state information (CSI), the proposed method considers the channel estimation error while jointly estimating data and channel, exploiting the near-field model structures.

Accordingly, the extra-large array with N𝑁Nitalic_N antennas are partitioned into C𝐶Citalic_C sub-arrays, and the sub-array c𝒞{1,2,,C}𝑐𝒞12𝐶c\in\mathcal{C}\triangleq\{1,2,\ldots,C\}italic_c ∈ caligraphic_C ≜ { 1 , 2 , … , italic_C } has Ncsubscript𝑁𝑐N_{c}italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT antennas satisfying N=c=1CNc𝑁superscriptsubscript𝑐1𝐶subscript𝑁𝑐N=\sum_{c=1}^{C}N_{c}italic_N = ∑ start_POSTSUBSCRIPT italic_c = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_C end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT. The received signals, residual channel errors, and model-based estimates can be also seen as 𝐘=[𝐘1T,𝐘2T,,𝐘CT]T𝐘superscriptmatrixsuperscriptsubscript𝐘1Tsuperscriptsubscript𝐘2Tsuperscriptsubscript𝐘𝐶TT\mathbf{Y}=\begin{bmatrix}\mathbf{Y}_{1}^{\mathrm{T}},\mathbf{Y}_{2}^{\mathrm{% T}},\dots,\mathbf{Y}_{C}^{\mathrm{T}}\end{bmatrix}^{\mathrm{T}}bold_Y = [ start_ARG start_ROW start_CELL bold_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , bold_Y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , … , bold_Y start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT, 𝐄=[𝐄1T,𝐄2T,,𝐄CT]T𝐄superscriptmatrixsuperscriptsubscript𝐄1Tsuperscriptsubscript𝐄2Tsuperscriptsubscript𝐄𝐶TT\mathbf{E}=\begin{bmatrix}\mathbf{E}_{1}^{\mathrm{T}},\mathbf{E}_{2}^{\mathrm{% T}},\dots,\mathbf{E}_{C}^{\mathrm{T}}\end{bmatrix}^{\mathrm{T}}bold_E = [ start_ARG start_ROW start_CELL bold_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , bold_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , … , bold_E start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT, and 𝐒^=[𝐒^1T,𝐒^2T,,𝐒^CT]T^𝐒superscriptmatrixsuperscriptsubscript^𝐒1Tsuperscriptsubscript^𝐒2Tsuperscriptsubscript^𝐒𝐶TT\hat{\mathbf{S}}=\begin{bmatrix}\hat{\mathbf{S}}_{1}^{\mathrm{T}},\hat{\mathbf% {S}}_{2}^{\mathrm{T}},\dots,\hat{\mathbf{S}}_{C}^{\mathrm{T}}\end{bmatrix}^{% \mathrm{T}}over^ start_ARG bold_S end_ARG = [ start_ARG start_ROW start_CELL over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , … , over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT, with 𝐘cNc×Ksubscript𝐘𝑐superscriptsubscript𝑁𝑐𝐾\mathbf{Y}_{c}\in\mathbb{C}^{N_{c}\times K}bold_Y start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT × italic_K end_POSTSUPERSCRIPT, 𝐄cNc×Usubscript𝐄𝑐superscriptsubscript𝑁𝑐𝑈\mathbf{E}_{c}\in\mathbb{C}^{N_{c}\times U}bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT × italic_U end_POSTSUPERSCRIPT, and 𝐒^cNc×Usubscript^𝐒𝑐superscriptsubscript𝑁𝑐𝑈\hat{\mathbf{S}}_{c}\in\mathbb{C}^{N_{c}\times U}over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT × italic_U end_POSTSUPERSCRIPT. The received signals 𝐘𝐘\mathbf{Y}bold_Y and 𝐘csubscript𝐘𝑐\mathbf{Y}_{c}bold_Y start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT can then be rewritten as

𝐘=𝐄𝐗+𝐒^𝐗+𝐍, with 𝐘c=𝐄c𝐗+𝐒^c𝐗+𝐍c.formulae-sequence𝐘𝐄𝐗^𝐒𝐗𝐍 with subscript𝐘𝑐subscript𝐄𝑐𝐗subscript^𝐒𝑐𝐗subscript𝐍𝑐\displaystyle\mathbf{Y}=\mathbf{E}\mathbf{X}+\hat{\mathbf{S}}\mathbf{X}+% \mathbf{N},\text{ with }\mathbf{Y}_{c}=\mathbf{E}_{c}\mathbf{X}+\hat{\mathbf{S% }}_{c}\mathbf{X}+\mathbf{N}_{c}.bold_Y = bold_EX + over^ start_ARG bold_S end_ARG bold_X + bold_N , with bold_Y start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT bold_X + over^ start_ARG bold_S end_ARG start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT bold_X + bold_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT . (16)

For convenience, let us define 𝒩{1,2,,N}𝒩12𝑁\mathcal{N}\triangleq\{1,2,\ldots,N\}caligraphic_N ≜ { 1 , 2 , … , italic_N } as the antenna index set, and 𝒩c{nc(1),nc(2),,nc(Nc)}𝒩subscript𝒩𝑐subscript𝑛𝑐1subscript𝑛𝑐2subscript𝑛𝑐𝑁𝑐𝒩\mathcal{N}_{c}\triangleq\left\{{n_{c(1)}},n_{c(2)},\ldots,n_{c(Nc)}\right\}% \subset\mathcal{N}caligraphic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ≜ { italic_n start_POSTSUBSCRIPT italic_c ( 1 ) end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT italic_c ( 2 ) end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_c ( italic_N italic_c ) end_POSTSUBSCRIPT } ⊂ caligraphic_N as the antenna index set at the c𝑐citalic_c-th sub-array such that 𝒩1𝒩2𝒩C=𝒩subscript𝒩1subscript𝒩2subscript𝒩𝐶𝒩\mathcal{N}_{1}\cup\mathcal{N}_{2}\cup\cdots\cup\mathcal{N}_{C}=\mathcal{N}caligraphic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ caligraphic_N start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∪ ⋯ ∪ caligraphic_N start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT = caligraphic_N and 𝒩i𝒩j=,ij𝒞formulae-sequencesubscript𝒩𝑖subscript𝒩𝑗𝑖𝑗𝒞\mathcal{N}_{i}\cap\mathcal{N}_{j}=\emptyset,\ i\neq j\in\mathcal{C}caligraphic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∩ caligraphic_N start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ∅ , italic_i ≠ italic_j ∈ caligraphic_C.

V-B Bayesian Inference Formulation

Based on the linear observation in (16) with the deterministic variable 𝐒^^𝐒\hat{\mathbf{S}}over^ start_ARG bold_S end_ARG and random variables 𝐗𝐗\mathbf{X}bold_X and 𝐄𝐄\mathbf{E}bold_E, the likelihood function for 𝐗𝐗\mathbf{X}bold_X and 𝐄𝐄\mathbf{E}bold_E can be expressed as

p(𝐘|𝐄,𝐗)=n𝒩k𝒦p(yn,k|𝐱¯k,𝐞¯n),𝑝conditional𝐘𝐄𝐗subscriptproduct𝑛𝒩subscriptproduct𝑘𝒦𝑝conditionalsubscript𝑦𝑛𝑘subscript¯𝐱𝑘subscript¯𝐞𝑛\displaystyle p(\mathbf{Y}|\mathbf{E},\mathbf{X})\!=\!\prod_{n\in\mathcal{N}}% \prod_{k\in\mathcal{K}}p(y_{n,k}|\bar{\mathbf{x}}_{k},\bar{\mathbf{e}}_{n}),italic_p ( bold_Y | bold_E , bold_X ) = ∏ start_POSTSUBSCRIPT italic_n ∈ caligraphic_N end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_k ∈ caligraphic_K end_POSTSUBSCRIPT italic_p ( italic_y start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT | over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) , (17)

where p(yn,k|𝐞¯n,𝐱¯k)=𝒞𝒩((𝐞¯n+𝐬¯n)T𝐱¯k,σ2)𝑝conditionalsubscript𝑦𝑛𝑘subscript¯𝐞𝑛subscript¯𝐱𝑘𝒞𝒩superscriptsubscript¯𝐞𝑛subscript¯𝐬𝑛Tsubscript¯𝐱𝑘superscript𝜎2p(y_{n,k}|\bar{\mathbf{e}}_{n},\bar{\mathbf{x}}_{k})=\mathcal{CN}((\bar{% \mathbf{e}}_{n}+\bar{\mathbf{s}}_{n})^{\mathrm{T}}\bar{\mathbf{x}}_{k},\ % \sigma^{2})italic_p ( italic_y start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT | over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = caligraphic_C caligraphic_N ( ( over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + over¯ start_ARG bold_s end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) with 𝐞¯n=[en,1,,en,U]TU×1subscript¯𝐞𝑛superscriptsubscript𝑒𝑛1subscript𝑒𝑛𝑈Tsuperscript𝑈1\bar{\mathbf{e}}_{n}=[e_{n,1},\ldots,e_{n,U}]^{\mathrm{T}}\in\mathbb{C}^{U% \times 1}over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = [ italic_e start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT , … , italic_e start_POSTSUBSCRIPT italic_n , italic_U end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_U × 1 end_POSTSUPERSCRIPT, 𝐬¯n=[s^n,1,,s^n,U]TU×1subscript¯𝐬𝑛superscriptsubscript^𝑠𝑛1subscript^𝑠𝑛𝑈Tsuperscript𝑈1\bar{\mathbf{s}}_{n}=[\hat{s}_{n,1},\ldots,\hat{s}_{n,U}]^{\mathrm{T}}\in% \mathbb{C}^{U\times 1}over¯ start_ARG bold_s end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = [ over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_n , 1 end_POSTSUBSCRIPT , … , over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_n , italic_U end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_U × 1 end_POSTSUPERSCRIPT, and 𝐱¯k=[x1,k,,xU,k]TU×1subscript¯𝐱𝑘superscriptsubscript𝑥1𝑘subscript𝑥𝑈𝑘Tsuperscript𝑈1\bar{\mathbf{x}}_{k}=[x_{1,k},\ldots,x_{U,k}]^{\mathrm{T}}\in\mathbb{C}^{U% \times 1}over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = [ italic_x start_POSTSUBSCRIPT 1 , italic_k end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_U , italic_k end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_U × 1 end_POSTSUPERSCRIPT.

Since each entry of 𝐗dsubscript𝐗d\mathbf{X}_{\mathrm{d}}bold_X start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT is randomly selected from the QAM constellation point set 𝒳𝒳\mathcal{X}caligraphic_X, the prior p(𝐗)𝑝𝐗p(\mathbf{X})italic_p ( bold_X ) can be written as

p(𝐗)=u𝒰k𝒦p(xu,k),𝑝𝐗subscriptproduct𝑢𝒰subscriptproduct𝑘𝒦𝑝subscript𝑥𝑢𝑘\displaystyle p(\mathbf{X})=\prod_{u\in\mathcal{U}}\prod_{k\in\mathcal{K}}p(x_% {u,k}),italic_p ( bold_X ) = ∏ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_k ∈ caligraphic_K end_POSTSUBSCRIPT italic_p ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (18)

with p(xu,kd)=1Q𝒳i𝒳δ(xu,kd𝒳i),kd𝒦dformulae-sequence𝑝subscript𝑥𝑢subscript𝑘𝑑1𝑄subscriptsubscript𝒳𝑖𝒳𝛿subscript𝑥𝑢subscript𝑘𝑑subscript𝒳𝑖for-allsubscript𝑘𝑑subscript𝒦𝑑p(x_{u,k_{d}})=\frac{1}{Q}\sum_{\mathcal{X}_{i}\in\mathcal{X}}\delta(x_{u,k_{d% }}-\mathcal{X}_{i}),\ \forall k_{d}\in\mathcal{K}_{d}italic_p ( italic_x start_POSTSUBSCRIPT italic_u , italic_k start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG italic_Q end_ARG ∑ start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ caligraphic_X end_POSTSUBSCRIPT italic_δ ( italic_x start_POSTSUBSCRIPT italic_u , italic_k start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT - caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , ∀ italic_k start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ∈ caligraphic_K start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT and p(xu,kp)=δ(xu,kp[𝐗p]u,kp),kp𝒦pformulae-sequence𝑝subscript𝑥𝑢subscript𝑘𝑝𝛿subscript𝑥𝑢subscript𝑘𝑝subscriptdelimited-[]subscript𝐗p𝑢subscript𝑘𝑝for-allsubscript𝑘𝑝subscript𝒦𝑝p(x_{u,k_{p}})=\delta(x_{u,k_{p}}-[\mathbf{X}_{\mathrm{p}}]_{u,k_{p}}),\ % \forall k_{p}\in\mathcal{K}_{p}italic_p ( italic_x start_POSTSUBSCRIPT italic_u , italic_k start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_δ ( italic_x start_POSTSUBSCRIPT italic_u , italic_k start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT - [ bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_u , italic_k start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , ∀ italic_k start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ∈ caligraphic_K start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT.

Although many conventional methods such as [13, 14] design the i.i.d. sparse prior for the beam-domain channel as p(𝐇)=n𝒩u𝒰p(hn,u)𝑝𝐇subscriptproduct𝑛𝒩subscriptproduct𝑢𝒰𝑝subscript𝑛𝑢p(\mathbf{H})=\prod_{n\in\mathcal{N}}\prod_{u\in\mathcal{U}}p(h_{n,u})italic_p ( bold_H ) = ∏ start_POSTSUBSCRIPT italic_n ∈ caligraphic_N end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_p ( italic_h start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT ) (e.g., BG prior), this modeling causes the model mismatch due to energy leakage effects in the near-field region. Therefore, we design the sparse prior for the residual channel error 𝐄𝐄\mathbf{E}bold_E instead of 𝐇𝐇\mathbf{H}bold_H as

p(𝐄;𝚯)=n𝒩u𝒰p(en,u;𝚯),𝑝𝐄𝚯subscriptproduct𝑛𝒩subscriptproduct𝑢𝒰𝑝subscript𝑒𝑛𝑢𝚯\displaystyle p(\mathbf{E};\mathbf{\Theta})=\prod_{n\in\mathcal{N}}\prod_{u\in% \mathcal{U}}p(e_{n,u};\mathbf{\Theta}),italic_p ( bold_E ; bold_Θ ) = ∏ start_POSTSUBSCRIPT italic_n ∈ caligraphic_N end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_p ( italic_e start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT ; bold_Θ ) , (19)

where p(en,u;𝚯)=𝒞𝒩(0,σn,ue)𝑝subscript𝑒𝑛𝑢𝚯𝒞𝒩0subscriptsuperscript𝜎𝑒𝑛𝑢p(e_{n,u};\mathbf{\Theta})=\mathcal{CN}(0,\ \sigma^{e}_{n,u})italic_p ( italic_e start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT ; bold_Θ ) = caligraphic_C caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT ) is Gaussian prior distribution with zero mean and variance σn,uesubscriptsuperscript𝜎𝑒𝑛𝑢\sigma^{e}_{n,u}italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT, which is widely used for sparse representation in the sparse Bayesian learning (SBL) algorithm [27], where 𝚯{σn,ue}n𝒩,u𝒰𝚯subscriptsubscriptsuperscript𝜎𝑒𝑛𝑢formulae-sequence𝑛𝒩𝑢𝒰\mathbf{\Theta}\triangleq\{\sigma^{e}_{n,u}\}_{n\in\mathcal{N},u\in\mathcal{U}}bold_Θ ≜ { italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_n ∈ caligraphic_N , italic_u ∈ caligraphic_U end_POSTSUBSCRIPT is the hyper parameter set to be optimized through the expectation maximization (EM) algorithm [28] as described in Section V-E.

From the likelihood in (17) and priors in (18), (19), the posterior can be written as

p(𝐄,𝐗|𝐘;𝚯)=p(𝐘|𝐄,𝐗)p(𝐗)p(𝐄;𝚯)/p(𝐘;𝚯),𝑝𝐄conditional𝐗𝐘𝚯𝑝conditional𝐘𝐄𝐗𝑝𝐗𝑝𝐄𝚯𝑝𝐘𝚯\displaystyle p(\mathbf{E},\mathbf{X}|\mathbf{Y};\mathbf{\Theta})=p(\mathbf{Y}% |\mathbf{E},\mathbf{X})p(\mathbf{X})p(\mathbf{E};\mathbf{\Theta})/p(\mathbf{Y}% ;\mathbf{\Theta}),italic_p ( bold_E , bold_X | bold_Y ; bold_Θ ) = italic_p ( bold_Y | bold_E , bold_X ) italic_p ( bold_X ) italic_p ( bold_E ; bold_Θ ) / italic_p ( bold_Y ; bold_Θ ) , (20)

where p(𝐘;𝚯)=𝐄,𝐗p(𝐘,𝐄,𝐗;𝚯)𝑝𝐘𝚯subscript𝐄𝐗𝑝𝐘𝐄𝐗𝚯p(\mathbf{Y};\mathbf{\Theta})=\int_{\mathbf{E},\mathbf{X}}p(\mathbf{Y},\mathbf% {E},\mathbf{X};\mathbf{\Theta})italic_p ( bold_Y ; bold_Θ ) = ∫ start_POSTSUBSCRIPT bold_E , bold_X end_POSTSUBSCRIPT italic_p ( bold_Y , bold_E , bold_X ; bold_Θ ) is the marginal likelihood referred to as the evidence for parameter 𝚯𝚯\mathbf{\Theta}bold_Θ. Our objective is to estimate 𝐄𝐄\mathbf{E}bold_E, 𝐗𝐗\mathbf{X}bold_X, and 𝚯𝚯\mathbf{\Theta}bold_Θ through the posterior and the evidence.

The estimator for 𝚯𝚯\mathbf{\Theta}bold_Θ by the type-II maximum likelihood method [29] is given as

𝚯^=argmax𝚯p(𝐘;𝚯).^𝚯𝚯argmax𝑝𝐘𝚯\displaystyle\hat{\mathbf{\Theta}}=\underset{\mathbf{\Theta}}{\mathrm{argmax}}% \ p(\mathbf{Y};\mathbf{\Theta}).over^ start_ARG bold_Θ end_ARG = underbold_Θ start_ARG roman_argmax end_ARG italic_p ( bold_Y ; bold_Θ ) . (21)

However, the calculation of the evidence p(𝐘;𝚯)𝑝𝐘𝚯p(\mathbf{Y};\mathbf{\Theta})italic_p ( bold_Y ; bold_Θ ) is intractable due to the multidimensional integral for 𝐗𝐗\mathbf{X}bold_X and 𝐄𝐄\mathbf{E}bold_E. Hence, we utilize the EM algorithm, which maximizes the evidence lower bound (ELBO) in each iteration, instead of directly maximizing the evidence [28]. Given 𝚯(t)superscript𝚯𝑡\mathbf{\Theta}^{(t)}bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT at the t𝑡titalic_t-th iteration, 𝚯(t+1)superscript𝚯𝑡1\mathbf{\Theta}^{(t+1)}bold_Θ start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT at the (t+1)𝑡1(t+1)( italic_t + 1 )-th iteration can be obtained as the following E-step and M-step:

E-step : (𝚯,𝚯(t))=𝔼p(𝐄,𝐗|𝐘;𝚯(t))[lnp(𝐘,𝐄,𝐗;𝚯)]+𝖼0(t),E-step : 𝚯superscript𝚯𝑡subscript𝔼𝑝𝐄conditional𝐗𝐘superscript𝚯𝑡delimited-[]𝑝𝐘𝐄𝐗𝚯superscriptsubscript𝖼0𝑡\displaystyle\text{E-step : }\mathcal{F}(\mathbf{\Theta},\mathbf{\Theta}^{(t)}% )\!=\!\mathbb{E}_{p(\mathbf{E},\mathbf{X}|\mathbf{Y};\mathbf{\Theta}^{(t)})}\!% \left[\ln p(\mathbf{Y},\mathbf{E},\mathbf{X};\mathbf{\Theta})\right]+\mathsf{c% }_{0}^{(t)}\!,E-step : caligraphic_F ( bold_Θ , bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) = blackboard_E start_POSTSUBSCRIPT italic_p ( bold_E , bold_X | bold_Y ; bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT [ roman_ln italic_p ( bold_Y , bold_E , bold_X ; bold_Θ ) ] + sansserif_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , (22)
M-step : 𝚯(t+1)=argmax𝚯(𝚯,𝚯(t)),M-step : superscript𝚯𝑡1𝚯argmax𝚯superscript𝚯𝑡\displaystyle\text{M-step : }\mathbf{\Theta}^{(t+1)}=\underset{\mathbf{\Theta}% }{\mathrm{argmax}}\ \mathcal{F}(\mathbf{\Theta},\mathbf{\Theta}^{(t)}),M-step : bold_Θ start_POSTSUPERSCRIPT ( italic_t + 1 ) end_POSTSUPERSCRIPT = underbold_Θ start_ARG roman_argmax end_ARG caligraphic_F ( bold_Θ , bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) , (23)

where (𝚯,𝚯(t))𝚯superscript𝚯𝑡\mathcal{F}(\mathbf{\Theta},\mathbf{\Theta}^{(t)})caligraphic_F ( bold_Θ , bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) is the ELBO with the constant value 𝖼0(t)=𝔼p(𝐄,𝐗|𝐘;𝚯(t))[lnp(𝐄,𝐗|𝐘;𝚯(t))]superscriptsubscript𝖼0𝑡subscript𝔼𝑝𝐄conditional𝐗𝐘superscript𝚯𝑡delimited-[]𝑝𝐄conditional𝐗𝐘superscript𝚯𝑡\mathsf{c}_{0}^{(t)}=\mathbb{E}_{p(\mathbf{E},\mathbf{X}|\mathbf{Y};\mathbf{% \Theta}^{(t)})}\left[\ln p(\mathbf{E},\mathbf{X}|\mathbf{Y};\mathbf{\Theta}^{(% t)})\right]sansserif_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = blackboard_E start_POSTSUBSCRIPT italic_p ( bold_E , bold_X | bold_Y ; bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT [ roman_ln italic_p ( bold_E , bold_X | bold_Y ; bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) ].

Since E-step requires the calculation of a multidimensional integral that is computationally unreasonable, we approximate the posterior by g(t)(𝐄,𝐗|𝐘)p(𝐄,𝐗|𝐘;𝚯(t))similar-to-or-equalssuperscript𝑔𝑡𝐄conditional𝐗𝐘𝑝𝐄conditional𝐗𝐘superscript𝚯𝑡g^{(t)}(\mathbf{E},\mathbf{X}|\mathbf{Y})\simeq p(\mathbf{E},\mathbf{X}|% \mathbf{Y};\mathbf{\Theta}^{(t)})italic_g start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) ≃ italic_p ( bold_E , bold_X | bold_Y ; bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ), using the EP algorithm. After the E-step, the maximization problem in (23) with the approximate posterior g(t)(𝐄,𝐗|𝐘)superscript𝑔𝑡𝐄conditional𝐗𝐘g^{(t)}(\mathbf{E},\mathbf{X}|\mathbf{Y})italic_g start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) is solved, which is described in detail in Section V-E. The EP procedure continues until it reaches the maximum number of iterations T𝑇Titalic_T. Finally, the last updated parameters at t=T𝑡𝑇t=Titalic_t = italic_T are used as the final estimates as 𝚯^𝚯(T)^𝚯superscript𝚯𝑇\hat{\mathbf{\Theta}}\triangleq\mathbf{\Theta}^{(T)}over^ start_ARG bold_Θ end_ARG ≜ bold_Θ start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT, 𝐄^𝔼g(T)(𝐄,𝐗|𝐘)[𝐄]^𝐄subscript𝔼superscript𝑔𝑇𝐄conditional𝐗𝐘delimited-[]𝐄\hat{\mathbf{E}}\triangleq\mathbb{E}_{g^{(T)}(\mathbf{E},\mathbf{X}|\mathbf{Y}% )}[\mathbf{E}]over^ start_ARG bold_E end_ARG ≜ blackboard_E start_POSTSUBSCRIPT italic_g start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) end_POSTSUBSCRIPT [ bold_E ], and 𝐗^𝔼g(T)(𝐄,𝐗|𝐘)[𝐗]^𝐗subscript𝔼superscript𝑔𝑇𝐄conditional𝐗𝐘delimited-[]𝐗\hat{\mathbf{X}}\triangleq\mathbb{E}_{g^{(T)}(\mathbf{E},\mathbf{X}|\mathbf{Y}% )}[\mathbf{X}]over^ start_ARG bold_X end_ARG ≜ blackboard_E start_POSTSUBSCRIPT italic_g start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) end_POSTSUBSCRIPT [ bold_X ].

In what follows, let us drop the iteration index t𝑡titalic_t for notation simplicity. The approximate posterior g(𝐄,𝐗|𝐘)𝑔𝐄conditional𝐗𝐘g(\mathbf{E},\mathbf{X}|\mathbf{Y})italic_g ( bold_E , bold_X | bold_Y ) is derived by minimizing the KL divergence subject to a Gaussian distribution set 𝚽𝚽\mathbf{\Phi}bold_Φ as

minimizeg𝚽KL(p(𝐄,𝐗|𝐘;𝚯)g(𝐄,𝐗|𝐘)),\displaystyle\underset{g\in\mathbf{\Phi}}{\mathrm{minimize}}\ \mathrm{KL}\left% (p(\mathbf{E},\mathbf{X}|\mathbf{Y};\mathbf{\Theta})\|g(\mathbf{E},\mathbf{X}|% \mathbf{Y})\right),start_UNDERACCENT italic_g ∈ bold_Φ end_UNDERACCENT start_ARG roman_minimize end_ARG roman_KL ( italic_p ( bold_E , bold_X | bold_Y ; bold_Θ ) ∥ italic_g ( bold_E , bold_X | bold_Y ) ) , (24)

where the approximate posterior g(𝐄,𝐗|𝐘)𝑔𝐄conditional𝐗𝐘g(\mathbf{E},\mathbf{X}|\mathbf{Y})italic_g ( bold_E , bold_X | bold_Y ) is designed as

g(𝐄,𝐗|𝐘)𝑔𝐄conditional𝐗𝐘\displaystyle g(\mathbf{E},\mathbf{X}|\mathbf{Y})italic_g ( bold_E , bold_X | bold_Y ) =Zg1Qx(𝐗)Qe(𝐄)Bx(𝐗)Be(𝐄),absentsuperscriptsubscript𝑍𝑔1superscript𝑄𝑥𝐗superscript𝑄𝑒𝐄superscript𝐵𝑥𝐗superscript𝐵𝑒𝐄\displaystyle=Z_{g}^{-1}Q^{x}(\mathbf{X})Q^{e}(\mathbf{E})B^{x}(\mathbf{X})B^{% e}(\mathbf{E}),= italic_Z start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_Q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) italic_Q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) italic_B start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) italic_B start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) , (25)

where Zg=𝐄,𝐗Qx(𝐗)Qe(𝐄)Bx(𝐗)Be(𝐄)subscript𝑍𝑔subscript𝐄𝐗superscript𝑄𝑥𝐗superscript𝑄𝑒𝐄superscript𝐵𝑥𝐗superscript𝐵𝑒𝐄Z_{g}=\int_{\mathbf{E},\mathbf{X}}Q^{x}(\mathbf{X})Q^{e}(\mathbf{E})B^{x}(% \mathbf{X})B^{e}(\mathbf{E})italic_Z start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT = ∫ start_POSTSUBSCRIPT bold_E , bold_X end_POSTSUBSCRIPT italic_Q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) italic_Q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) italic_B start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) italic_B start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) is a normalizing constant, and Qx(𝐗)superscript𝑄𝑥𝐗Q^{x}(\mathbf{X})italic_Q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ), Qe(𝐄)superscript𝑄𝑒𝐄Q^{e}(\mathbf{E})italic_Q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ), Bx(𝐗)superscript𝐵𝑥𝐗B^{x}(\mathbf{X})italic_B start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ), and Be(𝐄)superscript𝐵𝑒𝐄B^{e}(\mathbf{E})italic_B start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) are the approximate factors such that Qx(𝐗)Qe(𝐄)p(𝐘|𝐄,𝐗)similar-to-or-equalssuperscript𝑄𝑥𝐗superscript𝑄𝑒𝐄𝑝conditional𝐘𝐄𝐗Q^{x}(\mathbf{X})Q^{e}(\mathbf{E})\simeq p(\mathbf{Y}|\mathbf{E},\mathbf{X})italic_Q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) italic_Q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) ≃ italic_p ( bold_Y | bold_E , bold_X ), Bx(𝐗)p(𝐗)similar-to-or-equalssuperscript𝐵𝑥𝐗𝑝𝐗B^{x}(\mathbf{X})\simeq p(\mathbf{X})italic_B start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) ≃ italic_p ( bold_X ), and Be(𝐄)p(𝐄;𝚯)similar-to-or-equalssuperscript𝐵𝑒𝐄𝑝𝐄𝚯B^{e}(\mathbf{E})\simeq p(\mathbf{E};\bm{\Theta})italic_B start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) ≃ italic_p ( bold_E ; bold_Θ ) subject to Gaussian distribution set 𝚽𝚽\mathbf{\Phi}bold_Φ.

These approximate factors are designed as Qx(𝐗)=c,u,kqc,u,kx(xu,k)superscript𝑄𝑥𝐗subscriptproduct𝑐𝑢𝑘subscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘Q^{x}(\mathbf{X})=\prod_{c,u,k}q^{x}_{c,u,k}(x_{u,k})italic_Q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) = ∏ start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ), Qe(𝐄)=cnc,u,kqnc,u,ke(enc,u)superscript𝑄𝑒𝐄subscriptproduct𝑐subscriptproductsubscript𝑛𝑐𝑢𝑘subscriptsuperscript𝑞𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢Q^{e}(\mathbf{E})=\prod_{c}\prod_{n_{c},u,k}q^{e}_{n_{c},u,k}(e_{n_{c},u})italic_Q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) = ∏ start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ), Bx(𝐗)=u,kbu,kx(xu,k)superscript𝐵𝑥𝐗subscriptproduct𝑢𝑘subscriptsuperscript𝑏𝑥𝑢𝑘subscript𝑥𝑢𝑘B^{x}(\mathbf{X})=\prod_{u,k}b^{x}_{u,k}(x_{u,k})italic_B start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) = ∏ start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ), Be(𝐄)=cnc,ubnc,ue(enc,u)superscript𝐵𝑒𝐄subscriptproduct𝑐subscriptproductsubscript𝑛𝑐𝑢subscriptsuperscript𝑏𝑒subscript𝑛𝑐𝑢subscript𝑒subscript𝑛𝑐𝑢B^{e}(\mathbf{E})=\prod_{c}\prod_{n_{c},u}b^{e}_{n_{c},u}(e_{n_{c},u})italic_B start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) = ∏ start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT italic_b start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ), where qc,u,kx()subscriptsuperscript𝑞𝑥𝑐𝑢𝑘q^{x}_{c,u,k}(\cdot)italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( ⋅ ), qn,u,ke()subscriptsuperscript𝑞𝑒𝑛𝑢𝑘q^{e}_{n,u,k}(\cdot)italic_q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , italic_u , italic_k end_POSTSUBSCRIPT ( ⋅ ), bu,kx()subscriptsuperscript𝑏𝑥𝑢𝑘b^{x}_{u,k}(\cdot)italic_b start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( ⋅ ), and bu,ke()subscriptsuperscript𝑏𝑒𝑢𝑘b^{e}_{u,k}(\cdot)italic_b start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( ⋅ ) are the parameterized approximate functions defined as

qc,u,kx(xu,k)subscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle q^{x}_{c,u,k}(x_{u,k})italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) exp(|xu,kx^c,u,kq|2/ξc,u,kq,x),absentsuperscriptsubscript𝑥𝑢𝑘subscriptsuperscript^𝑥𝑞𝑐𝑢𝑘2subscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘\displaystyle\triangleq\exp\left(-|x_{u,k}-\hat{x}^{q}_{c,u,k}|^{2}/\xi^{q,x}_% {c,u,k}\right),≜ roman_exp ( - | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) , (26a)
qnc,u,ke(enc,u)subscriptsuperscript𝑞𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢\displaystyle q^{e}_{n_{c},u,k}(e_{n_{c},u})italic_q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) exp(|enc,ue^nc,u,kq|2/ξnc,u,kq,e),absentsuperscriptsubscript𝑒subscript𝑛𝑐𝑢subscriptsuperscript^𝑒𝑞subscript𝑛𝑐𝑢𝑘2subscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢𝑘\displaystyle\triangleq\exp\left(-|e_{n_{c},u}-\hat{e}^{q}_{n_{c},u,k}|^{2}/% \xi^{q,e}_{n_{c},u,k}\right),≜ roman_exp ( - | italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT - over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ) , (26b)
bu,kx(xu,k)subscriptsuperscript𝑏𝑥𝑢𝑘subscript𝑥𝑢𝑘\displaystyle b^{x}_{u,k}(x_{u,k})italic_b start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) exp(|xu,kx^u,kb|2/ξu,kb,x),absentsuperscriptsubscript𝑥𝑢𝑘subscriptsuperscript^𝑥𝑏𝑢𝑘2subscriptsuperscript𝜉𝑏𝑥𝑢𝑘\displaystyle\triangleq\exp\left(-|x_{u,k}-\hat{x}^{b}_{u,k}|^{2}/\xi^{b,x}_{u% ,k}\right),≜ roman_exp ( - | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (26c)
bnc,ue(enc,u)subscriptsuperscript𝑏𝑒subscript𝑛𝑐𝑢subscript𝑒subscript𝑛𝑐𝑢\displaystyle b^{e}_{n_{c},u}(e_{n_{c},u})italic_b start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) exp(|enc,ue^nc,ub|2/ξnc,ub,e),absentsuperscriptsubscript𝑒subscript𝑛𝑐𝑢subscriptsuperscript^𝑒𝑏subscript𝑛𝑐𝑢2subscriptsuperscript𝜉𝑏𝑒subscript𝑛𝑐𝑢\displaystyle\triangleq\exp\left(-|e_{n_{c},u}-\hat{e}^{b}_{n_{c},u}|^{2}/\xi^% {b,e}_{n_{c},u}\right),≜ roman_exp ( - | italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT - over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) , (26d)

where 𝝅c,u,kq,x[x^c,u,kq,ξc,u,kq,x]Tsubscriptsuperscript𝝅𝑞𝑥𝑐𝑢𝑘superscriptmatrixsubscriptsuperscript^𝑥𝑞𝑐𝑢𝑘subscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘T\bm{\pi}^{q,x}_{c,u,k}\!\triangleq\!\begin{bmatrix}\hat{x}^{q}_{c,u,k},\xi^{q,% x}_{c,u,k}\end{bmatrix}^{\mathrm{T}}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ≜ [ start_ARG start_ROW start_CELL over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT, 𝝅nc,u,kq,e[e^nc,u,kq,ξnc,u,kq,e]Tsubscriptsuperscript𝝅𝑞𝑒subscript𝑛𝑐𝑢𝑘superscriptmatrixsubscriptsuperscript^𝑒𝑞subscript𝑛𝑐𝑢𝑘subscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢𝑘T\bm{\pi}^{q,e}_{n_{c},u,k}\!\triangleq\!\begin{bmatrix}\hat{e}^{q}_{n_{c},u,k}% ,\xi^{q,e}_{n_{c},u,k}\end{bmatrix}^{\mathrm{T}}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ≜ [ start_ARG start_ROW start_CELL over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT, 𝝅u,kb,x[x^u,kb,ξu,kb,x]Tsubscriptsuperscript𝝅𝑏𝑥𝑢𝑘superscriptmatrixsubscriptsuperscript^𝑥𝑏𝑢𝑘subscriptsuperscript𝜉𝑏𝑥𝑢𝑘T\bm{\pi}^{b,x}_{u,k}\!\triangleq\!\begin{bmatrix}\hat{x}^{b}_{u,k},\xi^{b,x}_{% u,k}\end{bmatrix}^{\mathrm{T}}bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ≜ [ start_ARG start_ROW start_CELL over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT, and 𝝅nc,ub,e[e^nc,ub,ξnc,ub,e]Tsubscriptsuperscript𝝅𝑏𝑒subscript𝑛𝑐𝑢superscriptmatrixsubscriptsuperscript^𝑒𝑏subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑏𝑒subscript𝑛𝑐𝑢T\bm{\pi}^{b,e}_{n_{c},u}\!\triangleq\!\begin{bmatrix}\hat{e}^{b}_{n_{c},u},\xi% ^{b,e}_{n_{c},u}\end{bmatrix}^{\mathrm{T}}bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ≜ [ start_ARG start_ROW start_CELL over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT are unknown parameters to be optimized by minimizing the KL divergence.

Since the approximate posterior g(𝐄,𝐗|𝐘)𝑔𝐄conditional𝐗𝐘g(\mathbf{E},\mathbf{X}|\mathbf{Y})italic_g ( bold_E , bold_X | bold_Y ) in (25) is designed subject to Gaussian distribution set 𝚽𝚽\mathbf{\Phi}bold_Φ, the marginalized approximate posterior g(xu,k|𝐘)𝑔conditionalsubscript𝑥𝑢𝑘𝐘g(x_{u,k}|\mathbf{Y})italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) and g(enc,u|𝐘)𝑔conditionalsubscript𝑒subscript𝑛𝑐𝑢𝐘g(e_{n_{c},u}|\mathbf{Y})italic_g ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) can be expressed as g(xu,k|𝐘)=𝒞𝒩(x^u,k,ξu,kx)𝑔conditionalsubscript𝑥𝑢𝑘𝐘𝒞𝒩subscript^𝑥𝑢𝑘subscriptsuperscript𝜉𝑥𝑢𝑘g(x_{u,k}|\mathbf{Y})=\mathcal{CN}(\hat{x}_{u,k},\xi^{x}_{u,k})italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) = caligraphic_C caligraphic_N ( over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) and g(enc,u|𝐘)=𝒞𝒩(e^nc,u,ξnc,ue)𝑔conditionalsubscript𝑒subscript𝑛𝑐𝑢𝐘𝒞𝒩subscript^𝑒subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑒subscript𝑛𝑐𝑢g(e_{n_{c},u}|\mathbf{Y})=\mathcal{CN}(\hat{e}_{n_{c},u},\xi^{e}_{n_{c},u})italic_g ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) = caligraphic_C caligraphic_N ( over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ), where x^u,ksubscript^𝑥𝑢𝑘\hat{x}_{u,k}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT and e^nc,usubscript^𝑒subscript𝑛𝑐𝑢\hat{e}_{n_{c},u}over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT are the posterior means, and ξu,kxsubscriptsuperscript𝜉𝑥𝑢𝑘\xi^{x}_{u,k}italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT and ξnc,uesubscriptsuperscript𝜉𝑒subscript𝑛𝑐𝑢\xi^{e}_{n_{c},u}italic_ξ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT are the posterior variances.

Let 𝚷{𝝅c,u,kq,x,𝝅u,kb,x,𝝅nc,u,kq,e,𝝅nc,ub,e}c𝒞,nc𝒩,u𝒰,k𝒦𝚷subscriptsubscriptsuperscript𝝅𝑞𝑥𝑐𝑢𝑘subscriptsuperscript𝝅𝑏𝑥𝑢𝑘subscriptsuperscript𝝅𝑞𝑒subscript𝑛𝑐𝑢𝑘subscriptsuperscript𝝅𝑏𝑒subscript𝑛𝑐𝑢formulae-sequence𝑐𝒞formulae-sequencesubscript𝑛𝑐𝒩formulae-sequence𝑢𝒰𝑘𝒦\mathbf{\Pi}\triangleq\{\bm{\pi}^{q,x}_{c,u,k},\ \bm{\pi}^{b,x}_{u,k},\ \bm{% \pi}^{q,e}_{n_{c},u,k},\ \bm{\pi}^{b,e}_{n_{c},u}\}_{c\in\mathcal{C},n_{c}\in% \mathcal{N},u\in\mathcal{U},k\in\mathcal{K}}bold_Π ≜ { bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT , bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_c ∈ caligraphic_C , italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∈ caligraphic_N , italic_u ∈ caligraphic_U , italic_k ∈ caligraphic_K end_POSTSUBSCRIPT denote an unknown parameter set to be optimized. The optimal unknown parameter set 𝚷^^𝚷\hat{\mathbf{\Pi}}over^ start_ARG bold_Π end_ARG is obtained by minimizing the KL divergence in (24). However, the objective function cannot be expressed in closed-form because KL(p(𝐄,𝐗|𝐘;𝚯)g(𝐄,𝐗|𝐘))\mathrm{KL}\left(p(\mathbf{E},\mathbf{X}|\mathbf{Y};\mathbf{\Theta})\|g(% \mathbf{E},\mathbf{X}|\mathbf{Y})\right)roman_KL ( italic_p ( bold_E , bold_X | bold_Y ; bold_Θ ) ∥ italic_g ( bold_E , bold_X | bold_Y ) ) includes intractable integral operations with respect to the true posterior p(𝐄,𝐗|𝐘;𝚯)𝑝𝐄conditional𝐗𝐘𝚯p(\mathbf{E},\mathbf{X}|\mathbf{Y};\mathbf{\Theta})italic_p ( bold_E , bold_X | bold_Y ; bold_Θ ). To tackle this, we set the target distribution p^(𝐄,𝐗|𝐘)^𝑝𝐄conditional𝐗𝐘\hat{p}(\mathbf{E},\mathbf{X}|\mathbf{Y})over^ start_ARG italic_p end_ARG ( bold_E , bold_X | bold_Y ) instead of the true posterior p(𝐄,𝐗|𝐘;𝚯)𝑝𝐄conditional𝐗𝐘𝚯p(\mathbf{E},\mathbf{X}|\mathbf{Y};\mathbf{\Theta})italic_p ( bold_E , bold_X | bold_Y ; bold_Θ ) into the KL divergence in (24). The target distribution is designed by replacing a part of the true posterior with the approximate functions in (26a)-(26d) as described in the following sections. For the sake of notation convenience for the design of the target distribution in the following section, the approximate distribution lc,kx(𝐄c,𝐱¯k)p(𝐲c,k|𝐄c,𝐱¯k)similar-to-or-equalssubscriptsuperscript𝑙𝑥𝑐𝑘subscript𝐄𝑐subscript¯𝐱𝑘𝑝conditionalsubscript𝐲𝑐𝑘subscript𝐄𝑐subscript¯𝐱𝑘l^{x}_{c,k}(\mathbf{E}_{c},\bar{\mathbf{x}}_{k})\simeq p(\mathbf{y}_{c,k}|% \mathbf{E}_{c},\bar{\mathbf{x}}_{k})italic_l start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ( bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ≃ italic_p ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) and lnc,ke(𝐞¯nc,𝐱¯k)p(ync,k|𝐞¯nc,𝐱¯k)similar-to-or-equalssubscriptsuperscript𝑙𝑒subscript𝑛𝑐𝑘subscript¯𝐞subscript𝑛𝑐subscript¯𝐱𝑘𝑝conditionalsubscript𝑦subscript𝑛𝑐𝑘subscript¯𝐞subscript𝑛𝑐subscript¯𝐱𝑘l^{e}_{n_{c},k}(\bar{\mathbf{e}}_{n_{c}},\bar{\mathbf{x}}_{k})\simeq p(y_{n_{c% },k}|\bar{\mathbf{e}}_{n_{c}},\bar{\mathbf{x}}_{k})italic_l start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT ( over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ≃ italic_p ( italic_y start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT | over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) are expressed using (26a)-(26d) as

lc,kx(𝐄c,𝐱¯k)u𝒰qc,u,kx(xu,k)u𝒰nc𝒩cqnc,u,ke(enc,u),proportional-tosubscriptsuperscript𝑙𝑥𝑐𝑘subscript𝐄𝑐subscript¯𝐱𝑘subscriptproduct𝑢𝒰subscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘subscriptproduct𝑢𝒰subscriptproductsubscript𝑛𝑐subscript𝒩𝑐subscriptsuperscript𝑞𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢\displaystyle l^{x}_{c,k}(\mathbf{E}_{c},\bar{\mathbf{x}}_{k})\!\propto\!\prod% _{u\in\mathcal{U}}q^{x}_{c,u,k}(x_{u,k})\prod_{u\in\mathcal{U}}\prod_{n_{c}\in% \mathcal{N}_{c}}\!q^{e}_{n_{c},u,k}(e_{n_{c},u}),\!\!\!italic_l start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ( bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∝ ∏ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∈ caligraphic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) , (27a)
lnc,ke(𝐞¯nc,𝐱¯k)u𝒰{qc,u,kx(xu,k)}1Ncu𝒰qnc,u,ke(enc,u).proportional-tosubscriptsuperscript𝑙𝑒subscript𝑛𝑐𝑘subscript¯𝐞subscript𝑛𝑐subscript¯𝐱𝑘subscriptproduct𝑢𝒰superscriptsubscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘1subscript𝑁𝑐subscriptproduct𝑢𝒰subscriptsuperscript𝑞𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢\displaystyle l^{e}_{n_{c},k}(\bar{\mathbf{e}}_{n_{c}},\bar{\mathbf{x}}_{k})\!% \propto\!\prod_{u\in\mathcal{U}}\!\!\left\{q^{x}_{c,u,k}(x_{u,k})\right\}^{% \frac{1}{N_{c}}}\!\!\prod_{u\in\mathcal{U}}q^{e}_{n_{c},u,k}(e_{n_{c},u}).\!\!\!italic_l start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT ( over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∝ ∏ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT { italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) } start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) . (27b)

To solve the KL minimization problem, the alternating optimization algorithm [30] is utilized, where a target parameter is optimized while the other parameters are fixed. In what follows, the estimation method for {𝝅c,u,kq,x,𝝅u,kb,x}subscriptsuperscript𝝅𝑞𝑥𝑐𝑢𝑘subscriptsuperscript𝝅𝑏𝑥𝑢𝑘\{\bm{\pi}^{q,x}_{c,u,k},\bm{\pi}^{b,x}_{u,k}\}{ bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT } and {𝝅n,ub,e\{\bm{\pi}^{b,e}_{n,u}{ bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT, 𝝅n,u,kq,e}\bm{\pi}^{q,e}_{n,u,k}\}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , italic_u , italic_k end_POSTSUBSCRIPT } is described in Section V-C and V-D, respectively.

V-C EP for Data Estimation

V-C1 Update 𝝅c,u,kq,xsubscriptsuperscript𝝅𝑞𝑥𝑐𝑢𝑘\bm{\pi}^{q,x}_{c,u,k}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT

While the parameter 𝝅c,u,kq,xsubscriptsuperscript𝝅𝑞𝑥𝑐𝑢𝑘\bm{\pi}^{q,x}_{c,u,k}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT in qc,u,kx(xu,k)subscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘q^{x}_{c,u,k}(x_{u,k})italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) is updated, the other parameters 𝚷{𝝅c,u,kq,x}𝚷subscriptsuperscript𝝅𝑞𝑥𝑐𝑢𝑘\mathbf{\Pi}\setminus\{\bm{\pi}^{q,x}_{c,u,k}\}bold_Π ∖ { bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT } are fixed as the tentative estimated values, that is, the KL minimization problem for 𝝅c,u,kq,xsubscriptsuperscript𝝅𝑞𝑥𝑐𝑢𝑘\bm{\pi}^{q,x}_{c,u,k}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT is formulated as

minimize𝝅c,u,kq,xKL(p^c,kq,x(𝐄,𝐗|𝐘)g(𝐄,𝐗|𝐘)),\displaystyle\underset{\bm{\pi}^{q,x}_{c,u,k}}{\mathrm{minimize}}\ \mathrm{KL}% \left(\hat{p}_{c,k}^{q,x}(\mathbf{E},\mathbf{X}|\mathbf{Y})\|g(\mathbf{E},% \mathbf{X}|\mathbf{Y})\right),start_UNDERACCENT bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_minimize end_ARG roman_KL ( over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) ∥ italic_g ( bold_E , bold_X | bold_Y ) ) , (28)

where p^c,kq,x(𝐄,𝐗|𝐘)superscriptsubscript^𝑝𝑐𝑘𝑞𝑥𝐄conditional𝐗𝐘\hat{p}_{c,k}^{q,x}(\mathbf{E},\mathbf{X}|\mathbf{Y})over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) is the target distribution for 𝝅c,u,kq,xsubscriptsuperscript𝝅𝑞𝑥𝑐𝑢𝑘\bm{\pi}^{q,x}_{c,u,k}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT, which is designed using lc,kx(𝐄c,𝐱¯k)subscriptsuperscript𝑙𝑥𝑐𝑘subscript𝐄𝑐subscript¯𝐱𝑘l^{x}_{c,k}(\mathbf{E}_{c},\bar{\mathbf{x}}_{k})italic_l start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ( bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) in (27a) as

p^c,kq,x(\displaystyle\hat{p}_{c,k}^{q,x}(over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT ( 𝐄,𝐗|𝐘)=Cc,kq,xp(𝐲c,k|𝐄c,𝐱¯k)\displaystyle\mathbf{E},\mathbf{X}|\mathbf{Y})=C_{c,k}^{q,x}\ p(\mathbf{y}_{c,% k}|\mathbf{E}_{c},\bar{\mathbf{x}}_{k})bold_E , bold_X | bold_Y ) = italic_C start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT italic_p ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT )
(c,k)𝒞×𝒦(c,k)lc,kx(𝐄c,𝐱k)p(𝐲c,k|𝐄c,𝐱k)Bx(𝐗)Be(𝐄)p(𝐗)p(𝐄;𝚯),subscriptproductsuperscript𝑐superscript𝑘𝒞𝒦𝑐𝑘subscriptsubscriptsuperscript𝑙𝑥superscript𝑐superscript𝑘subscript𝐄superscript𝑐subscript𝐱superscript𝑘similar-to-or-equalsabsent𝑝conditionalsubscript𝐲superscript𝑐𝑘subscript𝐄superscript𝑐subscript𝐱superscript𝑘subscriptsuperscript𝐵𝑥𝐗superscript𝐵𝑒𝐄similar-to-or-equalsabsent𝑝𝐗𝑝𝐄𝚯\displaystyle\prod_{(c^{\prime},k^{\prime})\in\mathcal{C}\times\mathcal{K}% \setminus(c,k)}\underbrace{l^{x}_{c^{\prime},k^{\prime}}(\mathbf{E}_{c^{\prime% }},\mathbf{x}_{k^{\prime}})}_{\simeq p(\mathbf{y}_{c^{\prime},k}|\mathbf{E}_{c% ^{\prime}},\mathbf{x}_{k^{\prime}})}\underbrace{B^{x}(\mathbf{X})B^{e}(\mathbf% {E})}_{\simeq p(\mathbf{X})p(\mathbf{E};\mathbf{\Theta})},∏ start_POSTSUBSCRIPT ( italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ caligraphic_C × caligraphic_K ∖ ( italic_c , italic_k ) end_POSTSUBSCRIPT under⏟ start_ARG italic_l start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_E start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , bold_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( bold_y start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT | bold_E start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , bold_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT under⏟ start_ARG italic_B start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) italic_B start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( bold_X ) italic_p ( bold_E ; bold_Θ ) end_POSTSUBSCRIPT , (29)

where Cc,kq,xsuperscriptsubscript𝐶𝑐𝑘𝑞𝑥C_{c,k}^{q,x}italic_C start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT is a normalizing constant.

Let c,u,kq,x(𝝅c,u,kq,x)KL(p^c,kq,x(𝐄,𝐗|𝐘)g(𝐄,𝐗|𝐘))\mathcal{L}^{q,x}_{c,u,k}(\bm{\pi}^{q,x}_{c,u,k})\triangleq\mathrm{KL}\left(% \hat{p}_{c,k}^{q,x}(\mathbf{E},\mathbf{X}|\mathbf{Y})\|g(\mathbf{E},\mathbf{X}% |\mathbf{Y})\right)caligraphic_L start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) ≜ roman_KL ( over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) ∥ italic_g ( bold_E , bold_X | bold_Y ) ) denote the objective function in (28), resorting to

c,u,kq,x=lnZg𝔼p^c,kq,x(xu,k|𝐘)[lnqc,u,kx(xu,k)]+const.subscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑍𝑔subscript𝔼subscriptsuperscript^𝑝𝑞𝑥𝑐𝑘conditionalsubscript𝑥𝑢𝑘𝐘delimited-[]subscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘const\displaystyle\mathcal{L}^{q,x}_{c,u,k}=\!\ln Z_{g}\!-\!\mathbb{E}_{\hat{p}^{q,% x}_{c,k}(x_{u,k}|\mathbf{Y})}\!\left[\ln q^{x}_{c,u,k}(x_{u,k})\right]+\mathrm% {const}.caligraphic_L start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT = roman_ln italic_Z start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT - blackboard_E start_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) end_POSTSUBSCRIPT [ roman_ln italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ] + roman_const . (30)

Since the objective function c,u,kq,x(𝝅c,u,kq,x)subscriptsuperscript𝑞𝑥𝑐𝑢𝑘superscriptsubscript𝝅𝑐𝑢𝑘𝑞𝑥\mathcal{L}^{q,x}_{c,u,k}(\bm{\pi}_{c,u,k}^{q,x})caligraphic_L start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_italic_π start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT ) is convex with respect to 𝝅c,u,kq,xsuperscriptsubscript𝝅𝑐𝑢𝑘𝑞𝑥\bm{\pi}_{c,u,k}^{q,x}bold_italic_π start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT, the necessary and sufficient condition for the global optimal , i.e., c.u,kq,x/𝝅c,u,kq,x=𝟎subscriptsuperscript𝑞𝑥formulae-sequence𝑐𝑢𝑘superscriptsubscript𝝅𝑐𝑢𝑘𝑞𝑥0\partial\mathcal{L}^{q,x}_{c.u,k}/\partial\bm{\pi}_{c,u,k}^{q,x}=\mathbf{0}∂ caligraphic_L start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c . italic_u , italic_k end_POSTSUBSCRIPT / ∂ bold_italic_π start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT = bold_0, is equivalent to

g(xu,k|𝐘)=proj𝚽[p^c,kq,x(xu,k|𝐘)],𝑔conditionalsubscript𝑥𝑢𝑘𝐘subscriptproj𝚽delimited-[]subscriptsuperscript^𝑝𝑞𝑥𝑐𝑘conditionalsubscript𝑥𝑢𝑘𝐘\displaystyle g(x_{u,k}|\mathbf{Y})=\mathrm{proj}_{\mathbf{\Phi}}\left[\hat{p}% ^{q,x}_{c,k}(x_{u,k}|\mathbf{Y})\right],italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) = roman_proj start_POSTSUBSCRIPT bold_Φ end_POSTSUBSCRIPT [ over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) ] , (31)

where proj𝚽[p(x)]𝒞𝒩(𝔼p(x)[x],𝕍p(x)[x])subscriptproj𝚽delimited-[]𝑝𝑥𝒞𝒩subscript𝔼𝑝𝑥delimited-[]𝑥subscript𝕍𝑝𝑥delimited-[]𝑥\mathrm{proj}_{\mathbf{\Phi}}\left[p(x)\right]\triangleq\mathcal{CN}\left(% \mathbb{E}_{p(x)}[x],\ \mathbb{V}_{p(x)}[x]\right)roman_proj start_POSTSUBSCRIPT bold_Φ end_POSTSUBSCRIPT [ italic_p ( italic_x ) ] ≜ caligraphic_C caligraphic_N ( blackboard_E start_POSTSUBSCRIPT italic_p ( italic_x ) end_POSTSUBSCRIPT [ italic_x ] , blackboard_V start_POSTSUBSCRIPT italic_p ( italic_x ) end_POSTSUBSCRIPT [ italic_x ] ) is the projection operator onto Gaussian distribution set 𝚽𝚽\mathbf{\Phi}bold_Φ, which indicates the moment matching, i.e., the first and second moments of distribution p(x)𝑝𝑥p(x)italic_p ( italic_x ) matches those of the target distribution.

The marginalized approximate posterior g(xu,k|𝐘)=𝐄,𝐗xu,kg(𝐄,𝐗|𝐘)𝑔conditionalsubscript𝑥𝑢𝑘𝐘subscript𝐄𝐗subscript𝑥𝑢𝑘𝑔𝐄conditional𝐗𝐘g(x_{u,k}|\mathbf{Y})=\int_{\mathbf{E},\mathbf{X}\setminus x_{u,k}}g(\mathbf{E% },\mathbf{X}|\mathbf{Y})italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) = ∫ start_POSTSUBSCRIPT bold_E , bold_X ∖ italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_g ( bold_E , bold_X | bold_Y ) in (31) is written as

g(xu,k|𝐘)𝑔conditionalsubscript𝑥𝑢𝑘𝐘\displaystyle g(x_{u,k}|\mathbf{Y})italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) qc,u,kx(xu,k)vc,u,kx(xu,k),proportional-toabsentsubscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘subscriptsuperscript𝑣𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle\propto q^{x}_{c,u,k}(x_{u,k})v^{x}_{c,u,k}(x_{u,k}),∝ italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) italic_v start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (32)

with vc,u,kx(xu,k)bu,kx(xu,k)c𝒞cqc,u,kx(xu,k)subscriptsuperscript𝑣𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘superscriptsubscript𝑏𝑢𝑘𝑥subscript𝑥𝑢𝑘subscriptproductsuperscript𝑐𝒞𝑐subscriptsuperscript𝑞𝑥superscript𝑐𝑢𝑘subscript𝑥𝑢𝑘v^{x}_{c,u,k}(x_{u,k})\triangleq b_{u,k}^{x}(x_{u,k})\prod_{c^{\prime}\in% \mathcal{C}\setminus c}q^{x}_{c^{\prime},u,k}(x_{u,k})italic_v start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ≜ italic_b start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C ∖ italic_c end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ), which can also be represented as

vc,u,kx(xu,k)subscriptsuperscript𝑣𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle v^{x}_{c,u,k}(x_{u,k})italic_v start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) =Cc,u,kv,xexp(|xu,kx^c,u,kv|2/ξc,u,kv,x),absentsubscriptsuperscript𝐶𝑣𝑥𝑐𝑢𝑘superscriptsubscript𝑥𝑢𝑘subscriptsuperscript^𝑥𝑣𝑐𝑢𝑘2subscriptsuperscript𝜉𝑣𝑥𝑐𝑢𝑘\displaystyle=C^{v,x}_{c,u,k}\exp\left(-|x_{u,k}-\hat{x}^{v}_{c,u,k}|^{2}/\xi^% {v,x}_{c,u,k}\right),= italic_C start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT roman_exp ( - | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) , (33)

with the normalizing constant Cc,u,kv,xsubscriptsuperscript𝐶𝑣𝑥𝑐𝑢𝑘C^{v,x}_{c,u,k}italic_C start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT.

The marginalized target distribution p^c,kq,x(xu,k|𝐘)=𝐄,𝐗xu,kp^c,kq,x(𝐄,𝐗|𝐘)subscriptsuperscript^𝑝𝑞𝑥𝑐𝑘conditionalsubscript𝑥𝑢𝑘𝐘subscript𝐄𝐗subscript𝑥𝑢𝑘subscriptsuperscript^𝑝𝑞𝑥𝑐𝑘𝐄conditional𝐗𝐘\hat{p}^{q,x}_{c,k}(x_{u,k}|\mathbf{Y})=\int_{\mathbf{E},\mathbf{X}\setminus x% _{u,k}}\hat{p}^{q,x}_{c,k}(\mathbf{E},\mathbf{X}|\mathbf{Y})over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) = ∫ start_POSTSUBSCRIPT bold_E , bold_X ∖ italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ( bold_E , bold_X | bold_Y ) in (31) is written as

p^c,kq,x(xu,k|𝐘)subscriptsuperscript^𝑝𝑞𝑥𝑐𝑘conditionalsubscript𝑥𝑢𝑘𝐘\displaystyle\hat{p}^{q,x}_{c,k}(x_{u,k}|\mathbf{Y})over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) =p¯c,u,kx(𝐲c,k|xu,k)vc,u,kx(xu,k),absentsubscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘subscriptsuperscript𝑣𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle=\bar{p}^{x}_{c,u,k}(\mathbf{y}_{c,k}|x_{u,k})v^{x}_{c,u,k}(x_{u,% k}),= over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) italic_v start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (34)

where p¯c,u,kx(𝐲c,k|xu,k)subscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘\bar{p}^{x}_{c,u,k}(\mathbf{y}_{c,k}|x_{u,k})over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) is the conditional probability distribution defined in (35) in the top of next page along with vnc,u,ke(enc,u)subscriptsuperscript𝑣𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢v^{e}_{n_{c},u,k}(e_{n_{c},u})italic_v start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ), and v~c,u,ke(𝐞c,u)subscriptsuperscript~𝑣𝑒𝑐𝑢𝑘subscript𝐞𝑐𝑢\tilde{v}^{e}_{c,u,k}(\mathbf{e}_{c,u})over~ start_ARG italic_v end_ARG start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_e start_POSTSUBSCRIPT italic_c , italic_u end_POSTSUBSCRIPT ), with C¯c,u,kxsubscriptsuperscript¯𝐶𝑥𝑐𝑢𝑘\bar{C}^{x}_{c,u,k}over¯ start_ARG italic_C end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT being the normalizing constant and

𝐞^c,u,kvsubscriptsuperscript^𝐞𝑣𝑐𝑢𝑘\displaystyle\hat{\mathbf{e}}^{v}_{c,u,k}over^ start_ARG bold_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT =[e^nc(1),u,kv,e^nc(2),u,kv,,e^nc(Nc),u,kv]TNc×1,absentsuperscriptmatrixsubscriptsuperscript^𝑒𝑣subscript𝑛𝑐1𝑢𝑘subscriptsuperscript^𝑒𝑣subscript𝑛𝑐2𝑢𝑘subscriptsuperscript^𝑒𝑣subscript𝑛𝑐subscript𝑁𝑐𝑢𝑘Tsuperscriptsubscript𝑁𝑐1\displaystyle=\begin{bmatrix}\hat{e}^{v}_{n_{c(1)},u,k},\hat{e}^{v}_{n_{c(2)},% u,k},\ldots,\hat{e}^{v}_{n_{c(N_{c})},u,k}\end{bmatrix}^{\mathrm{T}}\in\mathbb% {C}^{N_{c}\times 1},= [ start_ARG start_ROW start_CELL over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c ( 1 ) end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c ( 2 ) end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , … , over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c ( italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT ,
𝚵c,u,kv,esubscriptsuperscript𝚵𝑣𝑒𝑐𝑢𝑘\displaystyle\mathbf{\Xi}^{v,e}_{c,u,k}\!\!bold_Ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT =diag(ξnc(1),u,kv,e,ξnc(2),u,kv,e,,ξnc(Nc),u,kv,e)Nc×Nc.absentdiagsubscriptsuperscript𝜉𝑣𝑒subscript𝑛𝑐1𝑢𝑘subscriptsuperscript𝜉𝑣𝑒subscript𝑛𝑐2𝑢𝑘subscriptsuperscript𝜉𝑣𝑒subscript𝑛𝑐subscript𝑁𝑐𝑢𝑘superscriptsubscript𝑁𝑐subscript𝑁𝑐\displaystyle=\!\mathrm{diag}\left(\!\xi^{v,e}_{n_{c(1)},u,k},\xi^{v,e}_{n_{c(% 2)},u,k},\ldots,\xi^{v,e}_{n_{c(N_{c})},u,k}\!\right)\in\mathbb{R}^{N_{c}% \times N_{c}}.= roman_diag ( italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c ( 1 ) end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c ( 2 ) end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , … , italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c ( italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUPERSCRIPT .
p¯c,u,kx(𝐲c,k|xu,k)subscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘\displaystyle\bar{p}^{x}_{c,u,k}(\mathbf{y}_{c,k}|x_{u,k})over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) C¯c,u,kx𝐄c,𝐱¯kxu,kp(𝐲c,k|𝐄c,𝐱¯k)u𝒰uvc,u,kx(xu,k)u𝒰nc𝒩cvnc,u,ke(enc,u),absentsubscriptsuperscript¯𝐶𝑥𝑐𝑢𝑘subscriptsubscript𝐄𝑐subscript¯𝐱𝑘subscript𝑥𝑢𝑘𝑝conditionalsubscript𝐲𝑐𝑘subscript𝐄𝑐subscript¯𝐱𝑘subscriptproductsuperscript𝑢𝒰𝑢subscriptsuperscript𝑣𝑥𝑐superscript𝑢𝑘subscript𝑥superscript𝑢𝑘subscriptproductsuperscript𝑢𝒰subscriptproductsubscript𝑛𝑐subscript𝒩𝑐subscriptsuperscript𝑣𝑒subscript𝑛𝑐superscript𝑢𝑘subscript𝑒subscript𝑛𝑐superscript𝑢\displaystyle\triangleq\bar{C}^{x}_{c,u,k}\int_{\mathbf{E}_{c},\bar{\mathbf{x}% }_{k}\setminus x_{u,k}}p(\mathbf{y}_{c,k}|\mathbf{E}_{c},\bar{\mathbf{x}}_{k})% \prod_{u^{\prime}\in\mathcal{U}\setminus u}v^{x}_{c,u^{\prime},k}(x_{u^{\prime% },k})\prod_{u^{\prime}\in\mathcal{U}}\prod_{n_{c}\in\mathcal{N}_{c}}v^{e}_{n_{% c},u^{\prime},k}(e_{n_{c},u^{\prime}}),≜ over¯ start_ARG italic_C end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_p ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | bold_E start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U ∖ italic_u end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∈ caligraphic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) , (35)
vnc,u,ke(enc,u)subscriptsuperscript𝑣𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢\displaystyle v^{e}_{n_{c},u,k}(e_{n_{c},u})italic_v start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) bnc,ue(enc,u)k𝒦kqnc,u,ke(enc,u)=Cnc,u,kv,eexp(|enc,ue^nc,u,kv|2/ξnc,u,kv,e),absentsuperscriptsubscript𝑏subscript𝑛𝑐𝑢𝑒subscript𝑒subscript𝑛𝑐𝑢subscriptproductsuperscript𝑘𝒦𝑘subscriptsuperscript𝑞𝑒subscript𝑛𝑐𝑢superscript𝑘subscript𝑒subscript𝑛𝑐𝑢subscriptsuperscript𝐶𝑣𝑒subscript𝑛𝑐𝑢𝑘superscriptsubscript𝑒subscript𝑛𝑐𝑢subscriptsuperscript^𝑒𝑣subscript𝑛𝑐𝑢𝑘2subscriptsuperscript𝜉𝑣𝑒subscript𝑛𝑐𝑢𝑘\displaystyle\triangleq b_{n_{c},u}^{e}(e_{n_{c},u})\prod_{k^{\prime}\in% \mathcal{K}\setminus k}q^{e}_{n_{c},u,k^{\prime}}(e_{n_{c},u})=C^{v,e}_{n_{c},% u,k}\exp\left(-|e_{n_{c},u}-\hat{e}^{v}_{n_{c},u,k}|^{2}/\xi^{v,e}_{n_{c},u,k}% \right),≜ italic_b start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_K ∖ italic_k end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) = italic_C start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT roman_exp ( - | italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT - over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ) , (36)
v~c,u,ke(𝐞c,u)subscriptsuperscript~𝑣𝑒𝑐𝑢𝑘subscript𝐞𝑐𝑢\displaystyle\tilde{v}^{e}_{c,u,k}(\mathbf{e}_{c,u})over~ start_ARG italic_v end_ARG start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_e start_POSTSUBSCRIPT italic_c , italic_u end_POSTSUBSCRIPT ) nc𝒩cvnc,u,ke(enc,u)exp{(𝐞c,u𝐞^c,u,kv)H𝚵c,u,ke,v1(𝐞c,u𝐞^c,u,kv)}.absentsubscriptproductsubscript𝑛𝑐subscript𝒩𝑐subscriptsuperscript𝑣𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢proportional-tosuperscriptsubscript𝐞𝑐𝑢subscriptsuperscript^𝐞𝑣𝑐𝑢𝑘Hsubscriptsuperscript𝚵𝑒𝑣1𝑐𝑢𝑘subscript𝐞𝑐𝑢subscriptsuperscript^𝐞𝑣𝑐𝑢𝑘\displaystyle\triangleq\prod_{n_{c}\in\mathcal{N}_{c}}v^{e}_{n_{c},u,k}(e_{n_{% c},u})\propto\exp\left\{-(\mathbf{e}_{c,u}-\hat{\mathbf{e}}^{v}_{c,u,k})^{% \mathrm{H}}\mathbf{\Xi}^{e,v-1}_{c,u,k}(\mathbf{e}_{c,u}-\hat{\mathbf{e}}^{v}_% {c,u,k})\right\}.≜ ∏ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ∈ caligraphic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_v start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) ∝ roman_exp { - ( bold_e start_POSTSUBSCRIPT italic_c , italic_u end_POSTSUBSCRIPT - over^ start_ARG bold_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT roman_H end_POSTSUPERSCRIPT bold_Ξ start_POSTSUPERSCRIPT italic_e , italic_v - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_e start_POSTSUBSCRIPT italic_c , italic_u end_POSTSUBSCRIPT - over^ start_ARG bold_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) } . (37)

From the conditional distribution p¯c,u,kx(𝐲c,k|xu,k)subscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘\bar{p}^{x}_{c,u,k}(\mathbf{y}_{c,k}|x_{u,k})over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) in (35), the mean 𝐲~c,u,kx𝔼p¯c,u,kx(𝐲c,k|xu,k)[𝐲c,k]subscriptsuperscript~𝐲𝑥𝑐𝑢𝑘subscript𝔼subscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘delimited-[]subscript𝐲𝑐𝑘\tilde{\mathbf{y}}^{x}_{c,u,k}\triangleq\mathbb{E}_{\bar{p}^{x}_{c,u,k}(% \mathbf{y}_{c,k}|x_{u,k})}\left[\mathbf{y}_{c,k}\right]over~ start_ARG bold_y end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ≜ blackboard_E start_POSTSUBSCRIPT over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT [ bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ] and covariance 𝛀c,kx𝔼p¯c,u,kx(𝐲c,k|xu,k)[(𝐲c,k𝐲~c,u,kx)(𝐲c,k𝐲~c,u,kx)H]subscriptsuperscript𝛀𝑥𝑐𝑘subscript𝔼subscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘delimited-[]subscript𝐲𝑐𝑘subscriptsuperscript~𝐲𝑥𝑐𝑢𝑘superscriptsubscript𝐲𝑐𝑘subscriptsuperscript~𝐲𝑥𝑐𝑢𝑘H\mathbf{\Omega}^{x}_{c,k}\triangleq\mathbb{E}_{\bar{p}^{x}_{c,u,k}(\mathbf{y}_% {c,k}|x_{u,k})}\left[(\mathbf{y}_{c,k}-\tilde{\mathbf{y}}^{x}_{c,u,k})(\mathbf% {y}_{c,k}-\tilde{\mathbf{y}}^{x}_{c,u,k})^{\mathrm{H}}\right]bold_Ω start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ≜ blackboard_E start_POSTSUBSCRIPT over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT [ ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT - over~ start_ARG bold_y end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT - over~ start_ARG bold_y end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT roman_H end_POSTSUPERSCRIPT ], can be calculated as

𝐲~c,u,kxsubscriptsuperscript~𝐲𝑥𝑐𝑢𝑘\displaystyle\tilde{\mathbf{y}}^{x}_{c,u,k}over~ start_ARG bold_y end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT =𝐲c,ku𝒰u𝐡^c,u,kvx^c,u,kv,absentsubscript𝐲𝑐𝑘subscriptsuperscript𝑢𝒰𝑢subscriptsuperscript^𝐡𝑣𝑐superscript𝑢𝑘subscriptsuperscript^𝑥𝑣𝑐superscript𝑢𝑘\displaystyle=\mathbf{y}_{c,k}-\sum_{u^{\prime}\in\mathcal{U}\setminus u}\ % \hat{\mathbf{h}}^{v}_{c,u^{\prime},k}\hat{x}^{v}_{c,u^{\prime},k},= bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U ∖ italic_u end_POSTSUBSCRIPT over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT , (38a)
𝛀c,kxsubscriptsuperscript𝛀𝑥𝑐𝑘\displaystyle\mathbf{\Omega}^{x}_{c,k}bold_Ω start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT =u𝒰{ξc,u,kc,x𝐡^c,u,kv𝐡^c,u,kvH\displaystyle=\sum_{u^{\prime}\in\mathcal{U}}\Big{\{}\xi^{c,x}_{c,u^{\prime},k% }\hat{\mathbf{h}}^{v}_{c,u^{\prime},k}\hat{\mathbf{h}}^{v\mathrm{H}}_{c,u^{% \prime},k}= ∑ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U end_POSTSUBSCRIPT { italic_ξ start_POSTSUPERSCRIPT italic_c , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT italic_v roman_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT
+(ξc,u,kv,x+|x^c,u,kv|2)𝚵c,u,kv,e}+σ2𝐈Nc,\displaystyle\qquad+(\xi^{v,x}_{c,u^{\prime},k}+|\hat{x}^{v}_{c,u^{\prime},k}|% ^{2})\mathbf{\Xi}^{v,e}_{c,u^{\prime},k}\Big{\}}+\sigma^{2}\mathbf{I}_{N_{c}},+ ( italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT + | over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) bold_Ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT } + italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (38b)

with 𝐡^c,u,kv𝐞^c,u,kv+𝐬^c,usubscriptsuperscript^𝐡𝑣𝑐superscript𝑢𝑘subscriptsuperscript^𝐞𝑣𝑐superscript𝑢𝑘subscript^𝐬𝑐superscript𝑢\hat{\mathbf{h}}^{v}_{c,u^{\prime},k}\triangleq\hat{\mathbf{e}}^{v}_{c,u^{% \prime},k}+\hat{\mathbf{s}}_{c,u^{\prime}}over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT ≜ over^ start_ARG bold_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT + over^ start_ARG bold_s end_ARG start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT.

Substituting (32) and (34) into (31), the approximate function qc,u,kx(xu,k)superscriptsubscript𝑞𝑐𝑢𝑘𝑥subscript𝑥𝑢𝑘q_{c,u,k}^{x}(x_{u,k})italic_q start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) can be obtained as

qc,u,kx(xu,k)proj𝚽[p¯c,u,kx(𝐲c,k|xu,k)vc,u,kx(xu,k)]vc,u,kx(xu,k).proportional-tosubscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘subscriptproj𝚽delimited-[]subscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘subscriptsuperscript𝑣𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘subscriptsuperscript𝑣𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle q^{x}_{c,u,k}(x_{u,k})\propto\frac{\mathrm{proj}_{\mathbf{\Phi}}% \left[\bar{p}^{x}_{c,u,k}(\mathbf{y}_{c,k}|x_{u,k})v^{x}_{c,u,k}(x_{u,k})% \right]}{v^{x}_{c,u,k}(x_{u,k})}.italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ∝ divide start_ARG roman_proj start_POSTSUBSCRIPT bold_Φ end_POSTSUBSCRIPT [ over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) italic_v start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ] end_ARG start_ARG italic_v start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) end_ARG . (39)

Under large system conditions with CLT, the conditional distribution can be approximated as p¯c,u,kx(𝐲c,k|xu,k)𝒞𝒩(𝐲~c,u,kx,𝛀c,kx)similar-to-or-equalssubscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘𝒞𝒩subscriptsuperscript~𝐲𝑥𝑐𝑢𝑘subscriptsuperscript𝛀𝑥𝑐𝑘\bar{p}^{x}_{c,u,k}(\mathbf{y}_{c,k}|x_{u,k})\simeq\mathcal{CN}(\tilde{\mathbf% {y}}^{x}_{c,u,k},\mathbf{\Omega}^{x}_{c,k})over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ≃ caligraphic_C caligraphic_N ( over~ start_ARG bold_y end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , bold_Ω start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ). Thus, the approximate function can be expressed as qc,u,kx(xu,k)p¯c,u,kx(𝐲c,k|xu,k)proportional-tosubscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘subscriptsuperscript¯𝑝𝑥𝑐𝑢𝑘conditionalsubscript𝐲𝑐𝑘subscript𝑥𝑢𝑘q^{x}_{c,u,k}(x_{u,k})\propto\bar{p}^{x}_{c,u,k}(\mathbf{y}_{c,k}|x_{u,k})italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ∝ over¯ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_y start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) with the mean and variance calculated as

x^c,u,kqsubscriptsuperscript^𝑥𝑞𝑐𝑢𝑘\displaystyle\hat{x}^{q}_{c,u,k}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT =𝐡^c,u,kvH(𝛀c,kx)1𝐲~c,u,kx/γc,u,kx,absentsubscriptsuperscript^𝐡𝑣H𝑐𝑢𝑘superscriptsuperscriptsubscript𝛀𝑐𝑘𝑥1subscriptsuperscript~𝐲𝑥𝑐𝑢𝑘subscriptsuperscript𝛾𝑥𝑐𝑢𝑘\displaystyle=\hat{\mathbf{h}}^{v\mathrm{H}}_{c,u,k}(\mathbf{\Omega}_{c,k}^{x}% )^{-1}\tilde{\mathbf{y}}^{x}_{c,u,k}/\gamma^{x}_{c,u,k},= over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT italic_v roman_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_Ω start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over~ start_ARG bold_y end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT / italic_γ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , (40a)
ξc,u,kq,xsubscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘\displaystyle\xi^{q,x}_{c,u,k}italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT =1/γc,u,kxξc,u,kv,x,absent1subscriptsuperscript𝛾𝑥𝑐𝑢𝑘subscriptsuperscript𝜉𝑣𝑥𝑐𝑢𝑘\displaystyle=1/\gamma^{x}_{c,u,k}-\xi^{v,x}_{c,u,k},= 1 / italic_γ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT - italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , (40b)

with γc,u,kx=𝐡^c,u,kvH(𝛀c,kx)1𝐡^c,u,kvsubscriptsuperscript𝛾𝑥𝑐𝑢𝑘subscriptsuperscript^𝐡𝑣H𝑐𝑢𝑘superscriptsubscriptsuperscript𝛀𝑥𝑐𝑘1subscriptsuperscript^𝐡𝑣𝑐𝑢𝑘\gamma^{x}_{c,u,k}=\hat{\mathbf{h}}^{v\mathrm{H}}_{c,u,k}(\mathbf{\Omega}^{x}_% {c,k})^{-1}\hat{\mathbf{h}}^{v}_{c,u,k}italic_γ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT = over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT italic_v roman_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( bold_Ω start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT. The calculation of 𝐲~c,u,kxsubscriptsuperscript~𝐲𝑥𝑐𝑢𝑘\tilde{\mathbf{y}}^{x}_{c,u,k}over~ start_ARG bold_y end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT in (38a) corresponds to a soft interference cancellation (Soft-IC) [12] using data replicas {x^c,u,kv}u𝒰usubscriptsubscriptsuperscript^𝑥𝑣𝑐superscript𝑢𝑘superscript𝑢𝒰𝑢\{\hat{x}^{v}_{c,u^{\prime},k}\}_{u^{\prime}\in\mathcal{U}\setminus u}{ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U ∖ italic_u end_POSTSUBSCRIPT and channel replicas {𝐡^c,u,kv}u𝒰subscriptsubscriptsuperscript^𝐡𝑣𝑐superscript𝑢𝑘superscript𝑢𝒰\{\hat{\mathbf{h}}^{v}_{c,u^{\prime},k}\}_{u^{\prime}\in\mathcal{U}}{ over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U end_POSTSUBSCRIPT.

Unlike the conventional MRC-based detections [13, 14, 12], the LMMSE-based detection for each sub-array c𝑐citalic_c in (40a) can deal with the correlation between the leaked energy in the beam-domain owing to whitening operation by (𝛀c,kx)1superscriptsuperscriptsubscript𝛀𝑐𝑘𝑥1(\mathbf{\Omega}_{c,k}^{x})^{-1}( bold_Ω start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.

V-C2 Update 𝝅u,kb,xsubscriptsuperscript𝝅𝑏𝑥𝑢𝑘\bm{\pi}^{b,x}_{u,k}bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT

The KL minimization problem for 𝝅u,kb,xsubscriptsuperscript𝝅𝑏𝑥𝑢𝑘\bm{\pi}^{b,x}_{u,k}bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT is formulated as

minimize𝝅u,kb,xKL(p^u,kb,x(𝐄,𝐗|𝐘)g(𝐄,𝐗|𝐘)),\displaystyle\underset{\bm{\pi}^{b,x}_{u,k}}{\mathrm{minimize}}\ \mathrm{KL}% \left(\hat{p}_{u,k}^{b,x}(\mathbf{E},\mathbf{X}|\mathbf{Y})\|g(\mathbf{E},% \mathbf{X}|\mathbf{Y})\right),start_UNDERACCENT bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_minimize end_ARG roman_KL ( over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) ∥ italic_g ( bold_E , bold_X | bold_Y ) ) , (41)

where p^u,kb,x(𝐄,𝐗|𝐘)superscriptsubscript^𝑝𝑢𝑘𝑏𝑥𝐄conditional𝐗𝐘\hat{p}_{u,k}^{b,x}(\mathbf{E},\mathbf{X}|\mathbf{Y})over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) is the target distribution defined as

p^u,kb,x(𝐄,𝐗|𝐘)=Cu,kb,xp(xu,k)superscriptsubscript^𝑝𝑢𝑘𝑏𝑥𝐄conditional𝐗𝐘superscriptsubscript𝐶𝑢𝑘𝑏𝑥𝑝subscript𝑥𝑢𝑘\displaystyle\hat{p}_{u,k}^{b,x}(\mathbf{E},\mathbf{X}|\mathbf{Y})=\ C_{u,k}^{% b,x}\ p(x_{u,k})over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) = italic_C start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT italic_p ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT )
(u,k)𝒰×𝒦(u,k)bu,kx(xu,k)p(xu,k)Qx(𝐗)Qe(𝐄)p(𝐘|𝐄,𝐗)Be(𝐄)p(𝐄;𝚯)subscriptproductsuperscript𝑢superscript𝑘𝒰𝒦𝑢𝑘subscriptsuperscriptsubscript𝑏superscript𝑢superscript𝑘𝑥subscript𝑥superscript𝑢superscript𝑘similar-to-or-equalsabsent𝑝subscript𝑥superscript𝑢superscript𝑘subscriptsuperscript𝑄𝑥𝐗superscript𝑄𝑒𝐄similar-to-or-equalsabsent𝑝conditional𝐘𝐄𝐗subscriptsuperscript𝐵𝑒𝐄similar-to-or-equalsabsent𝑝𝐄𝚯\displaystyle\prod_{(u^{\prime},k^{\prime})\in\mathcal{U}\times\mathcal{K}% \setminus(u,k)}\underbrace{b_{u^{\prime},k^{\prime}}^{x}(x_{u^{\prime},k^{% \prime}})}_{\simeq p(x_{u^{\prime},k^{\prime}})}\underbrace{Q^{x}(\mathbf{X})Q% ^{e}(\mathbf{E})}_{\simeq p(\mathbf{Y}|\mathbf{E},\mathbf{X})}\underbrace{B^{e% }(\mathbf{E})}_{\simeq p(\mathbf{E};\mathbf{\Theta})}∏ start_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ caligraphic_U × caligraphic_K ∖ ( italic_u , italic_k ) end_POSTSUBSCRIPT under⏟ start_ARG italic_b start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( italic_x start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT under⏟ start_ARG italic_Q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) italic_Q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( bold_Y | bold_E , bold_X ) end_POSTSUBSCRIPT under⏟ start_ARG italic_B start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( bold_E ; bold_Θ ) end_POSTSUBSCRIPT (42)

where Cu,kb,xsuperscriptsubscript𝐶𝑢𝑘𝑏𝑥C_{u,k}^{b,x}italic_C start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT is a normalizing constant.

Similar to the derivation of (31), the optimal condition for 𝝅u,kb,xsubscriptsuperscript𝝅𝑏𝑥𝑢𝑘\bm{\pi}^{b,x}_{u,k}bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT is derived as

g(xu,k|𝐘)=proj𝚽[p^u,kb,x(xu,k|𝐘)],𝑔conditionalsubscript𝑥𝑢𝑘𝐘subscriptproj𝚽delimited-[]subscriptsuperscript^𝑝𝑏𝑥𝑢𝑘conditionalsubscript𝑥𝑢𝑘𝐘\displaystyle g(x_{u,k}|\mathbf{Y})=\mathrm{proj}_{\mathbf{\Phi}}\left[\hat{p}% ^{b,x}_{u,k}(x_{u,k}|\mathbf{Y})\right],italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) = roman_proj start_POSTSUBSCRIPT bold_Φ end_POSTSUBSCRIPT [ over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) ] , (43)

where p^u,kb,x(xu,k|𝐘)=𝐄,𝐗xu,kp^u,kb,x(𝐄,𝐗|𝐘)subscriptsuperscript^𝑝𝑏𝑥𝑢𝑘conditionalsubscript𝑥𝑢𝑘𝐘subscript𝐄𝐗subscript𝑥𝑢𝑘subscriptsuperscript^𝑝𝑏𝑥𝑢𝑘𝐄conditional𝐗𝐘\hat{p}^{b,x}_{u,k}(x_{u,k}|\mathbf{Y})=\int_{\mathbf{E},\mathbf{X}\setminus x% _{u,k}}\hat{p}^{b,x}_{u,k}(\mathbf{E},\mathbf{X}|\mathbf{Y})over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) = ∫ start_POSTSUBSCRIPT bold_E , bold_X ∖ italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( bold_E , bold_X | bold_Y ) is the marginalized target distribution calculated as

p^u,kb,x(xu,k|𝐘)subscriptsuperscript^𝑝𝑏𝑥𝑢𝑘conditionalsubscript𝑥𝑢𝑘𝐘\displaystyle\hat{p}^{b,x}_{u,k}(x_{u,k}|\mathbf{Y})over^ start_ARG italic_p end_ARG start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) p(xu,k)c𝒞qc,u,kx(xu,k),proportional-toabsent𝑝subscript𝑥𝑢𝑘subscriptproductsuperscript𝑐𝒞subscriptsuperscript𝑞𝑥superscript𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle\propto\ p(x_{u,k})\prod_{c^{\prime}\in\mathcal{C}}q^{x}_{c^{% \prime},u,k}(x_{u,k}),∝ italic_p ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (44)

with the approximate function multiplied over the sub-array direction, c𝒞qc,u,kx(xu,k)subscriptproductsuperscript𝑐𝒞subscriptsuperscript𝑞𝑥superscript𝑐𝑢𝑘subscript𝑥𝑢𝑘\prod_{c^{\prime}\in\mathcal{C}}q^{x}_{c^{\prime},u,k}(x_{u,k})∏ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ), calculated as

c𝒞qc,u,kx(xu,k)exp(|xu,kx^u,kq|2/ξu,kq,x),proportional-tosubscriptproductsuperscript𝑐𝒞subscriptsuperscript𝑞𝑥superscript𝑐𝑢𝑘subscript𝑥𝑢𝑘superscriptsubscript𝑥𝑢𝑘subscriptsuperscript^𝑥𝑞𝑢𝑘2subscriptsuperscript𝜉𝑞𝑥𝑢𝑘\displaystyle\prod_{c^{\prime}\in\mathcal{C}}q^{x}_{c^{\prime},u,k}(x_{u,k})% \propto\exp\left(-|x_{u,k}-\hat{x}^{q}_{u,k}|^{2}/\xi^{q,x}_{u,k}\right),∏ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ∝ roman_exp ( - | italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (45)

with

x^u,kq=ξu,kq,x(c𝒞x^c,u,kqξc,u,kq,x),ξu,kq,x=(c𝒞1ξc,u,kq,x)1.formulae-sequencesuperscriptsubscript^𝑥𝑢𝑘𝑞superscriptsubscript𝜉𝑢𝑘𝑞𝑥subscriptsuperscript𝑐𝒞superscriptsubscript^𝑥superscript𝑐𝑢𝑘𝑞superscriptsubscript𝜉superscript𝑐𝑢𝑘𝑞𝑥superscriptsubscript𝜉𝑢𝑘𝑞𝑥superscriptsubscriptsuperscript𝑐𝒞1superscriptsubscript𝜉superscript𝑐𝑢𝑘𝑞𝑥1\displaystyle\hat{x}_{u,k}^{q}=\xi_{u,k}^{q,x}\left(\sum_{c^{\prime}\in% \mathcal{C}}\frac{\hat{x}_{c^{\prime},u,k}^{q}}{\xi_{c^{\prime},u,k}^{q,x}}% \right),\ \xi_{u,k}^{q,x}=\left(\sum_{c^{\prime}\in\mathcal{C}}\frac{1}{\xi_{c% ^{\prime},u,k}^{q,x}}\right)^{-1}\!\!\!\!\!.over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT = italic_ξ start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C end_POSTSUBSCRIPT divide start_ARG over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT end_ARG ) , italic_ξ start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT = ( ∑ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (46)

Note that combining the mean {x^c,u,kq}c𝒞subscriptsubscriptsuperscript^𝑥𝑞𝑐𝑢𝑘𝑐𝒞\{\hat{x}^{q}_{c,u,k}\}_{c\in\mathcal{C}}{ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT and variance {ξc,u,kq,x}c𝒞subscriptsubscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘𝑐𝒞\{\xi^{q,x}_{c,u,k}\}_{c\in\mathcal{C}}{ italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_c ∈ caligraphic_C end_POSTSUBSCRIPT over the sub-array direction c𝒞𝑐𝒞c\in\mathcal{C}italic_c ∈ caligraphic_C, as written in (46), leads to further improvements for data detection owing to the spatial diversity. Substituting (44) into (43), the approximate posterior g(xu,k|𝐘)𝑔conditionalsubscript𝑥𝑢𝑘𝐘g(x_{u,k}|\mathbf{Y})italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) can be written as

g(xu,k|𝐘)𝑔conditionalsubscript𝑥𝑢𝑘𝐘\displaystyle g(x_{u,k}|\mathbf{Y})italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) proj𝚽[p(xu,k)c𝒞qc,u,kx(xu,k)].proportional-toabsentsubscriptproj𝚽delimited-[]𝑝subscript𝑥𝑢𝑘subscriptproductsuperscript𝑐𝒞subscriptsuperscript𝑞𝑥superscript𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle\propto\mathrm{proj_{\mathbf{\Phi}}}\left[p(x_{u,k})\prod_{c^{% \prime}\in\mathcal{C}}q^{x}_{c^{\prime},u,k}(x_{u,k})\right].∝ roman_proj start_POSTSUBSCRIPT bold_Φ end_POSTSUBSCRIPT [ italic_p ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ∏ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ] . (47)

The approximate posterior mean x^u,ksubscript^𝑥𝑢𝑘\hat{x}_{u,k}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT and variance ξu,kxsubscriptsuperscript𝜉𝑥𝑢𝑘\xi^{x}_{u,k}italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT of g(xu,k|𝐘)𝑔conditionalsubscript𝑥𝑢𝑘𝐘g(x_{u,k}|\mathbf{Y})italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) can be derived using the MMSE denoiser function η()𝜂\eta(\cdot)italic_η ( ⋅ ) [31], which is designed based on the prior for QAM constellation p(xu,k)𝑝subscript𝑥𝑢𝑘p(x_{u,k})italic_p ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) in (18). Then, the posterior mean and variance is expressed as x^u,k=η(x^u,kq,ξu,kq,x)𝔼g(xu,k|𝐘)[xu,k]subscript^𝑥𝑢𝑘𝜂subscriptsuperscript^𝑥𝑞𝑢𝑘subscriptsuperscript𝜉𝑞𝑥𝑢𝑘subscript𝔼𝑔conditionalsubscript𝑥𝑢𝑘𝐘delimited-[]subscript𝑥𝑢𝑘\hat{x}_{u,k}=\eta(\hat{x}^{q}_{u,k},\xi^{q,x}_{u,k})\triangleq\mathbb{E}_{g(x% _{u,k}|\mathbf{Y})}[x_{u,k}]over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT = italic_η ( over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ≜ blackboard_E start_POSTSUBSCRIPT italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) end_POSTSUBSCRIPT [ italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ] and ξu,kx=ξu,kq,xη(x^u,kq,ξu,kq,x)x^u,kqsubscriptsuperscript𝜉𝑥𝑢𝑘subscriptsuperscript𝜉𝑞𝑥𝑢𝑘𝜂subscriptsuperscript^𝑥𝑞𝑢𝑘subscriptsuperscript𝜉𝑞𝑥𝑢𝑘subscriptsuperscript^𝑥𝑞𝑢𝑘\xi^{x}_{u,k}=\xi^{q,x}_{u,k}\frac{\partial\eta(\hat{x}^{q}_{u,k},\xi^{q,x}_{u% ,k})}{\partial\hat{x}^{q}_{u,k}}italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT = italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT divide start_ARG ∂ italic_η ( over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG, which can be calculated as

x^u,k=Cu,kg,x𝒳q𝒳𝒳qexp(|𝒳qx^u,kq|2/ξu,kq,x),subscript^𝑥𝑢𝑘subscriptsuperscript𝐶𝑔𝑥𝑢𝑘subscriptsubscript𝒳𝑞𝒳subscript𝒳𝑞superscriptsubscript𝒳𝑞subscriptsuperscript^𝑥𝑞𝑢𝑘2subscriptsuperscript𝜉𝑞𝑥𝑢𝑘\displaystyle\!\!\hat{x}_{u,k}=C^{g,x}_{u,k}\sum_{\mathcal{X}_{q}\in\mathcal{X% }}\mathcal{X}_{q}\exp\left(-|\mathcal{X}_{q}-\hat{x}^{q}_{u,k}|^{2}/\xi^{q,x}_% {u,k}\right),over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT = italic_C start_POSTSUPERSCRIPT italic_g , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ∈ caligraphic_X end_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT roman_exp ( - | caligraphic_X start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (48a)
ξu,kx=Cu,kg,x𝒳q𝒳|𝒳q|2exp(|𝒳qx^u,kq|2/ξu,kq,x)|x^u,k|2,subscriptsuperscript𝜉𝑥𝑢𝑘subscriptsuperscript𝐶𝑔𝑥𝑢𝑘subscriptsubscript𝒳𝑞𝒳superscriptsubscript𝒳𝑞2superscriptsubscript𝒳𝑞subscriptsuperscript^𝑥𝑞𝑢𝑘2subscriptsuperscript𝜉𝑞𝑥𝑢𝑘superscriptsubscript^𝑥𝑢𝑘2\displaystyle\!\!\xi^{x}_{u,k}\!=\!C^{g,x}_{u,k}\!\!\sum_{\mathcal{X}_{q}\in% \mathcal{X}}\!\!|\mathcal{X}_{q}|^{2}\exp\!\left(\!-|\mathcal{X}_{q}\!-\!\hat{% x}^{q}_{u,k}|^{2}/\xi^{q,x}_{u,k}\!\right)\!-\!|\hat{x}_{u,k}|^{2}\!\!,\!italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT = italic_C start_POSTSUPERSCRIPT italic_g , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ∈ caligraphic_X end_POSTSUBSCRIPT | caligraphic_X start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_exp ( - | caligraphic_X start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) - | over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (48b)

with (Cu,kg,x)1=𝒳q𝒳exp(|𝒳qx^u,kq|2/ξu,kq,x).superscriptsubscriptsuperscript𝐶𝑔𝑥𝑢𝑘1subscriptsubscript𝒳𝑞𝒳superscriptsubscript𝒳𝑞subscriptsuperscript^𝑥𝑞𝑢𝑘2subscriptsuperscript𝜉𝑞𝑥𝑢𝑘(C^{g,x}_{u,k})^{-1}=\sum_{\mathcal{X}_{q}\in\mathcal{X}}\exp\left(-|\mathcal{% X}_{q}-\hat{x}^{q}_{u,k}|^{2}/\xi^{q,x}_{u,k}\right).( italic_C start_POSTSUPERSCRIPT italic_g , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT caligraphic_X start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ∈ caligraphic_X end_POSTSUBSCRIPT roman_exp ( - | caligraphic_X start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) .

From (32), vc,u,kx(xu,k)superscriptsubscript𝑣𝑐𝑢𝑘𝑥subscript𝑥𝑢𝑘v_{c,u,k}^{x}(x_{u,k})italic_v start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) can be updated as

vc,u,kx(xu,k)g(xu,k|𝐘)/qc,u,kx(xu,k),proportional-tosubscriptsuperscript𝑣𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘𝑔conditionalsubscript𝑥𝑢𝑘𝐘subscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle v^{x}_{c,u,k}(x_{u,k})\propto\ g(x_{u,k}|\mathbf{Y})/q^{x}_{c,u,% k}(x_{u,k}),italic_v start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) ∝ italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) / italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (49)

from which the associated mean and variance are given by

x^c,u,kv=ξc,u,kv,x(x^u,kξu,kxx^c,u,kqξc,u,kq,x),ξc,u,kv,x=(1ξu,kx1ξc,u,kq,x)1.formulae-sequencesubscriptsuperscript^𝑥𝑣𝑐𝑢𝑘subscriptsuperscript𝜉𝑣𝑥𝑐𝑢𝑘subscript^𝑥𝑢𝑘subscriptsuperscript𝜉𝑥𝑢𝑘subscriptsuperscript^𝑥𝑞𝑐𝑢𝑘subscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘subscriptsuperscript𝜉𝑣𝑥𝑐𝑢𝑘superscript1subscriptsuperscript𝜉𝑥𝑢𝑘1subscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘1\displaystyle\hat{x}^{v}_{c,u,k}\!=\!\xi^{v,x}_{c,u,k}\left(\frac{\hat{x}_{u,k% }}{\xi^{x}_{u,k}}\!-\!\frac{\hat{x}^{q}_{c,u,k}}{\xi^{q,x}_{c,u,k}}\right)\!,% \ \xi^{v,x}_{c,u,k}\!=\!\left(\!\frac{1}{\xi^{x}_{u,k}}\!-\!\frac{1}{\xi^{q,x}% _{c,u,k}}\!\right)^{-1}\!.over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT = italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( divide start_ARG over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG - divide start_ARG over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT end_ARG ) , italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG - divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (50)

As shown in the Soft-IC process in (38a)-(38), x^c,u,kvsubscriptsuperscript^𝑥𝑣𝑐𝑢𝑘\hat{x}^{v}_{c,u,k}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT and ξc,u,kv,xsubscriptsuperscript𝜉𝑣𝑥𝑐𝑢𝑘\xi^{v,x}_{c,u,k}italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT in (50) are used as soft replicas instead of x^u,ksubscript^𝑥𝑢𝑘\hat{x}_{u,k}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT and ξu,kxsubscriptsuperscript𝜉𝑥𝑢𝑘\xi^{x}_{u,k}italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT in (48a)-(48b) in order to suppress the self-noise feedback in the algorithm iterations [32]. In conventional JCDE algorithms [11, 12], the self-feedback suppression is performed before the denoising process in (48a)-(48b) by generating antenna-wise extrinsic values based on BP rules. Hence, the complexity of the denoising process is 𝒪(QNUKd)𝒪𝑄𝑁𝑈subscript𝐾d\mathcal{O}(QNUK_{\mathrm{d}})caligraphic_O ( italic_Q italic_N italic_U italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ). In contrast, the proposed method can reduce the complexity in the denoising process as 𝒪(QUKd)𝒪𝑄𝑈subscript𝐾d\mathcal{O}(QUK_{\mathrm{d}})caligraphic_O ( italic_Q italic_U italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ), since the extrinsic values x^c,u,kvsubscriptsuperscript^𝑥𝑣𝑐𝑢𝑘\hat{x}^{v}_{c,u,k}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT and ξc,u,kv,xsubscriptsuperscript𝜉𝑣𝑥𝑐𝑢𝑘\xi^{v,x}_{c,u,k}italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT are generated after the denoising process in (50).

Finally, from (32), bu,kx(xu,k)subscriptsuperscript𝑏𝑥𝑢𝑘subscript𝑥𝑢𝑘b^{x}_{u,k}(x_{u,k})italic_b start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) can be updated as

bu,kx(xu,k)subscriptsuperscript𝑏𝑥𝑢𝑘subscript𝑥𝑢𝑘\displaystyle b^{x}_{u,k}(x_{u,k})\ italic_b start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) g(xu,k|𝐘u,k)/c𝒞qc,u,kx(xu,k),proportional-toabsent𝑔conditionalsubscript𝑥𝑢𝑘subscript𝐘𝑢𝑘subscriptproductsuperscript𝑐𝒞subscriptsuperscript𝑞𝑥superscript𝑐𝑢𝑘subscript𝑥𝑢𝑘\displaystyle\propto\ g(x_{u,k}|\mathbf{Y}_{u,k})\Big{/}\prod_{c^{\prime}\in% \mathcal{C}}q^{x}_{c^{\prime},u,k}(x_{u,k}),∝ italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) / ∏ start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_C end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) , (51)

with the mean x^u,kbsubscriptsuperscript^𝑥𝑏𝑢𝑘\hat{x}^{b}_{u,k}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT and variance ξu,kb,xsubscriptsuperscript𝜉𝑏𝑥𝑢𝑘\xi^{b,x}_{u,k}italic_ξ start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT of bu,kx(xu,k)subscriptsuperscript𝑏𝑥𝑢𝑘subscript𝑥𝑢𝑘b^{x}_{u,k}(x_{u,k})italic_b start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) being

x^u,kb=ξu,kb,x(x^u,kξu,kxx^u,kqξu,kq,x),subscriptsuperscript^𝑥𝑏𝑢𝑘subscriptsuperscript𝜉𝑏𝑥𝑢𝑘subscript^𝑥𝑢𝑘subscriptsuperscript𝜉𝑥𝑢𝑘subscriptsuperscript^𝑥𝑞𝑢𝑘subscriptsuperscript𝜉𝑞𝑥𝑢𝑘\displaystyle\hat{x}^{b}_{u,k}\!=\!\xi^{b,x}_{u,k}\left(\frac{\hat{x}_{u,k}}{% \xi^{x}_{u,k}}-\frac{\hat{x}^{q}_{u,k}}{\xi^{q,x}_{u,k}}\right),\!\!\!over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT = italic_ξ start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( divide start_ARG over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG - divide start_ARG over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG ) , ξu,kb,x=(1ξu,kx1ξu,kq,x)1.subscriptsuperscript𝜉𝑏𝑥𝑢𝑘superscript1subscriptsuperscript𝜉𝑥𝑢𝑘1subscriptsuperscript𝜉𝑞𝑥𝑢𝑘1\displaystyle\xi^{b,x}_{u,k}\!=\!\left(\frac{1}{\xi^{x}_{u,k}}-\frac{1}{\xi^{q% ,x}_{u,k}}\right)^{-1}\!\!\!\!\!\!.italic_ξ start_POSTSUPERSCRIPT italic_b , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG - divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (52)

V-D EP for Residual Channel Error Estimation

V-D1 Update 𝝅nc,u,kq,esubscriptsuperscript𝝅𝑞𝑒subscript𝑛𝑐𝑢𝑘\bm{\pi}^{q,e}_{n_{c},u,k}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT

For 𝝅nc,u,kq,esubscriptsuperscript𝝅𝑞𝑒subscript𝑛𝑐𝑢𝑘\bm{\pi}^{q,e}_{n_{c},u,k}bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT, we minimize

minimize𝝅nc,u,kq,eKL(p^nc,kq,e(𝐄,𝐗|𝐘)g(𝐄,𝐗|𝐘)),\displaystyle\underset{\bm{\pi}^{q,e}_{n_{c},u,k}}{\mathrm{minimize}}\ \mathrm% {KL}\left(\hat{p}_{n_{c},k}^{q,e}(\mathbf{E},\mathbf{X}|\mathbf{Y})\|g(\mathbf% {E},\mathbf{X}|\mathbf{Y})\right),start_UNDERACCENT bold_italic_π start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_minimize end_ARG roman_KL ( over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) ∥ italic_g ( bold_E , bold_X | bold_Y ) ) , (53)

where p^nc,kq,e(𝐄,𝐗|𝐘)superscriptsubscript^𝑝subscript𝑛𝑐𝑘𝑞𝑒𝐄conditional𝐗𝐘\hat{p}_{n_{c},k}^{q,e}(\mathbf{E},\mathbf{X}|\mathbf{Y})over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) is the target distribution designed as

p^nc,kq,e(𝐄,𝐗|𝐘)=Cnc,kq,ep(ync,k|𝐞¯nc,𝐱¯k)superscriptsubscript^𝑝subscript𝑛𝑐𝑘𝑞𝑒𝐄conditional𝐗𝐘superscriptsubscript𝐶subscript𝑛𝑐𝑘𝑞𝑒𝑝conditionalsubscript𝑦subscript𝑛𝑐𝑘subscript¯𝐞subscript𝑛𝑐subscript¯𝐱𝑘\displaystyle\hat{p}_{n_{c},k}^{q,e}(\mathbf{E},\mathbf{X}|\mathbf{Y})=C_{n_{c% },k}^{q,e}\ p(y_{n_{c},k}|\bar{\mathbf{e}}_{n_{c}},\bar{\mathbf{x}}_{k})over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) = italic_C start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT italic_p ( italic_y start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT | over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT end_POSTSUBSCRIPT , over¯ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT )
(nc,k)𝒩×𝒦(nc,k)lnc,ke(𝐞¯nc,𝐱k)p(ync,k|𝐞¯nc,𝐱k)Bx(𝐗)p(𝐗)Be(𝐄)p(𝐄;𝚯),subscriptproductsuperscriptsubscript𝑛𝑐superscript𝑘𝒩𝒦subscript𝑛𝑐𝑘subscriptsubscriptsuperscript𝑙𝑒superscriptsubscript𝑛𝑐superscript𝑘subscript¯𝐞superscriptsubscript𝑛𝑐subscript𝐱superscript𝑘similar-to-or-equalsabsent𝑝conditionalsubscript𝑦superscriptsubscript𝑛𝑐superscript𝑘subscript¯𝐞superscriptsubscript𝑛𝑐subscript𝐱superscript𝑘subscriptsuperscript𝐵𝑥𝐗similar-to-or-equalsabsent𝑝𝐗subscriptsuperscript𝐵𝑒𝐄similar-to-or-equalsabsent𝑝𝐄𝚯\displaystyle\prod_{(n_{c}^{\prime},k^{\prime})\in\mathcal{N}\times\mathcal{K}% \setminus(n_{c},k)}\underbrace{l^{e}_{n_{c}^{\prime},k^{\prime}}(\bar{\mathbf{% e}}_{n_{c}^{\prime}},\mathbf{x}_{k^{\prime}})}_{\simeq p(y_{n_{c}^{\prime},k^{% \prime}}|\bar{\mathbf{e}}_{n_{c}^{\prime}},\mathbf{x}_{k^{\prime}})}% \underbrace{B^{x}(\mathbf{X})}_{\simeq p(\mathbf{X})}\underbrace{B^{e}(\mathbf% {E})}_{\simeq p(\mathbf{E};\mathbf{\Theta})},∏ start_POSTSUBSCRIPT ( italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ caligraphic_N × caligraphic_K ∖ ( italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k ) end_POSTSUBSCRIPT under⏟ start_ARG italic_l start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , bold_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( italic_y start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | over¯ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , bold_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT under⏟ start_ARG italic_B start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( bold_X ) end_POSTSUBSCRIPT under⏟ start_ARG italic_B start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( bold_E ; bold_Θ ) end_POSTSUBSCRIPT , (54)

where Cnc,kq,esuperscriptsubscript𝐶subscript𝑛𝑐𝑘𝑞𝑒C_{n_{c},k}^{q,e}italic_C start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT is a normalizing constant.

Through the same procedure as the derivation of qc,u,kx(xu,k)subscriptsuperscript𝑞𝑥𝑐𝑢𝑘subscript𝑥𝑢𝑘q^{x}_{c,u,k}(x_{u,k})italic_q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) in (39), the mean and variance of approximate function qnc,u,ke(enc,u)subscriptsuperscript𝑞𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢q^{e}_{n_{c},u,k}(e_{n_{c},u})italic_q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) are obtained as

e^nc,u,kq=x^nc,u,kwy~nc,u,ke|x^nc,u,kw|2,ξnc,u,kq,e=ϕnc,u,ke|x^nc,u,kw|2,formulae-sequencesubscriptsuperscript^𝑒𝑞subscript𝑛𝑐𝑢𝑘subscriptsuperscript^𝑥𝑤subscript𝑛𝑐𝑢𝑘subscriptsuperscript~𝑦𝑒subscript𝑛𝑐𝑢𝑘superscriptsuperscriptsubscript^𝑥subscript𝑛𝑐𝑢𝑘𝑤2subscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢𝑘subscriptsuperscriptitalic-ϕ𝑒subscript𝑛𝑐𝑢𝑘superscriptsuperscriptsubscript^𝑥subscript𝑛𝑐𝑢𝑘𝑤2\displaystyle\hat{e}^{q}_{n_{c},u,k}=\frac{\hat{x}^{w\ast}_{n_{c},u,k}\tilde{y% }^{e}_{n_{c},u,k}}{|\hat{x}_{n_{c},u,k}^{w}|^{2}},\ \xi^{q,e}_{n_{c},u,k}=% \frac{\phi^{e}_{n_{c},u,k}}{|\hat{x}_{n_{c},u,k}^{w}|^{2}},over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT = divide start_ARG over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_w ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT end_ARG start_ARG | over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT = divide start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT end_ARG start_ARG | over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (55)

with

y~nc,u,kesubscriptsuperscript~𝑦𝑒subscript𝑛𝑐𝑢𝑘\displaystyle\tilde{y}^{e}_{n_{c},u,k}over~ start_ARG italic_y end_ARG start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT =ync,ku𝒰ux^c,u,kwe^nc,u,kvu𝒰x^nc,u,kws^nc,u,absentsubscript𝑦subscript𝑛𝑐𝑘subscriptsuperscript𝑢𝒰𝑢subscriptsuperscript^𝑥𝑤𝑐superscript𝑢𝑘subscriptsuperscript^𝑒𝑣subscript𝑛𝑐superscript𝑢𝑘subscriptsuperscript𝑢𝒰subscriptsuperscript^𝑥𝑤subscript𝑛𝑐superscript𝑢𝑘subscript^𝑠subscript𝑛𝑐superscript𝑢\displaystyle=y_{n_{c},k}\!-\!\sum_{u^{\prime}\in\mathcal{U}\setminus u}\hat{x% }^{w}_{c,u^{\prime},k}\hat{e}^{v}_{n_{c},u^{\prime},k}-\sum_{u^{\prime}\in% \mathcal{U}}\hat{x}^{w}_{n_{c},u^{\prime},k}\hat{s}_{n_{c},u^{\prime}},= italic_y start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_k end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U ∖ italic_u end_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U end_POSTSUBSCRIPT over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , (56a)
ϕnc,u,kesubscriptsuperscriptitalic-ϕ𝑒subscript𝑛𝑐𝑢𝑘\displaystyle\phi^{e}_{n_{c},u,k}italic_ϕ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT =u𝒰(|e^nc,u,kv|2+|s^nc,u|2+ξc,u,kv,e)ξnc,u,kw,xabsentsubscriptsuperscript𝑢𝒰superscriptsubscriptsuperscript^𝑒𝑣subscript𝑛𝑐superscript𝑢𝑘2superscriptsubscript^𝑠subscript𝑛𝑐superscript𝑢2subscriptsuperscript𝜉𝑣𝑒𝑐superscript𝑢𝑘subscriptsuperscript𝜉𝑤𝑥subscript𝑛𝑐superscript𝑢𝑘\displaystyle=\sum_{u^{\prime}\in\mathcal{U}}\left(|\hat{e}^{v}_{n_{c},u^{% \prime},k}|^{2}+|\hat{s}_{n_{c},u^{\prime}}|^{2}+\xi^{v,e}_{c,u^{\prime},k}% \right)\xi^{w,x}_{n_{c},u^{\prime},k}= ∑ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U end_POSTSUBSCRIPT ( | over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + | over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT ) italic_ξ start_POSTSUPERSCRIPT italic_w , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT
+u𝒰uξnc,u,kv,e|x^c,u,kw|2+σ2,subscriptsuperscript𝑢𝒰𝑢subscriptsuperscript𝜉𝑣𝑒subscript𝑛𝑐superscript𝑢𝑘superscriptsubscriptsuperscript^𝑥𝑤𝑐superscript𝑢𝑘2superscript𝜎2\displaystyle\quad+\sum_{u^{\prime}\in\mathcal{U}\setminus u}\xi^{v,e}_{n_{c},% u^{\prime},k}|\hat{x}^{w}_{c,u^{\prime},k}|^{2}+\sigma^{2},+ ∑ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U ∖ italic_u end_POSTSUBSCRIPT italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT | over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (56b)
x^c,u,kwsubscriptsuperscript^𝑥𝑤𝑐𝑢𝑘\displaystyle\hat{x}^{w}_{c,u,k}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT =ξc,u,kw,x(x^u,kx(ξu,kx)1x^c,u,kq(Ncξc,u,kq,x)1),absentsubscriptsuperscript𝜉𝑤𝑥𝑐𝑢𝑘subscriptsuperscript^𝑥𝑥𝑢𝑘superscriptsubscriptsuperscript𝜉𝑥𝑢𝑘1subscriptsuperscript^𝑥𝑞𝑐𝑢𝑘superscriptsubscript𝑁𝑐subscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘1\displaystyle=\xi^{w,x}_{c,u,k}\left(\hat{x}^{x}_{u,k}(\xi^{x}_{u,k})^{-1}-% \hat{x}^{q}_{c,u,k}(N_{c}\xi^{q,x}_{c,u,k})^{-1}\!\right),\!\!= italic_ξ start_POSTSUPERSCRIPT italic_w , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ( italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ( italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) , (56c)
ξc,u,kw,xsubscriptsuperscript𝜉𝑤𝑥𝑐𝑢𝑘\displaystyle\xi^{w,x}_{c,u,k}italic_ξ start_POSTSUPERSCRIPT italic_w , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT =((ξu,kx)1(Ncξc,u,kq,x)1)1.absentsuperscriptsuperscriptsubscriptsuperscript𝜉𝑥𝑢𝑘1superscriptsubscript𝑁𝑐subscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘11\displaystyle=\left((\xi^{x}_{u,k})^{-1}-(N_{c}\xi^{q,x}_{c,u,k})^{-1}\right)^% {-1}.= ( ( italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - ( italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (56d)

V-D2 Update 𝝅nc,ub,esubscriptsuperscript𝝅𝑏𝑒subscript𝑛𝑐𝑢\bm{\pi}^{b,e}_{n_{c},u}bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT

For 𝝅nc,ub,esubscriptsuperscript𝝅𝑏𝑒subscript𝑛𝑐𝑢\bm{\pi}^{b,e}_{n_{c},u}bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT, we have

minimize𝝅nc,ub,eKL(p^nc,ub,e(𝐄,𝐗|𝐘)g(𝐄,𝐗|𝐘)),\displaystyle\underset{\bm{\pi}^{b,e}_{n_{c},u}}{\mathrm{minimize}}\ \mathrm{% KL}\left(\hat{p}_{n_{c},u}^{b,e}(\mathbf{E},\mathbf{X}|\mathbf{Y})\|g(\mathbf{% E},\mathbf{X}|\mathbf{Y})\right),start_UNDERACCENT bold_italic_π start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_minimize end_ARG roman_KL ( over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) ∥ italic_g ( bold_E , bold_X | bold_Y ) ) , (57)

where p^nc,ub,e(𝐄,𝐗|𝐘)superscriptsubscript^𝑝subscript𝑛𝑐𝑢𝑏𝑒𝐄conditional𝐗𝐘\hat{p}_{n_{c},u}^{b,e}(\mathbf{E},\mathbf{X}|\mathbf{Y})over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) is the target distribution designed as

p^nc,ub,e(𝐄,𝐗|𝐘)=Cnc,ub,ep(enc,u;𝚯)superscriptsubscript^𝑝subscript𝑛𝑐𝑢𝑏𝑒𝐄conditional𝐗𝐘superscriptsubscript𝐶subscript𝑛𝑐𝑢𝑏𝑒𝑝subscript𝑒subscript𝑛𝑐𝑢𝚯\displaystyle\hat{p}_{n_{c},u}^{b,e}(\mathbf{E},\mathbf{X}|\mathbf{Y})=C_{n_{c% },u}^{b,e}\ p(e_{n_{c},u};\mathbf{\Theta})over^ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) = italic_C start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT italic_p ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ; bold_Θ )
(nc,u)𝒩×𝒰(nc,u)bnc,ue(enc,u)p(enc,u;𝚯)Qx(𝐗)Qe(𝐄)p(𝐘|𝐄,𝐗)Bx(𝐗)p(𝐗),\displaystyle\!\!\!\!\cdot\!\!\!\!\prod_{(n_{c}^{\prime},u^{\prime})\in% \mathcal{N}\times\mathcal{U}\setminus(n_{c},u)}\underbrace{b_{n_{c}^{\prime},u% ^{\prime}}^{e}(e_{n_{c}^{\prime},u^{\prime}})}_{\simeq p(e_{n_{c},u};\mathbf{% \Theta})}\underbrace{Q^{x}(\mathbf{X})Q^{e}(\mathbf{E})}_{\simeq p(\mathbf{Y}|% \mathbf{E},\mathbf{X})}\underbrace{B^{x}(\mathbf{X})}_{\simeq p(\mathbf{X})},⋅ ∏ start_POSTSUBSCRIPT ( italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ caligraphic_N × caligraphic_U ∖ ( italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u ) end_POSTSUBSCRIPT under⏟ start_ARG italic_b start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ; bold_Θ ) end_POSTSUBSCRIPT under⏟ start_ARG italic_Q start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) italic_Q start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( bold_E ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( bold_Y | bold_E , bold_X ) end_POSTSUBSCRIPT under⏟ start_ARG italic_B start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( bold_X ) end_ARG start_POSTSUBSCRIPT ≃ italic_p ( bold_X ) end_POSTSUBSCRIPT , (58)

where Cnc,ub,esuperscriptsubscript𝐶subscript𝑛𝑐𝑢𝑏𝑒C_{n_{c},u}^{b,e}italic_C start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT is a normalizing constant.

Following the same methodology used to derive g(xu,k|𝐘)𝑔conditionalsubscript𝑥𝑢𝑘𝐘g(x_{u,k}|\mathbf{Y})italic_g ( italic_x start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT | bold_Y ) in (47), the approximate posterior g(enc,u|𝐘)𝑔conditionalsubscript𝑒subscript𝑛𝑐𝑢𝐘g(e_{n_{c},u}|\mathbf{Y})italic_g ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) are derived as

g(enc,u|𝐘)𝑔conditionalsubscript𝑒subscript𝑛𝑐𝑢𝐘\displaystyle g(e_{n_{c},u}|\mathbf{Y})\!italic_g ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) proj𝚽[p(enc,u;𝚯)k𝒦qnc,u,ke(enc,u)],proportional-toabsentsubscriptproj𝚽delimited-[]𝑝subscript𝑒subscript𝑛𝑐𝑢𝚯subscriptproductsuperscript𝑘𝒦superscriptsubscript𝑞subscript𝑛𝑐𝑢superscript𝑘𝑒subscript𝑒subscript𝑛𝑐𝑢\displaystyle\propto\mathrm{proj}_{\mathbf{\Phi}}\!\left[p(e_{n_{c},u};\mathbf% {\Theta})\prod_{k^{\prime}\in\mathcal{K}}q_{n_{c},u,k^{\prime}}^{e}(e_{n_{c},u% })\right]\!,∝ roman_proj start_POSTSUBSCRIPT bold_Φ end_POSTSUBSCRIPT [ italic_p ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ; bold_Θ ) ∏ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_K end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) ] , (59)

where the mean and variance of g(enc,u|𝐘)𝑔conditionalsubscript𝑒subscript𝑛𝑐𝑢𝐘g(e_{n_{c},u}|\mathbf{Y})italic_g ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) can be calculated based on the prior distribution p(enc,u;𝚯)𝑝subscript𝑒subscript𝑛𝑐𝑢𝚯p(e_{n_{c},u};\mathbf{\Theta})italic_p ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ; bold_Θ ) in (19) as

e^nc,u=σnc,uee^nc,uqσnc,ue+ξnc,uq,e,ξnc,ue=(1σnc,ue+1ξnc,uq,e)1,formulae-sequencesubscript^𝑒subscript𝑛𝑐𝑢subscriptsuperscript𝜎𝑒subscript𝑛𝑐𝑢subscriptsuperscript^𝑒𝑞subscript𝑛𝑐𝑢subscriptsuperscript𝜎𝑒subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑒subscript𝑛𝑐𝑢superscript1subscriptsuperscript𝜎𝑒subscript𝑛𝑐𝑢1subscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢1\displaystyle\hat{e}_{n_{c},u}=\frac{\sigma^{e}_{n_{c},u}\hat{e}^{q}_{n_{c},u}% }{\sigma^{e}_{n_{c},u}+\xi^{q,e}_{n_{c},u}},\ \xi^{e}_{n_{c},u}=\left(\frac{1}% {\sigma^{e}_{n_{c},u}}+\frac{1}{\xi^{q,e}_{n_{c},u}}\right)^{-1},over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT = divide start_ARG italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT + italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG , italic_ξ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG + divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , (60)

with

e^nc,uq=ξnc,uq,e(k𝒦e^nc,u,kqξnv,u,kq,e),ξnc,uq,e=(k𝒦1ξnc,u,kq,e)1.formulae-sequencesuperscriptsubscript^𝑒subscript𝑛𝑐𝑢𝑞superscriptsubscript𝜉subscript𝑛𝑐𝑢𝑞𝑒subscriptsuperscript𝑘𝒦superscriptsubscript^𝑒subscript𝑛𝑐𝑢superscript𝑘𝑞superscriptsubscript𝜉subscript𝑛𝑣𝑢superscript𝑘𝑞𝑒superscriptsubscript𝜉subscript𝑛𝑐𝑢𝑞𝑒superscriptsubscriptsuperscript𝑘𝒦1superscriptsubscript𝜉subscript𝑛𝑐𝑢superscript𝑘𝑞𝑒1\displaystyle\hat{e}_{n_{c},u}^{q}\!=\!\xi_{n_{c},u}^{q,e}\!\left(\sum_{k^{% \prime}\in\mathcal{K}}\frac{\hat{e}_{n_{c},u,k^{\prime}}^{q}}{\xi_{n_{v},u,k^{% \prime}}^{q,e}}\!\right),\xi_{n_{c},u}^{q,e}\!\!=\!\left(\sum_{k^{\prime}\in% \mathcal{K}}\frac{1}{\xi_{n_{c},u,k^{\prime}}^{q,e}}\!\right)^{-1}\!\!\!\!\!\!% \!\!.over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT = italic_ξ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_K end_POSTSUBSCRIPT divide start_ARG over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT , italic_u , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT end_ARG ) , italic_ξ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT = ( ∑ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_K end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (61)

Similarly, the approximate function vnc,u,ke(enc,u)subscriptsuperscript𝑣𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢v^{e}_{n_{c},u,k}(e_{n_{c},u})italic_v start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) can be derived in the same manner as (49):

vnc,u,ke(enc,u)g(enc,u|𝐘)/qnc,u,ke(enc,u),proportional-tosubscriptsuperscript𝑣𝑒subscript𝑛𝑐𝑢𝑘subscript𝑒subscript𝑛𝑐𝑢𝑔conditionalsubscript𝑒subscript𝑛𝑐𝑢𝐘superscriptsubscript𝑞subscript𝑛𝑐𝑢𝑘𝑒subscript𝑒subscript𝑛𝑐𝑢\displaystyle v^{e}_{n_{c},u,k}(e_{n_{c},u})\propto g(e_{n_{c},u}|\mathbf{Y})/% q_{n_{c},u,k}^{e}(e_{n_{c},u}),italic_v start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) ∝ italic_g ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) / italic_q start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) , (62)

from which the mean and variance are respectively given by

e^nc,u,kv=ξnc,u,kv,e((ξnc,ue)1e^nc,u(ξnc,u,kq,e)1e^nc,u,kq),subscriptsuperscript^𝑒𝑣subscript𝑛𝑐𝑢𝑘subscriptsuperscript𝜉𝑣𝑒subscript𝑛𝑐𝑢𝑘superscriptsubscriptsuperscript𝜉𝑒subscript𝑛𝑐𝑢1subscript^𝑒subscript𝑛𝑐𝑢superscriptsubscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢𝑘1subscriptsuperscript^𝑒𝑞subscript𝑛𝑐𝑢𝑘\displaystyle\!\hat{e}^{v}_{n_{c},u,k}\!=\!\xi^{v,e}_{n_{c},u,k}\left(\!(\xi^{% e}_{n_{c},u})^{-1}\hat{e}_{n_{c},u}\!-\!(\xi^{q,e}_{n_{c},u,k})^{-1}\hat{e}^{q% }_{n_{c},u,k}\!\right),\!over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT = italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ( ( italic_ξ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT - ( italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ) , (63a)
ξnc,u,kv,e=((ξnc,ue)1(ξnc,u,kq,e)1)1.subscriptsuperscript𝜉𝑣𝑒subscript𝑛𝑐𝑢𝑘superscriptsuperscriptsubscriptsuperscript𝜉𝑒subscript𝑛𝑐𝑢1superscriptsubscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢𝑘11\displaystyle\!\xi^{v,e}_{n_{c},u,k}=\left((\xi^{e}_{n_{c},u})^{-1}-(\xi^{q,e}% _{n_{c},u,k})^{-1}\right)^{-1}.italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT = ( ( italic_ξ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - ( italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (63b)

Finally, the approximate function bnc,ue(enc,u)subscriptsuperscript𝑏𝑒subscript𝑛𝑐𝑢subscript𝑒subscript𝑛𝑐𝑢b^{e}_{n_{c},u}(e_{n_{c},u})italic_b start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) is obtained in a similar way as the derivation of (51) as

bnc,ue(enc,u)subscriptsuperscript𝑏𝑒subscript𝑛𝑐𝑢subscript𝑒subscript𝑛𝑐𝑢\displaystyle b^{e}_{n_{c},u}(e_{n_{c},u})italic_b start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) g(enc,u|𝐘)/k𝒦qnc,u,ke(enc,u),proportional-toabsent𝑔conditionalsubscript𝑒subscript𝑛𝑐𝑢𝐘subscriptproductsuperscript𝑘𝒦superscriptsubscript𝑞subscript𝑛𝑐𝑢superscript𝑘𝑒subscript𝑒subscript𝑛𝑐𝑢\displaystyle\propto g(e_{n_{c},u}|\mathbf{Y})\Big{/}\prod_{k^{\prime}\in% \mathcal{K}}q_{n_{c},u,k^{\prime}}^{e}(e_{n_{c},u}),∝ italic_g ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) / ∏ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_K end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) , (64)

with the mean and variance of bnc,ue(enc,u)subscriptsuperscript𝑏𝑒subscript𝑛𝑐𝑢subscript𝑒subscript𝑛𝑐𝑢b^{e}_{n_{c},u}(e_{n_{c},u})italic_b start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) being

e^nc,ub=ξnc,ub,e(e^nc,uξnc,uee^nc,uqξnc,uq,e),ξnc,ub,e=(1ξnc,ue1ξnc,uq,e)1.formulae-sequencesubscriptsuperscript^𝑒𝑏subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑏𝑒subscript𝑛𝑐𝑢subscript^𝑒subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑒subscript𝑛𝑐𝑢subscriptsuperscript^𝑒𝑞subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑏𝑒subscript𝑛𝑐𝑢superscript1subscriptsuperscript𝜉𝑒subscript𝑛𝑐𝑢1subscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢1\displaystyle\hat{e}^{b}_{n_{c},u}\!=\!\xi^{b,e}_{n_{c},u}\left(\frac{\hat{e}_% {n_{c},u}}{\xi^{e}_{n_{c},u}}-\frac{\hat{e}^{q}_{n_{c},u}}{\xi^{q,e}_{n_{c},u}% }\right),\ \xi^{b,e}_{n_{c},u}\!=\!\left(\frac{1}{\xi^{e}_{n_{c},u}}-\frac{1}{% \xi^{q,e}_{n_{c},u}}\right)^{-1}\!\!.over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT = italic_ξ start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ( divide start_ARG over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG - divide start_ARG over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG ) , italic_ξ start_POSTSUPERSCRIPT italic_b , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT = ( divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG - divide start_ARG 1 end_ARG start_ARG italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (65)

V-E Expectation Maximization for Hyper Parameter Learning

In this section, we describe the estimation method for hyper parameter set 𝚯𝚯\mathbf{\Theta}bold_Θ via the EM algorithm corresponding to M-step in (23). Using the approximate posterior g(t)(𝐄,𝐗|𝐘)superscript𝑔𝑡𝐄conditional𝐗𝐘g^{(t)}(\mathbf{E},\mathbf{X}|\mathbf{Y})italic_g start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) at the t𝑡titalic_t-th step as descrbed in Section V-C and V-D, the ELBO (𝚯,𝚯(t))𝚯superscript𝚯𝑡\mathcal{F}\left(\mathbf{\Theta},\mathbf{\Theta}^{(t)}\right)caligraphic_F ( bold_Θ , bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) in (23) can be approximated as

(𝚯,\displaystyle\mathcal{F}(\mathbf{\Theta},caligraphic_F ( bold_Θ , 𝚯(t))n𝒩u𝒰{lnσnc,ue+\displaystyle\mathbf{\Theta}^{(t)})\simeq-\sum_{n\in\mathcal{N}}\sum_{u\in% \mathcal{U}}\Big{\{}\ln\sigma^{e}_{n_{c},u}+bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) ≃ - ∑ start_POSTSUBSCRIPT italic_n ∈ caligraphic_N end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT { roman_ln italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT +
(σnc,ue)1𝔼g(t)(enc,u|𝐘)[|enc,u|2]}+const.\displaystyle(\sigma^{e}_{n_{c},u})^{-1}\mathbb{E}_{g^{(t)}(e_{n_{c},u}|% \mathbf{Y})}\left[|e_{n_{c},u}|^{2}\right]\Big{\}}+\mathrm{const}.( italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT blackboard_E start_POSTSUBSCRIPT italic_g start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) end_POSTSUBSCRIPT [ | italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] } + roman_const . (66)

Since the ELBO (𝚯,𝚯(t))𝚯superscript𝚯𝑡\mathcal{F}(\mathbf{\Theta},\mathbf{\Theta}^{(t)})caligraphic_F ( bold_Θ , bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) is concave for (σn,ue)1superscriptsubscriptsuperscript𝜎𝑒𝑛𝑢1(\sigma^{e}_{n,u})^{-1}( italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, the maximization problem in (23) can be solved by the first-order necessary and sufficient condition (𝚯,𝚯(t))/(σn,ue)1=𝟎𝚯superscript𝚯𝑡superscriptsuperscriptsubscript𝜎𝑛𝑢𝑒10\partial\mathcal{F}(\mathbf{\Theta},\mathbf{\Theta}^{(t)})/\partial\left(% \sigma_{n,u}^{e}\right)^{-1}=\mathbf{0}∂ caligraphic_F ( bold_Θ , bold_Θ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) / ∂ ( italic_σ start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = bold_0 , which derives the optimal variance σn,ue(t+1)subscriptsuperscript𝜎𝑒𝑡1𝑛𝑢\sigma^{e(t+1)}_{n,u}italic_σ start_POSTSUPERSCRIPT italic_e ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , italic_u end_POSTSUBSCRIPT at the t𝑡titalic_t-th step as

σnc,ue(t+1)subscriptsuperscript𝜎𝑒𝑡1subscript𝑛𝑐𝑢\displaystyle\sigma^{e(t+1)}_{n_{c},u}italic_σ start_POSTSUPERSCRIPT italic_e ( italic_t + 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT =𝔼g(t)(enc,u|𝐘)[|enc,u|2]=|e^nc,u(t)|2+ξnc,ue(t),absentsubscript𝔼superscript𝑔𝑡conditionalsubscript𝑒subscript𝑛𝑐𝑢𝐘delimited-[]superscriptsubscript𝑒subscript𝑛𝑐𝑢2superscriptsubscriptsuperscript^𝑒𝑡subscript𝑛𝑐𝑢2subscriptsuperscript𝜉𝑒𝑡subscript𝑛𝑐𝑢\displaystyle=\mathbb{E}_{g^{(t)}(e_{n_{c},u}|\mathbf{Y})}\left[|e_{n_{c},u}|^% {2}\right]=|\hat{e}^{(t)}_{n_{c},u}|^{2}+\xi^{e(t)}_{n_{c},u},= blackboard_E start_POSTSUBSCRIPT italic_g start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | bold_Y ) end_POSTSUBSCRIPT [ | italic_e start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = | over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_ξ start_POSTSUPERSCRIPT italic_e ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT , (67)

where e^nc,u(t)subscriptsuperscript^𝑒𝑡subscript𝑛𝑐𝑢\hat{e}^{(t)}_{n_{c},u}over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT and ξnc,ue(t)subscriptsuperscript𝜉𝑒𝑡subscript𝑛𝑐𝑢\xi^{e(t)}_{n_{c},u}italic_ξ start_POSTSUPERSCRIPT italic_e ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT are the approximate posterior mean and variance at the t𝑡titalic_t-th step, calculated in (60).

V-F Reinforcement for the Model-Based Estimate

To further improve the convergence performance for the EP algorithm, we update the model-based estimate 𝐒^^𝐒\hat{\mathbf{S}}over^ start_ARG bold_S end_ARG in each iteration. Using the estimated residual channel error 𝐄^(t1)𝔼g(t1)(𝐄,𝐗|𝐘)[𝐄]superscript^𝐄𝑡1subscript𝔼superscript𝑔𝑡1𝐄conditional𝐗𝐘delimited-[]𝐄\hat{\mathbf{E}}^{(t-1)}\triangleq\mathbb{E}_{g^{(t-1)}(\mathbf{E},\mathbf{X}|% \mathbf{Y})}[\mathbf{E}]over^ start_ARG bold_E end_ARG start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT ≜ blackboard_E start_POSTSUBSCRIPT italic_g start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT ( bold_E , bold_X | bold_Y ) end_POSTSUBSCRIPT [ bold_E ] at the (t1)𝑡1(t-1)( italic_t - 1 )-th iteration, the channel estimate for the u𝑢uitalic_u-th UE can be reconstructed as

𝐡^u(t1)=𝐬^u(t1)+𝐞^u(t1).subscriptsuperscript^𝐡𝑡1𝑢subscriptsuperscript^𝐬𝑡1𝑢subscriptsuperscript^𝐞𝑡1𝑢\displaystyle\hat{\mathbf{h}}^{(t-1)}_{u}=\hat{\mathbf{s}}^{(t-1)}_{u}+\hat{% \mathbf{e}}^{(t-1)}_{u}.over^ start_ARG bold_h end_ARG start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = over^ start_ARG bold_s end_ARG start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT + over^ start_ARG bold_e end_ARG start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT . (68)

The model-based estimate at the t𝑡titalic_t-th iteration 𝐬^u(t)superscriptsubscript^𝐬𝑢𝑡\hat{\mathbf{s}}_{u}^{(t)}over^ start_ARG bold_s end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT is updated with the channel estimate at the previous iteration 𝐡^u(t1)superscriptsubscript^𝐡𝑢𝑡1\hat{\mathbf{h}}_{u}^{(t-1)}over^ start_ARG bold_h end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT in (68). To efficiently estimate 𝐬^u(t)superscriptsubscript^𝐬𝑢𝑡\hat{\mathbf{s}}_{u}^{(t)}over^ start_ARG bold_s end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT by leveraging the near-field sparsity, the virtual channel representation with polar grids as described in Section IV-A are utilized. The grids are dynamically designed in the iterations, where the center of the grids is set as the angle and distance estimates at the previous iteration, and the range of grids decreases with the number of iterations. Thus, the angle and distance grids for the u𝑢uitalic_u-th UE and l𝑙litalic_l-th path at the t𝑡titalic_t-th iteration are designed as

θ~u,l,gθ(t)[θ^u,l(t1)σθ(t),θ^u,l(t1)+σθ(t)],superscriptsubscript~𝜃𝑢𝑙subscript𝑔𝜃𝑡superscriptsubscript^𝜃𝑢𝑙𝑡1superscriptsubscript𝜎𝜃𝑡superscriptsubscript^𝜃𝑢𝑙𝑡1superscriptsubscript𝜎𝜃𝑡\displaystyle\tilde{\theta}_{u,l,g_{\theta}}^{(t)}\!\in\!\left[\hat{\theta}_{u% ,l}^{(t-1)}\!-\!\sigma_{\theta}^{(t)},\ \hat{\theta}_{u,l}^{(t-1)}\!+\!\sigma_% {\theta}^{(t)}\right],over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l , italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∈ [ over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT - italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT + italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ] , (69a)
r~u,l,gr(t)[r^u,l(t1)σr(t),r^u,l(t1)+σr(t)],superscriptsubscript~𝑟𝑢𝑙subscript𝑔𝑟𝑡superscriptsubscript^𝑟𝑢𝑙𝑡1superscriptsubscript𝜎𝑟𝑡superscriptsubscript^𝑟𝑢𝑙𝑡1superscriptsubscript𝜎𝑟𝑡\displaystyle\tilde{r}_{u,l,g_{r}}^{(t)}\!\!\in\!\left[\hat{r}_{u,l}^{(t-1)}\!% -\!\sigma_{r}^{(t)},\ \hat{r}_{u,l}^{(t-1)}\!+\!\sigma_{r}^{(t)}\right],over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l , italic_g start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∈ [ over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT - italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT + italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ] , (69b)

with gθ{1,,G¯θ}subscript𝑔𝜃1subscript¯𝐺𝜃g_{\theta}\!\in\!\{1,\ldots,\bar{G}_{\theta}\}italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ∈ { 1 , … , over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT }, gr{1,,G¯r}subscript𝑔𝑟1subscript¯𝐺𝑟g_{r}\!\in\!\{1,\ldots,\bar{G}_{r}\}italic_g start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∈ { 1 , … , over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT }. θ^u,l(t1)superscriptsubscript^𝜃𝑢𝑙𝑡1\hat{\theta}_{u,l}^{(t-1)}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT and r^u,l(t1)superscriptsubscript^𝑟𝑢𝑙𝑡1\hat{r}_{u,l}^{(t-1)}over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT are the angle and distance estimates at the (t1)𝑡1(t-1)( italic_t - 1 )-th iteration, respectively, and σθ(t)superscriptsubscript𝜎𝜃𝑡\sigma_{\theta}^{(t)}italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and σr(t)superscriptsubscript𝜎𝑟𝑡\sigma_{r}^{(t)}italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT are, respectively, the range of angle and distance grids, where the initial values θ^u,l(0)superscriptsubscript^𝜃𝑢𝑙0\hat{\theta}_{u,l}^{(0)}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT and r^u,l(0)superscriptsubscript^𝑟𝑢𝑙0\hat{r}_{u,l}^{(0)}over^ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT are determined using the angle and distance estimates obtained by the initial channel estimation as shown in Algorithm 1.

Note that the range of angle and distance grids σθ(t)superscriptsubscript𝜎𝜃𝑡\sigma_{\theta}^{(t)}italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and σr(t)superscriptsubscript𝜎𝑟𝑡\sigma_{r}^{(t)}italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT are respectively designed by a monotonically decreasing function such as σθ(t)=aθexp(t/2)+bθsuperscriptsubscript𝜎𝜃𝑡subscript𝑎𝜃𝑡2subscript𝑏𝜃\sigma_{\theta}^{(t)}=a_{\theta}\exp(-t/2)+b_{\theta}italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = italic_a start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT roman_exp ( - italic_t / 2 ) + italic_b start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT and σr(t)=arexp(t/2)+brsuperscriptsubscript𝜎𝑟𝑡subscript𝑎𝑟𝑡2subscript𝑏𝑟\sigma_{r}^{(t)}=a_{r}\exp(-t/2)+b_{r}italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT roman_exp ( - italic_t / 2 ) + italic_b start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , where the constant values {aθ,bθ,ar,br}subscript𝑎𝜃subscript𝑏𝜃subscript𝑎𝑟subscript𝑏𝑟\{a_{\theta},b_{\theta},a_{r},b_{r}\}{ italic_a start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , italic_b start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } are uniquely determined with the desired range σθ(1)superscriptsubscript𝜎𝜃1\sigma_{\theta}^{(1)}italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT, σr(1)superscriptsubscript𝜎𝑟1\sigma_{r}^{(1)}italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT, σθ(T)superscriptsubscript𝜎𝜃𝑇\sigma_{\theta}^{(T)}italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT, and σr(T)superscriptsubscript𝜎𝑟𝑇\sigma_{r}^{(T)}italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT. Accordingly, the sets of angle and distance grids for the u𝑢uitalic_u-th UE are defined as 𝜽~u,l(t){θ~u,l,gθ(t)}gθ=1G¯θsuperscriptsubscript~𝜽𝑢𝑙𝑡superscriptsubscriptsuperscriptsubscript~𝜃𝑢𝑙subscript𝑔𝜃𝑡subscript𝑔𝜃1subscript¯𝐺𝜃\tilde{\bm{\theta}}_{u,l}^{(t)}\triangleq\{\tilde{{\theta}}_{u,l,g_{\theta}}^{% (t)}\}_{g_{\theta}=1}^{\bar{G}_{\theta}}over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≜ { over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l , italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, 𝜽~u(t){𝜽~u,l(t)}l=1L^usuperscriptsubscript~𝜽𝑢𝑡superscriptsubscriptsuperscriptsubscript~𝜽𝑢𝑙𝑡𝑙1subscript^𝐿𝑢\tilde{\bm{\theta}}_{u}^{(t)}\triangleq\{\tilde{{\bm{\theta}}}_{u,l}^{(t)}\}_{% l=1}^{\hat{L}_{u}}over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≜ { over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, 𝐫~u,l(t){r~u,l,gr(t)}gr=1G¯rsuperscriptsubscript~𝐫𝑢𝑙𝑡superscriptsubscriptsuperscriptsubscript~𝑟𝑢𝑙subscript𝑔𝑟𝑡subscript𝑔𝑟1subscript¯𝐺𝑟\tilde{\mathbf{r}}_{u,l}^{(t)}\triangleq\{\tilde{{r}}_{u,l,g_{r}}^{(t)}\}_{g_{% r}=1}^{\bar{G}_{r}}over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≜ { over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l , italic_g start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_g start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, and 𝐫~u(t){𝐫~u,l(t)}l=1L^usuperscriptsubscript~𝐫𝑢𝑡superscriptsubscriptsuperscriptsubscript~𝐫𝑢𝑙𝑡𝑙1subscript^𝐿𝑢\tilde{\mathbf{r}}_{u}^{(t)}\triangleq\{\tilde{{\mathbf{r}}}_{u,l}^{(t)}\}_{l=% 1}^{\hat{L}_{u}}over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≜ { over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT.

Using the angle and distance grids in (69a)-(69b), the polar-domain dictionary matrix for the u𝑢uitalic_u-th UE is designed as

𝐀~u(𝜽~u(t),𝐫~u(t))=[𝐀~u,1(𝜽~u,1(t),𝐫~u,1(t)),,𝐀~u,L^u(𝜽~u,L^u(t),𝐫~u,L^u(t))],subscript~𝐀𝑢superscriptsubscript~𝜽𝑢𝑡superscriptsubscript~𝐫𝑢𝑡matrixsubscript~𝐀𝑢1superscriptsubscript~𝜽𝑢1𝑡superscriptsubscript~𝐫𝑢1𝑡subscript~𝐀𝑢subscript^𝐿𝑢superscriptsubscript~𝜽𝑢subscript^𝐿𝑢𝑡superscriptsubscript~𝐫𝑢subscript^𝐿𝑢𝑡\displaystyle\tilde{\mathbf{A}}_{u}(\tilde{\bm{\theta}}_{u}^{(t)},\tilde{% \mathbf{r}}_{u}^{(t)})=\begin{bmatrix}\tilde{\mathbf{A}}_{u,1}(\tilde{\bm{% \theta}}_{u,1}^{(t)},\tilde{\mathbf{r}}_{u,1}^{(t)}),\ldots,\tilde{\mathbf{A}}% _{u,\hat{L}_{u}}(\tilde{\bm{\theta}}_{u,\hat{L}_{u}}^{(t)},\tilde{\mathbf{r}}_% {u,\hat{L}_{u}}^{(t)})\end{bmatrix},over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) = [ start_ARG start_ROW start_CELL over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) , … , over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u , over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u , over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u , over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) end_CELL end_ROW end_ARG ] , (70)

where 𝐀~u,l(𝜽~u,l(t),𝐫~u,l(t))N×G¯θG¯rsubscript~𝐀𝑢𝑙superscriptsubscript~𝜽𝑢𝑙𝑡superscriptsubscript~𝐫𝑢𝑙𝑡superscript𝑁subscript¯𝐺𝜃subscript¯𝐺𝑟\tilde{\mathbf{A}}_{u,l}(\tilde{\bm{\theta}}_{u,l}^{(t)},\tilde{\mathbf{r}}_{u% ,l}^{(t)})\in\mathbb{C}^{N\times\bar{G}_{\theta}\bar{G}_{r}}over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the virtual array response for the u𝑢uitalic_u-th UE and l𝑙litalic_l-th path defined as

𝐀~u,l(𝜽~u,l(t),𝐫~u,l(t))=[𝐚~(θ~u,l,1(t),r~u,l,1(t)),,𝐚~(θ~u,l,Gθ(t),r~u,l,Gr(t))].subscript~𝐀𝑢𝑙superscriptsubscript~𝜽𝑢𝑙𝑡superscriptsubscript~𝐫𝑢𝑙𝑡matrix~𝐚superscriptsubscript~𝜃𝑢𝑙1𝑡superscriptsubscript~𝑟𝑢𝑙1𝑡~𝐚superscriptsubscript~𝜃𝑢𝑙subscript𝐺𝜃𝑡superscriptsubscript~𝑟𝑢𝑙subscript𝐺𝑟𝑡\displaystyle\tilde{\mathbf{A}}_{u,l}(\tilde{\bm{\theta}}_{u,l}^{(t)},\tilde{% \mathbf{r}}_{u,l}^{(t)})=\begin{bmatrix}\tilde{\mathbf{a}}(\tilde{{\theta}}_{u% ,l,1}^{(t)},\tilde{r}_{u,l,1}^{(t)}),\ldots,\tilde{\mathbf{a}}(\tilde{{\theta}% }_{u,l,G_{\theta}}^{(t)},\tilde{r}_{u,l,G_{r}}^{(t)})\end{bmatrix}.over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) = [ start_ARG start_ROW start_CELL over~ start_ARG bold_a end_ARG ( over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) , … , over~ start_ARG bold_a end_ARG ( over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l , italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l , italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) end_CELL end_ROW end_ARG ] .

Through the virtual channel representation with the dictionary matrix 𝐀~u(𝜽~u(t),𝐫~u(t))subscript~𝐀𝑢superscriptsubscript~𝜽𝑢𝑡superscriptsubscript~𝐫𝑢𝑡\tilde{\mathbf{A}}_{u}(\tilde{\bm{\theta}}_{u}^{(t)},\tilde{\mathbf{r}}_{u}^{(% t)})over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ), the near-field channel for the u𝑢uitalic_u-th UE can be expressed as

𝐡u=l=1L^𝐀~u,l(𝜽u,l(t),𝐫~u,l(t))𝐳~u,l=𝐀~u(𝜽~u(t),𝐫~u(t))𝐳~u,subscript𝐡𝑢superscriptsubscript𝑙1^𝐿subscript~𝐀𝑢𝑙superscriptsubscript𝜽𝑢𝑙𝑡superscriptsubscript~𝐫𝑢𝑙𝑡subscript~𝐳𝑢𝑙subscript~𝐀𝑢superscriptsubscript~𝜽𝑢𝑡superscriptsubscript~𝐫𝑢𝑡subscript~𝐳𝑢\displaystyle\mathbf{h}_{u}=\sum_{l=1}^{\hat{L}}\tilde{\mathbf{A}}_{u,l}(\bm{% \theta}_{u,l}^{(t)},\tilde{\mathbf{r}}_{u,l}^{(t)})\tilde{\mathbf{z}}_{u,l}=% \tilde{\mathbf{A}}_{u}(\tilde{\bm{\theta}}_{u}^{(t)},\tilde{\mathbf{r}}_{u}^{(% t)})\tilde{\mathbf{z}}_{u},bold_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG end_POSTSUPERSCRIPT over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ( bold_italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) over~ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT = over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) over~ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , (71)

where 𝐳~u,lG¯θG¯r×1subscript~𝐳𝑢𝑙superscriptsubscript¯𝐺𝜃subscript¯𝐺𝑟1\tilde{\mathbf{z}}_{u,l}\in\mathbb{C}^{\bar{G}_{\theta}\bar{G}_{r}\times 1}over~ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT is the virtual path gain vector for the l𝑙litalic_l-th path, and 𝐳~u=[𝐳~u,1T,,𝐳~u,L^uT]TG¯θG¯rL^u×1subscript~𝐳𝑢superscriptsuperscriptsubscript~𝐳𝑢1Tsuperscriptsubscript~𝐳𝑢subscript^𝐿𝑢TTsuperscriptsubscript¯𝐺𝜃subscript¯𝐺𝑟subscript^𝐿𝑢1\tilde{\mathbf{z}}_{u}=[\tilde{\mathbf{z}}_{u,1}^{\mathrm{T}},\ldots,\tilde{% \mathbf{z}}_{u,\hat{L}_{u}}^{\mathrm{T}}]^{\mathrm{T}}\in\mathbb{C}^{\bar{G}_{% \theta}\bar{G}_{r}\hat{L}_{u}\times 1}over~ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = [ over~ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT , … , over~ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u , over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT is the virtual path gain vector including all paths.

In light of the near-field model in (71), an update of the model-based estimate 𝐬^u(t)subscriptsuperscript^𝐬𝑡𝑢\hat{\mathbf{s}}^{(t)}_{u}over^ start_ARG bold_s end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT can be obtained by

𝐬^u(t)=𝐀(𝜽^u(t),𝐫^u(t))𝐳^u(t),superscriptsubscript^𝐬𝑢𝑡𝐀superscriptsubscript^𝜽𝑢𝑡superscriptsubscript^𝐫𝑢𝑡superscriptsubscript^𝐳𝑢𝑡\displaystyle\hat{\mathbf{s}}_{u}^{(t)}=\mathbf{A}(\hat{\bm{\theta}}_{u}^{(t)}% ,\hat{\mathbf{r}}_{u}^{(t)})\hat{\mathbf{z}}_{u}^{(t)},over^ start_ARG bold_s end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = bold_A ( over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) over^ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , (72)

with 𝐳^u(t)superscriptsubscript^𝐳𝑢𝑡\hat{\mathbf{z}}_{u}^{(t)}over^ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT denoting the path gain estimates, 𝐀(𝜽^u(t),𝐫^u(t))𝐀superscriptsubscript^𝜽𝑢𝑡superscriptsubscript^𝐫𝑢𝑡\mathbf{A}(\hat{\bm{\theta}}_{u}^{(t)},\hat{\mathbf{r}}_{u}^{(t)})bold_A ( over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) being the corresponding array responses, which can be computed by solving

minimize𝐳u~~subscript𝐳𝑢minimize\displaystyle\underset{\tilde{\mathbf{z}_{u}}}{\text{minimize}}start_UNDERACCENT over~ start_ARG bold_z start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_ARG end_UNDERACCENT start_ARG minimize end_ARG 𝐡^u(t1)𝐬^u(t)22superscriptsubscriptnormsuperscriptsubscript^𝐡𝑢𝑡1subscriptsuperscript^𝐬𝑡𝑢22\displaystyle\ \ \left\|\hat{\mathbf{h}}_{u}^{(t-1)}-\hat{\mathbf{s}}^{(t)}_{u% }\right\|_{2}^{2}∥ over^ start_ARG bold_h end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT - over^ start_ARG bold_s end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
subject to 𝐬^u(t)=𝐀~u(𝜽~u(t),𝐫~u(t))𝐳~usubscriptsuperscript^𝐬𝑡𝑢subscript~𝐀𝑢superscriptsubscript~𝜽𝑢𝑡superscriptsubscript~𝐫𝑢𝑡subscript~𝐳𝑢\displaystyle\ \ \ \hat{\mathbf{s}}^{(t)}_{u}=\tilde{\mathbf{A}}_{u}(\tilde{% \bm{\theta}}_{u}^{(t)},\tilde{\mathbf{r}}_{u}^{(t)})\tilde{\mathbf{z}}_{u}over^ start_ARG bold_s end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) over~ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT
𝐳~u,l0=1,l{1,2,,L^u}.formulae-sequencesubscriptnormsubscript~𝐳𝑢𝑙01for-all𝑙12subscript^𝐿𝑢\displaystyle\ \ \left\|\tilde{\mathbf{z}}_{u,l}\right\|_{0}=1,\ \ \forall l% \in\{1,2,\ldots,\hat{L}_{u}\}.∥ over~ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 1 , ∀ italic_l ∈ { 1 , 2 , … , over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } . (73)

To summarize, the proposed algorithm is encapsulated in Algorithm 2, where a dam** scheme [16] is introduced in line 6, 7, 12, and 18 to enhance convergence performance.

Algorithm 2 Proposed JCDE algorithm
1:Input: 𝐘,𝐗p,𝐇0𝒮,{𝜽^u,𝐫^u}u𝒰,{L^u}u=1U,T,G¯θ,G¯r,𝐘subscript𝐗psubscriptsuperscript𝐇𝒮0subscriptsubscript^𝜽𝑢subscript^𝐫𝑢𝑢𝒰superscriptsubscriptsubscript^𝐿𝑢𝑢1𝑈𝑇subscript¯𝐺𝜃subscript¯𝐺𝑟\mathbf{Y},\ \mathbf{X}_{\mathrm{p}},\ \mathbf{H}^{\mathcal{S}}_{0},\ \{\hat{% \bm{\theta}}_{u},\hat{\mathbf{r}}_{u}\}_{u\in\mathcal{U}},\ \{\hat{L}_{u}\}_{u% =1}^{U},\ T,\ \bar{G}_{\theta},\ \bar{G}_{r},bold_Y , bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT , bold_H start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , { over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT , { over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT , italic_T , over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT , over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , σθ(1),σθ(T),σr(1),σr(T)superscriptsubscript𝜎𝜃1superscriptsubscript𝜎𝜃𝑇superscriptsubscript𝜎𝑟1superscriptsubscript𝜎𝑟𝑇\sigma_{\theta}^{(1)},\ \sigma_{\theta}^{(T)},\ \sigma_{r}^{(1)},\ \sigma_{r}^% {(T)}italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT , italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT
2:Output: 𝐗^,𝐇^,{𝜽^u,𝐫^u}u𝒰^𝐗^𝐇subscriptsubscript^𝜽𝑢subscript^𝐫𝑢𝑢𝒰\hat{\mathbf{X}},\ \hat{\mathbf{H}},\ \{\hat{\bm{\theta}}_{u},\ \hat{\mathbf{r% }}_{u}\}_{u\in\mathcal{U}}over^ start_ARG bold_X end_ARG , over^ start_ARG bold_H end_ARG , { over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT
3:
4:// Initialization
5:𝐒^=𝐃N𝐇0𝒮^𝐒subscript𝐃𝑁superscriptsubscript𝐇0𝒮\hat{\mathbf{S}}=\mathbf{D}_{N}\mathbf{H}_{0}^{\mathcal{S}}over^ start_ARG bold_S end_ARG = bold_D start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT from Algorithm 1
6:x^c,u,kdv=x^c,u,kdw=0subscriptsuperscript^𝑥𝑣𝑐𝑢subscript𝑘dsubscriptsuperscript^𝑥𝑤𝑐𝑢subscript𝑘d0\hat{x}^{v}_{c,u,k_{\mathrm{d}}}\!=\hat{x}^{w}_{c,u,k_{\mathrm{d}}}\!=0over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT = over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 0ξc,u,kdv,x=ξc,u,kdw,x=Essubscriptsuperscript𝜉𝑣𝑥𝑐𝑢subscript𝑘dsubscriptsuperscript𝜉𝑤𝑥𝑐𝑢subscript𝑘dsubscript𝐸𝑠\xi^{v,x}_{c,u,k_{\mathrm{d}}}\!=\xi^{w,x}_{c,u,k_{\mathrm{d}}}\!=E_{s}italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_ξ start_POSTSUPERSCRIPT italic_w , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_E start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, kd𝒦dfor-allsubscript𝑘dsubscript𝒦d\forall k_{\mathrm{d}}\in\mathcal{K}_{\mathrm{d}}∀ italic_k start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT ∈ caligraphic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT
7:x^c,u,kpv=x^c,u,kpw=[𝐗p]u,kpsubscriptsuperscript^𝑥𝑣𝑐𝑢subscript𝑘psubscriptsuperscript^𝑥𝑤𝑐𝑢subscript𝑘psubscriptdelimited-[]subscript𝐗p𝑢subscript𝑘𝑝\hat{x}^{v}_{c,u,k_{\mathrm{p}}}\!\!\!\!=\hat{x}^{w}_{c,u,k_{\mathrm{p}}}\!\!% \!\!=[\mathbf{X}_{\mathrm{p}}]_{u,k_{p}}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT = over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT = [ bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_u , italic_k start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPTξc,u,kpv,x=ξc,u,kpw,x=0subscriptsuperscript𝜉𝑣𝑥𝑐𝑢subscript𝑘psubscriptsuperscript𝜉𝑤𝑥𝑐𝑢subscript𝑘p0\xi^{v,x}_{c,u,k_{\mathrm{p}}}\!\!\!\!=\xi^{w,x}_{c,u,k_{\mathrm{p}}}\!\!\!\!=0italic_ξ start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_ξ start_POSTSUPERSCRIPT italic_w , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 0, kp𝒦pfor-allsubscript𝑘psubscript𝒦p\forall k_{\mathrm{p}}\in\mathcal{K}_{\mathrm{p}}∀ italic_k start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ∈ caligraphic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT
8:e^nc,u,kv=0subscriptsuperscript^𝑒𝑣subscript𝑛𝑐𝑢𝑘0\hat{e}^{v}_{n_{c},u,k}=0over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT = 0ξnc,u,kv,e=|[𝐃N𝐇0𝒮]nc,u|2subscriptsuperscript𝜉𝑣𝑒subscript𝑛𝑐𝑢𝑘superscriptsubscriptdelimited-[]subscript𝐃𝑁superscriptsubscript𝐇0𝒮subscript𝑛𝑐𝑢2\xi^{v,e}_{n_{c},u,k}=|[\mathbf{D}_{N}\mathbf{H}_{0}^{\mathcal{S}}]_{n_{c},u}|% ^{2}italic_ξ start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT = | [ bold_D start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_S end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, k𝒦for-all𝑘𝒦\forall k\in\mathcal{K}∀ italic_k ∈ caligraphic_K
9:for t=1,2,,T𝑡12𝑇t=1,2,\ldots,Titalic_t = 1 , 2 , … , italic_T do
10: // EP for data estimation
11:     Calculate 𝐲~c,u,kx,𝛀c,kxsubscriptsuperscript~𝐲𝑥𝑐𝑢𝑘subscriptsuperscript𝛀𝑥𝑐𝑘\tilde{\mathbf{y}}^{x}_{c,u,k},\ \mathbf{\Omega}^{x}_{c,k}over~ start_ARG bold_y end_ARG start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , bold_Ω start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_k end_POSTSUBSCRIPT from (38a)-(38)
12:     Calculate x^c,u,kq,ξc,u,kq,xsubscriptsuperscript^𝑥𝑞𝑐𝑢𝑘subscriptsuperscript𝜉𝑞𝑥𝑐𝑢𝑘\hat{x}^{q}_{c,u,k},\ \xi^{q,x}_{c,u,k}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT from (40a)-(40b)
13:     Calculate x^u,kq,ξu,kq,xsuperscriptsubscript^𝑥𝑢𝑘𝑞superscriptsubscript𝜉𝑢𝑘𝑞𝑥\hat{x}_{u,k}^{q},\ \xi_{u,k}^{q,x}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_x end_POSTSUPERSCRIPT from (46)
14:     Calculate x^u,k,ξu,kxsubscript^𝑥𝑢𝑘subscriptsuperscript𝜉𝑥𝑢𝑘\hat{x}_{u,k},\ \xi^{x}_{u,k}over^ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u , italic_k end_POSTSUBSCRIPT from (48a)-(48b)
15:     Calculate x^c,u,kv,ξ^c,u,kv,xsubscriptsuperscript^𝑥𝑣𝑐𝑢𝑘subscriptsuperscript^𝜉𝑣𝑥𝑐𝑢𝑘\hat{x}^{v}_{c,u,k},\ \hat{\xi}^{v,x}_{c,u,k}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , over^ start_ARG italic_ξ end_ARG start_POSTSUPERSCRIPT italic_v , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT from (50) with dam**
16:     Calculate x^c,u,kw,ξ^c,u,kw,xsubscriptsuperscript^𝑥𝑤𝑐𝑢𝑘subscriptsuperscript^𝜉𝑤𝑥𝑐𝑢𝑘\hat{x}^{w}_{c,u,k},\ \hat{\xi}^{w,x}_{c,u,k}over^ start_ARG italic_x end_ARG start_POSTSUPERSCRIPT italic_w end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT , over^ start_ARG italic_ξ end_ARG start_POSTSUPERSCRIPT italic_w , italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_c , italic_u , italic_k end_POSTSUBSCRIPT from (56c) with dam**
17: // EP for residual error estimation
18:     Calculate y~nc,u,ke,ϕnc,u,kesubscriptsuperscript~𝑦𝑒subscript𝑛𝑐𝑢𝑘subscriptsuperscriptitalic-ϕ𝑒subscript𝑛𝑐𝑢𝑘\tilde{y}^{e}_{n_{c},u,k},\ \phi^{e}_{n_{c},u,k}over~ start_ARG italic_y end_ARG start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , italic_ϕ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT from (56a)-(56)
19:     Calculate e^nc,u,kq,ξnc,u,kq,esubscriptsuperscript^𝑒𝑞subscript𝑛𝑐𝑢𝑘subscriptsuperscript𝜉𝑞𝑒subscript𝑛𝑐𝑢𝑘\hat{e}^{q}_{n_{c},u,k},\ \xi^{q,e}_{n_{c},u,k}over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT from (55)
20:     Calculate e^nc,uq,e,ξnc,uq,esuperscriptsubscript^𝑒subscript𝑛𝑐𝑢𝑞𝑒superscriptsubscript𝜉subscript𝑛𝑐𝑢𝑞𝑒\hat{e}_{n_{c},u}^{q,e},\ \xi_{n_{c},u}^{q,e}over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_q , italic_e end_POSTSUPERSCRIPT from (61)
21:     Calculate e^nc,u,ξnc,uesubscript^𝑒subscript𝑛𝑐𝑢subscriptsuperscript𝜉𝑒subscript𝑛𝑐𝑢\hat{e}_{n_{c},u},\ \xi^{e}_{n_{c},u}over^ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT , italic_ξ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT from (60)
22:     Calculate e^nc,u,kv,ξ^nc,u,kv,esubscriptsuperscript^𝑒𝑣subscript𝑛𝑐𝑢𝑘subscriptsuperscript^𝜉𝑣𝑒subscript𝑛𝑐𝑢𝑘\hat{e}^{v}_{n_{c},u,k},\hat{\xi}^{v,e}_{n_{c},u,k}over^ start_ARG italic_e end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT , over^ start_ARG italic_ξ end_ARG start_POSTSUPERSCRIPT italic_v , italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u , italic_k end_POSTSUBSCRIPT from (63a)-(63b) with dam**
23:     Update channel estimate 𝐡^u=𝐬^u+𝐞^usubscript^𝐡𝑢subscript^𝐬𝑢subscript^𝐞𝑢\hat{\mathbf{h}}_{u}=\hat{\mathbf{s}}_{u}+\hat{\mathbf{e}}_{u}over^ start_ARG bold_h end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = over^ start_ARG bold_s end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT + over^ start_ARG bold_e end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT from (68)
24: // EM algorithm for hyper parameter learning
25:     Update σnc,uesubscriptsuperscript𝜎𝑒subscript𝑛𝑐𝑢\sigma^{e}_{n_{c},u}italic_σ start_POSTSUPERSCRIPT italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT , italic_u end_POSTSUBSCRIPT from (67)
26: // Reinforcement for the model-based estimate
27:     Generate the grids θ~u,l,gθ,r~u,l,grsubscript~𝜃𝑢𝑙subscript𝑔𝜃subscript~𝑟𝑢𝑙subscript𝑔𝑟\tilde{\theta}_{u,l,g_{\theta}},\ \tilde{r}_{u,l,g_{r}}over~ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u , italic_l , italic_g start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT end_POSTSUBSCRIPT , over~ start_ARG italic_r end_ARG start_POSTSUBSCRIPT italic_u , italic_l , italic_g start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT from (69a)-(69b)
28:     Design the dictionary 𝐀~u(𝜽~u,𝐫~u)subscript~𝐀𝑢subscript~𝜽𝑢subscript~𝐫𝑢\tilde{\mathbf{A}}_{u}(\tilde{\bm{\theta}}_{u},\tilde{\mathbf{r}}_{u})over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( over~ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over~ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) from (70)
29:     Obtain 𝜽^u,𝐫^u,𝐳^usubscript^𝜽𝑢subscript^𝐫𝑢subscript^𝐳𝑢\hat{\bm{\theta}}_{u},\hat{\mathbf{r}}_{u},\hat{\mathbf{z}}_{u}over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_r end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT , over^ start_ARG bold_z end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT by solving (V-F)
30:     Calculate 𝐬^usubscript^𝐬𝑢\hat{\mathbf{s}}_{u}over^ start_ARG bold_s end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT from (72) with dam**
31:end for

VI Simulation Results

This section evaluates the performance of the proposed initial channel estimation and subsequent JCDE algorithms under the following setup. The carrier frequency is 100GHz100GHz100\ \mathrm{GHz}100 roman_GHz, the number of BS antennas N𝑁Nitalic_N is 200200200200, the number of UEs U𝑈Uitalic_U is 50505050, the modulation order Q𝑄Qitalic_Q is 64646464-QAM, and the length of pilots Kpsubscript𝐾pK_{\mathrm{p}}italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT and data Kdsubscript𝐾dK_{\mathrm{d}}italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT are 25252525 and 100100100100, respectively. The non-orthogonal pilot 𝐗p50×25subscript𝐗psuperscript5025\mathbf{X}_{\mathrm{p}}\in\mathbb{C}^{50\times 25}bold_X start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT 50 × 25 end_POSTSUPERSCRIPT is designed by the frame design method in [15]. The near-field channel is composed of Lu=3subscript𝐿𝑢3L_{u}=3italic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = 3 paths, i.e., 1111 LoS path and 2222 NLoS paths, with a Rician K𝐾Kitalic_K-factor of 10 dB. The total number of paths is L=50×3=150𝐿503150L\!=\!50\!\times\!3\!=\!150italic_L = 50 × 3 = 150, and the corresponding oversampling quantity used in Algorithm 1 is set to L^=250^𝐿250\hat{L}=250over^ start_ARG italic_L end_ARG = 250. The AoAs and distances are uniformly randomly generated in the range θu,l[60,60]subscript𝜃𝑢𝑙superscript60superscript60\theta_{u,l}\in[-60^{\circ},60^{\circ}]italic_θ start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ∈ [ - 60 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 60 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT ] and ru,l[1,10]subscript𝑟𝑢𝑙110r_{u,l}\in[1,10]italic_r start_POSTSUBSCRIPT italic_u , italic_l end_POSTSUBSCRIPT ∈ [ 1 , 10 ] m, respectively. The polar-domain dictionary 𝐀~(𝜽~,𝐫~)~𝐀~𝜽~𝐫\tilde{\mathbf{A}}(\tilde{\bm{\theta}},\tilde{\mathbf{r}})over~ start_ARG bold_A end_ARG ( over~ start_ARG bold_italic_θ end_ARG , over~ start_ARG bold_r end_ARG ) in (IV-A) is designed with Gr=7subscript𝐺𝑟7G_{r}=7italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = 7, Gθ=395subscript𝐺𝜃395G_{\theta}=395italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT = 395 and desired coherence γd=0.6subscript𝛾d0.6\gamma_{\mathrm{d}}=0.6italic_γ start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT = 0.6 in [10]. The performance is evaluated by the normalized mean-squared error (NMSE) and bit error rate (BER) under various signal-to-noise ratio (SNR). NMSE and SNR are defined as NMSE(𝐇)𝔼[𝐇𝐇^F2/𝐇F2]NMSE𝐇𝔼delimited-[]superscriptsubscriptnorm𝐇^𝐇F2superscriptsubscriptnorm𝐇F2\mathrm{NMSE}(\bm{\mathbf{H}})\triangleq\mathbb{E}\left[{\|\bm{\mathbf{H}}-% \hat{\bm{\mathbf{H}}}\|_{\mathrm{F}}^{2}}/{\|\bm{\mathbf{H}}\|_{\mathrm{F}}^{2% }}\right]roman_NMSE ( bold_H ) ≜ blackboard_E [ ∥ bold_H - over^ start_ARG bold_H end_ARG ∥ start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ∥ bold_H ∥ start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ], and SNR𝔼[𝐇𝐗F2]/𝔼[𝐍F2]SNR𝔼delimited-[]superscriptsubscriptnorm𝐇𝐗F2𝔼delimited-[]superscriptsubscriptnorm𝐍F2\mathrm{SNR}\triangleq\mathbb{E}\left[\|\mathbf{HX}\|_{\mathrm{F}}^{2}\right]/% \mathbb{E}\left[\|\mathbf{N}\|_{\mathrm{F}}^{2}\right]roman_SNR ≜ blackboard_E [ ∥ bold_HX ∥ start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] / blackboard_E [ ∥ bold_N ∥ start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ]. In what follows, the initial channel estimation and JCDE performance are evaluated in Section VI-A and VI-B, respectively.

VI-A Initial Channel Estimation Performance

To evaluate the initial channel estimation performance, the following estimation methods are compared: (a) LS: a classical least squares-based channel estimation, (b) P-SOMP [7]: a near-field channel estimation without considering the non-orthogonality of pilots. (c) 2D-CoSaMP [10]: a near-field channel estimation considering non-orthogonality, and (d) the proposed initial channel estimation method in Algorithm 1.

Fig. 4 shows the NMSE against SNR. The P-SOMP exhibits limited improvement with an increase in SNR due to pilot contamination stemming from non-orthogonal pilots, whereas 2D-CoSaMP demonstrates a performance enhancement compared to P-SOMP. The proposed method surpasses these conventional methods by mitigating noise amplification through the utilization of 2D-OMP in the second stage associated with UE-path pairing, resulting in superior channel estimation. Fig. 4 and Table I show the computational complexity evaluated by floating point operations (FLOPs). As depicted in the figure, the FLOPs of the proposed method are comparable to 2D-CoSaMP, owing to the two-stage procedure separating angle-distance estimation and UE-path pairing.

TABLE I: Computational complexity of initial channel estimation
Algotrithm FLOPs
P-SOMP [7] 𝒪(NU(GrGθ+L^)+L^2U(N+GrGθ))𝒪𝑁𝑈subscript𝐺𝑟subscript𝐺𝜃^𝐿superscript^𝐿2𝑈𝑁subscript𝐺𝑟subscript𝐺𝜃\mathcal{O}\left(NU(G_{r}G_{\theta}+\hat{L})+\hat{L}^{2}U(N+G_{r}G_{\theta})\right)caligraphic_O ( italic_N italic_U ( italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT + over^ start_ARG italic_L end_ARG ) + over^ start_ARG italic_L end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_U ( italic_N + italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ) )
2D-CoSaMP [10] 𝒪(Titer(NU(Kp+GrGθ)+L^NKp+L^3))𝒪subscript𝑇iter𝑁𝑈subscript𝐾psubscript𝐺𝑟subscript𝐺𝜃^𝐿𝑁subscript𝐾psuperscript^𝐿3\mathcal{O}\left(T_{\mathrm{iter}}\left(NU(K_{\mathrm{p}}+G_{r}G_{\theta})+% \hat{L}NK_{\mathrm{p}}+\hat{L}^{3}\right)\right)caligraphic_O ( italic_T start_POSTSUBSCRIPT roman_iter end_POSTSUBSCRIPT ( italic_N italic_U ( italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ) + over^ start_ARG italic_L end_ARG italic_N italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + over^ start_ARG italic_L end_ARG start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) )
Proposed
𝒪(NKp(GrGθ+L^+NU)\mathcal{O}\Big{(}NK_{\mathrm{p}}(G_{r}G_{\theta}+\hat{L}+NU)caligraphic_O ( italic_N italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT + over^ start_ARG italic_L end_ARG + italic_N italic_U )
+L^2(GrGθKp+NU+NKp)+L^3Kp+L^4)+\hat{L}^{2}(G_{r}G_{\theta}K_{\mathrm{p}}+NU+NK_{\mathrm{p}})+\hat{L}^{3}K_{% \mathrm{p}}+\hat{L}^{4}\Big{)}+ over^ start_ARG italic_L end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + italic_N italic_U + italic_N italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT ) + over^ start_ARG italic_L end_ARG start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT + over^ start_ARG italic_L end_ARG start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT )

Note: Titersubscript𝑇iterT_{\mathrm{iter}}italic_T start_POSTSUBSCRIPT roman_iter end_POSTSUBSCRIPT is the iteration number of 2D-CoSaMP, determined by [10].

Refer to caption
(a) NMSE versus SNR with the pilot length Kp=25subscript𝐾p25K_{\mathrm{p}}=25italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT = 25.
Refer to caption
(b) FLOPs versus the pilot length Kpsubscript𝐾pK_{\mathrm{p}}italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT.
Figure 4: NMSE and FLOPs performance.

VI-B JCDE Performance

In this subsection, we evaluate the performance of the proposed JCDE algorithm. As for JCDE algorithm parameters, the dam** factor is set to 0.50.50.50.5, the number of iterations is T=30𝑇30T=30italic_T = 30, the number of grids are G¯θ=5,G¯r=5formulae-sequencesubscript¯𝐺𝜃5subscript¯𝐺𝑟5\bar{G}_{\theta}=5,\bar{G}_{r}=5over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT = 5 , over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = 5, the grid ranges are σθ(1)=5superscriptsubscript𝜎𝜃1superscript5\sigma_{\theta}^{(1)}=5^{\circ}italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 5 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT, σθ(T)=0.1superscriptsubscript𝜎𝜃𝑇superscript0.1\sigma_{\theta}^{(T)}=0.1^{\circ}italic_σ start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT = 0.1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT, σr(1)=5msuperscriptsubscript𝜎𝑟15m\sigma_{r}^{(1)}=5\ \mathrm{m}italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = 5 roman_m, and σr(T)=1msuperscriptsubscript𝜎𝑟𝑇1m\sigma_{r}^{(T)}=1\ \mathrm{m}italic_σ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_T ) end_POSTSUPERSCRIPT = 1 roman_m, respectively. The extremely large array with N=200𝑁200N=200italic_N = 200 antennas is divided into C=4𝐶4C=4italic_C = 4 sub-arrays with Nc=50subscript𝑁𝑐50N_{c}=50italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT = 50 antennas per sub-array. For comparison, AoA-aided BiGaBP [12] are employed as a benchmark, which is a state-of-the-art JCDE algorithm. Besides, we consider an ideal Genie-aided case with perfect knowledge of CSI or data corresponding to the lower bound of the proposed method.

VI-B1 JCDE Performance with Initial Channel Estimation

Refer to caption
(a) BER versus SNR with the pilot length Kp=25subscript𝐾p25K_{\mathrm{p}}=25italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT = 25.
Refer to caption
(b) NMSE versus SNR with the pilot length Kp=25subscript𝐾p25K_{\mathrm{p}}=25italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT = 25.
Figure 5: BER and NMSE performance.

This subsection reveals the NMSE and BER performance of the JCDE algorithms with various initial channel estimation methods, including P-SOMP, 2D-CoSaMP, and the proposed initial channel estimation method. To evaluate the data detection capability of the above initial channel estimation methods, the LMMSE detector is used for data estimation.

Fig. 5 shows the BER and NMSE performance. As shown in the figures, while LMMSE with LS, which cannot take advantage of the near-field model structures, exhibits poor BER performance, LMMSE with the other initial estimation approaches considering the near-field model structure achieve a slight performance improvement. However, there remains high-level error floors due to the non-orthogonal pilots. In contrast, the JCDE algorithms boost BER performance due to utilizing both pilot and consecutive data. In particular, the proposed JCDE algorithm with the proposed initial channel estimation demonstrates a significant performance gain, approaching the lower bound of perfect CSI or perfect data.

Moreover, the proposed JCDE algorithm demonstrates a notable BER performance compared to the state-of-the-art AoA-aided BiGaBP [12]. The performance improvement can be attributed to two primary factors. The first factor is that the proposed algorithm can leverage the near-field model-based estimation described in Section V-F, whereas BiGaBP relies on the far-field assumption. The second factor is that the proposed sub-array-wise LMMSE-based detection in (40a) is capable of addressing the correlation between the leaked energy in the beam-domain, whereas BiGaBP is incapable of doing so because of its MRC. To reveal the aforementioned two factors, in Section VI-B2, we show the convergence analysis with and without near-field model information. Besides, we evaluate in Section VI-B3 the proposed sub-array-wise LMMSE-based detection performance and its complexity across various numbers of sub-arrays C𝐶Citalic_C.

VI-B2 Convergence Analysis

To clarify the advantages gained by leveraging the near-field model structure, we evaluate the proposed JCDE algorithm with and without the model-based estimation process explained in Section V-F. Fig. 6 illustrates the BER and NMSE convergence behavior with respect to the number of algorithmic iterations. In the figure, the red triangle marker corresponds to the proposed JCDE algorithm without the model-based estimate, i.e., 𝐒^(t)=𝟎superscript^𝐒𝑡0\hat{\mathbf{S}}^{(t)}=\mathbf{0}over^ start_ARG bold_S end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = bold_0, where the prior distribution is designed i.i.d. for each element of 𝐇𝐇\mathbf{H}bold_H instead of 𝐄𝐄\mathbf{E}bold_E, akin to [13, 14]. The green square marker corresponds to the proposed JCDE algorithm with the initial model-based estimate but without updating in iterations, i.e., 𝐒^(t)=𝐒^(0)superscript^𝐒𝑡superscript^𝐒0\hat{\mathbf{S}}^{(t)}=\hat{\mathbf{S}}^{(0)}over^ start_ARG bold_S end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = over^ start_ARG bold_S end_ARG start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT. Comparing the red triangle maker and green square marker, we can verify the performance improvement stemming from the use of the near-filed model through the decomposition of 𝐇𝐇\mathbf{H}bold_H into 𝐒^^𝐒\hat{\mathbf{S}}over^ start_ARG bold_S end_ARG and 𝐄𝐄\mathbf{E}bold_E as written in (15). Furthermore, in comparison to the proposed algorithm with adaptive update, it can be seen that the adaptive updating of the model-based estimate 𝐒^(t)superscript^𝐒𝑡\hat{\mathbf{S}}^{(t)}over^ start_ARG bold_S end_ARG start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT enhances the BER and NMSE performance by further exploiting the near-field model.

Refer to caption
(a) BER against the number of algorithmic iterations.
Refer to caption
(b) NMSE against the number of algorithmic iterations.
Figure 6: Convergence behavior in term of BER and NMSE performance with Kp=25,SNR=26dBformulae-sequencesubscript𝐾p25SNR26dBK_{\mathrm{p}}=25,\ \mathrm{SNR}=26\ \mathrm{dB}italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT = 25 , roman_SNR = 26 roman_dB.

VI-B3 Performance Against the Number of Sub-arrays

TABLE II: Computational complexity of JCDE algorithms
Algotrithm FLOPs
BiGaBP [12] 𝒪(UKdNQ+UKN+UL^u2N2+UN2L^u)𝒪𝑈subscript𝐾d𝑁𝑄𝑈𝐾𝑁𝑈superscriptsubscript^𝐿𝑢2superscript𝑁2𝑈superscript𝑁2subscript^𝐿𝑢\mathcal{O}\Big{(}UK_{\mathrm{d}}NQ+UKN+U\hat{L}_{u}^{2}N^{2}+UN^{2}\hat{L}_{u% }\Big{)}caligraphic_O ( italic_U italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT italic_N italic_Q + italic_U italic_K italic_N + italic_U over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_U italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT )
Proposed
𝒪(CUKdNc2+CKdNc3+UKdQ+UKN\mathcal{O}\Big{(}CUK_{\mathrm{d}}N_{c}^{2}+CK_{\mathrm{d}}N_{c}^{3}+UK_{% \mathrm{d}}Q+UKNcaligraphic_O ( italic_C italic_U italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_C italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT + italic_U italic_K start_POSTSUBSCRIPT roman_d end_POSTSUBSCRIPT italic_Q + italic_U italic_K italic_N
        +UNG¯θG¯r+UL^u2N+UL^uG¯θG¯r)+UN\bar{G}_{\theta}\bar{G}_{r}+U\hat{L}_{u}^{2}N+U\hat{L}_{u}\bar{G}_{\theta}% \bar{G}_{r}\Big{)}+ italic_U italic_N over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT + italic_U over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N + italic_U over^ start_ARG italic_L end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT over¯ start_ARG italic_G end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT )
Refer to caption
(a) BER against the number of sub-arrays C𝐶Citalic_C.
Refer to caption
(b) FLOPs against the number of sub-arrays C𝐶Citalic_C.
Figure 7: BER and FLOPs performance with Kp=25,SNR=26dBformulae-sequencesubscript𝐾p25SNR26dBK_{\mathrm{p}}=25,\ \mathrm{SNR}=26\ \mathrm{dB}italic_K start_POSTSUBSCRIPT roman_p end_POSTSUBSCRIPT = 25 , roman_SNR = 26 roman_dB.

To analyze the impact of the number of sub-arrays C𝐶Citalic_C on the performance of the proposed JCDE algorithm employing the sub-array-wise LMMSE-based detection, we offer in Fig. 7 the BER and FLOPs with respect to various numbers of sub-arrays C𝐶Citalic_C, where C=1𝐶1C=1italic_C = 1 corresponds to the full-array LMMSE-based detection and C=N=200𝐶𝑁200C=N=200italic_C = italic_N = 200 corresponds to the MRC-based detection. As depicted in the figure, the BER decreases as the number of sub-arrays increases (i.e., the number of antennas at each sub-array Ncsubscript𝑁cN_{\mathrm{c}}italic_N start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT decreases) because each sub-array fails to effectively whiten the correlation in the beam-domain even in the perfect CSI case. In particular, the MRC-based detection corresponding to C=200𝐶200C=200italic_C = 200 exhibits poor performance. In contrast, an increase in the number of sub-arrays leads to a reduction in FLOPs attributed to the decreased size of the inverse matrix associated with the LMMSE-based detection in (40a). Despite relying on the LMMSE-based detector, the proposed algorithm can achieve lower FLOPs when C>4𝐶4C>4italic_C > 4 compared to BiGaBP, which relies on an MRC-based detector, since the proposed method suppresses self-feedback in (50) after the denoising process as in (48a)-(48b) with FLOPs 𝒪(UKdQ)𝒪𝑈subscript𝐾𝑑𝑄\mathcal{O}(UK_{d}Q)caligraphic_O ( italic_U italic_K start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT italic_Q ), whereas BiGaBP suppresses self-feedback before the denoising process [12, 32] with FLOPs 𝒪(NUKdQ)𝒪𝑁𝑈subscript𝐾𝑑𝑄\mathcal{O}(NUK_{d}Q)caligraphic_O ( italic_N italic_U italic_K start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT italic_Q ) that is dominant complexity throughout the entire process as shown in Table II. From the above results, it is evident that the proposed method outperforms the conventional method in terms of both data detection and complexity.

VII Conclusion

This paper proposed an initial channel estimation algorithm and subsequent JCDE algorithm for multiuser XL-MIMO systems with non-orthogonal pilots. The initial channel estimation is performed by an efficient two-stage compressed sensing algorithm exploiting the polar-domain sparsity. Furthermore, the initial channel estimates are refined by jointly utilizing both non-orthogonal pilots and data via the EP algorithm. To improve channel estimation accuracy, the model-based deterministic approach is integrated into a Bayesian inference framework. In addition, to address the near-field specific correlation in the beam domain, a sub-array-wise LMMSE filter is designed considering the correlation and channel estimation errors for data detection. Computer simulations validated that the proposed method is superior to existing approaches in terms of channel estimation, data detection, and complexity.

References

  • [1] H. Tataria, M. Shafi, A. F. Molisch, M. Dohler, H. Sjöland, and F. Tufvesson, “6G wireless systems: Vision, requirements, challenges, insights, and opportunities,” Proc. IEEE, vol. 109, no. 7, pp. 1166–1199, 2021.
  • [2] E. D. Carvalho, A. Ali, A. Amiri, M. Angjelichinoski, and R. W. Heath, “Non-stationarities in extra-large-scale massive MIMO,” IEEE Wirel. Commun., vol. 27, no. 4, pp. 74–80, 2020.
  • [3] Z. Wang et al., “A tutorial on extremely large-scale MIMO for 6G: Fundamentals, signal processing, and applications,” IEEE Commun. Surveys Tuts., Early Access, 2024.
  • [4] H. Iimori, T. Takahashi, K. Ishibashi, G. T. F. de Abreu, D. González G., and O. Gonsa, “Joint activity and channel estimation for extra-large MIMO systems,” IEEE Trans. Wirel. Commun., vol. 21, no. 9, pp. 7253–7270, 2022.
  • [5] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potentials, and future directions,” IEEE Commun. Mag., vol. 61, no. 1, pp. 40–46, 2023.
  • [6] Y. Liu, Z. Wang, J. Xu, C. Ouyang, X. Mu, and R. Schober, “Near-field communications: A tutorial review,” IEEE Open J. Commun. Soc., vol. 4, pp. 1999–2049, 2023.
  • [7] M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: Far-field or near-field?” IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, 2022.
  • [8] J. Rodríguez-Fernández, N. González-Prelcic, K. Venugopal, and R. W. Heath, “Frequency-domain compressive channel estimation for frequency-selective hybrid millimeter wave MIMO systems,” IEEE Trans. Wirel. Commun., vol. 17, no. 5, pp. 2946–2960, 2018.
  • [9] C. Hu, L. Dai, T. Mir, Z. Gao, and J. Fang, “Super-resolution channel estimation for mmwave massive mimo with hybrid precoding,” IEEE Trans. Veh. Technol., vol. 67, no. 9, pp. 8954–8958, 2018.
  • [10] X. Xie, Y. Wu, J. An, D. W. K. Ng, C. Xing, and W. Zhang, “Massive unsourced random access for near-field communications,” IEEE Trans. Commun., pp. 1–1, Early Access 2024.
  • [11] K. Ito, T. Takahashi, S. Ibi, and S. Sampei, “Bilinear gaussian belief propagation for massive mimo detection with non-orthogonal pilots,” IEEE Trans. Commun., vol. 72, no. 2, pp. 1045–1061, 2024.
  • [12] K. Ito, T. Takahashi, K. Igarashi, S. Ibi, and S. Sampei, “AoA estimation-aided Bayesian receiver design via bilinear inference for mmWave massive MIMO,” in Proc. IEEE Int. Conf. Commun. (ICC), 2023, pp. 6474–6479.
  • [13] W. Yan and X. Yuan, “Semi-blind channel-and-signal estimation for uplink massive MIMO with channel sparsity,” IEEE Access, vol. 7, pp. 95 008–95 020, 2019.
  • [14] L. Chen and X. Yuan, “Blind multiuser detection in massive MIMO channels with clustered sparsity,” IEEE Wirel. Commun. Lett., vol. 8, no. 4, pp. 1052–1055, 2019.
  • [15] H. Iimori, T. Takahashi, K. Ishibashi, G. T. F. de Abreu, and W. Yu, “Grant-free access via bilinear inference for cell-free MIMO with low-coherence pilots,” IEEE Trans. Wirel. Commun., vol. 20, no. 11, pp. 7694–7710, 2021.
  • [16] J. T. Parker, P. Schniter, and V. Cevher, “Bilinear generalized approximate message passing―part I: Derivation,” IEEE Trans. Signal Process., vol. 62, no. 22, pp. 5839–5853, 2014.
  • [17] Y. Kabashima, “A CDMA multiuser detection algorithm on the basis of belief propagation,” J. Phys. A, Math. Gen., vol. 36, no. 43, 2003.
  • [18] D. Fan et al., “Angle domain channel estimation in hybrid millimeter wave massive MIMO systems,” IEEE Trans. Wirel. Commun., vol. 17, no. 12, pp. 8165–8179, 2018.
  • [19] Y. Fang, J. Wu, and B. Huang, “2D sparse signal recovery via 2D orthogonal matching pursuit,” Sci. China Inf. Sci., vol. 55, pp. 889–897, 2012.
  • [20] T. P. Minka, “Expectation propagation for approximate bayesian inference,” Proc. 17th Conf. Uncertainty Artif, pp. 362–369, 2001.
  • [21] H. Iimori, G. T. F. de Abreu, O. Taghizadeh, R.-A. Stoica, T. Hara, and K. Ishibashi, “Stochastic learning robust beamforming for millimeter-wave systems with path blockage,” IEEE Wirel. Commun. Lett., vol. 9, no. 9, pp. 1557–1561, 2020.
  • [22] Y. Pati, R. Rezaiifar, and P. Krishnaprasad, “Orthogonal matching pursuit: Recursive function approximation with applications to wavelet decomposition,” in Proc. Asilomar Conf. Signals, Syst., Comput, 1993, pp. 40–44 vol.1.
  • [23] B. L. Sturm and M. G. Christensen, “Comparison of orthogonal matching pursuit implementations,” in Proc. 20th Eur. Signal Process. Conf. (EUSIPCO), 2012, pp. 220–224.
  • [24] J. Ma and L. **, “Orthogonal AMP,” IEEE Access, vol. 5, pp. 2020–2033, 2017.
  • [25] S. Rangan, P. Schniter, and A. K. Fletcher, “Vector approximate message passing,” IEEE Trans. Inf. Theory, vol. 65, no. 10, pp. 6664–6684, 2019.
  • [26] H. Wang, A. Kosasih, C.-K. Wen, S. **, and W. Hardjawana, “Expectation propagation detector for extra-large scale massive MIMO,” IEEE Trans. Wirel. Commun., vol. 19, no. 3, pp. 2036–2051, 2020.
  • [27] A. Mishra, A. Rajoriya, A. K. Jagannatham, and G. Ascheid, “Sparse bayesian learning-based channel estimation in millimeter wave hybrid MIMO systems,” in Proc. IEEE Int. Workshop Sig. Process. Ad. Wirel. Commun. (SPAWC), 2017, pp. 1–5.
  • [28] C. M. Bishop, Pattern Recognition and Machine Learning (Information Science and Statistics).   Berlin, Germany: Springer-Verlag, 2006.
  • [29] M. E. Tip**, “Sparse Bayesian learning and the relevance vector machine,” J. Mach. Learn. Res., vol. 1, no. 2, pp. 211–244, 2002.
  • [30] P. Jain, P. Kar et al., “Non-convex optimization for machine learning,” Found. Trends Mach. Learn., vol. 10, no. 3-4, pp. 142–363, 2017.
  • [31] Q. Zou and H. Yang, “A concise tutorial on approximate message passing,” arXiv:2201.07487, 2022.
  • [32] R. Tamaki, K. Ito, T. Takahashi, S. Ibi, and S. Sampei, “Suppression of self-noise feedback in GAMP for highly correlated large MIMO detection,” in Proc. IEEE Int. Conf. Commun. (ICC), 2022, pp. 1300–1305.