RIS-aided MIMO Beamforming: Piece-Wise Near-field Channel Model

Weijian Chen, Zai Yang, Zhiqiang Wei, Derrick Wing Kwan Ng, and Michail Matthaiou, Part of the paper has been submitted to the 2024 ICCC [1]. W. Chen, Z. Yang, and Z. Wei are with the School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China (e-mails: [email protected], [email protected], [email protected]). (Corresponding author: Zhiqiang Wei). D. W. K. Ng is with the School of Electrical Engineering and Telecommunications, the University of New South Wales, Australia (email: [email protected]). M. Matthaiou is with the Centre for Wireless Innovation (CWI), Queen’s University Belfast, BT3 9DT Belfast, U.K. (e-mail: [email protected]).
Abstract

This paper proposes a joint active and passive beamforming design for reconfigurable intelligent surface (RIS)-aided wireless communication systems, adopting a piece-wise near-field channel model. While a traditional near-field channel model, applied without any approximations, offers higher modeling accuracy than a far-field model, it renders the system design more sensitive to channel estimation errors (CEEs). As a remedy, we propose to adopt a piece-wise near-field channel model that leverages the advantages of the near-field approach while enhancing its robustness against CEEs. Our study analyzes the impact of different channel models, including the traditional near-field, the proposed piece-wise near-field and far-field channel models, on the interference distribution caused by CEEs and model mismatches. Subsequently, by treating the interference as noise, we formulate a joint active and passive beamforming design problem to maximize the spectral efficiency (SE). The formulated problem is then recast as a mean squared error (MSE) minimization problem and a suboptimal algorithm is developed to iteratively update the active and passive beamforming strategies. Simulation results demonstrate that adopting the piece-wise near-field channel model leads to an improved SE compared to both the near-field and far-field models in the presence of CEEs. Furthermore, the proposed piece-wise near-field model achieves a good trade-off between modeling accuracy and system’s degrees of freedom (DoF).

Index Terms:
Beamforming, near-field, piece-wise near-field, reconfigurable intelligent surface.

I Introduction

Reconfigurable intelligent surface (RIS)-aided wireless communications have attracted significant interest due to their excellent ability to mitigate the propagation path loss through passive beamforming and to circumvent potential obstacles via establishing alternative propagation paths [2]. An RIS is a metalic planar array consisting of numerous passive elements that can be independently reconfigured. By customizing the phase shift of each RIS element based on the channel conditions, the received signal strength at the desired user can be enhanced [3], while undesired interference from adjacent cells and other users can be efficiently suppressed [3, 4, 5, 6].

Numerous studies have focused on optimizing both the active beamforming at the transmitter (Tx) and passive beamforming at the RIS for RIS-aided communication systems [7, 8, 9, 10, 11, 12, 13]. For example, Yu et al. [7] proposed two algorithms for single-user scenarios, namely fixed point iteration and manifold optimization, to maximize the achievable rate for a RIS-aided point-to-point (P2P) multiple-input single-output (MISO) communication system. Similarly, Zhang et al. [8] established the fundamental capacity limit through the joint design of the RIS reflection coefficients and the transmit power allocation. On the other hand, in multi-user scenarios, Alwazani et al. [9] studied the joint optimization of transmit power strategy at the Tx and passive beamforming at the RIS to maximize the minimum user signal-to-interference-plus-noise ratio (SINR), accounting for imperfect channel estimation information (CSI). Additionally, Pan et al. [10] explored the weighted sum rate of all users in multi-cell scenarios by jointly designing the active and passive beamforming, subject to individual base station (BS)’s power constraint and the unit modulus constraint for the RIS reflection coefficients according to a weighted minimum mean square error (WMMSE) framework. The affordability and portability of RIS technology have sparked a range of research initiatives. The work of Hu et al. in [11] on multiuser MISO downlink communications utilized an intelligent reflection surface (IRS) capable of reflecting the signal and harvesting energy from signals, enhancing both the communication robustness and security. Also, Wei et al. explored the implementation of IRS in unmanned aerial vehicle (UAV)-based orthogonal frequency division multiple access (OFDMA) communication systems, utilizing the mobility of UAVs and the beamforming capabilities of the IRS to boost the system’s sum-rate [12]. Furthermore, deep learning (DL) has been adopted to design beamforming for RIS-assisted multiuser communications, potentially outperforming traditional model-based techniques [13]. It is worth noting that these studies assumed a far-field channel model, which is only applicable when the RIS is deployed far from both the Tx and users.

In practice, the electromagnetic (EM) radiation field is commonly divided into two regions: the far-field and the near-field. The demarcation between these regions is determined by the Rayleigh distance which is proportional to the square of the array aperture and is inversely proportional to the signal carrier wavelength [14]. Specially, when a receiver (Rx) is located in the far-field region, the EM field propagation is approximately modeled by planar waves with a certain approximate error. However, in the near-field region, near-field propagation becomes dominant and the EM field propagation is accurately modeled by spherical waves. In fact, with emerging high-frequency communications, utilizing large-aperture antenna arrays for potential power gains can also increase the Rayleigh distance up to a hundred meters, such that the far-field assumptions are no longer valid.

In recent years, a novel area of research has emerged with the goal of improving the efficiency and performance of near-field wireless communication systems. For instance, in [14], Cui et al. introduced the fundamental concept of RIS-assisted near-field communications and highlighted several areas for future research. Besides, Wei et al. [15] focused on develo** a near-field codebook for extremely large-scale RIS (XL-RIS) by taking into account the near-field cascaded array steering vector. Specifically, they crafted a hierarchical near-field codebook and introduced a corresponding hierarchical near-field beam training scheme to minimize the beam training overhead. Furthermore, Wang et al. [16] proposed two efficient schemes for optimizing the BS beamforming to improve the RIS beam training performance, leveraging a near-field channel model. Moreover, in [17], Dovelos et al. investigated RIS-aided MIMO systems in terms of power gain and energy efficiency (EE), considering a spherical wave channel model. Specifically, they analyzed the power gains under beamfocusing and beamsteering, concluding that beamfocusing in the radiating near-field is more useful than beamforming adopted in the far-field counterpart scenarios.

In fact, the necessity of adopting a near-field channel model in a RIS-aided wireless communication system is twofold. Firstly, the RIS is usually coated with a large aperture and needs to be deployed close to the Tx or the Rx to enhance its gain [18], [19]. In other words, the Tx or Rx is more likely to be within the near-field region of the RIS. Secondly, when the RIS is located close to the Tx or the Rx, either the Tx-RIS or the RIS-Rx channel is likely dominated by line-of-sight (LoS) propagation paths [20], [21]. Note that assuming a LoS far-field channel in either the Tx-RIS or the RIS-Rx link makes the cascaded channel matrix between the Tx and Rx only exhibits rank-one and, thus, limits the system’s design degrees of freedom (DoF). It is important to note that a practical near-field channel model, as discussed in [22], can effectively uncover the implicit higher ranks of the cascaded channel matrix, potentially leading to an improvement in both the system’s DoF and spectral efficiency (SE). As a result, considering a near-field channel model for RIS-aided wireless communication systems not only improves the modeling accuracy but also reveals more DoF.

Despite of the advantages mentioned above, embracing near-field communications entails significant challenges. One key challenge is that characterizing a near-field channel requires knowledge of all the distances between each pair of transceiver antennas, whereas the far-field channel only depends on the distance between transceivers, the angle of departure (AoD) at the Tx, and the angle of arrival (AoA) at the Rx. Consequently, estimating the near-field channel is generally more challenging due to the larger number of parameters involved. Another challenge is the sensitivity of near-field communications to channel estimation errors (CEEs). For a far-field channel model, aligning the beam direction of the Tx towards the Rx is usually sufficient to achieve the beamforming gain, as long as the Rx is within the beam’s cone. However, for near-field communications, precise beamfocusing is required to accurately focus the transmitted signal towards the Rx’s location. In practical situations with the presence of CEEs, the focus of near-field beamforming may deviate from the intended focal point, which degrades the beamforming gain and system performance [21]. These two challenges motivate us to consider a new channel model that not only leverages the advantages of the near-field channel model but also enhances its robustness against CEEs.

TABLE I: Comparison of Existing Works
References Joint Beamforming MIMO Channel Model CSI Assumption
[7, 9, 11, 12, 13] Far-field Perfect/Imperfect
[8, 10] Far-field Perfect
[14], [15, 16, 17] Near-field Perfect
This paper Piece-wise near-field Imperfect

As compared in Table I, various existing works have overlooked the near-field effect and some recent works on near-field communications assumed the ideal case of perfect CSI for beamforming design. To the best of the authors’ knowledge, our work is the first to compare the performance and robustness of different channel models for RIS-aided communications.

In comparison to the conference version [1], this paper provides a detailed description of the piece-wise near-field model and extensively discusses the rank of the cascaded channel and its advantages. Additionally, it presents a comprehensive algorithmic framework, parameter selection, and convergence description for the the alternating direction penalty method (ADPM) algorithm, addressing the RIS phase constant modulus constraint. Through simulations, this paper also illustrates the impact of the number of antennas at the Tx and the number of RIS reflecting elements on SE, shedding light on their distinct roles in RIS-aided communication systems. The key contributions of this paper are outlined as follows:

  • For the first time, we propose a joint active and passive beamforming design for RIS-aided MIMO wireless communication systems, adopting a piece-wise near-field channel model. This model approximates the near-field channel via dividing a large-aperture RIS into multiple small-aperture sub-surfaces and assuming heterogeneous far-field propagation between each surface to the Tx or Rx. One can imagine that the piece-wise near-field channel model retains the advantages of accurate channel modeling and increased DoF gain of the near-field channel model.

  • For the three different channel models, i.e., the near-field model, the far-field model and the piece-wise near-field model, and by assuming an identical normalized CEE but different model mismatches, we analyze the covariance matrices of the corresponding interference plus noise signals.

  • The joint active and passive beamforming design is formulated as an optimization problem to maximize the achievable SE by treating the interference caused by CEEs and model mismatches as noise. The achievable SE maximization problem is then transformed equivalently to a problem minimizing the mean square error (MSE), assuming a Gaussian CEE distribution.

  • A block coordinate descent (BCD) approach is adopted to alternately optimize the active and passive beamforming strategies. In particular, the ADPM is adopted to address the constant modulus constraint on the RIS reflection coefficient. We propose an iterative suboptimal algorithm with closed-form updating rules in each step to design the active and passive beamforming strategies. Simulation results demonstrate the system design DoF gain and the enhanced robustness against CEE of adopting the piece-wise near-filed channel model compared to the conventional far-field and near-field models.

The rest of this paper is organized as follows: In Section II, we introduce the RIS-aided near-field communication system model adopting the piece-wise near-field channel model. Section III provides the analysis of the interference distribution with different channel models. In Section IV, we formulate the joint active and passive beamfoming design problem. Then, we present the solution in Section V. In Section VI, we present numerical results with discussions. Finally, we conclude with Section VII.

Notations: Lower-case letters are used to represent scalars, while vectors and matrices are denoted by lower-case and upper-case boldface letters, respectively. The set of complex numbers is denoted by \mathbb{C}blackboard_C; \Reroman_ℜ extracts the real part of a complex number. For vector 𝒙𝒙\boldsymbol{x}bold_italic_x, 𝒙jsubscript𝒙𝑗\boldsymbol{x}_{j}bold_italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT denotes the j𝑗jitalic_j-th element of 𝒙𝒙\boldsymbol{x}bold_italic_x and diag(𝒙)diag𝒙\text{diag}\left(\boldsymbol{x}\right)diag ( bold_italic_x ) denotes a diagonal matrix with its diagonal entries given by 𝒙𝒙{\boldsymbol{x}}bold_italic_x. For matrix 𝑨𝑨\boldsymbol{A}bold_italic_A, 𝔼{𝑨}𝔼𝑨\mathbb{E}\{\boldsymbol{A}\}blackboard_E { bold_italic_A }, 𝑨Tsuperscript𝑨𝑇\boldsymbol{A}^{T}bold_italic_A start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, 𝑨Hsuperscript𝑨𝐻\boldsymbol{A}^{H}bold_italic_A start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT, 𝑨Fsubscriptnorm𝑨𝐹\|\boldsymbol{A}\|_{F}∥ bold_italic_A ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT, 𝑨1superscript𝑨1\boldsymbol{A}^{-1}bold_italic_A start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, rank(𝑨)rank𝑨\text{rank}\left(\boldsymbol{A}\right)rank ( bold_italic_A ), and tr(𝑨)tr𝑨\text{tr}\left(\boldsymbol{A}\right)tr ( bold_italic_A ) denote the expectation, matrix transpose, conjugate transpose, Frobenius norm, inverse, rank, and trace of 𝑨𝑨\boldsymbol{A}bold_italic_A, respectively; 𝒞𝒩(𝝁,𝚺)𝒞𝒩𝝁𝚺\mathcal{CN}(\boldsymbol{\mu},\boldsymbol{\Sigma})caligraphic_C caligraphic_N ( bold_italic_μ , bold_Σ ) denotes a circularly symmetric complex Gaussian random vector distribution with mean 𝝁𝝁\boldsymbol{\mu}bold_italic_μ and covariance matrix 𝚺𝚺\boldsymbol{\Sigma}bold_Σ. The matrix 𝑨𝑨\boldsymbol{A}bold_italic_A is said to have a matrix-variate complex Gaussian distribution, which can be written as 𝒞𝒩n,m(𝚷n×m,𝚺r𝚺cT)𝒞subscript𝒩𝑛𝑚subscript𝚷𝑛𝑚tensor-productsubscript𝚺rsubscriptsuperscript𝚺𝑇c\mathcal{CN}_{n,m}(\boldsymbol{\Pi}_{n\times m},\boldsymbol{\Sigma}_{\rm r}% \otimes\boldsymbol{\Sigma}^{T}_{\rm c})caligraphic_C caligraphic_N start_POSTSUBSCRIPT italic_n , italic_m end_POSTSUBSCRIPT ( bold_Π start_POSTSUBSCRIPT italic_n × italic_m end_POSTSUBSCRIPT , bold_Σ start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT ⊗ bold_Σ start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT ), where the n×n𝑛𝑛n\times nitalic_n × italic_n matrix 𝚺rsubscript𝚺r\boldsymbol{\Sigma}_{\rm r}bold_Σ start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT and the m×m𝑚𝑚m\times mitalic_m × italic_m matrix 𝚺csubscript𝚺c\boldsymbol{\Sigma}_{\rm c}bold_Σ start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT are the row and column covariance matrices of 𝑨𝑨\boldsymbol{A}bold_italic_A, respectively [23]. Besides, vec(𝐀)𝒞𝒩(vec(𝚷),𝚺r𝚺cT)similar-tovec𝐀𝒞𝒩vec𝚷tensor-productsubscript𝚺rsubscriptsuperscript𝚺Tc\rm vec(\boldsymbol{A})\sim\mathcal{CN}(\rm vec({\boldsymbol{\Pi}}),% \boldsymbol{\Sigma}_{r}\otimes\boldsymbol{\Sigma}^{T}_{c})roman_vec ( bold_A ) ∼ caligraphic_C caligraphic_N ( roman_vec ( bold_Π ) , bold_Σ start_POSTSUBSCRIPT roman_r end_POSTSUBSCRIPT ⊗ bold_Σ start_POSTSUPERSCRIPT roman_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT ) where the operation vec(𝐀)vec𝐀\rm vec(\boldsymbol{A})roman_vec ( bold_A ) stacks the columns of the matrix 𝑨𝑨\boldsymbol{A}bold_italic_A into a single vector. The symbols tensor-product\otimes and \circ represent the Kronecker and Hadamard products, respectively. The symbol \angle is used to denote the phase of a complex number.

Refer to caption
(a) Near-field channel model for the Tx-RIS channel.
Refer to caption
(b) Piece-wise near-field channel model for the Tx-RIS channel.
Figure 1: The RIS-aided P2P wireless communication system.

II System Model

II-A System Model

We consider a RIS-aided point-to-point (P2P) multiple-input multiple-output (MIMO) wireless communication system, as illustrated in Fig. 1, where the RIS is deployed close to a Tx to assist the data transmission from the Tx to a Rx. The Tx equipped with NTxsubscript𝑁Tx{N_{\rm Tx}}italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT antennas, transmits Ns(NTx)annotatedsubscript𝑁sabsentsubscript𝑁Tx{N_{\rm s}}\ {(\leq N_{\rm Tx})}italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT ( ≤ italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ) independent data streams to the Rx equipped with NRxsubscript𝑁RxN_{\rm Rx}italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT antennas with the aid of a RIS comprising NR=NRy×NRzsubscript𝑁Rsubscript𝑁subscriptR𝑦subscript𝑁subscriptR𝑧N_{\rm R}=N_{{\rm R}_{y}}\times N_{{\rm R}_{z}}italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT passive elements, where NRysubscript𝑁subscriptR𝑦N_{{\rm R}_{y}}italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT and NRzsubscript𝑁subscriptR𝑧N_{{\rm R}_{z}}italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT are the number of elements in the horizontal and vertical directions of RIS, respectively. Without loss of generality, we assume that both the transmit and receive antenna arrays are uniform linear arrays (ULAs) with an identical antenna spacing for both arrays at the Tx and Rx as well as the RIS, denoted by d𝑑ditalic_d. We assume that there is no direct link from the Tx to the Rx, which is likely to occur due to blockages as commonly assumed in the literature [24]. Furthermore, the narrowband channels from the Tx to the RIS and from the RIS to the Rx are denoted by 𝑮NR×NTx𝑮superscriptsubscript𝑁Rsubscript𝑁Tx\boldsymbol{G}\in{\mathbb{C}^{N_{\rm R}\times{N_{\rm Tx}}}}bold_italic_G ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and 𝑹NRx×NR𝑹superscriptsubscript𝑁Rxsubscript𝑁R\boldsymbol{R}\in{\mathbb{C}^{N_{\rm Rx}\times{N_{\rm R}}}}bold_italic_R ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, respectively. The phase shift matrix of the RIS is denoted by 𝚽=diag(ϕ1,ϕ2,,ϕNR)NR×NR𝚽diagsubscriptitalic-ϕ1subscriptitalic-ϕ2subscriptitalic-ϕsubscript𝑁Rsuperscriptsubscript𝑁Rsubscript𝑁R\boldsymbol{\Phi}=\text{diag}(\phi_{1},\phi_{2},...,\phi_{N_{\rm R}})\in{% \mathbb{C}^{N_{\rm R}\times{N_{\rm R}}}}bold_Φ = diag ( italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_ϕ start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, to perform passive beamforming, where ϕnR=ejζnRsubscriptitalic-ϕsubscript𝑛Rsuperscript𝑒𝑗subscript𝜁subscript𝑛R\phi_{n_{\rm R}}=e^{j{\zeta}_{n_{\rm R}}}italic_ϕ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_j italic_ζ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and ζnR[0,2π]subscript𝜁subscript𝑛R02𝜋\zeta_{n_{\rm R}}\in[0,2\pi]italic_ζ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ [ 0 , 2 italic_π ] is the phase shift introduced by the nRsubscript𝑛R{n_{\rm R}}italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT-th RIS element, nR{1,,NR}for-allsubscript𝑛R1subscript𝑁R\forall{n_{\rm R}}\in\{1,\ldots,N_{\rm R}\}∀ italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT ∈ { 1 , … , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT }. We denote the symbols transmitted from the Tx to the Rx as 𝒔Ns×1𝒞𝒩(𝟎,𝐈Ns)𝒔superscriptsubscript𝑁s1similar-to𝒞𝒩0subscript𝐈subscript𝑁s\boldsymbol{s}\in{\mathbb{C}^{N_{\rm s}\times 1}}\sim\mathcal{CN}(\boldsymbol{% 0},\boldsymbol{\rm I}_{N_{\rm s}})bold_italic_s ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT ∼ caligraphic_C caligraphic_N ( bold_0 , bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT ). The received signal 𝒚NRx×1𝒚superscriptsubscript𝑁Rx1\boldsymbol{y}\in{\mathbb{C}^{N_{\rm Rx}\times 1}}bold_italic_y ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT at the Rx is given by

𝒚=𝑹𝚽𝑮𝑾𝒔+𝒏=𝑯𝑾𝒔+𝒏,𝒚𝑹𝚽𝑮𝑾𝒔𝒏𝑯𝑾𝒔𝒏\boldsymbol{y}=\boldsymbol{R}\boldsymbol{\Phi}\boldsymbol{G}\boldsymbol{W}% \boldsymbol{s}+\boldsymbol{n}=\boldsymbol{H}\boldsymbol{W}\boldsymbol{s}+% \boldsymbol{n},bold_italic_y = bold_italic_R bold_Φ bold_italic_G bold_italic_W bold_italic_s + bold_italic_n = bold_italic_H bold_italic_W bold_italic_s + bold_italic_n , (1)

where 𝑾NTx×Ns𝑾superscriptsubscript𝑁Txsubscript𝑁s\boldsymbol{W}\in\mathbb{C}^{N_{\rm Tx}\times{N_{\rm s}}}bold_italic_W ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the precoding matrix (i.e., active beamforming matrix) at the Tx and 𝒏NRx×1𝒞𝒩(𝟎,σ2𝐈NRx)𝒏superscriptsubscript𝑁Rx1similar-to𝒞𝒩0superscript𝜎2subscript𝐈subscript𝑁Rx\boldsymbol{n}\in\mathbb{C}^{N_{\rm Rx}\times 1}\sim\mathcal{CN}(\boldsymbol{0% },\sigma^{2}\boldsymbol{\rm I}_{N_{\rm Rx}})bold_italic_n ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT ∼ caligraphic_C caligraphic_N ( bold_0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) is the additive white Gaussian noise (AWGN) at the Rx with a noise power of σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. For the sake of presentation, we define the cascaded channel between the Tx and Rx as 𝑯=𝑹𝚽𝑮𝑯𝑹𝚽𝑮\boldsymbol{H}=\boldsymbol{R}\boldsymbol{\Phi}\boldsymbol{G}bold_italic_H = bold_italic_R bold_Φ bold_italic_G.

II-B Channel Models

To achieve a large passive beamforming gain, the RIS is deployed close to the Tx as shown in Fig. 1(a) and the number of RIS elements is usually large [14], [15]. Therefore, we consider a near-field channel model for the Tx-RIS channel and a far-field multi-path channel model for the RIS-Rx channel. To avoid confusion, we adopt different channel models, i.e., 𝑮N,𝑮Psubscript𝑮Nsubscript𝑮P\boldsymbol{G}_{\rm N},\boldsymbol{G}_{\rm P}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT , bold_italic_G start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT, and 𝑮Fsubscript𝑮F\boldsymbol{G}_{\rm F}bold_italic_G start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT, to represent the link between Tx-RIS. In particular, assuming a LoS-dominated propagation between the Tx and RIS, the Tx-RIS channel is modeled as [25]

𝑮N=[α11ej2πλd11α1NTxej2πλd1NTxα21ej2πλd21α2NTxej2πλd2NTxαNR1ej2πλdNR1αNRNTxej2πλdNRNTx],subscript𝑮Nmatrixsubscript𝛼11superscript𝑒𝑗2𝜋𝜆subscript𝑑11subscript𝛼1subscript𝑁Txsuperscript𝑒𝑗2𝜋𝜆subscript𝑑1subscript𝑁Txsubscript𝛼21superscript𝑒𝑗2𝜋𝜆subscript𝑑21subscript𝛼2subscript𝑁Txsuperscript𝑒𝑗2𝜋𝜆subscript𝑑2subscript𝑁Txsubscript𝛼subscript𝑁R1superscript𝑒𝑗2𝜋𝜆subscript𝑑subscript𝑁R1subscript𝛼subscript𝑁Rsubscript𝑁Txsuperscript𝑒𝑗2𝜋𝜆subscript𝑑subscript𝑁Rsubscript𝑁Tx\begin{array}[]{c}\boldsymbol{G}_{\rm N}\end{array}=\begin{bmatrix}\alpha_{11}% e^{-j\frac{2\pi}{\lambda}{d_{11}}}&\cdots&\alpha_{1{N_{\rm Tx}}}e^{-j\frac{2% \pi}{\lambda}{d_{1{N_{\rm Tx}}}}}\\ \alpha_{21}e^{-j\frac{2\pi}{\lambda}{d_{21}}}&\cdots&\alpha_{2{N_{\rm Tx}}}e^{% -j\frac{2\pi}{\lambda}{d_{2{N_{\rm Tx}}}}}\\ \vdots&\ddots&\vdots\\ \alpha_{{N_{\rm R}1}}e^{-j\frac{2\pi}{\lambda}{d_{{N_{\rm R}1}}}}&\cdots&% \alpha_{N_{\rm R}N_{\rm Tx}}e^{-j\frac{2\pi}{\lambda}{d_{{N_{\rm R}}{N_{\rm Tx% }}}}}\end{bmatrix},start_ARRAY start_ROW start_CELL bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY = [ start_ARG start_ROW start_CELL italic_α start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL italic_α start_POSTSUBSCRIPT 1 italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d start_POSTSUBSCRIPT 1 italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_α start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL italic_α start_POSTSUBSCRIPT 2 italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d start_POSTSUBSCRIPT 2 italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_α start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL italic_α start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] , (2)

where αnRnTx=λ2(4πdnRnTx)2subscript𝛼subscript𝑛Rsubscript𝑛Txsuperscript𝜆2superscript4𝜋subscript𝑑subscript𝑛Rsubscript𝑛Tx2\alpha_{n_{\rm R}n_{\rm Tx}}=\frac{\lambda^{2}}{(4\pi d_{n_{\rm R}n_{\rm Tx}})% ^{2}}italic_α start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT = divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( 4 italic_π italic_d start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG is the associated path coefficient, λ𝜆\lambdaitalic_λ is wavelength of the signal carrier frequency, and dnRnTxsubscript𝑑subscript𝑛Rsubscript𝑛Txd_{{n_{\rm R}}{n_{\rm Tx}}}italic_d start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT is the distance between the nTxsubscript𝑛Tx{n_{\rm Tx}}italic_n start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT-th antenna at the Tx and the nRsubscript𝑛R{n_{\rm R}}italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT-th element at the RIS. In contrast, the Tx-RIS channel has been approximated by a far-field channel model in the literature [26][27], i.e.,

𝑮F=γ𝒂NR(ψRaz,ψRel)𝒂NTxH(θTx),subscript𝑮F𝛾subscript𝒂subscript𝑁Rsubscript𝜓subscriptRazsubscript𝜓subscriptRelsubscriptsuperscript𝒂𝐻subscript𝑁Txsubscript𝜃Tx\boldsymbol{G}_{\rm F}=\gamma\boldsymbol{a}_{N_{\rm R}}(\psi_{{\rm R}_{\rm az}% },\psi_{{\rm R}_{\rm el}})\boldsymbol{a}^{H}_{N_{\rm Tx}}(\theta_{\rm Tx}),bold_italic_G start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT = italic_γ bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) bold_italic_a start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ) , (3)

where γ=λ2(4πdTR)2ej2πλdTR𝛾superscript𝜆2superscript4𝜋subscript𝑑TR2superscript𝑒𝑗2𝜋𝜆subscript𝑑TR\gamma=\frac{\lambda^{2}}{(4\pi d_{\rm TR})^{2}}e^{-j\frac{2\pi}{\lambda}d_{% \rm TR}}italic_γ = divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( 4 italic_π italic_d start_POSTSUBSCRIPT roman_TR end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d start_POSTSUBSCRIPT roman_TR end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the path coefficient and dTRsubscript𝑑TRd_{\rm TR}italic_d start_POSTSUBSCRIPT roman_TR end_POSTSUBSCRIPT is the distance between the centers of Tx and RIS. Moreover, 𝒂NTx(θTx)NTx×1subscript𝒂subscript𝑁Txsubscript𝜃Txsuperscriptsubscript𝑁Tx1\boldsymbol{a}_{N_{\rm Tx}}(\theta_{\rm Tx})\in\mathbb{C}^{N_{\rm Tx}\times 1}bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT and 𝒂NR(ψRaz,ψRel)NR×1subscript𝒂subscript𝑁Rsubscript𝜓subscriptRazsubscript𝜓subscriptRelsuperscriptsubscript𝑁R1\boldsymbol{a}_{N_{\rm R}}(\psi_{{\rm R}_{\rm az}},\psi_{{\rm R}_{\rm el}})\in% \mathbb{C}^{N_{{\rm R}}\times 1}bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT denote the array response vectors at the Tx and RIS, respectively, which are given by

𝒂NTx(θTx)=[1,,ej2πλ(NTx1)dsinθTx]T,subscript𝒂subscript𝑁Txsubscript𝜃Txsuperscript1superscript𝑒𝑗2𝜋𝜆subscript𝑁Tx1𝑑subscript𝜃Tx𝑇\boldsymbol{a}_{N_{\rm Tx}}(\theta_{\rm Tx})=\left[1,\cdots,e^{-j{\frac{2\pi}{% \lambda}(N_{\rm Tx}-1)d\sin{\theta}_{\rm Tx}}}\right]^{T},bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ) = [ 1 , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT - 1 ) italic_d roman_sin italic_θ start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (4)

and

𝒂NR(ψRaz,ψRel)=[1,,ej2πλ(NRy1)dsinψRazcosψRel]T[1,,ej2πλ(NRz1)dsinψRazsinψRel]T,subscript𝒂subscript𝑁Rsubscript𝜓subscriptRazsubscript𝜓subscriptReltensor-productsuperscript1superscript𝑒𝑗2𝜋𝜆subscript𝑁subscriptR𝑦1𝑑subscript𝜓subscriptRazsubscript𝜓subscriptRel𝑇superscript1superscript𝑒𝑗2𝜋𝜆subscript𝑁subscriptR𝑧1𝑑subscript𝜓subscriptRazsubscript𝜓subscriptRel𝑇\displaystyle\boldsymbol{a}_{N_{\rm R}}(\psi_{{\rm R}_{\rm az}},\psi_{{\rm R}_% {\rm el}})=\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}(N_{{\rm R}_{y}}-1)d\sin{% \psi}_{{\rm R}_{\rm az}}\cos{\psi}_{{\rm R}_{\rm el}}}}\right]^{T}\otimes\left% [1,\cdots,e^{-j{\frac{2\pi}{\lambda}(N_{{\rm R}_{z}}-1)d\sin{\psi}_{{\rm R}_{% \rm az}}\sin{\psi}_{{\rm R}_{\rm el}}}}\right]^{T},bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = [ 1 , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 ) italic_d roman_sin italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_cos italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ⊗ [ 1 , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 ) italic_d roman_sin italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_sin italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (5)

respectively, where θTxsubscript𝜃Tx\theta_{\rm Tx}italic_θ start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT is the azimuth AoD from the Tx to the center of the RIS, while ψRaz,ψRelsubscript𝜓subscriptRazsubscript𝜓subscriptRel\psi_{{\rm R}_{\rm az}},\psi_{{\rm R}_{\rm el}}italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_ψ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT are the azimuth and elevation AoAs at the center of the RIS.

On the other hand, based on the Saleh-Valenzuela channel model [28], the RIS-Rx channel matrix is given by

𝑹=l=1LRxβl𝒂NRx(φRxl)𝒃NRH(φRazl,φRell),𝑹superscriptsubscript𝑙1subscript𝐿Rxsubscript𝛽𝑙subscript𝒂subscript𝑁Rxsuperscriptsubscript𝜑Rx𝑙superscriptsubscript𝒃subscript𝑁R𝐻superscriptsubscript𝜑subscriptRaz𝑙superscriptsubscript𝜑subscriptRel𝑙\boldsymbol{R}=\sum_{l=1}^{L_{\rm Rx}}\beta_{l}{\boldsymbol{a}_{N_{\rm Rx}}}({% \varphi}_{{\rm Rx}}^{l})\boldsymbol{b}_{N_{\rm R}}^{H}({\varphi}_{{\rm R}_{\rm az% }}^{l},{\varphi}_{{\rm R}_{\rm el}}^{l}),bold_italic_R = ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_φ start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) bold_italic_b start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT , italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) , (6)

where βl=λ2(4πdRR)2ej2πλdRRsubscript𝛽𝑙superscript𝜆2superscript4𝜋subscript𝑑RR2superscript𝑒𝑗2𝜋𝜆subscript𝑑RR\beta_{l}=\frac{\lambda^{2}}{(4\pi d_{\rm RR})^{2}}e^{-j\frac{2\pi}{\lambda}d_% {\rm RR}}italic_β start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( 4 italic_π italic_d start_POSTSUBSCRIPT roman_RR end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d start_POSTSUBSCRIPT roman_RR end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the path coefficient of the l𝑙litalic_l-th path between the RIS and Rx, dRRsubscript𝑑RRd_{\rm RR}italic_d start_POSTSUBSCRIPT roman_RR end_POSTSUBSCRIPT is the distance between the centers of RIS and Rx, and LRxsubscript𝐿RxL_{\rm Rx}italic_L start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT is the total number of paths. The vectors 𝒂NRx(φRxl)NRx×1subscript𝒂subscript𝑁Rxsuperscriptsubscript𝜑Rx𝑙superscriptsubscript𝑁Rx1{\boldsymbol{a}_{N_{\rm Rx}}}({\varphi}_{{\rm Rx}}^{l})\in\mathbb{C}^{N_{{\rm Rx% }}\times 1}bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_φ start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT and 𝒃NR(φRazl,φRell)NR×1subscript𝒃subscript𝑁Rsuperscriptsubscript𝜑subscriptRaz𝑙superscriptsubscript𝜑subscriptRel𝑙superscriptsubscript𝑁R1{\boldsymbol{b}_{N_{\rm R}}}({\varphi}_{{\rm R}_{\rm az}}^{l},{\varphi}_{{\rm R% }_{\rm el}}^{l})\in\mathbb{C}^{N_{\rm R}\times 1}bold_italic_b start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT , italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT denote the array response vectors at the Rx and RIS, respectively, and they are given by

𝒂NRx(φRxl)=[1,,ej2πλ(NRx1)dsinφRxl]Tsubscript𝒂subscript𝑁Rxsuperscriptsubscript𝜑Rx𝑙superscript1superscript𝑒𝑗2𝜋𝜆subscript𝑁Rx1𝑑superscriptsubscript𝜑Rx𝑙𝑇{\boldsymbol{a}_{N_{\rm Rx}}}({\varphi}_{\rm Rx}^{l})=\left[1,\cdots,e^{-j{% \frac{2\pi}{\lambda}(N_{\rm Rx}-1)d\sin{\varphi}_{\rm Rx}^{l}}}\right]^{T}bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_φ start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) = [ 1 , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT - 1 ) italic_d roman_sin italic_φ start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (7)

and

𝒃NR(φRazl,φRell)=[1,,ej2πλ(NRy1)dsinφRazlcosφRell]T[1,,ej2πλ(NRz1)dsinφRazlsinφRell]T,subscript𝒃subscript𝑁Rsuperscriptsubscript𝜑subscriptRaz𝑙superscriptsubscript𝜑subscriptRel𝑙tensor-productsuperscript1superscript𝑒𝑗2𝜋𝜆subscript𝑁subscriptR𝑦1𝑑superscriptsubscript𝜑subscriptRaz𝑙superscriptsubscript𝜑subscriptRel𝑙𝑇superscript1superscript𝑒𝑗2𝜋𝜆subscript𝑁subscriptR𝑧1𝑑superscriptsubscript𝜑subscriptRaz𝑙superscriptsubscript𝜑subscriptRel𝑙𝑇\displaystyle{\boldsymbol{b}_{N_{\rm R}}}({\varphi}_{{\rm R}_{\rm az}}^{l},{% \varphi}_{{\rm R}_{\rm el}}^{l})=\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}(N_{% {\rm R}_{y}}-1)d\sin{\varphi}_{{\rm R}_{\rm az}}^{l}\cos{\varphi}_{{\rm R}_{% \rm el}}^{l}}}\right]^{T}\otimes\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}(N_{{% \rm R}_{z}}-1)d\sin{\varphi}_{{\rm R}_{\rm az}}^{l}\sin{\varphi}_{{\rm R}_{\rm el% }}^{l}}}\right]^{T},bold_italic_b start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT , italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) = [ 1 , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 ) italic_d roman_sin italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT roman_cos italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ⊗ [ 1 , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT - 1 ) italic_d roman_sin italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT roman_sin italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (8)

respectively, where φRxlsuperscriptsubscript𝜑Rx𝑙{\varphi}_{\rm Rx}^{l}italic_φ start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT is the azimuth AoA of the l𝑙litalic_l-th path at the Rx, and φRazlsuperscriptsubscript𝜑subscriptRaz𝑙{\varphi}_{{\rm R}_{\rm az}}^{l}italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT and φRellsuperscriptsubscript𝜑subscriptRel𝑙{\varphi}_{{\rm R}_{\rm el}}^{l}italic_φ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT are the azimuth and elevation AoDs of the l𝑙litalic_l-th path from the RIS to the Rx, respectively.

Comparing (2) and (3), we can observe that the near-field channel model in (2) involves more parameters than that in (3). Indeed, when the Tx is located within the near-field region of the RIS, (2) is a more accurate model to describe the signal propagation between the Tx and RIS, compared to (3). It has been demonstrated in [22], [20], and [29] that rank(𝑮N)>rank(𝑮F)=1ranksubscript𝑮Nranksubscript𝑮F1\text{rank}({\boldsymbol{G}_{\rm N}})>\text{rank}({\boldsymbol{G}_{\rm F}})=1rank ( bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT ) > rank ( bold_italic_G start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT ) = 1 usually holds when the near-field condition is satisfied, i.e., the distance between the Tx and RIS is shorter than the Rayleigh distance. However, for the near-field channel model in (2), beamfocusing is required for passive beamforming design at the RIS, which is typically more sensitive to CEEs than conventional beamsteering for the far-field channel model. Therefore, we advocate the utilization of a piece-wise near-field channel model to approximate the near-field channel in (2). We propose to equally divide the NRysubscript𝑁subscriptR𝑦N_{{\rm R}_{y}}italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT RIS elements in each row of the RIS into K𝐾Kitalic_K subarrays, and divide the NRzsubscript𝑁subscriptR𝑧N_{{\rm R}_{z}}italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT RIS elements in each column of the RIS into K𝐾Kitalic_K subarrays. Without lost of generality, we assume that both NRyKsubscript𝑁subscriptR𝑦𝐾\frac{N_{{\rm R}_{y}}}{K}divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG and NRzKsubscript𝑁subscriptR𝑧𝐾\frac{N_{{\rm R}_{z}}}{K}divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG are integers. As a result, the original RIS is divided into K2superscript𝐾2K^{2}italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT subsurfaces and the Rayleigh distance is reduced by a factor of K2superscript𝐾2K^{2}italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Consequently, we can safely assume that the channel between each subsurface and the Tx follows a far-field channel model. Thus, the proposed piece-wise near-field channel matrix, 𝑮Psubscript𝑮P\boldsymbol{G}_{\rm P}bold_italic_G start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT, is given by

𝑮P=([𝒈1h𝒈Kh][𝒈1v𝒈Kv])𝒂NTxH(θTx),subscript𝑮Ptensor-productmatrixsubscriptsuperscript𝒈h1subscriptsuperscript𝒈h𝐾matrixsubscriptsuperscript𝒈v1subscriptsuperscript𝒈v𝐾superscriptsubscript𝒂subscript𝑁Tx𝐻subscript𝜃Tx\boldsymbol{G}_{\rm P}=\left(\begin{bmatrix}\boldsymbol{g}^{\rm h}_{1}\\ \vdots\\ \boldsymbol{g}^{\rm h}_{K}\\ \end{bmatrix}\otimes\begin{bmatrix}\boldsymbol{g}^{\rm v}_{1}\\ \vdots\\ \boldsymbol{g}^{\rm v}_{K}\\ \end{bmatrix}\right){\boldsymbol{a}_{N_{\rm Tx}}^{H}}({\theta}_{\rm Tx}),bold_italic_G start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT = ( [ start_ARG start_ROW start_CELL bold_italic_g start_POSTSUPERSCRIPT roman_h end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL bold_italic_g start_POSTSUPERSCRIPT roman_h end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ⊗ [ start_ARG start_ROW start_CELL bold_italic_g start_POSTSUPERSCRIPT roman_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL bold_italic_g start_POSTSUPERSCRIPT roman_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ) bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ) , (9)

where 𝒂NTx(θTx)subscript𝒂subscript𝑁Txsubscript𝜃Tx{\boldsymbol{a}_{N_{\rm Tx}}}({\theta}_{\rm Tx})bold_italic_a start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ) was defined in (4), while 𝒈ihNRyK×1subscriptsuperscript𝒈h𝑖superscriptsubscript𝑁subscriptR𝑦𝐾1\boldsymbol{g}^{\rm h}_{i}\in\mathbb{C}^{{\frac{N_{{\rm R}_{y}}}{K}}\times 1}bold_italic_g start_POSTSUPERSCRIPT roman_h end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG × 1 end_POSTSUPERSCRIPT and 𝒈ivNRzK×1subscriptsuperscript𝒈v𝑖superscriptsubscript𝑁subscriptR𝑧𝐾1\boldsymbol{g}^{\rm v}_{i}\in\mathbb{C}^{{\frac{N_{{\rm R}_{z}}}{K}}\times 1}bold_italic_g start_POSTSUPERSCRIPT roman_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG × 1 end_POSTSUPERSCRIPT are defined by

𝒈ih=λ4πriejπλri𝒃NRyK(θRazi,θReli),i=1,,K,formulae-sequencesubscriptsuperscript𝒈h𝑖𝜆4𝜋subscript𝑟𝑖superscript𝑒𝑗𝜋𝜆subscript𝑟𝑖subscript𝒃subscript𝑁subscriptR𝑦𝐾subscriptsuperscript𝜃𝑖subscriptR𝑎𝑧subscriptsuperscript𝜃𝑖subscriptR𝑒𝑙𝑖1𝐾\boldsymbol{g}^{\rm h}_{i}=\frac{\lambda}{4\pi r_{i}}e^{-j{\frac{\pi}{\lambda}% r_{i}}}\boldsymbol{b}_{\frac{N_{{\rm R}_{y}}}{K}}(\theta^{i}_{{\rm R}_{az}},% \theta^{i}_{{\rm R}_{el}}),{i}=1,\ldots,K,bold_italic_g start_POSTSUPERSCRIPT roman_h end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG italic_λ end_ARG start_ARG 4 italic_π italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG italic_π end_ARG start_ARG italic_λ end_ARG italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT bold_italic_b start_POSTSUBSCRIPT divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_a italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_θ start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_e italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , italic_i = 1 , … , italic_K , (10)

and

𝒈iv=λ4πriejπλri𝒃NRzK(θRazi,θReli),i=1,,K,formulae-sequencesubscriptsuperscript𝒈v𝑖𝜆4𝜋subscript𝑟𝑖superscript𝑒𝑗𝜋𝜆subscript𝑟𝑖subscript𝒃subscript𝑁subscriptR𝑧𝐾subscriptsuperscript𝜃𝑖subscriptR𝑎𝑧subscriptsuperscript𝜃𝑖subscriptR𝑒𝑙𝑖1𝐾\boldsymbol{g}^{\rm v}_{i}=\frac{\lambda}{4\pi r_{i}}e^{-j{\frac{\pi}{\lambda}% r_{i}}}\boldsymbol{b}_{\frac{N_{{\rm R}_{z}}}{K}}(\theta^{i}_{{\rm R}_{az}},% \theta^{i}_{{\rm R}_{el}}),{i}=1,\ldots,K,bold_italic_g start_POSTSUPERSCRIPT roman_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG italic_λ end_ARG start_ARG 4 italic_π italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG italic_π end_ARG start_ARG italic_λ end_ARG italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT bold_italic_b start_POSTSUBSCRIPT divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_a italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_θ start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_e italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , italic_i = 1 , … , italic_K , (11)

respectively. In (10) and (11), risubscript𝑟𝑖{r_{i}}italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, θRazisuperscriptsubscript𝜃subscriptRaz𝑖{\theta}_{{\rm R}_{\rm az}}^{i}italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT, and θRelisuperscriptsubscript𝜃subscriptRel𝑖{\theta}_{{\rm R}_{\rm el}}^{i}italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT are the distance, azimuth, and elevation AoAs from the Tx to the center of the i𝑖iitalic_i-th subsurface, respectively. The vectors 𝒃NRyK(θRazi,θReli)subscript𝒃subscript𝑁subscriptR𝑦𝐾superscriptsubscript𝜃subscriptRaz𝑖superscriptsubscript𝜃subscriptRel𝑖\boldsymbol{b}_{\frac{N_{{\rm R}_{y}}}{K}}({\theta}_{{\rm R}_{\rm az}}^{i},{% \theta}_{{\rm R}_{\rm el}}^{i})bold_italic_b start_POSTSUBSCRIPT divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) and 𝒃NRzK(θRazi,θReli)subscript𝒃subscript𝑁subscriptR𝑧𝐾superscriptsubscript𝜃subscriptRaz𝑖superscriptsubscript𝜃subscriptRel𝑖{\boldsymbol{b}_{\frac{N_{{\rm R}_{z}}}{K}}}({\theta}_{{\rm R}_{\rm az}}^{i},{% \theta}_{{\rm R}_{\rm el}}^{i})bold_italic_b start_POSTSUBSCRIPT divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) are the horizontal and vertical array response vectors of the i𝑖iitalic_i-th subsurface which are given by

𝒃NRyK(θRazi,θReli)=[1,,ej2πλ(NRyK1)dsinθRazicosθReli]T,subscript𝒃subscript𝑁subscriptR𝑦𝐾superscriptsubscript𝜃subscriptRaz𝑖superscriptsubscript𝜃subscriptRel𝑖superscript1superscript𝑒𝑗2𝜋𝜆subscript𝑁subscriptR𝑦𝐾1𝑑superscriptsubscript𝜃subscriptRaz𝑖superscriptsubscript𝜃subscriptRel𝑖𝑇{\boldsymbol{b}_{\frac{N_{{\rm R}_{y}}}{K}}}({\theta}_{{\rm R}_{\rm az}}^{i},{% \theta}_{{\rm R}_{\rm el}}^{i})=\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}({% \frac{N_{{\rm R}_{y}}}{K}}-1)d\sin{\theta}_{{\rm R}_{\rm az}}^{i}\cos{\theta}_% {{\rm R}_{\rm el}}^{i}}}\right]^{T},bold_italic_b start_POSTSUBSCRIPT divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) = [ 1 , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG - 1 ) italic_d roman_sin italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT roman_cos italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (12)

and

𝒃NRzK(θRazi,θReli)=[1,,ej2πλ(NRzK1)dsinθRazisinθReli]T,subscript𝒃subscript𝑁subscriptR𝑧𝐾superscriptsubscript𝜃subscriptRaz𝑖superscriptsubscript𝜃subscriptRel𝑖superscript1superscript𝑒𝑗2𝜋𝜆subscript𝑁subscriptR𝑧𝐾1𝑑superscriptsubscript𝜃subscriptRaz𝑖superscriptsubscript𝜃subscriptRel𝑖𝑇{\boldsymbol{b}_{\frac{N_{{\rm R}_{z}}}{K}}}({\theta}_{{\rm R}_{\rm az}}^{i},{% \theta}_{{\rm R}_{\rm el}}^{i})=\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}({% \frac{N_{{\rm R}_{z}}}{K}}-1)d\sin{\theta}_{{\rm R}_{\rm az}}^{i}\sin{\theta}_% {{\rm R}_{\rm el}}^{i}}}\right]^{T},bold_italic_b start_POSTSUBSCRIPT divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) = [ 1 , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( divide start_ARG italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG - 1 ) italic_d roman_sin italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_az end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT roman_sin italic_θ start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT roman_el end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (13)

respectively. For illustration, let us take K=2𝐾2K=2italic_K = 2 as an example, which is shown in Fig. 1(b). In this case, the piece-wise near-field channel matrix 𝑮Psubscript𝑮P\boldsymbol{G}_{\rm P}bold_italic_G start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT is given by

𝑮P=([𝒈1h𝒈2h][𝒈1v𝒈2v])𝒂TxH(θTx).subscript𝑮Ptensor-productmatrixsubscriptsuperscript𝒈h1subscriptsuperscript𝒈h2matrixsubscriptsuperscript𝒈v1subscriptsuperscript𝒈v2superscriptsubscript𝒂Tx𝐻subscript𝜃Tx\boldsymbol{G}_{\rm P}=\left(\begin{bmatrix}\boldsymbol{g}^{\rm h}_{1}\\ \boldsymbol{g}^{\rm h}_{2}\\ \end{bmatrix}\otimes\begin{bmatrix}\boldsymbol{g}^{\rm v}_{1}\\ \boldsymbol{g}^{\rm v}_{2}\\ \end{bmatrix}\right){\boldsymbol{a}_{\rm Tx}^{H}}({\theta}_{\rm Tx}).bold_italic_G start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT = ( [ start_ARG start_ROW start_CELL bold_italic_g start_POSTSUPERSCRIPT roman_h end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_italic_g start_POSTSUPERSCRIPT roman_h end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ⊗ [ start_ARG start_ROW start_CELL bold_italic_g start_POSTSUPERSCRIPT roman_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_italic_g start_POSTSUPERSCRIPT roman_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ) bold_italic_a start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ) . (14)

If the number of subsurfaces is K2=1superscript𝐾21K^{2}=1italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 1, the piece-wise near-field channel model degenerates to the far-field case 𝑮Fsubscript𝑮F\boldsymbol{G}_{\rm F}bold_italic_G start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT in (3). If K=NRy=NRz𝐾subscript𝑁subscriptR𝑦subscript𝑁subscriptR𝑧K=N_{{\rm R}_{y}}=N_{{\rm R}_{z}}italic_K = italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_N start_POSTSUBSCRIPT roman_R start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT end_POSTSUBSCRIPT, the piece-wise near-field model becomes the accurate traditional near-field channel model 𝑮Nsubscript𝑮N\boldsymbol{G}_{\rm N}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT in (2). In other words, the piece-wise near-field channel model bridges the near-field and the far-field via fine tuning the number of subsurfaces. Comparing (2), (3), and (9), we can observe that the piece-wise channel model not only requires less number of parameters than the near-field model, but also enjoys a higher modeling accuracy than the far-field model. Indeed, the model presented in (9) is inspired by the model adopted in [30], which assumes the use of a ULA. Our proposed model in (9) considers the more practical uniform planar array (UPA) configuration of RIS, which is a more general model. Moreover, inheriting from the near-field channel model, the piece-wise channel matrix in (9) still avails of a higher rank than the far-field channel matrix in (3), i.e., rank(𝑮P)rank(𝑮F)=1ranksubscript𝑮Pranksubscript𝑮F1\text{rank}(\boldsymbol{G}_{\rm P})\geq\text{rank}(\boldsymbol{G}_{\rm F})=1rank ( bold_italic_G start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT ) ≥ rank ( bold_italic_G start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT ) = 1 [20], and thus it can improve the system’s DoF and performance by exploiting the distance and angle diversity among different subsurfaces. In other words, the robustness stems from the fact that piece beamsteering is less prone to errors in distance and angle when there exist CEEs.

III Analysis of Interference Distribution for Different Channel Models

In this section, we first introduce the CEE models and then analyze the distribution of interference-plus-noise signal for the three given models.

III-A Channel Estimation Error Models

Based on different channel models, one can obtain the estimated channel via traditional training and parameter estimation procedures [31], [32]. Note that when channel modeling is not accurate, the estimated channel suffers from not only CEEs, but also model mismatches. In particular, the actual channel between the Tx and RIS, which should follow a near-field channel model in the considered system, is composed of the estimated channel, the corresponding estimation error, and the model mismatch error. In the following, we represent the channel 𝑮Nsubscript𝑮N\boldsymbol{G}_{\rm N}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT by different channel models:

𝑮Nsubscript𝑮N\displaystyle\boldsymbol{G}_{\rm N}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT =𝑮^N+Δ𝑮N+Δ𝑴N,absentsubscript^𝑮NΔsubscript𝑮NΔsubscript𝑴N\displaystyle=\hat{\boldsymbol{G}}_{\rm N}+\Delta{\boldsymbol{G}}_{\rm N}+% \Delta{\boldsymbol{M}}_{\rm N},= over^ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT + roman_Δ bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT + roman_Δ bold_italic_M start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT , [Conventional] (15)
𝑮Nsubscript𝑮N\displaystyle\boldsymbol{G}_{\rm N}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT =𝑮^P+Δ𝑮P+Δ𝑴P,andabsentsubscript^𝑮PΔsubscript𝑮PΔsubscript𝑴Pand\displaystyle=\hat{\boldsymbol{G}}_{\rm P}+\Delta{\boldsymbol{G}}_{\rm P}+% \Delta{\boldsymbol{M}}_{\rm P},\text{and}= over^ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT + roman_Δ bold_italic_G start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT + roman_Δ bold_italic_M start_POSTSUBSCRIPT roman_P end_POSTSUBSCRIPT , and [Proposed] (16)
𝑮Nsubscript𝑮N\displaystyle\boldsymbol{G}_{\rm N}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT =𝑮^F+Δ𝑮F+Δ𝑴F.absentsubscript^𝑮FΔsubscript𝑮FΔsubscript𝑴F\displaystyle=\hat{\boldsymbol{G}}_{\rm F}+\Delta{\boldsymbol{G}}_{\rm F}+% \Delta{\boldsymbol{M}}_{\rm F}.= over^ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT + roman_Δ bold_italic_G start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT + roman_Δ bold_italic_M start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT . [Far-field] (17)

For concise notation, we use subscripts {1}={N}1N\{1\}=\{\rm N\}{ 1 } = { roman_N }, {2}={P}2P\{2\}=\{\rm P\}{ 2 } = { roman_P }, and {3}={F}3F\{3\}=\{\rm F\}{ 3 } = { roman_F } to denote the conventional near-field, the proposed piece-wise near-field, and the far-field channel models, respectively, while 𝑮^isubscript^𝑮𝑖\hat{\boldsymbol{G}}_{i}over^ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, Δ𝑮iΔsubscript𝑮𝑖\Delta{\boldsymbol{G}}_{i}roman_Δ bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and Δ𝑴i,i=1,2,3formulae-sequenceΔsubscript𝑴𝑖𝑖123\Delta{\boldsymbol{M}}_{i},i=1,2,3roman_Δ bold_italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i = 1 , 2 , 3, represent the estimated channel, the CEEs and the model mismatch error of different channel models, respectively. The left-hand side of (15), (16), and (17) is the ground-truth near-field channel state information 𝑮Nsubscript𝑮N\boldsymbol{G}_{\rm N}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT. As discussed in (2), (3), and (9), different channel models represent 𝑮Nsubscript𝑮N\boldsymbol{G}_{\rm N}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT with different channel matrix structures, i.e., 𝑮N=𝑮i+Δ𝑴i,i=1,2,3formulae-sequencesubscript𝑮Nsubscript𝑮𝑖Δsubscript𝑴𝑖𝑖123\boldsymbol{G}_{\rm N}={\boldsymbol{G}}_{i}+\Delta{\boldsymbol{M}}_{i},i=1,2,3bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT = bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + roman_Δ bold_italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i = 1 , 2 , 3. When i=1𝑖1i=1italic_i = 1, this means that we adopt the near-field channel model for channel estimation which is free of model mismatch, i.e., Δ𝑴1=𝟎Δsubscript𝑴10\Delta{\boldsymbol{M}}_{1}=\boldsymbol{0}roman_Δ bold_italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = bold_0. Based on different channel models in (2), (3), and (9), channel estimation introduces additional CEE, i.e., 𝑮i=𝑮^i+Δ𝑮i,i=1,2,3formulae-sequencesubscript𝑮𝑖subscript^𝑮𝑖Δsubscript𝑮𝑖𝑖123\boldsymbol{G}_{i}=\hat{\boldsymbol{G}}_{i}+\Delta{\boldsymbol{G}}_{i},i=1,2,3bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = over^ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + roman_Δ bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i = 1 , 2 , 3.

Assuming that the model mismatch errors are deterministic and the CEE follows a matrix-variate Gaussian distribution [33], we have

𝑮Nsubscript𝑮N\displaystyle\boldsymbol{G}_{\rm N}bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT =𝑮^i+Δ𝑴i+Δ𝑮i,andabsentsubscript^𝑮𝑖Δsubscript𝑴𝑖Δsubscript𝑮𝑖and\displaystyle=\hat{\boldsymbol{G}}_{i}+\Delta{\boldsymbol{M}}_{i}+\Delta{% \boldsymbol{G}}_{i},\ \text{and}= over^ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + roman_Δ bold_italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + roman_Δ bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , and (18)
Δ𝑮iΔsubscript𝑮𝑖\displaystyle\Delta{\boldsymbol{G}}_{i}roman_Δ bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT 𝒞𝒩NR,NTx(𝟎,σ𝑮i2𝐈NR𝐈NTx).similar-toabsent𝒞subscript𝒩subscript𝑁Rsubscript𝑁Tx0tensor-productsuperscriptsubscript𝜎subscript𝑮𝑖2subscript𝐈subscript𝑁Rsubscript𝐈subscript𝑁Tx\displaystyle\sim\mathcal{CN}_{N_{\rm R},N_{\rm Tx}}(\boldsymbol{0},\sigma_{{% \boldsymbol{G}}_{i}}^{2}\boldsymbol{\rm I}_{N_{\rm R}}\otimes\boldsymbol{\rm I% }_{N_{\rm Tx}}).∼ caligraphic_C caligraphic_N start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT , italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_0 , italic_σ start_POSTSUBSCRIPT bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (19)

Then, the distribution of the overall CSI imperfection Δ𝑮~i=Δ𝑮i+Δ𝑴iΔsubscript~𝑮𝑖Δsubscript𝑮𝑖Δsubscript𝑴𝑖\Delta\widetilde{\boldsymbol{G}}_{i}=\Delta{\boldsymbol{G}}_{i}+\Delta{% \boldsymbol{M}}_{i}roman_Δ over~ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = roman_Δ bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + roman_Δ bold_italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT follows

Δ𝑮~i𝒞𝒩NR,NTx(Δ𝑴i,σ𝑮i2𝐈NR𝐈NTx).similar-toΔsubscript~𝑮𝑖𝒞subscript𝒩subscript𝑁Rsubscript𝑁TxΔsubscript𝑴𝑖tensor-productsuperscriptsubscript𝜎subscript𝑮𝑖2subscript𝐈subscript𝑁Rsubscript𝐈subscript𝑁Tx\displaystyle\Delta\widetilde{\boldsymbol{G}}_{i}\sim\mathcal{CN}_{N_{\rm R},N% _{\rm Tx}}(\Delta{\boldsymbol{M}}_{i},\sigma_{{\boldsymbol{G}}_{i}}^{2}% \boldsymbol{\rm I}_{N_{\rm R}}\otimes\boldsymbol{\rm I}_{N_{\rm Tx}}).roman_Δ over~ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∼ caligraphic_C caligraphic_N start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT , italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Δ bold_italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) . (20)

Similarly, the channel between the RIS and Rx is given by

𝑹=𝑹^+Δ𝑹,Δ𝑹𝒞𝒩NRx,NR(𝟎,σ𝑹2𝐈NRx𝐈NR),formulae-sequence𝑹^𝑹Δ𝑹similar-toΔ𝑹𝒞subscript𝒩subscript𝑁Rxsubscript𝑁R0tensor-productsuperscriptsubscript𝜎𝑹2subscript𝐈subscript𝑁Rxsubscript𝐈subscript𝑁R\boldsymbol{R}=\hat{\boldsymbol{R}}+\Delta{\boldsymbol{R}},\Delta{\boldsymbol{% R}}\sim\mathcal{CN}_{N_{\rm Rx},N_{\rm R}}(\boldsymbol{0},\sigma_{{\boldsymbol% {R}}}^{2}\boldsymbol{\rm I}_{N_{\rm Rx}}\otimes\boldsymbol{\rm I}_{N_{\rm R}}),bold_italic_R = over^ start_ARG bold_italic_R end_ARG + roman_Δ bold_italic_R , roman_Δ bold_italic_R ∼ caligraphic_C caligraphic_N start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_0 , italic_σ start_POSTSUBSCRIPT bold_italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊗ bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , (21)

where 𝑹^^𝑹\hat{\boldsymbol{R}}over^ start_ARG bold_italic_R end_ARG and Δ𝑹Δ𝑹\Delta{\boldsymbol{R}}roman_Δ bold_italic_R represent the estimated channel and CEE of the RIS-Rx channel, respectively. To facilitate the subsequent analysis and design, we make the assumption that the channels of the Tx-RIS and RIS-Rx links are independently estimated and, thus, Δ𝑮~iΔsubscript~𝑮𝑖\Delta\widetilde{\boldsymbol{G}}_{i}roman_Δ over~ start_ARG bold_italic_G end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and Δ𝑹Δ𝑹\Delta{\boldsymbol{R}}roman_Δ bold_italic_R are independent of each other.111When dealing with a passive RIS, it becomes necessary to reconstruct the individual CSI of all links involving the RIS based on the acquired aggregate CSI, specifically the cascaded Tx-RIS-Rx CSI. This can be achieved by employing the methodology outlined in [34], [35].

III-B Covariance Matrix of the Interference-plus-Noise Signal

For concise notation, let us drop the subscript for now. Based on the assumed CEE model of 𝑮𝑮\boldsymbol{G}bold_italic_G for the three channel models, we analyze the CEE distribution of the cascaded channel 𝑯𝑯\boldsymbol{H}bold_italic_H as below. Assuming that the estimations of the Tx-RIS and RIS-Rx channels are separately executed with their estimated values 𝑮^^𝑮\hat{\boldsymbol{G}}over^ start_ARG bold_italic_G end_ARG and 𝑹^^𝑹\hat{\boldsymbol{R}}over^ start_ARG bold_italic_R end_ARG [27], [35], respectively, the estimated cascaded channel is given by

𝑯^=𝑹^𝚽𝑮^.^𝑯^𝑹𝚽^𝑮\hat{\boldsymbol{H}}=\hat{\boldsymbol{R}}\boldsymbol{\Phi}\hat{\boldsymbol{G}}.over^ start_ARG bold_italic_H end_ARG = over^ start_ARG bold_italic_R end_ARG bold_Φ over^ start_ARG bold_italic_G end_ARG . (22)

Then, we define Δ𝑯Δ𝑯\Delta\boldsymbol{H}roman_Δ bold_italic_H and Δ𝑯MΔsubscript𝑯M\Delta{\boldsymbol{H}}_{\rm M}roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT as the cascaded channel estimation error and the corresponding model mismatch error, respectively, which are given by

Δ𝑯Δ𝑯\displaystyle\Delta\boldsymbol{H}roman_Δ bold_italic_H =Δ𝑹𝚽𝑮^+𝑹^𝚽Δ𝑮+Δ𝑹𝚽Δ𝑮, andabsentΔ𝑹𝚽^𝑮^𝑹𝚽Δ𝑮Δ𝑹𝚽Δ𝑮 and\displaystyle=\Delta\boldsymbol{R}\boldsymbol{\Phi}\hat{\boldsymbol{G}}+\hat{% \boldsymbol{R}}\boldsymbol{\Phi}\Delta{\boldsymbol{G}}+\Delta\boldsymbol{R}% \boldsymbol{\Phi}\Delta{\boldsymbol{G}},\text{ and}= roman_Δ bold_italic_R bold_Φ over^ start_ARG bold_italic_G end_ARG + over^ start_ARG bold_italic_R end_ARG bold_Φ roman_Δ bold_italic_G + roman_Δ bold_italic_R bold_Φ roman_Δ bold_italic_G , and (23)
Δ𝑯MΔsubscript𝑯M\displaystyle\Delta\boldsymbol{H}_{\rm M}roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT =𝑹^𝚽Δ𝑴+Δ𝑹𝚽Δ𝑴,absent^𝑹𝚽Δ𝑴Δ𝑹𝚽Δ𝑴\displaystyle=\hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta\boldsymbol{M}+\Delta% \boldsymbol{R}\boldsymbol{\Phi}\Delta\boldsymbol{M},= over^ start_ARG bold_italic_R end_ARG bold_Φ roman_Δ bold_italic_M + roman_Δ bold_italic_R bold_Φ roman_Δ bold_italic_M , (24)

respectively. The received signal at the Rx in (1) can be reformulated as

𝒚=(𝑯^+Δ𝑯+Δ𝑯M)𝑾𝒔+𝒏=𝑯^𝑾𝒔+𝒏^,𝒚^𝑯Δ𝑯Δsubscript𝑯M𝑾𝒔𝒏^𝑯𝑾𝒔^𝒏\boldsymbol{y}=(\hat{\boldsymbol{H}}+\Delta\boldsymbol{H}+\Delta\boldsymbol{H}% _{\rm M})\boldsymbol{Ws}+\boldsymbol{n}=\hat{\boldsymbol{H}}\boldsymbol{Ws}+% \hat{\boldsymbol{n}},bold_italic_y = ( over^ start_ARG bold_italic_H end_ARG + roman_Δ bold_italic_H + roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT ) bold_italic_W bold_italic_s + bold_italic_n = over^ start_ARG bold_italic_H end_ARG bold_italic_W bold_italic_s + over^ start_ARG bold_italic_n end_ARG , (25)

with the interference-plus-noise signal 𝒏^=(Δ𝑯+Δ𝑯M)𝑾𝒔+𝒏^𝒏Δ𝑯Δsubscript𝑯M𝑾𝒔𝒏\hat{\boldsymbol{n}}=(\Delta\boldsymbol{H}+\Delta\boldsymbol{H}_{\rm M})% \boldsymbol{Ws}+\boldsymbol{n}over^ start_ARG bold_italic_n end_ARG = ( roman_Δ bold_italic_H + roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT ) bold_italic_W bold_italic_s + bold_italic_n.

For the convenience of analysis, we assume that

Δ𝑹FsubscriptnormΔ𝑹𝐹\displaystyle\|\Delta\boldsymbol{R}\|_{F}∥ roman_Δ bold_italic_R ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT 𝑹^F,much-less-thanabsentsubscriptnorm^𝑹𝐹\displaystyle\ll\|\hat{\boldsymbol{R}}\|_{F},≪ ∥ over^ start_ARG bold_italic_R end_ARG ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT , (26)
Δ𝑮FsubscriptnormΔ𝑮𝐹\displaystyle\|\Delta\boldsymbol{G}\|_{F}∥ roman_Δ bold_italic_G ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT 𝑮^F,andmuch-less-thanabsentsubscriptnorm^𝑮𝐹and\displaystyle\ll\|\hat{\boldsymbol{G}}\|_{F},\text{and}≪ ∥ over^ start_ARG bold_italic_G end_ARG ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT , and (27)
Δ𝑴FsubscriptnormΔ𝑴𝐹\displaystyle\|\Delta\boldsymbol{M}\|_{F}∥ roman_Δ bold_italic_M ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT 𝑮^F,much-less-thanabsentsubscriptnorm^𝑮𝐹\displaystyle\ll\|\hat{\boldsymbol{G}}\|_{F},≪ ∥ over^ start_ARG bold_italic_G end_ARG ∥ start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT , (28)

respectively, which implies a relatively small CEE and model mismatch.222The assumptions are reasonable as the errors are significantly smaller than their estimated values, indicating a high level of confidence in the accuracy of the estimations. Referring to the CEE model in Section III-A, the estimated channel is significantly dominant, justifying this assumption. By discarding the minor terms Δ𝑹𝚽Δ𝑮Δ𝑹𝚽Δ𝑮\Delta\boldsymbol{R}\boldsymbol{\Phi}\Delta{\boldsymbol{G}}roman_Δ bold_italic_R bold_Φ roman_Δ bold_italic_G and Δ𝑹𝚽Δ𝑴Δ𝑹𝚽Δ𝑴\Delta\boldsymbol{R}\boldsymbol{\Phi}\Delta\boldsymbol{M}roman_Δ bold_italic_R bold_Φ roman_Δ bold_italic_M in (23) and (24), Δ𝑯Δ𝑯\Delta\boldsymbol{H}roman_Δ bold_italic_H and Δ𝑯MΔsubscript𝑯M\Delta\boldsymbol{H}_{\rm M}roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT can be approximated as:

Δ𝑯Δ𝑯\displaystyle\Delta\boldsymbol{H}roman_Δ bold_italic_H Δ𝑹𝚽𝑮^+𝑹^𝚽Δ𝑮 andabsentΔ𝑹𝚽^𝑮^𝑹𝚽Δ𝑮 and\displaystyle\approx\Delta\boldsymbol{R}\boldsymbol{\Phi}\hat{\boldsymbol{G}}+% \hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta{\boldsymbol{G}}\text{ and}≈ roman_Δ bold_italic_R bold_Φ over^ start_ARG bold_italic_G end_ARG + over^ start_ARG bold_italic_R end_ARG bold_Φ roman_Δ bold_italic_G and (29)
Δ𝑯MΔsubscript𝑯M\displaystyle\Delta\boldsymbol{H}_{\rm M}roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT 𝑹^𝚽Δ𝑴,absent^𝑹𝚽Δ𝑴\displaystyle\approx\hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta\boldsymbol{M},≈ over^ start_ARG bold_italic_R end_ARG bold_Φ roman_Δ bold_italic_M , (30)

respectively. Notice that Δ𝑯MΔsubscript𝑯M\Delta\boldsymbol{H}_{\rm M}roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT in (30) is approximately deterministic when the passive beamforming strategy 𝚽𝚽\boldsymbol{\Phi}bold_Φ and the estimated channel 𝑹^^𝑹\hat{\boldsymbol{R}}over^ start_ARG bold_italic_R end_ARG are given.

The covariance matrix of the interference-plus-noise signal 𝒏^^𝒏\hat{\boldsymbol{n}}over^ start_ARG bold_italic_n end_ARG is given by

𝚺𝒏^subscript𝚺^𝒏\displaystyle\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT =𝔼{𝒏^𝒏^H}absent𝔼^𝒏superscript^𝒏𝐻\displaystyle=\mathbb{E}\{\hat{\boldsymbol{n}}\hat{\boldsymbol{n}}^{\it H}\}= blackboard_E { over^ start_ARG bold_italic_n end_ARG over^ start_ARG bold_italic_n end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT } (31)
=Δ𝑯M𝑾𝑾HΔ𝑯MH+𝔼{Δ𝑯𝑾𝑾HΔ𝑯H}+σ2𝐈NRxabsentΔsubscript𝑯M𝑾superscript𝑾𝐻Δsubscriptsuperscript𝑯𝐻M𝔼Δ𝑯𝑾superscript𝑾𝐻Δsuperscript𝑯𝐻superscript𝜎2subscript𝐈subscript𝑁Rx\displaystyle=\Delta\boldsymbol{H}_{\rm M}{\boldsymbol{W}}{\boldsymbol{W}}^{H}% \Delta\boldsymbol{H}^{H}_{\rm M}+\mathbb{E}\{\Delta\boldsymbol{H}{\boldsymbol{% W}}{\boldsymbol{W}}^{H}\Delta\boldsymbol{H}^{H}\}+\sigma^{2}{\boldsymbol{\rm I% }}_{N_{\rm Rx}}= roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Δ bold_italic_H start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT + blackboard_E { roman_Δ bold_italic_H bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Δ bold_italic_H start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT } + italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT
=Δ𝑯M𝑾𝑾HΔ𝑯MH+𝔼{Δ𝑹𝚽𝑮^𝑾𝑾H𝑮^H𝚽HΔ𝑹H}absentΔsubscript𝑯M𝑾superscript𝑾𝐻Δsubscriptsuperscript𝑯𝐻M𝔼Δ𝑹𝚽^𝑮𝑾superscript𝑾𝐻superscript^𝑮𝐻superscript𝚽𝐻Δsuperscript𝑹𝐻\displaystyle=\Delta\boldsymbol{H}_{\rm M}{\boldsymbol{W}}{\boldsymbol{W}}^{H}% \Delta\boldsymbol{H}^{H}_{\rm M}+\mathbb{E}\{\Delta\boldsymbol{R}\boldsymbol{% \Phi}\hat{\boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{\boldsymbol{% G}}^{H}\boldsymbol{\Phi}^{H}\Delta\boldsymbol{R}^{H}\}= roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Δ bold_italic_H start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT + blackboard_E { roman_Δ bold_italic_R bold_Φ over^ start_ARG bold_italic_G end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_G end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Δ bold_italic_R start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT }
+𝔼{𝑹^𝚽Δ𝑮𝑾𝑾HΔ𝑮H𝚽H𝑹^H}+σ2𝐈NRx.𝔼^𝑹𝚽Δ𝑮𝑾superscript𝑾𝐻Δsuperscript𝑮𝐻superscript𝚽𝐻superscript^𝑹𝐻superscript𝜎2subscript𝐈subscript𝑁Rx\displaystyle+\mathbb{E}\{\hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta{% \boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\Delta{\boldsymbol{G}}^{H}% \boldsymbol{\Phi}^{H}\hat{\boldsymbol{R}}^{H}\}+\sigma^{2}{\boldsymbol{\rm I}}% _{N_{\rm Rx}}.+ blackboard_E { over^ start_ARG bold_italic_R end_ARG bold_Φ roman_Δ bold_italic_G bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Δ bold_italic_G start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_R end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT } + italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

Next, we utilize the following lemma to calculate each term of 𝚺𝒏^subscript𝚺^𝒏\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT .

Lemma 1 ([36])

For a matrix 𝐗n×m𝐗superscript𝑛𝑚\boldsymbol{X}\in\mathbb{C}^{n\times m}bold_italic_X ∈ blackboard_C start_POSTSUPERSCRIPT italic_n × italic_m end_POSTSUPERSCRIPT, which obeys the distribution 𝐗𝒞𝒩n,m(𝐗^,𝐑n𝐑m)similar-to𝐗𝒞subscript𝒩𝑛𝑚^𝐗tensor-productsubscript𝐑𝑛subscript𝐑𝑚\boldsymbol{X}\sim{\mathcal{CN}_{n,m}}(\hat{\boldsymbol{X}},\boldsymbol{R}_{n}% \otimes\boldsymbol{R}_{m})bold_italic_X ∼ caligraphic_C caligraphic_N start_POSTSUBSCRIPT italic_n , italic_m end_POSTSUBSCRIPT ( over^ start_ARG bold_italic_X end_ARG , bold_italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊗ bold_italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ), where 𝐑mm×msubscript𝐑𝑚superscript𝑚𝑚\boldsymbol{R}_{m}\in\mathbb{C}^{m\times m}bold_italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_m × italic_m end_POSTSUPERSCRIPT and 𝐑nn×nsubscript𝐑𝑛superscript𝑛𝑛\boldsymbol{R}_{n}\in\mathbb{C}^{n\times n}bold_italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_n × italic_n end_POSTSUPERSCRIPT represent the receive and transmit correlation matrices, respectively, and a compatible matrix 𝐙𝐙\boldsymbol{Z}bold_italic_Z, it follows that

𝔼{𝑿𝒁𝑿H}=𝑿^𝒁𝑿^H+tr(𝒁𝑹mT)𝑹n𝔼𝑿𝒁superscript𝑿𝐻^𝑿𝒁superscript^𝑿𝐻tr𝒁superscriptsubscript𝑹𝑚𝑇subscript𝑹𝑛\mathbb{E}\left\{\boldsymbol{X}\boldsymbol{Z}{\boldsymbol{X}}^{H}\right\}=\hat% {\boldsymbol{X}}\boldsymbol{Z}\hat{\boldsymbol{X}}^{H}+\text{tr}(\boldsymbol{Z% }\boldsymbol{R}_{m}^{T}){\boldsymbol{R}_{n}}blackboard_E { bold_italic_X bold_italic_Z bold_italic_X start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT } = over^ start_ARG bold_italic_X end_ARG bold_italic_Z over^ start_ARG bold_italic_X end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT + tr ( bold_italic_Z bold_italic_R start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ) bold_italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

It follows from Lemma 1 that

𝔼{Δ𝑹𝚽𝑮^𝑾𝑾H𝑮^H𝚽HΔ𝑹H}=σ𝑹2tr(𝑮^𝑾𝑾H𝑮^H)𝐈NRx,𝔼Δ𝑹𝚽^𝑮𝑾superscript𝑾𝐻superscript^𝑮𝐻superscript𝚽𝐻Δsuperscript𝑹𝐻superscriptsubscript𝜎𝑹2tr^𝑮𝑾superscript𝑾𝐻superscript^𝑮𝐻subscript𝐈subscript𝑁Rx\displaystyle\mathbb{E}\left\{\Delta\boldsymbol{R}\boldsymbol{\Phi}\hat{% \boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{\boldsymbol{G}}^{H}% \boldsymbol{\Phi}^{H}\Delta\boldsymbol{R}^{H}\right\}=\sigma_{{\boldsymbol{R}}% }^{2}\text{tr}(\hat{\boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{% \boldsymbol{G}}^{H})\boldsymbol{\rm I}_{N_{\rm Rx}},blackboard_E { roman_Δ bold_italic_R bold_Φ over^ start_ARG bold_italic_G end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_G end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Δ bold_italic_R start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT } = italic_σ start_POSTSUBSCRIPT bold_italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT tr ( over^ start_ARG bold_italic_G end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_G end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (32)

and

𝔼{𝑹^𝚽Δ𝑮𝑾𝑾HΔ𝑮H𝚽H𝑹^H}=σ𝑮2tr(𝑾𝑾H)𝑹^𝑹^H.𝔼^𝑹𝚽Δ𝑮𝑾superscript𝑾𝐻Δsuperscript𝑮𝐻superscript𝚽𝐻superscript^𝑹𝐻superscriptsubscript𝜎𝑮2tr𝑾superscript𝑾𝐻^𝑹superscript^𝑹𝐻\displaystyle\mathbb{E}\left\{\hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta{% \boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\Delta{\boldsymbol{G}}^{H}% \boldsymbol{\Phi}^{H}\hat{\boldsymbol{R}}^{H}\right\}=\sigma_{{\boldsymbol{G}}% }^{2}\text{tr}({\boldsymbol{W}}{\boldsymbol{W}}^{H})\hat{\boldsymbol{R}}{\hat{% \boldsymbol{R}}}^{H}.blackboard_E { over^ start_ARG bold_italic_R end_ARG bold_Φ roman_Δ bold_italic_G bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Δ bold_italic_G start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_R end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT } = italic_σ start_POSTSUBSCRIPT bold_italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT tr ( bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) over^ start_ARG bold_italic_R end_ARG over^ start_ARG bold_italic_R end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT . (33)

Inserting (32) and (33) into (31), the covariance matrix of the interference-plus-noise signal in the presence of CEE and model mismatch is given by

𝚺𝒏^subscript𝚺^𝒏\displaystyle\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT =Δ𝑯M𝑾𝑾HΔ𝑯MH+σ𝑹2tr(𝑮^𝑾𝑾H𝑮^H)𝐈NRx+σ𝑮2tr(𝑾𝑾H)𝑹^𝑹^H+σ2𝐈NRx.absentΔsubscript𝑯M𝑾superscript𝑾𝐻Δsubscriptsuperscript𝑯𝐻Msuperscriptsubscript𝜎𝑹2tr^𝑮𝑾superscript𝑾𝐻superscript^𝑮𝐻subscript𝐈subscript𝑁Rxsuperscriptsubscript𝜎𝑮2tr𝑾superscript𝑾𝐻^𝑹superscript^𝑹𝐻superscript𝜎2subscript𝐈subscript𝑁Rx\displaystyle=\Delta\boldsymbol{H}_{\rm M}{\boldsymbol{W}}{\boldsymbol{W}}^{H}% \Delta\boldsymbol{H}^{H}_{\rm M}+\sigma_{{\boldsymbol{R}}}^{2}\text{tr}(\hat{% \boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{\boldsymbol{G}}^{H})% \boldsymbol{\rm I}_{N_{\rm Rx}}+\sigma_{{\boldsymbol{G}}}^{2}\text{tr}({% \boldsymbol{W}}{\boldsymbol{W}}^{H})\hat{\boldsymbol{R}}{\hat{\boldsymbol{R}}}% ^{H}+\sigma^{2}{\boldsymbol{\rm I}}_{N_{\rm Rx}}.= roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT roman_Δ bold_italic_H start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT bold_italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT tr ( over^ start_ARG bold_italic_G end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_G end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT bold_italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT tr ( bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) over^ start_ARG bold_italic_R end_ARG over^ start_ARG bold_italic_R end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT + italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (34)

Substituting the subscript i={1,2,3}𝑖123i=\{1,2,3\}italic_i = { 1 , 2 , 3 }, we can obtain the interference distributions for the three channel models caused by the CEEs and model mismatches, respectively.

Note that for the near-field channel model, i.e., Δ𝑴1=𝟎Δsubscript𝑴10\Delta\boldsymbol{M}_{1}=\boldsymbol{0}roman_Δ bold_italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = bold_0, the covariance matrix 𝚺𝒏^subscript𝚺^𝒏\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT is influenced by the CSI error in the second and third terms and noise in the last term in (34), which is in line with the traditional imperfect CSI schemes that do not account for the model mismatch. On the other side, both the piece-wise near-field model and far-field model introduce additional interference due to the model mismatch in the first term in (34), which results in performance deterioration. Nevertheless, the piece-wise near-field model is a good compromise between the near-field and far-field models in terms of the number of channel model parameters and modeling accuracy. Indeed, the robustness of the proposed model arises from the system’s DoF introduced by the near-field model, as well as the insensitivity of the far-field model to errors in distance and angle.

IV Problem Formulation

Due to the complicated variable coupling in the cascaded CEE when designing 𝑾𝑾\boldsymbol{W}bold_italic_W and 𝚽𝚽\mathbf{\Phi}bold_Φ based on the estimated channel, we treat the interference caused by the model mismatch error and CEE as noise.333This represents a worst case assumption. Then, the achievable SE between the transceivers is given by [37]

(𝑾,𝚽)=log2det(𝐈NRx+𝑯^𝑾𝑾H𝑯^H𝚺𝒏^1).𝑾𝚽subscript2subscript𝐈subscript𝑁Rx^𝑯𝑾superscript𝑾𝐻superscript^𝑯𝐻superscriptsubscript𝚺^𝒏1{\cal R}(\boldsymbol{W},\boldsymbol{\Phi})=\log_{2}\det(\boldsymbol{\rm I}_{N_% {\rm Rx}}+{\hat{\boldsymbol{H}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{% \boldsymbol{H}}^{H}}{\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}^{-1}}).caligraphic_R ( bold_italic_W , bold_Φ ) = roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_det ( bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT end_POSTSUBSCRIPT + over^ start_ARG bold_italic_H end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) . (35)

Then, the joint active and passive beamforming design to maximize the achievable SE, under the transmit power constraint at the Tx and the constant modulus constraint for the phase control variable at the RIS, is formulated as the following optimization problem:

max𝑾,𝚽(𝑾,𝚽)s.t.𝑾F2PTx,|ϕnR|=1,nR=1,,NR,subscript𝑾𝚽𝑾𝚽s.t.subscriptsuperscriptnorm𝑾2𝐹subscript𝑃Txmissing-subexpressionformulae-sequencesubscriptbold-italic-ϕsubscript𝑛R1for-allsubscript𝑛R1subscript𝑁R\begin{array}[]{ll}\max\limits_{\boldsymbol{W},\boldsymbol{\Phi}}&{\cal R}(% \boldsymbol{W},\boldsymbol{\Phi})\\ \text{s.t.}&\|\boldsymbol{W}\|^{2}_{F}\leq P_{\rm Tx},\\ &|\boldsymbol{\phi}_{n_{\rm R}}|=1,~{}\forall{n_{\rm R}}=1,\cdots,N_{\rm R},\\ \end{array}start_ARRAY start_ROW start_CELL roman_max start_POSTSUBSCRIPT bold_italic_W , bold_Φ end_POSTSUBSCRIPT end_CELL start_CELL caligraphic_R ( bold_italic_W , bold_Φ ) end_CELL end_ROW start_ROW start_CELL s.t. end_CELL start_CELL ∥ bold_italic_W ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ≤ italic_P start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL | bold_italic_ϕ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT | = 1 , ∀ italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = 1 , ⋯ , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT , end_CELL end_ROW end_ARRAY (36)

where ϕ=diag(𝚽)bold-italic-ϕdiag𝚽\boldsymbol{\phi}=\text{diag}({\boldsymbol{\Phi}})bold_italic_ϕ = diag ( bold_Φ ). The constraint 𝑾F2PTxsubscriptsuperscriptnorm𝑾2𝐹subscript𝑃Tx\|\boldsymbol{W}\|^{2}_{F}\leq P_{\rm Tx}∥ bold_italic_W ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ≤ italic_P start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT is introduced to prevent excessive power consumption and the constraint |ϕnR|=1subscriptbold-italic-ϕsubscript𝑛R1|\boldsymbol{\phi}_{n_{\rm R}}|=1| bold_italic_ϕ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT | = 1 is imposed on each diagonal entry of the phase control matrix 𝚽𝚽\boldsymbol{\Phi}bold_Φ to maintain a constant modulus, which implies that the reflection coefficients applied by the RIS elements remain on the unit circle thereby simplifying the hardware implementation of the RIS.

Solving the SE maximization problem is generally challenging, particularly in the presence of CSI imperfections. The difficulty arises from the non-convex nature of the objective function (𝑾,𝚽)𝑾𝚽{\cal R}(\boldsymbol{W},\boldsymbol{\Phi})caligraphic_R ( bold_italic_W , bold_Φ ), the non-convex constant modulus constraint |ϕnR|=1subscriptbold-italic-ϕsubscript𝑛R1|\boldsymbol{\phi}_{n_{\rm R}}|=1| bold_italic_ϕ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT | = 1, and even the intricate variable coupling between 𝑾𝑾\boldsymbol{W}bold_italic_W and 𝚽𝚽\boldsymbol{\Phi}bold_Φ as evident in the covariance matrix of the interference-plus-noise signal 𝚺𝒏^subscript𝚺^𝒏\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT in (34). Consequently, the optimization problem in (36) is intractable and poses significant challenges for joint active and passive beamforming design.

By introducing two auxiliary variables 𝒁NRx×Ns𝒁superscriptsubscript𝑁Rxsubscript𝑁s\boldsymbol{Z}\in\mathbb{C}^{N_{\rm Rx}\times N_{\rm s}}bold_italic_Z ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and 𝛀Ns×Ns𝟎𝛀superscriptsubscript𝑁ssubscript𝑁ssucceeds-or-equals0\boldsymbol{\Omega}\in\mathbb{C}^{N_{\rm s}\times N_{\rm s}}\succeq\boldsymbol% {0}bold_Ω ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⪰ bold_0, the SE maximization problem in (36) is equivalently transformed to an MSE minimization problem as follows [38]:

(P)min𝑾,𝒁,𝚽,𝛀Psubscript𝑾𝒁𝚽𝛀\displaystyle\left(\mathrm{P}\right)\min\limits_{\boldsymbol{W},\boldsymbol{Z}% ,\boldsymbol{\Phi},\boldsymbol{\Omega}}( roman_P ) roman_min start_POSTSUBSCRIPT bold_italic_W , bold_italic_Z , bold_Φ , bold_Ω end_POSTSUBSCRIPT tr(𝛀𝑱(𝒁,𝑾))logdet(𝛀)Nstr𝛀𝑱𝒁𝑾𝛀subscript𝑁s\displaystyle\text{tr}(\boldsymbol{\Omega}\boldsymbol{J}(\boldsymbol{Z,W}))-% \log\det(\boldsymbol{\Omega})-N_{\rm s}tr ( bold_Ω bold_italic_J ( bold_italic_Z bold_, bold_italic_W ) ) - roman_log roman_det ( bold_Ω ) - italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT (37)
s.t. 𝑾F2PTx,subscriptsuperscriptnorm𝑾2𝐹subscript𝑃Tx\displaystyle\|\boldsymbol{W}\|^{2}_{F}\leq P_{\rm Tx},∥ bold_italic_W ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ≤ italic_P start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ,
|ϕnR|=1,nR=1,,NR,formulae-sequencesubscriptbold-italic-ϕsubscript𝑛R1for-allsubscript𝑛R1subscript𝑁R\displaystyle|\boldsymbol{\phi}_{n_{\rm R}}|=1,~{}\forall{n_{\rm R}}=1,\cdots,% N_{\rm R},| bold_italic_ϕ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT | = 1 , ∀ italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = 1 , ⋯ , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT ,

where

𝑱(𝒁,𝑾)𝑱𝒁𝑾\displaystyle\boldsymbol{J}(\boldsymbol{Z,W})bold_italic_J ( bold_italic_Z bold_, bold_italic_W ) =𝔼{(𝒁H𝒚𝒔)(𝒁H𝒚𝒔)H}absent𝔼superscript𝒁𝐻𝒚𝒔superscriptsuperscript𝒁𝐻𝒚𝒔𝐻\displaystyle=\mathbb{E}\left\{(\boldsymbol{Z}^{H}\boldsymbol{y}-\boldsymbol{s% })(\boldsymbol{Z}^{H}\boldsymbol{y}-\boldsymbol{s})^{H}\right\}= blackboard_E { ( bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_y - bold_italic_s ) ( bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_y - bold_italic_s ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT }
=𝒁H(𝑯^𝑾𝑾H𝑯^H+𝚺𝒏^)𝒁𝒁H𝑯^𝑾𝑾H𝑯^H𝒁+𝐈Ns,absentsuperscript𝒁𝐻^𝑯𝑾superscript𝑾𝐻superscript^𝑯𝐻subscript𝚺^𝒏𝒁superscript𝒁𝐻^𝑯𝑾superscript𝑾𝐻superscript^𝑯𝐻𝒁subscript𝐈subscript𝑁s\displaystyle=\boldsymbol{Z}^{H}(\hat{\boldsymbol{H}}\boldsymbol{W}\boldsymbol% {W}^{H}\hat{\boldsymbol{H}}^{H}+\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}})% \boldsymbol{Z}-\boldsymbol{Z}^{H}\hat{\boldsymbol{H}}\boldsymbol{W}-% \boldsymbol{W}^{H}\hat{\boldsymbol{H}}^{H}\boldsymbol{Z}+\boldsymbol{\rm I}_{N% _{\rm s}},= bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( over^ start_ARG bold_italic_H end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT + bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT ) bold_italic_Z - bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG bold_italic_W - bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z + bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT , (38)

is the MSE matrix function. The proof of the equivalence between the SE maximization problem in (36) and the MSE minimization problem in (37) follows a similar approach as [37, 38]. In contrast to the SE maximization problem in (36), the MSE minimization problem in (37) is convex regarding three variables, i.e., 𝒁𝒁\boldsymbol{Z}bold_italic_Z, 𝛀𝛀\boldsymbol{\Omega}bold_Ω, and 𝑾𝑾\boldsymbol{W}bold_italic_W, when 𝚽𝚽\boldsymbol{\Phi}bold_Φ is given. This observation paves the way for optimizing these four variables alternately via the BCD framework, as elaborated in the following section.

V Proposed Solution

In the following, we introduce an iterative BCD approach to acquire an effective solution to (37). The proposed approach divides (37) into three subproblems that address different variables: 1) Optimize 𝒁𝒁\boldsymbol{Z}bold_italic_Z and 𝛀𝛀\boldsymbol{\Omega}bold_Ω given 𝑾𝑾\boldsymbol{W}bold_italic_W and 𝚽𝚽\boldsymbol{\Phi}bold_Φ; 2) Optimize 𝑾𝑾\boldsymbol{W}bold_italic_W given 𝒁𝒁\boldsymbol{Z}bold_italic_Z, 𝛀𝛀\boldsymbol{\Omega}bold_Ω and 𝚽𝚽\boldsymbol{\Phi}bold_Φ; 3) Optimize 𝚽𝚽\boldsymbol{\Phi}bold_Φ given 𝒁𝒁\boldsymbol{Z}bold_italic_Z, 𝛀𝛀\boldsymbol{\Omega}bold_Ω and 𝑾𝑾\boldsymbol{W}bold_italic_W. The following are the optimization steps for each of these three sub-problems.

V-A Update the Auxiliary Variables Matrices 𝐙𝐙\boldsymbol{Z}bold_italic_Z and 𝛀𝛀\boldsymbol{\Omega}bold_Ω

It is worth noting that when 𝑾𝑾\boldsymbol{W}bold_italic_W and 𝚽𝚽\boldsymbol{\Phi}bold_Φ are given at each iteration, the optimal auxiliary variables 𝒁𝒁\boldsymbol{Z}bold_italic_Z and 𝛀𝛀\boldsymbol{\Omega}bold_Ω to minimize the objective function in (37) are respectively given by

𝒁𝒁\displaystyle{\boldsymbol{Z}}bold_italic_Z =(𝑯^𝑾𝑾H𝑯^H+𝚺𝒏^)1𝑯^𝑾 andabsentsuperscript^𝑯𝑾superscript𝑾𝐻superscript^𝑯𝐻subscript𝚺^𝒏1^𝑯𝑾 and\displaystyle=(\hat{\boldsymbol{H}}\boldsymbol{W}\boldsymbol{W}^{H}\hat{% \boldsymbol{H}}^{H}+\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}})^{-1}\hat{% \boldsymbol{H}}\boldsymbol{W}\text{ and}= ( over^ start_ARG bold_italic_H end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT + bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG bold_italic_W and (39)
𝛀𝛀\displaystyle{\boldsymbol{\Omega}}bold_Ω =(𝑱(𝒁,𝑾))1.absentsuperscript𝑱𝒁𝑾1\displaystyle=(\boldsymbol{J}({\boldsymbol{Z}},\boldsymbol{W}))^{-1}.= ( bold_italic_J ( bold_italic_Z , bold_italic_W ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (40)

V-B Update the Active Beamforming Matrix 𝐖𝐖\boldsymbol{W}bold_italic_W

For given 𝒁𝒁\boldsymbol{Z}bold_italic_Z, 𝛀𝛀\boldsymbol{\Omega}bold_Ω, and 𝚽𝚽\boldsymbol{\Phi}bold_Φ, the precoding matrix 𝑾𝑾\boldsymbol{W}bold_italic_W can be updated by solving the following problem:

min𝑾subscript𝑾\displaystyle\min\limits_{\boldsymbol{W}}roman_min start_POSTSUBSCRIPT bold_italic_W end_POSTSUBSCRIPT tr(𝛀(𝐈Ns𝒁H𝑯^𝑾)(𝐈Ns𝒁H𝑯^𝑾)H)+tr(𝛀𝒁H𝚺𝒏^𝒁)tr𝛀subscript𝐈subscript𝑁ssuperscript𝒁𝐻^𝑯𝑾superscriptsubscript𝐈subscript𝑁ssuperscript𝒁𝐻^𝑯𝑾𝐻tr𝛀superscript𝒁𝐻subscript𝚺^𝒏𝒁\displaystyle~{}\text{tr}(\boldsymbol{\Omega}(\boldsymbol{\rm I}_{N_{\rm s}}-% \boldsymbol{Z}^{H}{\hat{\boldsymbol{H}}}\boldsymbol{W})(\boldsymbol{\rm I}_{N_% {\rm s}}-\boldsymbol{Z}^{H}{\hat{\boldsymbol{H}}}\boldsymbol{W})^{H})+\text{tr% }(\boldsymbol{\Omega}\boldsymbol{Z}^{H}\boldsymbol{\Sigma}_{\hat{\boldsymbol{n% }}}\boldsymbol{Z})tr ( bold_Ω ( bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG bold_italic_W ) ( bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG bold_italic_W ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) + tr ( bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT bold_italic_Z ) (41)
s.t. 𝑾F2PTx,subscriptsuperscriptnorm𝑾2𝐹subscript𝑃Tx\displaystyle~{}\|\boldsymbol{W}\|^{2}_{F}\leq P_{\rm Tx},∥ bold_italic_W ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ≤ italic_P start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ,

which is a convex optimization problem. Adopting the Lagrangian multiplier approach [39], the optimal active beamformer at the Tx is given by

𝑾𝑾\displaystyle\boldsymbol{W}bold_italic_W =[𝑯^H𝒁𝛀𝒁H𝑯^+σ𝑹2tr(𝛀𝒁H𝒁)𝑮^H𝑮^\displaystyle=[{\hat{\boldsymbol{H}}}^{H}\boldsymbol{Z\Omega}\boldsymbol{Z}^{H% }{\hat{\boldsymbol{H}}}+\sigma_{\boldsymbol{R}}^{2}\text{tr}(\boldsymbol{% \Omega}\boldsymbol{Z}^{H}\boldsymbol{Z}){\hat{\boldsymbol{G}}}^{H}{\hat{% \boldsymbol{G}}}= [ over^ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG + italic_σ start_POSTSUBSCRIPT bold_italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT tr ( bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z ) over^ start_ARG bold_italic_G end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_G end_ARG
+σ𝑮2tr(𝛀𝒁H𝑹^𝑹^H𝒁)𝐈NTx+NRσ𝑮2σ𝑹2tr(𝛀𝒁H𝒁)𝐈NTx+η𝐈NTx]1𝑯^H𝒁𝛀,\displaystyle+\sigma_{{\boldsymbol{G}}}^{2}\text{tr}(\boldsymbol{\Omega}% \boldsymbol{Z}^{H}\hat{\boldsymbol{R}}{\hat{\boldsymbol{R}}}^{H}\boldsymbol{Z}% )\boldsymbol{\rm I}_{N_{\rm Tx}}+N_{\rm R}\sigma_{{\boldsymbol{G}}}^{2}\sigma_% {{\boldsymbol{R}}}^{2}\text{tr}(\boldsymbol{\Omega}\boldsymbol{Z}^{H}% \boldsymbol{Z})\boldsymbol{\rm I}_{N_{\rm Tx}}+\eta\boldsymbol{\rm I}_{N_{\rm Tx% }}]^{-1}{\hat{\boldsymbol{H}}}^{H}\boldsymbol{Z\Omega},+ italic_σ start_POSTSUBSCRIPT bold_italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT tr ( bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_R end_ARG over^ start_ARG bold_italic_R end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z ) bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT italic_σ start_POSTSUBSCRIPT bold_italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT bold_italic_R end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT tr ( bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z ) bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_η bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z bold_Ω , (42)

where the optimal Lagrangian multiplier η0𝜂0\eta\geq 0italic_η ≥ 0 can be found by the proposed algorithm in [10]. Note that to maximize the achievable SE, we have 𝑾F2=PTxsubscriptsuperscriptnorm𝑾2𝐹subscript𝑃Tx\|\boldsymbol{W}\|^{2}_{F}=P_{\rm Tx}∥ bold_italic_W ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT at the optimum [35].

V-C Update the Passive Beamforming Matrix 𝚽𝚽\boldsymbol{\Phi}bold_Φ

For given 𝒁𝒁\boldsymbol{Z}bold_italic_Z, 𝛀𝛀\boldsymbol{\Omega}bold_Ω, and 𝑾𝑾\boldsymbol{W}bold_italic_W, the passive beamforming design problem is formulated as

min𝚽f(𝚽)=tr(𝛀(𝐈Ns𝒁H𝑯^𝑾)(𝐈Ns𝒁H𝑯^𝑾)H)+tr(𝛀𝒁H𝚺𝒏^𝒁)s.t.|ϕnR|=1,nR=1,,NR.subscript𝚽𝑓𝚽tr𝛀subscript𝐈subscript𝑁ssuperscript𝒁𝐻^𝑯𝑾superscriptsubscript𝐈subscript𝑁ssuperscript𝒁𝐻^𝑯𝑾𝐻tr𝛀superscript𝒁𝐻subscript𝚺^𝒏𝒁s.t.formulae-sequencesubscriptbold-italic-ϕsubscript𝑛R1for-allsubscript𝑛R1subscript𝑁R\displaystyle\begin{aligned} \min\limits_{\boldsymbol{\Phi}}\ &f(\boldsymbol{% \Phi})=\text{tr}(\boldsymbol{\Omega}(\boldsymbol{\rm I}_{N_{\rm s}}-% \boldsymbol{Z}^{H}{\hat{\boldsymbol{H}}}\boldsymbol{W})(\boldsymbol{\rm I}_{N_% {\rm s}}-\boldsymbol{Z}^{H}{\hat{\boldsymbol{H}}}\boldsymbol{W})^{H})+\ \text{% tr}(\boldsymbol{\Omega}\boldsymbol{Z}^{H}\boldsymbol{\Sigma}_{\hat{\boldsymbol% {n}}}\boldsymbol{Z})\\ \text{s.t.}\ &|\boldsymbol{\phi}_{n_{\rm R}}|=1,~{}\forall{n_{\rm R}}=1,\ldots% ,N_{\rm R}.\\ \end{aligned}start_ROW start_CELL roman_min start_POSTSUBSCRIPT bold_Φ end_POSTSUBSCRIPT end_CELL start_CELL italic_f ( bold_Φ ) = tr ( bold_Ω ( bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG bold_italic_W ) ( bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT - bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_H end_ARG bold_italic_W ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) + tr ( bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT bold_italic_Z ) end_CELL end_ROW start_ROW start_CELL s.t. end_CELL start_CELL | bold_italic_ϕ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT | = 1 , ∀ italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = 1 , … , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT . end_CELL end_ROW (43)

Substituting 𝑯^^𝑯\hat{\boldsymbol{H}}over^ start_ARG bold_italic_H end_ARG into (43) and ignoring items that are not related to 𝚽𝚽\boldsymbol{\Phi}bold_Φ, the objective function in (43) can be simplified as

f(𝚽)𝑓𝚽\displaystyle f(\boldsymbol{\Phi})italic_f ( bold_Φ ) =tr(𝚽H𝑹^H𝒁𝛀𝒁H𝑹^𝚽𝑮^𝑾𝑾H𝑮^H)tr(𝑹^H𝒁𝛀𝑾H𝑮^H𝚽H)tr(𝑮^𝑾𝛀𝒁H𝑹^𝚽).absenttrsuperscript𝚽𝐻superscript^𝑹𝐻𝒁𝛀superscript𝒁𝐻^𝑹𝚽^𝑮𝑾superscript𝑾𝐻superscript^𝑮𝐻trsuperscript^𝑹𝐻𝒁𝛀superscript𝑾𝐻superscript^𝑮𝐻superscript𝚽𝐻tr^𝑮𝑾𝛀superscript𝒁𝐻^𝑹𝚽\displaystyle=\text{tr}({\boldsymbol{\Phi}}^{H}{\hat{\boldsymbol{R}}}^{H}{% \boldsymbol{Z\Omega}}\boldsymbol{Z}^{H}{\hat{\boldsymbol{R}}}{\boldsymbol{\Phi% }}{\hat{\boldsymbol{G}}}\boldsymbol{W}\boldsymbol{W}^{H}{\hat{\boldsymbol{G}}}% ^{H})-\text{tr}({\hat{\boldsymbol{R}}}^{H}{\boldsymbol{Z\Omega}}\boldsymbol{W}% ^{H}{\hat{\boldsymbol{G}}}^{H}{\boldsymbol{\Phi}}^{H})-\text{tr}({\hat{% \boldsymbol{G}}}\boldsymbol{W}{\boldsymbol{\Omega}}\boldsymbol{Z}^{H}{\hat{% \boldsymbol{R}}}{\boldsymbol{\Phi}}).= tr ( bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_R end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_R end_ARG bold_Φ over^ start_ARG bold_italic_G end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_G end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) - tr ( over^ start_ARG bold_italic_R end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z bold_Ω bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_G end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Φ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) - tr ( over^ start_ARG bold_italic_G end_ARG bold_italic_W bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_R end_ARG bold_Φ ) . (44)

According to [36, Lemma 10.6], we can further simplify the optimization problem in (43) to a standard quadratic programming (QP) problem, i.e.,

min𝚽f(ϕ)=ϕH𝑨ϕ2{𝒅Tϕ}s.t.|ϕnR|=1,nR=1,,NR,subscript𝚽𝑓bold-italic-ϕsuperscriptbold-italic-ϕ𝐻𝑨bold-italic-ϕ2superscript𝒅𝑇bold-italic-ϕs.t.formulae-sequencesubscriptbold-italic-ϕsubscript𝑛R1for-allsubscript𝑛R1subscript𝑁R\displaystyle\begin{array}[]{ll}\min\limits_{\boldsymbol{\Phi}}&~{}f(% \boldsymbol{\phi})={\boldsymbol{\phi}}^{H}\boldsymbol{A}{\boldsymbol{\phi}}-2% \Re\{\boldsymbol{d}^{T}{\boldsymbol{\phi}}\}\\ \text{s.t.}&|\boldsymbol{\phi}_{n_{\rm R}}|=1,~{}\forall{n_{\rm R}}=1,\ldots,N% _{\rm R},\end{array}start_ARRAY start_ROW start_CELL roman_min start_POSTSUBSCRIPT bold_Φ end_POSTSUBSCRIPT end_CELL start_CELL italic_f ( bold_italic_ϕ ) = bold_italic_ϕ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_A bold_italic_ϕ - 2 roman_ℜ { bold_italic_d start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_ϕ } end_CELL end_ROW start_ROW start_CELL s.t. end_CELL start_CELL | bold_italic_ϕ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT | = 1 , ∀ italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = 1 , … , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT , end_CELL end_ROW end_ARRAY (47)

where

𝑨𝑨\displaystyle\boldsymbol{A}bold_italic_A =(𝑹^H𝒁𝛀𝒁H𝑹^)(𝑮^𝑾𝑾H𝑮^H)T,absentsuperscript^𝑹𝐻𝒁𝛀superscript𝒁𝐻^𝑹superscript^𝑮𝑾superscript𝑾𝐻superscript^𝑮𝐻𝑇\displaystyle=({\hat{\boldsymbol{R}}}^{H}{\boldsymbol{Z\Omega}}\boldsymbol{Z}^% {H}{\hat{\boldsymbol{R}}})\circ({\hat{\boldsymbol{G}}}\boldsymbol{W}% \boldsymbol{W}^{H}{\hat{\boldsymbol{G}}}^{H})^{T},= ( over^ start_ARG bold_italic_R end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_Z bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_R end_ARG ) ∘ ( over^ start_ARG bold_italic_G end_ARG bold_italic_W bold_italic_W start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_G end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (48)
𝒅𝒅\displaystyle\boldsymbol{d}bold_italic_d =(𝑫1,1,,𝑫NR,NR)T, and 𝑫=𝑮^𝑾𝛀𝒁H𝑹^.formulae-sequenceabsentsuperscriptsubscript𝑫11subscript𝑫subscript𝑁Rsubscript𝑁R𝑇 and 𝑫^𝑮𝑾𝛀superscript𝒁𝐻^𝑹\displaystyle=(\boldsymbol{D}_{1,1},\dots,\boldsymbol{D}_{N_{\rm R},N_{\rm R}}% )^{T},\text{ and }\boldsymbol{D}={\hat{\boldsymbol{G}}}\boldsymbol{W}{% \boldsymbol{\Omega}}\boldsymbol{Z}^{H}{\hat{\boldsymbol{R}}}.= ( bold_italic_D start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT , … , bold_italic_D start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , and bold_italic_D = over^ start_ARG bold_italic_G end_ARG bold_italic_W bold_Ω bold_italic_Z start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT over^ start_ARG bold_italic_R end_ARG . (49)

We notice that the constant modulus constraint in equation (47) is generally non-convex and NP-hard, posing a challenge for solving the quadratic optimization problem. In contrast to the traditional alternating direction method of multipliers (ADMM) framework, which employs a fixed penalty factor, our approach is inspired by the method of multipliers and the penalty alternating direction methods discussed in [40], [41] to introduce an alternating direction penalty method (ADPM) algorithm. This algorithm gradually increases the penalty factor during iterations to drive the penalty term toward zero, thereby facilitating the design of the phase of the RIS. More specifically, by introducing an auxiliary variable ϕ0=[ejϑ1,,ejϑNR]NR×1subscriptbold-italic-ϕ0superscript𝑒𝑗subscriptitalic-ϑ1superscript𝑒𝑗subscriptitalic-ϑsubscript𝑁Rsuperscriptsubscript𝑁R1\boldsymbol{\phi}_{0}=\left[e^{j\vartheta_{1}},\ldots,e^{j\vartheta_{N_{\rm R}% }}\right]\in\mathbb{C}^{N_{\rm R}\times 1}bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ italic_e start_POSTSUPERSCRIPT italic_j italic_ϑ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , … , italic_e start_POSTSUPERSCRIPT italic_j italic_ϑ start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT, we equivalently recast the problem in (47) as

minϕ,ϕ0subscriptbold-italic-ϕsubscriptbold-italic-ϕ0\displaystyle\min\limits_{\boldsymbol{\phi},\boldsymbol{\phi}_{0}}roman_min start_POSTSUBSCRIPT bold_italic_ϕ , bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ϕH𝑨ϕ2{𝒅Tϕ}superscriptbold-italic-ϕ𝐻𝑨bold-italic-ϕ2superscript𝒅𝑇bold-italic-ϕ\displaystyle~{}{\boldsymbol{\phi}}^{H}\boldsymbol{A}{\boldsymbol{\phi}}-2\Re% \{\boldsymbol{d}^{T}{\boldsymbol{\phi}}\}bold_italic_ϕ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_A bold_italic_ϕ - 2 roman_ℜ { bold_italic_d start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_ϕ } (50)
s.t. ϕ=ϕ0,bold-italic-ϕsubscriptbold-italic-ϕ0\displaystyle\boldsymbol{\phi}=\boldsymbol{\phi}_{0},bold_italic_ϕ = bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ,
|ϕ0nR|=1,ϑnR[0,2π],nR=1,,NR.formulae-sequencesubscriptsubscriptbold-italic-ϕ0subscript𝑛R1formulae-sequencesubscriptitalic-ϑsubscript𝑛R02𝜋for-allsubscript𝑛R1subscript𝑁R\displaystyle|{\boldsymbol{\phi}_{0}}_{n_{\rm R}}|=1,~{}\vartheta_{n_{\rm R}}% \in[0,2\pi],~{}\forall{n_{\rm R}}=1,\ldots,N_{\rm R}.| bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT | = 1 , italic_ϑ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ [ 0 , 2 italic_π ] , ∀ italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = 1 , … , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT .

The augmented Lagrangian function of (50) is given by

=f(ϕ)+{𝓾H(ϕϕ0)}+ρ2ϕϕ022,𝑓bold-italic-ϕsuperscript𝓾𝐻bold-italic-ϕsubscriptbold-italic-ϕ0𝜌2superscriptsubscriptnormbold-italic-ϕsubscriptbold-italic-ϕ022\displaystyle{\cal L}={\it f}(\boldsymbol{\phi})+\Re\{{\cal\boldsymbol{u}}^{% \it H}(\boldsymbol{\phi}-\boldsymbol{\phi}_{0})\}+\frac{\rho}{2}||\boldsymbol{% \phi}-\boldsymbol{\phi}_{0}||_{2}^{2},caligraphic_L = italic_f ( bold_italic_ϕ ) + roman_ℜ { bold_caligraphic_u start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( bold_italic_ϕ - bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) } + divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG | | bold_italic_ϕ - bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (51)

where f(ϕ)=ϕH𝑨ϕ2{𝒅Tϕ}𝑓bold-italic-ϕsuperscriptbold-italic-ϕ𝐻𝑨bold-italic-ϕ2superscript𝒅𝑇bold-italic-ϕ{\it f}(\boldsymbol{\phi})={\boldsymbol{\phi}}^{H}\boldsymbol{A}{\boldsymbol{% \phi}}-2\Re\{\boldsymbol{d}^{T}{\boldsymbol{\phi}}\}italic_f ( bold_italic_ϕ ) = bold_italic_ϕ start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_A bold_italic_ϕ - 2 roman_ℜ { bold_italic_d start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_ϕ }, while 𝓾NR×1𝓾superscriptsubscript𝑁R1{\cal{\boldsymbol{u}}}\in\mathbb{C}^{N_{\rm R}\times 1}bold_caligraphic_u ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT and ρ>0𝜌0\rho>0italic_ρ > 0 are the multiplier vector and the penalty factor, respectively.

In the following, we illustrate how to update ϕ0subscriptbold-italic-ϕ0\boldsymbol{\phi}_{0}bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and ϕbold-italic-ϕ\boldsymbol{\phi}bold_italic_ϕ, and then discuss the selection of ρ(0)superscript𝜌0\rho^{(0)}italic_ρ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT.

V-C1 Update ϕ0subscriptbold-italic-ϕ0\boldsymbol{\phi}_{0}bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT

When we consider the update of ϕ0subscriptbold-italic-ϕ0\boldsymbol{\phi}_{0}bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT given ϕ(t1),𝓾(t1),ρ(t1)superscriptbold-italic-ϕ𝑡1superscript𝓾𝑡1superscript𝜌𝑡1\boldsymbol{\phi}^{(t-1)},{\cal\boldsymbol{u}}^{(t-1)},\rho^{(t-1)}bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT , bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT , italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT in the t𝑡titalic_t-th iteration of ADPM, we omit the constant terms in {\cal L}caligraphic_L that are irrelevant to ϕ0subscriptbold-italic-ϕ0\boldsymbol{\phi}_{0}bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and the optimization problem is given by

minϕ0{(𝓾(t1)ρ(t1)ϕ(t1))Hϕ0}s.t.|ϕ0|=1,ϑnR[0,2π],nR=1,,NR.subscriptsubscriptbold-italic-ϕ0superscriptsuperscript𝓾𝑡1superscript𝜌𝑡1superscriptbold-italic-ϕ𝑡1𝐻subscriptbold-italic-ϕ0s.t.formulae-sequencesubscriptbold-italic-ϕ01formulae-sequencesubscriptitalic-ϑsubscript𝑛R02𝜋subscript𝑛R1subscript𝑁R\begin{array}[]{ll}\min\limits_{\boldsymbol{\phi}_{0}}&\Re\{(-{\cal\boldsymbol% {u}}^{(t-1)}-\rho^{(t-1)}\boldsymbol{\phi}^{(t-1)})^{H}\boldsymbol{\phi}_{0}\}% \\ \text{s.t.}&|\boldsymbol{\phi}_{0}|=1,~{}\vartheta_{n_{\rm R}}\in[0,2\pi],~{}n% _{\rm R}=1,\ldots,N_{\rm R}.\end{array}start_ARRAY start_ROW start_CELL roman_min start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_CELL start_CELL roman_ℜ { ( - bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT - italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_CELL end_ROW start_ROW start_CELL s.t. end_CELL start_CELL | bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | = 1 , italic_ϑ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ [ 0 , 2 italic_π ] , italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = 1 , … , italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT . end_CELL end_ROW end_ARRAY (52)

The optimal solution of problem (52) is given as

ϑnR={𝜸nR(t1)},subscriptitalic-ϑsubscript𝑛Rsubscriptsuperscript𝜸𝑡1subscript𝑛R\vartheta_{n_{\rm R}}=\angle\{\boldsymbol{\gamma}^{(t-1)}_{n_{\rm R}}\},italic_ϑ start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∠ { bold_italic_γ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT } , (53)

where 𝜸(t1)=𝓾(t1)+ρ(t1)ϕ(t1)NR×1superscript𝜸𝑡1superscript𝓾𝑡1superscript𝜌𝑡1superscriptbold-italic-ϕ𝑡1superscriptsubscript𝑁R1\boldsymbol{\gamma}^{(t-1)}={\cal\boldsymbol{u}}^{(t-1)}+\rho^{(t-1)}% \boldsymbol{\phi}^{(t-1)}\in\mathbb{C}^{N_{\rm R}\times 1}bold_italic_γ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT = bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT + italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT.

V-C2 Update ϕbold-italic-ϕ\boldsymbol{\phi}bold_italic_ϕ

When we consider the update of ϕbold-italic-ϕ\boldsymbol{\phi}bold_italic_ϕ given ϕ0(t),𝓾(t1),ρ(t1)superscriptsubscriptbold-italic-ϕ0𝑡superscript𝓾𝑡1superscript𝜌𝑡1\boldsymbol{\phi}_{0}^{(t)},{\cal\boldsymbol{u}}^{(t-1)},\rho^{(t-1)}bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT , bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT , italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT, the minimization problem is given by

minϕf(ϕ)+{(𝓾(t1)ρ(t1)ϕ0(t))Hϕ}.subscriptbold-italic-ϕ𝑓bold-italic-ϕsuperscriptsuperscript𝓾𝑡1superscript𝜌𝑡1superscriptsubscriptbold-italic-ϕ0𝑡𝐻bold-italic-ϕ\min\limits_{\boldsymbol{\phi}}f(\boldsymbol{\phi})+\Re\{({\cal\boldsymbol{u}}% ^{(t-1)}-\rho^{(t-1)}\boldsymbol{\phi}_{0}^{(t)})^{H}\boldsymbol{\phi}\}.roman_min start_POSTSUBSCRIPT bold_italic_ϕ end_POSTSUBSCRIPT italic_f ( bold_italic_ϕ ) + roman_ℜ { ( bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT - italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_italic_ϕ } . (54)

We can obtain the closed-form optimal solution to the problem in (54) as

ϕ(t)=(2𝑨+ρ(t1)𝐈)1(ρ(t1)ϕ0(t)𝓾(t1)+2𝒅).superscriptbold-italic-ϕ𝑡superscript2𝑨superscript𝜌𝑡1𝐈1superscript𝜌𝑡1superscriptsubscriptbold-italic-ϕ0𝑡superscript𝓾𝑡12𝒅\boldsymbol{\phi}^{(t)}=(2{\boldsymbol{A}}+{\rho^{(t-1)}}\boldsymbol{\rm I})^{% -1}(\rho^{(t-1)}\boldsymbol{\phi}_{0}^{(t)}-{\cal\boldsymbol{u}}^{(t-1)}+2% \boldsymbol{d}).bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ( 2 bold_italic_A + italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT bold_I ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT + 2 bold_italic_d ) . (55)
Algorithm 1 ADPM-based Algorithm for Handling (50)
1:  Initialize: 𝑨,𝒅,𝓾(0),ρ(0)>0,δ1,δ2,ϵ,κformulae-sequence𝑨𝒅superscript𝓾0superscript𝜌00subscript𝛿1subscript𝛿2italic-ϵ𝜅\boldsymbol{A},\boldsymbol{d},{\cal\boldsymbol{u}}^{(0)},\rho^{(0)}>0,\delta_{% 1},\delta_{2},\epsilon,\kappabold_italic_A , bold_italic_d , bold_caligraphic_u start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , italic_ρ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT > 0 , italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_ϵ , italic_κ, where 0<δ1<10subscript𝛿110<\delta_{1}<10 < italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < 1, δ2>1subscript𝛿21\delta_{2}>1italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 1 are close to 1.
2:  while Δe(t)=ϕ(t)ϕ0(t)>ϵΔsuperscript𝑒𝑡normsuperscriptbold-italic-ϕ𝑡superscriptsubscriptbold-italic-ϕ0𝑡italic-ϵ\Delta e^{(t)}=||\boldsymbol{\phi}^{(t)}-\boldsymbol{\phi}_{0}^{(t)}||>\epsilonroman_Δ italic_e start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = | | bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT | | > italic_ϵ do
3:     update: ϕ0(t)superscriptsubscriptbold-italic-ϕ0𝑡\boldsymbol{\phi}_{0}^{(t)}bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and ϕ(t)superscriptbold-italic-ϕ𝑡\boldsymbol{\phi}^{(t)}bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT
{(ϕ0(t))nR}={𝜸nR(t1)},subscriptsubscriptsuperscriptbold-italic-ϕ𝑡0subscript𝑛Rsubscriptsuperscript𝜸𝑡1subscript𝑛R\angle\{{(\boldsymbol{\phi}^{(t)}_{0})}_{n_{\rm R}}\}=\angle\{\boldsymbol{% \gamma}^{(t-1)}_{n_{\rm R}}\},∠ { ( bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT } = ∠ { bold_italic_γ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT } ,
ϕ(t)=(2𝑨+ρ(t1)𝐈NR)1(ρ(t1)ϕ0(t)𝓾(t1)+2𝒅),superscriptbold-italic-ϕ𝑡superscript2𝑨superscript𝜌𝑡1subscript𝐈subscript𝑁R1superscript𝜌𝑡1superscriptsubscriptbold-italic-ϕ0𝑡superscript𝓾𝑡12𝒅\boldsymbol{\phi}^{(t)}=(2{\boldsymbol{A}}+{\rho^{(t-1)}}\boldsymbol{\rm I}_{N% _{\rm R}})^{-1}(\rho^{(t-1)}\boldsymbol{\phi}_{0}^{(t)}-{\cal\boldsymbol{u}}^{% (t-1)}+2\boldsymbol{d}),bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = ( 2 bold_italic_A + italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT + 2 bold_italic_d ) ,
4:     update: 𝓾(t)superscript𝓾𝑡{\cal\boldsymbol{u}}^{(t)}bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and ρ(t)superscript𝜌𝑡\rho^{(t)}italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT
ρ(t)={ρ(t1),Δe(t)δ1Δe(t1),δ2ρ(t1),else.superscript𝜌𝑡casessuperscript𝜌𝑡1Δsuperscript𝑒𝑡subscript𝛿1Δsuperscript𝑒𝑡1subscript𝛿2superscript𝜌𝑡1else\displaystyle\begin{split}\rho^{(t)}=\left\{\begin{array}[]{ll}\rho^{(t-1)},&% \Delta e^{(t)}\leq\delta_{1}\Delta e^{(t-1)},\\ \delta_{2}\rho^{(t-1)},&\text{else}.\end{array}\right.\end{split}start_ROW start_CELL italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = { start_ARRAY start_ROW start_CELL italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT , end_CELL start_CELL roman_Δ italic_e start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ≤ italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT roman_Δ italic_e start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT , end_CELL end_ROW start_ROW start_CELL italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT , end_CELL start_CELL else . end_CELL end_ROW end_ARRAY end_CELL end_ROW
𝓾(t)={𝓾(t1)+ρ(t)(ϕ(t)ϕ0(t)),umax(t)κ,(𝓾(t1)+ρ(t)(ϕ(t)ϕ0(t)))/umax(t),else,superscript𝓾𝑡casessuperscript𝓾𝑡1superscript𝜌𝑡superscriptbold-italic-ϕ𝑡superscriptsubscriptbold-italic-ϕ0𝑡subscriptsuperscript𝑢𝑡𝜅missing-subexpressionsuperscript𝓾𝑡1superscript𝜌𝑡superscriptbold-italic-ϕ𝑡superscriptsubscriptbold-italic-ϕ0𝑡subscriptsuperscript𝑢𝑡elsemissing-subexpression\begin{split}{\cal\boldsymbol{u}}^{(t)}=\left\{\begin{array}[]{ll}{\cal% \boldsymbol{u}}^{(t-1)}+\rho^{(t)}(\boldsymbol{\phi}^{(t)}-\boldsymbol{\phi}_{% 0}^{(t)}),\ u^{(t)}_{\max}\leq\kappa,\\ ({\cal\boldsymbol{u}}^{(t-1)}+\rho^{(t)}(\boldsymbol{\phi}^{(t)}-\boldsymbol{% \phi}_{0}^{(t)}))/u^{(t)}_{\max},\ \text{else},\end{array}\right.\end{split}start_ROW start_CELL bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = { start_ARRAY start_ROW start_CELL bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT + italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) , italic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ≤ italic_κ , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ( bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT + italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) ) / italic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT , else , end_CELL start_CELL end_CELL end_ROW end_ARRAY end_CELL end_ROW
where umax(t)subscriptsuperscript𝑢𝑡u^{(t)}_{\max}italic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT is the value with the largest modulus in the multiplier vector 𝓾(t)superscript𝓾𝑡{\cal\boldsymbol{u}}^{(t)}bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT.
5:  end while
6:  Output: ϕsuperscriptbold-italic-ϕ\boldsymbol{\phi}^{\ast}bold_italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT

The ADPM-based algorithm for designing the passive beamforming at the RIS is outlined in Algorithm 1, detailing the update rules for ρ𝜌\rhoitalic_ρ and 𝓾𝓾\cal\boldsymbol{u}bold_caligraphic_u. If we substitute the update rules of ρ(t)superscript𝜌𝑡\rho^{(t)}italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT and 𝓾(t)superscript𝓾𝑡{\cal\boldsymbol{u}}^{(t)}bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT in Algorithm 1 with ρ(t)=ρ(t1)superscript𝜌𝑡superscript𝜌𝑡1\rho^{(t)}=\rho^{(t-1)}italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = italic_ρ start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT and 𝓾(t)=𝓾(t1)+ρ(t)(ϕ(t)ϕ0(t))superscript𝓾𝑡superscript𝓾𝑡1superscript𝜌𝑡superscriptbold-italic-ϕ𝑡superscriptsubscriptbold-italic-ϕ0𝑡{\cal\boldsymbol{u}}^{(t)}={\cal\boldsymbol{u}}^{(t-1)}+\rho^{(t)}(\boldsymbol% {\phi}^{(t)}-\boldsymbol{\phi}_{0}^{(t)})bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = bold_caligraphic_u start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT + italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ( bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT - bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ), Algorithm 1 degenerates to the classical ADMM framework. According to the theoretical analysis in [42], adapting the penalty factor ρ(t)superscript𝜌𝑡\rho^{(t)}italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT is crucial with the following principle: increasing it when the primal residual Δe(t)Δsuperscript𝑒𝑡\Delta e^{(t)}roman_Δ italic_e start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT fails to decrease with iterations, helps drive Δe(t)Δsuperscript𝑒𝑡\Delta e^{(t)}roman_Δ italic_e start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT towards zero to locate a feasible point. Otherwise, ρ(t)superscript𝜌𝑡\rho^{(t)}italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT remains unchanged. This strategy aims to enhance the likelihood of the ADPM algorithm discovering a feasible point compared to the ADMM [42], [43].

Moreover, achieving faster convergence can be facilitated by selecting an appropriate initial value for the penalty factor ρ(0)superscript𝜌0\rho^{(0)}italic_ρ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT. In this regard, we employ a method proposed in [40] to determine the initialized penalty factor for the ADPM. Specifically, when problem (50) does not involve any non-convex constraints, an effective initialized penalty factor for ADPM is obtained as [40]:

ρ(0)=λmin(𝑨)λmax(𝑨),superscript𝜌0subscript𝜆𝑨subscript𝜆𝑨\rho^{(0)}=\sqrt{\lambda_{\min}(\boldsymbol{A})\lambda_{\max}(\boldsymbol{A})},italic_ρ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = square-root start_ARG italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( bold_italic_A ) italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( bold_italic_A ) end_ARG , (56)

where λmin(𝑨)subscript𝜆𝑨\lambda_{\min}(\boldsymbol{A})italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( bold_italic_A ) and λmax(𝑨)subscript𝜆𝑨\lambda_{\max}(\boldsymbol{A})italic_λ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( bold_italic_A ) represent the minimum and maximum eigenvalues of 𝑨𝑨\boldsymbol{A}bold_italic_A, respectively. It should be noted that if the smallest eigenvalue of 𝑨𝑨\boldsymbol{A}bold_italic_A is zero, λmin(𝑨)subscript𝜆𝑨\lambda_{\min}(\boldsymbol{A})italic_λ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( bold_italic_A ) is assigned to its smallest nonzero eigenvalue. For further details, please refer to Theorem 4 in [40].

Remark 1

The proposed ADPM algorithm is guaranteed to converge for arbitrary initialization ϕ(0)superscriptbold-ϕ0\boldsymbol{\phi}^{(0)}bold_italic_ϕ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT and 𝓾(0)superscript𝓾0{\cal\boldsymbol{u}}^{(0)}bold_caligraphic_u start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT provided that ρ(0)>0,0<δ1<1,δ2>1formulae-sequenceformulae-sequencesuperscript𝜌000subscript𝛿11subscript𝛿21\rho^{(0)}>0,0<\delta_{1}<1,\delta_{2}>1italic_ρ start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT > 0 , 0 < italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < 1 , italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 1, and κ𝜅\kappaitalic_κ is a sufficiently large positive number [42]. In particular, for the continuous phase case, if the penalty parameter ρ(t)superscript𝜌𝑡\rho^{(t)}italic_ρ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT is bounded, the limiting point ϕsuperscriptbold-ϕ\boldsymbol{\phi}^{\ast}bold_italic_ϕ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT of the sequence {ϕ(t)}t=0superscriptsubscriptsuperscriptbold-ϕ𝑡𝑡0\{\boldsymbol{\phi}^{(t)}\}_{t=0}^{\infty}{ bold_italic_ϕ start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT obtained via the ADPM algorithm is a Karush-Kuhn-Tucker (KKT) point of (50) [42], [43].

Algorithm 2 Overall Block Coordinate Descent Algorithm for Addressing (37)
1:  Initialize: 𝚽0superscript𝚽0\boldsymbol{\Phi}^{0}bold_Φ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, 𝑾0superscript𝑾0\boldsymbol{W}^{0}bold_italic_W start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, 𝒁0superscript𝒁0\boldsymbol{Z}^{0}bold_italic_Z start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT and 𝛀0superscript𝛀0\boldsymbol{\Omega}^{0}bold_Ω start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, tolerance accuracy ε𝜀\varepsilonitalic_ε, maximum number of iterations rmaxsubscript𝑟r_{\max}italic_r start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, the objective function value of problem (37) P(𝚽0P(\boldsymbol{\Phi}^{0}italic_P ( bold_Φ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, 𝑾0)\boldsymbol{W}^{0})bold_italic_W start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ).
2:  repeat
3:     Given 𝑾r,𝚽rsuperscript𝑾𝑟superscript𝚽𝑟\boldsymbol{W}^{r},\boldsymbol{\Phi}^{r}bold_italic_W start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT , bold_Φ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT and 𝛀rsuperscript𝛀𝑟\boldsymbol{\Omega}^{r}bold_Ω start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT, compute the auxiliary variable 𝒁rsuperscript𝒁𝑟\boldsymbol{Z}^{r}bold_italic_Z start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT by (39).
4:     Given 𝑾r,𝚽rsuperscript𝑾𝑟superscript𝚽𝑟\boldsymbol{W}^{r},\boldsymbol{\Phi}^{r}bold_italic_W start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT , bold_Φ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT and 𝒁rsuperscript𝒁𝑟\boldsymbol{Z}^{r}bold_italic_Z start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT, compute the auxiliary variable 𝛀rsuperscript𝛀𝑟\boldsymbol{\Omega}^{r}bold_Ω start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT by (40).
5:     Given 𝒁rsuperscript𝒁𝑟\boldsymbol{Z}^{r}bold_italic_Z start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT, 𝛀rsuperscript𝛀𝑟\boldsymbol{\Omega}^{r}bold_Ω start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT and 𝚽rsuperscript𝚽𝑟\boldsymbol{\Phi}^{r}bold_Φ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT, determine η𝜂\etaitalic_η and compute 𝑾r+1superscript𝑾𝑟1\boldsymbol{W}^{r+1}bold_italic_W start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT by (V-B).
6:     Given 𝒁rsuperscript𝒁𝑟\boldsymbol{Z}^{r}bold_italic_Z start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT, 𝛀rsuperscript𝛀𝑟\boldsymbol{\Omega}^{r}bold_Ω start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT and 𝑾r+1superscript𝑾𝑟1\boldsymbol{W}^{r+1}bold_italic_W start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT, compute 𝑨𝑨\boldsymbol{A}bold_italic_A and 𝒅𝒅\boldsymbol{d}bold_italic_d by (48) and (49) .
7:     Update ϕr+1superscriptbold-italic-ϕ𝑟1\boldsymbol{\phi}^{r+1}bold_italic_ϕ start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT by Algorithm 1, and reconstruct 𝚽r+1superscript𝚽𝑟1\boldsymbol{\Phi}^{r+1}bold_Φ start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT.
8:     Set r=r+1𝑟𝑟1r=r+1italic_r = italic_r + 1.
9:  until  r>rmax𝑟subscript𝑟r>r_{\max}italic_r > italic_r start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT or |P(𝑾r+1,𝚽r+1)P(𝑾r,𝚽r)|<ε𝑃superscript𝑾𝑟1superscript𝚽𝑟1𝑃superscript𝑾𝑟superscript𝚽𝑟𝜀|P(\boldsymbol{W}^{r+1},\boldsymbol{\Phi}^{r+1})-P(\boldsymbol{W}^{r},% \boldsymbol{\Phi}^{r})|<\varepsilon| italic_P ( bold_italic_W start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT , bold_Φ start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT ) - italic_P ( bold_italic_W start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT , bold_Φ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ) | < italic_ε
10:  Output: 𝚽,𝑾superscript𝚽superscript𝑾\boldsymbol{\Phi}^{\ast},\boldsymbol{W}^{\ast}bold_Φ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_W start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.

V-D Overall Algorithm and Complexity Analysis

Now, we provide the detailed description of the overall BCD algorithm for solving (37) in Algorithm 2. Step 1 is used to initialize these variables and to set thresholds. Steps 2 and 3 are used to update the auxiliary variables 𝒁𝒁\boldsymbol{Z}bold_italic_Z and 𝛀𝛀\boldsymbol{\Omega}bold_Ω, step 4 updates the active beamforming matrix at the Tx. Then, steps 5 and 6 update the passive beamforming matrix of the RIS by ADPM proposed in Algorithm 1. Finally, the stop** condition is |P(𝑾r+1,𝚽r+1)P(𝑾r,𝚽r)|<ε𝑃superscript𝑾𝑟1superscript𝚽𝑟1𝑃superscript𝑾𝑟superscript𝚽𝑟𝜀|P(\boldsymbol{W}^{r+1},\boldsymbol{\Phi}^{r+1})-P(\boldsymbol{W}^{r},% \boldsymbol{\Phi}^{r})|<\varepsilon| italic_P ( bold_italic_W start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT , bold_Φ start_POSTSUPERSCRIPT italic_r + 1 end_POSTSUPERSCRIPT ) - italic_P ( bold_italic_W start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT , bold_Φ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ) | < italic_ε. The convergence analysis of the proposed Algorithm 2 can be found in [10], [35].

Note that the proposed problem formulation and algorithmic solution are applicable for all the three channel models in (2), (3), and (9). However, the impact of channel models on the system performance are characterized by the covariance matrices 𝚺𝒏^subscript𝚺^𝒏\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT in the problem formulation (35). A further extension of this work is to leverage the matrix structures of 𝚺𝒏^subscript𝚺^𝒏\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}bold_Σ start_POSTSUBSCRIPT over^ start_ARG bold_italic_n end_ARG end_POSTSUBSCRIPT and 𝑯^^𝑯\hat{\boldsymbol{H}}over^ start_ARG bold_italic_H end_ARG in different channel models to design specific algorithms aimed at exploring the impact of channel models on the system performance for RIS-aided MIMO communications.

We note that the original problem is divided into three sub-problems and addressed iteratively, which requires IOsubscript𝐼OI_{\rm O}italic_I start_POSTSUBSCRIPT roman_O end_POSTSUBSCRIPT iterations. For the update of the two auxiliary variables, {𝒁}𝒁\{\boldsymbol{Z}\}{ bold_italic_Z }, {𝛀}𝛀\{\boldsymbol{\Omega}\}{ bold_Ω }, it requires the computation of order 𝒪(NRx3)𝒪subscriptsuperscript𝑁3Rx\mathcal{O}(N^{3}_{\rm Rx})caligraphic_O ( italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT ) and 𝒪(Ns3)𝒪subscriptsuperscript𝑁3s\mathcal{O}(N^{3}_{\rm s})caligraphic_O ( italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT ), respectively. For the active beamforming problem at the Tx, solving {𝑾}𝑾\{\boldsymbol{W}\}{ bold_italic_W } requires the computation of order 𝒪(IηNTx3)𝒪subscript𝐼𝜂subscriptsuperscript𝑁3Tx\mathcal{O}(I_{\eta}N^{3}_{\rm Tx})caligraphic_O ( italic_I start_POSTSUBSCRIPT italic_η end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ), where Iηsubscript𝐼𝜂I_{\eta}italic_I start_POSTSUBSCRIPT italic_η end_POSTSUBSCRIPT is the number of iterations for searching the dual variable η𝜂\etaitalic_η. Since the number of transmitter antennas NTxsubscript𝑁TxN_{\rm Tx}italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT is usually larger than Nssubscript𝑁sN_{\rm s}italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT and NRxsubscript𝑁RxN_{\rm Rx}italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT, the complexity of the second sub-problem is 𝒪(IWIηNTx3)𝒪subscript𝐼𝑊subscript𝐼𝜂subscriptsuperscript𝑁3Tx\mathcal{O}(I_{W}I_{\eta}N^{3}_{\rm Tx})caligraphic_O ( italic_I start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_η end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT ), where IWsubscript𝐼𝑊I_{W}italic_I start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT is the number of iterations required to converge. For the passive beamforming problem at the RIS, the complexity of the third sub-problem is 𝒪(NR3+IANR2)𝒪subscriptsuperscript𝑁3Rsubscript𝐼𝐴subscriptsuperscript𝑁2R\mathcal{O}(N^{3}_{\rm R}+I_{A}N^{2}_{\rm R})caligraphic_O ( italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT + italic_I start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT ), where IAsubscript𝐼𝐴I_{A}italic_I start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT is the number of iterations required to converge. Based on the above analysis, the computational complexity of Algorithm 2 is 𝒪(IO(NRx3+Ns3+IWIηNTx3+NR3+IANR2))𝒪subscript𝐼Osubscriptsuperscript𝑁3Rxsubscriptsuperscript𝑁3ssubscript𝐼𝑊subscript𝐼𝜂subscriptsuperscript𝑁3Txsubscriptsuperscript𝑁3Rsubscript𝐼𝐴subscriptsuperscript𝑁2R\mathcal{O}(I_{\rm O}(N^{3}_{\rm Rx}+N^{3}_{\rm s}+I_{W}I_{\eta}N^{3}_{\rm Tx}% +N^{3}_{\rm R}+I_{A}N^{2}_{\rm R}))caligraphic_O ( italic_I start_POSTSUBSCRIPT roman_O end_POSTSUBSCRIPT ( italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT + italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT + italic_I start_POSTSUBSCRIPT italic_W end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_η end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT + italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT + italic_I start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT ) ).

VI Numerical Results

In this section, we present simulation results to assess the performance of the three different channel models in the presence of channel estimation error. A 3D Cartesian coordinate system is considered, where the BS, the RIS, and the Rx are located at (10,20,5)10205(10,-20,5)( 10 , - 20 , 5 ) m, (0,0,10)0010(0,0,10)( 0 , 0 , 10 ) m, and (100,50,5)100505(100,50,5)( 100 , 50 , 5 ) m, respectively. The Tx is equipped with NTx=64subscript𝑁Tx64N_{\rm Tx}=64italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT = 64 transmit antennas serving one user equipment (UE) equipped with NRx=8subscript𝑁Rx8N_{\rm Rx}=8italic_N start_POSTSUBSCRIPT roman_Rx end_POSTSUBSCRIPT = 8 receive antennas with the assistance of an RIS. The number of data streams is Ns=64subscript𝑁s64N_{\rm s}=64italic_N start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT = 64. The number of reflection elements is NR=256subscript𝑁R256N_{\rm R}=256italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = 256. The path loss model utilized is a model tailored for RIS-aided near-field communication, as detailed in [25]. The carrier frequency is 30303030 GHz and the number of Monte Carlo experiments is 50505050. The ground-truth channel from the BS to the RIS follows a near-field channel model while the system design adopts three different channel models: the conventional near-field, the proposed piece-wise near-field and the far-field channel models, resulting in different covariance matrices of the interference-plus-noise signal. Meanwhile, the channel from the RIS to the Rx follows a far-field channel model. The transmit SNR is defined by SNR=10log10(PTx/σ2)SNR10subscript10subscript𝑃Txsuperscript𝜎2\text{SNR}=10\log_{10}(P_{\rm Tx}/\sigma^{2})SNR = 10 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT / italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), where PTxsubscript𝑃TxP_{\rm Tx}italic_P start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT and σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is the power of the transmit signal and noise, respectively. Unless otherwise specified, we set σ2=80superscript𝜎280{\sigma^{2}}=-80italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = - 80 dBm.

The initialization parameters of the ADPM-based algorithm for solving (50) and Algorithm 2 are provided as follows: the multiplier vector 𝒖(0)=𝟎superscript𝒖00{\boldsymbol{u}}^{(0)}=\boldsymbol{0}bold_italic_u start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = bold_0, ϵ=106italic-ϵsuperscript106\epsilon=10^{-6}italic_ϵ = 10 start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT, δ1=0.95subscript𝛿10.95\delta_{1}=0.95italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.95, δ2=1.05subscript𝛿21.05\delta_{2}=1.05italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 1.05, κ=103𝜅superscript103\kappa=10^{3}italic_κ = 10 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT, ε=103𝜀superscript103\varepsilon=10^{-3}italic_ε = 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT, and rmax=100subscript𝑟100r_{\max}=100italic_r start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = 100. For more detailed parameter settings, please refer to the reference [42]. We assume an identical normalized CEE for different channel models, i.e., σ𝑮i2=τi𝔼{𝑮NΔ𝑴iF2}subscriptsuperscript𝜎2subscript𝑮𝑖subscript𝜏𝑖𝔼subscriptsuperscriptnormsubscript𝑮NΔsubscript𝑴𝑖2𝐹\sigma^{2}_{{\boldsymbol{G}}_{i}}=\tau_{i}\cdot\mathbb{E}\{\|\boldsymbol{G}_{% \rm N}-\Delta\boldsymbol{M}_{i}\|^{2}_{F}\}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_τ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ blackboard_E { ∥ bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT - roman_Δ bold_italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT }, where τisubscript𝜏𝑖\tau_{i}italic_τ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the normalized CEE for the Tx-RIS link and it is given by

τi=𝔼[𝑮NΔ𝑴i𝑮i^F2]𝔼[𝑮NΔ𝑴iF2],i=1,2,3.formulae-sequencesubscript𝜏𝑖𝔼delimited-[]subscriptsuperscriptnormsubscript𝑮NΔsubscript𝑴𝑖^subscript𝑮𝑖2𝐹𝔼delimited-[]subscriptsuperscriptnormsubscript𝑮NΔsubscript𝑴𝑖2𝐹𝑖123\tau_{i}=\frac{\mathbb{E}[||\boldsymbol{G}_{\rm N}-\Delta{\boldsymbol{M}_{i}}-% \hat{\boldsymbol{G}_{i}}||^{2}_{F}]}{\mathbb{E}[||\boldsymbol{G}_{\rm N}-% \Delta{\boldsymbol{M}_{i}}||^{2}_{F}]},i=1,2,3.italic_τ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG blackboard_E [ | | bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT - roman_Δ bold_italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG bold_italic_G start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ] end_ARG start_ARG blackboard_E [ | | bold_italic_G start_POSTSUBSCRIPT roman_N end_POSTSUBSCRIPT - roman_Δ bold_italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ] end_ARG , italic_i = 1 , 2 , 3 . (57)

Similarly, we can define the normalized CEE τ𝑹subscript𝜏𝑹\tau_{\boldsymbol{R}}italic_τ start_POSTSUBSCRIPT bold_italic_R end_POSTSUBSCRIPT for the RIS-Rx link and it is given by

τ𝑹=𝔼[Δ𝑹F2]𝔼[𝑹F2].subscript𝜏𝑹𝔼delimited-[]subscriptsuperscriptnormΔ𝑹2𝐹𝔼delimited-[]subscriptsuperscriptnorm𝑹2𝐹\tau_{\boldsymbol{R}}=\frac{\mathbb{E}[||\Delta{\boldsymbol{R}}||^{2}_{F}]}{% \mathbb{E}[||\boldsymbol{R}||^{2}_{F}]}.italic_τ start_POSTSUBSCRIPT bold_italic_R end_POSTSUBSCRIPT = divide start_ARG blackboard_E [ | | roman_Δ bold_italic_R | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ] end_ARG start_ARG blackboard_E [ | | bold_italic_R | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ] end_ARG . (58)

VI-A Convergence Validation

Refer to caption
Figure 2: Convergence behavior when dBR=20subscript𝑑BR20d_{\rm BR}=20italic_d start_POSTSUBSCRIPT roman_BR end_POSTSUBSCRIPT = 20 m and SNR = 10 dB.

We investigate the average achievable SEs for different channel models without considering any estimation error in Fig. 2, i.e., τi=0,i=1,2,3formulae-sequencesubscript𝜏𝑖0𝑖123\tau_{i}=0,i=1,2,3italic_τ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 , italic_i = 1 , 2 , 3, which means that only model mismatch errors are considered. Here, we set the distance along the y-axis from the Tx to the RIS as dBR=20subscript𝑑BR20d_{\rm BR}=20italic_d start_POSTSUBSCRIPT roman_BR end_POSTSUBSCRIPT = 20 m and SNR=10SNR10{\rm SNR}=10roman_SNR = 10 dB. We compare the performance of our proposed algorithm (i.e., Algorithm 2), with that of the algorithm in [8] under the near-field channel model, verifying the effectiveness and convergence of Algorithm 2. When K=1𝐾1K=1italic_K = 1, i.e., the RIS is not partitioned into subsurfaces, the piece-wise near-field channel model degenerates to the conventional far-field channel model. It can be seen that the performance of the piece-wise near-field channel model with multiple subsurface structures is indeed better than that of the traditional far-field model, owing to the reduced model mismatch error as well as the increased DoF. Furthermore, as the number of subsurface increases, adopting the piece-wise near-field channel model gradually approaches the performance of the conventional near-field channel model, where the latter model does not have any model mismatch error, which is consistent with the channel model analysis in Section II-B. It is worth noting that in the extreme case, where the RIS is divided into 256 subsurfaces, the piece-wise near-field channel model evolves into the near-field channel model. Therefore, dividing the RIS into 64 pieces of subsurfaces, i.e., K=8𝐾8K=8italic_K = 8, can achieve the dominant performance gain of the conventional near-field model, while this piece-wise near-field channel model significantly reduces the number of parameters involved.

VI-B SE vs SNR for Different Channel Models in the Presence of CEE

Refer to caption
Figure 3: The achievable SEs versus the SNR when τ=0.2𝜏0.2\tau=0.2italic_τ = 0.2.

Figure 3 presents the average achievable SEs for different channel models at different SNR levels in the presence of CEE with τ=0.2𝜏0.2\tau=0.2italic_τ = 0.2. The slope of the SE curve corresponding to each model represents the multiplexing gain with a steeper model indicating more available DoF. The results depict that the proposed piece-wise near-field channel provides more DoF compared to the traditional far-field channel, potentially leading to an increased achievable SE. Although the DoF provided by the near-field channel model are slightly higher than that of the piece-wise near-field channel model, adopting the piece-wise channel model can achieve a higher achievable SE than that of the conventional near-field model, due to the high sensitivity of the latter model to CEEs. This is because beamsteering is robust against the beam misalignment due to the angle and distance errors, and it exploits more DoF brought by the conventional near-field channel model. Besides, we note that the achievable SEs for all the three channel models are saturated in the high SNR regime due to the presence of CEEs and potential model mismatches.

VI-C SE vs Normalized CEE for Different Channel Models

Refer to caption
Figure 4: The achievable SEs versus the CEE variance when K=8𝐾8K=8italic_K = 8.

In Fig. 4, the average achievable SEs for different channel models are presented under different normalized CEEs for K=8𝐾8K=8italic_K = 8. When the normalized CEE τ𝜏\tauitalic_τ is 0, it corresponds to the scenario without any estimation error in Fig. 2. The results show that as the normalized CEE τ𝜏\tauitalic_τ increases, the performance of the near-field channel model deteriorates significantly, which indicates the high sensitivity of beamfocusing with respect to CEE in the near-field region. We observe that when the normalized CEE τ>0.13𝜏0.13\tau>0.13italic_τ > 0.13, the proposed piece-wise near-field model yields a better performance than the near-field model due to the enhanced robustness inherited from the far-field model. When the normalized CEE is large, the performances of the piece-wise near-field and near-field models are nearly identical as the beamfocusing in both cases is inaccurate. When the normalized CEE τ>0.75𝜏0.75\tau>0.75italic_τ > 0.75, the performance of the far-field channel model is better than that of both the near-field and piece-wise near-field models due to its robustness to CEE. This implies that for different levels of CEE, different channel models can be adopted to improve the system performance. By combining the results from Fig. 3 and Fig. 4, it becomes evident that the piece-wise channel model not only surpasses the DoF associated with the far-field model but also enhances the system robustness against the CEEs when compared to the near-field model.

VI-D SE vs Number of Transmit Antennas for Different Channel Models

Refer to caption
Figure 5: The achievable SEs versus the number of transmit antennas, NTxsubscript𝑁TxN_{\rm{Tx}}italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT, when NR=256subscript𝑁R256N_{\rm R}=256italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = 256.

Figure 5 demonstrates the impact of the number of transmit antennas at the Tx on the achievable SE. We present the achievable SEs for three models under two scenarios: perfect CSI (τ=0𝜏0\tau=0italic_τ = 0) and imperfect CSI (τ=0.2𝜏0.2\tau=0.2italic_τ = 0.2). When τ=0𝜏0\tau=0italic_τ = 0, a linear scaling in the SE is observed with respect to the number of transmit antennas. In this scenario, the deterministic model mismatch Δ𝑯MΔsubscript𝑯M\Delta\boldsymbol{H}_{\rm M}roman_Δ bold_italic_H start_POSTSUBSCRIPT roman_M end_POSTSUBSCRIPT in (34) is considered as interference, and increasing NTxsubscript𝑁TxN_{\rm Tx}italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT provides more DoF for designing the active beamforming matrix 𝑾𝑾\boldsymbol{W}bold_italic_W to suppress the interference caused by the model mismatch, i.e., the first term in (34), thus approaching interference-free transmission. However, when τ=0.2𝜏0.2\tau=0.2italic_τ = 0.2, the DoF are not sufficient to design the active beamforming matrix to suppress the interference caused by the uncertain CEE, i.e., the second and third term in (34). Moreover, the piece-wise near-field model with imperfect CSI demonstrates a higher SE than the far-field model with perfect CSI, highlighting the DoF advantages introduced by the piece-wise near-field model.

VI-E SE vs Number of Reflecting Elements for Different Channel Models

Refer to caption
Figure 6: The achievable SEs versus the number of reflecting elements NRsubscript𝑁RN_{\rm R}italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT when NTx=64subscript𝑁Tx64N_{\rm{Tx}}=64italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT = 64.

In Fig. 6, we investigate the impact of the number of reflecting elements on the achievable SEs. We present SEs for the three models with varying numbers of reflecting elements under perfect CSI (τ=0𝜏0\tau=0italic_τ = 0) and imperfect CSI (τ=0.2𝜏0.2\tau=0.2italic_τ = 0.2) scenarios. We observe that as the number of reflecting elements increases, the SE of the three models exhibits linear growth when NRsubscript𝑁RN_{\rm R}italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT is small. However, the slope of the SE curves tends to flatten for larger numbers of reflecting elements, e.g., NR448subscript𝑁R448N_{\rm R}\geq 448italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT ≥ 448. This is because the model mismatch increases with the number of reflecting elements as the near-field propagation becomes dominant. Comparing the findings in Fig. 5 and Fig. 6, we observe that while increasing both NTxsubscript𝑁TxN_{\rm Tx}italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT and NRsubscript𝑁RN_{\rm R}italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT enhances the SE of the system, their roles in RIS-aided MIMO communication systems are distinct. On one hand, increasing NTxsubscript𝑁TxN_{\rm Tx}italic_N start_POSTSUBSCRIPT roman_Tx end_POSTSUBSCRIPT at the Tx enhances the spatial DoF, thereby facilitating interference mitigation and enabling higher beamforming gain. On the other hand, as the number of reflecting elements NRsubscript𝑁RN_{\rm R}italic_N start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT increases, the near-field effects become more pronounced, leading to an increase in the model mismatch between the piece-wise near-field channel and the far-field channel models.

VII Conclusions

This paper proposed to adopt a piece-wise near-field channel model for a RIS-aided MIMO system in the presence of CEEs. We considered three channel models (i.e., near-field, piece-wise near-field and far-field) and analyzed the impact of CEEs and model mismatches on the interference distribution. By treating the interference caused by CEEs and model mismatches as noise, we formulated the joint active and passive beamforming design as an optimization problem to maximize the achievable SE taking into account the transmit power constraint for active beamforming matrix and the constant modulus constraint for passive beamforming matrix. The joint beamforming optimization problem was equivalently transformed into an MSE minimization problem, which was then addressed by the proposed algorithm exploit the BCD and ADPM to handle the constant modulus constraint of RIS elements. We revealed that the adopted piece-wise near-field channel model not only improves the DoF gain but also demonstrates enhanced robustness against CEEs, resulting in higher achievable rates compared to the other channel models. A promising extension of this work is considering more reasonable parameters (distance and angle) for the error modeling schemes instead of overall channel estimation error modeling, which could lead to tailored channel estimation schemes and robust resource allocation strategies. Another future research direction is to utilize data-driven deep learning networks to select the number of subsurfaces in the piece-wise near-field channel model to achieve the best trade-off between the modeling accuracy and the robustness against CEEs.

References

  • [1] W. Chen, Z. Yang, Z. Wei, D. W. K. Ng, and M. Matthaiou, “Beamforming design for RIS-Aided MIMO communication: A piece-wise near-field model,” submitted to IEEE ICCC, May 2024.
  • [2] M. Matthaiou, O. Yurduseven, H. Q. Ngo, D. Morales-Jimenez, S. L. Cotton, and V. F. Fusco, “The road to 6G: Ten physical layer challenges for communications engineers,” IEEE Commun. Mag., vol. 59, no. 1, pp. 64–69, Jan. 2021.
  • [3] T. J. Cui, M. Q. Qi, X. Wan, J. Zhao, and Q. Cheng, “Coding metamaterials, digital metamaterials and programmable metamaterials,” Light Sci. Appl., vol. 3, no. 10, pp. e218–e218, Oct. 2014.
  • [4] J. Zhang, E. Björnson, M. Matthaiou, D. W. K. Ng, H. Yang, and D. J. Love, “Prospective multiple antenna technologies for beyond 5G,” IEEE J. Sel. Areas in Commun., vol. 38, no. 8, pp. 1637–1660, Aug. 2020.
  • [5] E. Basar, M. Di Renzo, J. De Rosny, M. Debbah, M. S. Alouini, and R. Zhang, “Wireless communications through reconfigurable intelligent surfaces,” IEEE Access, vol. 7, pp. 116 753–116 773, Aug. 2019.
  • [6] C. Pan et al., “Reconfigurable intelligent surfaces for 6G systems: Principles, applications, and research directions,” IEEE Commun. Mag., vol. 59, no. 6, pp. 14–20, Jun. 2021.
  • [7] X. Yu, D. Xu, and R. Schober, “MISO wireless communication systems via intelligent reflecting surfaces,” in Proc. IEEE ICCC, Aug. 2019, pp. 735–740.
  • [8] S. Zhang and R. Zhang, “Capacity characterization for intelligent reflecting surface aided MIMO communication,” IEEE J. Sel. Areas Commun., vol. 38, no. 8, pp. 1823–1838, Aug. 2020.
  • [9] H. Alwazani, A. Kammoun, A. Chaaban, M. Debbah, and M. S. Alouini, “Intelligent reflecting surface-assisted multi-user MISO communication: Channel estimation and beamforming design,” IEEE Open J. Commun. Soc., vol. 1, pp. 661–680, May 2020.
  • [10] C. Pan, H. Ren, K. Wang, W. Xu, M. Elkashlan, A. Nallanathan, and L. Hanzo, “Multicell MIMO communications relying on intelligent reflecting surfaces,” IEEE Trans. Wireless Commun., vol. 19, no. 8, pp. 5218–5233, Aug. 2020.
  • [11] S. Hu, Z. Wei, Y. Cai, C. Liu, D. W. K. Ng, and J. Yuan, “Robust and secure sum-rate maximization for multiuser MISO downlink systems with self-sustainable IRS,” IEEE Trans. Commun., vol. 69, no. 10, pp. 7032–7049, Jul. 2021.
  • [12] Z. Wei, Y. Cai, Z. Sun, D. W. K. Ng, J. Yuan, M. Zhou, and L. Sun, “Sum-rate maximization for IRS-assisted UAV OFDMA communication systems,” IEEE Trans. Wireless Commun., vol. 20, no. 4, pp. 2530–2550, Dec. 2020.
  • [13] C. Liu, X. Liu, Z. Wei, S. Hu, D. W. K. Ng, and J. Yuan, “Deep learning-empowered predictive beamforming for IRS-assisted multi-user communications,” in Proc. IEEE GLOBECOM, Dec. 2021, pp. 1–6.
  • [14] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potentials, and future directions,” IEEE Commun. Mag., vol. 61, no. 1, pp. 40–46, Jan. 2022.
  • [15] X. Wei, L. Dai, Y. Zhao, G. Yu, and X. Duan, “Codebook design and beam training for extremely large-scale RIS: Far-field or near-field?” China Commun., vol. 19, no. 6, pp. 193–204, Jun. 2022.
  • [16] T. Wang, C. You, F. Zhou, and C. Yin, “Base station beamforming design in near-field XL-IRS beam training,” IEEE Commun. Lett., pp. 1–1, 2024.
  • [17] K. Dovelos, S. D. Assimonis, H. Q. Ngo, B. Bellalta, and M. Matthaiou, “Intelligent reflecting surfaces at terahertz bands: Channel modeling and analysis,” in Proc. IEEE ICC, Jul. 2021, pp. 1–6.
  • [18] Q. Tao, J. Wang, and C. Zhong, “Performance analysis of intelligent reflecting surface aided communication systems,” IEEE Commun. Lett., vol. 24, no. 11, pp. 2464–2468, Nov. 2020.
  • [19] J. Li and Y. Hong, “Intelligent reflecting surface aided communication systems: Performance analysis,” in Proc. IEEE PIMRC, Oct. 2021, pp. 519–524.
  • [20] Y. Liu, Z. Wang, J. Xu, C. Ouyang, X. Mu, and R. Schober, “Near-field communications: A tutorial review,” IEEE Open J. Commun. Soc., vol. 4, pp. 1999–2049, Aug. 2023.
  • [21] X. Wei and L. Dai, “Channel estimation for extremely large-scale massive MIMO: Far-field, near-field, or hybrid-field?” IEEE Commun. Lett., vol. 26, no. 1, pp. 177–181, Jan. 2021.
  • [22] Z. Zhou, X. Gao, J. Fang, and Z. Chen, “Spherical wave channel and analysis for large linear array in LoS conditions,” in Proc. IEEE GLOBECOM, Dec. 2015, pp. 1–6.
  • [23] A. K. Gupta and D. K. Nagar, Matrix Variate Distributions.   London U.K.: Chapman and Hall/CRC, 2018.
  • [24] M. Z. Siddiqi and T. Mir, “Reconfigurable intelligent surface-aided wireless communications: An overview,” Intell. and Converged Netw., vol. 3, no. 1, pp. 33–63, Mar. 2022.
  • [25] E. Björnson and L. Sanguinetti, “Power scaling laws and near-field behaviors of massive MIMO and intelligent reflecting surfaces,” IEEE Open J. Commun. Soc., vol. 1, pp. 1306–1324, Jan. 2020.
  • [26] C. Pan et al., “An overview of signal processing techniques for RIS/IRS-aided wireless systems,” IEEE J. Sel. Top. Signal Process., vol. 5, no. 16, pp. 883–917, Aug. 2022.
  • [27] Z. Wang, L. Liu, and S. Cui, “Channel estimation for intelligent reflecting surface assisted multiuser communications: Framework, algorithms, and analysis,” IEEE Trans. Wireless Commun., vol. 19, no. 10, pp. 6607–6620, Oct. 2020.
  • [28] O. El Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave MIMO systems,” IEEE Trans. Wireless Commun., vol. 13, no. 3, pp. 1499–1513, Mar. 2014.
  • [29] Y. Zhao et al., “6G near-field technologies white paper.”   FuTURE Forum, Nan**g, China, Apr. 2024. doi: 10.12142/FuTURE.202404002.
  • [30] Y. Lu and L. Dai, “Near-field channel estimation in mixed LoS/NLoS environments for extremely large-scale MIMO systems,” IEEE Trans Commun., vol. 71, no. 6, pp. 3694–3707, Jun. 2023.
  • [31] H. Xie, F. Gao, and S. **, “An overview of low-rank channel estimation for massive MIMO systems,” IEEE Access, vol. 4, pp. 7313–7321, Nov. 2016.
  • [32] B. Zheng, C. You, W. Mei, and R. Zhang, “A survey on channel estimation and practical passive beamforming design for intelligent reflecting surface aided wireless communications,” IEEE Commun. Surv. Tutor., vol. 24, no. 2, pp. 1035–1071, Feb. 2022.
  • [33] C. Xing, S. Ma, and Y.-C. Wu, “Robust joint design of linear relay precoder and destination equalizer for dual-hop amplify-and-forward MIMO relay systems,” IEEE Trans. Signal Process., vol. 58, no. 4, pp. 2273–2283, Apr. 2009.
  • [34] K. Ardah, S. Gherekhloo, A. L. de Almeida, and M. Haardt, “TRICE: A channel estimation framework for RIS-aided millimeter-wave MIMO systems,” IEEE Signal Process. Letters, vol. 28, pp. 513–517, Feb. 2021.
  • [35] P. Zeng, D. Qiao, H. Qian, and Q. Wu, “Joint beamforming design for IRS aided multiuser MIMO with imperfect CSI,” IEEE Trans. Veh. Technol., vol. 71, no. 10, pp. 10 729–10 743, Oct. 2022.
  • [36] X. Zhang, Matrix Analysis and Applications.   Cambridge University Press, 2017.
  • [37] Q. Shi, M. Razaviyayn, Z. Luo, and C. He, “An iteratively weighted MMSE approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,” IEEE Trans. Signal Process., vol. 59, no. 9, pp. 4331–4340, Sep. 2011.
  • [38] X. Zhao, S. Lu, Q. Shi, and Z.-Q. Luo, “Rethinking WMMSE: Can its complexity scale linearly with the number of BS antennas?” IEEE Trans. Signal Process., vol. 71, pp. 433–446, Feb. 2023.
  • [39] S. Boyd and L. Vandenberghe, Convex Optimization.   Cambridge University Press, 2004.
  • [40] E. Ghadimi, A. Teixeira, I. Shames, and M. Johansson, “Optimal parameter selection for the alternating direction method of multipliers (ADMM): Quadratic problems,” IEEE Trans. Autom. Control, vol. 60, no. 3, pp. 644–658, Mar. 2014.
  • [41] S. Magnússon, P. C. Weeraddana, M. G. Rabbat, and C. Fischione, “On the convergence of alternating direction lagrangian methods for nonconvex structured optimization problems,” IEEE Trans. Control Netw. Syst., vol. 3, no. 3, pp. 296–309, Sep. 2015.
  • [42] X. Yu, G. Cui, J. Yang, J. Li, and L. Kong, “Quadratic optimization for unimodular sequence design via an ADPM framework,” IEEE Trans. Signal Process., vol. 68, pp. 3619–3634, May. 2020.
  • [43] X. Yu, G. Cui, Z. Zhang, L. Zhou, J. Yang, and L. Kong, “Discrete-phase waveform design to quadratic optimization via an ADPM framework with convergence guarantee,” in Proc. IEEE SAM, Jun. 2020, pp. 1–5.