RIS-aided MIMO Beamforming: Piece-Wise Near-field Channel Model

Weijian Chen, Zai Yang, Zhiqiang Wei, Derrick Wing Kwan Ng, and Michail Matthaiou, Part of the paper has been submitted to the 2024 ICCC [1]. W. Chen, Z. Yang, and Z. Wei are with the School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China (e-mails: [email protected], [email protected], [email protected]). (Corresponding author: Zhiqiang Wei). D. W. K. Ng is with the School of Electrical Engineering and Telecommunications, the University of New South Wales, Australia (email: [email protected]). M. Matthaiou is with the Centre for Wireless Innovation (CWI), Queen’s University Belfast, BT3 9DT Belfast, U.K. (e-mail: [email protected]).

Abstract

This paper proposes a joint active and passive beamforming design for reconfigurable intelligent surface (RIS)-aided wireless communication systems, adopting a piece-wise near-field channel model. While a traditional near-field channel model, applied without any approximations, offers higher modeling accuracy than a far-field model, it renders the system design more sensitive to channel estimation errors (CEEs). As a remedy, we propose to adopt a piece-wise near-field channel model that leverages the advantages of the near-field approach while enhancing its robustness against CEEs. Our study analyzes the impact of different channel models, including the traditional near-field, the proposed piece-wise near-field and far-field channel models, on the interference distribution caused by CEEs and model mismatches. Subsequently, by treating the interference as noise, we formulate a joint active and passive beamforming design problem to maximize the spectral efficiency (SE). The formulated problem is then recast as a mean squared error (MSE) minimization problem and a suboptimal algorithm is developed to iteratively update the active and passive beamforming strategies. Simulation results demonstrate that adopting the piece-wise near-field channel model leads to an improved SE compared to both the near-field and far-field models in the presence of CEEs. Furthermore, the proposed piece-wise near-field model achieves a good trade-off between modeling accuracy and system’s degrees of freedom (DoF).

Index Terms:

Beamforming, near-field, piece-wise near-field, reconfigurable intelligent surface.

I Introduction

Reconfigurable intelligent surface (RIS)-aided wireless communications have attracted significant interest due to their excellent ability to mitigate the propagation path loss through passive beamforming and to circumvent potential obstacles via establishing alternative propagation paths [2]. An RIS is a metalic planar array consisting of numerous passive elements that can be independently reconfigured. By customizing the phase shift of each RIS element based on the channel conditions, the received signal strength at the desired user can be enhanced [3], while undesired interference from adjacent cells and other users can be efficiently suppressed [3, 4, 5, 6].

Numerous studies have focused on optimizing both the active beamforming at the transmitter (Tx) and passive beamforming at the RIS for RIS-aided communication systems [7, 8, 9, 10, 11, 12, 13]. For example, Yu et al. [7] proposed two algorithms for single-user scenarios, namely fixed point iteration and manifold optimization, to maximize the achievable rate for a RIS-aided point-to-point (P2P) multiple-input single-output (MISO) communication system. Similarly, Zhang et al. [8] established the fundamental capacity limit through the joint design of the RIS reflection coefficients and the transmit power allocation. On the other hand, in multi-user scenarios, Alwazani et al. [9] studied the joint optimization of transmit power strategy at the Tx and passive beamforming at the RIS to maximize the minimum user signal-to-interference-plus-noise ratio (SINR), accounting for imperfect channel estimation information (CSI). Additionally, Pan et al. [10] explored the weighted sum rate of all users in multi-cell scenarios by jointly designing the active and passive beamforming, subject to individual base station (BS)’s power constraint and the unit modulus constraint for the RIS reflection coefficients according to a weighted minimum mean square error (WMMSE) framework. The affordability and portability of RIS technology have sparked a range of research initiatives. The work of Hu et al. in [11] on multiuser MISO downlink communications utilized an intelligent reflection surface (IRS) capable of reflecting the signal and harvesting energy from signals, enhancing both the communication robustness and security. Also, Wei et al. explored the implementation of IRS in unmanned aerial vehicle (UAV)-based orthogonal frequency division multiple access (OFDMA) communication systems, utilizing the mobility of UAVs and the beamforming capabilities of the IRS to boost the system’s sum-rate [12]. Furthermore, deep learning (DL) has been adopted to design beamforming for RIS-assisted multiuser communications, potentially outperforming traditional model-based techniques [13]. It is worth noting that these studies assumed a far-field channel model, which is only applicable when the RIS is deployed far from both the Tx and users.

In practice, the electromagnetic (EM) radiation field is commonly divided into two regions: the far-field and the near-field. The demarcation between these regions is determined by the Rayleigh distance which is proportional to the square of the array aperture and is inversely proportional to the signal carrier wavelength [14]. Specially, when a receiver (Rx) is located in the far-field region, the EM field propagation is approximately modeled by planar waves with a certain approximate error. However, in the near-field region, near-field propagation becomes dominant and the EM field propagation is accurately modeled by spherical waves. In fact, with emerging high-frequency communications, utilizing large-aperture antenna arrays for potential power gains can also increase the Rayleigh distance up to a hundred meters, such that the far-field assumptions are no longer valid.

In recent years, a novel area of research has emerged with the goal of improving the efficiency and performance of near-field wireless communication systems. For instance, in [14], Cui et al. introduced the fundamental concept of RIS-assisted near-field communications and highlighted several areas for future research. Besides, Wei et al. [15] focused on develo** a near-field codebook for extremely large-scale RIS (XL-RIS) by taking into account the near-field cascaded array steering vector. Specifically, they crafted a hierarchical near-field codebook and introduced a corresponding hierarchical near-field beam training scheme to minimize the beam training overhead. Furthermore, Wang et al. [16] proposed two efficient schemes for optimizing the BS beamforming to improve the RIS beam training performance, leveraging a near-field channel model. Moreover, in [17], Dovelos et al. investigated RIS-aided MIMO systems in terms of power gain and energy efficiency (EE), considering a spherical wave channel model. Specifically, they analyzed the power gains under beamfocusing and beamsteering, concluding that beamfocusing in the radiating near-field is more useful than beamforming adopted in the far-field counterpart scenarios.

In fact, the necessity of adopting a near-field channel model in a RIS-aided wireless communication system is twofold. Firstly, the RIS is usually coated with a large aperture and needs to be deployed close to the Tx or the Rx to enhance its gain [18], [19]. In other words, the Tx or Rx is more likely to be within the near-field region of the RIS. Secondly, when the RIS is located close to the Tx or the Rx, either the Tx-RIS or the RIS-Rx channel is likely dominated by line-of-sight (LoS) propagation paths [20], [21]. Note that assuming a LoS far-field channel in either the Tx-RIS or the RIS-Rx link makes the cascaded channel matrix between the Tx and Rx only exhibits rank-one and, thus, limits the system’s design degrees of freedom (DoF). It is important to note that a practical near-field channel model, as discussed in [22], can effectively uncover the implicit higher ranks of the cascaded channel matrix, potentially leading to an improvement in both the system’s DoF and spectral efficiency (SE). As a result, considering a near-field channel model for RIS-aided wireless communication systems not only improves the modeling accuracy but also reveals more DoF.

Despite of the advantages mentioned above, embracing near-field communications entails significant challenges. One key challenge is that characterizing a near-field channel requires knowledge of all the distances between each pair of transceiver antennas, whereas the far-field channel only depends on the distance between transceivers, the angle of departure (AoD) at the Tx, and the angle of arrival (AoA) at the Rx. Consequently, estimating the near-field channel is generally more challenging due to the larger number of parameters involved. Another challenge is the sensitivity of near-field communications to channel estimation errors (CEEs). For a far-field channel model, aligning the beam direction of the Tx towards the Rx is usually sufficient to achieve the beamforming gain, as long as the Rx is within the beam’s cone. However, for near-field communications, precise beamfocusing is required to accurately focus the transmitted signal towards the Rx’s location. In practical situations with the presence of CEEs, the focus of near-field beamforming may deviate from the intended focal point, which degrades the beamforming gain and system performance [21]. These two challenges motivate us to consider a new channel model that not only leverages the advantages of the near-field channel model but also enhances its robustness against CEEs.

TABLE I: Comparison of Existing Works

References	Joint Beamforming	MIMO	Channel Model	CSI Assumption
[7, 9, 11, 12, 13]	✓	✗	Far-field	Perfect/Imperfect
[8, 10]	✓	✓	Far-field	Perfect
[14], [15, 16, 17]	✓	✓	Near-field	Perfect
This paper	✓	✓	Piece-wise near-field	Imperfect

As compared in Table I, various existing works have overlooked the near-field effect and some recent works on near-field communications assumed the ideal case of perfect CSI for beamforming design. To the best of the authors’ knowledge, our work is the first to compare the performance and robustness of different channel models for RIS-aided communications.

In comparison to the conference version [1], this paper provides a detailed description of the piece-wise near-field model and extensively discusses the rank of the cascaded channel and its advantages. Additionally, it presents a comprehensive algorithmic framework, parameter selection, and convergence description for the the alternating direction penalty method (ADPM) algorithm, addressing the RIS phase constant modulus constraint. Through simulations, this paper also illustrates the impact of the number of antennas at the Tx and the number of RIS reflecting elements on SE, shedding light on their distinct roles in RIS-aided communication systems. The key contributions of this paper are outlined as follows:

•

For the first time, we propose a joint active and passive beamforming design for RIS-aided MIMO wireless communication systems, adopting a piece-wise near-field channel model. This model approximates the near-field channel via dividing a large-aperture RIS into multiple small-aperture sub-surfaces and assuming heterogeneous far-field propagation between each surface to the Tx or Rx. One can imagine that the piece-wise near-field channel model retains the advantages of accurate channel modeling and increased DoF gain of the near-field channel model.
•

For the three different channel models, i.e., the near-field model, the far-field model and the piece-wise near-field model, and by assuming an identical normalized CEE but different model mismatches, we analyze the covariance matrices of the corresponding interference plus noise signals.
•

The joint active and passive beamforming design is formulated as an optimization problem to maximize the achievable SE by treating the interference caused by CEEs and model mismatches as noise. The achievable SE maximization problem is then transformed equivalently to a problem minimizing the mean square error (MSE), assuming a Gaussian CEE distribution.
•

A block coordinate descent (BCD) approach is adopted to alternately optimize the active and passive beamforming strategies. In particular, the ADPM is adopted to address the constant modulus constraint on the RIS reflection coefficient. We propose an iterative suboptimal algorithm with closed-form updating rules in each step to design the active and passive beamforming strategies. Simulation results demonstrate the system design DoF gain and the enhanced robustness against CEE of adopting the piece-wise near-filed channel model compared to the conventional far-field and near-field models.

The rest of this paper is organized as follows: In Section II, we introduce the RIS-aided near-field communication system model adopting the piece-wise near-field channel model. Section III provides the analysis of the interference distribution with different channel models. In Section IV, we formulate the joint active and passive beamfoming design problem. Then, we present the solution in Section V. In Section VI, we present numerical results with discussions. Finally, we conclude with Section VII.

Notations: Lower-case letters are used to represent scalars, while vectors and matrices are denoted by lower-case and upper-case boldface letters, respectively. The set of complex numbers is denoted by $\mathbb{C}$ ; $\Re$ extracts the real part of a complex number. For vector $\boldsymbol{x}$ , $\boldsymbol{x}_{j}$ denotes the $j$ -th element of $\boldsymbol{x}$ and $\text{diag}\left(\boldsymbol{x}\right)$ denotes a diagonal matrix with its diagonal entries given by ${\boldsymbol{x}}$ . For matrix $\boldsymbol{A}$ , $\mathbb{E}\{\boldsymbol{A}\}$ , $\boldsymbol{A}^{T}$ , $\boldsymbol{A}^{H}$ , $\|\boldsymbol{A}\|_{F}$ , $\boldsymbol{A}^{-1}$ , $\text{rank}\left(\boldsymbol{A}\right)$ , and $\text{tr}\left(\boldsymbol{A}\right)$ denote the expectation, matrix transpose, conjugate transpose, Frobenius norm, inverse, rank, and trace of $\boldsymbol{A}$ , respectively; $\mathcal{CN}(\boldsymbol{\mu},\boldsymbol{\Sigma})$ denotes a circularly symmetric complex Gaussian random vector distribution with mean $\boldsymbol{\mu}$ and covariance matrix $\boldsymbol{\Sigma}$ . The matrix $\boldsymbol{A}$ is said to have a matrix-variate complex Gaussian distribution, which can be written as $\mathcal{CN}_{n,m}(\boldsymbol{\Pi}_{n\times m},\boldsymbol{\Sigma}_{\rm r}% \otimes\boldsymbol{\Sigma}^{T}_{\rm c})$ , where the $n\times n$ matrix $\boldsymbol{\Sigma}_{\rm r}$ and the $m\times m$ matrix $\boldsymbol{\Sigma}_{\rm c}$ are the row and column covariance matrices of $\boldsymbol{A}$ , respectively [23]. Besides, $\rm vec(\boldsymbol{A})\sim\mathcal{CN}(\rm vec({\boldsymbol{\Pi}}),% \boldsymbol{\Sigma}_{r}\otimes\boldsymbol{\Sigma}^{T}_{c})$ where the operation $\rm vec(\boldsymbol{A})$ stacks the columns of the matrix $\boldsymbol{A}$ into a single vector. The symbols $\otimes$ and $\circ$ represent the Kronecker and Hadamard products, respectively. The symbol $\angle$ is used to denote the phase of a complex number.

Refer to caption — (a) Near-field channel model for the Tx-RIS channel.

II System Model

II-A System Model

We consider a RIS-aided point-to-point (P2P) multiple-input multiple-output (MIMO) wireless communication system, as illustrated in Fig. 1, where the RIS is deployed close to a Tx to assist the data transmission from the Tx to a Rx. The Tx equipped with ${N_{\rm Tx}}$ antennas, transmits ${N_{\rm s}}\ {(\leq N_{\rm Tx})}$ independent data streams to the Rx equipped with $N_{\rm Rx}$ antennas with the aid of a RIS comprising $N_{\rm R}=N_{{\rm R}_{y}}\times N_{{\rm R}_{z}}$ passive elements, where $N_{{\rm R}_{y}}$ and $N_{{\rm R}_{z}}$ are the number of elements in the horizontal and vertical directions of RIS, respectively. Without loss of generality, we assume that both the transmit and receive antenna arrays are uniform linear arrays (ULAs) with an identical antenna spacing for both arrays at the Tx and Rx as well as the RIS, denoted by $d$ . We assume that there is no direct link from the Tx to the Rx, which is likely to occur due to blockages as commonly assumed in the literature [24]. Furthermore, the narrowband channels from the Tx to the RIS and from the RIS to the Rx are denoted by $\boldsymbol{G}\in{\mathbb{C}^{N_{\rm R}\times{N_{\rm Tx}}}}$ and $\boldsymbol{R}\in{\mathbb{C}^{N_{\rm Rx}\times{N_{\rm R}}}}$ , respectively. The phase shift matrix of the RIS is denoted by $\boldsymbol{\Phi}=\text{diag}(\phi_{1},\phi_{2},...,\phi_{N_{\rm R}})\in{% \mathbb{C}^{N_{\rm R}\times{N_{\rm R}}}}$ , to perform passive beamforming, where $\phi_{n_{\rm R}}=e^{j{\zeta}_{n_{\rm R}}}$ and $\zeta_{n_{\rm R}}\in[0,2\pi]$ is the phase shift introduced by the ${n_{\rm R}}$ -th RIS element, $\forall{n_{\rm R}}\in\{1,\ldots,N_{\rm R}\}$ . We denote the symbols transmitted from the Tx to the Rx as $\boldsymbol{s}\in{\mathbb{C}^{N_{\rm s}\times 1}}\sim\mathcal{CN}(\boldsymbol{% 0},\boldsymbol{\rm I}_{N_{\rm s}})$ . The received signal $\boldsymbol{y}\in{\mathbb{C}^{N_{\rm Rx}\times 1}}$ at the Rx is given by

\boldsymbol{y}=\boldsymbol{R}\boldsymbol{\Phi}\boldsymbol{G}\boldsymbol{W}% \boldsymbol{s}+\boldsymbol{n}=\boldsymbol{H}\boldsymbol{W}\boldsymbol{s}+% \boldsymbol{n},

(1)

where $\boldsymbol{W}\in\mathbb{C}^{N_{\rm Tx}\times{N_{\rm s}}}$ is the precoding matrix (i.e., active beamforming matrix) at the Tx and $\boldsymbol{n}\in\mathbb{C}^{N_{\rm Rx}\times 1}\sim\mathcal{CN}(\boldsymbol{0% },\sigma^{2}\boldsymbol{\rm I}_{N_{\rm Rx}})$ is the additive white Gaussian noise (AWGN) at the Rx with a noise power of $\sigma^{2}$ . For the sake of presentation, we define the cascaded channel between the Tx and Rx as $\boldsymbol{H}=\boldsymbol{R}\boldsymbol{\Phi}\boldsymbol{G}$ .

II-B Channel Models

To achieve a large passive beamforming gain, the RIS is deployed close to the Tx as shown in Fig. 1(a) and the number of RIS elements is usually large [14], [15]. Therefore, we consider a near-field channel model for the Tx-RIS channel and a far-field multi-path channel model for the RIS-Rx channel. To avoid confusion, we adopt different channel models, i.e., $\boldsymbol{G}_{\rm N},\boldsymbol{G}_{\rm P}$ , and $\boldsymbol{G}_{\rm F}$ , to represent the link between Tx-RIS. In particular, assuming a LoS-dominated propagation between the Tx and RIS, the Tx-RIS channel is modeled as [25]

\begin{array}[]{c}\boldsymbol{G}_{\rm N}\end{array}=\begin{bmatrix}\alpha_{11}% e^{-j\frac{2\pi}{\lambda}{d_{11}}}&\cdots&\alpha_{1{N_{\rm Tx}}}e^{-j\frac{2% \pi}{\lambda}{d_{1{N_{\rm Tx}}}}}\\ \alpha_{21}e^{-j\frac{2\pi}{\lambda}{d_{21}}}&\cdots&\alpha_{2{N_{\rm Tx}}}e^{% -j\frac{2\pi}{\lambda}{d_{2{N_{\rm Tx}}}}}\\ \vdots&\ddots&\vdots\\ \alpha_{{N_{\rm R}1}}e^{-j\frac{2\pi}{\lambda}{d_{{N_{\rm R}1}}}}&\cdots&% \alpha_{N_{\rm R}N_{\rm Tx}}e^{-j\frac{2\pi}{\lambda}{d_{{N_{\rm R}}{N_{\rm Tx% }}}}}\end{bmatrix},

(2)

where $\alpha_{n_{\rm R}n_{\rm Tx}}=\frac{\lambda^{2}}{(4\pi d_{n_{\rm R}n_{\rm Tx}})% ^{2}}$ is the associated path coefficient, $\lambda$ is wavelength of the signal carrier frequency, and $d_{{n_{\rm R}}{n_{\rm Tx}}}$ is the distance between the ${n_{\rm Tx}}$ -th antenna at the Tx and the ${n_{\rm R}}$ -th element at the RIS. In contrast, the Tx-RIS channel has been approximated by a far-field channel model in the literature [26]–[27], i.e.,

\boldsymbol{G}_{\rm F}=\gamma\boldsymbol{a}_{N_{\rm R}}(\psi_{{\rm R}_{\rm az}% },\psi_{{\rm R}_{\rm el}})\boldsymbol{a}^{H}_{N_{\rm Tx}}(\theta_{\rm Tx}),

(3)

where $\gamma=\frac{\lambda^{2}}{(4\pi d_{\rm TR})^{2}}e^{-j\frac{2\pi}{\lambda}d_{% \rm TR}}$ is the path coefficient and $d_{\rm TR}$ is the distance between the centers of Tx and RIS. Moreover, $\boldsymbol{a}_{N_{\rm Tx}}(\theta_{\rm Tx})\in\mathbb{C}^{N_{\rm Tx}\times 1}$ and $\boldsymbol{a}_{N_{\rm R}}(\psi_{{\rm R}_{\rm az}},\psi_{{\rm R}_{\rm el}})\in% \mathbb{C}^{N_{{\rm R}}\times 1}$ denote the array response vectors at the Tx and RIS, respectively, which are given by

\boldsymbol{a}_{N_{\rm Tx}}(\theta_{\rm Tx})=\left[1,\cdots,e^{-j{\frac{2\pi}{% \lambda}(N_{\rm Tx}-1)d\sin{\theta}_{\rm Tx}}}\right]^{T},

(4)

and

\displaystyle\boldsymbol{a}_{N_{\rm R}}(\psi_{{\rm R}_{\rm az}},\psi_{{\rm R}_% {\rm el}})=\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}(N_{{\rm R}_{y}}-1)d\sin{% \psi}_{{\rm R}_{\rm az}}\cos{\psi}_{{\rm R}_{\rm el}}}}\right]^{T}\otimes\left% [1,\cdots,e^{-j{\frac{2\pi}{\lambda}(N_{{\rm R}_{z}}-1)d\sin{\psi}_{{\rm R}_{% \rm az}}\sin{\psi}_{{\rm R}_{\rm el}}}}\right]^{T},

(5)

respectively, where $\theta_{\rm Tx}$ is the azimuth AoD from the Tx to the center of the RIS, while $\psi_{{\rm R}_{\rm az}},\psi_{{\rm R}_{\rm el}}$ are the azimuth and elevation AoAs at the center of the RIS.

On the other hand, based on the Saleh-Valenzuela channel model [28], the RIS-Rx channel matrix is given by

\boldsymbol{R}=\sum_{l=1}^{L_{\rm Rx}}\beta_{l}{\boldsymbol{a}_{N_{\rm Rx}}}({% \varphi}_{{\rm Rx}}^{l})\boldsymbol{b}_{N_{\rm R}}^{H}({\varphi}_{{\rm R}_{\rm az% }}^{l},{\varphi}_{{\rm R}_{\rm el}}^{l}),

(6)

where $\beta_{l}=\frac{\lambda^{2}}{(4\pi d_{\rm RR})^{2}}e^{-j\frac{2\pi}{\lambda}d_% {\rm RR}}$ is the path coefficient of the $l$ -th path between the RIS and Rx, $d_{\rm RR}$ is the distance between the centers of RIS and Rx, and $L_{\rm Rx}$ is the total number of paths. The vectors ${\boldsymbol{a}_{N_{\rm Rx}}}({\varphi}_{{\rm Rx}}^{l})\in\mathbb{C}^{N_{{\rm Rx% }}\times 1}$ and ${\boldsymbol{b}_{N_{\rm R}}}({\varphi}_{{\rm R}_{\rm az}}^{l},{\varphi}_{{\rm R% }_{\rm el}}^{l})\in\mathbb{C}^{N_{\rm R}\times 1}$ denote the array response vectors at the Rx and RIS, respectively, and they are given by

{\boldsymbol{a}_{N_{\rm Rx}}}({\varphi}_{\rm Rx}^{l})=\left[1,\cdots,e^{-j{% \frac{2\pi}{\lambda}(N_{\rm Rx}-1)d\sin{\varphi}_{\rm Rx}^{l}}}\right]^{T}

(7)

and

\displaystyle{\boldsymbol{b}_{N_{\rm R}}}({\varphi}_{{\rm R}_{\rm az}}^{l},{% \varphi}_{{\rm R}_{\rm el}}^{l})=\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}(N_{% {\rm R}_{y}}-1)d\sin{\varphi}_{{\rm R}_{\rm az}}^{l}\cos{\varphi}_{{\rm R}_{% \rm el}}^{l}}}\right]^{T}\otimes\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}(N_{{% \rm R}_{z}}-1)d\sin{\varphi}_{{\rm R}_{\rm az}}^{l}\sin{\varphi}_{{\rm R}_{\rm el% }}^{l}}}\right]^{T},

(8)

respectively, where ${\varphi}_{\rm Rx}^{l}$ is the azimuth AoA of the $l$ -th path at the Rx, and ${\varphi}_{{\rm R}_{\rm az}}^{l}$ and ${\varphi}_{{\rm R}_{\rm el}}^{l}$ are the azimuth and elevation AoDs of the $l$ -th path from the RIS to the Rx, respectively.

Comparing (2) and (3), we can observe that the near-field channel model in (2) involves more parameters than that in (3). Indeed, when the Tx is located within the near-field region of the RIS, (2) is a more accurate model to describe the signal propagation between the Tx and RIS, compared to (3). It has been demonstrated in [22], [20], and [29] that $\text{rank}({\boldsymbol{G}_{\rm N}})>\text{rank}({\boldsymbol{G}_{\rm F}})=1$ usually holds when the near-field condition is satisfied, i.e., the distance between the Tx and RIS is shorter than the Rayleigh distance. However, for the near-field channel model in (2), beamfocusing is required for passive beamforming design at the RIS, which is typically more sensitive to CEEs than conventional beamsteering for the far-field channel model. Therefore, we advocate the utilization of a piece-wise near-field channel model to approximate the near-field channel in (2). We propose to equally divide the $N_{{\rm R}_{y}}$ RIS elements in each row of the RIS into $K$ subarrays, and divide the $N_{{\rm R}_{z}}$ RIS elements in each column of the RIS into $K$ subarrays. Without lost of generality, we assume that both $\frac{N_{{\rm R}_{y}}}{K}$ and $\frac{N_{{\rm R}_{z}}}{K}$ are integers. As a result, the original RIS is divided into $K^{2}$ subsurfaces and the Rayleigh distance is reduced by a factor of $K^{2}$ . Consequently, we can safely assume that the channel between each subsurface and the Tx follows a far-field channel model. Thus, the proposed piece-wise near-field channel matrix, $\boldsymbol{G}_{\rm P}$ , is given by

\boldsymbol{G}_{\rm P}=\left(\begin{bmatrix}\boldsymbol{g}^{\rm h}_{1}\\ \vdots\\ \boldsymbol{g}^{\rm h}_{K}\\ \end{bmatrix}\otimes\begin{bmatrix}\boldsymbol{g}^{\rm v}_{1}\\ \vdots\\ \boldsymbol{g}^{\rm v}_{K}\\ \end{bmatrix}\right){\boldsymbol{a}_{N_{\rm Tx}}^{H}}({\theta}_{\rm Tx}),

(9)

where ${\boldsymbol{a}_{N_{\rm Tx}}}({\theta}_{\rm Tx})$ was defined in (4), while $\boldsymbol{g}^{\rm h}_{i}\in\mathbb{C}^{{\frac{N_{{\rm R}_{y}}}{K}}\times 1}$ and $\boldsymbol{g}^{\rm v}_{i}\in\mathbb{C}^{{\frac{N_{{\rm R}_{z}}}{K}}\times 1}$ are defined by

\boldsymbol{g}^{\rm h}_{i}=\frac{\lambda}{4\pi r_{i}}e^{-j{\frac{\pi}{\lambda}% r_{i}}}\boldsymbol{b}_{\frac{N_{{\rm R}_{y}}}{K}}(\theta^{i}_{{\rm R}_{az}},% \theta^{i}_{{\rm R}_{el}}),{i}=1,\ldots,K,

(10)

and

\boldsymbol{g}^{\rm v}_{i}=\frac{\lambda}{4\pi r_{i}}e^{-j{\frac{\pi}{\lambda}% r_{i}}}\boldsymbol{b}_{\frac{N_{{\rm R}_{z}}}{K}}(\theta^{i}_{{\rm R}_{az}},% \theta^{i}_{{\rm R}_{el}}),{i}=1,\ldots,K,

(11)

respectively. In (10) and (11), ${r_{i}}$ , ${\theta}_{{\rm R}_{\rm az}}^{i}$ , and ${\theta}_{{\rm R}_{\rm el}}^{i}$ are the distance, azimuth, and elevation AoAs from the Tx to the center of the $i$ -th subsurface, respectively. The vectors $\boldsymbol{b}_{\frac{N_{{\rm R}_{y}}}{K}}({\theta}_{{\rm R}_{\rm az}}^{i},{% \theta}_{{\rm R}_{\rm el}}^{i})$ and ${\boldsymbol{b}_{\frac{N_{{\rm R}_{z}}}{K}}}({\theta}_{{\rm R}_{\rm az}}^{i},{% \theta}_{{\rm R}_{\rm el}}^{i})$ are the horizontal and vertical array response vectors of the $i$ -th subsurface which are given by

{\boldsymbol{b}_{\frac{N_{{\rm R}_{y}}}{K}}}({\theta}_{{\rm R}_{\rm az}}^{i},{% \theta}_{{\rm R}_{\rm el}}^{i})=\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}({% \frac{N_{{\rm R}_{y}}}{K}}-1)d\sin{\theta}_{{\rm R}_{\rm az}}^{i}\cos{\theta}_% {{\rm R}_{\rm el}}^{i}}}\right]^{T},

(12)

and

{\boldsymbol{b}_{\frac{N_{{\rm R}_{z}}}{K}}}({\theta}_{{\rm R}_{\rm az}}^{i},{% \theta}_{{\rm R}_{\rm el}}^{i})=\left[1,\cdots,e^{-j{\frac{2\pi}{\lambda}({% \frac{N_{{\rm R}_{z}}}{K}}-1)d\sin{\theta}_{{\rm R}_{\rm az}}^{i}\sin{\theta}_% {{\rm R}_{\rm el}}^{i}}}\right]^{T},

(13)

respectively. For illustration, let us take $K=2$ as an example, which is shown in Fig. 1(b). In this case, the piece-wise near-field channel matrix $\boldsymbol{G}_{\rm P}$ is given by

\boldsymbol{G}_{\rm P}=\left(\begin{bmatrix}\boldsymbol{g}^{\rm h}_{1}\\ \boldsymbol{g}^{\rm h}_{2}\\ \end{bmatrix}\otimes\begin{bmatrix}\boldsymbol{g}^{\rm v}_{1}\\ \boldsymbol{g}^{\rm v}_{2}\\ \end{bmatrix}\right){\boldsymbol{a}_{\rm Tx}^{H}}({\theta}_{\rm Tx}).

(14)

If the number of subsurfaces is $K^{2}=1$ , the piece-wise near-field channel model degenerates to the far-field case $\boldsymbol{G}_{\rm F}$ in (3). If $K=N_{{\rm R}_{y}}=N_{{\rm R}_{z}}$ , the piece-wise near-field model becomes the accurate traditional near-field channel model $\boldsymbol{G}_{\rm N}$ in (2). In other words, the piece-wise near-field channel model bridges the near-field and the far-field via fine tuning the number of subsurfaces. Comparing (2), (3), and (9), we can observe that the piece-wise channel model not only requires less number of parameters than the near-field model, but also enjoys a higher modeling accuracy than the far-field model. Indeed, the model presented in (9) is inspired by the model adopted in [30], which assumes the use of a ULA. Our proposed model in (9) considers the more practical uniform planar array (UPA) configuration of RIS, which is a more general model. Moreover, inheriting from the near-field channel model, the piece-wise channel matrix in (9) still avails of a higher rank than the far-field channel matrix in (3), i.e., $\text{rank}(\boldsymbol{G}_{\rm P})\geq\text{rank}(\boldsymbol{G}_{\rm F})=1$ [20], and thus it can improve the system’s DoF and performance by exploiting the distance and angle diversity among different subsurfaces. In other words, the robustness stems from the fact that piece beamsteering is less prone to errors in distance and angle when there exist CEEs.

III Analysis of Interference Distribution for Different Channel Models

In this section, we first introduce the CEE models and then analyze the distribution of interference-plus-noise signal for the three given models.

III-A Channel Estimation Error Models

Based on different channel models, one can obtain the estimated channel via traditional training and parameter estimation procedures [31], [32]. Note that when channel modeling is not accurate, the estimated channel suffers from not only CEEs, but also model mismatches. In particular, the actual channel between the Tx and RIS, which should follow a near-field channel model in the considered system, is composed of the estimated channel, the corresponding estimation error, and the model mismatch error. In the following, we represent the channel $\boldsymbol{G}_{\rm N}$ by different channel models:

$\displaystyle\boldsymbol{G}_{\rm N}$	$\displaystyle=\hat{\boldsymbol{G}}_{\rm N}+\Delta{\boldsymbol{G}}_{\rm N}+% \Delta{\boldsymbol{M}}_{\rm N},$	[Conventional]	(15)
$\displaystyle\boldsymbol{G}_{\rm N}$	$\displaystyle=\hat{\boldsymbol{G}}_{\rm P}+\Delta{\boldsymbol{G}}_{\rm P}+% \Delta{\boldsymbol{M}}_{\rm P},\text{and}$	[Proposed]	(16)
$\displaystyle\boldsymbol{G}_{\rm N}$	$\displaystyle=\hat{\boldsymbol{G}}_{\rm F}+\Delta{\boldsymbol{G}}_{\rm F}+% \Delta{\boldsymbol{M}}_{\rm F}.$	[Far-field]	(17)

For concise notation, we use subscripts $\{1\}=\{\rm N\}$ , $\{2\}=\{\rm P\}$ , and $\{3\}=\{\rm F\}$ to denote the conventional near-field, the proposed piece-wise near-field, and the far-field channel models, respectively, while $\hat{\boldsymbol{G}}_{i}$ , $\Delta{\boldsymbol{G}}_{i}$ , and $\Delta{\boldsymbol{M}}_{i},i=1,2,3$ , represent the estimated channel, the CEEs and the model mismatch error of different channel models, respectively. The left-hand side of (15), (16), and (17) is the ground-truth near-field channel state information $\boldsymbol{G}_{\rm N}$ . As discussed in (2), (3), and (9), different channel models represent $\boldsymbol{G}_{\rm N}$ with different channel matrix structures, i.e., $\boldsymbol{G}_{\rm N}={\boldsymbol{G}}_{i}+\Delta{\boldsymbol{M}}_{i},i=1,2,3$ . When $i=1$ , this means that we adopt the near-field channel model for channel estimation which is free of model mismatch, i.e., $\Delta{\boldsymbol{M}}_{1}=\boldsymbol{0}$ . Based on different channel models in (2), (3), and (9), channel estimation introduces additional CEE, i.e., $\boldsymbol{G}_{i}=\hat{\boldsymbol{G}}_{i}+\Delta{\boldsymbol{G}}_{i},i=1,2,3$ .

Assuming that the model mismatch errors are deterministic and the CEE follows a matrix-variate Gaussian distribution [33], we have

	$\displaystyle\boldsymbol{G}_{\rm N}$	$\displaystyle=\hat{\boldsymbol{G}}_{i}+\Delta{\boldsymbol{M}}_{i}+\Delta{% \boldsymbol{G}}_{i},\ \text{and}$		(18)
	$\displaystyle\Delta{\boldsymbol{G}}_{i}$	$\displaystyle\sim\mathcal{CN}_{N_{\rm R},N_{\rm Tx}}(\boldsymbol{0},\sigma_{{% \boldsymbol{G}}_{i}}^{2}\boldsymbol{\rm I}_{N_{\rm R}}\otimes\boldsymbol{\rm I% }_{N_{\rm Tx}}).$		(19)

Then, the distribution of the overall CSI imperfection $\Delta\widetilde{\boldsymbol{G}}_{i}=\Delta{\boldsymbol{G}}_{i}+\Delta{% \boldsymbol{M}}_{i}$ follows

\displaystyle\Delta\widetilde{\boldsymbol{G}}_{i}\sim\mathcal{CN}_{N_{\rm R},N% _{\rm Tx}}(\Delta{\boldsymbol{M}}_{i},\sigma_{{\boldsymbol{G}}_{i}}^{2}% \boldsymbol{\rm I}_{N_{\rm R}}\otimes\boldsymbol{\rm I}_{N_{\rm Tx}}).

(20)

Similarly, the channel between the RIS and Rx is given by

\boldsymbol{R}=\hat{\boldsymbol{R}}+\Delta{\boldsymbol{R}},\Delta{\boldsymbol{% R}}\sim\mathcal{CN}_{N_{\rm Rx},N_{\rm R}}(\boldsymbol{0},\sigma_{{\boldsymbol% {R}}}^{2}\boldsymbol{\rm I}_{N_{\rm Rx}}\otimes\boldsymbol{\rm I}_{N_{\rm R}}),

(21)

where $\hat{\boldsymbol{R}}$ and $\Delta{\boldsymbol{R}}$ represent the estimated channel and CEE of the RIS-Rx channel, respectively. To facilitate the subsequent analysis and design, we make the assumption that the channels of the Tx-RIS and RIS-Rx links are independently estimated and, thus, $\Delta\widetilde{\boldsymbol{G}}_{i}$ and $\Delta{\boldsymbol{R}}$ are independent of each other.¹¹1When dealing with a passive RIS, it becomes necessary to reconstruct the individual CSI of all links involving the RIS based on the acquired aggregate CSI, specifically the cascaded Tx-RIS-Rx CSI. This can be achieved by employing the methodology outlined in [34], [35].

III-B Covariance Matrix of the Interference-plus-Noise Signal

For concise notation, let us drop the subscript for now. Based on the assumed CEE model of $\boldsymbol{G}$ for the three channel models, we analyze the CEE distribution of the cascaded channel $\boldsymbol{H}$ as below. Assuming that the estimations of the Tx-RIS and RIS-Rx channels are separately executed with their estimated values $\hat{\boldsymbol{G}}$ and $\hat{\boldsymbol{R}}$ [27], [35], respectively, the estimated cascaded channel is given by

\hat{\boldsymbol{H}}=\hat{\boldsymbol{R}}\boldsymbol{\Phi}\hat{\boldsymbol{G}}.

(22)

Then, we define $\Delta\boldsymbol{H}$ and $\Delta{\boldsymbol{H}}_{\rm M}$ as the cascaded channel estimation error and the corresponding model mismatch error, respectively, which are given by

	$\displaystyle\Delta\boldsymbol{H}$	$\displaystyle=\Delta\boldsymbol{R}\boldsymbol{\Phi}\hat{\boldsymbol{G}}+\hat{% \boldsymbol{R}}\boldsymbol{\Phi}\Delta{\boldsymbol{G}}+\Delta\boldsymbol{R}% \boldsymbol{\Phi}\Delta{\boldsymbol{G}},\text{ and}$		(23)
	$\displaystyle\Delta\boldsymbol{H}_{\rm M}$	$\displaystyle=\hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta\boldsymbol{M}+\Delta% \boldsymbol{R}\boldsymbol{\Phi}\Delta\boldsymbol{M},$		(24)

respectively. The received signal at the Rx in (1) can be reformulated as

\boldsymbol{y}=(\hat{\boldsymbol{H}}+\Delta\boldsymbol{H}+\Delta\boldsymbol{H}% _{\rm M})\boldsymbol{Ws}+\boldsymbol{n}=\hat{\boldsymbol{H}}\boldsymbol{Ws}+% \hat{\boldsymbol{n}},

(25)

with the interference-plus-noise signal $\hat{\boldsymbol{n}}=(\Delta\boldsymbol{H}+\Delta\boldsymbol{H}_{\rm M})% \boldsymbol{Ws}+\boldsymbol{n}$ .

For the convenience of analysis, we assume that

$\displaystyle\\|\Delta\boldsymbol{R}\\|_{F}$	$\displaystyle\ll\\|\hat{\boldsymbol{R}}\\|_{F},$	(26)
$\displaystyle\\|\Delta\boldsymbol{G}\\|_{F}$	$\displaystyle\ll\\|\hat{\boldsymbol{G}}\\|_{F},\text{and}$	(27)
$\displaystyle\\|\Delta\boldsymbol{M}\\|_{F}$	$\displaystyle\ll\\|\hat{\boldsymbol{G}}\\|_{F},$	(28)

respectively, which implies a relatively small CEE and model mismatch.²²2The assumptions are reasonable as the errors are significantly smaller than their estimated values, indicating a high level of confidence in the accuracy of the estimations. Referring to the CEE model in Section III-A, the estimated channel is significantly dominant, justifying this assumption. By discarding the minor terms $\Delta\boldsymbol{R}\boldsymbol{\Phi}\Delta{\boldsymbol{G}}$ and $\Delta\boldsymbol{R}\boldsymbol{\Phi}\Delta\boldsymbol{M}$ in (23) and (24), $\Delta\boldsymbol{H}$ and $\Delta\boldsymbol{H}_{\rm M}$ can be approximated as:

	$\displaystyle\Delta\boldsymbol{H}$	$\displaystyle\approx\Delta\boldsymbol{R}\boldsymbol{\Phi}\hat{\boldsymbol{G}}+% \hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta{\boldsymbol{G}}\text{ and}$		(29)
	$\displaystyle\Delta\boldsymbol{H}_{\rm M}$	$\displaystyle\approx\hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta\boldsymbol{M},$		(30)

respectively. Notice that $\Delta\boldsymbol{H}_{\rm M}$ in (30) is approximately deterministic when the passive beamforming strategy $\boldsymbol{\Phi}$ and the estimated channel $\hat{\boldsymbol{R}}$ are given.

The covariance matrix of the interference-plus-noise signal $\hat{\boldsymbol{n}}$ is given by

$\displaystyle\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}$	$\displaystyle=\mathbb{E}\{\hat{\boldsymbol{n}}\hat{\boldsymbol{n}}^{\it H}\}$	(31)
	$\displaystyle=\Delta\boldsymbol{H}_{\rm M}{\boldsymbol{W}}{\boldsymbol{W}}^{H}% \Delta\boldsymbol{H}^{H}_{\rm M}+\mathbb{E}\{\Delta\boldsymbol{H}{\boldsymbol{% W}}{\boldsymbol{W}}^{H}\Delta\boldsymbol{H}^{H}\}+\sigma^{2}{\boldsymbol{\rm I% }}_{N_{\rm Rx}}$
	$\displaystyle=\Delta\boldsymbol{H}_{\rm M}{\boldsymbol{W}}{\boldsymbol{W}}^{H}% \Delta\boldsymbol{H}^{H}_{\rm M}+\mathbb{E}\{\Delta\boldsymbol{R}\boldsymbol{% \Phi}\hat{\boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{\boldsymbol{% G}}^{H}\boldsymbol{\Phi}^{H}\Delta\boldsymbol{R}^{H}\}$
	$\displaystyle+\mathbb{E}\{\hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta{% \boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\Delta{\boldsymbol{G}}^{H}% \boldsymbol{\Phi}^{H}\hat{\boldsymbol{R}}^{H}\}+\sigma^{2}{\boldsymbol{\rm I}}% _{N_{\rm Rx}}.$

Next, we utilize the following lemma to calculate each term of $\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}$ .

Lemma 1 ([36])

For a matrix $\boldsymbol{X}\in\mathbb{C}^{n\times m}$ , which obeys the distribution $\boldsymbol{X}\sim{\mathcal{CN}_{n,m}}(\hat{\boldsymbol{X}},\boldsymbol{R}_{n}% \otimes\boldsymbol{R}_{m})$ , where $\boldsymbol{R}_{m}\in\mathbb{C}^{m\times m}$ and $\boldsymbol{R}_{n}\in\mathbb{C}^{n\times n}$ represent the receive and transmit correlation matrices, respectively, and a compatible matrix $\boldsymbol{Z}$ , it follows that

$\mathbb{E}\left\{\boldsymbol{X}\boldsymbol{Z}{\boldsymbol{X}}^{H}\right\}=\hat% {\boldsymbol{X}}\boldsymbol{Z}\hat{\boldsymbol{X}}^{H}+\text{tr}(\boldsymbol{Z% }\boldsymbol{R}_{m}^{T}){\boldsymbol{R}_{n}}$ .

It follows from Lemma 1 that

\displaystyle\mathbb{E}\left\{\Delta\boldsymbol{R}\boldsymbol{\Phi}\hat{% \boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{\boldsymbol{G}}^{H}% \boldsymbol{\Phi}^{H}\Delta\boldsymbol{R}^{H}\right\}=\sigma_{{\boldsymbol{R}}% }^{2}\text{tr}(\hat{\boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{% \boldsymbol{G}}^{H})\boldsymbol{\rm I}_{N_{\rm Rx}},

(32)

and

\displaystyle\mathbb{E}\left\{\hat{\boldsymbol{R}}\boldsymbol{\Phi}\Delta{% \boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\Delta{\boldsymbol{G}}^{H}% \boldsymbol{\Phi}^{H}\hat{\boldsymbol{R}}^{H}\right\}=\sigma_{{\boldsymbol{G}}% }^{2}\text{tr}({\boldsymbol{W}}{\boldsymbol{W}}^{H})\hat{\boldsymbol{R}}{\hat{% \boldsymbol{R}}}^{H}.

(33)

Inserting (32) and (33) into (31), the covariance matrix of the interference-plus-noise signal in the presence of CEE and model mismatch is given by

\displaystyle\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}

\displaystyle=\Delta\boldsymbol{H}_{\rm M}{\boldsymbol{W}}{\boldsymbol{W}}^{H}% \Delta\boldsymbol{H}^{H}_{\rm M}+\sigma_{{\boldsymbol{R}}}^{2}\text{tr}(\hat{% \boldsymbol{G}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{\boldsymbol{G}}^{H})% \boldsymbol{\rm I}_{N_{\rm Rx}}+\sigma_{{\boldsymbol{G}}}^{2}\text{tr}({% \boldsymbol{W}}{\boldsymbol{W}}^{H})\hat{\boldsymbol{R}}{\hat{\boldsymbol{R}}}% ^{H}+\sigma^{2}{\boldsymbol{\rm I}}_{N_{\rm Rx}}.

(34)

Substituting the subscript $i=\{1,2,3\}$ , we can obtain the interference distributions for the three channel models caused by the CEEs and model mismatches, respectively.

Note that for the near-field channel model, i.e., $\Delta\boldsymbol{M}_{1}=\boldsymbol{0}$ , the covariance matrix $\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}$ is influenced by the CSI error in the second and third terms and noise in the last term in (34), which is in line with the traditional imperfect CSI schemes that do not account for the model mismatch. On the other side, both the piece-wise near-field model and far-field model introduce additional interference due to the model mismatch in the first term in (34), which results in performance deterioration. Nevertheless, the piece-wise near-field model is a good compromise between the near-field and far-field models in terms of the number of channel model parameters and modeling accuracy. Indeed, the robustness of the proposed model arises from the system’s DoF introduced by the near-field model, as well as the insensitivity of the far-field model to errors in distance and angle.

IV Problem Formulation

Due to the complicated variable coupling in the cascaded CEE when designing $\boldsymbol{W}$ and $\mathbf{\Phi}$ based on the estimated channel, we treat the interference caused by the model mismatch error and CEE as noise.³³3This represents a worst case assumption. Then, the achievable SE between the transceivers is given by [37]

{\cal R}(\boldsymbol{W},\boldsymbol{\Phi})=\log_{2}\det(\boldsymbol{\rm I}_{N_% {\rm Rx}}+{\hat{\boldsymbol{H}}{\boldsymbol{W}}{\boldsymbol{W}}^{H}\hat{% \boldsymbol{H}}^{H}}{\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}^{-1}}).

(35)

Then, the joint active and passive beamforming design to maximize the achievable SE, under the transmit power constraint at the Tx and the constant modulus constraint for the phase control variable at the RIS, is formulated as the following optimization problem:

\begin{array}[]{ll}\max\limits_{\boldsymbol{W},\boldsymbol{\Phi}}&{\cal R}(% \boldsymbol{W},\boldsymbol{\Phi})\\ \text{s.t.}&\|\boldsymbol{W}\|^{2}_{F}\leq P_{\rm Tx},\\ &|\boldsymbol{\phi}_{n_{\rm R}}|=1,~{}\forall{n_{\rm R}}=1,\cdots,N_{\rm R},\\ \end{array}

(36)

where $\boldsymbol{\phi}=\text{diag}({\boldsymbol{\Phi}})$ . The constraint $\|\boldsymbol{W}\|^{2}_{F}\leq P_{\rm Tx}$ is introduced to prevent excessive power consumption and the constraint $|\boldsymbol{\phi}_{n_{\rm R}}|=1$ is imposed on each diagonal entry of the phase control matrix $\boldsymbol{\Phi}$ to maintain a constant modulus, which implies that the reflection coefficients applied by the RIS elements remain on the unit circle thereby simplifying the hardware implementation of the RIS.

Solving the SE maximization problem is generally challenging, particularly in the presence of CSI imperfections. The difficulty arises from the non-convex nature of the objective function ${\cal R}(\boldsymbol{W},\boldsymbol{\Phi})$ , the non-convex constant modulus constraint $|\boldsymbol{\phi}_{n_{\rm R}}|=1$ , and even the intricate variable coupling between $\boldsymbol{W}$ and $\boldsymbol{\Phi}$ as evident in the covariance matrix of the interference-plus-noise signal $\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}$ in (34). Consequently, the optimization problem in (36) is intractable and poses significant challenges for joint active and passive beamforming design.

By introducing two auxiliary variables $\boldsymbol{Z}\in\mathbb{C}^{N_{\rm Rx}\times N_{\rm s}}$ and $\boldsymbol{\Omega}\in\mathbb{C}^{N_{\rm s}\times N_{\rm s}}\succeq\boldsymbol% {0}$ , the SE maximization problem in (36) is equivalently transformed to an MSE minimization problem as follows [38]:

$\displaystyle\left(\mathrm{P}\right)\min\limits_{\boldsymbol{W},\boldsymbol{Z}% ,\boldsymbol{\Phi},\boldsymbol{\Omega}}$	$\displaystyle\text{tr}(\boldsymbol{\Omega}\boldsymbol{J}(\boldsymbol{Z,W}))-% \log\det(\boldsymbol{\Omega})-N_{\rm s}$	(37)
s.t.	$\displaystyle\\|\boldsymbol{W}\\|^{2}_{F}\leq P_{\rm Tx},$
	$\displaystyle\|\boldsymbol{\phi}_{n_{\rm R}}\|=1,~{}\forall{n_{\rm R}}=1,\cdots,% N_{\rm R},$

where

	$\displaystyle\boldsymbol{J}(\boldsymbol{Z,W})$	$\displaystyle=\mathbb{E}\left\{(\boldsymbol{Z}^{H}\boldsymbol{y}-\boldsymbol{s% })(\boldsymbol{Z}^{H}\boldsymbol{y}-\boldsymbol{s})^{H}\right\}$
		$\displaystyle=\boldsymbol{Z}^{H}(\hat{\boldsymbol{H}}\boldsymbol{W}\boldsymbol% {W}^{H}\hat{\boldsymbol{H}}^{H}+\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}})% \boldsymbol{Z}-\boldsymbol{Z}^{H}\hat{\boldsymbol{H}}\boldsymbol{W}-% \boldsymbol{W}^{H}\hat{\boldsymbol{H}}^{H}\boldsymbol{Z}+\boldsymbol{\rm I}_{N% _{\rm s}},$		(38)

is the MSE matrix function. The proof of the equivalence between the SE maximization problem in (36) and the MSE minimization problem in (37) follows a similar approach as [37, 38]. In contrast to the SE maximization problem in (36), the MSE minimization problem in (37) is convex regarding three variables, i.e., $\boldsymbol{Z}$ , $\boldsymbol{\Omega}$ , and $\boldsymbol{W}$ , when $\boldsymbol{\Phi}$ is given. This observation paves the way for optimizing these four variables alternately via the BCD framework, as elaborated in the following section.

V Proposed Solution

In the following, we introduce an iterative BCD approach to acquire an effective solution to (37). The proposed approach divides (37) into three subproblems that address different variables: 1) Optimize $\boldsymbol{Z}$ and $\boldsymbol{\Omega}$ given $\boldsymbol{W}$ and $\boldsymbol{\Phi}$ ; 2) Optimize $\boldsymbol{W}$ given $\boldsymbol{Z}$ , $\boldsymbol{\Omega}$ and $\boldsymbol{\Phi}$ ; 3) Optimize $\boldsymbol{\Phi}$ given $\boldsymbol{Z}$ , $\boldsymbol{\Omega}$ and $\boldsymbol{W}$ . The following are the optimization steps for each of these three sub-problems.

V-A Update the Auxiliary Variables Matrices $\boldsymbol{Z}$ and $\boldsymbol{\Omega}$

It is worth noting that when $\boldsymbol{W}$ and $\boldsymbol{\Phi}$ are given at each iteration, the optimal auxiliary variables $\boldsymbol{Z}$ and $\boldsymbol{\Omega}$ to minimize the objective function in (37) are respectively given by

	$\displaystyle{\boldsymbol{Z}}$	$\displaystyle=(\hat{\boldsymbol{H}}\boldsymbol{W}\boldsymbol{W}^{H}\hat{% \boldsymbol{H}}^{H}+\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}})^{-1}\hat{% \boldsymbol{H}}\boldsymbol{W}\text{ and}$		(39)
	$\displaystyle{\boldsymbol{\Omega}}$	$\displaystyle=(\boldsymbol{J}({\boldsymbol{Z}},\boldsymbol{W}))^{-1}.$		(40)

V-B Update the Active Beamforming Matrix $\boldsymbol{W}$

For given $\boldsymbol{Z}$ , $\boldsymbol{\Omega}$ , and $\boldsymbol{\Phi}$ , the precoding matrix $\boldsymbol{W}$ can be updated by solving the following problem:

	$\displaystyle\min\limits_{\boldsymbol{W}}$	$\displaystyle~{}\text{tr}(\boldsymbol{\Omega}(\boldsymbol{\rm I}_{N_{\rm s}}-% \boldsymbol{Z}^{H}{\hat{\boldsymbol{H}}}\boldsymbol{W})(\boldsymbol{\rm I}_{N_% {\rm s}}-\boldsymbol{Z}^{H}{\hat{\boldsymbol{H}}}\boldsymbol{W})^{H})+\text{tr% }(\boldsymbol{\Omega}\boldsymbol{Z}^{H}\boldsymbol{\Sigma}_{\hat{\boldsymbol{n% }}}\boldsymbol{Z})$		(41)
	s.t.	$\displaystyle~{}\\|\boldsymbol{W}\\|^{2}_{F}\leq P_{\rm Tx},$		(41)

which is a convex optimization problem. Adopting the Lagrangian multiplier approach [39], the optimal active beamformer at the Tx is given by

	$\displaystyle\boldsymbol{W}$	$\displaystyle=[{\hat{\boldsymbol{H}}}^{H}\boldsymbol{Z\Omega}\boldsymbol{Z}^{H% }{\hat{\boldsymbol{H}}}+\sigma_{\boldsymbol{R}}^{2}\text{tr}(\boldsymbol{% \Omega}\boldsymbol{Z}^{H}\boldsymbol{Z}){\hat{\boldsymbol{G}}}^{H}{\hat{% \boldsymbol{G}}}$
		$\displaystyle+\sigma_{{\boldsymbol{G}}}^{2}\text{tr}(\boldsymbol{\Omega}% \boldsymbol{Z}^{H}\hat{\boldsymbol{R}}{\hat{\boldsymbol{R}}}^{H}\boldsymbol{Z}% )\boldsymbol{\rm I}_{N_{\rm Tx}}+N_{\rm R}\sigma_{{\boldsymbol{G}}}^{2}\sigma_% {{\boldsymbol{R}}}^{2}\text{tr}(\boldsymbol{\Omega}\boldsymbol{Z}^{H}% \boldsymbol{Z})\boldsymbol{\rm I}_{N_{\rm Tx}}+\eta\boldsymbol{\rm I}_{N_{\rm Tx% }}]^{-1}{\hat{\boldsymbol{H}}}^{H}\boldsymbol{Z\Omega},$		(42)

where the optimal Lagrangian multiplier $\eta\geq 0$ can be found by the proposed algorithm in [10]. Note that to maximize the achievable SE, we have $\|\boldsymbol{W}\|^{2}_{F}=P_{\rm Tx}$ at the optimum [35].

V-C Update the Passive Beamforming Matrix $\boldsymbol{\Phi}$

For given $\boldsymbol{Z}$ , $\boldsymbol{\Omega}$ , and $\boldsymbol{W}$ , the passive beamforming design problem is formulated as

\displaystyle\begin{aligned} \min\limits_{\boldsymbol{\Phi}}\ &f(\boldsymbol{% \Phi})=\text{tr}(\boldsymbol{\Omega}(\boldsymbol{\rm I}_{N_{\rm s}}-% \boldsymbol{Z}^{H}{\hat{\boldsymbol{H}}}\boldsymbol{W})(\boldsymbol{\rm I}_{N_% {\rm s}}-\boldsymbol{Z}^{H}{\hat{\boldsymbol{H}}}\boldsymbol{W})^{H})+\ \text{% tr}(\boldsymbol{\Omega}\boldsymbol{Z}^{H}\boldsymbol{\Sigma}_{\hat{\boldsymbol% {n}}}\boldsymbol{Z})\\ \text{s.t.}\ &|\boldsymbol{\phi}_{n_{\rm R}}|=1,~{}\forall{n_{\rm R}}=1,\ldots% ,N_{\rm R}.\\ \end{aligned}

(43)

Substituting $\hat{\boldsymbol{H}}$ into (43) and ignoring items that are not related to $\boldsymbol{\Phi}$ , the objective function in (43) can be simplified as

\displaystyle f(\boldsymbol{\Phi})

\displaystyle=\text{tr}({\boldsymbol{\Phi}}^{H}{\hat{\boldsymbol{R}}}^{H}{% \boldsymbol{Z\Omega}}\boldsymbol{Z}^{H}{\hat{\boldsymbol{R}}}{\boldsymbol{\Phi% }}{\hat{\boldsymbol{G}}}\boldsymbol{W}\boldsymbol{W}^{H}{\hat{\boldsymbol{G}}}% ^{H})-\text{tr}({\hat{\boldsymbol{R}}}^{H}{\boldsymbol{Z\Omega}}\boldsymbol{W}% ^{H}{\hat{\boldsymbol{G}}}^{H}{\boldsymbol{\Phi}}^{H})-\text{tr}({\hat{% \boldsymbol{G}}}\boldsymbol{W}{\boldsymbol{\Omega}}\boldsymbol{Z}^{H}{\hat{% \boldsymbol{R}}}{\boldsymbol{\Phi}}).

(44)

According to [36, Lemma 10.6], we can further simplify the optimization problem in (43) to a standard quadratic programming (QP) problem, i.e.,

\displaystyle\begin{array}[]{ll}\min\limits_{\boldsymbol{\Phi}}&~{}f(% \boldsymbol{\phi})={\boldsymbol{\phi}}^{H}\boldsymbol{A}{\boldsymbol{\phi}}-2% \Re\{\boldsymbol{d}^{T}{\boldsymbol{\phi}}\}\\ \text{s.t.}&|\boldsymbol{\phi}_{n_{\rm R}}|=1,~{}\forall{n_{\rm R}}=1,\ldots,N% _{\rm R},\end{array}

(47)

where

	$\displaystyle\boldsymbol{A}$	$\displaystyle=({\hat{\boldsymbol{R}}}^{H}{\boldsymbol{Z\Omega}}\boldsymbol{Z}^% {H}{\hat{\boldsymbol{R}}})\circ({\hat{\boldsymbol{G}}}\boldsymbol{W}% \boldsymbol{W}^{H}{\hat{\boldsymbol{G}}}^{H})^{T},$		(48)
	$\displaystyle\boldsymbol{d}$	$\displaystyle=(\boldsymbol{D}_{1,1},\dots,\boldsymbol{D}_{N_{\rm R},N_{\rm R}}% )^{T},\text{ and }\boldsymbol{D}={\hat{\boldsymbol{G}}}\boldsymbol{W}{% \boldsymbol{\Omega}}\boldsymbol{Z}^{H}{\hat{\boldsymbol{R}}}.$		(49)

We notice that the constant modulus constraint in equation (47) is generally non-convex and NP-hard, posing a challenge for solving the quadratic optimization problem. In contrast to the traditional alternating direction method of multipliers (ADMM) framework, which employs a fixed penalty factor, our approach is inspired by the method of multipliers and the penalty alternating direction methods discussed in [40], [41] to introduce an alternating direction penalty method (ADPM) algorithm. This algorithm gradually increases the penalty factor during iterations to drive the penalty term toward zero, thereby facilitating the design of the phase of the RIS. More specifically, by introducing an auxiliary variable $\boldsymbol{\phi}_{0}=\left[e^{j\vartheta_{1}},\ldots,e^{j\vartheta_{N_{\rm R}% }}\right]\in\mathbb{C}^{N_{\rm R}\times 1}$ , we equivalently recast the problem in (47) as

$\displaystyle\min\limits_{\boldsymbol{\phi},\boldsymbol{\phi}_{0}}$	$\displaystyle~{}{\boldsymbol{\phi}}^{H}\boldsymbol{A}{\boldsymbol{\phi}}-2\Re% \{\boldsymbol{d}^{T}{\boldsymbol{\phi}}\}$	(50)
s.t.	$\displaystyle\boldsymbol{\phi}=\boldsymbol{\phi}_{0},$
	$\displaystyle\|{\boldsymbol{\phi}_{0}}_{n_{\rm R}}\|=1,~{}\vartheta_{n_{\rm R}}% \in[0,2\pi],~{}\forall{n_{\rm R}}=1,\ldots,N_{\rm R}.$

The augmented Lagrangian function of (50) is given by

\displaystyle{\cal L}={\it f}(\boldsymbol{\phi})+\Re\{{\cal\boldsymbol{u}}^{% \it H}(\boldsymbol{\phi}-\boldsymbol{\phi}_{0})\}+\frac{\rho}{2}||\boldsymbol{% \phi}-\boldsymbol{\phi}_{0}||_{2}^{2},

(51)

where ${\it f}(\boldsymbol{\phi})={\boldsymbol{\phi}}^{H}\boldsymbol{A}{\boldsymbol{% \phi}}-2\Re\{\boldsymbol{d}^{T}{\boldsymbol{\phi}}\}$ , while ${\cal{\boldsymbol{u}}}\in\mathbb{C}^{N_{\rm R}\times 1}$ and $\rho>0$ are the multiplier vector and the penalty factor, respectively.

In the following, we illustrate how to update $\boldsymbol{\phi}_{0}$ and $\boldsymbol{\phi}$ , and then discuss the selection of $\rho^{(0)}$ .

V-C1 Update $\boldsymbol{\phi}_{0}$

When we consider the update of $\boldsymbol{\phi}_{0}$ given $\boldsymbol{\phi}^{(t-1)},{\cal\boldsymbol{u}}^{(t-1)},\rho^{(t-1)}$ in the $t$ -th iteration of ADPM, we omit the constant terms in ${\cal L}$ that are irrelevant to $\boldsymbol{\phi}_{0}$ , and the optimization problem is given by

\begin{array}[]{ll}\min\limits_{\boldsymbol{\phi}_{0}}&\Re\{(-{\cal\boldsymbol% {u}}^{(t-1)}-\rho^{(t-1)}\boldsymbol{\phi}^{(t-1)})^{H}\boldsymbol{\phi}_{0}\}% \\ \text{s.t.}&|\boldsymbol{\phi}_{0}|=1,~{}\vartheta_{n_{\rm R}}\in[0,2\pi],~{}n% _{\rm R}=1,\ldots,N_{\rm R}.\end{array}

(52)

The optimal solution of problem (52) is given as

\vartheta_{n_{\rm R}}=\angle\{\boldsymbol{\gamma}^{(t-1)}_{n_{\rm R}}\},

(53)

where $\boldsymbol{\gamma}^{(t-1)}={\cal\boldsymbol{u}}^{(t-1)}+\rho^{(t-1)}% \boldsymbol{\phi}^{(t-1)}\in\mathbb{C}^{N_{\rm R}\times 1}$ .

V-C2 Update $\boldsymbol{\phi}$

When we consider the update of $\boldsymbol{\phi}$ given $\boldsymbol{\phi}_{0}^{(t)},{\cal\boldsymbol{u}}^{(t-1)},\rho^{(t-1)}$ , the minimization problem is given by

\min\limits_{\boldsymbol{\phi}}f(\boldsymbol{\phi})+\Re\{({\cal\boldsymbol{u}}% ^{(t-1)}-\rho^{(t-1)}\boldsymbol{\phi}_{0}^{(t)})^{H}\boldsymbol{\phi}\}.

(54)

We can obtain the closed-form optimal solution to the problem in (54) as

\boldsymbol{\phi}^{(t)}=(2{\boldsymbol{A}}+{\rho^{(t-1)}}\boldsymbol{\rm I})^{% -1}(\rho^{(t-1)}\boldsymbol{\phi}_{0}^{(t)}-{\cal\boldsymbol{u}}^{(t-1)}+2% \boldsymbol{d}).

(55)

Algorithm 1 ADPM-based Algorithm for Handling (50)

1: Initialize:

\boldsymbol{A},\boldsymbol{d},{\cal\boldsymbol{u}}^{(0)},\rho^{(0)}>0,\delta_{% 1},\delta_{2},\epsilon,\kappa

, where

0<\delta_{1}<1

\delta_{2}>1

are close to 1.

2: while

\Delta e^{(t)}=||\boldsymbol{\phi}^{(t)}-\boldsymbol{\phi}_{0}^{(t)}||>\epsilon

3: update:

\boldsymbol{\phi}_{0}^{(t)}

and

\boldsymbol{\phi}^{(t)}

\angle\{{(\boldsymbol{\phi}^{(t)}_{0})}_{n_{\rm R}}\}=\angle\{\boldsymbol{% \gamma}^{(t-1)}_{n_{\rm R}}\},

\boldsymbol{\phi}^{(t)}=(2{\boldsymbol{A}}+{\rho^{(t-1)}}\boldsymbol{\rm I}_{N% _{\rm R}})^{-1}(\rho^{(t-1)}\boldsymbol{\phi}_{0}^{(t)}-{\cal\boldsymbol{u}}^{% (t-1)}+2\boldsymbol{d}),

4: update:

{\cal\boldsymbol{u}}^{(t)}

and

\rho^{(t)}

\displaystyle\begin{split}\rho^{(t)}=\left\{\begin{array}[]{ll}\rho^{(t-1)},&% \Delta e^{(t)}\leq\delta_{1}\Delta e^{(t-1)},\\ \delta_{2}\rho^{(t-1)},&\text{else}.\end{array}\right.\end{split}

\begin{split}{\cal\boldsymbol{u}}^{(t)}=\left\{\begin{array}[]{ll}{\cal% \boldsymbol{u}}^{(t-1)}+\rho^{(t)}(\boldsymbol{\phi}^{(t)}-\boldsymbol{\phi}_{% 0}^{(t)}),\ u^{(t)}_{\max}\leq\kappa,\\ ({\cal\boldsymbol{u}}^{(t-1)}+\rho^{(t)}(\boldsymbol{\phi}^{(t)}-\boldsymbol{% \phi}_{0}^{(t)}))/u^{(t)}_{\max},\ \text{else},\end{array}\right.\end{split}

where

u^{(t)}_{\max}

is the value with the largest modulus in the multiplier vector

{\cal\boldsymbol{u}}^{(t)}

5: end while

6: Output:

\boldsymbol{\phi}^{\ast}

The ADPM-based algorithm for designing the passive beamforming at the RIS is outlined in Algorithm 1, detailing the update rules for $\rho$ and $\cal\boldsymbol{u}$ . If we substitute the update rules of $\rho^{(t)}$ and ${\cal\boldsymbol{u}}^{(t)}$ in Algorithm 1 with $\rho^{(t)}=\rho^{(t-1)}$ and ${\cal\boldsymbol{u}}^{(t)}={\cal\boldsymbol{u}}^{(t-1)}+\rho^{(t)}(\boldsymbol% {\phi}^{(t)}-\boldsymbol{\phi}_{0}^{(t)})$ , Algorithm 1 degenerates to the classical ADMM framework. According to the theoretical analysis in [42], adapting the penalty factor $\rho^{(t)}$ is crucial with the following principle: increasing it when the primal residual $\Delta e^{(t)}$ fails to decrease with iterations, helps drive $\Delta e^{(t)}$ towards zero to locate a feasible point. Otherwise, $\rho^{(t)}$ remains unchanged. This strategy aims to enhance the likelihood of the ADPM algorithm discovering a feasible point compared to the ADMM [42], [43].

Moreover, achieving faster convergence can be facilitated by selecting an appropriate initial value for the penalty factor $\rho^{(0)}$ . In this regard, we employ a method proposed in [40] to determine the initialized penalty factor for the ADPM. Specifically, when problem (50) does not involve any non-convex constraints, an effective initialized penalty factor for ADPM is obtained as [40]:

\rho^{(0)}=\sqrt{\lambda_{\min}(\boldsymbol{A})\lambda_{\max}(\boldsymbol{A})},

(56)

where $\lambda_{\min}(\boldsymbol{A})$ and $\lambda_{\max}(\boldsymbol{A})$ represent the minimum and maximum eigenvalues of $\boldsymbol{A}$ , respectively. It should be noted that if the smallest eigenvalue of $\boldsymbol{A}$ is zero, $\lambda_{\min}(\boldsymbol{A})$ is assigned to its smallest nonzero eigenvalue. For further details, please refer to Theorem 4 in [40].

Remark 1

The proposed ADPM algorithm is guaranteed to converge for arbitrary initialization $\boldsymbol{\phi}^{(0)}$ and ${\cal\boldsymbol{u}}^{(0)}$ provided that $\rho^{(0)}>0,0<\delta_{1}<1,\delta_{2}>1$ , and $\kappa$ is a sufficiently large positive number [42]. In particular, for the continuous phase case, if the penalty parameter $\rho^{(t)}$ is bounded, the limiting point $\boldsymbol{\phi}^{\ast}$ of the sequence $\{\boldsymbol{\phi}^{(t)}\}_{t=0}^{\infty}$ obtained via the ADPM algorithm is a Karush-Kuhn-Tucker (KKT) point of (50) [42], [43].

Algorithm 2 Overall Block Coordinate Descent Algorithm for Addressing (37)

1: Initialize:

\boldsymbol{\Phi}^{0}

\boldsymbol{W}^{0}

\boldsymbol{Z}^{0}

and

\boldsymbol{\Omega}^{0}

, tolerance accuracy

\varepsilon

, maximum number of iterations

r_{\max}

, the objective function value of problem (37)

P(\boldsymbol{\Phi}^{0}

\boldsymbol{W}^{0})

2: repeat

3: Given

\boldsymbol{W}^{r},\boldsymbol{\Phi}^{r}

and

\boldsymbol{\Omega}^{r}

, compute the auxiliary variable

\boldsymbol{Z}^{r}

by (39).

4: Given

\boldsymbol{W}^{r},\boldsymbol{\Phi}^{r}

and

\boldsymbol{Z}^{r}

, compute the auxiliary variable

\boldsymbol{\Omega}^{r}

by (40).

5: Given

\boldsymbol{Z}^{r}

\boldsymbol{\Omega}^{r}

and

\boldsymbol{\Phi}^{r}

, determine

\eta

and compute

\boldsymbol{W}^{r+1}

by (V-B).

6: Given

\boldsymbol{Z}^{r}

\boldsymbol{\Omega}^{r}

and

\boldsymbol{W}^{r+1}

, compute

\boldsymbol{A}

and

\boldsymbol{d}

by (48) and (49) .

7: Update

\boldsymbol{\phi}^{r+1}

by Algorithm 1, and reconstruct

\boldsymbol{\Phi}^{r+1}

8: Set

r=r+1

9: until

r>r_{\max}

|P(\boldsymbol{W}^{r+1},\boldsymbol{\Phi}^{r+1})-P(\boldsymbol{W}^{r},% \boldsymbol{\Phi}^{r})|<\varepsilon

10: Output:

\boldsymbol{\Phi}^{\ast},\boldsymbol{W}^{\ast}

V-D Overall Algorithm and Complexity Analysis

Now, we provide the detailed description of the overall BCD algorithm for solving (37) in Algorithm 2. Step 1 is used to initialize these variables and to set thresholds. Steps 2 and 3 are used to update the auxiliary variables $\boldsymbol{Z}$ and $\boldsymbol{\Omega}$ , step 4 updates the active beamforming matrix at the Tx. Then, steps 5 and 6 update the passive beamforming matrix of the RIS by ADPM proposed in Algorithm 1. Finally, the stop** condition is $|P(\boldsymbol{W}^{r+1},\boldsymbol{\Phi}^{r+1})-P(\boldsymbol{W}^{r},% \boldsymbol{\Phi}^{r})|<\varepsilon$ . The convergence analysis of the proposed Algorithm 2 can be found in [10], [35].

Note that the proposed problem formulation and algorithmic solution are applicable for all the three channel models in (2), (3), and (9). However, the impact of channel models on the system performance are characterized by the covariance matrices $\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}$ in the problem formulation (35). A further extension of this work is to leverage the matrix structures of $\boldsymbol{\Sigma}_{\hat{\boldsymbol{n}}}$ and $\hat{\boldsymbol{H}}$ in different channel models to design specific algorithms aimed at exploring the impact of channel models on the system performance for RIS-aided MIMO communications.

We note that the original problem is divided into three sub-problems and addressed iteratively, which requires $I_{\rm O}$ iterations. For the update of the two auxiliary variables, $\{\boldsymbol{Z}\}$ , $\{\boldsymbol{\Omega}\}$ , it requires the computation of order $\mathcal{O}(N^{3}_{\rm Rx})$ and $\mathcal{O}(N^{3}_{\rm s})$ , respectively. For the active beamforming problem at the Tx, solving $\{\boldsymbol{W}\}$ requires the computation of order $\mathcal{O}(I_{\eta}N^{3}_{\rm Tx})$ , where $I_{\eta}$ is the number of iterations for searching the dual variable $\eta$ . Since the number of transmitter antennas $N_{\rm Tx}$ is usually larger than $N_{\rm s}$ and $N_{\rm Rx}$ , the complexity of the second sub-problem is $\mathcal{O}(I_{W}I_{\eta}N^{3}_{\rm Tx})$ , where $I_{W}$ is the number of iterations required to converge. For the passive beamforming problem at the RIS, the complexity of the third sub-problem is $\mathcal{O}(N^{3}_{\rm R}+I_{A}N^{2}_{\rm R})$ , where $I_{A}$ is the number of iterations required to converge. Based on the above analysis, the computational complexity of Algorithm 2 is $\mathcal{O}(I_{\rm O}(N^{3}_{\rm Rx}+N^{3}_{\rm s}+I_{W}I_{\eta}N^{3}_{\rm Tx}% +N^{3}_{\rm R}+I_{A}N^{2}_{\rm R}))$ .

VI Numerical Results

In this section, we present simulation results to assess the performance of the three different channel models in the presence of channel estimation error. A 3D Cartesian coordinate system is considered, where the BS, the RIS, and the Rx are located at $(10,-20,5)$ m, $(0,0,10)$ m, and $(100,50,5)$ m, respectively. The Tx is equipped with $N_{\rm Tx}=64$ transmit antennas serving one user equipment (UE) equipped with $N_{\rm Rx}=8$ receive antennas with the assistance of an RIS. The number of data streams is $N_{\rm s}=64$ . The number of reflection elements is $N_{\rm R}=256$ . The path loss model utilized is a model tailored for RIS-aided near-field communication, as detailed in [25]. The carrier frequency is $30$ GHz and the number of Monte Carlo experiments is $50$ . The ground-truth channel from the BS to the RIS follows a near-field channel model while the system design adopts three different channel models: the conventional near-field, the proposed piece-wise near-field and the far-field channel models, resulting in different covariance matrices of the interference-plus-noise signal. Meanwhile, the channel from the RIS to the Rx follows a far-field channel model. The transmit SNR is defined by $\text{SNR}=10\log_{10}(P_{\rm Tx}/\sigma^{2})$ , where $P_{\rm Tx}$ and $\sigma^{2}$ is the power of the transmit signal and noise, respectively. Unless otherwise specified, we set ${\sigma^{2}}=-80$ dBm.

The initialization parameters of the ADPM-based algorithm for solving (50) and Algorithm 2 are provided as follows: the multiplier vector ${\boldsymbol{u}}^{(0)}=\boldsymbol{0}$ , $\epsilon=10^{-6}$ , $\delta_{1}=0.95$ , $\delta_{2}=1.05$ , $\kappa=10^{3}$ , $\varepsilon=10^{-3}$ , and $r_{\max}=100$ . For more detailed parameter settings, please refer to the reference [42]. We assume an identical normalized CEE for different channel models, i.e., $\sigma^{2}_{{\boldsymbol{G}}_{i}}=\tau_{i}\cdot\mathbb{E}\{\|\boldsymbol{G}_{% \rm N}-\Delta\boldsymbol{M}_{i}\|^{2}_{F}\}$ , where $\tau_{i}$ is the normalized CEE for the Tx-RIS link and it is given by

\tau_{i}=\frac{\mathbb{E}[||\boldsymbol{G}_{\rm N}-\Delta{\boldsymbol{M}_{i}}-% \hat{\boldsymbol{G}_{i}}||^{2}_{F}]}{\mathbb{E}[||\boldsymbol{G}_{\rm N}-% \Delta{\boldsymbol{M}_{i}}||^{2}_{F}]},i=1,2,3.

(57)

Similarly, we can define the normalized CEE $\tau_{\boldsymbol{R}}$ for the RIS-Rx link and it is given by

\tau_{\boldsymbol{R}}=\frac{\mathbb{E}[||\Delta{\boldsymbol{R}}||^{2}_{F}]}{% \mathbb{E}[||\boldsymbol{R}||^{2}_{F}]}.

(58)

VI-A Convergence Validation

We investigate the average achievable SEs for different channel models without considering any estimation error in Fig. 2, i.e., $\tau_{i}=0,i=1,2,3$ , which means that only model mismatch errors are considered. Here, we set the distance along the y-axis from the Tx to the RIS as $d_{\rm BR}=20$ m and ${\rm SNR}=10$ dB. We compare the performance of our proposed algorithm (i.e., Algorithm 2), with that of the algorithm in [8] under the near-field channel model, verifying the effectiveness and convergence of Algorithm 2. When $K=1$ , i.e., the RIS is not partitioned into subsurfaces, the piece-wise near-field channel model degenerates to the conventional far-field channel model. It can be seen that the performance of the piece-wise near-field channel model with multiple subsurface structures is indeed better than that of the traditional far-field model, owing to the reduced model mismatch error as well as the increased DoF. Furthermore, as the number of subsurface increases, adopting the piece-wise near-field channel model gradually approaches the performance of the conventional near-field channel model, where the latter model does not have any model mismatch error, which is consistent with the channel model analysis in Section II-B. It is worth noting that in the extreme case, where the RIS is divided into 256 subsurfaces, the piece-wise near-field channel model evolves into the near-field channel model. Therefore, dividing the RIS into 64 pieces of subsurfaces, i.e., $K=8$ , can achieve the dominant performance gain of the conventional near-field model, while this piece-wise near-field channel model significantly reduces the number of parameters involved.

VI-B SE vs SNR for Different Channel Models in the Presence of CEE

Figure 3 presents the average achievable SEs for different channel models at different SNR levels in the presence of CEE with $\tau=0.2$ . The slope of the SE curve corresponding to each model represents the multiplexing gain with a steeper model indicating more available DoF. The results depict that the proposed piece-wise near-field channel provides more DoF compared to the traditional far-field channel, potentially leading to an increased achievable SE. Although the DoF provided by the near-field channel model are slightly higher than that of the piece-wise near-field channel model, adopting the piece-wise channel model can achieve a higher achievable SE than that of the conventional near-field model, due to the high sensitivity of the latter model to CEEs. This is because beamsteering is robust against the beam misalignment due to the angle and distance errors, and it exploits more DoF brought by the conventional near-field channel model. Besides, we note that the achievable SEs for all the three channel models are saturated in the high SNR regime due to the presence of CEEs and potential model mismatches.

VI-C SE vs Normalized CEE for Different Channel Models

In Fig. 4, the average achievable SEs for different channel models are presented under different normalized CEEs for $K=8$ . When the normalized CEE $\tau$ is 0, it corresponds to the scenario without any estimation error in Fig. 2. The results show that as the normalized CEE $\tau$ increases, the performance of the near-field channel model deteriorates significantly, which indicates the high sensitivity of beamfocusing with respect to CEE in the near-field region. We observe that when the normalized CEE $\tau>0.13$ , the proposed piece-wise near-field model yields a better performance than the near-field model due to the enhanced robustness inherited from the far-field model. When the normalized CEE is large, the performances of the piece-wise near-field and near-field models are nearly identical as the beamfocusing in both cases is inaccurate. When the normalized CEE $\tau>0.75$ , the performance of the far-field channel model is better than that of both the near-field and piece-wise near-field models due to its robustness to CEE. This implies that for different levels of CEE, different channel models can be adopted to improve the system performance. By combining the results from Fig. 3 and Fig. 4, it becomes evident that the piece-wise channel model not only surpasses the DoF associated with the far-field model but also enhances the system robustness against the CEEs when compared to the near-field model.

VI-D SE vs Number of Transmit Antennas for Different Channel Models

Figure 5 demonstrates the impact of the number of transmit antennas at the Tx on the achievable SE. We present the achievable SEs for three models under two scenarios: perfect CSI ( $\tau=0$ ) and imperfect CSI ( $\tau=0.2$ ). When $\tau=0$ , a linear scaling in the SE is observed with respect to the number of transmit antennas. In this scenario, the deterministic model mismatch $\Delta\boldsymbol{H}_{\rm M}$ in (34) is considered as interference, and increasing $N_{\rm Tx}$ provides more DoF for designing the active beamforming matrix $\boldsymbol{W}$ to suppress the interference caused by the model mismatch, i.e., the first term in (34), thus approaching interference-free transmission. However, when $\tau=0.2$ , the DoF are not sufficient to design the active beamforming matrix to suppress the interference caused by the uncertain CEE, i.e., the second and third term in (34). Moreover, the piece-wise near-field model with imperfect CSI demonstrates a higher SE than the far-field model with perfect CSI, highlighting the DoF advantages introduced by the piece-wise near-field model.

VI-E SE vs Number of Reflecting Elements for Different Channel Models

In Fig. 6, we investigate the impact of the number of reflecting elements on the achievable SEs. We present SEs for the three models with varying numbers of reflecting elements under perfect CSI ( $\tau=0$ ) and imperfect CSI ( $\tau=0.2$ ) scenarios. We observe that as the number of reflecting elements increases, the SE of the three models exhibits linear growth when $N_{\rm R}$ is small. However, the slope of the SE curves tends to flatten for larger numbers of reflecting elements, e.g., $N_{\rm R}\geq 448$ . This is because the model mismatch increases with the number of reflecting elements as the near-field propagation becomes dominant. Comparing the findings in Fig. 5 and Fig. 6, we observe that while increasing both $N_{\rm Tx}$ and $N_{\rm R}$ enhances the SE of the system, their roles in RIS-aided MIMO communication systems are distinct. On one hand, increasing $N_{\rm Tx}$ at the Tx enhances the spatial DoF, thereby facilitating interference mitigation and enabling higher beamforming gain. On the other hand, as the number of reflecting elements $N_{\rm R}$ increases, the near-field effects become more pronounced, leading to an increase in the model mismatch between the piece-wise near-field channel and the far-field channel models.

VII Conclusions

This paper proposed to adopt a piece-wise near-field channel model for a RIS-aided MIMO system in the presence of CEEs. We considered three channel models (i.e., near-field, piece-wise near-field and far-field) and analyzed the impact of CEEs and model mismatches on the interference distribution. By treating the interference caused by CEEs and model mismatches as noise, we formulated the joint active and passive beamforming design as an optimization problem to maximize the achievable SE taking into account the transmit power constraint for active beamforming matrix and the constant modulus constraint for passive beamforming matrix. The joint beamforming optimization problem was equivalently transformed into an MSE minimization problem, which was then addressed by the proposed algorithm exploit the BCD and ADPM to handle the constant modulus constraint of RIS elements. We revealed that the adopted piece-wise near-field channel model not only improves the DoF gain but also demonstrates enhanced robustness against CEEs, resulting in higher achievable rates compared to the other channel models. A promising extension of this work is considering more reasonable parameters (distance and angle) for the error modeling schemes instead of overall channel estimation error modeling, which could lead to tailored channel estimation schemes and robust resource allocation strategies. Another future research direction is to utilize data-driven deep learning networks to select the number of subsurfaces in the piece-wise near-field channel model to achieve the best trade-off between the modeling accuracy and the robustness against CEEs.

References

[1] W. Chen, Z. Yang, Z. Wei, D. W. K. Ng, and M. Matthaiou, “Beamforming design for RIS-Aided MIMO communication: A piece-wise near-field model,” submitted to IEEE ICCC, May 2024.
[2] M. Matthaiou, O. Yurduseven, H. Q. Ngo, D. Morales-Jimenez, S. L. Cotton, and V. F. Fusco, “The road to 6G: Ten physical layer challenges for communications engineers,” IEEE Commun. Mag., vol. 59, no. 1, pp. 64–69, Jan. 2021.
[3] T. J. Cui, M. Q. Qi, X. Wan, J. Zhao, and Q. Cheng, “Coding metamaterials, digital metamaterials and programmable metamaterials,” Light Sci. Appl., vol. 3, no. 10, pp. e218–e218, Oct. 2014.
[4] J. Zhang, E. Björnson, M. Matthaiou, D. W. K. Ng, H. Yang, and D. J. Love, “Prospective multiple antenna technologies for beyond 5G,” IEEE J. Sel. Areas in Commun., vol. 38, no. 8, pp. 1637–1660, Aug. 2020.
[5] E. Basar, M. Di Renzo, J. De Rosny, M. Debbah, M. S. Alouini, and R. Zhang, “Wireless communications through reconfigurable intelligent surfaces,” IEEE Access, vol. 7, pp. 116 753–116 773, Aug. 2019.
[6] C. Pan et al., “Reconfigurable intelligent surfaces for 6G systems: Principles, applications, and research directions,” IEEE Commun. Mag., vol. 59, no. 6, pp. 14–20, Jun. 2021.
[7] X. Yu, D. Xu, and R. Schober, “MISO wireless communication systems via intelligent reflecting surfaces,” in Proc. IEEE ICCC, Aug. 2019, pp. 735–740.
[8] S. Zhang and R. Zhang, “Capacity characterization for intelligent reflecting surface aided MIMO communication,” IEEE J. Sel. Areas Commun., vol. 38, no. 8, pp. 1823–1838, Aug. 2020.
[9] H. Alwazani, A. Kammoun, A. Chaaban, M. Debbah, and M. S. Alouini, “Intelligent reflecting surface-assisted multi-user MISO communication: Channel estimation and beamforming design,” IEEE Open J. Commun. Soc., vol. 1, pp. 661–680, May 2020.
[10] C. Pan, H. Ren, K. Wang, W. Xu, M. Elkashlan, A. Nallanathan, and L. Hanzo, “Multicell MIMO communications relying on intelligent reflecting surfaces,” IEEE Trans. Wireless Commun., vol. 19, no. 8, pp. 5218–5233, Aug. 2020.
[11] S. Hu, Z. Wei, Y. Cai, C. Liu, D. W. K. Ng, and J. Yuan, “Robust and secure sum-rate maximization for multiuser MISO downlink systems with self-sustainable IRS,” IEEE Trans. Commun., vol. 69, no. 10, pp. 7032–7049, Jul. 2021.
[12] Z. Wei, Y. Cai, Z. Sun, D. W. K. Ng, J. Yuan, M. Zhou, and L. Sun, “Sum-rate maximization for IRS-assisted UAV OFDMA communication systems,” IEEE Trans. Wireless Commun., vol. 20, no. 4, pp. 2530–2550, Dec. 2020.
[13] C. Liu, X. Liu, Z. Wei, S. Hu, D. W. K. Ng, and J. Yuan, “Deep learning-empowered predictive beamforming for IRS-assisted multi-user communications,” in Proc. IEEE GLOBECOM, Dec. 2021, pp. 1–6.
[14] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potentials, and future directions,” IEEE Commun. Mag., vol. 61, no. 1, pp. 40–46, Jan. 2022.
[15] X. Wei, L. Dai, Y. Zhao, G. Yu, and X. Duan, “Codebook design and beam training for extremely large-scale RIS: Far-field or near-field?” China Commun., vol. 19, no. 6, pp. 193–204, Jun. 2022.
[16] T. Wang, C. You, F. Zhou, and C. Yin, “Base station beamforming design in near-field XL-IRS beam training,” IEEE Commun. Lett., pp. 1–1, 2024.
[17] K. Dovelos, S. D. Assimonis, H. Q. Ngo, B. Bellalta, and M. Matthaiou, “Intelligent reflecting surfaces at terahertz bands: Channel modeling and analysis,” in Proc. IEEE ICC, Jul. 2021, pp. 1–6.
[18] Q. Tao, J. Wang, and C. Zhong, “Performance analysis of intelligent reflecting surface aided communication systems,” IEEE Commun. Lett., vol. 24, no. 11, pp. 2464–2468, Nov. 2020.
[19] J. Li and Y. Hong, “Intelligent reflecting surface aided communication systems: Performance analysis,” in Proc. IEEE PIMRC, Oct. 2021, pp. 519–524.
[20] Y. Liu, Z. Wang, J. Xu, C. Ouyang, X. Mu, and R. Schober, “Near-field communications: A tutorial review,” IEEE Open J. Commun. Soc., vol. 4, pp. 1999–2049, Aug. 2023.
[21] X. Wei and L. Dai, “Channel estimation for extremely large-scale massive MIMO: Far-field, near-field, or hybrid-field?” IEEE Commun. Lett., vol. 26, no. 1, pp. 177–181, Jan. 2021.
[22] Z. Zhou, X. Gao, J. Fang, and Z. Chen, “Spherical wave channel and analysis for large linear array in LoS conditions,” in Proc. IEEE GLOBECOM, Dec. 2015, pp. 1–6.
[23] A. K. Gupta and D. K. Nagar, Matrix Variate Distributions. London U.K.: Chapman and Hall/CRC, 2018.
[24] M. Z. Siddiqi and T. Mir, “Reconfigurable intelligent surface-aided wireless communications: An overview,” Intell. and Converged Netw., vol. 3, no. 1, pp. 33–63, Mar. 2022.
[25] E. Björnson and L. Sanguinetti, “Power scaling laws and near-field behaviors of massive MIMO and intelligent reflecting surfaces,” IEEE Open J. Commun. Soc., vol. 1, pp. 1306–1324, Jan. 2020.
[26] C. Pan et al., “An overview of signal processing techniques for RIS/IRS-aided wireless systems,” IEEE J. Sel. Top. Signal Process., vol. 5, no. 16, pp. 883–917, Aug. 2022.
[27] Z. Wang, L. Liu, and S. Cui, “Channel estimation for intelligent reflecting surface assisted multiuser communications: Framework, algorithms, and analysis,” IEEE Trans. Wireless Commun., vol. 19, no. 10, pp. 6607–6620, Oct. 2020.
[28] O. El Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave MIMO systems,” IEEE Trans. Wireless Commun., vol. 13, no. 3, pp. 1499–1513, Mar. 2014.
[29] Y. Zhao et al., “6G near-field technologies white paper.” FuTURE Forum, Nan**g, China, Apr. 2024. doi: 10.12142/FuTURE.202404002.
[30] Y. Lu and L. Dai, “Near-field channel estimation in mixed LoS/NLoS environments for extremely large-scale MIMO systems,” IEEE Trans Commun., vol. 71, no. 6, pp. 3694–3707, Jun. 2023.
[31] H. Xie, F. Gao, and S. **, “An overview of low-rank channel estimation for massive MIMO systems,” IEEE Access, vol. 4, pp. 7313–7321, Nov. 2016.
[32] B. Zheng, C. You, W. Mei, and R. Zhang, “A survey on channel estimation and practical passive beamforming design for intelligent reflecting surface aided wireless communications,” IEEE Commun. Surv. Tutor., vol. 24, no. 2, pp. 1035–1071, Feb. 2022.
[33] C. Xing, S. Ma, and Y.-C. Wu, “Robust joint design of linear relay precoder and destination equalizer for dual-hop amplify-and-forward MIMO relay systems,” IEEE Trans. Signal Process., vol. 58, no. 4, pp. 2273–2283, Apr. 2009.
[34] K. Ardah, S. Gherekhloo, A. L. de Almeida, and M. Haardt, “TRICE: A channel estimation framework for RIS-aided millimeter-wave MIMO systems,” IEEE Signal Process. Letters, vol. 28, pp. 513–517, Feb. 2021.
[35] P. Zeng, D. Qiao, H. Qian, and Q. Wu, “Joint beamforming design for IRS aided multiuser MIMO with imperfect CSI,” IEEE Trans. Veh. Technol., vol. 71, no. 10, pp. 10 729–10 743, Oct. 2022.
[36] X. Zhang, Matrix Analysis and Applications. Cambridge University Press, 2017.
[37] Q. Shi, M. Razaviyayn, Z. Luo, and C. He, “An iteratively weighted MMSE approach to distributed sum-utility maximization for a MIMO interfering broadcast channel,” IEEE Trans. Signal Process., vol. 59, no. 9, pp. 4331–4340, Sep. 2011.
[38] X. Zhao, S. Lu, Q. Shi, and Z.-Q. Luo, “Rethinking WMMSE: Can its complexity scale linearly with the number of BS antennas?” IEEE Trans. Signal Process., vol. 71, pp. 433–446, Feb. 2023.
[39] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University Press, 2004.
[40] E. Ghadimi, A. Teixeira, I. Shames, and M. Johansson, “Optimal parameter selection for the alternating direction method of multipliers (ADMM): Quadratic problems,” IEEE Trans. Autom. Control, vol. 60, no. 3, pp. 644–658, Mar. 2014.
[41] S. Magnússon, P. C. Weeraddana, M. G. Rabbat, and C. Fischione, “On the convergence of alternating direction lagrangian methods for nonconvex structured optimization problems,” IEEE Trans. Control Netw. Syst., vol. 3, no. 3, pp. 296–309, Sep. 2015.
[42] X. Yu, G. Cui, J. Yang, J. Li, and L. Kong, “Quadratic optimization for unimodular sequence design via an ADPM framework,” IEEE Trans. Signal Process., vol. 68, pp. 3619–3634, May. 2020.
[43] X. Yu, G. Cui, Z. Zhang, L. Zhou, J. Yang, and L. Kong, “Discrete-phase waveform design to quadratic optimization via an ADPM framework with convergence guarantee,” in Proc. IEEE SAM, Jun. 2020, pp. 1–5.

$\displaystyle\\|\Delta\boldsymbol{R}\\|_{F}$	$\displaystyle\ll\\|\hat{\boldsymbol{R}}\\|_{F},$	(26)
$\displaystyle\\|\Delta\boldsymbol{G}\\|_{F}$	$\displaystyle\ll\\|\hat{\boldsymbol{G}}\\|_{F},\text{and}$	(27)
$\displaystyle\\|\Delta\boldsymbol{M}\\|_{F}$	$\displaystyle\ll\\|\hat{\boldsymbol{G}}\\|_{F},$	(28)

RIS-aided MIMO Beamforming: Piece-Wise Near-field Channel Model

Abstract

Index Terms:

I Introduction

II System Model

II-A System Model

II-B Channel Models

III Analysis of Interference Distribution for Different Channel Models

III-A Channel Estimation Error Models

III-B Covariance Matrix of the Interference-plus-Noise Signal

Lemma 1 ([36])

IV Problem Formulation

V Proposed Solution

V-A Update the Auxiliary Variables Matrices 𝐙𝐙\boldsymbol{Z}bold_italic_Z and 𝛀𝛀\boldsymbol{\Omega}bold_Ω

V-B Update the Active Beamforming Matrix 𝐖𝐖\boldsymbol{W}bold_italic_W

V-C Update the Passive Beamforming Matrix 𝚽𝚽\boldsymbol{\Phi}bold_Φ

V-C1 Update ϕ0subscriptbold-italic-ϕ0\boldsymbol{\phi}_{0}bold_italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT

V-C2 Update ϕbold-italic-ϕ\boldsymbol{\phi}bold_italic_ϕ

Remark 1

V-D Overall Algorithm and Complexity Analysis

VI Numerical Results

VI-A Convergence Validation

VI-B SE vs SNR for Different Channel Models in the Presence of CEE

VI-C SE vs Normalized CEE for Different Channel Models

VI-D SE vs Number of Transmit Antennas for Different Channel Models

VI-E SE vs Number of Reflecting Elements for Different Channel Models

VII Conclusions

References

V-A Update the Auxiliary Variables Matrices $\boldsymbol{Z}$ and $\boldsymbol{\Omega}$

V-B Update the Active Beamforming Matrix $\boldsymbol{W}$

V-C Update the Passive Beamforming Matrix $\boldsymbol{\Phi}$

V-C1 Update $\boldsymbol{\phi}_{0}$

V-C2 Update $\boldsymbol{\phi}$