\Newassociation

solutionSolutionsolutionfile \Opensolutionfilesolutionfile[LargeSIMRateDouble]

Achievable Rate Optimization for Large Stacked Intelligent Metasurfaces Based on Statistical CSI

Anastasios Papazafeiropoulos, Pandelis Kourtessis, Symeon Chatzinotas, Dimitra I. Kaklamani, Iakovos S. Venieris A. Papazafeiropoulos is with the Communications and Intelligent Systems Research Group, University of Hertfordshire, Hatfield AL10 9AB, U. K., and with SnT at the University of Luxembourg, Luxembourg. P. Kourtessis is with the Communications and Intelligent Systems Research Group, University of Hertfordshire, Hatfield AL10 9AB, U. K. S. Chatzinotas is with the SnT at the University of Luxembourg, Luxembourg. Dimitra I. Kaklamani is with the Microwave and Fiber Optics Laboratory, and Iakovos S. Venieris is with the Intelligent Communications and Broadband Networks Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, Zografou, 15780 Athens, Greece. Corresponding author’s email: [email protected].
Abstract

Stacked intelligent metasurface (SIM) is an emerging design that consists of multiple layers of metasurfaces. A SIM enables holographic multiple-input multiple-output (HMIMO) precoding in the wave domain, which results in the reduction of energy consumption and hardware cost. On the ground of multiuser beamforming, this letter focuses on the downlink achievable rate and its maximization. Contrary to previous works on multiuser SIM, we consider statistical channel state information (CSI) as opposed to instantaneous CSI to overcome challenges such as large overhead. Also, we examine the performance of large surfaces. We apply an alternating optimization (AO) algorithm regarding the phases of the SIM and the allocated transmit power. Simulations illustrate the performance of the considered large SIM-assisted design as well as the comparison between different CSI considerations.

Index Terms:
Reconfigurable intelligent surface (RIS), stacked intelligent metasurfaces (SIM), 6G networks.

I Introduction

The technology of reconfigurable intelligent surfaces (RISs) has recently emerged to increase coverage and enhance spectral and energy efficiencies in various communication environments [1, 2]. In general terms, an RIS includes a surface that includes a large number of elements, which are nearly passive and have low cost. The purpose of these elements is to adjust the phases of the incident electromagnetic (EM) waves by using a smart controller, and hence, shape the propagation environment dynamically [3, 4, 5].

However, most existing works on RIS assume single-layer metasurface structure [3, 4, 6], which imposes a constraint on the adjustment of the beam patterns. Also, the single-layer structures of RISs do not have the capability of inter-user interference suppression as shown in [6]. These observations led the authors in [7, 8] to propose a stacked intelligent metasurface (SIM), which consists of an array of programmable metasurfaces similar to artificial neural networks (ANNs). Among the processing capabilities of a SIM, we that the forward propagation takes place at the speed of light.

On this ground, in [7], authors proposed a SIM-based design for the transceiver of point-to-point multiple-input multiple-output (MIMO) communication systems, where the combining and the precoding take place as the EM waves propagate along the SIM. In [8], we observe the integration of a SIM to the transmitter, i.e., the base station (BS) towards enabling beamforming in the EM domain based on instantaneous channel state information (CSI). Contrary to [7] and [8], in [9] and [10], we proposed more general hybrid digital wave designs, where all element parameters are optimised simultaneously through more efficient algorithms.

In this work, we focus on a SIM-enabled multiuser architecture operating solely in the wave domain. Note that [10] assumes a hybrid digital wave design, and [11] focuses on satellite communication systems. Also, contrary to previous works [7, 8, 9], we consider a SIM that consists of large metasurfaces, since we apply the use-and-then-forget (UatF) bound [12]. Most importantly, we obtain the downlink rate and perform its optimization regarding the phase shifts and transmit power in terms of statistical CSI. Notably, this approach enables the optimization at every several coherence intervals rather than optimizing at each interval. Hence, we achieve lower overhead, which is one of the main challenges in SIM-assisted systems.

Notation: Matrices and vectors are represented by boldface upper and lower case symbols, respectively. The notations ()𝖳superscript𝖳(\cdot)^{\scriptscriptstyle\mathsf{T}}( ⋅ ) start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT, ()𝖧superscript𝖧(\cdot)^{\scriptscriptstyle\mathsf{H}}( ⋅ ) start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT, and tr()trace\tr\!\left({\cdot}\right)roman_tr ( ⋅ ) denote the transpose, Hermitian transpose, and trace operators, respectively. Also, the symbol 𝔼[]𝔼delimited-[]\mathbb{E}\left[\cdot\right]blackboard_E [ ⋅ ] denotes the expectation operator. The floor function x𝑥\lfloor x\rfloor⌊ italic_x ⌋ gives as output the greatest integer less than or equal to x𝑥xitalic_x. The notation diag()(𝐀)diag𝐀\text{diag}\left(\right)\left({\mathbf{A}}\right)diag ( ) ( bold_A ) represents a vector with elements equal to the diagonal elements of 𝐀𝐀{\mathbf{A}}bold_A. The notation 𝐛𝒞𝒩(𝟎,𝚺)similar-to𝐛𝒞𝒩0𝚺{\mathbf{b}}\sim{\cal C}{\cal N}{({\mathbf{0}},\mathbf{\Sigma})}bold_b ∼ caligraphic_C caligraphic_N ( bold_0 , bold_Σ ) represents a circularly symmetric complex Gaussian vector with zero mean and a covariance matrix 𝚺𝚺\mathbf{\Sigma}bold_Σ.

II System Model

We consider a SIM-aided MIMO communication system as depicted in Fig. 1. In particular, a BS, which includes Ntsubscript𝑁𝑡N_{t}italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT antennas, communicates with K𝐾Kitalic_K single-antenna user equipments (UEs) through a SIM performing wave-based processing. The SIM is implemented by L𝐿Litalic_L metasurfaces, where each one has a large number of N𝑁Nitalic_N meta-atoms. Let 𝒦={1,,K}𝒦1𝐾\mathcal{K}=\{1,\ldots,K\}caligraphic_K = { 1 , … , italic_K }, ={1,,L}1𝐿\mathcal{L}=\{1,\ldots,L\}caligraphic_L = { 1 , … , italic_L }, and 𝒩={1,,N}𝒩1𝑁\mathcal{N}=\{1,\ldots,N\}caligraphic_N = { 1 , … , italic_N } denote the sets of UEs, metasurfaces, and meta-atoms, respectively. Note that an intelligent controller adjusts the shifts of the phases of the electromagnetic (EM) waves that im**e on the metasurface layers.

Refer to caption
Figure 1: A SIM-aided MIMO system.

On this basis, let θnl[0,2π),n𝒩,lformulae-sequencesuperscriptsubscript𝜃𝑛𝑙02𝜋formulae-sequence𝑛𝒩𝑙\theta_{n}^{l}\in[0,2\pi),n\in\mathcal{N},l\in\mathcal{L}italic_θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ∈ [ 0 , 2 italic_π ) , italic_n ∈ caligraphic_N , italic_l ∈ caligraphic_L be the phase shift by the n𝑛nitalic_nth meta-atom on the surface layer l𝑙litalic_l. Also, we denote ϕnl=ejθnlsuperscriptsubscriptitalic-ϕ𝑛𝑙superscript𝑒𝑗superscriptsubscript𝜃𝑛𝑙\phi_{n}^{l}=e^{j\theta_{n}^{l}}italic_ϕ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT = italic_e start_POSTSUPERSCRIPT italic_j italic_θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT, and 𝚽l=diag()(ϕl)N×Nsubscript𝚽𝑙diagsuperscriptbold-italic-ϕ𝑙superscript𝑁𝑁{\bm{\Phi}}_{l}=\text{diag}\left(\right)({\bm{\phi}}^{l})\in\mathbb{C}^{N% \times N}bold_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = diag ( ) ( bold_italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT, where ϕl=[ϕ1l,,ϕNl]𝖳N×1superscriptbold-italic-ϕ𝑙superscriptsubscriptsuperscriptitalic-ϕ𝑙1subscriptsuperscriptitalic-ϕ𝑙𝑁𝖳superscript𝑁1{\bm{\phi}}^{l}=[\phi^{l}_{1},\dots,\phi^{l}_{N}]^{{\scriptscriptstyle\mathsf{% T}}}\in\mathbb{C}^{{\color[rgb]{0,0,0}N}\times 1}bold_italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT = [ italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT.111Herein, we consider phase shifts, which are continuously-adjustable and their modulus equals to 1111 to evaluate large SIM-aided MIMO communications. Practical issues such as the consideration of discrete phase shifts [13] is the topic of future work. In addition, 𝐖lN×N,l/{1}formulae-sequencesuperscript𝐖𝑙superscript𝑁𝑁𝑙1{\mathbf{W}}^{l}\in\mathbb{C}^{N\times N},l\in\mathcal{L}/\{1\}bold_W start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT , italic_l ∈ caligraphic_L / { 1 } denotes the coefficient matrix between layer (l1)𝑙1(l-1)( italic_l - 1 ) and layer l𝑙litalic_l. In particular, its entries from meta-atom n~~𝑛\tilde{n}over~ start_ARG italic_n end_ARG on layer (l1)𝑙1(l-1)( italic_l - 1 ) to meta-atom n𝑛nitalic_n on layer l,l𝑙for-all𝑙l,\forall l\in\mathcal{L}italic_l , ∀ italic_l ∈ caligraphic_L are given by

wn,n~l=Atcosxn,n~lrn,n~l(12πrn,n~lj1λ)ej2πrn,n~l/λ,superscriptsubscript𝑤𝑛~𝑛𝑙subscript𝐴𝑡𝑐𝑜𝑠superscriptsubscript𝑥𝑛~𝑛𝑙superscriptsubscript𝑟𝑛~𝑛𝑙12𝜋subscriptsuperscript𝑟𝑙𝑛~𝑛𝑗1𝜆superscript𝑒𝑗2𝜋superscriptsubscript𝑟𝑛~𝑛𝑙𝜆\displaystyle w_{n,\tilde{n}}^{l}=\frac{A_{t}cosx_{n,\tilde{n}}^{l}}{r_{n,% \tilde{n}}^{l}}\left(\frac{1}{2\pi r^{l}_{n,\tilde{n}}}-j\frac{1}{\lambda}% \right)e^{j2\pi r_{n,\tilde{n}}^{l}/\lambda},italic_w start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT = divide start_ARG italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_c italic_o italic_s italic_x start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT end_ARG ( divide start_ARG 1 end_ARG start_ARG 2 italic_π italic_r start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT end_ARG - italic_j divide start_ARG 1 end_ARG start_ARG italic_λ end_ARG ) italic_e start_POSTSUPERSCRIPT italic_j 2 italic_π italic_r start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT / italic_λ end_POSTSUPERSCRIPT , (1)

where Atsubscript𝐴𝑡A_{t}italic_A start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the area of each meta-atom at the SIM, xn,n~lsuperscriptsubscript𝑥𝑛~𝑛𝑙x_{n,\tilde{n}}^{l}italic_x start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT denotes the angle between the normal direction of the transmit metasurface layer (l1)𝑙1(l-1)( italic_l - 1 ) and the propagation direction, rn,n~lsuperscriptsubscript𝑟𝑛~𝑛𝑙r_{n,\tilde{n}}^{l}italic_r start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT, is the respective transmission distance. Moreover, let 𝐰k1N×1subscriptsuperscript𝐰1𝑘superscript𝑁1{\mathbf{w}}^{1}_{k}\in\mathbb{C}^{N\times 1}bold_w start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT express the coefficient from the transmit antenna array. Thus, the impact of the SIM can be expressed as

𝐆=𝚽L𝐖L𝚽2𝐖2𝚽1N×N.𝐆subscript𝚽𝐿superscript𝐖𝐿subscript𝚽2superscript𝐖2subscript𝚽1superscript𝑁𝑁\displaystyle{\mathbf{G}}={\bm{\Phi}}_{L}{\mathbf{W}}^{L}\cdots{\bm{\Phi}}_{2}% {\mathbf{W}}^{2}{\bm{\Phi}}_{1}\in\mathbb{C}^{N\times N}.bold_G = bold_Φ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ⋯ bold_Φ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT . (2)

Let 𝐡kN×1,k𝒦formulae-sequencesubscript𝐡𝑘superscript𝑁1for-all𝑘𝒦{\mathbf{h}}_{k}\in\mathbb{C}^{N\times 1},\forall k\in\mathcal{K}bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT , ∀ italic_k ∈ caligraphic_K express the channel between the last layer and UE k𝑘kitalic_k that is described by the correlated Rician fading distribution as

𝐡k=βk(κk1+κk𝐡k,LoS+11+κk𝐡k,NLoS)k𝒦.subscript𝐡𝑘subscript𝛽𝑘subscript𝜅𝑘1subscript𝜅𝑘subscript𝐡𝑘LoS11subscript𝜅𝑘subscript𝐡𝑘NLoSfor-all𝑘𝒦\displaystyle{\mathbf{h}}_{k}=\sqrt{\beta_{k}}\left(\sqrt{\frac{\kappa_{k}}{1+% \kappa_{k}}}{\mathbf{h}}_{k,\mathrm{LoS}}+\sqrt{\frac{1}{1+\kappa_{k}}}{% \mathbf{h}}_{k,\mathrm{NLoS}}\right)~{}~{}\forall k\in\mathcal{K}.bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = square-root start_ARG italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ( square-root start_ARG divide start_ARG italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_ARG bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT + square-root start_ARG divide start_ARG 1 end_ARG start_ARG 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_ARG bold_h start_POSTSUBSCRIPT italic_k , roman_NLoS end_POSTSUBSCRIPT ) ∀ italic_k ∈ caligraphic_K . (3)

In (3), κksubscript𝜅𝑘\kappa_{k}italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the Rician factor, βksubscript𝛽𝑘\beta_{k}italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the channel gain, 𝐡k,NLoSN×1subscript𝐡𝑘NLoSsuperscript𝑁1{\mathbf{h}}_{k,\mathrm{NLoS}}\in\mathbb{C}^{N\times 1}bold_h start_POSTSUBSCRIPT italic_k , roman_NLoS end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT is the LoS component, and 𝐡k,NLoS𝒞𝒩(𝟎,𝐑)N×1similar-tosubscript𝐡𝑘NLoS𝒞𝒩0𝐑superscript𝑁1{\mathbf{h}}_{k,\mathrm{NLoS}}\sim\mathcal{CN}({\mathbf{0}},{\mathbf{R}})\in% \mathbb{C}^{N\times 1}bold_h start_POSTSUBSCRIPT italic_k , roman_NLoS end_POSTSUBSCRIPT ∼ caligraphic_C caligraphic_N ( bold_0 , bold_R ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT is the NLoS component with 𝐑N×N𝐑superscript𝑁𝑁{\mathbf{R}}\in\mathbb{C}^{N\times N}bold_R ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT representing the spatial correlation of each surface. This correlation is obtained n𝒩,n~𝒩formulae-sequencefor-all𝑛𝒩~𝑛𝒩\forall n\in\mathcal{N},\tilde{n}\in\mathcal{N}∀ italic_n ∈ caligraphic_N , over~ start_ARG italic_n end_ARG ∈ caligraphic_N as [14]

[𝐑SIM]n~,nsubscriptdelimited-[]subscript𝐑SIM~𝑛𝑛\displaystyle[{\mathbf{R}}_{\mathrm{SIM}}]_{\tilde{n},n}[ bold_R start_POSTSUBSCRIPT roman_SIM end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG , italic_n end_POSTSUBSCRIPT =sinc(2𝐮n𝐮n~/λ),n,n~=1,,Nformulae-sequenceabsentsinc2normsubscript𝐮𝑛subscript𝐮~𝑛𝜆𝑛~𝑛1𝑁\displaystyle=\mathrm{sinc}(2\|{\mathbf{u}}_{n}-{\mathbf{u}}_{\tilde{n}}\|/% \lambda),n,\tilde{n}=1,\ldots,N= roman_sinc ( 2 ∥ bold_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - bold_u start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ∥ / italic_λ ) , italic_n , over~ start_ARG italic_n end_ARG = 1 , … , italic_N (4)

where 𝐮n=[0,i(n)dH,j(n)dV]𝖳subscript𝐮𝑛superscript0𝑖𝑛subscript𝑑H𝑗𝑛subscript𝑑V𝖳{\mathbf{u}}_{n}=[0,i(n)d_{\mathrm{H}},j(n)d_{\mathrm{V}}]^{{% \scriptscriptstyle\mathsf{T}}}bold_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = [ 0 , italic_i ( italic_n ) italic_d start_POSTSUBSCRIPT roman_H end_POSTSUBSCRIPT , italic_j ( italic_n ) italic_d start_POSTSUBSCRIPT roman_V end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT with i(n)=mod(n1,Nx)i(n)=\mod(n-1,N_{x})italic_i ( italic_n ) = roman_mod ( italic_n - 1 , italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) and j(n)=(n1)/Nx𝑗𝑛𝑛1subscript𝑁𝑥j(n)=\lfloor(n-1)/N_{x}\rflooritalic_j ( italic_n ) = ⌊ ( italic_n - 1 ) / italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ⌋ being the horizontal and vertical indices of element n𝑛nitalic_n, respectively. Nxsubscript𝑁𝑥N_{x}italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT and Nysubscript𝑁𝑦N_{y}italic_N start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT are the elements per row and column, while dHsubscript𝑑Hd_{\mathrm{H}}italic_d start_POSTSUBSCRIPT roman_H end_POSTSUBSCRIPT and dVsubscript𝑑Vd_{\mathrm{V}}italic_d start_POSTSUBSCRIPT roman_V end_POSTSUBSCRIPT denote the horizontal width and the vertical height.

III Downlink Data Transmission

During the downlink transmission and based on wave-based beamforming [8], the received signal at the k𝑘kitalic_k-th UE is written as

yk=𝐡k𝖧𝐆i=1K𝐰i1pixi+nk,k𝒦formulae-sequencesubscript𝑦𝑘superscriptsubscript𝐡𝑘𝖧𝐆superscriptsubscript𝑖1𝐾superscriptsubscript𝐰𝑖1subscript𝑝𝑖subscript𝑥𝑖subscript𝑛𝑘for-all𝑘𝒦\displaystyle y_{k}={\mathbf{h}}_{k}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{% G}}\sum_{i=1}^{K}{\mathbf{w}}_{i}^{1}\sqrt{p}_{i}x_{i}+n_{k},~{}~{}~{}\forall k% \in\mathcal{K}italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_G ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT square-root start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k ∈ caligraphic_K (5)

where xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the information symbol intended for the k𝑘kitalic_k-th UE, which has a zero mean and unit variance. Also, pisubscript𝑝𝑖p_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the power corresponding to the k𝑘kitalic_k-th UE with i=1KpiPTsuperscriptsubscript𝑖1𝐾subscript𝑝𝑖subscript𝑃T\sum_{i=1}^{K}p_{i}\leq P_{\mathrm{T}}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_P start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT, where PTsubscript𝑃TP_{\mathrm{T}}italic_P start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT is the total transmit power at the BS. Also nk𝒞𝒩(0,σk2)similar-tosubscript𝑛𝑘𝒞𝒩0superscriptsubscript𝜎𝑘2n_{k}\sim\mathcal{CN}(0,\sigma_{k}^{2})italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∼ caligraphic_C caligraphic_N ( 0 , italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) denotes the additive white Gaussian noise (AWGN) with σk2superscriptsubscript𝜎𝑘2\sigma_{k}^{2}italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT expressing its variance at UE k𝑘kitalic_k.

The downlink achievable SE of UE k𝑘kitalic_k is given by

SE=k=1Klog2(1+γk),SEsuperscriptsubscript𝑘1𝐾subscript21subscript𝛾𝑘\displaystyle\mathrm{SE}=\sum_{k=1}^{K}\log_{2}\left(1+\gamma_{k}\right)\!,roman_SE = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , (6)

where γksubscript𝛾𝑘\gamma_{k}italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denotes the downlink signal-to-interference-plus-noise ratio (SINR), which is written according to the UaTF bounding technique [12] as

γk=pk|𝔼{𝐡k𝖧𝐆𝐰k1}|2i=1Kpi𝔼{|𝐡k𝖧𝐆𝐰i1|2}pk|𝔼{𝐡k𝖧𝐆𝐰k1}|2+σk2,subscript𝛾𝑘subscript𝑝𝑘superscript𝔼superscriptsubscript𝐡𝑘𝖧superscriptsubscript𝐆𝐰𝑘12superscriptsubscript𝑖1𝐾subscript𝑝𝑖𝔼superscriptsuperscriptsubscript𝐡𝑘𝖧superscriptsubscript𝐆𝐰𝑖12subscript𝑝𝑘superscript𝔼superscriptsubscript𝐡𝑘𝖧superscriptsubscript𝐆𝐰𝑘12superscriptsubscript𝜎𝑘2\displaystyle\gamma_{k}=\frac{p_{k}|\mathbb{E}\{{\mathbf{h}}_{k}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{k}^{1}\}|^{2}}{\sum_{i% =1}^{K}p_{i}\mathbb{E}\{|{\mathbf{h}}_{k}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{G}}{\mathbf{w}}_{i}^{1}|^{2}\}-p_{k}|\mathbb{E}\{{\mathbf{h}}_{k}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{k}^{1}\}|^{2}+\sigma_{% k}^{2}},italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | blackboard_E { bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT } | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT blackboard_E { | bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT } - italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | blackboard_E { bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT } | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (7)

where is assumed that UE k𝑘kitalic_k has knowledge of the average effective channel.

Proposition 1

The achievable SINR of UE k𝑘kitalic_k for a given SIM during the downlink transmission is provided by (8).

γk=pkκk|𝐡k,LoS𝖧𝐆𝐰k1|2i=1Kpitr(𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐑)+ikKpiκk𝐡k,LoS𝖧𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐡k,LoS+σk2(1+κk)βk.subscript𝛾𝑘subscript𝑝𝑘subscript𝜅𝑘superscriptsuperscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑘12superscriptsubscript𝑖1𝐾subscript𝑝𝑖tracesuperscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧𝐑superscriptsubscript𝑖𝑘𝐾subscript𝑝𝑖subscript𝜅𝑘superscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧subscript𝐡𝑘LoSsuperscriptsubscript𝜎𝑘21subscript𝜅𝑘subscript𝛽𝑘\displaystyle\gamma_{k}=\frac{p_{k}\kappa_{k}|{\mathbf{h}}_{k,\mathrm{LoS}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{k}^{1}|^{2}}{\sum_{i=1% }^{K}p_{i}\tr({\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}^{1^{{% \scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{R}})+\sum_{i\neq k}^{K}p_{i}\kappa_{k}{\mathbf{h}}_{k,\mathrm{LoS}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}% ^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{% H}}}{\mathbf{h}}_{k,\mathrm{LoS}}+\frac{\sigma_{k}^{2}(1+\kappa_{k})}{\beta_{k% }}}.italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_tr ( start_ARG bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_R end_ARG ) + ∑ start_POSTSUBSCRIPT italic_i ≠ italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT + divide start_ARG italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_ARG . (8)
Proof:

The numerator becomes

|𝔼{𝐡k𝖧𝐆𝐰k1}|2=βkκk1+κk|𝐡k,LoS𝖧𝐆𝐰k1|2.superscript𝔼superscriptsubscript𝐡𝑘𝖧superscriptsubscript𝐆𝐰𝑘12subscript𝛽𝑘subscript𝜅𝑘1subscript𝜅𝑘superscriptsuperscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑘12\displaystyle|\mathbb{E}\{{\mathbf{h}}_{k}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{G}}{\mathbf{w}}_{k}^{1}\}|^{2}=\beta_{k}\frac{\kappa_{k}}{1+\kappa_{k}% }|{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{% \mathbf{w}}_{k}^{1}|^{2}.| blackboard_E { bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT } | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG | bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (9)

Regarding the denominator of (7), the first term is written as

𝔼{|𝐡k𝖧𝐆𝐰i1|2}=tr(𝐡k𝖧𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐡k)𝔼superscriptsuperscriptsubscript𝐡𝑘𝖧superscriptsubscript𝐆𝐰𝑖12tracesuperscriptsubscript𝐡𝑘𝖧superscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧subscript𝐡𝑘\displaystyle\mathbb{E}\{|{\mathbf{h}}_{k}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{G}}{\mathbf{w}}_{i}^{1}|^{2}\}=\tr({\mathbf{h}}_{k}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}% ^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{% H}}}{\mathbf{h}}_{k})blackboard_E { | bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT } = roman_tr ( start_ARG bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) (10)
=βk11+κktr(𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐑)absentsubscript𝛽𝑘11subscript𝜅𝑘tracesuperscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧𝐑\displaystyle=\beta_{k}\frac{1}{1+\kappa_{k}}\tr({\mathbf{G}}{\mathbf{w}}_{i}^% {1}{\mathbf{w}}_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{R}})= italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG roman_tr ( start_ARG bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_R end_ARG )
+βkκk1+κk𝐡k,LoS𝖧𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐡k,LoS,subscript𝛽𝑘subscript𝜅𝑘1subscript𝜅𝑘superscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧subscript𝐡𝑘LoS\displaystyle+\beta_{k}\frac{\kappa_{k}}{1+\kappa_{k}}{\mathbf{h}}_{k,\mathrm{% LoS}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf% {w}}_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{h}}_{k,\mathrm{LoS}},+ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT , (11)

where, in (10), we have applied that 𝐱𝖧𝐲=tr(𝐲𝐱𝖧)superscript𝐱𝖧𝐲tracesuperscript𝐲𝐱𝖧{\mathbf{x}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{y}}=\tr({\mathbf{y}}{% \mathbf{x}}^{{\scriptscriptstyle\mathsf{H}}})bold_x start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_y = roman_tr ( start_ARG bold_yx start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_ARG ) for any vectors 𝐱𝐱{\mathbf{x}}bold_x, 𝐲𝐲{\mathbf{y}}bold_y. By substituting (9) and (11) in (7), we obtain the achievable SINR. ∎

IV Problem Formulation and Optimization

The maximization of the sum SE regarding the phase shifts of each surface and the allocated power is of great importance.

IV-A Problem Formulation

The maximization problem is formulated as

(𝒫)𝒫\displaystyle(\mathcal{P})~{}~{}( caligraphic_P ) maxϕl,𝐩f(ϕl,𝐩)=k=1Klog2(1+DkIk)subscriptsubscriptbold-italic-ϕ𝑙𝐩𝑓subscriptbold-italic-ϕ𝑙𝐩superscriptsubscript𝑘1𝐾subscript21subscript𝐷𝑘subscript𝐼𝑘\displaystyle\max_{{\bm{\phi}}_{l},{\mathbf{p}}}\;f({\bm{\phi}}_{l},{\mathbf{p% }})=\sum_{k=1}^{K}\log_{2}\left(1+\frac{D_{k}}{I_{k}}\right)roman_max start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , bold_p end_POSTSUBSCRIPT italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , bold_p ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) (12a)
s.t𝐆=𝚽L𝐖L𝚽2𝐖2𝚽1,formulae-sequencest𝐆subscript𝚽𝐿superscript𝐖𝐿subscript𝚽2superscript𝐖2subscript𝚽1\displaystyle~{}\mathrm{s.t}~{}~{}~{}{\mathbf{G}}={\bm{\Phi}}_{L}{\mathbf{W}}^% {L}\cdots{\bm{\Phi}}_{2}{\mathbf{W}}^{2}{\bm{\Phi}}_{1},roman_s . roman_t bold_G = bold_Φ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ⋯ bold_Φ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , (12b)
|ϕnl|=1,n𝒩,l,formulae-sequencesubscriptsuperscriptitalic-ϕ𝑙𝑛1formulae-sequence𝑛𝒩𝑙\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!|\phi^{l}_{n}|=1,n\in\mathcal{N},l\in% \mathcal{L},| italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | = 1 , italic_n ∈ caligraphic_N , italic_l ∈ caligraphic_L , (12c)
i=1Kpi=PTsuperscriptsubscript𝑖1𝐾subscript𝑝𝑖subscript𝑃T\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!\sum_{i=1}^{K}p_{i}=P_{\mathrm{T}}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT (12d)
pk0,k𝒦,formulae-sequencesubscript𝑝𝑘0for-all𝑘𝒦\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!p_{k}\geq 0,\forall k\in\mathcal{K},italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 , ∀ italic_k ∈ caligraphic_K , (12e)

where Dksubscript𝐷𝑘D_{k}italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and Iksubscript𝐼𝑘I_{k}italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT are the numerator and denominator of γksubscript𝛾𝑘\gamma_{k}italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT obtained in Proposition 1. Also, we have defined the vector 𝐩=[p1,,pK]𝖳𝐩superscriptsubscript𝑝1subscript𝑝𝐾𝖳{\mathbf{p}}=[p_{1},\ldots,p_{K}]^{{\scriptscriptstyle\mathsf{T}}}bold_p = [ italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_p start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT. Note that the constraint (12c) expresses that each RIS element provides only a phase shift while (12d) corresponds to the maximum power constraint.

The non-convexity optimization problem (𝒫)𝒫(\mathcal{P})( caligraphic_P ) and its dependence on the unit-modulus constraint with respect to ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT make the solution challenging. For this reason, we resort to alternating optimization (AO). According to this technique, ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT and 𝐩𝐩{\mathbf{p}}bold_p will be optimized individually in an iterative manner. Specifically, first, we find the optimum ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT for a fixed 𝐩𝐩{\mathbf{p}}bold_p. During the next step, we solve for 𝐩𝐩{\mathbf{p}}bold_p with ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT fixed. The objective converges to its optimum value by iterating this process, which leads to the increase of f(ϕl,𝐩)𝑓subscriptbold-italic-ϕ𝑙𝐩f({\bm{\phi}}_{l},{\mathbf{p}})italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , bold_p ) after each step until a specific point because of the upper-bound coming from the power constraint (12d).

IV-B SIM Optimization

Until now, ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT was assumed fixed. However, to exploit each metasurface towards wave-based beamforming while maximizing (6), the optimization of each ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT has to take place. Its presence is observed inside the matrix 𝐆𝐆{\mathbf{G}}bold_G, appearing in Dksubscript𝐷𝑘D_{k}italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and Iksubscript𝐼𝑘I_{k}italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Hence, the maximization problem regarding ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT is described as

(𝒫1)𝒫1\displaystyle(\mathcal{P}1)~{}~{}( caligraphic_P 1 ) maxϕlf(ϕl)subscriptsubscriptbold-italic-ϕ𝑙𝑓subscriptbold-italic-ϕ𝑙\displaystyle\max_{{\bm{\phi}}_{l}}\;f({\bm{\phi}}_{l})roman_max start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) (12ma)
s.t𝐆=𝚽L𝐖L𝚽2𝐖2𝚽1,formulae-sequencest𝐆subscript𝚽𝐿superscript𝐖𝐿subscript𝚽2superscript𝐖2subscript𝚽1\displaystyle~{}\mathrm{s.t}~{}~{}~{}{\mathbf{G}}={\bm{\Phi}}_{L}{\mathbf{W}}^% {L}\cdots{\bm{\Phi}}_{2}{\mathbf{W}}^{2}{\bm{\Phi}}_{1},roman_s . roman_t bold_G = bold_Φ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ⋯ bold_Φ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , (12mb)
|ϕnl|=1,n𝒩,l,formulae-sequencesubscriptsuperscriptitalic-ϕ𝑙𝑛1formulae-sequence𝑛𝒩𝑙\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!|\phi^{l}_{n}|=1,n\in\mathcal{N},l\in% \mathcal{L},| italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | = 1 , italic_n ∈ caligraphic_N , italic_l ∈ caligraphic_L , (12mc)

where the maximization problem (𝒫1)𝒫1(\mathcal{P}1)( caligraphic_P 1 ) is non-convex regarding ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT, and it obeys to a unit-modulus constraint with respect to ϕnlsubscriptsuperscriptitalic-ϕ𝑙𝑛\phi^{l}_{n}italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Application of the projected gradient ascent algorithm until convergence while taking into account the unit-modulus constraint results in a locally optimal solution to (𝒫1)𝒫1(\mathcal{P}1)( caligraphic_P 1 ).

The proposed algorithm suggests starting from ϕl0superscriptsubscriptbold-italic-ϕ𝑙0{\bm{\phi}}_{l}^{0}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, and then shifting along the gradient of f(ϕl)𝑓subscriptbold-italic-ϕ𝑙f({\bm{\phi}}_{l})italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ). The new point ϕlsubscriptbold-italic-ϕ𝑙{\bm{\phi}}_{l}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT is projected onto ΦlsubscriptΦ𝑙\Phi_{l}roman_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT to hold the new points in the feasible set. Specifically, the unit-modulus constraint means that ϕnlsubscriptsuperscriptitalic-ϕ𝑙𝑛\phi^{l}_{n}italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT has to be found inside the unit circle. PΦl()subscript𝑃subscriptΦ𝑙P_{\Phi_{l}}(\cdot)italic_P start_POSTSUBSCRIPT roman_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) is the projection onto ΦlsubscriptΦ𝑙\Phi_{l}roman_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT. Hence, we have

u¯l,n={ul,n|ul,n|ul,n0ejϕnl,ϕnl[0,2π]ul,n=0,n=1,,N,formulae-sequencesubscript¯𝑢𝑙𝑛casessubscript𝑢𝑙𝑛subscript𝑢𝑙𝑛subscript𝑢𝑙𝑛0superscript𝑒𝑗subscriptsuperscriptitalic-ϕ𝑙𝑛subscriptsuperscriptitalic-ϕ𝑙𝑛02𝜋subscript𝑢𝑙𝑛0𝑛1𝑁\displaystyle\bar{u}_{l,n}=\left\{\begin{array}[]{ll}\frac{u_{l,n}}{|u_{l,n}|}% &u_{l,n}\neq 0\\ e^{j\phi^{l}_{n}},\phi^{l}_{n}\in[0,2\pi]&u_{l,n}=0\\ \end{array},n=1,\ldots,N,\right.over¯ start_ARG italic_u end_ARG start_POSTSUBSCRIPT italic_l , italic_n end_POSTSUBSCRIPT = { start_ARRAY start_ROW start_CELL divide start_ARG italic_u start_POSTSUBSCRIPT italic_l , italic_n end_POSTSUBSCRIPT end_ARG start_ARG | italic_u start_POSTSUBSCRIPT italic_l , italic_n end_POSTSUBSCRIPT | end_ARG end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_l , italic_n end_POSTSUBSCRIPT ≠ 0 end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_j italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_ϕ start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ [ 0 , 2 italic_π ] end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_l , italic_n end_POSTSUBSCRIPT = 0 end_CELL end_ROW end_ARRAY , italic_n = 1 , … , italic_N , (12mp)

where the vector 𝐮¯lsubscript¯𝐮𝑙\bar{{\mathbf{u}}}_{l}over¯ start_ARG bold_u end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT of PΦl(𝐮l)subscript𝑃subscriptΦ𝑙subscript𝐮𝑙P_{\Phi_{l}}({\mathbf{u}}_{l})italic_P start_POSTSUBSCRIPT roman_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_u start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) is a given point.

The algorithm is described by the following iteration

ϕli+1superscriptsubscriptbold-italic-ϕ𝑙𝑖1\displaystyle{\bm{\phi}}_{l}^{i+1}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT =PΦl(ϕli+μiϕlf(ϕli)).absentsubscript𝑃subscriptΦ𝑙superscriptsubscriptbold-italic-ϕ𝑙𝑖subscript𝜇𝑖subscriptsubscriptbold-italic-ϕ𝑙𝑓superscriptsubscriptbold-italic-ϕ𝑙𝑖\displaystyle=P_{\Phi_{l}}({\bm{\phi}}_{l}^{i}+\mu_{i}\nabla_{{\bm{\phi}}_{l}}% f({\bm{\phi}}_{l}^{i})).= italic_P start_POSTSUBSCRIPT roman_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) ) . (12mq)

The Armijo-Goldstein backtracking line search method provides the step size, which is μi=Liκmisubscript𝜇𝑖subscript𝐿𝑖superscript𝜅subscript𝑚𝑖\mu_{i}=L_{i}\kappa^{m_{i}}italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_κ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, where κ(0,1)𝜅01\kappa\in(0,1)italic_κ ∈ ( 0 , 1 ) and Li>0subscript𝐿𝑖0L_{i}>0italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0. Note that misubscript𝑚𝑖m_{i}italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the smallest positive integer that satisfies

f(ϕli+1)QLiκmi(ϕli;ϕli+1),𝑓superscriptsubscriptbold-italic-ϕ𝑙𝑖1subscript𝑄subscript𝐿𝑖superscript𝜅subscript𝑚𝑖superscriptsubscriptbold-italic-ϕ𝑙𝑖superscriptsubscriptbold-italic-ϕ𝑙𝑖1\displaystyle f({\bm{\phi}}_{l}^{i+1})\geq Q_{L_{i}\kappa^{m_{i}}}({\bm{\phi}}% _{l}^{i};{\bm{\phi}}_{l}^{i+1}),italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT ) ≥ italic_Q start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_κ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ; bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT ) , (12mr)

where

Qμ(ϕl;𝐱)=f(ϕl)+ϕlf(ϕl),𝐱ϕl1μ𝐱ϕl22subscript𝑄𝜇subscriptbold-italic-ϕ𝑙𝐱𝑓subscriptbold-italic-ϕ𝑙subscriptsubscriptbold-italic-ϕ𝑙𝑓subscriptbold-italic-ϕ𝑙𝐱subscriptbold-italic-ϕ𝑙1𝜇subscriptsuperscriptnorm𝐱subscriptbold-italic-ϕ𝑙22\displaystyle\!\!Q_{\mu}({\bm{\phi}}_{l};{\mathbf{x}})\!=\!f({\bm{\phi}}_{l})% \!+\!\langle\nabla_{{\bm{\phi}}_{l}}f({\bm{\phi}}_{l}),{\mathbf{x}}\!-\!{\bm{% \phi}}_{l}\rangle\!-\!\frac{1}{\mu}\|{\mathbf{x}}-{\bm{\phi}}_{l}\|^{2}_{2}italic_Q start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ; bold_x ) = italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) + ⟨ ∇ start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) , bold_x - bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ⟩ - divide start_ARG 1 end_ARG start_ARG italic_μ end_ARG ∥ bold_x - bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (12ms)

is the quadratic approximation of f(ϕl)𝑓subscriptbold-italic-ϕ𝑙f({\bm{\phi}}_{l})italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ).

Proposition 2

The gradient of f(ϕl)𝑓subscriptbold-ϕ𝑙f({\bm{\phi}}_{l})italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) regarding ϕlsuperscriptsubscriptbold-ϕ𝑙{\bm{\phi}}_{l}^{*}bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is obtained in closed-form as

ϕlf(ϕl)subscriptsubscriptbold-italic-ϕ𝑙𝑓subscriptbold-italic-ϕ𝑙\displaystyle\nabla_{{\bm{\phi}}_{l}}f({\bm{\phi}}_{l})∇ start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) =1log2(e)k=1KIkϕlDkDkϕlIk(1+γk)Ik,absent1subscript2𝑒superscriptsubscript𝑘1𝐾subscript𝐼𝑘subscriptsubscriptbold-italic-ϕ𝑙subscript𝐷𝑘subscript𝐷𝑘subscriptsubscriptbold-italic-ϕ𝑙subscript𝐼𝑘1subscript𝛾𝑘subscript𝐼𝑘\displaystyle=\frac{1}{\log_{2}(e)}\sum_{k=1}^{K}\frac{{I}_{k}\nabla_{{\bm{% \phi}}_{l}}D_{k}-D_{k}\nabla_{{\bm{\phi}}_{l}}{I}_{k}}{(1+\gamma_{k}){I}_{k}},= divide start_ARG 1 end_ARG start_ARG roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_e ) end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT divide start_ARG italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG ( 1 + italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) italic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG , (12mt)

where

with 𝐀l=𝚽L𝐖L𝚽l+1𝐖l+1subscript𝐀𝑙subscript𝚽𝐿superscript𝐖𝐿subscript𝚽𝑙1superscript𝐖𝑙1{\mathbf{A}}_{l}={\bm{\Phi}}_{L}{\mathbf{W}}^{L}\cdots{\bm{\Phi}}_{l+1}{% \mathbf{W}}^{l+1}bold_A start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = bold_Φ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ⋯ bold_Φ start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT, and 𝐂l=𝐖l𝚽l1𝐖l1𝚽1subscript𝐂𝑙superscript𝐖𝑙subscript𝚽𝑙1superscript𝐖𝑙1subscript𝚽1{\mathbf{C}}_{l}={\mathbf{W}}^{l}{\bm{\Phi}}_{l-1}{\mathbf{W}}^{l-1}\cdots{\bm% {\Phi}}_{1}bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = bold_W start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT bold_Φ start_POSTSUBSCRIPT italic_l - 1 end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_l - 1 end_POSTSUPERSCRIPT ⋯ bold_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

Proof:

Please see Appendix A. ∎

The SIM optimization design, based on the gradient ascent, appears a significant advantage because the gradient ascent is obtained in a closed form. It has low computational complexity because it consists of simple matrix operations. Specifically, the complexity of (12ma) for large SIMs is 𝒪(NtN2+LN2+KN3)𝒪subscript𝑁𝑡superscript𝑁2𝐿superscript𝑁2𝐾superscript𝑁3\mathcal{O}(N_{t}N^{2}+LN^{2}+KN^{3})caligraphic_O ( italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_L italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_K italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ), and the complexity of (12mt) is similar. In other words, the number of meta-atoms of each surface has a higher impact.

IV-C Power Optimization

Given a fixed 𝚽lsubscript𝚽𝑙{\bm{\Phi}}_{l}bold_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT, we focus on the optimization with respect to 𝐩𝐩{\mathbf{p}}bold_p. Specifically, we have

(𝒫2)𝒫2\displaystyle(\mathcal{P}2)~{}~{}( caligraphic_P 2 ) max𝐩f(𝐩)subscript𝐩𝑓𝐩\displaystyle\max_{{\mathbf{p}}}\;f({\mathbf{p}})roman_max start_POSTSUBSCRIPT bold_p end_POSTSUBSCRIPT italic_f ( bold_p ) (12mua)
i=1Kpi=PT,pk0,k𝒦.formulae-sequencesuperscriptsubscript𝑖1𝐾subscript𝑝𝑖subscript𝑃Tformulae-sequencesubscript𝑝𝑘0for-all𝑘𝒦\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!\sum_{i=1}^{K}p_{i}=P_{\mathrm{T}},~{}p% _{k}\geq 0,\forall k\in\mathcal{K}.∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 , ∀ italic_k ∈ caligraphic_K . (12mub)

The nonconvexity of problem (𝒫2)𝒫2(\mathcal{P}2)( caligraphic_P 2 ) leads us to obtain a solution which is locally optimal. For this reason, we apply a weighted minimum mean square error (MMSE) reformulation of the sum SE. To this end, we denote 𝐜=[c1,,cK]𝖳𝐜superscriptsubscript𝑐1subscript𝑐𝐾𝖳{\mathbf{c}}\!=\![c_{1},\ldots,c_{K}]^{{\scriptscriptstyle\mathsf{T}}}bold_c = [ italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_c start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT. Then, the SINR γksubscript𝛾𝑘\gamma_{k}italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT can be written in terms of the vector 𝐩𝐩{\mathbf{p}}bold_p as

γk=pkqk𝐜𝖳𝐩+uk2,subscript𝛾𝑘subscript𝑝𝑘subscript𝑞𝑘superscript𝐜𝖳𝐩superscriptsubscript𝑢𝑘2\displaystyle\gamma_{k}=\frac{p_{k}q_{k}}{{\mathbf{c}}^{{\scriptscriptstyle% \mathsf{T}}}{\mathbf{p}}+u_{k}^{2}},italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG bold_c start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT bold_p + italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (12mv)

where

qk=κk|𝐡k,LoS𝖧𝐆𝐰k1|2,ck=βk11+κktr(𝐆𝐰k1𝐰k1𝖧𝐆𝖧𝐑),formulae-sequencesubscript𝑞𝑘subscript𝜅𝑘superscriptsuperscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑘12subscript𝑐𝑘subscript𝛽𝑘11subscript𝜅𝑘tracesuperscriptsubscript𝐆𝐰𝑘1superscriptsubscript𝐰𝑘superscript1𝖧superscript𝐆𝖧𝐑\displaystyle q_{k}=\kappa_{k}|{\mathbf{h}}_{k,\mathrm{LoS}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{k}^{1}|^{2},c_{k}=% \beta_{k}\frac{1}{1+\kappa_{k}}\tr({\mathbf{G}}{\mathbf{w}}_{k}^{1}{\mathbf{w}% }_{k}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{R}}),italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG roman_tr ( start_ARG bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_R end_ARG ) ,
tk2=σk2(1+κk)βk,ci=βk11+κktr(𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐑)formulae-sequencesuperscriptsubscript𝑡𝑘2superscriptsubscript𝜎𝑘21subscript𝜅𝑘subscript𝛽𝑘subscript𝑐𝑖subscript𝛽𝑘11subscript𝜅𝑘tracesuperscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧𝐑\displaystyle t_{k}^{2}=\frac{\sigma_{k}^{2}(1+\kappa_{k})}{\beta_{k}},c_{i}=% \beta_{k}\frac{1}{1+\kappa_{k}}\tr({\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}% }_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{R}})italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG , italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG roman_tr ( start_ARG bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_R end_ARG )
+βkκk1+κk𝐡k,LoS𝖧𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐡k,LoS,ik.subscript𝛽𝑘subscript𝜅𝑘1subscript𝜅𝑘superscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧subscript𝐡𝑘LoSfor-all𝑖𝑘\displaystyle+\beta_{k}\frac{\kappa_{k}}{1+\kappa_{k}}{\mathbf{h}}_{k,\mathrm{% LoS}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf% {w}}_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{h}}_{k,\mathrm{LoS}},~{}\forall i\neq k.+ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT divide start_ARG italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG 1 + italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT , ∀ italic_i ≠ italic_k . (12mw)

Now, we consider the single-input and single-output (SISO) channel model that comes from the SINR in (12mv), which is given by

y~k=pkqksk+i=1Kpicisi+nk,subscript~𝑦𝑘subscript𝑝𝑘subscript𝑞𝑘subscript𝑠𝑘superscriptsubscript𝑖1𝐾subscript𝑝𝑖subscript𝑐𝑖subscript𝑠𝑖subscript𝑛𝑘\displaystyle\tilde{y}_{k}=\sqrt{p_{k}q_{k}}s_{k}+\sum_{i=1}^{K}\sqrt{p_{i}c_{% i}}s_{i}+n_{k},over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = square-root start_ARG italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT square-root start_ARG italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (12mx)

where nk𝒞𝒩(0,uk2)similar-tosubscript𝑛𝑘𝒞𝒩0superscriptsubscript𝑢𝑘2n_{k}\sim{\cal C}{\cal N}\left(0,u_{k}^{2}\right)italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∼ caligraphic_C caligraphic_N ( 0 , italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) while sisubscript𝑠𝑖s_{i}\in\mathbb{C}italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_C is the data signal with unit variance, and y~ksubscript~𝑦𝑘\tilde{y}_{k}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the received signal.

Then, the receiver estimates sksubscript𝑠𝑘s_{k}italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, i.e., s^k=vky~ksubscript^𝑠𝑘superscriptsubscript𝑣𝑘subscript~𝑦𝑘\hat{s}_{k}=v_{k}^{*}\tilde{y}_{k}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT with vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT being a receiver coefficient. The corresponding mean square error ek(𝐩,vk)=[|s^ksk|2]subscript𝑒𝑘𝐩subscript𝑣𝑘delimited-[]superscriptsubscript^𝑠𝑘subscript𝑠𝑘2e_{k}({\mathbf{p}},v_{k})=[|\hat{s}_{k}-s_{k}|^{2}]italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_p , italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = [ | over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] becomes

ek(𝐩,vk)=vkg2(pkqk+𝐜k𝖳𝐩+uk2)2vkqkpk+1.subscript𝑒𝑘𝐩subscript𝑣𝑘superscriptsubscript𝑣𝑘𝑔2subscript𝑝𝑘subscript𝑞𝑘superscriptsubscript𝐜𝑘𝖳𝐩superscriptsubscript𝑢𝑘22subscript𝑣𝑘subscript𝑞𝑘subscript𝑝𝑘1\displaystyle e_{k}({\mathbf{p}},v_{k})=v_{kg}^{2}\left(p_{k}q_{k}+{\mathbf{c}% }_{k}^{{\scriptscriptstyle\mathsf{T}}}{\mathbf{p}}+u_{k}^{2}\right)-2v_{k}% \sqrt{q_{k}p_{k}}+1.italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_p , italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_v start_POSTSUBSCRIPT italic_k italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + bold_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT bold_p + italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) - 2 italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT square-root start_ARG italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG + 1 . (12my)

For a given 𝐩𝐩{\mathbf{p}}bold_p, vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is obtained by the minimization of the MSE as

vk=pkqkpkqk+i=1Kpici+uk2.subscript𝑣𝑘subscript𝑝𝑘subscript𝑞𝑘subscript𝑝𝑘subscript𝑞𝑘superscriptsubscript𝑖1𝐾subscript𝑝𝑖subscript𝑐𝑖superscriptsubscript𝑢𝑘2\displaystyle v_{k}=\frac{\sqrt{p_{k}q_{k}}}{p_{k}q_{k}+\sum_{i=1}^{K}p_{i}c_{% i}+u_{k}^{2}}.italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG square-root start_ARG italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (12mz)

Inserting vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT into (12my), eksubscript𝑒𝑘e_{k}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT becomes 1/(1+γk)11subscript𝛾𝑘1/\left(1+\gamma_{k}\right)1 / ( 1 + italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). Based on the weighted MMSE method, let the auxiliary weight dk0subscript𝑑𝑘0d_{k}\geq 0italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 for the MSE eksubscript𝑒𝑘e_{k}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and consider the problem

(𝒫2.1)min𝐩0,{vk,dk0:k=1,,K}Kk=1Kdkek(𝐩,𝐯k)ln(dk)s.ti=1KpiPT.formulae-sequence𝒫2.1subscript𝐩0conditional-setsubscript𝑣𝑘subscript𝑑𝑘0𝑘1𝐾𝐾superscriptsubscript𝑘1𝐾subscript𝑑𝑘subscript𝑒𝑘𝐩subscript𝐯𝑘subscript𝑑𝑘stsuperscriptsubscript𝑖1𝐾subscript𝑝𝑖subscript𝑃T\displaystyle\begin{split}(\mathcal{P}2.1)\min_{\begin{subarray}{c}{\mathbf{p}% }\geq 0,\\ \{v_{k},d_{k}\geq 0:k=1,\ldots,K\}\end{subarray}}&{K}\sum_{k=1}^{K}d_{k}e_{k}(% {\mathbf{p}},{\mathbf{v}}_{k})-\ln(d_{k})\\ \mathrm{s.t}~{}~{}\;\!&\sum_{i=1}^{K}p_{i}\leq P_{\mathrm{T}}.\end{split}start_ROW start_CELL ( caligraphic_P 2.1 ) roman_min start_POSTSUBSCRIPT start_ARG start_ROW start_CELL bold_p ≥ 0 , end_CELL end_ROW start_ROW start_CELL { italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ 0 : italic_k = 1 , … , italic_K } end_CELL end_ROW end_ARG end_POSTSUBSCRIPT end_CELL start_CELL italic_K ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_p , bold_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - roman_ln ( start_ARG italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ) end_CELL end_ROW start_ROW start_CELL roman_s . roman_t end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_P start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT . end_CELL end_ROW (12maa)

Problems (𝒫2)𝒫2(\mathcal{P}2)( caligraphic_P 2 ) and (𝒫2.1)𝒫2.1(\mathcal{P}2.1)( caligraphic_P 2.1 ) are equivalent, and thus, are subject to the same solution. The solution of (𝒫2.1)𝒫2.1(\mathcal{P}2.1)( caligraphic_P 2.1 ) can be provided in closed form as

pi=min(PT,qkdk2vk2(qkdkvk2+i=1Kdivi2ci)2).subscript𝑝𝑖subscript𝑃Tsubscript𝑞𝑘superscriptsubscript𝑑𝑘2superscriptsubscript𝑣𝑘2superscriptsubscript𝑞𝑘subscript𝑑𝑘superscriptsubscript𝑣𝑘2superscriptsubscript𝑖1𝐾subscript𝑑𝑖superscriptsubscript𝑣𝑖2subscript𝑐𝑖2\displaystyle p_{i}=\min\left(P_{\mathrm{T}},\frac{q_{k}d_{k}^{2}v_{k}^{2}}{% \left(q_{k}d_{k}v_{k}^{2}+\sum_{i=1}^{K}d_{i}v_{i}^{2}c_{i}\right)^{2}}\right).italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = roman_min ( italic_P start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT , divide start_ARG italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_q start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) . (12mab)

The power allocation presents a similar complexity to the SIM optimization design since it consists of similar matrix operations to (𝒫1)𝒫1(\mathcal{P}1)( caligraphic_P 1 ), i.e., its complexity is 𝒪(NtN2+LN2+KN3)𝒪subscript𝑁𝑡superscript𝑁2𝐿superscript𝑁2𝐾superscript𝑁3\mathcal{O}(N_{t}N^{2}+LN^{2}+KN^{3})caligraphic_O ( italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_L italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_K italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ).

Remark 1

Both algorithms, corresponding to Problems (𝒫1)𝒫1(\mathcal{P}1)( caligraphic_P 1 ) and (𝒫2)𝒫2(\mathcal{P}2)( caligraphic_P 2 ), have low computation complexity and converge quickly. Note that the achievement of a local optimum, obtained from this optimization, will make different initializations result in different solutions, which will be studied below.

V Numerical Results

In this section, we present and evaluate the performance of the achievable sum SE of large SIM-assisted multiuser communications with statistical CSI by showing both analytical results and Monte Carlo simulations. For the setup, we assume that the SIM is parallel to the xy𝑥𝑦x-yitalic_x - italic_y plane and centered along the zlimit-from𝑧z-italic_z -axis at a height HBS=10msubscript𝐻BS10mH_{\mathrm{BS}}=10~{}\mathrm{m}italic_H start_POSTSUBSCRIPT roman_BS end_POSTSUBSCRIPT = 10 roman_m. The spacing between adjacent meta-atoms is assumed to be λ/2𝜆2\lambda/2italic_λ / 2, and the size of each meta-atom is λ/2×λ/2𝜆2𝜆2\lambda/2\times\lambda/2italic_λ / 2 × italic_λ / 2. The thickness of the SIM is TSIM=5λsubscript𝑇SIM5𝜆T_{\mathrm{SIM}}=5\lambdaitalic_T start_POSTSUBSCRIPT roman_SIM end_POSTSUBSCRIPT = 5 italic_λ, while the spacing is dSIM=TSIM/Lsubscript𝑑SIMsubscript𝑇SIM𝐿d_{\mathrm{SIM}}=T_{\mathrm{SIM}}/Litalic_d start_POSTSUBSCRIPT roman_SIM end_POSTSUBSCRIPT = italic_T start_POSTSUBSCRIPT roman_SIM end_POSTSUBSCRIPT / italic_L. Moreover, the locations of the users are randomly distributed at a distance between 60m60m60\mathrm{m}60 roman_m and 80m80m80\mathrm{m}80 roman_m.

The distance between the n~limit-from~𝑛\tilde{n}-over~ start_ARG italic_n end_ARG -th meta-atom of the (l1)limit-from𝑙1(l-1)-( italic_l - 1 ) -st metasurface and the nlimit-from𝑛{n}-italic_n -th meta-atom of the llimit-from𝑙l-italic_l -st metasurface is given by dn,n~l=dSIM2+dn,n~2superscriptsubscript𝑑𝑛~𝑛𝑙superscriptsubscript𝑑SIM2superscriptsubscript𝑑𝑛~𝑛2d_{n,\tilde{n}}^{l}=\sqrt{d_{\mathrm{SIM}}^{2}+d_{n,\tilde{n}}^{2}}italic_d start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT = square-root start_ARG italic_d start_POSTSUBSCRIPT roman_SIM end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_d start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG, where

dn,n~=λ2|nn~|/Nx2+[mod(|nn~|,Nx)]2.subscript𝑑𝑛~𝑛𝜆2superscript𝑛~𝑛subscript𝑁𝑥2superscriptdelimited-[]mod𝑛~𝑛subscript𝑁𝑥2\displaystyle\!\!d_{n,\tilde{n}}\!=\!\frac{\lambda}{2}\sqrt{\lfloor|n-\tilde{n% }|/N_{x}\rfloor^{2}\!+\![\mathrm{mod}(|n-\tilde{n}|,N_{x})]^{2}}.italic_d start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT = divide start_ARG italic_λ end_ARG start_ARG 2 end_ARG square-root start_ARG ⌊ | italic_n - over~ start_ARG italic_n end_ARG | / italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ⌋ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + [ roman_mod ( | italic_n - over~ start_ARG italic_n end_ARG | , italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (12mac)

The transmission distance between the m𝑚mitalic_m-th antenna and the n~~𝑛\tilde{n}over~ start_ARG italic_n end_ARG-th meta-atom on the first metasurface layer is provided by (12mad). Note that we have cosxn,n~l=dSIM/dn,n~l,lformulae-sequencesuperscriptsubscript𝑥𝑛~𝑛𝑙subscript𝑑SIMsuperscriptsubscript𝑑𝑛~𝑛𝑙for-all𝑙\cos x_{n,\tilde{n}}^{l}=d_{\mathrm{SIM}}/d_{n,\tilde{n}}^{l},\forall l\in% \mathcal{L}roman_cos italic_x start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT = italic_d start_POSTSUBSCRIPT roman_SIM end_POSTSUBSCRIPT / italic_d start_POSTSUBSCRIPT italic_n , over~ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT , ∀ italic_l ∈ caligraphic_L.

dn~,m1=dSIM2+[(mod(n~1,Nx)Nx12)λ2(mNt+12)λ2]2+(n~/NxNy+12)2λ24.superscriptsubscript𝑑~𝑛𝑚1superscriptsubscript𝑑SIM2superscriptdelimited-[]mod~𝑛1subscript𝑁𝑥subscript𝑁𝑥12𝜆2𝑚subscript𝑁𝑡12𝜆22superscript~𝑛subscript𝑁𝑥subscript𝑁𝑦122subscript𝜆24\displaystyle{\small d_{\tilde{n},m}^{1}\!=\!\sqrt{\!d_{\mathrm{SIM}}^{2}\!+\!% \Big{[}\!\Big{(}\!\mathrm{mod}(\tilde{n}\!-\!1,N_{x})\!-\!\frac{N_{x}\!-\!1}{2% }\!\Big{)}\frac{\lambda}{2}\!-\!\Big{(}m\!-\!\frac{N_{t}\!+\!1}{2}\Big{)}\frac% {\lambda}{2}\Big{]}^{2}\!+\!\Big{(}\!\lceil\tilde{n}/N_{x}\rceil\!-\!\frac{N_{% y}\!+\!1}{2}\Big{)}^{2}\frac{\lambda_{2}}{4}}}.italic_d start_POSTSUBSCRIPT over~ start_ARG italic_n end_ARG , italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT = square-root start_ARG italic_d start_POSTSUBSCRIPT roman_SIM end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + [ ( roman_mod ( over~ start_ARG italic_n end_ARG - 1 , italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ) - divide start_ARG italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT - 1 end_ARG start_ARG 2 end_ARG ) divide start_ARG italic_λ end_ARG start_ARG 2 end_ARG - ( italic_m - divide start_ARG italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + 1 end_ARG start_ARG 2 end_ARG ) divide start_ARG italic_λ end_ARG start_ARG 2 end_ARG ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( ⌈ over~ start_ARG italic_n end_ARG / italic_N start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ⌉ - divide start_ARG italic_N start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT + 1 end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG 4 end_ARG end_ARG . (12mad)

The path loss is given by

β~k=C0(dk/d^)α,subscript~𝛽𝑘subscript𝐶0superscriptsubscript𝑑𝑘^𝑑𝛼\displaystyle\tilde{\beta}_{k}=C_{0}(d_{k}/\hat{d})^{-\alpha},over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT / over^ start_ARG italic_d end_ARG ) start_POSTSUPERSCRIPT - italic_α end_POSTSUPERSCRIPT , (12mae)

where C0=(λ2/4πd^)subscript𝐶0subscript𝜆24𝜋^𝑑C_{0}=(\lambda_{2}/4\pi\hat{d})italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT / 4 italic_π over^ start_ARG italic_d end_ARG ) is the free space path loss at the reference distance of d^=1m^𝑑1m\hat{d}=1~{}\mathrm{m}over^ start_ARG italic_d end_ARG = 1 roman_m, and α=2.5𝛼2.5\alpha=2.5italic_α = 2.5 is the path-loss exponent. The correlation matrix 𝐑SIMsubscript𝐑SIM{\mathbf{R}}_{\mathrm{SIM}}bold_R start_POSTSUBSCRIPT roman_SIM end_POSTSUBSCRIPT is obtained according to (4). The carrier frequency and the system bandwidth are 2GHz2GHz2~{}\mathrm{GHz}2 roman_GHz and 20MHz20MHz20~{}\mathrm{MHz}20 roman_MHz, respectively. Moreover, we assume Nt=8subscript𝑁𝑡8N_{t}=8italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 8, K=8𝐾8K=8italic_K = 8, N=200𝑁200N=200italic_N = 200, and L=4𝐿4L=4italic_L = 4.

In Fig. 2, we depict the achievable sum SE versus the number of elements N𝑁Nitalic_N of each surface while varying the number of surfaces L𝐿Litalic_L. First, it is shown that the downlink sum SE increases with N𝑁Nitalic_N for different L𝐿Litalic_L. Moreover, an increase in the number of surfaces results in an increase in the sum SE. In addition, for the sake of comparison, we present the performance in the case of instantaneous CSI for L=4𝐿4L=4italic_L = 4 [8], which performs better than the case of statistical CSI since the latter is obtained based on a lower bound that is optimized at every several coherence intervals instead of at each coherence interval. However, the statistical CSI modeling allows to save significant overhead. Moreover, we show the effect of the size of each surface element. We observe that as the size of each surface element decreases, the correlation decreases, and the sum SE increases. Notably, Monte Carlo (MC) simulations corroborate the analytical results.

Refer to caption
Figure 2: Achievable sum SE of the large SIM-aided MIMO architecture with respect to the number of meta-atoms N𝑁Nitalic_N.

Fig. 3 shows the sum SE versus the number of layers L𝐿Litalic_L of the SIM for the number of UEs K𝐾Kitalic_K. When Nt=K=8subscript𝑁𝑡𝐾8N_{t}=K=8italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_K = 8, we observe that the sum SE improves until L=6𝐿6L=6italic_L = 6 because the SIM is able to mitigate the inter-user interference in the EM wave domain. In particular, a significant improvement is observed compared to the single-layer SIM. Again, we illustrate the comparison between the cases corresponding to instantaneous and statistical CSI, where the latter exhibits worse performance for the benefit of lower overhead.

Refer to caption
Figure 3: Achievable sum SE of the large SIM-aided MIMO architecture with respect to the number of meta-surfaces L𝐿Litalic_L.

In Fig. 4, we depict the convergence of the proposed algorithm. The algorithm terminates when the difference of the objective between the two last iterations is less than 105superscript10510^{-5}10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT or the number of iterations is larger than 130130130130. It is shown that the algorithm converges to its maximum as the number of iterations increases. Since Problem (𝒫)𝒫(\mathcal{P})( caligraphic_P ) is non-convex, the algorithm does not converge to a globally optimal solution. This means that the algorithm may converge to different points starting from different initial points. For this reason, we select the best solutions after executing the algorithm from different initial points. Herein, we depict the sum SE versus the iteration count for 5555 different randomly generated initial points, and we observe that all points result in the same SE. Generally, the selection of 5555 randomly generated initial points allows a good trade-off between performance and complexity.

Refer to caption
Figure 4: Achievable sum SE of the large SIM-aided MIMO architecture with respect to the number of iterations for 5555 different randomly generated initial points.

VI Conclusion

This paper provided the achievable downlink SE of large SIM-aided multiuser communications over Ricean fading channels. In particular, we deduced a new, tractable expression for the downlink SE for large-metasurfaces as a function of large-scale statistics while the downlink precoding takes place in the wave domain, in order to lower the computational burden and the processing latency. The results were used to pursue an optimization based on statistical CSI that achieves lower overhead with respect to instantaneous CSI. Specifically, we proposed an AO algorithm that solves the optimization regarding the phase shifts of each surface of the SIM and the allocated power, which contributes to a reduction in processing latency due to lower overhead.

Appendix A Proof of Proposition 2

The proof starts with the derivation of ϕlDksubscriptsubscriptbold-italic-ϕ𝑙subscript𝐷𝑘\nabla_{{\bm{\phi}}_{l}}D_{k}∇ start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. To this end, we focus on the differential of |𝐡k,LoS𝖧𝐆𝐰k1|2superscriptsuperscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑘12|{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{% \mathbf{w}}_{k}^{1}|^{2}| bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. We have

d(|𝐡k,LoS𝖧𝐆𝐰k1|2)=𝐡k,LoS𝖧d(𝐆)𝐰k1𝐰k1𝖧𝐆𝖧𝐡k,LoS𝑑superscriptsuperscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑘12superscriptsubscript𝐡𝑘LoS𝖧𝑑𝐆superscriptsubscript𝐰𝑘1superscriptsubscript𝐰𝑘superscript1𝖧superscript𝐆𝖧subscript𝐡𝑘LoS\displaystyle d(|{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}}% }{\mathbf{G}}{\mathbf{w}}_{k}^{1}|^{2})={\mathbf{h}}_{k,\mathrm{LoS}}^{{% \scriptscriptstyle\mathsf{H}}}d({\mathbf{G}}){\mathbf{w}}_{k}^{1}{\mathbf{w}}_% {k}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{h}}_{k,\mathrm{LoS}}italic_d ( | bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) = bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT italic_d ( bold_G ) bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT
+𝐡k,LoS𝖧𝐆𝐰k1𝐰k1𝖧d(𝐆𝖧)𝐡k,LoSsuperscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑘1superscriptsubscript𝐰𝑘superscript1𝖧𝑑superscript𝐆𝖧subscript𝐡𝑘LoS\displaystyle+{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{G}}{\mathbf{w}}_{k}^{1}{\mathbf{w}}_{k}^{1^{{\scriptscriptstyle\mathsf% {H}}}}d({\mathbf{G}}^{{\scriptscriptstyle\mathsf{H}}}){\mathbf{h}}_{k,\mathrm{% LoS}}+ bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_d ( bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT ) bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT (12maf)
=tr(𝐡k,LoS𝖧𝐀ld(𝚽l)𝐂l𝐰k1𝐰k1𝖧𝐆𝖧𝐡k,LoSmissing)absenttracesuperscriptsubscript𝐡𝑘LoS𝖧subscript𝐀𝑙𝑑subscript𝚽𝑙subscript𝐂𝑙superscriptsubscript𝐰𝑘1superscriptsubscript𝐰𝑘superscript1𝖧superscript𝐆𝖧subscript𝐡𝑘LoSmissing\displaystyle=\tr\big({\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{A}}_{l}d({\bm{\Phi}}_{l}){\mathbf{C}}_{l}{\mathbf{w}}_{k}% ^{1}{\mathbf{w}}_{k}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{h}}_{k,\mathrm{LoS}}\big{missing})= roman_tr ( start_ARG bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_A start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT italic_d ( bold_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT roman_missing end_ARG )
+tr(𝐀l𝖧𝐡k,LoS𝐡k,LoS𝖧𝐆𝐰k1𝐰k1𝖧𝐂l𝖧d(𝚽l𝖧)).tracesuperscriptsubscript𝐀𝑙𝖧subscript𝐡𝑘LoSsuperscriptsubscript𝐡𝑘LoS𝖧superscriptsubscript𝐆𝐰𝑘1superscriptsubscript𝐰𝑘superscript1𝖧superscriptsubscript𝐂𝑙𝖧𝑑superscriptsubscript𝚽𝑙𝖧\displaystyle+\tr({\mathbf{A}}_{l}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{h}% }_{k,\mathrm{LoS}}{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}% }}{\mathbf{G}}{\mathbf{w}}_{k}^{1}{\mathbf{w}}_{k}^{1^{{\scriptscriptstyle% \mathsf{H}}}}{\mathbf{C}}_{l}^{{\scriptscriptstyle\mathsf{H}}}d({\bm{\Phi}}_{l% }^{{\scriptscriptstyle\mathsf{H}}})).+ roman_tr ( start_ARG bold_A start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_Gw start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT italic_d ( bold_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT ) end_ARG ) . (12mag)

In (12mag), we have substituted d(𝐆)=𝐀ld(𝚽l)𝐂l𝑑𝐆subscript𝐀𝑙𝑑subscript𝚽𝑙subscript𝐂𝑙d({\mathbf{G}})={\mathbf{A}}_{l}d({\bm{\Phi}}_{l}){\mathbf{C}}_{l}italic_d ( bold_G ) = bold_A start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT italic_d ( bold_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT, where 𝐀l=𝚽L𝐖L𝚽l+1𝐖l+1subscript𝐀𝑙subscript𝚽𝐿superscript𝐖𝐿subscript𝚽𝑙1superscript𝐖𝑙1{\mathbf{A}}_{l}={\bm{\Phi}}_{L}{\mathbf{W}}^{L}\cdots{\bm{\Phi}}_{l+1}{% \mathbf{W}}^{l+1}bold_A start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = bold_Φ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT ⋯ bold_Φ start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_l + 1 end_POSTSUPERSCRIPT, and 𝐂l=𝐖l𝚽l1𝐖l1𝚽1subscript𝐂𝑙superscript𝐖𝑙subscript𝚽𝑙1superscript𝐖𝑙1subscript𝚽1{\mathbf{C}}_{l}={\mathbf{W}}^{l}{\bm{\Phi}}_{l-1}{\mathbf{W}}^{l-1}\cdots{\bm% {\Phi}}_{1}bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = bold_W start_POSTSUPERSCRIPT italic_l end_POSTSUPERSCRIPT bold_Φ start_POSTSUBSCRIPT italic_l - 1 end_POSTSUBSCRIPT bold_W start_POSTSUPERSCRIPT italic_l - 1 end_POSTSUPERSCRIPT ⋯ bold_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Having obtained the differential, we have

ϕlDksubscriptsubscriptbold-italic-ϕ𝑙subscript𝐷𝑘\displaystyle\nabla_{{\bm{\phi}}_{l}}D_{k}∇ start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =ϕlDkabsentsuperscriptsubscriptbold-italic-ϕ𝑙subscript𝐷𝑘\displaystyle=\frac{\partial}{\partial{\bm{\phi}}_{l}^{*}}D_{k}= divide start_ARG ∂ end_ARG start_ARG ∂ bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG italic_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (12mah)
=pkκkdiag()(𝐂l𝐰k1𝐰k1𝖳𝐆𝖳𝐡k,LoS𝐡k,LoS𝖳𝐀l)absentsubscript𝑝𝑘subscript𝜅𝑘diagsuperscriptsubscript𝐂𝑙superscriptsubscript𝐰𝑘superscript1superscriptsubscript𝐰𝑘superscript1𝖳superscript𝐆𝖳superscriptsubscript𝐡𝑘LoSsuperscriptsubscript𝐡𝑘LoS𝖳superscriptsubscript𝐀𝑙\displaystyle=p_{k}\kappa_{k}\text{diag}\left(\right)({\mathbf{C}}_{l}^{*}{% \mathbf{w}}_{k}^{1^{*}}{\mathbf{w}}_{k}^{1^{{\scriptscriptstyle\mathsf{T}}}}{% \mathbf{G}}^{{\scriptscriptstyle\mathsf{T}}}{\mathbf{h}}_{k,\mathrm{LoS}}^{*}{% \mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{T}}}{\mathbf{A}}_{l}^% {*})= italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT diag ( ) ( bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT bold_h start_POSTSUBSCRIPT italic_k , roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT bold_A start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )

The first term in the denominator is written as

d(tr(𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐑missing))=tr(𝐀ld(𝚽l)𝐂l𝐰i1𝐰i1𝖧𝐆𝖧𝐑missing)𝑑tracesuperscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧𝐑missingtracesubscript𝐀𝑙𝑑subscript𝚽𝑙subscript𝐂𝑙superscriptsubscript𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧𝐑missing\displaystyle d\big{(}\tr\big({\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}% ^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{% H}}}{\mathbf{R}}\big{missing})\big{)}=\tr\big({\mathbf{A}}_{l}d({\bm{\Phi}}_{l% }){\mathbf{C}}_{l}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}^{1^{{\scriptscriptstyle% \mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{R}}\big{% missing})italic_d ( roman_tr ( start_ARG bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_R roman_missing end_ARG ) ) = roman_tr ( start_ARG bold_A start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT italic_d ( bold_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT ) bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_R roman_missing end_ARG )
+tr(𝐀l𝖧𝐑𝐆𝐰i1𝐰i1𝖧𝐂l𝖧d(𝚽l𝖧)missing).tracesuperscriptsubscript𝐀𝑙𝖧superscriptsubscript𝐑𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscriptsubscript𝐂𝑙𝖧𝑑superscriptsubscript𝚽𝑙𝖧missing\displaystyle+\tr\big({\mathbf{A}}_{l}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{R}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}^{1^{{% \scriptscriptstyle\mathsf{H}}}}{\mathbf{C}}_{l}^{{\scriptscriptstyle\mathsf{H}% }}d({\bm{\Phi}}_{l}^{{\scriptscriptstyle\mathsf{H}}})\big{missing}).+ roman_tr ( start_ARG bold_A start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_RGw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT italic_d ( bold_Φ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT ) roman_missing end_ARG ) . (12mai)

Thus, we have

ϕltr(𝐆𝐰i1𝐰i1𝖧𝐆𝖧𝐑missing)=diag()(𝐂l𝐰i1𝐰i1𝖳𝐆𝖳𝐑𝐀l)subscriptsubscriptbold-italic-ϕ𝑙tracesuperscriptsubscript𝐆𝐰𝑖1superscriptsubscript𝐰𝑖superscript1𝖧superscript𝐆𝖧𝐑missingdiagsuperscriptsubscript𝐂𝑙superscriptsubscript𝐰𝑖superscript1superscriptsubscript𝐰𝑖superscript1𝖳superscript𝐆𝖳superscriptsubscript𝐑𝐀𝑙\displaystyle\!\nabla_{{\bm{\phi}}_{l}}\!\tr\big({\mathbf{G}}{\mathbf{w}}_{i}^% {1}{\mathbf{w}}_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{R}}\big{missing})\!=\!\text{diag}\left(% \right)({\mathbf{C}}_{l}^{*}{\mathbf{w}}_{i}^{1^{*}}{\mathbf{w}}_{i}^{1^{{% \scriptscriptstyle\mathsf{T}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{T}}}{% \mathbf{R}}{\mathbf{A}}_{l}^{*})∇ start_POSTSUBSCRIPT bold_italic_ϕ start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_tr ( start_ARG bold_Gw start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_H end_POSTSUPERSCRIPT bold_R roman_missing end_ARG ) = diag ( ) ( bold_C start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT sansserif_T end_POSTSUPERSCRIPT bold_RA start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )

The second term in the denominator is similar to the numerator. Hence, the derivation is similar.

References

  • [1] M. Di Renzo et al., “Smart radio environments empowered by reconfigurable intelligent surfaces: How it works, state of research, and the road ahead,” IEEE J. Sel. Areas Commun., vol. 38, no. 11, pp. 2450–2525, 2020.
  • [2] Y. Liu et al., “Reconfigurable intelligent surfaces: Principles and opportunities,” IEEE Commun. Surveys & Tuts, vol. 23, no. 3, pp. 1546–1577, 2021.
  • [3] C. Huang et al., “Reconfigurable intelligent surfaces for energy efficiency in wireless communication,” IEEE Transa. Wireless Commun., vol. 18, no. 8, pp. 4157–4170, 2019.
  • [4] A. Papazafeiropoulos et al., “Intelligent reflecting surface-assisted MU-MISO systems with imperfect hardware: Channel estimation and beamforming design,” IEEE Trans. Wireless Commun., vol. 21, no. 3, pp. 2077–2092, 2021.
  • [5] ——, “Achievable rate of a STAR-RIS assisted massive MIMO system under spatially-correlated channels,” IEEE Trans. Wireless Commun., pp. 1–1, 2023.
  • [6] H. Guo et al., “Weighted sum-rate maximization for reconfigurable intelligent surface aided wireless networks,” IEEE Trans. Wireless Commun., vol. 19, no. 5, pp. 3064–3076, 2020.
  • [7] J. An et al., “Stacked intelligent metasurfaces for efficient holographic MIMO communications in 6G,” IEEE J. Sel. Areas Commun., 2023.
  • [8] ——, “Stacked intelligent metasurfaces for multiuser downlink beamforming in the wave domain,” arXiv preprint arXiv:2309.02687, 2023.
  • [9] A. Papazafeiropoulos et al., “Achievable rate optimization for stacked intelligent metasurface-assisted holographic MIMO communications,” arXiv preprint arXiv:2402.16415.
  • [10] A. Papazafeiropoulos, P. Kourtessis, and S. Chatzinotas, “Performance of double-stacked intelligent metasurface-assisted multiuser massive MIMO communications in the wave domain,” arXiv preprint arXiv:2402.16405, 2024.
  • [11] S. Lin et al., “Stacked intelligent metasurface enabled LEO satellite communications relying on statistical CSI,” IEEE Wireless Commun. Let., 2024.
  • [12] E. Björnson et al., “Massive MIMO networks: Spectral, energy, and hardware efficiency,” Foundations and Trends® in Signal Processing, vol. 11, no. 3-4, pp. 154–655, 2017.
  • [13] C. Liu et al., “A programmable diffractive deep neural network based on a digital-coding metasurface array,” Nature Electronics, vol. 5, no. 2, pp. 113–122, 2022.
  • [14] E. Björnson and L. Sanguinetti, “Rayleigh fading modeling and channel hardening for reconfigurable intelligent surfaces,” IEEE Wireless Commun. Lett., vol. 10, no. 4, pp. 830–834, 2021.
\Closesolutionfile

solutionfile