\Newassociation

solutionSolutionsolutionfile \Opensolutionfilesolutionfile[LargeSIMRateDouble]

Achievable Rate Optimization for Large Stacked Intelligent Metasurfaces Based on Statistical CSI

Anastasios Papazafeiropoulos, Pandelis Kourtessis, Symeon Chatzinotas, Dimitra I. Kaklamani, Iakovos S. Venieris A. Papazafeiropoulos is with the Communications and Intelligent Systems Research Group, University of Hertfordshire, Hatfield AL10 9AB, U. K., and with SnT at the University of Luxembourg, Luxembourg. P. Kourtessis is with the Communications and Intelligent Systems Research Group, University of Hertfordshire, Hatfield AL10 9AB, U. K. S. Chatzinotas is with the SnT at the University of Luxembourg, Luxembourg. Dimitra I. Kaklamani is with the Microwave and Fiber Optics Laboratory, and Iakovos S. Venieris is with the Intelligent Communications and Broadband Networks Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, Zografou, 15780 Athens, Greece. Corresponding author’s email: [email protected].

Abstract

Stacked intelligent metasurface (SIM) is an emerging design that consists of multiple layers of metasurfaces. A SIM enables holographic multiple-input multiple-output (HMIMO) precoding in the wave domain, which results in the reduction of energy consumption and hardware cost. On the ground of multiuser beamforming, this letter focuses on the downlink achievable rate and its maximization. Contrary to previous works on multiuser SIM, we consider statistical channel state information (CSI) as opposed to instantaneous CSI to overcome challenges such as large overhead. Also, we examine the performance of large surfaces. We apply an alternating optimization (AO) algorithm regarding the phases of the SIM and the allocated transmit power. Simulations illustrate the performance of the considered large SIM-assisted design as well as the comparison between different CSI considerations.

Index Terms:

Reconfigurable intelligent surface (RIS), stacked intelligent metasurfaces (SIM), 6G networks.

I Introduction

The technology of reconfigurable intelligent surfaces (RISs) has recently emerged to increase coverage and enhance spectral and energy efficiencies in various communication environments [1, 2]. In general terms, an RIS includes a surface that includes a large number of elements, which are nearly passive and have low cost. The purpose of these elements is to adjust the phases of the incident electromagnetic (EM) waves by using a smart controller, and hence, shape the propagation environment dynamically [3, 4, 5].

However, most existing works on RIS assume single-layer metasurface structure [3, 4, 6], which imposes a constraint on the adjustment of the beam patterns. Also, the single-layer structures of RISs do not have the capability of inter-user interference suppression as shown in [6]. These observations led the authors in [7, 8] to propose a stacked intelligent metasurface (SIM), which consists of an array of programmable metasurfaces similar to artificial neural networks (ANNs). Among the processing capabilities of a SIM, we that the forward propagation takes place at the speed of light.

On this ground, in [7], authors proposed a SIM-based design for the transceiver of point-to-point multiple-input multiple-output (MIMO) communication systems, where the combining and the precoding take place as the EM waves propagate along the SIM. In [8], we observe the integration of a SIM to the transmitter, i.e., the base station (BS) towards enabling beamforming in the EM domain based on instantaneous channel state information (CSI). Contrary to [7] and [8], in [9] and [10], we proposed more general hybrid digital wave designs, where all element parameters are optimised simultaneously through more efficient algorithms.

In this work, we focus on a SIM-enabled multiuser architecture operating solely in the wave domain. Note that [10] assumes a hybrid digital wave design, and [11] focuses on satellite communication systems. Also, contrary to previous works [7, 8, 9], we consider a SIM that consists of large metasurfaces, since we apply the use-and-then-forget (UatF) bound [12]. Most importantly, we obtain the downlink rate and perform its optimization regarding the phase shifts and transmit power in terms of statistical CSI. Notably, this approach enables the optimization at every several coherence intervals rather than optimizing at each interval. Hence, we achieve lower overhead, which is one of the main challenges in SIM-assisted systems.

Notation: Matrices and vectors are represented by boldface upper and lower case symbols, respectively. The notations $(\cdot)^{\scriptscriptstyle\mathsf{T}}$ , $(\cdot)^{\scriptscriptstyle\mathsf{H}}$ , and $\tr\!\left({\cdot}\right)$ denote the transpose, Hermitian transpose, and trace operators, respectively. Also, the symbol $\mathbb{E}\left[\cdot\right]$ denotes the expectation operator. The floor function $\lfloor x\rfloor$ gives as output the greatest integer less than or equal to $x$ . The notation $\text{diag}\left(\right)\left({\mathbf{A}}\right)$ represents a vector with elements equal to the diagonal elements of ${\mathbf{A}}$ . The notation ${\mathbf{b}}\sim{\cal C}{\cal N}{({\mathbf{0}},\mathbf{\Sigma})}$ represents a circularly symmetric complex Gaussian vector with zero mean and a covariance matrix $\mathbf{\Sigma}$ .

II System Model

We consider a SIM-aided MIMO communication system as depicted in Fig. 1. In particular, a BS, which includes $N_{t}$ antennas, communicates with $K$ single-antenna user equipments (UEs) through a SIM performing wave-based processing. The SIM is implemented by $L$ metasurfaces, where each one has a large number of $N$ meta-atoms. Let $\mathcal{K}=\{1,\ldots,K\}$ , $\mathcal{L}=\{1,\ldots,L\}$ , and $\mathcal{N}=\{1,\ldots,N\}$ denote the sets of UEs, metasurfaces, and meta-atoms, respectively. Note that an intelligent controller adjusts the shifts of the phases of the electromagnetic (EM) waves that im**e on the metasurface layers.

Refer to caption — Figure 1: A SIM-aided MIMO system.

On this basis, let $\theta_{n}^{l}\in[0,2\pi),n\in\mathcal{N},l\in\mathcal{L}$ be the phase shift by the $n$ th meta-atom on the surface layer $l$ . Also, we denote $\phi_{n}^{l}=e^{j\theta_{n}^{l}}$ , and ${\bm{\Phi}}_{l}=\text{diag}\left(\right)({\bm{\phi}}^{l})\in\mathbb{C}^{N% \times N}$ , where ${\bm{\phi}}^{l}=[\phi^{l}_{1},\dots,\phi^{l}_{N}]^{{\scriptscriptstyle\mathsf{% T}}}\in\mathbb{C}^{{\color[rgb]{0,0,0}N}\times 1}$ .¹¹1Herein, we consider phase shifts, which are continuously-adjustable and their modulus equals to $1$ to evaluate large SIM-aided MIMO communications. Practical issues such as the consideration of discrete phase shifts [13] is the topic of future work. In addition, ${\mathbf{W}}^{l}\in\mathbb{C}^{N\times N},l\in\mathcal{L}/\{1\}$ denotes the coefficient matrix between layer $(l-1)$ and layer $l$ . In particular, its entries from meta-atom $\tilde{n}$ on layer $(l-1)$ to meta-atom $n$ on layer $l,\forall l\in\mathcal{L}$ are given by

\displaystyle w_{n,\tilde{n}}^{l}=\frac{A_{t}cosx_{n,\tilde{n}}^{l}}{r_{n,% \tilde{n}}^{l}}\left(\frac{1}{2\pi r^{l}_{n,\tilde{n}}}-j\frac{1}{\lambda}% \right)e^{j2\pi r_{n,\tilde{n}}^{l}/\lambda},

(1)

where $A_{t}$ is the area of each meta-atom at the SIM, $x_{n,\tilde{n}}^{l}$ denotes the angle between the normal direction of the transmit metasurface layer $(l-1)$ and the propagation direction, $r_{n,\tilde{n}}^{l}$ , is the respective transmission distance. Moreover, let ${\mathbf{w}}^{1}_{k}\in\mathbb{C}^{N\times 1}$ express the coefficient from the transmit antenna array. Thus, the impact of the SIM can be expressed as

\displaystyle{\mathbf{G}}={\bm{\Phi}}_{L}{\mathbf{W}}^{L}\cdots{\bm{\Phi}}_{2}% {\mathbf{W}}^{2}{\bm{\Phi}}_{1}\in\mathbb{C}^{N\times N}.

(2)

Let ${\mathbf{h}}_{k}\in\mathbb{C}^{N\times 1},\forall k\in\mathcal{K}$ express the channel between the last layer and UE $k$ that is described by the correlated Rician fading distribution as

\displaystyle{\mathbf{h}}_{k}=\sqrt{\beta_{k}}\left(\sqrt{\frac{\kappa_{k}}{1+% \kappa_{k}}}{\mathbf{h}}_{k,\mathrm{LoS}}+\sqrt{\frac{1}{1+\kappa_{k}}}{% \mathbf{h}}_{k,\mathrm{NLoS}}\right)~{}~{}\forall k\in\mathcal{K}.

(3)

In (3), $\kappa_{k}$ is the Rician factor, $\beta_{k}$ is the channel gain, ${\mathbf{h}}_{k,\mathrm{NLoS}}\in\mathbb{C}^{N\times 1}$ is the LoS component, and ${\mathbf{h}}_{k,\mathrm{NLoS}}\sim\mathcal{CN}({\mathbf{0}},{\mathbf{R}})\in% \mathbb{C}^{N\times 1}$ is the NLoS component with ${\mathbf{R}}\in\mathbb{C}^{N\times N}$ representing the spatial correlation of each surface. This correlation is obtained $\forall n\in\mathcal{N},\tilde{n}\in\mathcal{N}$ as [14]

\displaystyle[{\mathbf{R}}_{\mathrm{SIM}}]_{\tilde{n},n}

\displaystyle=\mathrm{sinc}(2\|{\mathbf{u}}_{n}-{\mathbf{u}}_{\tilde{n}}\|/% \lambda),n,\tilde{n}=1,\ldots,N

(4)

where ${\mathbf{u}}_{n}=[0,i(n)d_{\mathrm{H}},j(n)d_{\mathrm{V}}]^{{% \scriptscriptstyle\mathsf{T}}}$ with $i(n)=\mod(n-1,N_{x})$ and $j(n)=\lfloor(n-1)/N_{x}\rfloor$ being the horizontal and vertical indices of element $n$ , respectively. $N_{x}$ and $N_{y}$ are the elements per row and column, while $d_{\mathrm{H}}$ and $d_{\mathrm{V}}$ denote the horizontal width and the vertical height.

III Downlink Data Transmission

During the downlink transmission and based on wave-based beamforming [8], the received signal at the $k$ -th UE is written as

\displaystyle y_{k}={\mathbf{h}}_{k}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{% G}}\sum_{i=1}^{K}{\mathbf{w}}_{i}^{1}\sqrt{p}_{i}x_{i}+n_{k},~{}~{}~{}\forall k% \in\mathcal{K}

(5)

where $x_{i}$ is the information symbol intended for the $k$ -th UE, which has a zero mean and unit variance. Also, $p_{i}$ is the power corresponding to the $k$ -th UE with $\sum_{i=1}^{K}p_{i}\leq P_{\mathrm{T}}$ , where $P_{\mathrm{T}}$ is the total transmit power at the BS. Also $n_{k}\sim\mathcal{CN}(0,\sigma_{k}^{2})$ denotes the additive white Gaussian noise (AWGN) with $\sigma_{k}^{2}$ expressing its variance at UE $k$ .

The downlink achievable SE of UE $k$ is given by

\displaystyle\mathrm{SE}=\sum_{k=1}^{K}\log_{2}\left(1+\gamma_{k}\right)\!,

(6)

where $\gamma_{k}$ denotes the downlink signal-to-interference-plus-noise ratio (SINR), which is written according to the UaTF bounding technique [12] as

\displaystyle\gamma_{k}=\frac{p_{k}|\mathbb{E}\{{\mathbf{h}}_{k}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{k}^{1}\}|^{2}}{\sum_{i% =1}^{K}p_{i}\mathbb{E}\{|{\mathbf{h}}_{k}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{G}}{\mathbf{w}}_{i}^{1}|^{2}\}-p_{k}|\mathbb{E}\{{\mathbf{h}}_{k}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{k}^{1}\}|^{2}+\sigma_{% k}^{2}},

(7)

where is assumed that UE $k$ has knowledge of the average effective channel.

Proposition 1

The achievable SINR of UE $k$ for a given SIM during the downlink transmission is provided by (8).

\displaystyle\gamma_{k}=\frac{p_{k}\kappa_{k}|{\mathbf{h}}_{k,\mathrm{LoS}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{k}^{1}|^{2}}{\sum_{i=1% }^{K}p_{i}\tr({\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}^{1^{{% \scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{R}})+\sum_{i\neq k}^{K}p_{i}\kappa_{k}{\mathbf{h}}_{k,\mathrm{LoS}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}% ^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{% H}}}{\mathbf{h}}_{k,\mathrm{LoS}}+\frac{\sigma_{k}^{2}(1+\kappa_{k})}{\beta_{k% }}}.

(8)

Proof:

The numerator becomes

\displaystyle|\mathbb{E}\{{\mathbf{h}}_{k}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{G}}{\mathbf{w}}_{k}^{1}\}|^{2}=\beta_{k}\frac{\kappa_{k}}{1+\kappa_{k}% }|{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{% \mathbf{w}}_{k}^{1}|^{2}.

(9)

Regarding the denominator of (7), the first term is written as

	$\displaystyle\mathbb{E}\{\|{\mathbf{h}}_{k}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{G}}{\mathbf{w}}_{i}^{1}\|^{2}\}=\tr({\mathbf{h}}_{k}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}% ^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{% H}}}{\mathbf{h}}_{k})$		(10)
	$\displaystyle=\beta_{k}\frac{1}{1+\kappa_{k}}\tr({\mathbf{G}}{\mathbf{w}}_{i}^% {1}{\mathbf{w}}_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{R}})$
	$\displaystyle+\beta_{k}\frac{\kappa_{k}}{1+\kappa_{k}}{\mathbf{h}}_{k,\mathrm{% LoS}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf% {w}}_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{h}}_{k,\mathrm{LoS}},$		(11)

where, in (10), we have applied that ${\mathbf{x}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{y}}=\tr({\mathbf{y}}{% \mathbf{x}}^{{\scriptscriptstyle\mathsf{H}}})$ for any vectors ${\mathbf{x}}$ , ${\mathbf{y}}$ . By substituting (9) and (11) in (7), we obtain the achievable SINR. ∎

IV Problem Formulation and Optimization

The maximization of the sum SE regarding the phase shifts of each surface and the allocated power is of great importance.

IV-A Problem Formulation

The maximization problem is formulated as


$\displaystyle(\mathcal{P})~{}~{}$	$\displaystyle\max_{{\bm{\phi}}_{l},{\mathbf{p}}}\;f({\bm{\phi}}_{l},{\mathbf{p% }})=\sum_{k=1}^{K}\log_{2}\left(1+\frac{D_{k}}{I_{k}}\right)$	(12a)
	$\displaystyle~{}\mathrm{s.t}~{}~{}~{}{\mathbf{G}}={\bm{\Phi}}_{L}{\mathbf{W}}^% {L}\cdots{\bm{\Phi}}_{2}{\mathbf{W}}^{2}{\bm{\Phi}}_{1},$	(12b)
	$\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!\|\phi^{l}_{n}\|=1,n\in\mathcal{N},l\in% \mathcal{L},$	(12c)
	$\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!\sum_{i=1}^{K}p_{i}=P_{\mathrm{T}}$	(12d)
	$\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!p_{k}\geq 0,\forall k\in\mathcal{K},$	(12e)

where $D_{k}$ and $I_{k}$ are the numerator and denominator of $\gamma_{k}$ obtained in Proposition 1. Also, we have defined the vector ${\mathbf{p}}=[p_{1},\ldots,p_{K}]^{{\scriptscriptstyle\mathsf{T}}}$ . Note that the constraint (12c) expresses that each RIS element provides only a phase shift while (12d) corresponds to the maximum power constraint.

The non-convexity optimization problem $(\mathcal{P})$ and its dependence on the unit-modulus constraint with respect to ${\bm{\phi}}_{l}$ make the solution challenging. For this reason, we resort to alternating optimization (AO). According to this technique, ${\bm{\phi}}_{l}$ and ${\mathbf{p}}$ will be optimized individually in an iterative manner. Specifically, first, we find the optimum ${\bm{\phi}}_{l}$ for a fixed ${\mathbf{p}}$ . During the next step, we solve for ${\mathbf{p}}$ with ${\bm{\phi}}_{l}$ fixed. The objective converges to its optimum value by iterating this process, which leads to the increase of $f({\bm{\phi}}_{l},{\mathbf{p}})$ after each step until a specific point because of the upper-bound coming from the power constraint (12d).

IV-B SIM Optimization

Until now, ${\bm{\phi}}_{l}$ was assumed fixed. However, to exploit each metasurface towards wave-based beamforming while maximizing (6), the optimization of each ${\bm{\phi}}_{l}$ has to take place. Its presence is observed inside the matrix ${\mathbf{G}}$ , appearing in $D_{k}$ and $I_{k}$ . Hence, the maximization problem regarding ${\bm{\phi}}_{l}$ is described as


$\displaystyle(\mathcal{P}1)~{}~{}$	$\displaystyle\max_{{\bm{\phi}}_{l}}\;f({\bm{\phi}}_{l})$	(12ma)
	$\displaystyle~{}\mathrm{s.t}~{}~{}~{}{\mathbf{G}}={\bm{\Phi}}_{L}{\mathbf{W}}^% {L}\cdots{\bm{\Phi}}_{2}{\mathbf{W}}^{2}{\bm{\Phi}}_{1},$	(12mb)
	$\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!\|\phi^{l}_{n}\|=1,n\in\mathcal{N},l\in% \mathcal{L},$	(12mc)

where the maximization problem $(\mathcal{P}1)$ is non-convex regarding ${\bm{\phi}}_{l}$ , and it obeys to a unit-modulus constraint with respect to $\phi^{l}_{n}$ . Application of the projected gradient ascent algorithm until convergence while taking into account the unit-modulus constraint results in a locally optimal solution to $(\mathcal{P}1)$ .

The proposed algorithm suggests starting from ${\bm{\phi}}_{l}^{0}$ , and then shifting along the gradient of $f({\bm{\phi}}_{l})$ . The new point ${\bm{\phi}}_{l}$ is projected onto $\Phi_{l}$ to hold the new points in the feasible set. Specifically, the unit-modulus constraint means that $\phi^{l}_{n}$ has to be found inside the unit circle. $P_{\Phi_{l}}(\cdot)$ is the projection onto $\Phi_{l}$ . Hence, we have

\displaystyle\bar{u}_{l,n}=\left\{\begin{array}[]{ll}\frac{u_{l,n}}{|u_{l,n}|}% &u_{l,n}\neq 0\\ e^{j\phi^{l}_{n}},\phi^{l}_{n}\in[0,2\pi]&u_{l,n}=0\\ \end{array},n=1,\ldots,N,\right.

(12mp)

where the vector $\bar{{\mathbf{u}}}_{l}$ of $P_{\Phi_{l}}({\mathbf{u}}_{l})$ is a given point.

The algorithm is described by the following iteration

\displaystyle{\bm{\phi}}_{l}^{i+1}

\displaystyle=P_{\Phi_{l}}({\bm{\phi}}_{l}^{i}+\mu_{i}\nabla_{{\bm{\phi}}_{l}}% f({\bm{\phi}}_{l}^{i})).

(12mq)

The Armijo-Goldstein backtracking line search method provides the step size, which is $\mu_{i}=L_{i}\kappa^{m_{i}}$ , where $\kappa\in(0,1)$ and $L_{i}>0$ . Note that $m_{i}$ is the smallest positive integer that satisfies

\displaystyle f({\bm{\phi}}_{l}^{i+1})\geq Q_{L_{i}\kappa^{m_{i}}}({\bm{\phi}}% _{l}^{i};{\bm{\phi}}_{l}^{i+1}),

(12mr)

where

\displaystyle\!\!Q_{\mu}({\bm{\phi}}_{l};{\mathbf{x}})\!=\!f({\bm{\phi}}_{l})% \!+\!\langle\nabla_{{\bm{\phi}}_{l}}f({\bm{\phi}}_{l}),{\mathbf{x}}\!-\!{\bm{% \phi}}_{l}\rangle\!-\!\frac{1}{\mu}\|{\mathbf{x}}-{\bm{\phi}}_{l}\|^{2}_{2}

(12ms)

is the quadratic approximation of $f({\bm{\phi}}_{l})$ .

Proposition 2

The gradient of $f({\bm{\phi}}_{l})$ regarding ${\bm{\phi}}_{l}^{*}$ is obtained in closed-form as

\displaystyle\nabla_{{\bm{\phi}}_{l}}f({\bm{\phi}}_{l})

\displaystyle=\frac{1}{\log_{2}(e)}\sum_{k=1}^{K}\frac{{I}_{k}\nabla_{{\bm{% \phi}}_{l}}D_{k}-D_{k}\nabla_{{\bm{\phi}}_{l}}{I}_{k}}{(1+\gamma_{k}){I}_{k}},

(12mt)

where

with ${\mathbf{A}}_{l}={\bm{\Phi}}_{L}{\mathbf{W}}^{L}\cdots{\bm{\Phi}}_{l+1}{% \mathbf{W}}^{l+1}$ , and ${\mathbf{C}}_{l}={\mathbf{W}}^{l}{\bm{\Phi}}_{l-1}{\mathbf{W}}^{l-1}\cdots{\bm% {\Phi}}_{1}$ .

Proof:

Please see Appendix A. ∎

The SIM optimization design, based on the gradient ascent, appears a significant advantage because the gradient ascent is obtained in a closed form. It has low computational complexity because it consists of simple matrix operations. Specifically, the complexity of (12ma) for large SIMs is $\mathcal{O}(N_{t}N^{2}+LN^{2}+KN^{3})$ , and the complexity of (12mt) is similar. In other words, the number of meta-atoms of each surface has a higher impact.

IV-C Power Optimization

Given a fixed ${\bm{\Phi}}_{l}$ , we focus on the optimization with respect to ${\mathbf{p}}$ . Specifically, we have


$\displaystyle(\mathcal{P}2)~{}~{}$	$\displaystyle\max_{{\mathbf{p}}}\;f({\mathbf{p}})$	(12mua)
	$\displaystyle\;\quad\;\;\;\;\;\!\!~{}\!\sum_{i=1}^{K}p_{i}=P_{\mathrm{T}},~{}p% _{k}\geq 0,\forall k\in\mathcal{K}.$	(12mub)

The nonconvexity of problem $(\mathcal{P}2)$ leads us to obtain a solution which is locally optimal. For this reason, we apply a weighted minimum mean square error (MMSE) reformulation of the sum SE. To this end, we denote ${\mathbf{c}}\!=\![c_{1},\ldots,c_{K}]^{{\scriptscriptstyle\mathsf{T}}}$ . Then, the SINR $\gamma_{k}$ can be written in terms of the vector ${\mathbf{p}}$ as

\displaystyle\gamma_{k}=\frac{p_{k}q_{k}}{{\mathbf{c}}^{{\scriptscriptstyle% \mathsf{T}}}{\mathbf{p}}+u_{k}^{2}},

(12mv)

where

	$\displaystyle q_{k}=\kappa_{k}\|{\mathbf{h}}_{k,\mathrm{LoS}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{k}^{1}\|^{2},c_{k}=% \beta_{k}\frac{1}{1+\kappa_{k}}\tr({\mathbf{G}}{\mathbf{w}}_{k}^{1}{\mathbf{w}% }_{k}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{R}}),$
	$\displaystyle t_{k}^{2}=\frac{\sigma_{k}^{2}(1+\kappa_{k})}{\beta_{k}},c_{i}=% \beta_{k}\frac{1}{1+\kappa_{k}}\tr({\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}% }_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{R}})$
	$\displaystyle+\beta_{k}\frac{\kappa_{k}}{1+\kappa_{k}}{\mathbf{h}}_{k,\mathrm{% LoS}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf% {w}}_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{h}}_{k,\mathrm{LoS}},~{}\forall i\neq k.$		(12mw)

Now, we consider the single-input and single-output (SISO) channel model that comes from the SINR in (12mv), which is given by

\displaystyle\tilde{y}_{k}=\sqrt{p_{k}q_{k}}s_{k}+\sum_{i=1}^{K}\sqrt{p_{i}c_{% i}}s_{i}+n_{k},

(12mx)

where $n_{k}\sim{\cal C}{\cal N}\left(0,u_{k}^{2}\right)$ while $s_{i}\in\mathbb{C}$ is the data signal with unit variance, and $\tilde{y}_{k}$ is the received signal.

Then, the receiver estimates $s_{k}$ , i.e., $\hat{s}_{k}=v_{k}^{*}\tilde{y}_{k}$ with $v_{k}$ being a receiver coefficient. The corresponding mean square error $e_{k}({\mathbf{p}},v_{k})=[|\hat{s}_{k}-s_{k}|^{2}]$ becomes

\displaystyle e_{k}({\mathbf{p}},v_{k})=v_{kg}^{2}\left(p_{k}q_{k}+{\mathbf{c}% }_{k}^{{\scriptscriptstyle\mathsf{T}}}{\mathbf{p}}+u_{k}^{2}\right)-2v_{k}% \sqrt{q_{k}p_{k}}+1.

(12my)

For a given ${\mathbf{p}}$ , $v_{k}$ is obtained by the minimization of the MSE as

\displaystyle v_{k}=\frac{\sqrt{p_{k}q_{k}}}{p_{k}q_{k}+\sum_{i=1}^{K}p_{i}c_{% i}+u_{k}^{2}}.

(12mz)

Inserting $v_{k}$ into (12my), $e_{k}$ becomes $1/\left(1+\gamma_{k}\right)$ . Based on the weighted MMSE method, let the auxiliary weight $d_{k}\geq 0$ for the MSE $e_{k}$ and consider the problem

\displaystyle\begin{split}(\mathcal{P}2.1)\min_{\begin{subarray}{c}{\mathbf{p}% }\geq 0,\\ \{v_{k},d_{k}\geq 0:k=1,\ldots,K\}\end{subarray}}&{K}\sum_{k=1}^{K}d_{k}e_{k}(% {\mathbf{p}},{\mathbf{v}}_{k})-\ln(d_{k})\\ \mathrm{s.t}~{}~{}\;\!&\sum_{i=1}^{K}p_{i}\leq P_{\mathrm{T}}.\end{split}

(12maa)

Problems $(\mathcal{P}2)$ and $(\mathcal{P}2.1)$ are equivalent, and thus, are subject to the same solution. The solution of $(\mathcal{P}2.1)$ can be provided in closed form as

\displaystyle p_{i}=\min\left(P_{\mathrm{T}},\frac{q_{k}d_{k}^{2}v_{k}^{2}}{% \left(q_{k}d_{k}v_{k}^{2}+\sum_{i=1}^{K}d_{i}v_{i}^{2}c_{i}\right)^{2}}\right).

(12mab)

The power allocation presents a similar complexity to the SIM optimization design since it consists of similar matrix operations to $(\mathcal{P}1)$ , i.e., its complexity is $\mathcal{O}(N_{t}N^{2}+LN^{2}+KN^{3})$ .

Remark 1

Both algorithms, corresponding to Problems $(\mathcal{P}1)$ and $(\mathcal{P}2)$ , have low computation complexity and converge quickly. Note that the achievement of a local optimum, obtained from this optimization, will make different initializations result in different solutions, which will be studied below.

V Numerical Results

In this section, we present and evaluate the performance of the achievable sum SE of large SIM-assisted multiuser communications with statistical CSI by showing both analytical results and Monte Carlo simulations. For the setup, we assume that the SIM is parallel to the $x-y$ plane and centered along the $z-$ axis at a height $H_{\mathrm{BS}}=10~{}\mathrm{m}$ . The spacing between adjacent meta-atoms is assumed to be $\lambda/2$ , and the size of each meta-atom is $\lambda/2\times\lambda/2$ . The thickness of the SIM is $T_{\mathrm{SIM}}=5\lambda$ , while the spacing is $d_{\mathrm{SIM}}=T_{\mathrm{SIM}}/L$ . Moreover, the locations of the users are randomly distributed at a distance between $60\mathrm{m}$ and $80\mathrm{m}$ .

The distance between the $\tilde{n}-$ th meta-atom of the $(l-1)-$ st metasurface and the ${n}-$ th meta-atom of the $l-$ st metasurface is given by $d_{n,\tilde{n}}^{l}=\sqrt{d_{\mathrm{SIM}}^{2}+d_{n,\tilde{n}}^{2}}$ , where

\displaystyle\!\!d_{n,\tilde{n}}\!=\!\frac{\lambda}{2}\sqrt{\lfloor|n-\tilde{n% }|/N_{x}\rfloor^{2}\!+\![\mathrm{mod}(|n-\tilde{n}|,N_{x})]^{2}}.

(12mac)

The transmission distance between the $m$ -th antenna and the $\tilde{n}$ -th meta-atom on the first metasurface layer is provided by (12mad). Note that we have $\cos x_{n,\tilde{n}}^{l}=d_{\mathrm{SIM}}/d_{n,\tilde{n}}^{l},\forall l\in% \mathcal{L}$ .

\displaystyle{\small d_{\tilde{n},m}^{1}\!=\!\sqrt{\!d_{\mathrm{SIM}}^{2}\!+\!% \Big{[}\!\Big{(}\!\mathrm{mod}(\tilde{n}\!-\!1,N_{x})\!-\!\frac{N_{x}\!-\!1}{2% }\!\Big{)}\frac{\lambda}{2}\!-\!\Big{(}m\!-\!\frac{N_{t}\!+\!1}{2}\Big{)}\frac% {\lambda}{2}\Big{]}^{2}\!+\!\Big{(}\!\lceil\tilde{n}/N_{x}\rceil\!-\!\frac{N_{% y}\!+\!1}{2}\Big{)}^{2}\frac{\lambda_{2}}{4}}}.

(12mad)

The path loss is given by

\displaystyle\tilde{\beta}_{k}=C_{0}(d_{k}/\hat{d})^{-\alpha},

(12mae)

where $C_{0}=(\lambda_{2}/4\pi\hat{d})$ is the free space path loss at the reference distance of $\hat{d}=1~{}\mathrm{m}$ , and $\alpha=2.5$ is the path-loss exponent. The correlation matrix ${\mathbf{R}}_{\mathrm{SIM}}$ is obtained according to (4). The carrier frequency and the system bandwidth are $2~{}\mathrm{GHz}$ and $20~{}\mathrm{MHz}$ , respectively. Moreover, we assume $N_{t}=8$ , $K=8$ , $N=200$ , and $L=4$ .

In Fig. 2, we depict the achievable sum SE versus the number of elements $N$ of each surface while varying the number of surfaces $L$ . First, it is shown that the downlink sum SE increases with $N$ for different $L$ . Moreover, an increase in the number of surfaces results in an increase in the sum SE. In addition, for the sake of comparison, we present the performance in the case of instantaneous CSI for $L=4$ [8], which performs better than the case of statistical CSI since the latter is obtained based on a lower bound that is optimized at every several coherence intervals instead of at each coherence interval. However, the statistical CSI modeling allows to save significant overhead. Moreover, we show the effect of the size of each surface element. We observe that as the size of each surface element decreases, the correlation decreases, and the sum SE increases. Notably, Monte Carlo (MC) simulations corroborate the analytical results.

Fig. 3 shows the sum SE versus the number of layers $L$ of the SIM for the number of UEs $K$ . When $N_{t}=K=8$ , we observe that the sum SE improves until $L=6$ because the SIM is able to mitigate the inter-user interference in the EM wave domain. In particular, a significant improvement is observed compared to the single-layer SIM. Again, we illustrate the comparison between the cases corresponding to instantaneous and statistical CSI, where the latter exhibits worse performance for the benefit of lower overhead.

In Fig. 4, we depict the convergence of the proposed algorithm. The algorithm terminates when the difference of the objective between the two last iterations is less than $10^{-5}$ or the number of iterations is larger than $130$ . It is shown that the algorithm converges to its maximum as the number of iterations increases. Since Problem $(\mathcal{P})$ is non-convex, the algorithm does not converge to a globally optimal solution. This means that the algorithm may converge to different points starting from different initial points. For this reason, we select the best solutions after executing the algorithm from different initial points. Herein, we depict the sum SE versus the iteration count for $5$ different randomly generated initial points, and we observe that all points result in the same SE. Generally, the selection of $5$ randomly generated initial points allows a good trade-off between performance and complexity.

VI Conclusion

This paper provided the achievable downlink SE of large SIM-aided multiuser communications over Ricean fading channels. In particular, we deduced a new, tractable expression for the downlink SE for large-metasurfaces as a function of large-scale statistics while the downlink precoding takes place in the wave domain, in order to lower the computational burden and the processing latency. The results were used to pursue an optimization based on statistical CSI that achieves lower overhead with respect to instantaneous CSI. Specifically, we proposed an AO algorithm that solves the optimization regarding the phase shifts of each surface of the SIM and the allocated power, which contributes to a reduction in processing latency due to lower overhead.

Appendix A Proof of Proposition 2

The proof starts with the derivation of $\nabla_{{\bm{\phi}}_{l}}D_{k}$ . To this end, we focus on the differential of $|{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{G}}{% \mathbf{w}}_{k}^{1}|^{2}$ . We have

	$\displaystyle d(\|{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}}% }{\mathbf{G}}{\mathbf{w}}_{k}^{1}\|^{2})={\mathbf{h}}_{k,\mathrm{LoS}}^{{% \scriptscriptstyle\mathsf{H}}}d({\mathbf{G}}){\mathbf{w}}_{k}^{1}{\mathbf{w}}_% {k}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{h}}_{k,\mathrm{LoS}}$
	$\displaystyle+{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{G}}{\mathbf{w}}_{k}^{1}{\mathbf{w}}_{k}^{1^{{\scriptscriptstyle\mathsf% {H}}}}d({\mathbf{G}}^{{\scriptscriptstyle\mathsf{H}}}){\mathbf{h}}_{k,\mathrm{% LoS}}$		(12maf)
	$\displaystyle=\tr\big({\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle% \mathsf{H}}}{\mathbf{A}}_{l}d({\bm{\Phi}}_{l}){\mathbf{C}}_{l}{\mathbf{w}}_{k}% ^{1}{\mathbf{w}}_{k}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{h}}_{k,\mathrm{LoS}}\big{missing})$
	$\displaystyle+\tr({\mathbf{A}}_{l}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{h}% }_{k,\mathrm{LoS}}{\mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{H}% }}{\mathbf{G}}{\mathbf{w}}_{k}^{1}{\mathbf{w}}_{k}^{1^{{\scriptscriptstyle% \mathsf{H}}}}{\mathbf{C}}_{l}^{{\scriptscriptstyle\mathsf{H}}}d({\bm{\Phi}}_{l% }^{{\scriptscriptstyle\mathsf{H}}})).$		(12mag)

In (12mag), we have substituted $d({\mathbf{G}})={\mathbf{A}}_{l}d({\bm{\Phi}}_{l}){\mathbf{C}}_{l}$ , where ${\mathbf{A}}_{l}={\bm{\Phi}}_{L}{\mathbf{W}}^{L}\cdots{\bm{\Phi}}_{l+1}{% \mathbf{W}}^{l+1}$ , and ${\mathbf{C}}_{l}={\mathbf{W}}^{l}{\bm{\Phi}}_{l-1}{\mathbf{W}}^{l-1}\cdots{\bm% {\Phi}}_{1}$ . Having obtained the differential, we have

	$\displaystyle\nabla_{{\bm{\phi}}_{l}}D_{k}$	$\displaystyle=\frac{\partial}{\partial{\bm{\phi}}_{l}^{*}}D_{k}$		(12mah)
		$\displaystyle=p_{k}\kappa_{k}\text{diag}\left(\right)({\mathbf{C}}_{l}^{}{% \mathbf{w}}_{k}^{1^{}}{\mathbf{w}}_{k}^{1^{{\scriptscriptstyle\mathsf{T}}}}{% \mathbf{G}}^{{\scriptscriptstyle\mathsf{T}}}{\mathbf{h}}_{k,\mathrm{LoS}}^{}{% \mathbf{h}}_{k,\mathrm{LoS}}^{{\scriptscriptstyle\mathsf{T}}}{\mathbf{A}}_{l}^% {})$

The first term in the denominator is written as

	$\displaystyle d\big{(}\tr\big({\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}% ^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{% H}}}{\mathbf{R}}\big{missing})\big{)}=\tr\big({\mathbf{A}}_{l}d({\bm{\Phi}}_{l% }){\mathbf{C}}_{l}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}^{1^{{\scriptscriptstyle% \mathsf{H}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{H}}}{\mathbf{R}}\big{% missing})$
	$\displaystyle+\tr\big({\mathbf{A}}_{l}^{{\scriptscriptstyle\mathsf{H}}}{% \mathbf{R}}{\mathbf{G}}{\mathbf{w}}_{i}^{1}{\mathbf{w}}_{i}^{1^{{% \scriptscriptstyle\mathsf{H}}}}{\mathbf{C}}_{l}^{{\scriptscriptstyle\mathsf{H}% }}d({\bm{\Phi}}_{l}^{{\scriptscriptstyle\mathsf{H}}})\big{missing}).$		(12mai)

Thus, we have

\displaystyle\!\nabla_{{\bm{\phi}}_{l}}\!\tr\big({\mathbf{G}}{\mathbf{w}}_{i}^% {1}{\mathbf{w}}_{i}^{1^{{\scriptscriptstyle\mathsf{H}}}}{\mathbf{G}}^{{% \scriptscriptstyle\mathsf{H}}}{\mathbf{R}}\big{missing})\!=\!\text{diag}\left(% \right)({\mathbf{C}}_{l}^{*}{\mathbf{w}}_{i}^{1^{*}}{\mathbf{w}}_{i}^{1^{{% \scriptscriptstyle\mathsf{T}}}}{\mathbf{G}}^{{\scriptscriptstyle\mathsf{T}}}{% \mathbf{R}}{\mathbf{A}}_{l}^{*})

The second term in the denominator is similar to the numerator. Hence, the derivation is similar.

References

[1] M. Di Renzo et al., “Smart radio environments empowered by reconfigurable intelligent surfaces: How it works, state of research, and the road ahead,” IEEE J. Sel. Areas Commun., vol. 38, no. 11, pp. 2450–2525, 2020.
[2] Y. Liu et al., “Reconfigurable intelligent surfaces: Principles and opportunities,” IEEE Commun. Surveys & Tuts, vol. 23, no. 3, pp. 1546–1577, 2021.
[3] C. Huang et al., “Reconfigurable intelligent surfaces for energy efficiency in wireless communication,” IEEE Transa. Wireless Commun., vol. 18, no. 8, pp. 4157–4170, 2019.
[4] A. Papazafeiropoulos et al., “Intelligent reflecting surface-assisted MU-MISO systems with imperfect hardware: Channel estimation and beamforming design,” IEEE Trans. Wireless Commun., vol. 21, no. 3, pp. 2077–2092, 2021.
[5] ——, “Achievable rate of a STAR-RIS assisted massive MIMO system under spatially-correlated channels,” IEEE Trans. Wireless Commun., pp. 1–1, 2023.
[6] H. Guo et al., “Weighted sum-rate maximization for reconfigurable intelligent surface aided wireless networks,” IEEE Trans. Wireless Commun., vol. 19, no. 5, pp. 3064–3076, 2020.
[7] J. An et al., “Stacked intelligent metasurfaces for efficient holographic MIMO communications in 6G,” IEEE J. Sel. Areas Commun., 2023.
[8] ——, “Stacked intelligent metasurfaces for multiuser downlink beamforming in the wave domain,” arXiv preprint arXiv:2309.02687, 2023.
[9] A. Papazafeiropoulos et al., “Achievable rate optimization for stacked intelligent metasurface-assisted holographic MIMO communications,” arXiv preprint arXiv:2402.16415.
[10] A. Papazafeiropoulos, P. Kourtessis, and S. Chatzinotas, “Performance of double-stacked intelligent metasurface-assisted multiuser massive MIMO communications in the wave domain,” arXiv preprint arXiv:2402.16405, 2024.
[11] S. Lin et al., “Stacked intelligent metasurface enabled LEO satellite communications relying on statistical CSI,” IEEE Wireless Commun. Let., 2024.
[12] E. Björnson et al., “Massive MIMO networks: Spectral, energy, and hardware efficiency,” Foundations and Trends® in Signal Processing, vol. 11, no. 3-4, pp. 154–655, 2017.
[13] C. Liu et al., “A programmable diffractive deep neural network based on a digital-coding metasurface array,” Nature Electronics, vol. 5, no. 2, pp. 113–122, 2022.
[14] E. Björnson and L. Sanguinetti, “Rayleigh fading modeling and channel hardening for reconfigurable intelligent surfaces,” IEEE Wireless Commun. Lett., vol. 10, no. 4, pp. 830–834, 2021.

\Closesolutionfile

solutionfile