Achievable Rate Optimization for Large Stacked Intelligent Metasurfaces Based on Statistical CSI
Anastasios Papazafeiropoulos, Pandelis Kourtessis, Symeon Chatzinotas, Dimitra I. Kaklamani, Iakovos S. Venieris
A. Papazafeiropoulos is with the Communications and Intelligent Systems Research Group, University of Hertfordshire, Hatfield AL10 9AB, U. K., and with SnT at the University of Luxembourg, Luxembourg. P. Kourtessis is with the Communications and Intelligent Systems Research Group, University of Hertfordshire, Hatfield AL10 9AB, U. K. S. Chatzinotas is with the SnT at the University of Luxembourg, Luxembourg. Dimitra I. Kaklamani is with the Microwave and Fiber Optics Laboratory, and Iakovos S. Venieris is with the Intelligent Communications and Broadband Networks Laboratory, School of Electrical and Computer Engineering, National Technical University of Athens, Zografou, 15780 Athens, Greece. Corresponding author’s email: [email protected].
Abstract
Stacked intelligent metasurface (SIM) is an emerging design that consists of multiple layers of metasurfaces. A SIM enables holographic
multiple-input multiple-output (HMIMO) precoding in the wave domain, which results in the reduction of energy consumption and hardware cost. On the ground of multiuser beamforming, this letter focuses on the downlink achievable rate and its maximization. Contrary to previous works on multiuser SIM, we consider statistical channel state information (CSI) as opposed to instantaneous CSI to overcome challenges such as large overhead. Also, we examine the performance of large surfaces. We apply an alternating optimization (AO) algorithm regarding the phases of the SIM and the allocated transmit power. Simulations illustrate the performance of the considered large SIM-assisted design as well as the comparison between different CSI considerations.
The technology of reconfigurable intelligent surfaces (RISs) has recently emerged to increase coverage and enhance spectral and energy efficiencies in various communication environments [1, 2]. In general terms, an RIS includes a surface that includes a large number of elements, which are nearly passive and have low cost. The purpose of these elements is to adjust the phases of the incident electromagnetic (EM) waves by using a smart controller, and hence, shape the propagation environment dynamically [3, 4, 5].
However, most existing works on RIS assume single-layer metasurface structure [3, 4, 6], which imposes a constraint on the adjustment of the beam patterns. Also, the single-layer structures of RISs do not have the capability of inter-user interference suppression as shown in [6]. These observations led the authors in [7, 8] to propose a stacked intelligent metasurface (SIM), which consists of an array of programmable metasurfaces similar to artificial neural networks (ANNs). Among the processing capabilities of a SIM, we that the forward propagation takes place at the speed of light.
On this ground, in [7], authors proposed a SIM-based design for the transceiver of point-to-point multiple-input multiple-output (MIMO) communication systems, where the combining and the precoding take place as the EM waves propagate along the SIM. In [8], we observe the integration of a SIM to the transmitter, i.e., the base station (BS) towards enabling beamforming in the EM domain based on instantaneous channel state information (CSI). Contrary to [7] and [8], in [9] and [10], we proposed more general hybrid digital wave designs, where all element parameters are optimised simultaneously through more efficient algorithms.
In this work, we focus on a SIM-enabled multiuser architecture operating solely in the wave domain. Note that [10] assumes a hybrid digital wave design, and [11] focuses on satellite communication systems. Also, contrary to previous works [7, 8, 9], we consider a SIM that consists of large metasurfaces, since we apply the use-and-then-forget (UatF) bound [12]. Most importantly, we obtain the downlink rate and perform its optimization regarding the phase shifts and transmit power in terms of statistical CSI. Notably, this approach enables the optimization at every several coherence intervals rather than optimizing at each interval. Hence, we achieve lower overhead, which is one of the main challenges in SIM-assisted systems.
Notation: Matrices and vectors are represented by boldface upper and lower case symbols, respectively. The notations , , and denote the transpose, Hermitian transpose, and trace operators, respectively. Also, the symbol denotes the expectation operator. The floor function gives as output the greatest integer less than or equal to . The notation represents a vector with elements equal to the diagonal elements of . The notation represents a circularly symmetric complex Gaussian vector with zero mean and a covariance matrix .
II System Model
We consider a SIM-aided MIMO communication system as depicted in Fig. 1. In particular, a BS, which includes antennas, communicates with single-antenna user equipments (UEs) through a SIM performing wave-based processing. The SIM is implemented by metasurfaces, where each one has a large number of meta-atoms. Let , , and denote the sets of UEs, metasurfaces, and meta-atoms, respectively. Note that an intelligent controller adjusts the shifts of the phases of the electromagnetic (EM) waves that im**e on the metasurface layers.
On this basis, let be the phase shift by the th meta-atom on the surface layer . Also, we denote , and , where .111Herein, we consider phase shifts, which are continuously-adjustable and their modulus equals to to evaluate large SIM-aided MIMO communications. Practical issues such as the consideration of discrete phase shifts [13] is the topic of future work. In addition, denotes the coefficient matrix between layer and layer . In particular, its entries from meta-atom on layer to meta-atom on layer are given by
(1)
where is the area of each meta-atom at the SIM, denotes the angle between the normal direction of the transmit metasurface layer and the propagation direction, , is the respective transmission distance. Moreover, let express the coefficient from the transmit antenna array. Thus, the impact of the SIM can be expressed as
(2)
Let express the channel between the last layer and UE that is described by the correlated Rician fading distribution as
(3)
In (3), is the Rician factor, is the channel gain, is the LoS component, and is the NLoS component with representing the spatial correlation of each surface. This correlation is obtained as [14]
(4)
where with and being the horizontal and vertical indices of element , respectively. and are the elements per row and column, while and denote the horizontal width and the vertical height.
III Downlink Data Transmission
During the downlink transmission and based on wave-based beamforming [8], the received signal at the -th UE is written as
(5)
where is
the information symbol intended for the -th UE, which has
a zero mean and unit variance. Also, is the power corresponding to the -th UE with , where is the total transmit power at the BS. Also denotes the additive white Gaussian noise (AWGN) with expressing its variance at UE .
The downlink achievable SE of UE is given by
(6)
where denotes the downlink signal-to-interference-plus-noise ratio (SINR), which is written according to the UaTF bounding technique [12] as
(7)
where is assumed that UE has knowledge of the average effective channel.
Proposition 1
The achievable SINR of UE for a given SIM during the downlink transmission is provided by (8).
(8)
Proof:
The numerator becomes
(9)
Regarding the denominator of (7), the first term is written as
(10)
(11)
where, in (10), we have applied that for any vectors , .
By substituting (9) and (11) in (7), we obtain the achievable SINR.
∎
IV Problem Formulation and Optimization
The maximization of the sum SE regarding the phase shifts of each surface and the allocated power is of great importance.
IV-AProblem Formulation
The maximization problem is formulated as
(12a)
(12b)
(12c)
(12d)
(12e)
where and are the numerator and denominator of obtained in Proposition 1. Also, we have defined the vector . Note that
the constraint (12c) expresses that each RIS element provides only a phase shift while (12d) corresponds to the maximum power constraint.
The non-convexity optimization problem and its dependence on the unit-modulus constraint with respect to make the solution challenging. For this reason, we resort to alternating optimization (AO). According to this technique,
and will be optimized individually in an iterative manner. Specifically, first, we find the optimum for a fixed . During the next step, we solve for with fixed. The objective converges to its optimum value by iterating this process, which leads to the increase of after each step until a specific point because of the upper-bound coming from the power constraint (12d).
IV-BSIM Optimization
Until now, was assumed fixed. However, to exploit each metasurface towards wave-based beamforming while maximizing (6), the optimization of each has to take place. Its presence is observed inside the matrix , appearing in and . Hence, the maximization problem regarding is described as
(12ma)
(12mb)
(12mc)
where the maximization problem is non-convex regarding , and it obeys to a unit-modulus constraint with respect to . Application of the projected gradient ascent algorithm until convergence while taking into account the unit-modulus constraint results in a locally optimal solution to .
The proposed algorithm suggests starting from , and then shifting along the gradient of . The new point is projected onto to hold the new points in the feasible set. Specifically, the unit-modulus constraint means that has to be found inside the unit circle. is the projection onto . Hence, we have
(12mp)
where the vector of is a given point.
The algorithm is described by the following iteration
(12mq)
The Armijo-Goldstein backtracking line search method provides the step size, which is , where and . Note that is the
smallest positive integer that satisfies
(12mr)
where
(12ms)
is the quadratic approximation of .
Proposition 2
The gradient of regarding is obtained in closed-form as
The SIM optimization design, based on the gradient ascent, appears a significant advantage because the gradient ascent is obtained in a closed form. It has low computational complexity because it consists of simple matrix operations. Specifically, the complexity of (12ma) for large SIMs is , and the complexity of (12mt) is similar. In other words, the number of meta-atoms of each surface has a higher impact.
IV-CPower Optimization
Given a fixed , we focus on the optimization with respect to . Specifically, we have
(12mua)
(12mub)
The nonconvexity of problem leads us to obtain a solution which is locally optimal. For this reason, we apply a weighted minimum mean square error (MMSE) reformulation of the sum SE. To this end, we denote . Then, the SINR can be written in terms of the vector as
(12mv)
where
(12mw)
Now, we consider the single-input and single-output (SISO) channel model that comes from the SINR in (12mv), which is given by
(12mx)
where while is the data signal with unit variance, and is the received signal.
Then, the receiver estimates , i.e., with being a receiver coefficient. The corresponding mean square error becomes
(12my)
For a given , is obtained by the minimization of the MSE as
(12mz)
Inserting into (12my), becomes . Based on the weighted MMSE method, let the auxiliary weight for the MSE and consider the problem
(12maa)
Problems and are equivalent, and thus, are subject to the same solution. The solution of can be provided in closed form as
(12mab)
The power allocation presents a similar complexity to the SIM optimization design since it consists of similar matrix operations to , i.e., its complexity is .
Remark 1
Both algorithms, corresponding to Problems and , have low computation complexity and converge quickly. Note that the achievement of a local optimum, obtained from this optimization, will make different initializations result in different solutions, which will be studied below.
VNumerical Results
In this section, we present and evaluate the performance of the achievable sum SE of large SIM-assisted multiuser communications with statistical CSI by showing both analytical results and Monte Carlo simulations. For the setup, we assume that the SIM is parallel to the plane and centered along the axis at a height . The spacing between adjacent meta-atoms is assumed to be , and the size of each meta-atom is . The thickness of the SIM is , while the spacing is . Moreover, the locations of the users are randomly distributed at a distance between and .
The distance between the th meta-atom of the st metasurface and the th meta-atom of the st metasurface is given by , where
(12mac)
The transmission distance between the -th antenna and the -th meta-atom on the first metasurface layer is provided by (12mad). Note that we have .
(12mad)
The path loss is given by
(12mae)
where is the free space path loss at the reference distance of , and is the path-loss exponent. The correlation matrix is obtained according to (4). The carrier frequency and the system bandwidth are and , respectively. Moreover, we assume , , , and .
In Fig. 2, we depict the achievable sum SE versus the number of elements of each surface while varying the number of surfaces . First, it is shown that the downlink sum SE increases with for different . Moreover, an increase in the number of surfaces results in an increase in the sum SE. In addition, for the sake of comparison, we present the performance in the case of instantaneous CSI for [8], which performs better than the case of statistical CSI since the latter is obtained based on a lower bound that is optimized at every several coherence intervals instead of at each coherence interval. However, the statistical CSI modeling allows to save significant overhead. Moreover, we show the
effect of the size of each surface element. We observe that as the size of each surface element decreases, the correlation decreases, and the sum SE increases. Notably, Monte Carlo (MC) simulations corroborate the analytical results.
Fig. 3 shows the sum SE versus the number of layers of the SIM for the number of UEs . When , we observe that the sum SE improves until because the SIM is able to mitigate the inter-user interference in the EM wave domain. In particular, a significant improvement is observed compared to the single-layer SIM. Again, we illustrate the comparison between the cases corresponding to instantaneous and statistical CSI, where the latter exhibits worse performance for the benefit of lower overhead.
In Fig. 4, we depict the convergence of the proposed algorithm. The algorithm terminates when the difference of the objective between the two last iterations is less than or the number of iterations is larger than . It is shown that the algorithm converges to its maximum as the number of iterations increases. Since Problem is non-convex, the algorithm does not converge to a globally optimal solution. This means that the algorithm may converge to different points starting from different initial points. For this reason, we select the best solutions after executing the algorithm from different initial points. Herein, we depict the sum SE versus the iteration count for different randomly generated initial points, and we observe that all points result in the same SE. Generally, the selection of randomly generated initial points allows a good trade-off between performance and complexity.
VIConclusion
This paper provided the achievable downlink SE of large SIM-aided multiuser communications over Ricean fading channels. In particular, we deduced a new, tractable expression for the downlink SE for large-metasurfaces as a function of large-scale statistics while the downlink precoding takes place in the wave domain, in order to lower the computational burden and the processing latency. The results were used to pursue an optimization based on statistical CSI that achieves lower overhead with respect to instantaneous CSI. Specifically, we proposed an AO algorithm that solves the optimization regarding the phase shifts of each surface of the SIM and the allocated power, which contributes to a reduction in processing latency due to lower overhead.
The proof starts with the derivation of . To this end, we focus on the differential of . We have
(12maf)
(12mag)
In (12mag), we have substituted , where , and . Having obtained the differential, we have
(12mah)
The first term in the denominator is written as
(12mai)
Thus, we have
The second term in the denominator is similar to the numerator. Hence, the derivation is similar.
References
[1]
M. Di Renzo et al., “Smart radio environments empowered by
reconfigurable intelligent surfaces: How it works, state of research, and
the road ahead,” IEEE J. Sel. Areas Commun., vol. 38, no. 11, pp.
2450–2525, 2020.
[2]
Y. Liu et al., “Reconfigurable intelligent surfaces: Principles and
opportunities,” IEEE Commun. Surveys & Tuts, vol. 23, no. 3, pp.
1546–1577, 2021.
[3]
C. Huang et al., “Reconfigurable intelligent surfaces for energy
efficiency in wireless communication,” IEEE Transa. Wireless Commun.,
vol. 18, no. 8, pp. 4157–4170, 2019.
[4]
A. Papazafeiropoulos et al., “Intelligent reflecting surface-assisted
MU-MISO systems with imperfect hardware: Channel estimation and
beamforming design,” IEEE Trans. Wireless Commun., vol. 21, no. 3,
pp. 2077–2092, 2021.
[5]
——, “Achievable rate of a STAR-RIS assisted massive MIMO system under
spatially-correlated channels,” IEEE Trans. Wireless Commun., pp.
1–1, 2023.
[6]
H. Guo et al., “Weighted sum-rate maximization for reconfigurable
intelligent surface aided wireless networks,” IEEE Trans. Wireless
Commun., vol. 19, no. 5, pp. 3064–3076, 2020.
[7]
J. An et al., “Stacked intelligent metasurfaces for efficient
holographic MIMO communications in 6G,” IEEE J. Sel. Areas
Commun., 2023.
[8]
——, “Stacked intelligent metasurfaces for multiuser downlink beamforming
in the wave domain,” arXiv preprint arXiv:2309.02687, 2023.
[9]
A. Papazafeiropoulos et al., “Achievable rate optimization for stacked
intelligent metasurface-assisted holographic MIMO communications,”
arXiv preprint arXiv:2402.16415.
[10]
A. Papazafeiropoulos, P. Kourtessis, and S. Chatzinotas, “Performance of
double-stacked intelligent metasurface-assisted multiuser massive MIMO
communications in the wave domain,” arXiv preprint arXiv:2402.16405,
2024.
[11]
S. Lin et al., “Stacked intelligent metasurface enabled LEO satellite
communications relying on statistical CSI,” IEEE Wireless Commun.
Let., 2024.
[12]
E. Björnson et al., “Massive MIMO networks: Spectral, energy, and
hardware efficiency,” Foundations and Trends® in
Signal Processing, vol. 11, no. 3-4, pp. 154–655, 2017.
[13]
C. Liu et al., “A programmable diffractive deep neural network based on
a digital-coding metasurface array,” Nature Electronics, vol. 5,
no. 2, pp. 113–122, 2022.
[14]
E. Björnson and L. Sanguinetti, “Rayleigh fading modeling and channel
hardening for reconfigurable intelligent surfaces,” IEEE Wireless
Commun. Lett., vol. 10, no. 4, pp. 830–834, 2021.