License: arXiv.org perpetual non-exclusive license
arXiv:2312.15832v1 [cs.IT] 26 Dec 2023

Tomlinson-Harashima Cluster-Based Precoders for Cell-Free MU-MIMO Networks

André R. Flores11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT, Rodrigo C. de Lamare1,212{}^{1,2}start_FLOATSUPERSCRIPT 1 , 2 end_FLOATSUPERSCRIPT and Kumar Vijay Mishra33{}^{3}start_FLOATSUPERSCRIPT 3 end_FLOATSUPERSCRIPT
11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPTCentre for Telecommunications Studies, Pontifical Catholic University of Rio de Janeiro, Brazil
22{}^{2}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPTDepartment of Electronic Engineering, University of York, United Kingdom
33{}^{3}start_FLOATSUPERSCRIPT 3 end_FLOATSUPERSCRIPTUnited States DEVCOM Army Research Laboratory, Adelphi, MD 20783 USA
Emails: {andre.flores, delamare}@cetuc.puc-rio.br, [email protected]
Abstract

Cell-free (CF) multiple-input multiple-output (MIMO) systems generally employ linear precoding techniques to mitigate the effects of multiuser interference. However, the power loss, efficiency, and precoding accuracy of linear precoders are usually improved by replacing them with nonlinear precoders that employ perturbation and modulo operation. In this work, we propose nonlinear user-centric precoders for CF MIMO, wherein different clusters of access points (APs) serve different users in CF multiple-antenna networks. Each cluster of APs is selected based on large-scale fading coefficients. The clustering procedure results in a sparse nonlinear precoder. We further devise a reduced-dimension nonlinear precoder, where clusters of users are created to reduce the complexity of the nonlinear precoder, the amount of required signaling, and the number of users. Numerical experiments show that the proposed nonlinear techniques for CF systems lead to an enhanced performance when compared to their linear counterparts.

Index Terms:
Cell-free wireless networks, multiple-antenna systems, multiuser interference, nonlinear precoding, Tomlinson-Harashima precoding.

I Introduction

Coordinated base stations (BSs) have been deployed worldwide to establish cellular network services. However, wireless applications are evolving constantly with an increasing demand for more resources [1, 2, 3, 4]. For high throughput and quality-of-service required for future networks, it is desired to further densify BSs. However, this approach is impractical. As an alternative, cell-free (CF) multiple-input multiple output (MIMO) systems have emerged as a potential solution to improve the performance and satisfy throughout requirements of future wireless networks [5, 6].

Compared to conventional BS-based networks, CF MU-MIMO systems employ multiple APs distributed geographically over the area of interest. A central processing unit (CPU), which may be located at the cloud server, coordinates the APs. The distributed deployment of CF networks yields higher coverage than the BSs with collocated antennas [7]. In addition, CF multiuser MIMO (MU-MIMO) has been shown to provide increased throughput per user [8, 9] as well as better performance in terms of energy efficiency [10, 11, 12].

Further, CF MU-MIMO employs the same time-frequency resources to provide service to multiple users as BS-based systems. To avoid the multiuser interference (MUI) in the downlink, a precoder is often implemented at the transmitter. Prior works on CF MU-MIMO have focused on linear precoding techniques such as matched filter (MF), zero-forcing (ZF) [13], and minimum mean-square error (MMSE) [14] techniques. However, it is well-known that nonlinear precoders [15, 16, 17] have the potential to outperform their linear counterparts [18, 19, 20, 21, 22, 23].

State-of-the-art in CF MU-MIMO systems has proposed network-wide (NW) precoders [24, 13, 14, 25] but these techniques entail a very high signaling load. Moreover, NW approaches demand high computational complexity because they require the inversion of a matrix whose size increases with the number of APs and users. To mitigate this problem, NW precoders that employ APs and user clusterization have been proposed [26] for lower computational complexity and signaling load. For instance, in [20] the number of APs is curtailed to reduce the signaling load. In [27], scalable MMSE combiners and precoders are developed. Very recently, a regularized ZF precoder based on subsets of user was proposed in [28] to judiciously use the available resources.

Unlike previous works [29, 20, 23], we propose nonlinear precoding techniques for CF MU-MIMO systems. The proposed techniques are based on the well-established Tomlinson-Harashima precoder (THP) [30], which may be interpreted as the transmit analog of the successive interference cancellation (SIC) employed at the receiver [31]. Essentially, THP employs a nonlinear modulo operation that reduces the power penalty associated with the linear precoders thereby enhancing the overall performance. Additionally, a cluster-based approach is devised based on a user selection matrix, resulting in a user-centric nonlinear precoder and addressing the gap in nonlinear structures for cluster-based precoders in CF networks. The resulting precoder is sparse and its complexity is reduced by employing clusters of users, thereby reducing the amount of signaling and the computational load. Our numerical experiments show that the TH precoding techniques outperform their linear counterparts.

The rest of this paper is organized as follows. In the next section, we describe the ssytem model of the CF MU-MIMO communications. We derive the proposed cluster-based nonlinear precoding techniques in Section III. We introduce the metric to evaluate the performance of the proposed precoders in Section IV. We validate our model and methods via numerical experiments in Section V. We conclude in Section VI.

II System Model

Consider the downlink of a CF MIMO system, where N𝑁Nitalic_N geographically distributed APs serve K𝐾Kitalic_K users equipped with a single omnidirecitonal antenna. A central processing unit (CPU) located at the cloud server is connected to the APs. The data are transmitted over a flat-fading channel 𝐆𝐂N×K𝐆superscript𝐂𝑁𝐾\mathbf{G}\in\mathbf{C}^{N\times K}bold_G ∈ bold_C start_POSTSUPERSCRIPT italic_N × italic_K end_POSTSUPERSCRIPT. The (n,k)𝑛𝑘(n,k)( italic_n , italic_k )-th element of matrix 𝐆𝐆\mathbf{G}bold_G is the channel coefficient between the n𝑛nitalic_n-th AP and k𝑘kitalic_k-th user, i.e., gn,k=ζn,khn,ksubscript𝑔𝑛𝑘subscript𝜁𝑛𝑘subscript𝑛𝑘g_{n,k}=\sqrt{\zeta_{n,k}}h_{n,k}italic_g start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT = square-root start_ARG italic_ζ start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT end_ARG italic_h start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT, where ζn,ksubscript𝜁𝑛𝑘\zeta_{n,k}italic_ζ start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT is the large-scale fading coefficient that models the path loss and shadowing effects, and hn,ksubscript𝑛𝑘h_{n,k}italic_h start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT represents the small-scale fading coefficient. The coefficients hn,ksubscript𝑛𝑘h_{n,k}italic_h start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT are modeled as independently and identically distributed (i.i.d.) random variables with complex Gaussian distribution 𝒞𝒩(0,1)𝒞𝒩01\mathcal{CN}\left(0,1\right)caligraphic_C caligraphic_N ( 0 , 1 ).

Denote the transmit signal by 𝐱N𝐱superscript𝑁\mathbf{x}\in\mathbb{C}^{N}bold_x ∈ blackboard_C start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT, which obeys the transmit power constraint 𝔼[𝐱2]Pt𝔼delimited-[]superscriptdelimited-∥∥𝐱2subscript𝑃𝑡\mathbb{E}\left[\lVert\mathbf{x}\rVert^{2}\right]\leq P_{t}blackboard_E [ ∥ bold_x ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, where 𝔼[]𝔼delimited-[]\mathbb{E}[\cdot]blackboard_E [ ⋅ ] denotes the statistical expectation. Then, the K×1𝐾1K\times 1italic_K × 1 received signal vector is

𝐲=𝐆T𝐱+𝐧,𝐲superscript𝐆T𝐱𝐧\mathbf{y}=\mathbf{G}^{\text{T}}\mathbf{x}+\mathbf{n},bold_y = bold_G start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_x + bold_n , (1)

where ()Tsuperscript𝑇(\cdot)^{T}( ⋅ ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT is the conjugate transpose and 𝐧K𝐧superscript𝐾\mathbf{n}\in\mathbb{C}^{K}bold_n ∈ blackboard_C start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT is the additive white Gaussian noise (AWGN) that follows the distribution 𝒞𝒩(𝟎,σn2𝐈)𝒞𝒩0superscriptsubscript𝜎𝑛2𝐈\mathcal{CN}\left(\mathbf{0},\sigma_{n}^{2}\mathbf{I}\right)caligraphic_C caligraphic_N ( bold_0 , italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I ).

The system employs the time division duplexing (TDD) protocol and therefore the channels can be estimated employing the channel reciprocity property and pilot training [32]. After receiving the pilots, the CPU computes the channel estimate 𝐆^T=[𝐠^1,𝐠^2,,𝐠^k]TK×Nsuperscript^𝐆Tsuperscriptsubscript^𝐠1subscript^𝐠2subscript^𝐠𝑘Tsuperscript𝐾𝑁\mathbf{\hat{G}}^{\text{T}}=\left[\mathbf{\hat{g}}_{1},\mathbf{\hat{g}}_{2},% \cdots,\mathbf{\hat{g}}_{k}\right]^{\text{T}}\in\mathbb{C}^{K\times N}over^ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT = [ over^ start_ARG bold_g end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over^ start_ARG bold_g end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , over^ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × italic_N end_POSTSUPERSCRIPT, whose (n,k)𝑛𝑘(n,k)( italic_n , italic_k )-th element is

g^n,k=ζn,k(1σe2hn,k+σeh~n,k),subscript^𝑔𝑛𝑘subscript𝜁𝑛𝑘1superscriptsubscript𝜎𝑒2subscript𝑛𝑘subscript𝜎𝑒subscript~𝑛𝑘\hat{g}_{n,k}=\sqrt{\zeta_{n,k}}\left(\sqrt{1-\sigma_{e}^{2}}h_{n,k}+\sigma_{e% }\tilde{h}_{n,k}\right),over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT = square-root start_ARG italic_ζ start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT end_ARG ( square-root start_ARG 1 - italic_σ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_h start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT + italic_σ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT over~ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT ) , (2)

where g^n,ksubscript^𝑔𝑛𝑘\hat{g}_{n,k}over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT is the channel estimate between the n𝑛nitalic_n-th AP and the k𝑘kitalic_k-th user; h~n,ksubscript~𝑛𝑘\tilde{h}_{n,k}over~ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT are i.i.d complex Gaussian random variables that follow the distribution 𝒞𝒩(0,1)𝒞𝒩01\mathcal{CN}\left(0,1\right)caligraphic_C caligraphic_N ( 0 , 1 ) (independent from hn,ksubscript𝑛𝑘h_{n,k}italic_h start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT) and model the errors in the channel estimates; and σesubscript𝜎𝑒\sigma_{e}italic_σ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT represents the quality of the channel state information (CSI). The error affecting the channel estimate g^n,ksubscript^𝑔𝑛𝑘\hat{g}_{n,k}over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT is g~n,k=σeζn,kh~n,ksubscript~𝑔𝑛𝑘subscript𝜎𝑒subscript𝜁𝑛𝑘subscript~𝑛𝑘\tilde{g}_{n,k}=\sigma_{e}\sqrt{\zeta_{n,k}}\tilde{h}_{n,k}over~ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT = italic_σ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT square-root start_ARG italic_ζ start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT end_ARG over~ start_ARG italic_h end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT.

III Proposed Cluster-Based Nonlinear Precoders

To enhance the performance of the system while reducing the signaling load and computational complexity of NW precoders, we propose cluster-based nonlinear precoders. To this end, we form clusters of APs and users. These clusters are defined based on the large-scale channel coefficients given by ζn,ksubscript𝜁𝑛𝑘\zeta_{n,k}italic_ζ start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT. Since only small subsets of APs transmit the most relevant signals for reception, the contribution of the remaining APs is not significant and the transmission over such APs is avoidable. The upshot of this technique is that we discard the APs whose processing is cost-ineffective to reduce the signaling load.

III-A AP selection

The signaling load is brought down by taking into account that each user is served only by a reduced cluster of APs. Consider the pre-fixed scalar L𝐿Litalic_L that denotes the number of APs that are going to be selected. Then, for the k𝑘kitalic_k-th user, the L𝐿Litalic_L APs with the largest large-scale fading coefficient are selected and gathered in the set 𝒜ksubscript𝒜𝑘\mathcal{A}_{k}caligraphic_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. In this sense, we employ the equivalent channel estimate 𝐆¯T=[𝐠¯1,𝐠¯2,,𝐠¯k]TK×Nsuperscript¯𝐆Tsuperscriptsubscript¯𝐠1subscript¯𝐠2subscript¯𝐠𝑘Tsuperscript𝐾𝑁\bar{\mathbf{G}}^{\text{T}}=\left[\mathbf{\bar{g}}_{1},\mathbf{\bar{g}}_{2},% \cdots,\mathbf{\bar{g}}_{k}\right]^{\text{T}}\in\mathbb{C}^{K\times N}over¯ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT = [ over¯ start_ARG bold_g end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over¯ start_ARG bold_g end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × italic_N end_POSTSUPERSCRIPT, which is a sparse matrix with the (n,k)𝑛𝑘(n,k)( italic_n , italic_k )-th element as

g¯n,k={g^n,k,n𝒜k,0,otherwise.subscript¯𝑔𝑛𝑘casessubscript^𝑔𝑛𝑘𝑛subscript𝒜𝑘0otherwise.\bar{g}_{n,k}=\begin{cases}\hat{g}_{n,k},&n\in\mathcal{A}_{k},\\ 0,&\text{otherwise.}\end{cases}over¯ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT = { start_ROW start_CELL over^ start_ARG italic_g end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT , end_CELL start_CELL italic_n ∈ caligraphic_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL 0 , end_CELL start_CELL otherwise. end_CELL end_ROW (3)

III-B Sparse TH precoder

Using (3), we compute a sparse TH precoder (TH-SP), which defines how the symbols are transmitted by the selected APs. The conventional THP employs three different filters [33]: feedback filter 𝐁K×K𝐁superscript𝐾𝐾\mathbf{B}\in\mathbb{C}^{K\times K}bold_B ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × italic_K end_POSTSUPERSCRIPT, feedforward filter 𝐅N×K𝐅superscript𝑁𝐾\mathbf{F}\in\mathbb{C}^{N\times K}bold_F ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K end_POSTSUPERSCRIPT, and a scaling matrix 𝐂K×K𝐂superscript𝐾𝐾\mathbf{C}\in\mathbb{C}^{K\times K}bold_C ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × italic_K end_POSTSUPERSCRIPT [15]. The feedback filter 𝐁𝐁\mathbf{B}bold_B deals with the multiuser interference (MUI) by successively subtracting the interference of previous symbols from the current symbol and, therefore, is a matrix with a lower triangular structure. The feedforward filter 𝐅𝐅\mathbf{F}bold_F enforces the spatial causality. The scaling matrix 𝐂𝐂\mathbf{C}bold_C assigns a weight to each stream and is, therefore, a diagonal matrix. Depending on the position of matrix 𝐂𝐂\mathbf{C}bold_C, two different THP structures have been suggested: the centralized THP (cTHP) implements the scaling matrix at the transmitter side (at the central processing unit), whereas the decentralized THP (dTHP) considers that 𝐂𝐂\mathbf{C}bold_C is included at the receivers.

Our proposed (TH-SP) attempts to completely remove the MUI. We implement it by applying an LQ decomposition on the equivalent channel estimate 𝐆¯Tsuperscript¯𝐆T\bar{\mathbf{G}}^{\text{T}}over¯ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT, i.e., 𝐆¯T=𝐋¯𝐐¯superscript¯𝐆T¯𝐋¯𝐐\bar{\mathbf{G}}^{\text{T}}=\bar{\mathbf{L}}\bar{\mathbf{Q}}over¯ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT = over¯ start_ARG bold_L end_ARG over¯ start_ARG bold_Q end_ARG, where 𝐋¯K×K¯𝐋superscript𝐾𝐾\bar{\mathbf{L}}\in\mathbb{C}^{K\times K}over¯ start_ARG bold_L end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × italic_K end_POSTSUPERSCRIPT and 𝐐¯K×N¯𝐐superscript𝐾𝑁\bar{\mathbf{Q}}\in\mathbb{C}^{K\times N}over¯ start_ARG bold_Q end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × italic_N end_POSTSUPERSCRIPT. Denote the (n,k)𝑛𝑘(n,k)( italic_n , italic_k )-th element of the matrix 𝐋¯¯𝐋\bar{\mathbf{L}}over¯ start_ARG bold_L end_ARG by l^n,ksubscript^𝑙𝑛𝑘\hat{l}_{n,k}over^ start_ARG italic_l end_ARG start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT. Then, the respective three THP filters are

𝐅𝐅\displaystyle\mathbf{F}bold_F =𝐐¯H,absentsuperscript¯𝐐𝐻\displaystyle=\bar{\mathbf{Q}}^{H},= over¯ start_ARG bold_Q end_ARG start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , (4)
𝐂𝐂\displaystyle\mathbf{C}bold_C =diag(l¯1,1,l¯2,2,,l¯N,N),absentdiagsubscript¯𝑙11subscript¯𝑙22subscript¯𝑙𝑁𝑁\displaystyle=\text{diag}\left(\bar{l}_{1,1},\bar{l}_{2,2},\cdots,\bar{l}_{N,N% }\right),= diag ( over¯ start_ARG italic_l end_ARG start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT , over¯ start_ARG italic_l end_ARG start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_l end_ARG start_POSTSUBSCRIPT italic_N , italic_N end_POSTSUBSCRIPT ) , (5)
𝐁(c)superscript𝐁c\displaystyle\mathbf{B}^{\left(\text{c}\right)}bold_B start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT =𝐋¯𝐂,absent¯𝐋𝐂\displaystyle=\bar{\mathbf{L}}\mathbf{C},= over¯ start_ARG bold_L end_ARG bold_C , (6)
𝐁(d)superscript𝐁d\displaystyle\quad\mathbf{B}^{\left(\text{d}\right)}bold_B start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT =𝐂𝐋¯,absent𝐂¯𝐋\displaystyle=\mathbf{C}\bar{\mathbf{L}},= bold_C over¯ start_ARG bold_L end_ARG , (7)

where 𝐁(c)superscript𝐁c\mathbf{B}^{\left(\text{c}\right)}bold_B start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT and 𝐁(c)superscript𝐁c\mathbf{B}^{\left(\text{c}\right)}bold_B start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT denote the feedback filters for the centralized and decentralized architectures, respectively.

Denote the coefficients of the feedback filter by bn,ksubscript𝑏𝑛𝑘b_{n,k}italic_b start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT and the symbols after feedback processing by s˘ksubscript˘𝑠𝑘\breve{s}_{k}over˘ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Then, the feedback filter subtracts the interference from previous symbols as

s˘k=ski=1k1bk,is˘i.subscript˘𝑠𝑘subscript𝑠𝑘superscriptsubscript𝑖1𝑘1subscript𝑏𝑘𝑖subscript˘𝑠𝑖\breve{s}_{k}=s_{k}-\sum_{i=1}^{k-1}b_{k,i}\breve{s}_{i}.over˘ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT italic_b start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT over˘ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT . (8)

The feedback filter amplifies the power of the transmitted signal. Therefore, a modulo operation is introduced to reduce the power of the transmitted signal as

(s˘k)=s˘kRe(s˘k)λ+12λjIm(s˘k)λ+12λ,subscript˘𝑠𝑘subscript˘𝑠𝑘Resubscript˘𝑠𝑘𝜆12𝜆𝑗Imsubscript˘𝑠𝑘𝜆12𝜆\mathcal{M}\left(\breve{s}_{k}\right)=\breve{s}_{k}-\left\lfloor\frac{\text{Re% }\left(\breve{s}_{k}\right)}{\lambda}+\frac{1}{2}\right\rfloor\lambda-j\left% \lfloor\frac{\text{Im}\left(\breve{s}_{k}\right)}{\lambda}+\frac{1}{2}\right% \rfloor\lambda,caligraphic_M ( over˘ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = over˘ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - ⌊ divide start_ARG Re ( over˘ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_λ end_ARG + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ⌋ italic_λ - italic_j ⌊ divide start_ARG Im ( over˘ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) end_ARG start_ARG italic_λ end_ARG + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ⌋ italic_λ , (9)

where Re()Re\text{Re}(\cdot)Re ( ⋅ ) (Im()Im\text{Im}(\cdot)Im ( ⋅ )) is the real (imaginary) part of its complex argument and the parameter λ𝜆\lambdaitalic_λ depends on the modulation alphabet and the power allocation scheme. Some common values of λ𝜆\lambdaitalic_λ when employing symbols with unit variance are λ=22𝜆22\lambda=2\sqrt{2}italic_λ = 2 square-root start_ARG 2 end_ARG and λ=410/5𝜆4105\lambda=4\sqrt{10}/5italic_λ = 4 square-root start_ARG 10 end_ARG / 5 for QPSK and 16-QAM, respectively.

Unlike linear precoders [4, 34, 35, 36, 37, 38, 39, 40, 41, 42], THP introduces power and modulo losses in the system. The former comes from the energy difference between the original constellation and the transmitted symbols after precoding. The latter is caused by the modulo operation. Both losses can be neglected for analysis purposes and for moderate and large modulation sizes [15, 17].

The modulo operation is modeled as the addition of a perturbation vector 𝐝K×1𝐝superscript𝐾1\mathbf{d}\in\mathbb{C}^{K\times 1}bold_d ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × 1 end_POSTSUPERSCRIPT to the transmitted symbols 𝐬𝐬\mathbf{s}bold_s. On the other hand, the feedback processing is implemented through the inversion of the matrix 𝐁𝐁\mathbf{B}bold_B. Thus, the vector of symbols after feedback processing 𝐬˘K×1˘𝐬superscript𝐾1\breve{\mathbf{s}}\in\mathbb{C}^{K\times 1}over˘ start_ARG bold_s end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_K × 1 end_POSTSUPERSCRIPT is

𝐬˘=˘𝐬absent\displaystyle\breve{\mathbf{s}}=over˘ start_ARG bold_s end_ARG = 𝐁1(𝐬+𝐝)superscript𝐁1𝐬𝐝\displaystyle\mathbf{B}^{-1}\left(\mathbf{s}+\mathbf{d}\right)bold_B start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_s + bold_d )
=\displaystyle== 𝐁1𝐯.superscript𝐁1𝐯\displaystyle\mathbf{B}^{-1}\mathbf{v}.bold_B start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_v . (10)

Therefore, the receive signal vectors for the centralized and decentralized structures are, respectively,

𝐲(c)=1β(c)(𝐆Tβ(c)𝐅𝐂𝐬˘+𝐧),superscript𝐲c1superscript𝛽csuperscript𝐆Tsuperscript𝛽c𝐅𝐂˘𝐬𝐧\mathbf{y}^{\left(\text{c}\right)}=\frac{1}{\beta^{\left(\text{c}\right)}}% \left(\mathbf{G}^{\text{T}}\beta^{\left(\text{c}\right)}\mathbf{F}\mathbf{C}% \breve{\mathbf{s}}+\mathbf{n}\right),bold_y start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT end_ARG ( bold_G start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT italic_β start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT bold_FC over˘ start_ARG bold_s end_ARG + bold_n ) , (11)

and

𝐲(d)=1β(d)𝐂(𝐆Tβ(d)𝐅𝐬˘+𝐧),superscript𝐲d1superscript𝛽d𝐂superscript𝐆Tsuperscript𝛽d𝐅˘𝐬𝐧\mathbf{y}^{\left(\text{d}\right)}=\frac{1}{\beta^{\left(\text{d}\right)}}% \mathbf{C}\left(\mathbf{G}^{\text{T}}\beta^{\left(\text{d}\right)}\mathbf{F}% \breve{\mathbf{s}}+\mathbf{n}\right),bold_y start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT end_ARG bold_C ( bold_G start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT italic_β start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT bold_F over˘ start_ARG bold_s end_ARG + bold_n ) , (12)

where the parameters β(c)superscript𝛽c\beta^{\left(\text{c}\right)}italic_β start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT (β(d)superscript𝛽d\beta^{\left(\text{d}\right)}italic_β start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT) represent scaling factor of the centralized (decentralized) structure introduced to fulfill the transmit power constraint and defined as

β(c)PtK,superscript𝛽csubscript𝑃𝑡𝐾\displaystyle\beta^{\left(\text{c}\right)}\approx\sqrt{\frac{P_{t}}{K}},italic_β start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT ≈ square-root start_ARG divide start_ARG italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_K end_ARG end_ARG , β(d)Ptk=1K(1/l¯k,k2).superscript𝛽dsubscript𝑃𝑡superscriptsubscript𝑘1𝐾1superscriptsubscript¯𝑙𝑘𝑘2\displaystyle\beta^{\left(\text{d}\right)}\approx\sqrt{\frac{P_{t}}{\sum% \limits_{k=1}^{K}\left(1/\bar{l}_{k,k}^{2}\right)}}.italic_β start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT ≈ square-root start_ARG divide start_ARG italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( 1 / over¯ start_ARG italic_l end_ARG start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG end_ARG . (13)

Using the channel relation 𝐆T=1τ(𝐆¯T𝐆~T)superscript𝐆T1𝜏superscript¯𝐆Tsuperscript~𝐆T\mathbf{G}^{\text{T}}=\frac{1}{\tau}\left(\bar{\mathbf{G}}^{\text{T}}-\tilde{% \mathbf{G}}^{\text{T}}\right)bold_G start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG ( over¯ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT - over~ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT ), where τ=1+σe2𝜏1superscriptsubscript𝜎𝑒2\tau=\sqrt{1+\sigma_{e}^{2}}italic_τ = square-root start_ARG 1 + italic_σ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG, and substituting (III-B) in (11) and (12) yield, respectively,

𝐲(c)=superscript𝐲cabsent\displaystyle\mathbf{y}^{\left(\text{c}\right)}=bold_y start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT = 1τ𝐯1τ𝐆~T𝐅𝐂𝐁(c)1𝐯+1β(c)𝐧,1𝜏𝐯1𝜏superscript~𝐆Tsuperscript𝐅𝐂𝐁superscriptc1𝐯1superscript𝛽c𝐧\displaystyle\frac{1}{\tau}\mathbf{v}-\frac{1}{\tau}\tilde{\mathbf{G}}^{\text{% T}}\mathbf{F}\mathbf{C}\mathbf{B}^{\left(\text{c}\right)^{-1}}\mathbf{v}+\frac% {1}{\beta^{\left(\text{c}\right)}}\mathbf{n},divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG bold_v - divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG over~ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_FCB start_POSTSUPERSCRIPT ( c ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_v + divide start_ARG 1 end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT end_ARG bold_n , (14)
𝐲(d)=superscript𝐲dabsent\displaystyle\mathbf{y}^{\left(\text{d}\right)}=bold_y start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT = 1τ𝐯1τ𝐂𝐆~T𝐅𝐁(d)1𝐯+1β(d)𝐂𝐧,1𝜏𝐯1𝜏𝐂superscript~𝐆Tsuperscript𝐅𝐁superscriptd1𝐯1superscript𝛽d𝐂𝐧\displaystyle\frac{1}{\tau}\mathbf{v}-\frac{1}{\tau}\mathbf{C}\tilde{\mathbf{G% }}^{\text{T}}\mathbf{F}\mathbf{B}^{\left(\text{d}\right)^{-1}}\mathbf{v}+\frac% {1}{\beta^{\left(\text{d}\right)}}\mathbf{C}\mathbf{n},divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG bold_v - divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG bold_C over~ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_FB start_POSTSUPERSCRIPT ( d ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT bold_v + divide start_ARG 1 end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT end_ARG bold_Cn , (15)

It follows that the received signal at user k𝑘kitalic_k is

yk(c)=subscriptsuperscript𝑦c𝑘absent\displaystyle y^{\left(\text{c}\right)}_{k}=italic_y start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1τvk1τ𝐠~kTi=1Kvi𝐩i(c)+1β(c)nk,1𝜏subscript𝑣𝑘1𝜏subscriptsuperscript~𝐠T𝑘superscriptsubscript𝑖1𝐾subscript𝑣𝑖superscriptsubscript𝐩𝑖c1superscript𝛽csubscript𝑛𝑘\displaystyle\frac{1}{\tau}v_{k}-\frac{1}{\tau}\tilde{\mathbf{g}}^{\text{T}}_{% k}\sum\limits_{i=1}^{K}v_{i}\mathbf{p}_{i}^{\left(\text{c}\right)}+\frac{1}{% \beta^{\left(\text{c}\right)}}n_{k},divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG over~ start_ARG bold_g end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT end_ARG italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (16)
yi(d)=subscriptsuperscript𝑦d𝑖absent\displaystyle y^{\left(\text{d}\right)}_{i}=italic_y start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1τvkck,kτ𝐠~kTi=1Kvi𝐩i(d)+ck,kβ(d)nk,1𝜏subscript𝑣𝑘subscript𝑐𝑘𝑘𝜏subscriptsuperscript~𝐠T𝑘superscriptsubscript𝑖1𝐾subscript𝑣𝑖superscriptsubscript𝐩𝑖dsubscript𝑐𝑘𝑘superscript𝛽dsubscript𝑛𝑘\displaystyle\frac{1}{\tau}v_{k}-\frac{c_{k,k}}{\tau}\tilde{\mathbf{g}}^{\text% {T}}_{k}\sum\limits_{i=1}^{K}v_{i}\mathbf{p}_{i}^{\left(\text{d}\right)}+\frac% {c_{k,k}}{\beta^{\left(\text{d}\right)}}n_{k},divide start_ARG 1 end_ARG start_ARG italic_τ end_ARG italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - divide start_ARG italic_c start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_τ end_ARG over~ start_ARG bold_g end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT + divide start_ARG italic_c start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT end_ARG italic_n start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (17)

where, to simplify the notations, we have substituted the matrices 𝐏(c)=𝐅𝐂𝐁(c)1N×Ksuperscript𝐏csuperscript𝐅𝐂𝐁superscriptc1superscript𝑁𝐾\mathbf{P}^{\left(\text{c}\right)}=\mathbf{F}\mathbf{C}\mathbf{B}^{\left(\text% {c}\right)^{-1}}\in\mathbb{C}^{N\times K}bold_P start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT = bold_FCB start_POSTSUPERSCRIPT ( c ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K end_POSTSUPERSCRIPT and 𝐏(d)=𝐅𝐁(d)1N×Ksuperscript𝐏dsuperscript𝐅𝐁superscriptd1superscript𝑁𝐾\mathbf{P}^{\left(\text{d}\right)}=\mathbf{F}\mathbf{B}^{\left(\text{d}\right)% ^{-1}}\in\mathbb{C}^{N\times K}bold_P start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT = bold_FB start_POSTSUPERSCRIPT ( d ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_K end_POSTSUPERSCRIPT, whose i𝑖iitalic_i-th columns are 𝐩i(c)subscriptsuperscript𝐩c𝑖\mathbf{p}^{\left(\text{c}\right)}_{i}bold_p start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and 𝐩i(d)subscriptsuperscript𝐩d𝑖\mathbf{p}^{\left(\text{d}\right)}_{i}bold_p start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, respectively.

III-C Cluster-based TH precoders

Denote the K𝐾Kitalic_K clusters of usersby 𝒫ksubscript𝒫𝑘\mathcal{P}_{k}caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, k=1,,K𝑘1𝐾k=1,\cdots,Kitalic_k = 1 , ⋯ , italic_K. While the user k𝑘kitalic_k is always included in 𝒫ksubscript𝒫𝑘\mathcal{P}_{k}caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the user i𝑖iitalic_i, ik𝑖𝑘i\neq kitalic_i ≠ italic_k is included in 𝒫ksubscript𝒫𝑘\mathcal{P}_{k}caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT if at least Nasubscript𝑁𝑎N_{a}italic_N start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT antennas provide service to user i𝑖iitalic_i and all other users in 𝒫ksubscript𝒫𝑘\mathcal{P}_{k}caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Then, define the user selection matrix 𝐔k|𝒫k|×Ksubscript𝐔𝑘superscriptsubscript𝒫𝑘𝐾\mathbf{U}_{k}\in\mathbb{R}^{\lvert\mathcal{P}_{k}\rvert\times K}bold_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | × italic_K end_POSTSUPERSCRIPT, where |𝒫k|subscript𝒫𝑘\lvert\mathcal{P}_{k}\rvert| caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | is the cardinality of the set 𝒫ksubscript𝒫𝑘\mathcal{P}_{k}caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and the j𝑗jitalic_j-th row of 𝐔ksubscript𝐔𝑘\mathbf{U}_{k}bold_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is 𝐮j,ksubscript𝐮𝑗𝑘\mathbf{u}_{j,k}bold_u start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT. In particular, 𝐮1,ksubscript𝐮1𝑘\mathbf{u}_{1,k}bold_u start_POSTSUBSCRIPT 1 , italic_k end_POSTSUBSCRIPT contains zeros in all positions except in the l𝑙litalic_l-th, where l𝑙litalic_l is the j𝑗jitalic_j-th lowest index in 𝒫ksubscript𝒫𝑘\mathcal{P}_{k}caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Similarly, the second row 𝐮2,ksubscript𝐮2𝑘\mathbf{u}_{2,k}bold_u start_POSTSUBSCRIPT 2 , italic_k end_POSTSUBSCRIPT contains a one at the j𝑗jitalic_j-th position, where j𝑗jitalic_j is the second lowest index in 𝒫ksubscript𝒫𝑘\mathcal{P}_{k}caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and all other coefficients are equal to zero. The subsequent rows of 𝐔ksubscript𝐔𝑘\mathbf{U}_{k}bold_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT are defined similarly.

The reduced channel matrix is 𝐆¯kT=𝐔k𝐆¯T|𝒫k|×Nsubscriptsuperscript¯𝐆T𝑘subscript𝐔𝑘superscript¯𝐆Tsuperscriptsubscript𝒫𝑘𝑁\bar{\mathbf{G}}^{\text{T}}_{k}=\mathbf{U}_{k}\bar{\mathbf{G}}^{\text{T}}\in% \mathbb{C}^{\lvert\mathcal{P}_{k}\rvert\times N}over¯ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = bold_U start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over¯ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | × italic_N end_POSTSUPERSCRIPT, which is used to compute the TH precoder with reduced dimensions (THP-RD). Applying an LQ decomposition over the reduced channel matrix, i.e. 𝐆¯kT=𝐋¯k𝐐¯ksuperscriptsubscript¯𝐆𝑘Tsubscript¯𝐋𝑘subscript¯𝐐𝑘\bar{\mathbf{G}}_{k}^{\text{T}}=\bar{\mathbf{L}}_{k}\bar{\mathbf{Q}}_{k}over¯ start_ARG bold_G end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT = over¯ start_ARG bold_L end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over¯ start_ARG bold_Q end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, where 𝐋¯k|𝒫k|×|𝒫k|subscript¯𝐋𝑘superscriptsubscript𝒫𝑘subscript𝒫𝑘\bar{\mathbf{L}}_{k}\in\mathbb{C}^{\lvert\mathcal{P}_{k}\rvert\times\lvert% \mathcal{P}_{k}\rvert}over¯ start_ARG bold_L end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | × | caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | end_POSTSUPERSCRIPT and 𝐐¯k|𝒫k|×Nsubscript¯𝐐𝑘superscriptsubscript𝒫𝑘𝑁\bar{\mathbf{Q}}_{k}\in\mathbb{C}^{\lvert\mathcal{P}_{k}\rvert\times N}over¯ start_ARG bold_Q end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | × italic_N end_POSTSUPERSCRIPT, produces the three THP filters as

𝐅ksubscript𝐅𝑘\displaystyle\mathbf{F}_{k}bold_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =𝐐¯kH,absentsuperscriptsubscript¯𝐐𝑘𝐻\displaystyle=\bar{\mathbf{Q}}_{k}^{H},= over¯ start_ARG bold_Q end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , (18)
𝐂ksubscript𝐂𝑘\displaystyle\mathbf{C}_{k}bold_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =diag(l¯1,1,l¯2,2,,l¯|𝒫k|,|𝒫k|),absentdiagsubscript¯𝑙11subscript¯𝑙22subscript¯𝑙subscript𝒫𝑘subscript𝒫𝑘\displaystyle=\text{diag}\left(\bar{l}_{1,1},\bar{l}_{2,2},\cdots,\bar{l}_{% \lvert\mathcal{P}_{k}\rvert,\lvert\mathcal{P}_{k}\rvert}\right),= diag ( over¯ start_ARG italic_l end_ARG start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT , over¯ start_ARG italic_l end_ARG start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_l end_ARG start_POSTSUBSCRIPT | caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | , | caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | end_POSTSUBSCRIPT ) , (19)
𝐁k(c)superscriptsubscript𝐁𝑘c\displaystyle\mathbf{B}_{k}^{\left(\text{c}\right)}bold_B start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT =𝐋¯k𝐂k,absentsubscript¯𝐋𝑘subscript𝐂𝑘\displaystyle=\bar{\mathbf{L}}_{k}\mathbf{C}_{k},= over¯ start_ARG bold_L end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (20)
𝐁k(d)superscriptsubscript𝐁𝑘d\displaystyle\mathbf{B}_{k}^{\left(\text{d}\right)}bold_B start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT =𝐂k𝐋¯k.absentsubscript𝐂𝑘subscript¯𝐋𝑘\displaystyle=\mathbf{C}_{k}\bar{\mathbf{L}}_{k}.= bold_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT over¯ start_ARG bold_L end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (21)

The set 𝒫ksubscript𝒫𝑘\mathcal{P}_{k}caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is associated to 𝐆¯kTsubscriptsuperscript¯𝐆T𝑘\bar{\mathbf{G}}^{\text{T}}_{k}over¯ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and to the decoding of the information of user k𝑘kitalic_k but the channel matrix 𝐆¯kTsubscriptsuperscript¯𝐆T𝑘\bar{\mathbf{G}}^{\text{T}}_{k}over¯ start_ARG bold_G end_ARG start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT has reduced dimensions. Therefore, we need an index map** to find the correct precoder. Denote this index by q𝑞qitalic_q such that 𝐮q,ksubscript𝐮𝑞𝑘\mathbf{u}_{q,k}bold_u start_POSTSUBSCRIPT italic_q , italic_k end_POSTSUBSCRIPT contains a one in its k𝑘kitalic_k-th entry. It follows that the q𝑞qitalic_q-th column should be employed in the precoders denoted by 𝐏(cTHP-RD)=[𝐩1(c)𝐩k(c)𝐩K(c)]superscript𝐏cTHP-RDdelimited-[]superscriptsubscript𝐩1superscriptcsuperscriptsubscript𝐩𝑘superscriptcsuperscriptsubscript𝐩𝐾superscriptc\mathbf{P}^{\left(\text{cTHP-RD}\right)}=[\mathbf{p}_{1}^{\left(\text{c}\right% )^{\prime}}\ldots\mathbf{p}_{k}^{\left(\text{c}\right)^{\prime}}\ldots\mathbf{% p}_{K}^{\left(\text{c}\right)^{\prime}}]bold_P start_POSTSUPERSCRIPT ( cTHP-RD ) end_POSTSUPERSCRIPT = [ bold_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT … bold_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT … bold_p start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ] and 𝐏(dTHP-RD)=[𝐩1(d)𝐩k(d)𝐩K(d)]superscript𝐏dTHP-RDdelimited-[]superscriptsubscript𝐩1superscriptdsuperscriptsubscript𝐩𝑘superscriptdsuperscriptsubscript𝐩𝐾superscriptd\mathbf{P}^{\left(\text{dTHP-RD}\right)}=[\mathbf{p}_{1}^{\left(\text{d}\right% )^{\prime}}\ldots\mathbf{p}_{k}^{\left(\text{d}\right)^{\prime}}\ldots\mathbf{% p}_{K}^{\left(\text{d}\right)^{\prime}}]bold_P start_POSTSUPERSCRIPT ( dTHP-RD ) end_POSTSUPERSCRIPT = [ bold_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT … bold_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT … bold_p start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ] for the cTHP and dTHP structures, respectively. Then, the k𝑘kitalic_k-th columns of the respective precoding matrices are

𝐩k(c)=subscriptsuperscript𝐩superscript𝑐𝑘absent\displaystyle\mathbf{p}^{\left(c\right)^{\prime}}_{k}=bold_p start_POSTSUPERSCRIPT ( italic_c ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = [𝐅k𝐂k𝐁k(c)1]q,subscriptdelimited-[]subscript𝐅𝑘subscript𝐂𝑘subscriptsuperscript𝐁superscript𝑐1𝑘𝑞\displaystyle\left[\mathbf{F}_{k}\mathbf{C}_{k}\mathbf{B}^{\left(c\right)^{-1}% }_{k}\right]_{q},[ bold_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_C start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_B start_POSTSUPERSCRIPT ( italic_c ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT , (22)
𝐩k(d)=subscriptsuperscript𝐩superscript𝑑𝑘absent\displaystyle\mathbf{p}^{\left(d\right)^{\prime}}_{k}=bold_p start_POSTSUPERSCRIPT ( italic_d ) start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = [𝐅k𝐁k(d)1]q.subscriptdelimited-[]subscript𝐅𝑘subscriptsuperscript𝐁superscript𝑑1𝑘𝑞\displaystyle\left[\mathbf{F}_{k}\mathbf{B}^{\left(d\right)^{-1}}_{k}\right]_{% q}.[ bold_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_B start_POSTSUPERSCRIPT ( italic_d ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT . (23)

IV Sum-rate performance

To evaluate the proposed nonlinear schemes, we employ the ergodic sum-rate (ESR) defined as

Sr=𝔼[k=1KR¯k],subscript𝑆𝑟𝔼delimited-[]superscriptsubscript𝑘1𝐾subscript¯𝑅𝑘S_{r}=\mathbb{E}\left[\sum_{k=1}^{K}\bar{R}_{k}\right],italic_S start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = blackboard_E [ ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT over¯ start_ARG italic_R end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] , (24)

where R¯k=𝔼[Rk|𝐆^]subscript¯𝑅𝑘𝔼delimited-[]conditionalsubscript𝑅𝑘^𝐆\bar{R}_{k}=\mathbb{E}\left[R_{k}|\hat{\mathbf{G}}\right]over¯ start_ARG italic_R end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = blackboard_E [ italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | over^ start_ARG bold_G end_ARG ] is the average rate and Rksubscript𝑅𝑘R_{k}italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the instantaneous rate of the k𝑘kitalic_k-th user. The rate R¯ksubscript¯𝑅𝑘\bar{R}_{k}over¯ start_ARG italic_R end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT averages out the effects of the imperfect CSIT because the instantaneous rates are not achievable. Considering Gaussian codebooks, the instantaneous rate is

Rk=log2(1+γk),subscript𝑅𝑘subscript21subscript𝛾𝑘R_{k}=\log_{2}\left(1+\gamma_{k}\right),italic_R start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , (25)

where γksubscript𝛾𝑘\gamma_{k}italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the signal-to-interference-plus-noise ratio (SINR) at user k𝑘kitalic_k.

Denote the SINR for the centralized and decentralized structures by γk(c)superscriptsubscript𝛾𝑘c\gamma_{k}^{\left(\text{c}\right)}italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT and γk(d)superscriptsubscript𝛾𝑘d\gamma_{k}^{\left(\text{d}\right)}italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT, respectively. Then, depending on the specific THP structure used, we employ γk(c)superscriptsubscript𝛾𝑘c\gamma_{k}^{\left(\text{c}\right)}italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT or γk(d)superscriptsubscript𝛾𝑘d\gamma_{k}^{\left(\text{d}\right)}italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT in (25) to obtain the instantaneous rate. To compute the SINR, we obtain the mean powers of the received signal at user k𝑘kitalic_k for centralized and decentralized structures as, respectively,

𝔼[|yk(c)|2]=𝔼delimited-[]superscriptsubscriptsuperscript𝑦c𝑘2absent\displaystyle\mathbb{E}\left[\lvert y^{\left(\text{c}\right)}_{k}\rvert^{2}% \right]=blackboard_E [ | italic_y start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = 1τ2+1τ2i=1ikK|𝐠~kT𝐩i(c)|2+1β(c)σn2+1τ2dg(c),1superscript𝜏21superscript𝜏2superscriptsubscript𝑖1𝑖𝑘𝐾superscriptsuperscriptsubscript~𝐠𝑘Tsuperscriptsubscript𝐩𝑖c21superscript𝛽csuperscriptsubscript𝜎𝑛21superscript𝜏2superscriptsubscript𝑑𝑔c\displaystyle\frac{1}{\tau^{2}}+\frac{1}{\tau^{2}}\sum\limits_{\begin{subarray% }{c}i=1\\ i\neq k\end{subarray}}^{K}\lvert\tilde{\mathbf{g}}_{k}^{\text{T}}\mathbf{p}_{i% }^{\left(\text{c}\right)}\rvert^{2}+\frac{1}{\beta^{\left(\text{c}\right)}}% \sigma_{n}^{2}+\frac{1}{\tau^{2}}d_{g}^{\left(\text{c}\right)},divide start_ARG 1 end_ARG start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + divide start_ARG 1 end_ARG start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_i = 1 end_CELL end_ROW start_ROW start_CELL italic_i ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT | over~ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT end_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT , (28)

and

𝔼[|yk(d)|2]=𝔼delimited-[]superscriptsubscriptsuperscript𝑦d𝑘2absent\displaystyle\mathbb{E}\left[\lvert y^{\left(\text{d}\right)}_{k}\rvert^{2}% \right]=blackboard_E [ | italic_y start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = 1τ2+ck,k2τ2i=1ikK|𝐠~kT𝐩i(d)|2+ck,k2β(d)σn2+1τ2dg(d),1superscript𝜏2superscriptsubscript𝑐𝑘𝑘2superscript𝜏2superscriptsubscript𝑖1𝑖𝑘𝐾superscriptsuperscriptsubscript~𝐠𝑘Tsuperscriptsubscript𝐩𝑖d2superscriptsubscript𝑐𝑘𝑘2superscript𝛽dsuperscriptsubscript𝜎𝑛21superscript𝜏2superscriptsubscript𝑑𝑔d\displaystyle\frac{1}{\tau^{2}}+\frac{c_{k,k}^{2}}{\tau^{2}}\sum\limits_{% \begin{subarray}{c}i=1\\ i\neq k\end{subarray}}^{K}\lvert\tilde{\mathbf{g}}_{k}^{\text{T}}\mathbf{p}_{i% }^{\left(\text{d}\right)}\rvert^{2}+\frac{c_{k,k}^{2}}{\beta^{\left(\text{d}% \right)}}\sigma_{n}^{2}+\frac{1}{\tau^{2}}d_{g}^{\left(\text{d}\right)},divide start_ARG 1 end_ARG start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG + divide start_ARG italic_c start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_i = 1 end_CELL end_ROW start_ROW start_CELL italic_i ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT | over~ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_c start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT end_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT , (31)

where dg(c)=|𝐠~k𝐩k(c)|22Re(𝐠~k𝐩k(c))superscriptsubscript𝑑𝑔csuperscriptsubscript~𝐠𝑘subscriptsuperscript𝐩c𝑘22Resubscript~𝐠𝑘subscriptsuperscript𝐩c𝑘d_{g}^{\left(\text{c}\right)}=\lvert\tilde{\mathbf{g}}_{k}\mathbf{p}^{\left(% \text{c}\right)}_{k}\rvert^{2}-2\text{Re}{\left(\tilde{\mathbf{g}}_{k}\mathbf{% p}^{\left(\text{c}\right)}_{k}\right)}italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT = | over~ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_p start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 Re ( over~ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_p start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) and dg(d)=ck,k2|𝐠~k𝐩k(d)|22Re(ck,k𝐠~k𝐩k(d))superscriptsubscript𝑑𝑔dsuperscriptsubscript𝑐𝑘𝑘2superscriptsubscript~𝐠𝑘subscriptsuperscript𝐩d𝑘22Resubscript𝑐𝑘𝑘subscript~𝐠𝑘subscriptsuperscript𝐩d𝑘d_{g}^{\left(\text{d}\right)}=c_{k,k}^{2}\lvert\tilde{\mathbf{g}}_{k}\mathbf{p% }^{\left(\text{d}\right)}_{k}\rvert^{2}-2\text{Re}{\left(c_{k,k}\tilde{\mathbf% {g}}_{k}\mathbf{p}^{\left(\text{d}\right)}_{k}\right)}italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT = italic_c start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | over~ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_p start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 Re ( italic_c start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT over~ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_p start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). This yields

γk(c)=superscriptsubscript𝛾𝑘cabsent\displaystyle\gamma_{k}^{\left(\text{c}\right)}=italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT = 1dg(c)+i=1ikK|𝐠~kT𝐩i|2+τ2β(c)2σn2,1superscriptsubscript𝑑𝑔csuperscriptsubscript𝑖1𝑖𝑘𝐾superscriptsuperscriptsubscript~𝐠𝑘Tsubscript𝐩𝑖2superscript𝜏2superscript𝛽superscript𝑐2superscriptsubscript𝜎𝑛2\displaystyle\frac{1}{d_{g}^{\left(\text{c}\right)}+\sum\limits_{\begin{% subarray}{c}i=1\\ i\neq k\end{subarray}}^{K}\left\lvert\tilde{\mathbf{g}}_{k}^{\textrm{T}}% \mathbf{p}_{i}\right\rvert^{2}+\frac{\tau^{2}}{\beta^{\left(c\right)^{2}}}% \sigma_{n}^{2}},divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( c ) end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_i = 1 end_CELL end_ROW start_ROW start_CELL italic_i ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT | over~ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( italic_c ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT end_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (34)

and

γk(d)=superscriptsubscript𝛾𝑘dabsent\displaystyle\gamma_{k}^{\left(\text{d}\right)}=italic_γ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT = 1dg(d)+ck,k2i=1ikK|𝐠~kT𝐩i(d)|2+ck,k2τ2β(d)σn2.1superscriptsubscript𝑑𝑔dsuperscriptsubscript𝑐𝑘𝑘2superscriptsubscript𝑖1𝑖𝑘𝐾superscriptsuperscriptsubscript~𝐠𝑘Tsuperscriptsubscript𝐩𝑖d2superscriptsubscript𝑐𝑘𝑘2superscript𝜏2superscript𝛽dsuperscriptsubscript𝜎𝑛2\displaystyle\frac{1}{d_{g}^{\left(\text{d}\right)}+c_{k,k}^{2}\sum\limits_{% \begin{subarray}{c}i=1\\ i\neq k\end{subarray}}^{K}\lvert\tilde{\mathbf{g}}_{k}^{\text{T}}\mathbf{p}_{i% }^{\left(\text{d}\right)}\rvert^{2}+\frac{c_{k,k}^{2}\tau^{2}}{\beta^{\left(% \text{d}\right)}}\sigma_{n}^{2}}.divide start_ARG 1 end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT + italic_c start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL italic_i = 1 end_CELL end_ROW start_ROW start_CELL italic_i ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT | over~ start_ARG bold_g end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_c start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_τ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_β start_POSTSUPERSCRIPT ( d ) end_POSTSUPERSCRIPT end_ARG italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG . (37)

V Numerical Experiments

We assess the performance of the proposed TH precoders via numerical experiments. Throughout the experiments, the large scale fading coefficients are set to

ζk,n=Pk,n10σ(s)zk,n10,subscript𝜁𝑘𝑛subscript𝑃𝑘𝑛superscript10superscript𝜎ssubscript𝑧𝑘𝑛10\zeta_{k,n}=P_{k,n}\cdot 10^{\frac{\sigma^{\left(\textrm{s}\right)}z_{k,n}}{10% }},italic_ζ start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT ⋅ 10 start_POSTSUPERSCRIPT divide start_ARG italic_σ start_POSTSUPERSCRIPT ( s ) end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT end_ARG start_ARG 10 end_ARG end_POSTSUPERSCRIPT , (38)

where Pk,nsubscript𝑃𝑘𝑛P_{k,n}italic_P start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT is the path loss and the scalar 10σ(s)zk,n10superscript10superscript𝜎ssubscript𝑧𝑘𝑛1010^{\frac{\sigma^{\left(\textrm{s}\right)}z_{k,n}}{10}}10 start_POSTSUPERSCRIPT divide start_ARG italic_σ start_POSTSUPERSCRIPT ( s ) end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT end_ARG start_ARG 10 end_ARG end_POSTSUPERSCRIPT include the shadowing effect with standard deviation σ(s)=8superscript𝜎s8\sigma^{\left(\textrm{s}\right)}=8italic_σ start_POSTSUPERSCRIPT ( s ) end_POSTSUPERSCRIPT = 8. The random variable zk,nsubscript𝑧𝑘𝑛z_{k,n}italic_z start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT follows Gaussian distribution with zero mean and unit variance. The path loss was calculated using a three-slope model as

Pk,n={L35log10(dk,n),dk,n>d1L15log10(d1)20log10(dk,n),d0<dk,nd1L15log10(d1)20log10(d0),otherwise,subscript𝑃𝑘𝑛cases𝐿35subscript10subscript𝑑𝑘𝑛subscript𝑑𝑘𝑛subscript𝑑1𝐿15subscript10subscript𝑑120subscript10subscript𝑑𝑘𝑛subscript𝑑0subscript𝑑𝑘𝑛subscript𝑑1𝐿15subscript10subscript𝑑120subscript10subscript𝑑0otherwise,\displaystyle P_{k,n}=\begin{cases}-L-35\log_{10}\left(d_{k,n}\right),&\text{$% d_{k,n}>d_{1}$}\\ -L-15\log_{10}\left(d_{1}\right)-20\log_{10}\left(d_{k,n}\right),&\text{$d_{0}% <d_{k,n}\leq d_{1}$}\\ -L-15\log_{10}\left(d_{1}\right)-20\log_{10}\left(d_{0}\right),&\text{% otherwise,}\end{cases}italic_P start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT = { start_ROW start_CELL - italic_L - 35 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT ) , end_CELL start_CELL italic_d start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT > italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL - italic_L - 15 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - 20 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT ) , end_CELL start_CELL italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_d start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT ≤ italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL - italic_L - 15 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) - 20 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , end_CELL start_CELL otherwise, end_CELL end_ROW (39)

where dk,nsubscript𝑑𝑘𝑛d_{k,n}italic_d start_POSTSUBSCRIPT italic_k , italic_n end_POSTSUBSCRIPT is the distance between the n𝑛nitalic_n-th AP and the k𝑘kitalic_k-th user, d1=50subscript𝑑150d_{1}=50italic_d start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 50 m, d0=10subscript𝑑010d_{0}=10italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 10 m, and the attenuation L𝐿Litalic_L is

L=𝐿absent\displaystyle L=italic_L = 46.3+33.9log10(f)13.82log10(hAP)46.333.9subscript10𝑓13.82subscript10subscriptAP\displaystyle 46.3+33.9\log_{10}\left(f\right)-13.82\log_{10}\left(h_{\textrm{% AP}}\right)46.3 + 33.9 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_f ) - 13.82 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_h start_POSTSUBSCRIPT AP end_POSTSUBSCRIPT )
(1.1log10(f)0.7)hu+(1.56log10(f)0.8),1.1subscript10𝑓0.7subscript𝑢1.56subscript10𝑓0.8\displaystyle-\left(1.1\log_{10}\left(f\right)-0.7\right)h_{u}+\left(1.56\log_% {10}\left(f\right)-0.8\right),- ( 1.1 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_f ) - 0.7 ) italic_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT + ( 1.56 roman_log start_POSTSUBSCRIPT 10 end_POSTSUBSCRIPT ( italic_f ) - 0.8 ) , (40)

where hAP=15subscriptAP15h_{\textrm{AP}}=15italic_h start_POSTSUBSCRIPT AP end_POSTSUBSCRIPT = 15 m and hu=1.65subscript𝑢1.65h_{u}=1.65italic_h start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = 1.65 m are the positions of the APs and UEs above the ground, respectively. We consider a frequency of f=1900𝑓1900f=1900italic_f = 1900 MHz. The noise variance is

σn2=TokBBNf,superscriptsubscript𝜎𝑛2subscript𝑇𝑜subscript𝑘𝐵𝐵subscript𝑁𝑓\sigma_{n}^{2}=T_{o}k_{B}BN_{f},italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_T start_POSTSUBSCRIPT italic_o end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT italic_B italic_N start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , (41)

where To=290subscript𝑇𝑜290T_{o}=290italic_T start_POSTSUBSCRIPT italic_o end_POSTSUBSCRIPT = 290 K is the noise temperature, kB=1.381×1023subscript𝑘𝐵1.381superscript1023k_{B}=1.381\times 10^{-23}italic_k start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT = 1.381 × 10 start_POSTSUPERSCRIPT - 23 end_POSTSUPERSCRIPT J/K is the Boltzmann constant, B=50𝐵50B=50italic_B = 50 MHz is the bandwidth and Nf=10subscript𝑁𝑓10N_{f}=10italic_N start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = 10 dB is the noise figure. The signal-to-noise ratio (SNR) is

SNR=PtTr(𝐆T𝐆*)NKσn2,SNRsubscript𝑃𝑡Trsuperscript𝐆Tsuperscript𝐆𝑁𝐾superscriptsubscript𝜎𝑛2\text{SNR}=\frac{P_{t}\textrm{Tr}\left(\mathbf{G}^{\text{T}}\mathbf{G}^{*}% \right)}{NK\sigma_{n}^{2}},SNR = divide start_ARG italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT Tr ( bold_G start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_N italic_K italic_σ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (42)

where Tr()Tr\textrm{Tr}(\cdot)Tr ( ⋅ ) is the trace of its matrix argument.

For all experiments, we have 128128128128 APs randomly distributed over a square with side equal to 20202020 km. The APs serve a total of 24242424 users, which are geographically distributed. We considered a total of 10,000 channel realizations to compute the ESR. Specifically, we employed 100100100100 channel estimates and, for each channel estimate, we considered 100100100100 error matrices. It follows that the average rate was computed with 100100100100 error matrices.

We first compare the ESR of the proposed precoders with their linear counterparts. We consider that the error in the channel coefficient estimate has a variance of 0.010.010.010.01. Fig. 1 shows the sum-rate performance of the proposed nonlinear precoders against their conventional linear counterparts. The dTHP with sparse channel estimate or “dTHP-SP” performs the best, even better than the linear ZF precoder that employs all the APs (ZF-NW).

Refer to caption
Figure 1: Sum-rate performance of various precoders versus SNR. Here, N=128𝑁128N=128italic_N = 128, K=24𝐾24K=24italic_K = 24, |𝒜k|=24subscript𝒜𝑘24|\mathcal{A}_{k}|=24| caligraphic_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | = 24, |𝒫k|=10subscript𝒫𝑘10|\mathcal{P}_{k}|=10| caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | = 10, σe2=0.01superscriptsubscript𝜎𝑒20.01\sigma_{e}^{2}=0.01italic_σ start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0.01.
Refer to caption
Figure 2: Sum-rate performance of precoders versus CSIT quality. Here, N=128𝑁128N=128italic_N = 128, K=24𝐾24K=24italic_K = 24, |𝒜k|=24subscript𝒜𝑘24|\mathcal{A}_{k}|=24| caligraphic_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | = 24, |𝒫k|=10subscript𝒫𝑘10|\mathcal{P}_{k}|=10| caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | = 10, SNR=15dBSNR15dB\textrm{SNR}=15~{}\textrm{dB}SNR = 15 dB.

In the second experiment, we assessed the sum-rate performance at SNR=15absent15=15= 15 dB with respect to CSIT quality (Fig. 2). The proposed dTHP with reduced dimensions or “dTHP-RD” outperforms the ZF-RD precoder. The RD precoding techniques have reduced computational complexity than the corresponding SP precoders. We observe that our proposed nonlinear cluster-based precoders generally yield better ESR than their linear counterparts.

VI Summary

We proposed clustered nonlinear precoders based on the noninear THP algorithm. Our proposed THP-SP reduces the signaling load and the THP-RD additionally lowers the computational complexity at the expense of performance. Note that the reduction of the computational complexity is critical for practical applications. Numerical experiments showed that the proposed cluster-based nonlinear precoders yield better performance and robustness against CSIT uncertainties than the conventional linear precoders.

References

  • [1] H. Tataria, M. Shafi, A. F. Molisch, M. Dohler, H. Sjöland, and F. Tufvesson, “6G wireless systems: Vision, requirements, challenges, insights, and opportunities,” Proceedings of the IEEE, vol. 109, no. 7, pp. 1166–1199, 2021.
  • [2] M. Giordani, M. Polese, M. Mezzavilla, S. Rangan, and M. Zorzi, “Toward 6G networks: Use cases and technologies,” IEEE Communications Magazine, vol. 58, no. 3, pp. 55–61, 2020.
  • [3] R. C. de Lamare, “Massive MIMO systems: Signal processing challenges and future trends,” URSI Radio Science Bulletin, vol. 2013, no. 347, pp. 8–20, 2013.
  • [4] W. Zhang, H. Ren, C. Pan, M. Chen, R. C. de Lamare, B. Du, and J. Dai, “Large-scale antenna systems with ul/dl hardware mismatch: Achievable rates analysis and calibration,” IEEE Transactions on Communications, vol. 63, no. 4, pp. 1216–1229, 2015.
  • [5] H. A. Ammar, R. Adve, S. Shahbazpanahi, G. Boudreau, and K. V. Srinivas, “User-centric cell-free massive MIMO networks: A survey of opportunities, challenges and solutions,” IEEE Communications Surveys & Tutorials, vol. 24, no. 1, pp. 611–652, 2022.
  • [6] S. Elhoushy, M. Ibrahim, and W. Hamouda, “Cell-free massive MIMO: A survey,” IEEE Communications Surveys & Tutorials, vol. 24, no. 1, pp. 492–523, 2022.
  • [7] M. Attarifar, A. Abbasfar, and A. Lozano, “Subset MMSE receivers for cell-free networks,” IEEE Transactions on Wireless Communications, vol. 19, no. 6, pp. 4183–4194, 2020.
  • [8] H. Yang and T. L. Marzetta, “Energy efficiency of massive MIMO: Cell-free vs. cellular,” in IEEE Vehicular Technology Conference - Spring, 2018.
  • [9] S. Elhoushy and W. Hamouda, “Towards high data rates in dynamic environments using hybrid cell-free massive MIMO/small-cell system,” IEEE Wireless Communications Letters, vol. 10, no. 2, pp. 201–205, 2021.
  • [10] H. Q. Ngo, L.-N. Tran, T. Q. Duong, M. Matthaiou, and E. G. Larsson, “On the total energy efficiency of cell-free massive MIMO,” IEEE Transactions on Green Communications and Networking, vol. 2, no. 1, pp. 25–39, 2018.
  • [11] J. Zhang, S. Chen, Y. Lin, J. Zheng, B. Ai, and L. Hanzo, “Cell-free massive MIMO: A new next-generation paradigm,” IEEE Access, vol. 7, pp. 99 878–99 888, 2019.
  • [12] S.-N. **, D.-W. Yue, and H. H. Nguyen, “Spectral and energy efficiency in cell-free massive MIMO systems over correlated Rician fading,” IEEE Systems Journal, vol. 15, no. 2, pp. 2822–2833, 2021.
  • [13] E. Nayebi, A. Ashikhmin, T. L. Marzetta, H. Yang, and B. D. Rao, “Precoding and power optimization in cell-free massive MIMO systems,” IEEE Transactions on Wireless Communications, vol. 16, no. 7, pp. 4445–4459, 2017.
  • [14] E. Björnson and L. Sanguinetti, “Making cell-free massive MIMO competitive with MMSE processing and centralized implementation,” IEEE Transactions on Wireless Communications, vol. 19, no. 1, pp. 77–90, 2020.
  • [15] K. Zu, R. C. de Lamare, and M. Haardt, “Multi-branch Tomlinson-Harashima precoding design for MU-MIMO systems: Theory and algorithms,” IEEE Transactions on Communications, vol. 62, no. 3, pp. 939–951, 2014.
  • [16] L. Zhang, Y. Cai, R. C. de Lamare, and M. Zhao, “Robust multibranch Tomlinson–Harashima precoding design in amplify-and-forward MIMO relay systems,” IEEE Transactions on Communications, vol. 62, no. 10, pp. 3476–3490, 2014.
  • [17] A. R. Flores, R. C. De Lamare, and B. Clerckx, “Tomlinson-Harashima precoded rate-splitting with stream combiners for MU-MIMO systems,” IEEE Transactions on Communications, vol. 69, no. 6, pp. 3833–3845, 2021.
  • [18] M. Joham, W. Utschick, and J. Nossek, “Linear transmit processing in MIMO communications systems,” IEEE Transactions on Signal Processing, vol. 53, no. 8, pp. 2700–2712, 2005.
  • [19] H. Ruan and R. C. de Lamare, “Distributed robust beamforming based on low-rank and cross-correlation techniques: Design and analysis,” IEEE Transactions on Signal Processing, vol. 67, no. 24, pp. 6411–6423, 2019.
  • [20] V. M. Palhares, R. C. de Lamare, A. R. Flores, and L. T. Landau, “Iterative AP selection, MMSE precoding and power allocation in cell-free massive MIMO systems,” IET Communications, vol. 14, no. 22, pp. 3996–4006, 2020.
  • [21] A. R. Flores, R. C. de Lamare, and B. Clerckx, “Linear precoding and stream combining for rate splitting in multiuser MIMO systems,” IEEE Communications Letters, vol. 24, no. 4, pp. 890–894, 2020.
  • [22] S. Mashdour, R. C. de Lamare, and J. P. S. H. Lima, “Enhanced subset greedy multiuser scheduling in clustered cell-free massive mimo systems,” IEEE Communications Letters, vol. 27, no. 2, pp. 610–614, 2023.
  • [23] A. R. Flores, R. C. de Lamare, and K. V. Mishra, “Clustered cell-free multi-user multiple-antenna systems with rate-splitting: Precoder design and power allocation,” IEEE Transactions on Communications, vol. 71, no. 10, pp. 5920–5934, 2023.
  • [24] H. Q. Ngo, A. Ashikhmin, H. Yang, E. G. Larsson, and T. L. Marzetta, “Cell-free massive MIMO versus small cells,” IEEE Transactions on Wireless Communications, vol. 16, no. 3, pp. 1834–1850, 2017.
  • [25] L. D. Nguyen, T. Q. Duong, H. Q. Ngo, and K. Tourki, “Energy efficiency in cell-free massive MIMO with zero-forcing precoding design,” IEEE Communications Letters, vol. 21, no. 8, pp. 1871–1874, 2017.
  • [26] S. Buzzi, C. D’Andrea, A. Zappone, and C. D’Elia, “User-centric 5G cellular networks: Resource allocation and comparison with the cell-free massive MIMO approach,” IEEE Transactions on Wireless Communications, vol. 19, no. 2, pp. 1250–1264, 2020.
  • [27] E. Björnson and L. Sanguinetti, “Scalable cell-free massive MIMO systems,” IEEE Transactions on Communications, vol. 68, no. 7, pp. 4247–4261, 2020.
  • [28] M. M. Mojahedian and A. Lozano, “Subset regularized zero-forcing precoders for cell-free C-RANs,” in European Signal Processing Conference, 2021, pp. 915–919.
  • [29] M. A. Albreem, A. H. Al Habbash, A. M. Abu-Hudrouss, and S. S. Ikki, “Overview of precoding techniques for massive MIMO,” IEEE Access, vol. 9, pp. 60 764–60 801, 2021.
  • [30] R. Fischer, C. Windpassinger, A. Lampe, and J. Huber, “Tomlinson-Harashima precoding in space-time transmission for low-rate backward channel,” in International Zurich Seminar on Broadband Communications Access - Transmission - Networking, 2002, pp. 7–7.
  • [31] R. C. De Lamare and R. Sampaio-Neto, “Minimum mean-squared error iterative successive parallel arbitrated decision feedback detectors for ds-cdma systems,” IEEE Transactions on Communications, vol. 56, no. 5, pp. 778–789, 2008.
  • [32] M. Vu and A. Paulraj, “MIMO wireless linear precoding,” IEEE Signal Processing Magazine, vol. 24, no. 5, pp. 86–105, 2007.
  • [33] Y. Chen, “Low complexity precoding schemes for massive MIMO systems,” Ph.D. dissertation, Newcastle University, 2019.
  • [34] K. Zu, R. C. de Lamare, and M. Haardt, “Generalized design of low-complexity block diagonalization type precoding algorithms for multiuser mimo systems,” IEEE Transactions on Communications, vol. 61, no. 10, pp. 4232–4242, 2013.
  • [35] Y. Cai, R. C. d. Lamare, and R. Fa, “Switched interleaving techniques with limited feedback for interference mitigation in ds-cdma systems,” IEEE Transactions on Communications, vol. 59, no. 7, pp. 1946–1956, 2011.
  • [36] Y. Cai, R. C. de Lamare, and D. Le Ruyet, “Transmit processing techniques based on switched interleaving and limited feedback for interference mitigation in multiantenna mc-cdma systems,” IEEE Transactions on Vehicular Technology, vol. 60, no. 4, pp. 1559–1570, 2011.
  • [37] W. Zhang, R. C. de Lamare, C. Pan, M. Chen, J. Dai, B. Wu, and X. Bao, “Widely linear precoding for large-scale mimo with iqi: Algorithms and performance analysis,” IEEE Transactions on Wireless Communications, vol. 16, no. 5, pp. 3298–3312, 2017.
  • [38] Y. Cai, R. C. de Lamare, L.-L. Yang, and M. Zhao, “Robust mmse precoding based on switched relaying and side information for multiuser mimo relay systems,” IEEE Transactions on Vehicular Technology, vol. 64, no. 12, pp. 5677–5687, 2015.
  • [39] S. F. B. Pinto and R. C. de Lamare, “Block diagonalization precoding and power allocation for multiple-antenna systems with coarsely quantized signals,” IEEE Transactions on Communications, vol. 69, no. 10, pp. 6793–6807, 2021.
  • [40] A. R. Flores, R. C. de Lamare, and B. Clerckx, “Linear precoding and stream combining for rate splitting in multiuser mimo systems,” IEEE Communications Letters, vol. 24, no. 4, pp. 890–894, 2020.
  • [41] L. T. N. Landau and R. C. de Lamare, “Branch-and-bound precoding for multiuser mimo systems with 1-bit quantization,” IEEE Wireless Communications Letters, vol. 6, no. 6, pp. 770–773, 2017.
  • [42] D. M. V. Melo, L. T. N. Landau, R. C. de Lamare, P. F. Neuhaus, and G. P. Fettweis, “Zero-crossing precoding techniques for channels with 1-bit temporal oversampling adcs,” IEEE Transactions on Wireless Communications, vol. 22, no. 8, pp. 5321–5336, 2023.