License: CC BY-NC-ND 4.0
arXiv:2312.15921v1 [cs.IT] 26 Dec 2023

Hybrid Precoder Design for Angle-of-Departure Estimation with Limited-Resolution Phase Shifters

Hui** Huang, Member, IEEE, Musa Furkan Keskin, Member, IEEE, Henk Wymeersch, Fellow, IEEE,
Xuesong Cai, Senior Member, IEEE, Linlong Wu, Member, IEEE, Johan Thunberg,
Fredrik Tufvesson, Fellow, IEEE
This paper is supported by the Vinnova B5GPOS Project under Grant 2022-01640.H. Huang, M. F. Keskin, and H. Wymeersch are with Department of Electrical Engineering, Chalmers University of Technology, 41296 Gothenburg, Sweden (e-mail: {hui**; furkan; henkw}@chalmers.se).X. Cai, J. Thunberg, and F. Tufvesson are with Department of Electrical and Information Technology, Lund University, 22100 Lund, Sweden (e-mail: {xuesong.cai; johan.thunberg; fredrik.tufvesson}@eit.lth.se).L. Wu is with the Interdisciplinary Centre for Security, Reliability and Trust (SnT), University of Luxembourg, 1855 Luxembourg, Luxembourg (e-mail: [email protected]).
Abstract

Hybrid analog-digital beamforming stands out as a key enabler for future communication systems with a massive number of antennas. In this paper, we investigate the hybrid precoder design problem for angle-of-departure (AoD) estimation, where we take into account the practical constraint on the limited resolution of phase shifters. Our goal is to design a radio-frequency (RF) precoder and a base-band (BB) precoder to estimate AoD of the user with a high accuracy. To this end, we propose a two-step strategy where we first obtain the fully digital precoder that minimizes the angle error bound, and then the resulting digital precoder is decomposed into an RF precoder and a BB precoder, based on the alternating optimization and the alternating direction method of multipliers. Besides, we derive the quantization error upper bound and analyse the convergence behavior of the proposed algorithm. Numerical results demonstrate the superior performance of the proposed method over state-of-the-art baselines.

Index Terms:
Hybrid beamforming, hybrid precoder, phase shifter, angle-of-departure estimation, alternating optimization, alternating direction method of multipliers.

I Introduction

Millimeter wave (mmWave) and terahertz (THz) band have been proven to play an important role in future wireless systems, because they can provide ultra-high data rates [1, 2, 3, 4, 5, 6, 7, 8]. However, high carrier frequencies result in severe path loss. Large-scale antenna systems, which are equipped with hundreds or even thousands of antennas, have emerged as a crucial technology for addressing this problem [9, 10, 11, 12, 13].

It is not feasible for large-scale antenna systems to employ fully digital beamforming at mmWave/THz, since fully digital beamforming requires as many radio-frequency (RF) chains (including digital-to-analog converters, mixers, etc.) as the antennas, leading to prohibitive hardware costs and power consumption [14]. On the contrary, hybrid beamforming where only a small number of RF chains are needed is a promising solution to handle this problem [15]. The RF chains are connected to antennas via phase shifters with a finite number of quantized phases [16, 17, 18, 19, 20, 21, 22].

Numerous works have been devoted to hybrid beamformer (precoder and/or combiner) design with practical constraints [23, 24, 25, 18, 26, 27, 28, 29, 30, 31, 32, 19, 33, 20, 21, 22, 34, 35]. Among them, the following four methods attract much attention. (i) The authors in [18] proposed a hybrid beamforming algorithm with 1-bit resolution phase shifter, which is based on alternating optimization framework and the Babai algorithm [36] (termed as “Alt-Babai”). (ii) An iterative hybrid transceiver design approach using alternating optimization and coordinate descent method (CDM) was developed in [19] (termed as “Alt-CDM”). (iii) [24] exploited the spatial structure of mmWave channels and proposed a method for optimal unconstrained precoders and combiners, which employs sparse representation and orthogonal matching pursuit (termed as “Spa-OMP”). (iv) Another hybrid precoding method was presented in [29], which is on the basis of the manifold optimization [37, 38, 39] (termed as “ManiOpt”). All the above-mentioned hybrid beamforming design methods are from the communications perspective. In contrast, much less work has focused on hybrid beamforming design for channel parameters (such as angles, delays, Dopplers, etc.) estimation and positioning. Although optimal beamforming design for positioning has been investigated in e.g., [40, 41, 42, 43], these works investigated fully digital beamforming rather than hybrid beamforming. Note that existing hybrid beamformer design methods in [23, 24, 25, 18, 26, 27, 28, 29, 30, 31, 32, 19, 33, 20, 21, 22, 34, 35] can be applied for positioning. However, these methods do not guarantee a good performance in positioning (since they are proposed for the purpose of communications). Therefore, there is lack of dedicated hybrid beamforming design for the purpose of channel parameters estimation and positioning.

To fill the research gap, in this paper we delve into the intricate problem of hybrid precoder design for angle-of-departure (AoD) estimation, accounting for practical limitation on the finite resolution of phase shifters. Our objective is to derive a solution comprising an RF precoder and a base-band (BB) precoder that not only adheres to the practical constraint but also facilitates precise user AoD estimation. To achieve this goal, we present a two-step approach. We first find a fully digital precoder that minimizes the angle error bound, which is the theoretical lower bound on AoD estimation. Then, we decompose the resulting digital precoder into an RF precoder and a BB precoder, by using alternating optimization framework and the alternating direction method of multipliers (ADMM). The numerical results show that the proposed method outperforms existing state-of-the-art approaches while incurring less complexity. The main contributions of this work are listed as follows:

  • The problem of hybrid beamformer design under practical constraints has not yet been considered for positioning (specifically AoD estimation). We consider such a problem and develop an efficient algorithm to obtain the RF precoder and BB precoder.

  • In the existing literature of hybrid beamforming with limited-resolution phase shifters, e.g., [23, 24, 25, 18, 26, 27, 28, 29, 30, 31, 32, 19, 33, 20, 21, 22, 34, 35], no theoretical results are available regarding the quantization error bound. In this paper, we derive such an error bound.

  • We provide convergence analyses of the proposed algorithm. Our analyses differ from the related works in [44, 45, 46, 47, 48, 49] since our algorithm involves a quantization operation, which is not the case in the related works.

  • The convergence analyses presented in this paper go beyond our previous works in [47, 48, 49], as the former additionally reveal that the point sequence produced by the proposed algorithm is a Cauchy sequence and it converges to a fixed point after a finite number of iterations.

The remainder of this paper is organized as follows. The system model is described in Section II. Section III presents the proposed method for hybrid precoder design for AoD estimation. Section IV analyzes the quantization error bound and convergence behavior of the proposed algorithm. Various numerical examples are provided in Section V to demonstrate the effectiveness of the proposed approach, followed by conclusions in Section VI.

II System Model

We consider a mmWave downlink positioning scenario as in [50], shown in Fig. 1, where the base station (BS) consists of a BB precoder, an RF precoder, and a uniform linear array (ULA) of NTxsubscript𝑁TxN_{\text{Tx}}italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT antennas; while the user equipment (UE) consists of a single antenna. The RF precoder is implemented by limited-resolution phase shifters.

The BS transmits M𝑀Mitalic_M pilot symbols sequentially with identical power, denoted as sm,m=1,2,,Mformulae-sequencesubscript𝑠𝑚𝑚12𝑀s_{m},m=1,2,\cdots,Mitalic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_m = 1 , 2 , ⋯ , italic_M. Employing a two-timescale hybrid precoding approach [51, 52, 53], we adopt a transmission model in which the analog RF precoder is optimized at a slower time scale compared to the digital BB precoder. This prevents high hardware costs (attributed to rapid adaptation of the analog precoder) and reduces computational complexity, along with minimizing signaling overhead [51]. In particular, each symbol is first precoded by a dedicated BB precoder vector, 𝐟BB,mNRFsubscript𝐟BB𝑚superscriptsubscript𝑁RF{\bf f}_{\text{BB},m}\in\mathbb{C}^{N_{\text{RF}}}bold_f start_POSTSUBSCRIPT BB , italic_m end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, and then precoded by an RF precoder constant for all symbols, 𝐅RFNTx×NRFsubscript𝐅RFsuperscriptsubscript𝑁Txsubscript𝑁RF{\bf F}_{\text{RF}}\in\mathbb{C}^{N_{\text{Tx}}\times N_{\text{RF}}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, where NRFNTxsubscript𝑁RFsubscript𝑁TxN_{\text{RF}}\leq N_{\text{Tx}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT ≤ italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT denotes the number of RF chains. Considering highly directional mmWave transmissions, we assume a line-of-sight (LOS)-only channel111The LOS path is resolvable from the non-line-of-sight paths due to channel sparsity, large number of antennas, and large bandwidth in the mmWave/THz wireless communication systems [5].. Thus, corresponding to the transmitted signal smsubscript𝑠𝑚s_{m}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, the received signal at the single-antenna UE can be modeled as

ym=β𝐚T(θ)𝐅RF𝐟BB,msm+nm,m=1,2,,M,formulae-sequencesubscript𝑦𝑚𝛽superscript𝐚T𝜃subscript𝐅RFsubscript𝐟BB𝑚subscript𝑠𝑚subscript𝑛𝑚𝑚12𝑀\displaystyle y_{m}=\beta{\bf a}^{\textrm{T}}(\theta){\bf F}_{\text{RF}}{\bf f% }_{\text{BB},m}s_{m}+n_{m},~{}m=1,2,\cdots,M,italic_y start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_β bold_a start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT ( italic_θ ) bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_f start_POSTSUBSCRIPT BB , italic_m end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_m = 1 , 2 , ⋯ , italic_M , (1)

where β𝛽\beta\in\mathbb{C}italic_β ∈ blackboard_C is the complex amplitude of the path, θ𝜃\thetaitalic_θ is the AoD, nmsubscript𝑛𝑚n_{m}italic_n start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is the complex additive white Gaussian noise with zero mean and variance σn2superscriptsubscript𝜎n2\sigma_{\text{n}}^{2}italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, and the steering vector is

𝐚(θ)=[1,eȷ2πdλsinθ,,eȷ2πdλ(NTx1)sinθ]T,𝐚𝜃superscript1superscript𝑒italic-ȷ2𝜋𝑑𝜆𝜃superscript𝑒italic-ȷ2𝜋𝑑𝜆subscript𝑁Tx1𝜃T\displaystyle{\bf a}(\theta)=\left[1,e^{-\jmath\frac{2\pi d}{\lambda}\sin{% \theta}},\cdots,e^{-\jmath\frac{2\pi d}{\lambda}(N_{\text{Tx}}-1)\sin{\theta}}% \right]^{\textrm{T}},bold_a ( italic_θ ) = [ 1 , italic_e start_POSTSUPERSCRIPT - italic_ȷ divide start_ARG 2 italic_π italic_d end_ARG start_ARG italic_λ end_ARG roman_sin italic_θ end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_ȷ divide start_ARG 2 italic_π italic_d end_ARG start_ARG italic_λ end_ARG ( italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT - 1 ) roman_sin italic_θ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT , (2)

with ȷ=1italic-ȷ1\jmath=\sqrt{-1}italic_ȷ = square-root start_ARG - 1 end_ARG, d𝑑ditalic_d being the element spacing of the ULA, and λ𝜆\lambdaitalic_λ denoting the transmit signal wavelength.

The signal model (1) can be written in vector form as

𝐲=β𝐒(𝐅RF𝐅BB)T𝐚(θ)+𝐧,𝐲𝛽𝐒superscriptsubscript𝐅RFsubscript𝐅BBT𝐚𝜃𝐧\displaystyle{\bf y}=\beta{\bf S}({\bf F}_{\text{RF}}{\bf F}_{\text{BB}})^{% \textrm{T}}{\bf a}(\theta)+{\bf n},bold_y = italic_β bold_S ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ) + bold_n , (3)

where 𝐲=[y1,y2,,yM]TM𝐲superscriptsubscript𝑦1subscript𝑦2subscript𝑦𝑀Tsuperscript𝑀{\bf y}\!=\![{y}_{1},{y}_{2},\cdots,{y}_{M}]^{\textrm{T}}\in\mathbb{C}^{M}bold_y = [ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT, 𝐧=[n1,n2,,nM]TM𝐧superscriptsubscript𝑛1subscript𝑛2subscript𝑛𝑀Tsuperscript𝑀{\bf n}\!=\![{n}_{1},{n}_{2},\cdots,{n}_{M}]^{\textrm{T}}\in\mathbb{C}^{M}bold_n = [ italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_n start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT, 𝐅BB=[𝐟BB,1,𝐟BB,2,,𝐟BB,M]NRF×Msubscript𝐅BBsubscript𝐟BB1subscript𝐟BB2subscript𝐟BB𝑀superscriptsubscript𝑁RF𝑀{\bf F}_{\text{BB}}=[{\bf f}_{\text{BB},1},{\bf f}_{\text{BB},2},\cdots,{\bf f% }_{\text{BB},M}]\in\mathbb{C}^{N_{\text{RF}}\times M}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT = [ bold_f start_POSTSUBSCRIPT BB , 1 end_POSTSUBSCRIPT , bold_f start_POSTSUBSCRIPT BB , 2 end_POSTSUBSCRIPT , ⋯ , bold_f start_POSTSUBSCRIPT BB , italic_M end_POSTSUBSCRIPT ] ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT × italic_M end_POSTSUPERSCRIPT, and 𝐒=diag{[s1,s2,,sM]}𝐒diagsubscript𝑠1subscript𝑠2subscript𝑠𝑀{\bf S}=\textrm{diag}\{[s_{1},s_{2},\cdots,s_{M}]\}bold_S = diag { [ italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ] } with diag{}diag\textrm{diag}\{\cdot\}diag { ⋅ } representing the diagonal matrix operator. In addition, 𝐧𝒞𝒩(𝟎,σn2𝐈)similar-to𝐧𝒞𝒩0superscriptsubscript𝜎n2𝐈{\bf n}\sim\mathcal{CN}({\bf 0},\sigma_{\text{n}}^{2}{\bf I})bold_n ∼ caligraphic_C caligraphic_N ( bold_0 , italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I ) with known noise power σn2superscriptsubscript𝜎n2\sigma_{\text{n}}^{2}italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Our goal is to design an RF precoder and a BB precoder such that the accuracy of estimation of AoD is maximized, under the BS transmit power constraint and the hardware constraint on the limited resolution of phase shifters.

Refer to caption

Figure 1: Illustration of a mmWave downlink positioning scenario, where the single-antenna UE receives the pilot signals transmitted from the BS (consisting of a BB precoder, an RF precoder, and a ULA of multiple antennas).

III Proposed Method

III-A CRB-Based Performance Metric

Define 𝐲~β𝐒(𝐅RF𝐅BB)T𝐚(θ)~𝐲𝛽𝐒superscriptsubscript𝐅RFsubscript𝐅BBT𝐚𝜃{\bf{\tilde{y}}}\triangleq\beta{\bf S}({\bf F}_{\text{RF}}{\bf F}_{\text{BB}})% ^{\textrm{T}}{\bf a}(\theta)over~ start_ARG bold_y end_ARG ≜ italic_β bold_S ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ). Then, the Fisher information matrix (FIM) 𝐉(𝐅RF,𝐅BB;𝐱)3×3𝐉subscript𝐅RFsubscript𝐅BB𝐱superscript33{\bf J}({\bf F}_{\text{RF}},{\bf F}_{\text{BB}};{\bf x})\in\mathbb{R}^{3\times 3}bold_J ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ; bold_x ) ∈ blackboard_R start_POSTSUPERSCRIPT 3 × 3 end_POSTSUPERSCRIPT can be computed by using the Slepian-Bangs formula [54] as

[𝐉]ij=2σn2{(𝐲~[𝐱]i)H(𝐲~[𝐱]j)},subscriptdelimited-[]𝐉𝑖𝑗2superscriptsubscript𝜎n2superscript~𝐲subscriptdelimited-[]𝐱𝑖H~𝐲subscriptdelimited-[]𝐱𝑗\displaystyle[{\bf J}]_{ij}=\frac{2}{\sigma_{\text{n}}^{2}}\Re\left\{\left(% \frac{\partial{\bf{\tilde{y}}}}{\partial[{\bf x}]_{i}}\right)^{\textrm{H}}% \left(\frac{\partial{\bf{\tilde{y}}}}{\partial[{\bf x}]_{j}}\right)\right\},[ bold_J ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = divide start_ARG 2 end_ARG start_ARG italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG roman_ℜ { ( divide start_ARG ∂ over~ start_ARG bold_y end_ARG end_ARG start_ARG ∂ [ bold_x ] start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( divide start_ARG ∂ over~ start_ARG bold_y end_ARG end_ARG start_ARG ∂ [ bold_x ] start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG ) } , (4)

where 𝐱=[θ,βR,βI]T𝐱superscript𝜃subscript𝛽Rsubscript𝛽IT{\bf x}=[\theta,\beta_{\text{R}},\beta_{\text{I}}]^{\textrm{T}}bold_x = [ italic_θ , italic_β start_POSTSUBSCRIPT R end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT I end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT contains all the unknown parameters, [𝐉]ijsubscriptdelimited-[]𝐉𝑖𝑗[{\bf J}]_{ij}[ bold_J ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT is the entry of 𝐉𝐉{\bf J}bold_J in the i𝑖iitalic_i-th row and j𝑗jitalic_j-th column, and [𝐱]isubscriptdelimited-[]𝐱𝑖[{\bf x}]_{i}[ bold_x ] start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the i𝑖iitalic_i-th entry of 𝐱𝐱{\bf x}bold_x. In addition, βRsubscript𝛽R\beta_{\text{R}}italic_β start_POSTSUBSCRIPT R end_POSTSUBSCRIPT and βIsubscript𝛽I\beta_{\text{I}}italic_β start_POSTSUBSCRIPT I end_POSTSUBSCRIPT denote the real and imaginary parts of β𝛽\betaitalic_β, respectively. The derivative of 𝐲~~𝐲{\bf{\tilde{y}}}over~ start_ARG bold_y end_ARG with respect to (w.r.t.) [𝐱]isubscriptdelimited-[]𝐱𝑖[{\bf x}]_{i}[ bold_x ] start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is calculated as in Appendix A. The corresponding Cramér-Rao bound (CRB) matrix is defined as

𝐂=𝐉1.𝐂superscript𝐉1\displaystyle{\bf C}={\bf J}^{-1}.bold_C = bold_J start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT . (5)

To quantify the AoD estimation accuracy, we adopt the angle error bound (AEB) as our performance metric, computed as (6) displayed at the top of the next page, where σssubscript𝜎s\sigma_{\text{s}}italic_σ start_POSTSUBSCRIPT s end_POSTSUBSCRIPT is the signal power, 𝐃diag{0,1,,NTx1}𝐃diag01subscript𝑁Tx1{\bf D}\triangleq\text{diag}\{0,1,\cdots,N_{\text{Tx}-1}\}bold_D ≜ diag { 0 , 1 , ⋯ , italic_N start_POSTSUBSCRIPT Tx - 1 end_POSTSUBSCRIPT }, 𝐅=𝐅RF𝐅BB𝐅subscript𝐅RFsubscript𝐅BB{\bf F}={\bf F}_{\text{RF}}{\bf F}_{\text{BB}}bold_F = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT, and we have employed the block matrix inversion lemma [55] as detailed in Appendix B.

AEB(𝐅RF,𝐅BB;𝐱)=[𝐂]11=σnσsλ22πd𝐚H(θ)𝐅*𝐅T𝐚(θ)𝐚H(θ)𝐅*𝐅T[|β|2𝐚(θ)𝐚H(θ)𝐃𝐅*𝐅T𝐃𝐃𝐚(θ)𝐚H(θ)𝐃𝐅*𝐅T]𝐚(θ)AEBsubscript𝐅RFsubscript𝐅BB𝐱subscriptdelimited-[]𝐂11subscript𝜎nsubscript𝜎s𝜆22𝜋𝑑superscript𝐚H𝜃superscript𝐅superscript𝐅T𝐚𝜃superscript𝐚H𝜃superscript𝐅superscript𝐅Tdelimited-[]superscript𝛽2𝐚𝜃superscript𝐚H𝜃superscript𝐃𝐅superscript𝐅T𝐃𝐃𝐚𝜃superscript𝐚H𝜃superscript𝐃𝐅superscript𝐅T𝐚𝜃\displaystyle\text{AEB}({\bf F}_{\text{RF}},{\bf F}_{\text{BB}};{\bf x})=\sqrt% {[{\bf C}]_{11}}=\frac{\sigma_{\text{n}}}{\sigma_{\text{s}}}\frac{\lambda}{2% \sqrt{2}\pi d}\sqrt{\frac{{\bf a}^{\textrm{H}}(\theta){\bf F}^{*}{\bf F}^{% \textrm{T}}{\bf a}(\theta)}{{\bf a}^{\textrm{H}}(\theta){\bf F}^{*}{\bf F}^{% \textrm{T}}\big{[}|\beta|^{2}{\bf a}(\theta){\bf a}^{\textrm{H}}(\theta){\bf D% }{\bf F}^{*}{\bf F}^{\textrm{T}}{\bf D}-{\bf D}{\bf a}(\theta){\bf a}^{\textrm% {H}}(\theta){\bf D}{\bf F}^{*}{\bf F}^{\textrm{T}}\big{]}{\bf a}(\theta)}}AEB ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ; bold_x ) = square-root start_ARG [ bold_C ] start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT end_ARG = divide start_ARG italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT s end_POSTSUBSCRIPT end_ARG divide start_ARG italic_λ end_ARG start_ARG 2 square-root start_ARG 2 end_ARG italic_π italic_d end_ARG square-root start_ARG divide start_ARG bold_a start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( italic_θ ) bold_F start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_F start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ) end_ARG start_ARG bold_a start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( italic_θ ) bold_F start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_F start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT [ | italic_β | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_a ( italic_θ ) bold_a start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( italic_θ ) bold_DF start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_F start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_D - bold_Da ( italic_θ ) bold_a start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( italic_θ ) bold_DF start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_F start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT ] bold_a ( italic_θ ) end_ARG end_ARG (6)

III-B Problem Formulation for Optimal Precoder Design

The AEB depends on the unknown parameters in 𝐱𝐱{\bf x}bold_x. We assume that 𝐱𝐱{\bf x}bold_x belongs to an uncertainty set 𝒳𝒳\mathcal{X}caligraphic_X that can be, e.g., determined via some tracking algorithms [56, 40, 41]. For any 𝐱𝒳𝐱𝒳{\bf x}\in\mathcal{X}bold_x ∈ caligraphic_X, the AEB is only a function of 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT and 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT. The optimal precoder design problem can be formulated as

min𝐅RF,𝐅BBsubscriptsubscript𝐅RFsubscript𝐅BB\displaystyle\min_{{\bf F}_{\text{RF}},{\bf F}_{\text{BB}}}roman_min start_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT end_POSTSUBSCRIPT AEB(𝐅RF,𝐅BB;𝐱)AEBsubscript𝐅RFsubscript𝐅BB𝐱\displaystyle~{}\text{AEB}({\bf F}_{\text{RF}},{\bf F}_{\text{BB}};{\bf x})AEB ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ; bold_x ) (7a)
s.t. 𝐅RF𝐅BBF2=P,superscriptsubscriptnormsubscript𝐅RFsubscript𝐅BBF2𝑃\displaystyle~{}\|{\bf F}_{\text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}^{2}=P,∥ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_P , (7b)
[𝐅RF]ij,1iNTxand1jNRF,formulae-sequencesubscriptdelimited-[]subscript𝐅RF𝑖𝑗1𝑖subscript𝑁Txand1𝑗subscript𝑁RF\displaystyle~{}\left[{\bf F}_{\text{RF}}\right]_{ij}\in\mathcal{F},~{}~{}1% \leq i\leq N_{\text{Tx}}~{}\text{and}~{}1\leq j\leq N_{\text{RF}},[ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ∈ caligraphic_F , 1 ≤ italic_i ≤ italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT and 1 ≤ italic_j ≤ italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , (7c)

where P𝑃Pitalic_P stands for the total transmit power of the BS antennas, and \mathcal{F}caligraphic_F denotes the set for limited resolution of the phase shifters, which is defined as:

{1NTxeȷ2πb/2B|b=0,1,,2B1},conditional-set1subscript𝑁Txsuperscript𝑒italic-ȷ2𝜋𝑏superscript2𝐵𝑏01superscript2𝐵1\displaystyle\mathcal{F}\triangleq\left\{\frac{1}{\sqrt{N_{\text{Tx}}}}e^{% \jmath 2\pi b/2^{B}}\bigg{|}b=0,1,\cdots,2^{B}\!-\!1\right\},caligraphic_F ≜ { divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT end_ARG end_ARG italic_e start_POSTSUPERSCRIPT italic_ȷ 2 italic_π italic_b / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT | italic_b = 0 , 1 , ⋯ , 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT - 1 } , (8)

with B𝐵Bitalic_B representing the total number of quantization bits of the phase shifters.

III-C Two-Step Strategy for Solving Problem (III-B)

It is difficult to directly solve Problem (III-B) w.r.t. 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT and 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT, due to the complicated structure222The denominator of AEB(𝐅RF,𝐅BB;𝐱)AEBsubscript𝐅RFsubscript𝐅BB𝐱\text{AEB}({\bf F}_{\text{RF}},{\bf F}_{\text{BB}};{\bf x})AEB ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ; bold_x ) contains quartic terms w.r.t. 𝐅=𝐅RF𝐅BB𝐅subscript𝐅RFsubscript𝐅BB{\bf F}={\bf F}_{\text{RF}}{\bf F}_{\text{BB}}bold_F = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT. of AEB(𝐅RF,𝐅BB;𝐱)AEBsubscript𝐅RFsubscript𝐅BB𝐱\text{AEB}({\bf F}_{\text{RF}},{\bf F}_{\text{BB}};{\bf x})AEB ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ; bold_x ) and the discrete-phase nature of the entries of 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT. We provide a strategy for solving Problem (III-B) via the following two steps:

  • Step 1: Finding the optimal fully digital precoder 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT as a solution to Problem (III-B).

  • Step 2: Finding a decomposition of 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT to obtain the best approximation 𝐅opt𝐅RF𝐅BBsubscript𝐅optsubscript𝐅RFsubscript𝐅BB{\bf F}_{\text{opt}}\approx{\bf F}_{\text{RF}}{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT ≈ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT in the least-squares (LS) sense.

We now elaborate on these two steps.

Step 1: Based on the fact that the unknown variables 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT and 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT appear as a product (i.e., 𝐅RF𝐅BBsubscript𝐅RFsubscript𝐅BB{\bf F}_{\text{RF}}{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT) in both the objective function (7a) and the constraint (7b), for any 𝐱𝒳𝐱𝒳{\bf x}\in\mathcal{X}bold_x ∈ caligraphic_X, we consider the following optimization problem:

min𝐅AEB(𝐅;𝐱)s.t.𝐅F2=P,subscript𝐅AEB𝐅𝐱s.t.superscriptsubscriptnorm𝐅F2𝑃\displaystyle\min_{{\bf F}}~{}\text{AEB}({\bf F};{\bf x})\quad\text{s.t.}~{}\|% {\bf F}\|_{\text{F}}^{2}=P,roman_min start_POSTSUBSCRIPT bold_F end_POSTSUBSCRIPT AEB ( bold_F ; bold_x ) s.t. ∥ bold_F ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_P , (9)

where 𝐅=𝐅RF𝐅BBNTx×M𝐅subscript𝐅RFsubscript𝐅BBsuperscriptsubscript𝑁Tx𝑀{\bf F}={\bf F}_{\text{RF}}{\bf F}_{\text{BB}}\in\mathbb{C}^{N_{\text{Tx}}% \times M}bold_F = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT × italic_M end_POSTSUPERSCRIPT and we drop the constraint (7c) temporarily. This corresponds to a fully digital precoder optimization [41]. We define 𝐙𝐅𝐅H𝐙superscript𝐅𝐅H{\bf Z}\triangleq{\bf F}{\bf F}^{\textrm{H}}bold_Z ≜ bold_FF start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT, and relax Problem (9) by removing the constraint rank(𝐙)=Mrank𝐙𝑀\text{rank}({\bf Z})=Mrank ( bold_Z ) = italic_M, as

min𝐙,uus.t.[𝐉(𝐙;𝐱)𝐞1𝐞1Tu]𝟎,tr(𝐙)=P,𝐙𝟎,formulae-sequencesucceeds-or-equalssubscript𝐙𝑢𝑢s.t.delimited-[]𝐉𝐙𝐱subscript𝐞1superscriptsubscript𝐞1T𝑢0formulae-sequencetr𝐙𝑃succeeds-or-equals𝐙0\displaystyle\min_{{\bf Z},u}~{}\!u~{}~{}\text{s.t.}\left[\begin{array}[]{cc}{% \bf J}({\bf Z};{\bf x})&{\bf e}_{1}\\ {\bf e}_{1}^{\textrm{T}}&u\end{array}\right]\succeq{\bf 0},~{}\text{tr}({\bf Z% })=P,~{}{\bf Z}\succeq{\bf 0},roman_min start_POSTSUBSCRIPT bold_Z , italic_u end_POSTSUBSCRIPT italic_u s.t. [ start_ARRAY start_ROW start_CELL bold_J ( bold_Z ; bold_x ) end_CELL start_CELL bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT end_CELL start_CELL italic_u end_CELL end_ROW end_ARRAY ] ⪰ bold_0 , tr ( bold_Z ) = italic_P , bold_Z ⪰ bold_0 , (12)

where 𝐞1=[1,0,0]Tsubscript𝐞1superscript100T{\bf e}_{1}=[1,0,0]^{\text{T}}bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = [ 1 , 0 , 0 ] start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT, tr()tr\text{tr}(\cdot)tr ( ⋅ ) is the trace of a matrix, and 𝐙𝟎succeeds-or-equals𝐙0{\bf Z}\succeq{\bf 0}bold_Z ⪰ bold_0 means that 𝐙𝐙{\bf Z}bold_Z is positive semidefinite. Taking into account the uncertainty of 𝐱𝐱{\bf x}bold_x, i.e., 𝐱𝒳𝐱𝒳{\bf x}\in\mathcal{X}bold_x ∈ caligraphic_X, and by discretizing 𝒳𝒳{\mathcal{X}}caligraphic_X into a uniform grid of G𝐺Gitalic_G points {𝐱g}g=1Gsuperscriptsubscriptsubscript𝐱𝑔𝑔1𝐺\{{\bf x}_{g}\}_{g=1}^{G}{ bold_x start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_g = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_G end_POSTSUPERSCRIPT, a robust design for the above problem can be given as

min𝐙,{ug}subscript𝐙subscript𝑢𝑔\displaystyle\min_{{\bf Z},\{u_{g}\}}roman_min start_POSTSUBSCRIPT bold_Z , { italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT } end_POSTSUBSCRIPT max𝐱𝒳ugsubscript𝐱𝒳subscript𝑢𝑔\displaystyle~{}\!\max_{{\bf x}\in\mathcal{X}}~{}\!u_{g}roman_max start_POSTSUBSCRIPT bold_x ∈ caligraphic_X end_POSTSUBSCRIPT italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT (13a)
s.t. [𝐉(𝐙;𝐱)𝐞1𝐞1Tug]𝟎,g=1,2,,G,formulae-sequencesucceeds-or-equalsdelimited-[]𝐉𝐙𝐱subscript𝐞1superscriptsubscript𝐞1Tsubscript𝑢𝑔0𝑔12𝐺\displaystyle\left[\begin{array}[]{cc}{\bf J}({\bf Z};{\bf x})&{\bf e}_{1}\\ {\bf e}_{1}^{\textrm{T}}&u_{g}\end{array}\right]\succeq{\bf 0},~{}\!g=1,2,% \cdots,G,[ start_ARRAY start_ROW start_CELL bold_J ( bold_Z ; bold_x ) end_CELL start_CELL bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] ⪰ bold_0 , italic_g = 1 , 2 , ⋯ , italic_G , (13d)
tr(𝐙)=P,𝐙𝟎.formulae-sequencetr𝐙𝑃succeeds-or-equals𝐙0\displaystyle~{}\!\text{tr}({\bf Z})=P,~{}{\bf Z}\succeq{\bf 0}.tr ( bold_Z ) = italic_P , bold_Z ⪰ bold_0 . (13e)

Problem (III-C) can be further formulated as

min𝐙,{ug},tsubscript𝐙subscript𝑢𝑔𝑡\displaystyle\min_{{\bf Z},\{u_{g}\},t}roman_min start_POSTSUBSCRIPT bold_Z , { italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT } , italic_t end_POSTSUBSCRIPT t𝑡\displaystyle~{}\!titalic_t (14a)
s.t. [𝐉(𝐙;𝐱)𝐞1𝐞1Tug]𝟎,g=1,2,,G,formulae-sequencesucceeds-or-equalsdelimited-[]𝐉𝐙𝐱subscript𝐞1superscriptsubscript𝐞1Tsubscript𝑢𝑔0𝑔12𝐺\displaystyle\left[\begin{array}[]{cc}{\bf J}({\bf Z};{\bf x})&{\bf e}_{1}\\ {\bf e}_{1}^{\textrm{T}}&u_{g}\end{array}\right]\succeq{\bf 0},~{}\!g=1,2,% \cdots,G,[ start_ARRAY start_ROW start_CELL bold_J ( bold_Z ; bold_x ) end_CELL start_CELL bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] ⪰ bold_0 , italic_g = 1 , 2 , ⋯ , italic_G , (14d)
ugt,g=1,2,,G,formulae-sequencesubscript𝑢𝑔𝑡𝑔12𝐺\displaystyle~{}\!u_{g}\leq t,~{}\!g=1,2,\cdots,G,italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ≤ italic_t , italic_g = 1 , 2 , ⋯ , italic_G , (14e)
tr(𝐙)=P,𝐙𝟎.formulae-sequencetr𝐙𝑃succeeds-or-equals𝐙0\displaystyle~{}\!\text{tr}({\bf Z})=P,~{}{\bf Z}\succeq{\bf 0}.tr ( bold_Z ) = italic_P , bold_Z ⪰ bold_0 . (14f)

It is shown in [41] that a codebook-based approach can be applied to decrease the complexity while achieving a optimal design. Specifically, a predefined codebook consists of directional and derivative beams [41], that is, 𝐅(pre)=[𝐅(direc),𝐅(deriv)]superscript𝐅presuperscript𝐅direcsuperscript𝐅deriv{\bf F}^{(\text{pre})}=[{\bf F}^{(\text{direc})},{\bf F}^{(\text{deriv})}]bold_F start_POSTSUPERSCRIPT ( pre ) end_POSTSUPERSCRIPT = [ bold_F start_POSTSUPERSCRIPT ( direc ) end_POSTSUPERSCRIPT , bold_F start_POSTSUPERSCRIPT ( deriv ) end_POSTSUPERSCRIPT ], where 𝐅(direc)=[𝐚(θ1),𝐚(θ2),,𝐚(θG)]superscript𝐅direc𝐚subscript𝜃1𝐚subscript𝜃2𝐚subscript𝜃𝐺{\bf F}^{(\text{direc})}=[{\bf a}(\theta_{1}),{\bf a}(\theta_{2}),\cdots,{\bf a% }(\theta_{G})]bold_F start_POSTSUPERSCRIPT ( direc ) end_POSTSUPERSCRIPT = [ bold_a ( italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , bold_a ( italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , ⋯ , bold_a ( italic_θ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ] and 𝐅(deriv)=[𝐚˙(θ1),𝐚˙(θ2),,𝐚˙(θG)]superscript𝐅deriv˙𝐚subscript𝜃1˙𝐚subscript𝜃2˙𝐚subscript𝜃𝐺{\bf F}^{(\text{deriv})}=[{\bf{\dot{a}}}(\theta_{1}),{\bf{\dot{a}}}(\theta_{2}% ),\cdots,{\bf{\dot{a}}}(\theta_{G})]bold_F start_POSTSUPERSCRIPT ( deriv ) end_POSTSUPERSCRIPT = [ over˙ start_ARG bold_a end_ARG ( italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , over˙ start_ARG bold_a end_ARG ( italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , ⋯ , over˙ start_ARG bold_a end_ARG ( italic_θ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ], with 𝐚˙(θ)=𝐚(θ)θ˙𝐚𝜃𝐚𝜃𝜃{\bf{\dot{a}}}(\theta)=\frac{\partial{\bf a}(\theta)}{\partial\theta}over˙ start_ARG bold_a end_ARG ( italic_θ ) = divide start_ARG ∂ bold_a ( italic_θ ) end_ARG start_ARG ∂ italic_θ end_ARG and G=M/2𝐺𝑀2G=M/2italic_G = italic_M / 2. With the predefine codebook, we consider the optimal beam power allocation problem in 𝐪=[q1,q2,,qM]T𝐪superscriptsubscript𝑞1subscript𝑞2subscript𝑞𝑀T{\bf q}=[q_{1},q_{2},\cdots,q_{M}]^{\textrm{T}}bold_q = [ italic_q start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_q start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_q start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT [41]:

min𝐪,{ug},tsubscript𝐪subscript𝑢𝑔𝑡\displaystyle\min_{{\bf q},\{u_{g}\},t}roman_min start_POSTSUBSCRIPT bold_q , { italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT } , italic_t end_POSTSUBSCRIPT t𝑡\displaystyle~{}\!titalic_t (15a)
s.t. [𝐉(𝐙;𝐱)𝐞1𝐞1Tug]𝟎,g=1,2,,G,formulae-sequencesucceeds-or-equalsdelimited-[]𝐉𝐙𝐱subscript𝐞1superscriptsubscript𝐞1Tsubscript𝑢𝑔0𝑔12𝐺\displaystyle\left[\begin{array}[]{cc}{\bf J}({\bf Z};{\bf x})&{\bf e}_{1}\\ {\bf e}_{1}^{\textrm{T}}&u_{g}\end{array}\right]\succeq{\bf 0},~{}\!g=1,2,% \cdots,G,[ start_ARRAY start_ROW start_CELL bold_J ( bold_Z ; bold_x ) end_CELL start_CELL bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT end_CELL start_CELL italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] ⪰ bold_0 , italic_g = 1 , 2 , ⋯ , italic_G , (15d)
ugt,g=1,2,,G,formulae-sequencesubscript𝑢𝑔𝑡𝑔12𝐺\displaystyle~{}\!u_{g}\leq t,~{}\!g=1,2,\cdots,G,italic_u start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ≤ italic_t , italic_g = 1 , 2 , ⋯ , italic_G , (15e)
𝐙=𝐅(pre)diag(𝐪)(𝐅(pre))H,𝐙superscript𝐅prediag𝐪superscriptsuperscript𝐅preH\displaystyle~{}\!{\bf Z}={\bf F}^{(\text{pre})}\text{diag}({\bf q})({\bf F}^{% (\text{pre})})^{\textrm{H}},bold_Z = bold_F start_POSTSUPERSCRIPT ( pre ) end_POSTSUPERSCRIPT diag ( bold_q ) ( bold_F start_POSTSUPERSCRIPT ( pre ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT , (15f)
tr(𝐙)=P,𝐙𝟎,formulae-sequencetr𝐙𝑃succeeds-or-equals𝐙0\displaystyle~{}\!\text{tr}({\bf Z})=P,~{}{\bf Z}\succeq{\bf 0},tr ( bold_Z ) = italic_P , bold_Z ⪰ bold_0 , (15g)

which yields the optimal fully digital precoder as

𝐅opt=𝐅(pre)diag([q1,q2,,qM]).subscript𝐅optsuperscript𝐅prediagsubscript𝑞1subscript𝑞2subscript𝑞𝑀\displaystyle{\bf F}_{\text{opt}}={\bf F}^{(\text{pre})}{\text{diag}}([\sqrt{q% _{1}},\sqrt{q_{2}},\cdots,\sqrt{q}_{M}]).bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT = bold_F start_POSTSUPERSCRIPT ( pre ) end_POSTSUPERSCRIPT diag ( [ square-root start_ARG italic_q start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , square-root start_ARG italic_q start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG , ⋯ , square-root start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ] ) . (16)

Step 2: We decompose 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT into two matrices, i.e., 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT and 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT, by taking into account the constraints (7b) and (7c):

min𝐅RF,𝐅BB12𝐅opt𝐅RF𝐅BBF2s.t.(7b)and(7c).subscriptsubscript𝐅RFsubscript𝐅BB12superscriptsubscriptnormsubscript𝐅optsubscript𝐅RFsubscript𝐅BBF2s.t.(7b)and(7c)\displaystyle\min_{{\bf F}_{\text{RF}},{\bf F}_{\text{BB}}}~{}\frac{1}{2}\|{% \bf F}_{\text{opt}}-{\bf F}_{\text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}^{2}% \quad\text{s.t.}~{}\text{(\ref{constraint1_OptPrecoder_problem})}~{}\text{and}% ~{}\text{(\ref{constraint2_OptPrecoder_problem})}.roman_min start_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT s.t. ( ) and ( ) . (17)

In what follows, we propose an alternating optimization approach for solving Problem (17). To be specific, we first solve 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT with a fixed 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT, as

min𝐅BB12𝐅opt𝐅RF𝐅BBF2s.t.(7b).subscriptsubscript𝐅BB12superscriptsubscriptnormsubscript𝐅optsubscript𝐅RFsubscript𝐅BBF2s.t.(7b)\displaystyle\min_{{\bf F}_{\text{BB}}}~{}\frac{1}{2}\|{\bf F}_{\text{opt}}-{% \bf F}_{\text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}^{2}\quad\text{s.t.}~{}\text% {(\ref{constraint1_OptPrecoder_problem})}.roman_min start_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT s.t. ( ) . (18)

It has a LS closed-form solution as

𝐅BB=P𝐅RF𝐅RF𝐅optF𝐅RF𝐅opt,subscript𝐅BB𝑃subscriptnormsubscript𝐅RFsuperscriptsubscript𝐅RFsubscript𝐅optFsuperscriptsubscript𝐅RFsubscript𝐅opt\displaystyle{\bf F}_{\text{BB}}=\frac{\sqrt{P}}{\|{\bf F}_{\text{RF}}{\bf F}_% {\text{RF}}^{\dagger}{\bf F}_{\text{opt}}\|_{\text{F}}}{\bf F}_{\text{RF}}^{% \dagger}{\bf F}_{\text{opt}},bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT = divide start_ARG square-root start_ARG italic_P end_ARG end_ARG start_ARG ∥ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT end_ARG bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT , (19)

where 𝐅RF=(𝐅RFH𝐅RF)1𝐅RFHsuperscriptsubscript𝐅RFsuperscriptsuperscriptsubscript𝐅RFHsubscript𝐅RF1superscriptsubscript𝐅RFH{\bf F}_{\text{RF}}^{\dagger}=({\bf F}_{\text{RF}}^{\textrm{H}}{\bf F}_{\text{% RF}})^{-1}{\bf F}_{\text{RF}}^{\textrm{H}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT = ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT. Then, we solve 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT with the obtained 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT in (19), as

min𝐅RF12𝐅opt𝐅RF𝐅BBF2s.t.(7c).subscriptsubscript𝐅RF12superscriptsubscriptnormsubscript𝐅optsubscript𝐅RFsubscript𝐅BBF2s.t.(7c)\displaystyle\min_{{\bf F}_{\text{RF}}}~{}\frac{1}{2}\|{\bf F}_{\text{opt}}-{% \bf F}_{\text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}^{2}\quad\text{s.t.}~{}\text% {(\ref{constraint2_OptPrecoder_problem})}.roman_min start_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT s.t. ( ) . (20)

We develop an algorithm based on the ADMM [45] to solve the above problem. To this end, we introduce an auxiliary variable 𝐅~RFNTx×NRFsubscript~𝐅RFsuperscriptsubscript𝑁Txsubscript𝑁RF{\bf{\tilde{F}}}_{\text{RF}}\in\mathbb{C}^{N_{\text{Tx}}\times N_{\text{RF}}}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, and Problem (20) can be equivalently expressed as

min𝐅RF,𝐅~RF12𝐅opt𝐅~RF𝐅BBF2s.t.(7c)and𝐅~RF=𝐅RF.subscriptsubscript𝐅RFsubscript~𝐅RF12superscriptsubscriptnormsubscript𝐅optsubscript~𝐅RFsubscript𝐅BBF2s.t.(7c)andsubscript~𝐅RFsubscript𝐅RF\displaystyle\min_{{\bf F}_{\text{RF}},{\bf{\tilde{F}}}_{\text{RF}}}~{}\!\!% \frac{1}{2}\|{\bf F}_{\text{opt}}-{\bf{\tilde{F}}}_{\text{RF}}{\bf F}_{\text{% BB}}\|_{\text{F}}^{2}\quad\text{s.t.}~{}\text{(\ref{constraint2_OptPrecoder_% problem})}~{}\text{and}~{}{\bf{\tilde{F}}}_{\text{RF}}={\bf F}_{\text{RF}}.roman_min start_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT s.t. ( ) and over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT . (21)

The corresponding scaled-form augmented Lagrangian function is given as [45]

(𝐅~RF,𝐅RF,\displaystyle\mathcal{L}\Big{(}{\bf{\tilde{F}}}_{\text{RF}},{\bf F}_{\text{RF}},caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , 𝐔)=12𝐅opt𝐅~RF𝐅BBF2\displaystyle~{}\!{\bf U}\Big{)}=\frac{1}{2}\|{\bf F}_{\text{opt}}-{\bf{\tilde% {F}}}_{\text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}^{2}bold_U ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
+ρ2(𝐅~RF𝐅RF+𝐔F2𝐔F2),𝜌2superscriptsubscriptnormsubscript~𝐅RFsubscript𝐅RF𝐔F2superscriptsubscriptnorm𝐔F2\displaystyle+\frac{\rho}{2}\left(\|{\bf{\tilde{F}}}_{\text{RF}}-{\bf F}_{% \text{RF}}+{\bf U}\|_{\text{F}}^{2}-\|{\bf U}\|_{\text{F}}^{2}\right),+ divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG ( ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT + bold_U ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ bold_U ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) , (22)

where 𝐔NTx×NRF𝐔superscriptsubscript𝑁Txsubscript𝑁RF{\bf U}\in\mathbb{C}^{N_{\text{Tx}}\times N_{\text{RF}}}bold_U ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the scaled dual variable and ρ>0𝜌0\rho>0italic_ρ > 0 is the augmented Lagrangian parameter. Parameter ρ𝜌\rhoitalic_ρ can be set based on the proposed convergence analyses in Section IV-B. The primal, auxiliary, and dual variables are updated as:

𝐅RF(k+1)superscriptsubscript𝐅RF𝑘1\displaystyle{\bf F}_{\text{RF}}^{(\!k\!+\!1\!)}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT =argmin[𝐅RF]i,j(𝐅~RF(k),𝐅RF,𝐔(k))absentsubscriptargminsubscriptdelimited-[]subscript𝐅RF𝑖𝑗superscriptsubscript~𝐅RF𝑘subscript𝐅RFsuperscript𝐔𝑘\displaystyle=\operatorname*{arg\,min}_{\left[{\bf F}_{\text{RF}}\right]_{i,j}% \in\mathcal{F}}~{}\mathcal{L}({\bf{\tilde{F}}}_{\text{RF}}^{(\!k\!)},{\bf F}_{% \text{RF}},{\bf U}^{(\!k\!)})= start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT [ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT ∈ caligraphic_F end_POSTSUBSCRIPT caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT )
=1NTxeȷ𝒬((𝐅~RF(k)+𝐔(k))),absent1subscript𝑁Txsuperscript𝑒italic-ȷ𝒬superscriptsubscript~𝐅RF𝑘superscript𝐔𝑘\displaystyle=\frac{1}{\sqrt{N_{\text{Tx}}}}e^{\jmath\mathcal{Q}\left(\angle({% \bf{\tilde{F}}}_{\text{RF}}^{(\!k\!)}+{\bf U}^{(\!k\!)})\right)},= divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT end_ARG end_ARG italic_e start_POSTSUPERSCRIPT italic_ȷ caligraphic_Q ( ∠ ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT + bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) ) end_POSTSUPERSCRIPT , (23a)
𝐅~RF(k+1)superscriptsubscript~𝐅RF𝑘1\displaystyle{\bf{\tilde{F}}}_{\text{RF}}^{(\!k\!+\!1\!)}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT =argmin𝐅~RF(𝐅~RF,𝐅RF(k+1),𝐔(k))absentsubscriptargminsubscript~𝐅RFsubscript~𝐅RFsuperscriptsubscript𝐅RF𝑘1superscript𝐔𝑘\displaystyle=\operatorname*{arg\,min}_{{\bf{\tilde{F}}}_{\text{RF}}}~{}% \mathcal{L}({\bf{\tilde{F}}}_{\text{RF}},{\bf F}_{\text{RF}}^{(\!k\!+\!1\!)},{% \bf U}^{(\!k\!)})= start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT )
=[𝐅opt𝐅BBH+ρ(𝐅RF(k+1)𝐔(k))](𝐅BB𝐅BBH+ρ𝐈)1,absentdelimited-[]subscript𝐅optsuperscriptsubscript𝐅BBH𝜌superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘superscriptsubscript𝐅BBsuperscriptsubscript𝐅BBH𝜌𝐈1\displaystyle=[{\bf F}_{\text{opt}}{\bf F}_{\text{BB}}^{\textrm{H}}\!+\!\rho({% \bf F}_{\text{RF}}^{(\!k\!+\!1\!)}\!\!-\!{\bf U}^{(\!k\!)}\!)]({\bf F}_{\text{% BB}}{\bf F}_{\text{BB}}^{\textrm{H}}\!+\!\rho{\bf I})^{-1},= [ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT + italic_ρ ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) ] ( bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT + italic_ρ bold_I ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , (23b)
𝐔(k+1)superscript𝐔𝑘1\displaystyle{\bf U}^{(\!k\!+\!1\!)}bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT =𝐔(k)+𝐅~RF(k+1)𝐅RF(k+1).absentsuperscript𝐔𝑘superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1\displaystyle={\bf U}^{(\!k\!)}+{\bf{\tilde{F}}}_{\text{RF}}^{(\!k\!+\!1\!)}-{% \bf F}_{\text{RF}}^{(\!k\!+\!1\!)}.= bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT + over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT . (23c)

In (III-C), \angle\cdot∠ ⋅ denotes the angle of its argument in an element-wise manner, and 𝒬()𝒬{\mathcal{Q}}(\cdot)caligraphic_Q ( ⋅ ) stands for the quantization function rounding its argument to the available phases of the phase shifters ((\bigl{(}(i.e., 2π2B×{0,1,,2B1}2𝜋superscript2𝐵01superscript2𝐵1\frac{2\pi}{2^{B}}\!\times\!\left\{0,1,\cdots,2^{B}\!-\!1\right\}divide start_ARG 2 italic_π end_ARG start_ARG 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_ARG × { 0 , 1 , ⋯ , 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT - 1 }))\bigr{)}).

The proposed algorithm for solving Problem (17) is referred to as AltOpt-LS-ADMM, and summarized in Algorithm 1, where superscript (i)superscript𝑖\cdot^{(\!i\!)}⋅ start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT denotes the corresponding variable at the i𝑖iitalic_i-th outer iteration, superscript (k)superscript𝑘\cdot^{(\!k\!)}⋅ start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT denotes the corresponding variable at the k𝑘kitalic_k-th inner (i.e., ADMM) iteration, and Imaxsubscript𝐼maxI_{\text{max}}italic_I start_POSTSUBSCRIPT max end_POSTSUBSCRIPT and kmaxsubscript𝑘maxk_{\text{max}}italic_k start_POSTSUBSCRIPT max end_POSTSUBSCRIPT are the maximal numbers of the outer and the inner loops, respectively. Besides, 𝐅RF(init)superscriptsubscript𝐅RFinit{\bf F}_{\text{RF}}^{(\text{init})}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( init ) end_POSTSUPERSCRIPT and 𝐅~RF(init)superscriptsubscript~𝐅RFinit{\bf{\tilde{F}}}_{\text{RF}}^{(\text{init})}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( init ) end_POSTSUPERSCRIPT are obtained by randomly selecting from the feasible set (8), the update of ρ𝜌\rhoitalic_ρ in Line 3 comes from (26) in Section IV-B, and 𝐎𝐎{\bf O}bold_O in Line 4 is an all-zeros matrix.

Algorithm 1 AltOpt-LS-ADMM for solving Problem (17)

Input    ​: 𝐅optNTx×Msubscript𝐅optsuperscriptsubscript𝑁Tx𝑀{\bf F}_{\text{opt}}\in\mathbb{C}^{N_{\text{Tx}}\times M}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT × italic_M end_POSTSUPERSCRIPT, Imaxsubscript𝐼maxI_{\text{max}}italic_I start_POSTSUBSCRIPT max end_POSTSUBSCRIPT, kmaxsubscript𝑘maxk_{\text{max}}italic_k start_POSTSUBSCRIPT max end_POSTSUBSCRIPT
Output  ​: 𝐅RFNTx×NRFsubscript𝐅RFsuperscriptsubscript𝑁Txsubscript𝑁RF{\bf F}_{\text{RF}}\in\mathbb{C}^{N_{\text{Tx}}\times N_{\text{RF}}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, 𝐅BBNRF×Msubscript𝐅BBsuperscriptsubscript𝑁RF𝑀{\bf F}_{\text{BB}}\in\mathbb{C}^{N_{\text{RF}}\times M}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT × italic_M end_POSTSUPERSCRIPT
Initialize: 𝐅RF(0)=𝐅RF(init)superscriptsubscript𝐅RF0superscriptsubscript𝐅RFinit{\bf F}_{\text{RF}}^{(\!0\!)}={\bf F}_{\text{RF}}^{(\text{init})}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( init ) end_POSTSUPERSCRIPT, i=0𝑖0i=0italic_i = 0

1:while i<Imax𝑖subscript𝐼maxi<I_{\text{max}}italic_i < italic_I start_POSTSUBSCRIPT max end_POSTSUBSCRIPT do
2:    𝐅BB(i+1)=P𝐅RF(i)𝐅RF(i)𝐅optF𝐅RF(i)𝐅optsuperscriptsubscript𝐅BB𝑖1𝑃subscriptnormsuperscriptsubscript𝐅RF𝑖superscriptsubscript𝐅RF𝑖subscript𝐅optFsuperscriptsubscript𝐅RF𝑖subscript𝐅opt{\bf F}_{\text{BB}}^{(\!i\!+\!1\!)}=\frac{\sqrt{P}}{\|{\bf F}_{\text{RF}}^{(\!% i\!)}{\bf F}_{\text{RF}}^{(\!i\!)\dagger}{\bf F}_{\text{opt}}\|_{\text{F}}}{% \bf F}_{\text{RF}}^{(\!i\!)\dagger}{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT = divide start_ARG square-root start_ARG italic_P end_ARG end_ARG start_ARG ∥ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) † end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT end_ARG bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) † end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT
3:    ρ=max{2𝐅BB(i+1)(𝐅BB(i+1))HF,𝐅BB(i+1)F2}𝜌maxconditional-set2evaluated-atsuperscriptsubscript𝐅BB𝑖1superscriptsuperscriptsubscript𝐅BB𝑖1HFsuperscriptsubscriptnormsuperscriptsubscript𝐅BB𝑖1F2\rho=\mathrm{max}\left\{\sqrt{2}\|{\bf F}_{\text{{BB}}}^{(\!i+1\!)}({\bf F}_{% \text{{BB}}}^{(\!i+1\!)})^{\textrm{{H}}}\|_{\text{{F}}}~{}\!,\|{\bf F}_{\text{% {BB}}}^{(\!i+1\!)}\|_{\text{{F}}}^{2}\right\}italic_ρ = roman_max { square-root start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT ( bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT , ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT }
4:    𝐅~RF(0)=𝐅~RF(init)superscriptsubscript~𝐅RF0superscriptsubscript~𝐅RFinit{\bf{\tilde{F}}}_{\text{RF}}^{(\!0\!)}\!=\!{\bf{\tilde{F}}}_{\text{RF}}^{(% \text{init})}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( init ) end_POSTSUPERSCRIPT​, ​ 𝐔(0)=𝐎superscript𝐔0𝐎{\bf U}^{(\!0\!)}\!=\!{\bf O}bold_U start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = bold_O, ​ k=0𝑘0k\!=\!0italic_k = 0 \triangleright ADMM initialization
5:     repeat
6:       𝐅RF(k+1)superscriptsubscript𝐅RF𝑘1absent{\bf F}_{\text{RF}}^{(\!k+1\!)}\leftarrowbold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ← (III-C)
7:       𝐅~RF(k+1)superscriptsubscript~𝐅RF𝑘1absent{\bf{\tilde{F}}}_{\text{RF}}^{(\!k+1\!)}\leftarrowover~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ← (III-C)
8:       𝐔(k+1)superscript𝐔𝑘1absent{\bf U}^{(\!k+1\!)}\leftarrowbold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ← (23c)
9:       ​ k=k+1𝑘𝑘1k=k+1italic_k = italic_k + 1
10:     until k=kmax𝑘subscript𝑘maxk=k_{\text{max}}italic_k = italic_k start_POSTSUBSCRIPT max end_POSTSUBSCRIPT
11:    𝐅RF(i+1)=1NTxeȷ𝒬(𝐅~RF(k))superscriptsubscript𝐅RF𝑖11subscript𝑁Txsuperscript𝑒italic-ȷ𝒬superscriptsubscript~𝐅RF𝑘{\bf F}_{\text{RF}}^{(\!i+1\!)}=\frac{1}{\sqrt{N}_{\text{Tx}}}e^{\jmath% \mathcal{Q}(\angle{\bf{\tilde{F}}}_{\text{RF}}^{(\!k\!)})}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_N end_ARG start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT end_ARG italic_e start_POSTSUPERSCRIPT italic_ȷ caligraphic_Q ( ∠ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) end_POSTSUPERSCRIPT
12:    i=i+1𝑖𝑖1i=i+1italic_i = italic_i + 1
13:end while
14:𝐅RF=𝐅RF(i)subscript𝐅RFsuperscriptsubscript𝐅RF𝑖{\bf F}_{\text{RF}}={\bf F}_{\text{RF}}^{(\!i\!)}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT and 𝐅BB=𝐅BB(i)subscript𝐅BBsuperscriptsubscript𝐅BB𝑖{\bf F}_{\text{BB}}={\bf F}_{\text{BB}}^{(\!i\!)}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT = bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT
Remark.

It is worth mentioning that the proposed two-step strategy can find approximate (but not exact) solutions to the original optimal precoder design problem, i.e., Problem (III-B).

III-D Computational Complexity Analysis

The computational cost of the proposed AltOpt-LS-ADMM algorithm mainly comes from the pseudo-inverse operation and the multiplication operation in Line 2 and the inverse operation and the multiplication operation in Line 7, which incur the complexities 𝒪(NRF2NTx)𝒪superscriptsubscript𝑁RF2subscript𝑁Tx\mathcal{O}(N_{\text{RF}}^{2}N_{\text{Tx}})caligraphic_O ( italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT ), 𝒪(NRFNTxM)𝒪subscript𝑁RFsubscript𝑁Tx𝑀\mathcal{O}(N_{\text{RF}}N_{\text{Tx}}M)caligraphic_O ( italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT italic_M ), 𝒪(NRF3)𝒪superscriptsubscript𝑁RF3\mathcal{O}(N_{\text{RF}}^{3})caligraphic_O ( italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ), and 𝒪(NRFNTxM)𝒪subscript𝑁RFsubscript𝑁Tx𝑀\mathcal{O}(N_{\text{RF}}N_{\text{Tx}}M)caligraphic_O ( italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT italic_M ), respectively. Since we can compute the inverse operation and the multiplication operation in Line 7 outside the ADMM iteration and then use their results for all inner iterations, the total computational cost of the proposed AltOpt-LS-ADMM algorithm is 𝒪(kmax(NRF2NTx+2NRFNTxM))𝒪subscript𝑘maxsuperscriptsubscript𝑁RF2subscript𝑁Tx2subscript𝑁RFsubscript𝑁Tx𝑀\mathcal{O}(k_{\text{max}}(N_{\text{RF}}^{2}N_{\text{Tx}}+2N_{\text{RF}}N_{% \text{Tx}}M))caligraphic_O ( italic_k start_POSTSUBSCRIPT max end_POSTSUBSCRIPT ( italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT + 2 italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT italic_M ) ).

IV Analysis of Error Bounds and Convergence

IV-A Analysis of Quantization Error Bound

In this subsection, we analyse the quantization error bound in the proposed ADMM algorithm (i.e., inner iteration of Algorithm 1) resulting from the quantization operation in Line 6 of Algorithm 1. We first denote 𝐅^RFsubscript^𝐅RF{\bf{\widehat{F}}}_{\text{RF}}over^ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT and 𝐅RFsuperscriptsubscript𝐅RF{\bf F}_{\text{RF}}^{\star}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT as the RF precoder with (i.e., B<𝐵B<\inftyitalic_B < ∞) and without (i.e., B=𝐵B=\inftyitalic_B = ∞) quantization, respectively. Then, the relation of these two matrices is given as

𝐅^RF=𝚽𝐅RF,subscript^𝐅RFdirect-product𝚽superscriptsubscript𝐅RF\displaystyle{\bf{\widehat{F}}}_{\text{RF}}={\bf\Phi}\odot{\bf F}_{\text{RF}}^% {\star},over^ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = bold_Φ ⊙ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT , (24)

where direct-product\odot denotes the element-wise product and 𝚽NTx×NRF𝚽superscriptsubscript𝑁Txsubscript𝑁RF{\bf\Phi}\in\mathbb{C}^{N_{\text{Tx}}\times N_{\text{RF}}}bold_Φ ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT × italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUPERSCRIPT is the quantization error matrix. Moreover, the elements of 𝚽𝚽{\bf\Phi}bold_Φ can be formulated as [𝚽]ij=eȷϕijsubscriptdelimited-[]𝚽𝑖𝑗superscript𝑒italic-ȷsubscriptitalic-ϕ𝑖𝑗[{\bf\Phi}]_{ij}\!=\!e^{\jmath\phi_{ij}}[ bold_Φ ] start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT italic_ȷ italic_ϕ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, where 0|ϕij|π/2B0subscriptitalic-ϕ𝑖𝑗𝜋superscript2𝐵0\!\leq\!|\phi_{ij}|\!\leq\!\pi/2^{B}0 ≤ | italic_ϕ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT | ≤ italic_π / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT for all 1iNTx1𝑖subscript𝑁Tx1\leq i\leq N_{\text{Tx}}1 ≤ italic_i ≤ italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT and 1jNRF1𝑗subscript𝑁RF1\leq j\leq N_{\text{RF}}1 ≤ italic_j ≤ italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT. Therefore, the quantization error can be calculated as

𝐅opt𝐅^RF𝐅BBF𝐅opt𝐅RF𝐅BBFsubscriptnormsubscript𝐅optsubscript^𝐅RFsubscript𝐅BBFsubscriptnormsubscript𝐅optsuperscriptsubscript𝐅RFsubscript𝐅BBF\displaystyle\|{\bf F}_{\text{opt}}-{\bf{\widehat{F}}}_{\text{RF}}{\bf F}_{% \text{BB}}\|_{\text{F}}-\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}^{\star}{\bf F% }_{\text{BB}}\|_{\text{F}}∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over^ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT - ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT
=\displaystyle== 𝐅opt(𝚽𝐅RF)𝐅BBF𝐅opt𝐅RF𝐅BBFsubscriptnormsubscript𝐅optdirect-product𝚽superscriptsubscript𝐅RFsubscript𝐅BBFsubscriptnormsubscript𝐅optsuperscriptsubscript𝐅RFsubscript𝐅BBF\displaystyle~{}\!\|{\bf F}_{\text{opt}}-({\bf\Phi}\odot{\bf F}_{\text{RF}}^{% \star}){\bf F}_{\text{BB}}\|_{\text{F}}-\|{\bf F}_{\text{opt}}-{\bf F}_{\text{% RF}}^{\star}{\bf F}_{\text{BB}}\|_{\text{F}}∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - ( bold_Φ ⊙ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT - ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT
\displaystyle\leq (𝐅RF(𝚽𝐅RF))𝐅BBFsubscriptnormsuperscriptsubscript𝐅RFdirect-product𝚽superscriptsubscript𝐅RFsubscript𝐅BBF\displaystyle~{}\!\|({\bf F}_{\text{RF}}^{\star}-({\bf\Phi}\odot{\bf F}_{\text% {RF}}^{\star})){\bf F}_{\text{BB}}\|_{\text{F}}∥ ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT - ( bold_Φ ⊙ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ) ) bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT (25a)
=\displaystyle== [(𝟏𝚽)𝐅RF]𝐅BBFsubscriptnormdelimited-[]direct-product1𝚽superscriptsubscript𝐅RFsubscript𝐅BBF\displaystyle~{}\!\|[({\bf 1}-{\bf\Phi})\odot{\bf F}_{\text{RF}}^{\star}]{\bf F% }_{\text{BB}}\|_{\text{F}}∥ [ ( bold_1 - bold_Φ ) ⊙ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ] bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT
\displaystyle\leq (𝟏𝚽)𝐅RFF𝐅BBFsubscriptnormdirect-product1𝚽superscriptsubscript𝐅RFFsubscriptnormsubscript𝐅BBF\displaystyle~{}\!\|({\bf 1}-{\bf\Phi})\odot{\bf F}_{\text{RF}}^{\star}\|_{% \text{F}}\|{\bf F}_{\text{BB}}\|_{\text{F}}∥ ( bold_1 - bold_Φ ) ⊙ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT (25b)
\displaystyle\leq 𝟏𝚽F𝐅RFF𝐅BBFsubscriptnorm1𝚽Fsubscriptnormsuperscriptsubscript𝐅RFFsubscriptnormsubscript𝐅BBF\displaystyle~{}\!\|{\bf 1}-{\bf\Phi}\|_{\text{F}}\|{\bf F}_{\text{RF}}^{\star% }\|_{\text{F}}\|{\bf F}_{\text{BB}}\|_{\text{F}}∥ bold_1 - bold_Φ ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ∥ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT (25c)
\displaystyle\leq |1eȷπ/2B|NTxNRF𝐅RFF𝐅BBF,1superscript𝑒italic-ȷ𝜋superscript2𝐵subscript𝑁Txsubscript𝑁RFsubscriptnormsuperscriptsubscript𝐅RFFsubscriptnormsubscript𝐅BBF\displaystyle~{}\!\left|1-e^{\jmath{\pi}/{2^{B}}}\right|\sqrt{N_{\text{Tx}}N_{% \text{RF}}}\|{\bf F}_{\text{RF}}^{\star}\|_{\text{F}}\|{\bf F}_{\text{BB}}\|_{% \text{F}},| 1 - italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT | square-root start_ARG italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_ARG ∥ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT , (25d)

where 𝟏1{\bf 1}bold_1 is the all-ones matrix of appropriate size, in (25a) we used the triangle inequality, in (25b) we employed the fact that 𝐌𝐍F𝐌F𝐍Fsubscriptnorm𝐌𝐍Fsubscriptnorm𝐌Fsubscriptnorm𝐍F\|{\bf M}{\bf N}\|_{\text{F}}\leq\|{\bf M}\|_{\text{F}}\|{\bf N}\|_{\text{F}}∥ bold_MN ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ≤ ∥ bold_M ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ∥ bold_N ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT holds for any matrices 𝐌𝐌{\bf M}bold_M and 𝐍𝐍{\bf N}bold_N of appropriate sizes, in (25c) we utilized the Cauchy-Schwarz inequality, and in (25d) we used the following inequality:

𝟏𝚽F=subscriptnorm1𝚽Fabsent\displaystyle\|{\bf 1}-{\bf\Phi}\|_{\text{F}}=∥ bold_1 - bold_Φ ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT = i=1NTxj=1NRF|1eȷϕij|2superscriptsubscript𝑖1subscript𝑁Txsuperscriptsubscript𝑗1subscript𝑁RFsuperscript1superscript𝑒italic-ȷsubscriptitalic-ϕ𝑖𝑗2\displaystyle~{}\!\sqrt{\sum_{i=1}^{N_{\text{Tx}}}\sum_{j=1}^{N_{\text{RF}}}% \left|1-e^{\jmath\phi_{ij}}\right|^{2}}square-root start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_e start_POSTSUPERSCRIPT italic_ȷ italic_ϕ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG
\displaystyle\leq i=1NTxj=1NRF|1eȷπ/2B|2superscriptsubscript𝑖1subscript𝑁Txsuperscriptsubscript𝑗1subscript𝑁RFsuperscript1superscript𝑒italic-ȷ𝜋superscript2𝐵2\displaystyle~{}\!\sqrt{\sum_{i=1}^{N_{\text{Tx}}}\sum_{j=1}^{N_{\text{RF}}}% \left|1-e^{\jmath\pi/2^{B}}\right|^{2}}square-root start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | 1 - italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG
=\displaystyle== |1eȷπ/2B|NTxNRF.1superscript𝑒italic-ȷ𝜋superscript2𝐵subscript𝑁Txsubscript𝑁RF\displaystyle~{}\!\left|1-e^{\jmath{\pi}/{2^{B}}}\right|\sqrt{N_{\text{Tx}}N_{% \text{RF}}}~{}\!.| 1 - italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT | square-root start_ARG italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_ARG .

Note that NTxNRF𝐅RFF𝐅BBFsubscript𝑁Txsubscript𝑁RFsubscriptnormsuperscriptsubscript𝐅RFFsubscriptnormsubscript𝐅BBF\sqrt{N_{\text{Tx}}N_{\text{RF}}}\|{\bf F}_{\text{RF}}^{\star}\|_{\text{F}}\|{% \bf F}_{\text{BB}}\|_{\text{F}}square-root start_ARG italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_ARG ∥ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT in (25d) is a constant w.r.t. the number of quantization bits of phase shifters. For notational simplicity, we define CNTxNRF𝐅RFF𝐅BBF𝐶subscript𝑁Txsubscript𝑁RFsubscriptnormsuperscriptsubscript𝐅RFFsubscriptnormsubscript𝐅BBFC\triangleq\sqrt{N_{\text{Tx}}N_{\text{RF}}}\|{\bf F}_{\text{RF}}^{\star}\|_{% \text{F}}\|{\bf F}_{\text{BB}}\|_{\text{F}}italic_C ≜ square-root start_ARG italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_ARG ∥ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT, and rewrite the quantization error as

𝐅opt𝐅^RF𝐅BBF𝐅opt𝐅RF𝐅BBFC|1eȷπ/2B|.subscriptnormsubscript𝐅optsubscript^𝐅RFsubscript𝐅BBFsubscriptnormsubscript𝐅optsuperscriptsubscript𝐅RFsubscript𝐅BBF𝐶1superscript𝑒italic-ȷ𝜋superscript2𝐵\displaystyle\|{\bf F}_{\text{opt}}-{\bf{\widehat{F}}}_{\text{RF}}{\bf F}_{% \text{BB}}\|_{\text{F}}-\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}^{\star}{\bf F% }_{\text{BB}}\|_{\text{F}}\leq C\left|1-e^{\jmath{\pi}/{2^{B}}}\right|.∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over^ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT - ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ≤ italic_C | 1 - italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT | .

The values of |1eȷπ/2B|1superscript𝑒italic-ȷ𝜋superscript2𝐵\left|1-e^{\jmath{\pi}/{2^{B}}}\right|| 1 - italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT | with different numbers of quantization bits are presented in Table I. It is seen from Table I that when B5𝐵5B\geq 5italic_B ≥ 5, the quantization upper bound decreases by more than 10 times compared to B=1𝐵1B=1italic_B = 1, suggesting that B=5𝐵5B=5italic_B = 5 can be sufficient to approach the performance of infinite-resolution phase shifters. In order to illustrate the impact of quantization bits B𝐵Bitalic_B on the decomposition error upper bound (DecpUB), we define

DecpUB𝐅opt𝐅RF𝐅BBF+C|1eȷπ/2B|,DecpUBsubscriptnormsubscript𝐅optsuperscriptsubscript𝐅RFsubscript𝐅BBF𝐶1superscript𝑒italic-ȷ𝜋superscript2𝐵\displaystyle{\text{DecpUB}}\triangleq\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF% }}^{\star}{\bf F}_{\text{BB}}\|_{\text{F}}+C\left|1-e^{\jmath{\pi}/{2^{B}}}% \right|,DecpUB ≜ ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⋆ end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT + italic_C | 1 - italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT | ,

and then plot it w.r.t. the number of quantization bits in Fig. 2, where the simulation parameters are NTx=16subscript𝑁Tx16N_{\text{Tx}}=16italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT = 16, M=20𝑀20M=20italic_M = 20, and P=10𝑃10P=10italic_P = 10 dBm. It can be observed from Fig. 2 that when B3𝐵3B\leq 3italic_B ≤ 3 the DecpUB decrease sharply, and when B5𝐵5B\geq 5italic_B ≥ 5 the slopes of the DecpUB are approximately equal to 0. This leads to the same conclusion as the one drawn from Table I, that is, B=5𝐵5B=5italic_B = 5 is sufficient to approach the performance of infinite-resolution phase shifters. This will be further verified through simulations in Section V.

IV-B Convergence Analysis

In this subsection, we analyse the convergence behaviors of the proposed ADMM algorithm (i.e., the inner iteration of Algorithm 1) and AltOpt-LS-ADMM algorithm (i.e., the outer iteration of Algorithm 1), which are stated in the following two theorems, respectively.

Theorem 1.

The augmented Lagrangian function value sequence {(𝐅~RF(k),𝐅RF(k),𝐔(k))|k=0,1,2,}conditional-setsuperscriptsubscriptnormal-~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘𝑘012normal-⋯\left\{\mathcal{L}\left({\bf{\tilde{F}}}_{\text{\emph{RF}}}^{(k)},{\bf F}_{% \text{\emph{RF}}}^{(k)},{\bf U}^{(k)}\right)\Big{|}k=0,1,2,\cdots\right\}{ caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) | italic_k = 0 , 1 , 2 , ⋯ } produced by the proposed ADMM algorithm converges if

ρmax{2𝐅BB𝐅BBHF,𝐅BBF2}.𝜌maxconditional-set2evaluated-atsubscript𝐅BBsuperscriptsubscript𝐅BBHFsuperscriptsubscriptnormsubscript𝐅BBF2\displaystyle\rho\geq\mathrm{max}\left\{\sqrt{2}\|{\bf F}_{\text{\emph{BB}}}{% \bf F}_{\text{\emph{BB}}}^{\textrm{\emph{H}}}\|_{\text{\emph{F}}},\|{\bf F}_{% \text{\emph{BB}}}\|_{\text{\emph{F}}}^{2}\right\}.italic_ρ ≥ roman_max { square-root start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT , ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT } . (26)

Furthermore, as knormal-→𝑘k\to\inftyitalic_k → ∞, we have 𝐅RF(k+1)=𝐅RF(k)superscriptsubscript𝐅RF𝑘1superscriptsubscript𝐅RF𝑘{\bf F}_{\text{\emph{RF}}}^{(k+1)}={\bf F}_{\text{\emph{RF}}}^{(k)}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT, 𝐅~RF(k+1)=𝐅~RF(k)superscriptsubscriptnormal-~𝐅RF𝑘1superscriptsubscriptnormal-~𝐅RF𝑘{\bf{\tilde{F}}}_{\text{\emph{RF}}}^{(k+1)}={\bf{\tilde{F}}}_{\text{\emph{RF}}% }^{(k)}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT, 𝐔(k+1)=𝐔(k)superscript𝐔𝑘1superscript𝐔𝑘{\bf U}^{(k+1)}={\bf U}^{(k)}bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT, and 𝐅RF(k)=𝐅~RF(k)superscriptsubscript𝐅RF𝑘superscriptsubscriptnormal-~𝐅RF𝑘{\bf F}_{\text{\emph{RF}}}^{(k)}={\bf{\tilde{F}}}_{\text{\emph{RF}}}^{(k)}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT = over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT; and the point sequence {(𝐅~RF(k),𝐅RF(k),𝐔(k))}superscriptsubscriptnormal-~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘\left\{\left({\bf{\tilde{F}}}_{\text{\emph{RF}}}^{(k)},{\bf F}_{\text{\emph{RF% }}}^{(k)},{\bf U}^{(k)}\right)\right\}{ ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) } is a Cauchy sequence and it converges to a fixed point after a finite number of iterations.

Proof.

See Appendix C. ∎

Theorem 2.

If (26) holds, the sequence {𝐅opt𝐅RF(i)𝐅BB(i)F}subscriptnormsubscript𝐅optsuperscriptsubscript𝐅RF𝑖superscriptsubscript𝐅BB𝑖F\left\{\|{\bf F}_{\text{\emph{opt}}}-{\bf F}_{\text{\emph{RF}}}^{(i)}{\bf F}_{% \text{\emph{BB}}}^{(i)}\|_{\text{\emph{F}}}\right\}{ ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT } generated by the proposed AltOpt-LS-ADMM algorithm converges.

Proof.

See Appendix D. ∎

Theorem 1 asserts that as long as the augmented Lagrangian parameter ρ𝜌\rhoitalic_ρ is large enough (see (26)), the proposed ADMM algorithm is convergent. Additionally, Theorem 2 establishes that the proposed AltOpt-LS-ADMM can generate a convergent sequence of cost function values, as long as (26) holds.

TABLE I: Quantization error bound with different B𝐵Bitalic_B
B𝐵Bitalic_B 1 2 3 4 5 6
|1eȷπ/2B|1superscript𝑒italic-ȷ𝜋superscript2𝐵\left|1-e^{\jmath{\pi}/{2^{B}}}\right|| 1 - italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π / 2 start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT | 1.4142 0.7654 0.3902 0.1960 0.0981 0.0491

Refer to caption

Figure 2: Decomposition error upper bound versus number of quantization bits of phase shifter.

V Numerical Results

V-A Scenario, Performance Metric, and Benchmark

In this section, we conduct simulations to verify the performance of the proposed AltOpt-LS-ADMM algorithm. Two scenarios are considered as follows:

  • Scenario I: We randomly generate a digital precoder 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT, and our performance metric is the decomposition error (DecpErr), defined as: 𝐅opt𝐅RF𝐅BBF/𝐅optFsubscriptnormsubscript𝐅optsubscript𝐅RFsubscript𝐅BBFsubscriptnormsubscript𝐅optF{\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}}/{% \|{\bf F}_{\text{opt}}\|_{\text{F}}}∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT / ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT. The simulation parameters are summarized in Table II.

  • Scenario II: We obtain an optimal digital precoder 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT as introduced in Step 1 in Section III-C, and our performance metric is the AEB in (6). The simulation parameters are summarized in Table III.

TABLE II: Simulation Parameters in Scenario I
Parameter Scenario I
Imaxsubscript𝐼maxI_{\text{max}}italic_I start_POSTSUBSCRIPT max end_POSTSUBSCRIPT 10101010
kmaxsubscript𝑘maxk_{\text{max}}italic_k start_POSTSUBSCRIPT max end_POSTSUBSCRIPT 50505050
BS Transmit Power, P𝑃Pitalic_P 10101010 dBm
Pilot Length, M𝑀Mitalic_M 20202020
NTxsubscript𝑁TxN_{\text{Tx}}italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT 16161616
NRFsubscript𝑁RFN_{\text{RF}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT 1,2,,1612161,2,\cdots,161 , 2 , ⋯ , 16 (Figs. 3 and 4);
1,2,4,7,10,13,1612471013161,2,4,7,10,13,161 , 2 , 4 , 7 , 10 , 13 , 16 (Fig. 5);
8888 (Fig. 6)
B𝐵Bitalic_B 1,2,3,4,5,123451,2,3,4,5,\infty1 , 2 , 3 , 4 , 5 , ∞ (Fig. 3);
2 (Fig. 4);
1,2,,1612161,2,\cdots,161 , 2 , ⋯ , 16 (Figs. 5 and 6)
TABLE III: Simulation Parameters in Scenario II
Parameter Scenario II
Imaxsubscript𝐼maxI_{\text{max}}italic_I start_POSTSUBSCRIPT max end_POSTSUBSCRIPT 10101010
kmaxsubscript𝑘maxk_{\text{max}}italic_k start_POSTSUBSCRIPT max end_POSTSUBSCRIPT 50505050
BS Transmit Power, P𝑃Pitalic_P 10101010 dBm
Signal-to-Noise Ratio (SNR) 10101010 dB
Pilot Length, M𝑀Mitalic_M 20202020
NTxsubscript𝑁TxN_{\text{Tx}}italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT 4,6,,3046304,6,\cdots,304 , 6 , ⋯ , 30 (Figs. 7 and 11);
20202020 (Figs. 8, 9, and 10)
NRFsubscript𝑁RFN_{\text{RF}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT 1,2,,2012201,2,\cdots,201 , 2 , ⋯ , 20 (Figs. 8 and 11);
4444 (Figs. 7, 9, and 10)
B𝐵Bitalic_B 1,2,,1612161,2,\cdots,161 , 2 , ⋯ , 16 (Fig. 9);
\infty (Figs. 7, 8, and 10)
AoD 80,75,,80superscript80superscript75superscript80-80^{\circ},-75^{\circ},\cdots,80^{\circ}- 80 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , - 75 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , ⋯ , 80 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT (Fig. 10);
0superscript00^{\circ}0 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT (Figs. 7, 8, and 9)

We compare the proposed method with the following methods:

  • Alt-Babai [18]: alternating optimization + the Babai algorithm

  • Alt-CDM [19]: alternating optimization + coordinate descent method

  • Spa-OMP [24]: spatially sparse representation + orthogonal matching pursuit

  • ManiOpt [29]: manifold optimization (where we utilize the Manopt function [57] for implementation)

Note that ManiOpt in [29] utilizes infinite-resolution (i.e., B=𝐵B=\inftyitalic_B = ∞) phase shifters, which is adopted as a benchmark in this work. Also note that the ManiOpt is initialized with random value or with the output of the proposed AltOpt-LS-ADMM method, labelled as “ManiOpt (random initialization)” and “ManiOpt (proposed initialization)”, respectively.

V-B Results and Discussion of Scenario I

V-B1 DecpErr as a Function of NRFsubscript𝑁RFN_{\text{\emph{RF}}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT

We randomly generate an 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT and decompose it into 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT and 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT by using the proposed AltOpt-LS-ADMM algorithm. The DecpErrs are averaged over 500 Monte-Carlo trials, and the results w.r.t. the number of RF chains, NRFsubscript𝑁RFN_{\text{RF}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT, are plotted in Fig. 3. It can be seen that: (i) when NRFsubscript𝑁RFN_{\text{RF}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT or B𝐵Bitalic_B increases, the DecpErr decreases; (ii) when B=5𝐵5B=5italic_B = 5, its performance approaches the one with B=𝐵B=\inftyitalic_B = ∞ (i.e., the infinite resolution phase shifter); (iii) when NRF=NTx=16subscript𝑁RFsubscript𝑁Tx16N_{\text{RF}}=N_{\text{Tx}}=16italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT = 16, the DecpErrs are always 0. This is because when NRF=NTxsubscript𝑁RFsubscript𝑁TxN_{\text{RF}}=N_{\text{Tx}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT, 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT is a square matrix and invertible, and thus there always exists a matrix 𝐅BB=𝐅RF1𝐅optsubscript𝐅BBsuperscriptsubscript𝐅RF1subscript𝐅opt{\bf F}_{\text{BB}}={\bf F}_{\text{RF}}^{-1}{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT such that 𝐅opt=𝐅RF𝐅BBsubscript𝐅optsubscript𝐅RFsubscript𝐅BB{\bf F}_{\text{opt}}={\bf F}_{\text{RF}}{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT.

Refer to caption

Figure 3: Decomposition error versus number of RF chains with different bits of the phase shifter, by the proposed method.

Next, the DecpErrs of different methods are displayed in Fig. 4, with B=2𝐵2B=2italic_B = 2. We see that ManiOpt with random initialization has the worst performance and ManiOpt with the proposed method as initialization achieves the highest performance. Besides, when NRF7subscript𝑁RF7N_{\text{RF}}\geq 7italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT ≥ 7, the decomposition error of the proposed method is larger than that of ManiOpt with proposed initilization, and smaller than those of Alt-Babai, Alt-CDM, Spa-OMP, and ManiOpt with random initialization. Note that ManiOpt with proposed initialization attains the best decomposition performance at a cost of higher computational complexity, which will be verified in Fig. 11.

Refer to caption

Figure 4: Decomposition error versus number of RF chains with B=2𝐵2B=2italic_B = 2 bits, among different algorithms.

V-B2 DecpErr as a Function of B𝐵Bitalic_B

The DecpErr of the proposed algorithm w.r.t. the number of quantization bits, B𝐵Bitalic_B, is shown in Fig. 5. We observe that when NRFsubscript𝑁RFN_{\text{RF}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT increases, the decomposition error decreases, as expected. For NRF<16subscript𝑁RF16N_{\text{RF}}<16italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT < 16, when B𝐵Bitalic_B increases from 1 until 5, the decomposition error decreases; when B5𝐵5B\geq 5italic_B ≥ 5, the decomposition error remains nearly unchanged. Therefore, taking into account the outcomes presented in Fig. 3, we can infer that B=5𝐵5B=5italic_B = 5 bits prove to be sufficient in achieving near-optimal hybrid precoding performance (i.e., reaching a performance level very close to that obtained by digital precoding). Besides, it is seen from Fig. 5 that, the decomposition error is 0 when NRF=NTx=16subscript𝑁RFsubscript𝑁Tx16N_{\text{RF}}=N_{\text{Tx}}=16italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT = 16, which has been explained in the first example.

Refer to caption

Figure 5: Decomposition error versus number of bits of the phase shifter with different numbers of RF chains, by the proposed method.

Next, the DecpErrs of different algorithms are depicted in Fig. 6, with NRF=8subscript𝑁RF8N_{\text{RF}}=8italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = 8, which verifies the better performance of the proposed method against Alt-Babai, Alt-CDM, Spa-OMP, and ManiOpt with random initialization, especially when number of bits B2𝐵2B\geq 2italic_B ≥ 2. Note that the ManiOpt with random initialization has a horizontal line because it uses B=𝐵B=\inftyitalic_B = ∞ quantization bits; while ManiOpt with proposed initialization does not has a horizontal line because it is sensitive to its initialization (its initialization, i.e., the proposed method, has better performance as B𝐵Bitalic_B increases from 1 until 5).

Refer to caption

Figure 6: Decomposition error versus number of bits of the phase shifter with NRF=8subscript𝑁RF8N_{\text{RF}}=8italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = 8 RF chains, among different algorithms.

V-C Results and Discussion of Scenario II

V-C1 AEB as a Function of NTxsubscript𝑁TxN_{\text{\emph{Tx}}}italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT

We now evaluate the AEB performance of the proposed algorithm and the benchmark methods. The optimal digital precoder 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT can be obtained by the method333Since our main focus is the decomposition step, i.e., Step 2, in Section III-C, in the following simulations, 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT is obtained heuristically by assigning 0.10.10.10.1 of power to derivative beams and the rest to directional beams. as introduced in Section III-C. Then we decompose this 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT into 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT and 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT by using different algorithms. The AEBs w.r.t. the number of Tx antennas, NTxsubscript𝑁TxN_{\text{Tx}}italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT, achieved by the different methods are shown in Fig. 7, where the curve labelled as “Optimal (fully digital)” is the result by using 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT directly without decomposition. We can see that the proposed algorithm attains lower AEB than those of Alt-Babai, Alt-CDM, Spa-OMP, and ManiOpt with random initialization. The ManiOpt with proposed initialization outperforms the others and is much closer to the “Optimal (fully digital)” curve.

Refer to caption

Figure 7: AEB versus number of transmit antennas with NRF=4subscript𝑁RF4N_{\text{RF}}\!=\!4italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = 4, B=𝐵B=\inftyitalic_B = ∞, and SNR = 10 dB.

V-C2 AEB as a Function of NRFsubscript𝑁RFN_{\text{\emph{RF}}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT

The results of AEB w.r.t. the number of RF chains, NRFsubscript𝑁RFN_{\text{RF}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT, are presented in Fig. 8. It can be observed from Fig. 8 that the AEB of the proposed algorithm is smaller than those of Alt-Babai, Alt-CDM, Spa-OMP, and ManiOpt with random initilization, while ManiOpt with proposed initialization has the best performance.

Refer to caption

Figure 8: AEB versus number of RF chains with NTx=20subscript𝑁Tx20N_{\text{Tx}}=20italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT = 20, B=𝐵B=\inftyitalic_B = ∞, and SNR = 10 dB.

V-C3 AEB as a Function of B𝐵Bitalic_B

We compare AEB versus different quantization bits of phase shifters, and the results are displayed in Fig. 9. We have similar findings as in Fig. 6.

Refer to caption

Figure 9: AEB versus number of bits of the phase shifters with NTx=20subscript𝑁Tx20N_{\text{Tx}}=20italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT = 20, NRF=4subscript𝑁RF4N_{\text{RF}}=4italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = 4, and SNR = 10 dB.

V-C4 AEB as a Function of AoD

The predefined codebook for 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT is set around 0superscript00^{\circ}0 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT, while the true AoD changes from 80superscript80-80^{\circ}- 80 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT to 80superscript8080^{\circ}80 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT. The results of AEB versus AoD are plotted in Fig. 10. We see that when AoD is 0superscript00^{\circ}0 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT (matching with our predefined codebook), all the curves reach their lowest AEB. Besides, the ManiOpt with proposed initialization outperforms others, followed by the proposed AltOpt-LS-ADMM algorithm. In addition, the proposed algorithm for hybrid precoder design, both when used independently and as an initialization for ManiOpt, exhibits AoD estimation performance very close to that achieved via a fully digital array, which underscores its effectiveness over a broad range of AoD values.

Refer to caption

Figure 10: AEB versus AoD with NTx=20subscript𝑁Tx20N_{\text{Tx}}=20italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT = 20, NRF=4subscript𝑁RF4N_{\text{RF}}=4italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT = 4, B=𝐵B=\inftyitalic_B = ∞, and SNR = 10 dB.

V-C5 CPU Runtime as a Function of NTxsubscript𝑁TxN_{\text{\emph{Tx}}}italic_N start_POSTSUBSCRIPT Tx end_POSTSUBSCRIPT or NRFsubscript𝑁RFN_{\text{\emph{RF}}}italic_N start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT

We compare the computational complexity of different algorithms. The central processing unit (CPU) runtime versus number of transmit antennas is drawn in Fig. 11 (left). On the other hand, the CPU runtime versus number of RF chains is drawn in Fig. 11 (right). From Fig. 11, it can be seen that ManiOpt with random initialization and ManiOpt with proposed initialization have almost the same CPU runtime. The Optimal method has the least CPU runtime since it does not need to perform the decomposition operation on 𝐅optsubscript𝐅opt{\bf F}_{\text{opt}}bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT. Besides, the proposed algorithm consumes less CPU runtime than Alt-Babai, Alt-CDM, Spa-OMP, and ManiOpt.

Refer to caption

Figure 11: CPU runtime versus number of transmit antennas (left) and number of RF chains (right).

VI Conclusion

In this paper, we have investigated the hybrid precoder design problem for angle-of-departure (AoD) estimation, where we took into account practical limitation on the finite resolution of phase shifters. Our aim was to devise a radio-frequency (RF) precoder and a base-band (BB) precoder that could simultaneously adhere to the practical constraint and achieve highly precise AoD estimation. To accomplish this goal, we developed a two-step approach. Firstly, we derived a fully digital precoder that minimizes the angle error bound by using a predefined codebook. Then, we decomposed this digital precoder into an RF precoder and a BB precoder, employing the alternating optimization framework and alternating direction method of multipliers. We also analysed the quantization error bound, and provided convergence analyses of the proposed algorithm. Numerical results demonstrated the exceptional performance of the proposed method with low complexity, leading to the following key conclusions:

  • Number of Bits Sufficient for AoD Estimation: 5 bits are sufficient to achieve almost the same decomposition and AoD estimation performance as the case with infinite-resolution phase shifters.

  • Number of RF Chains Sufficient for AoD Estimation: For a 20-element transmit array, 4 RF chains are sufficient to attain the same AoD estimation performance as the fully-digital architecture.

  • High-Quality Initialization: The proposed algorithm can provide high-quality initialization that boosts the performance of manifold optimization compared to random initialization.

  • Covering Wide Range of AoDs: The proposed algorithm attains near-optimal (in the sense of achieving the fully-digital performance) AoD performance over a broad range of AoD values ranging from -80 to 80 degrees.

Appendix A Calculation of the FIM

The derivatives of 𝐲~=β𝐒(𝐅RF𝐅BB)T𝐚(θ)~𝐲𝛽𝐒superscriptsubscript𝐅RFsubscript𝐅BBT𝐚𝜃{\bf{\tilde{y}}}=\beta{\bf S}({\bf F}_{\text{RF}}{\bf F}_{\text{BB}})^{\textrm% {T}}{\bf a}(\theta)over~ start_ARG bold_y end_ARG = italic_β bold_S ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ) w.r.t. [𝐱]isubscriptdelimited-[]𝐱𝑖[{\bf x}]_{i}[ bold_x ] start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i=1,2,3𝑖123i=1,2,3italic_i = 1 , 2 , 3, are given as follows

𝐲~[𝐱]1~𝐲subscriptdelimited-[]𝐱1\displaystyle\frac{\partial{\bf{\tilde{y}}}}{\partial[{\bf x}]_{1}}divide start_ARG ∂ over~ start_ARG bold_y end_ARG end_ARG start_ARG ∂ [ bold_x ] start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG =𝐲~sinθ=ȷ2βπdλ𝐒(𝐅RF𝐅BB)T𝐃𝐚(θ),absent~𝐲𝜃italic-ȷ2𝛽𝜋𝑑𝜆𝐒superscriptsubscript𝐅RFsubscript𝐅BBT𝐃𝐚𝜃\displaystyle=\frac{\partial{\bf{\tilde{y}}}}{\partial\sin{\theta}}=-\jmath% \frac{2\beta\pi d}{\lambda}{\bf S}({\bf F}_{\text{RF}}{\bf F}_{\text{BB}})^{% \textrm{T}}{\bf D}{\bf a}(\theta),= divide start_ARG ∂ over~ start_ARG bold_y end_ARG end_ARG start_ARG ∂ roman_sin italic_θ end_ARG = - italic_ȷ divide start_ARG 2 italic_β italic_π italic_d end_ARG start_ARG italic_λ end_ARG bold_S ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_Da ( italic_θ ) ,
𝐲~[𝐱]2~𝐲subscriptdelimited-[]𝐱2\displaystyle\frac{\partial{\bf{\tilde{y}}}}{\partial[{\bf x}]_{2}}divide start_ARG ∂ over~ start_ARG bold_y end_ARG end_ARG start_ARG ∂ [ bold_x ] start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG =𝐲~βR=𝐒(𝐅RF𝐅BB)T𝐚(θ),absent~𝐲subscript𝛽R𝐒superscriptsubscript𝐅RFsubscript𝐅BBT𝐚𝜃\displaystyle=\frac{\partial{\bf{\tilde{y}}}}{\partial\beta_{\text{R}}}={\bf S% }({\bf F}_{\text{RF}}{\bf F}_{\text{BB}})^{\textrm{T}}{\bf a}(\theta),= divide start_ARG ∂ over~ start_ARG bold_y end_ARG end_ARG start_ARG ∂ italic_β start_POSTSUBSCRIPT R end_POSTSUBSCRIPT end_ARG = bold_S ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ) ,
𝐲~[𝐱]3~𝐲subscriptdelimited-[]𝐱3\displaystyle\frac{\partial{\bf{\tilde{y}}}}{\partial[{\bf x}]_{3}}divide start_ARG ∂ over~ start_ARG bold_y end_ARG end_ARG start_ARG ∂ [ bold_x ] start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG =𝐲~βI=ȷ𝐒(𝐅RF𝐅BB)T𝐚(θ),absent~𝐲subscript𝛽Iitalic-ȷ𝐒superscriptsubscript𝐅RFsubscript𝐅BBT𝐚𝜃\displaystyle=\frac{\partial{\bf{\tilde{y}}}}{\partial\beta_{\text{I}}}=\jmath% {\bf S}({\bf F}_{\text{RF}}{\bf F}_{\text{BB}})^{\textrm{T}}{\bf a}(\theta),= divide start_ARG ∂ over~ start_ARG bold_y end_ARG end_ARG start_ARG ∂ italic_β start_POSTSUBSCRIPT I end_POSTSUBSCRIPT end_ARG = italic_ȷ bold_S ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ) ,

where 𝐃diag{0,1,,NTx1}𝐃diag01subscript𝑁Tx1{\bf D}\triangleq\text{diag}\{0,1,\cdots,N_{\text{Tx}-1}\}bold_D ≜ diag { 0 , 1 , ⋯ , italic_N start_POSTSUBSCRIPT Tx - 1 end_POSTSUBSCRIPT }. According to (4), we have

[𝐉]11subscriptdelimited-[]𝐉11\displaystyle[{\bf J}]_{11}[ bold_J ] start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT =8|β|2π2d2σn2λ2𝐚H(θ)𝐃𝐅*𝐒H𝐒𝐅T𝐃𝐚(θ),absent8superscript𝛽2superscript𝜋2superscript𝑑2superscriptsubscript𝜎n2superscript𝜆2superscript𝐚H𝜃superscript𝐃𝐅superscript𝐒Hsuperscript𝐒𝐅T𝐃𝐚𝜃\displaystyle=\frac{8|\beta|^{2}\pi^{2}d^{2}}{\sigma_{\text{n}}^{2}\lambda^{2}% }{\bf a}^{\textrm{H}}(\theta){\bf D}{\bf F}^{*}{\bf S}^{\textrm{H}}{\bf S}{\bf F% }^{\textrm{T}}{\bf D}{\bf a}(\theta),= divide start_ARG 8 | italic_β | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG bold_a start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( italic_θ ) bold_DF start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_S start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT bold_SF start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_Da ( italic_θ ) ,
[𝐉]12subscriptdelimited-[]𝐉12\displaystyle[{\bf J}]_{12}[ bold_J ] start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT =[𝐉]21=4πdσn2λ×{ȷβ*𝐚H(θ)𝐃𝐅*𝐒H𝐒𝐅T𝐚(θ)},absentsubscriptdelimited-[]𝐉214𝜋𝑑superscriptsubscript𝜎n2𝜆italic-ȷsuperscript𝛽superscript𝐚H𝜃superscript𝐃𝐅superscript𝐒Hsuperscript𝐒𝐅T𝐚𝜃\displaystyle=[{\bf J}]_{21}=\frac{4\pi d}{\sigma_{\text{n}}^{2}\lambda}\times% \Re\left\{\jmath\beta^{*}{\bf a}^{\textrm{H}}(\theta){\bf D}{\bf F}^{*}{\bf S}% ^{\textrm{H}}{\bf S}{\bf F}^{\textrm{T}}{\bf a}(\theta)\right\},= [ bold_J ] start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT = divide start_ARG 4 italic_π italic_d end_ARG start_ARG italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_λ end_ARG × roman_ℜ { italic_ȷ italic_β start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_a start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( italic_θ ) bold_DF start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_S start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT bold_SF start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ) } ,
[𝐉]13subscriptdelimited-[]𝐉13\displaystyle[{\bf J}]_{13}[ bold_J ] start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT =[𝐉]31=4πdσn2λ×{β*𝐚H(θ)𝐃𝐅*𝐒H𝐒𝐅T𝐚(θ)},absentsubscriptdelimited-[]𝐉314𝜋𝑑superscriptsubscript𝜎n2𝜆superscript𝛽superscript𝐚H𝜃superscript𝐃𝐅superscript𝐒Hsuperscript𝐒𝐅T𝐚𝜃\displaystyle=[{\bf J}]_{31}=-\frac{4\pi d}{\sigma_{\text{n}}^{2}\lambda}% \times\Re\left\{\beta^{*}{\bf a}^{\textrm{H}}(\theta){\bf D}{\bf F}^{*}{\bf S}% ^{\textrm{H}}{\bf S}{\bf F}^{\textrm{T}}{\bf a}(\theta)\right\},= [ bold_J ] start_POSTSUBSCRIPT 31 end_POSTSUBSCRIPT = - divide start_ARG 4 italic_π italic_d end_ARG start_ARG italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_λ end_ARG × roman_ℜ { italic_β start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_a start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( italic_θ ) bold_DF start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_S start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT bold_SF start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ) } ,
[𝐉]22subscriptdelimited-[]𝐉22\displaystyle[{\bf J}]_{22}[ bold_J ] start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT =[𝐉]33=2σn2𝐚H(θ)𝐅*𝐒H𝐒𝐅T𝐚(θ),absentsubscriptdelimited-[]𝐉332superscriptsubscript𝜎n2superscript𝐚H𝜃superscript𝐅superscript𝐒Hsuperscript𝐒𝐅T𝐚𝜃\displaystyle=[{\bf J}]_{33}=\frac{2}{\sigma_{\text{n}}^{2}}{\bf a}^{\textrm{H% }}(\theta){\bf F}^{*}{\bf S}^{\textrm{H}}{\bf S}{\bf F}^{\textrm{T}}{\bf a}(% \theta),= [ bold_J ] start_POSTSUBSCRIPT 33 end_POSTSUBSCRIPT = divide start_ARG 2 end_ARG start_ARG italic_σ start_POSTSUBSCRIPT n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG bold_a start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ( italic_θ ) bold_F start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT bold_S start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT bold_SF start_POSTSUPERSCRIPT T end_POSTSUPERSCRIPT bold_a ( italic_θ ) ,
[𝐉]23subscriptdelimited-[]𝐉23\displaystyle[{\bf J}]_{23}[ bold_J ] start_POSTSUBSCRIPT 23 end_POSTSUBSCRIPT =[𝐉]32=0.absentsubscriptdelimited-[]𝐉320\displaystyle=[{\bf J}]_{32}=0.= [ bold_J ] start_POSTSUBSCRIPT 32 end_POSTSUBSCRIPT = 0 .

Appendix B Calculation of the AEB

Based on the block matrix inversion lemma [55], we have

AEB(𝐅RF,𝐅BB;𝐱)AEBsubscript𝐅RFsubscript𝐅BB𝐱\displaystyle\text{AEB}({\bf F}_{\text{RF}},{\bf F}_{\text{BB}};{\bf x})AEB ( bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ; bold_x )
=\displaystyle== ([𝐉]11[[𝐉]12,[𝐉]13][[𝐉]2200[𝐉]33]1[[𝐉]21[𝐉]31])12superscriptsubscriptdelimited-[]𝐉11subscriptdelimited-[]𝐉12subscriptdelimited-[]𝐉13superscriptdelimited-[]subscriptdelimited-[]𝐉2200subscriptdelimited-[]𝐉331delimited-[]subscriptdelimited-[]𝐉21subscriptdelimited-[]𝐉3112\displaystyle~{}\!\left(\![{\bf J}]_{11}-\bigg{[}[{\bf J}]_{12},[{\bf J}]_{13}% \bigg{]}\!\!\left[\!\!\!\begin{array}[]{cc}[{\bf J}]_{22}&0\\ 0&[{\bf J}]_{33}\end{array}\!\!\!\right]^{-1}\!\left[\!\!\begin{array}[]{c}% \left[{\bf J}\right]_{21}\\ \left[{\bf J}\right]_{31}\end{array}\!\!\right]\!\right)^{-\frac{1}{2}}( [ bold_J ] start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - [ [ bold_J ] start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT , [ bold_J ] start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT ] [ start_ARRAY start_ROW start_CELL [ bold_J ] start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL [ bold_J ] start_POSTSUBSCRIPT 33 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT [ start_ARRAY start_ROW start_CELL [ bold_J ] start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL [ bold_J ] start_POSTSUBSCRIPT 31 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] ) start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT
=\displaystyle== ([𝐉]11([𝐉]12)2+([𝐉]13)2[𝐉]22)12.superscriptsubscriptdelimited-[]𝐉11superscriptsubscriptdelimited-[]𝐉122superscriptsubscriptdelimited-[]𝐉132subscriptdelimited-[]𝐉2212\displaystyle~{}\!\left([{\bf J}]_{11}-\frac{(\left[{\bf J}\right]_{12})^{2}+(% [{\bf J}]_{13})^{2}}{[{\bf J}]_{22}}\right)^{-\frac{1}{2}}.( [ bold_J ] start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT - divide start_ARG ( [ bold_J ] start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( [ bold_J ] start_POSTSUBSCRIPT 13 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG [ bold_J ] start_POSTSUBSCRIPT 22 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT .

Substituting the results in Appendix A and 𝐒H𝐒=σs2𝐈Msuperscript𝐒H𝐒superscriptsubscript𝜎s2subscript𝐈𝑀{\bf S}^{\textrm{H}}{\bf S}=\sigma_{\text{s}}^{2}{\bf I}_{M}bold_S start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT bold_S = italic_σ start_POSTSUBSCRIPT s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT into the above equation yields (6).

Appendix C Proof of Theorem 1

To show the convergence of {(𝐅~RF(k),𝐅RF(k),𝐔(k))}superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘\left\{\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k)},{\bf F}_{\text{RF% }}^{(k)},{\bf U}^{(k)}\right)\right\}{ caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) }, we first provide the following two lemmas:

Lemma 1.

The proposed AltOpt-LS-ADMM algorithm, i.e., Algorithm 1, produces a monotonically decreasing sequence {(k)|k=0,1,2,}conditional-setsuperscript𝑘𝑘012normal-⋯\{\mathcal{L}^{(k)}|k=0,1,2,\cdots\}{ caligraphic_L start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT | italic_k = 0 , 1 , 2 , ⋯ }, where (k)(𝐅~RF(k),𝐅RF(k),𝐔(k))normal-≜superscript𝑘superscriptsubscriptnormal-~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘\mathcal{L}^{(k)}\triangleq\mathcal{L}\left({\bf{\tilde{F}}}_{\text{\emph{RF}}% }^{(k)},{\bf F}_{\text{\emph{RF}}}^{(k)},{\bf U}^{(k)}\right)caligraphic_L start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ≜ caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ), provided that the augmented Lagrangian parameter ρ𝜌\rhoitalic_ρ satisfies

ρ2𝐅BB𝐅BBHF.𝜌2subscriptnormsubscript𝐅BBsuperscriptsubscript𝐅BBHF\displaystyle\rho\geq\sqrt{2}\|{\bf F}_{\text{\emph{BB}}}{\bf F}_{\text{\emph{% BB}}}^{\textrm{\emph{H}}}\|_{\text{\emph{F}}}.italic_ρ ≥ square-root start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT . (30)
Lemma 2.

The function (𝐅~RF,𝐅RF,𝐔)subscriptnormal-~𝐅RFsubscript𝐅RF𝐔\mathcal{L}\left({\bf{\tilde{F}}}_{\text{\emph{RF}}},{\bf F}_{\text{\emph{RF}}% },{\bf U}\right)caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_U ) defined in (III-C) is bounded from below by 0 during the iteration process (III-C), provided that the augmented Lagrangian parameter ρ𝜌\rhoitalic_ρ satisfies

ρ𝐅BBF2.𝜌superscriptsubscriptnormsubscript𝐅BBF2\displaystyle\rho\geq\|{\bf F}_{\text{\emph{BB}}}\|_{\text{\emph{F}}}^{2}.italic_ρ ≥ ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (31)

The proofs of Lemmas 1 and 2 are relegated to Appendix E and Appendix F, respectively. These two lemmas straightforwardly implies that the sequence {(𝐅~RF(k),𝐅RF(k),𝐔(k))}superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘\left\{\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k)},{\bf F}_{\text{RF% }}^{(k)},{\bf U}^{(k)}\right)\right\}{ caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) } is convergent. Therefore, when the augmented Lagrangian parameter ρ𝜌\rhoitalic_ρ satisfies (26), we have

(k+1)(k)=0superscript𝑘1superscript𝑘0\displaystyle\mathcal{L}^{(k+1)}-\mathcal{L}^{(k)}=0caligraphic_L start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - caligraphic_L start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT = 0 (32)

as k𝑘k\to\inftyitalic_k → ∞. On the other hand, it is showed that if ρ𝐅BBHF2𝜌superscriptsubscriptnormsuperscriptsubscript𝐅BBHF2\rho\geq\|{\bf F}_{\text{BB}}^{\textrm{H}}\|_{\text{F}}^{2}italic_ρ ≥ ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT,

(k+1)(k)(i)×𝐅~RF(k+1)𝐅~RF(k)F20,superscript𝑘1superscript𝑘isuperscriptsubscriptnormsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript~𝐅RF𝑘F20\displaystyle\mathcal{L}^{(k+1)}-\mathcal{L}^{(k)}\leq(\text{i})\times\left\|{% \bf{\tilde{F}}}_{\text{RF}}^{(k+1)}-{\bf{\tilde{F}}}_{\text{RF}}^{(k)}\right\|% _{\text{F}}^{2}\leq 0,caligraphic_L start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - caligraphic_L start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ≤ ( i ) × ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ 0 , (33)

where the term (i)i(\text{i})( i ) is defined in Appendix E. Combining (32) and (33) leads to

𝐅~RF(k+1)=𝐅~RF(k).superscriptsubscript~𝐅RF𝑘1superscriptsubscript~𝐅RF𝑘\displaystyle{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}={\bf{\tilde{F}}}_{\text{RF}}% ^{(k)}.over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT . (34)

The above equation together with

𝐔=1ρ(𝐅opt𝐅~RF𝐅BB)𝐅BBH𝐔1𝜌subscript𝐅optsubscript~𝐅RFsubscript𝐅BBsuperscriptsubscript𝐅BBH\displaystyle{\bf U}=\frac{1}{\rho}\left({\bf F}_{\text{opt}}-{\bf{\tilde{F}}}% _{\text{RF}}{\bf F}_{\text{BB}}\right){\bf F}_{\text{BB}}^{\textrm{H}}bold_U = divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG ( bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT (35)

(which is the result of combining (III-C) and (23c)) yields

𝐔(k+1)=𝐔(k).superscript𝐔𝑘1superscript𝐔𝑘\displaystyle{\bf U}^{(k+1)}={\bf U}^{(k)}.bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT . (36)

Since 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT is calculated based on 𝐅~RFsubscript~𝐅RF{\bf{\tilde{F}}}_{\text{RF}}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT and 𝐔𝐔{\bf U}bold_U (see Line 6 in Algorithm 1), (34) and (36) yields

𝐅RF(k+1)=𝐅RF(k).superscriptsubscript𝐅RF𝑘1superscriptsubscript𝐅RF𝑘\displaystyle{\bf F}_{\text{RF}}^{(k+1)}={\bf F}_{\text{RF}}^{(k)}.bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT . (37)

Further, according to Line 8 in Algorithm 1, we have

𝐅RF(k)=𝐅~RF(k).superscriptsubscript𝐅RF𝑘superscriptsubscript~𝐅RF𝑘\displaystyle{\bf F}_{\text{RF}}^{(k)}={\bf{\tilde{F}}}_{\text{RF}}^{(k)}.bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT = over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT . (38)

On the other hand, from (34) we have 𝐅~RF(k+1)𝐅~RF(k)F0subscriptnormsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript~𝐅RF𝑘F0\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}\!-\!{\bf{\tilde{F}}}_{\text{RF}}^{% (k)}\!\right\|_{\text{F}}\to 0∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT → 0 as k𝑘k\to\inftyitalic_k → ∞. This leads to the fact that: for any positive ϵitalic-ϵ\epsilonitalic_ϵ, there always exists an integer T𝑇Titalic_T (large enough), such that

𝐅~RF(k1)𝐅~RF(k2)Fsubscriptnormsuperscriptsubscript~𝐅RFsubscript𝑘1superscriptsubscript~𝐅RFsubscript𝑘2F\displaystyle\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k_{1})}\!-\!{\bf{\tilde{F}}% }_{\text{RF}}^{(k_{2})}\!\right\|_{\text{F}}∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT
=\displaystyle== 𝐅~RF(k1)𝐅~RF(k1+1)+𝐅~RF(k1+1)𝐅~RF(k1+2)++𝐅~RF(k21)𝐅~RF(k2)Fsubscriptnormsuperscriptsubscript~𝐅RFsubscript𝑘1superscriptsubscript~𝐅RFsubscript𝑘11superscriptsubscript~𝐅RFsubscript𝑘11superscriptsubscript~𝐅RFsubscript𝑘12superscriptsubscript~𝐅RFsubscript𝑘21superscriptsubscript~𝐅RFsubscript𝑘2F\displaystyle\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k_{1})}\!\!-\!{\bf{\tilde{F% }}}_{\text{RF}}^{(k_{1}+1)}\!+\!{\bf{\tilde{F}}}_{\text{RF}}^{(k_{1}+1)}\!\!-% \!{\bf{\tilde{F}}}_{\text{RF}}^{(k_{1}+2)}\!+\!\cdots\!+\!{\bf{\tilde{F}}}_{% \text{RF}}^{(k_{2}-1)}\!\!-\!{\bf{\tilde{F}}}_{\text{RF}}^{(k_{2})}\!\right\|_% {\text{F}}∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 ) end_POSTSUPERSCRIPT + over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 ) end_POSTSUPERSCRIPT + ⋯ + over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT
\displaystyle\leq 𝐅~RF(k1)𝐅~RF(k1+1)F+𝐅~RF(k1+1)𝐅~RF(k1+2)F+subscriptnormsuperscriptsubscript~𝐅RFsubscript𝑘1superscriptsubscript~𝐅RFsubscript𝑘11Fsubscriptnormsuperscriptsubscript~𝐅RFsubscript𝑘11superscriptsubscript~𝐅RFsubscript𝑘12F\displaystyle\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k_{1})}\!\!-\!{\bf{\tilde{F% }}}_{\text{RF}}^{(k_{1}+1)}\right\|_{\text{F}}+\left\|{\bf{\tilde{F}}}_{\text{% RF}}^{(k_{1}+1)}\!\!-\!{\bf{\tilde{F}}}_{\text{RF}}^{(k_{1}+2)}\right\|_{\text% {F}}+\cdots∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT + ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT + ⋯
+𝐅~RF(k21)𝐅~RF(k2)Fsubscriptnormsuperscriptsubscript~𝐅RFsubscript𝑘21superscriptsubscript~𝐅RFsubscript𝑘2F\displaystyle\cdots+\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k_{2}-1)}\!\!-\!{\bf% {\tilde{F}}}_{\text{RF}}^{(k_{2})}\right\|_{\text{F}}⋯ + ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT
\displaystyle\leq ϵitalic-ϵ\displaystyle~{}\!\epsilonitalic_ϵ

holds for all k1subscript𝑘1k_{1}italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, k2subscript𝑘2k_{2}italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT Tabsent𝑇\geq T≥ italic_T (without loss of generality we assume k2>k1subscript𝑘2subscript𝑘1k_{2}>k_{1}italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in the above inequalities). This indicates that sequence {𝐅~RF(k)}superscriptsubscript~𝐅RF𝑘\left\{\!{\bf{\tilde{F}}}_{\text{RF}}^{(k)}\!\right\}{ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT } is a Cauchy sequence, and thus it converges to a fixed point after a finite number (i.e., T𝑇Titalic_T) of iterations [58]. Similarly, both sequences {𝐔(k)}superscript𝐔𝑘\left\{\!{\bf{U}}^{(k)}\!\right\}{ bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT } and {𝐅RF(k)}superscriptsubscript𝐅RF𝑘\left\{\!{\bf{F}}_{\text{RF}}^{(k)}\!\right\}{ bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT } are Cauchy sequences and they converge to fixed points after T𝑇Titalic_T iterations, thanks to (35) and Line 6 in Algorithm 1. This completes the proof of Theorem 1.

Appendix D Proof of Theorem 2

Since the proposed AltOpt-LS-ADMM algorithm, i.e., Algorithm 1, has unique optimal solutions for both 𝐅BBsubscript𝐅BB{\bf F}_{\text{BB}}bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT (see (19)) and 𝐅RFsubscript𝐅RF{\bf F}_{\text{RF}}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT (see Theorem 1) at each iteration, we have

𝐅opt𝐅RF(i+1)𝐅BB(i+1)Fsubscriptnormsubscript𝐅optsuperscriptsubscript𝐅RF𝑖1superscriptsubscript𝐅BB𝑖1Fabsent\displaystyle\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}^{(i+1)}{\bf F}_{\text{% BB}}^{(i+1)}\|_{\text{F}}\leq∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ≤ 𝐅opt𝐅RF(i)𝐅BB(i+1)Fsubscriptnormsubscript𝐅optsuperscriptsubscript𝐅RF𝑖superscriptsubscript𝐅BB𝑖1F\displaystyle~{}\!\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}^{(i)}{\bf F}_{% \text{BB}}^{(i+1)}\|_{\text{F}}∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT
\displaystyle\leq 𝐅opt𝐅RF(i)𝐅BB(i)F,subscriptnormsubscript𝐅optsuperscriptsubscript𝐅RF𝑖superscriptsubscript𝐅BB𝑖F\displaystyle~{}\!\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}^{(i)}{\bf F}_{% \text{BB}}^{(i)}\|_{\text{F}},∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ,

which shows that sequence {𝐅opt𝐅RF(i)𝐅BB(i)F}subscriptnormsubscript𝐅optsuperscriptsubscript𝐅RF𝑖superscriptsubscript𝐅BB𝑖F\left\{\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}^{(i)}{\bf F}_{\text{BB}}^{(i% )}\|_{\text{F}}\right\}{ ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT } is monotonically decreasing. On the other hand, it is straightforward to see that 𝐅opt𝐅RF(i)𝐅BB(i)Fsubscriptnormsubscript𝐅optsuperscriptsubscript𝐅RF𝑖superscriptsubscript𝐅BB𝑖F\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}^{(i)}{\bf F}_{\text{BB}}^{(i)}\|_{% \text{F}}∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT is bounded from below by 0. This indicates that sequence {𝐅opt𝐅RF(i)𝐅BB(i)F}subscriptnormsubscript𝐅optsuperscriptsubscript𝐅RF𝑖superscriptsubscript𝐅BB𝑖F\left\{\|{\bf F}_{\text{opt}}-{\bf F}_{\text{RF}}^{(i)}{\bf F}_{\text{BB}}^{(i% )}\|_{\text{F}}\right\}{ ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT } generated by the proposed algorithm converges. This completes the proof of Theorem 2.

Appendix E Proof of Lemma 1

The difference between the augmented Lagrangian function values at two successive iterations is calculated as

(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k+1))(𝐅~RF(k),𝐅RF(k),𝐔(k))superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘1superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘\displaystyle\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k+1)},{\bf F}_{% \text{RF}}^{(k+1)},{\bf U}^{(k+1)}\right)-\mathcal{L}\!\left({\bf{\tilde{F}}}_% {\text{RF}}^{(k)},{\bf F}_{\text{RF}}^{(k)},{\bf U}^{(k)}\right)caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT )
=\displaystyle== [(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k+1))(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k))]delimited-[]superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘1superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘\displaystyle\left[\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k+1)},{% \bf F}_{\text{RF}}^{(k+1)},{\bf U}^{(k+1)}\right)\!-\!\mathcal{L}\!\left({\bf{% \tilde{F}}}_{\text{RF}}^{(k+1)},{\bf F}_{\text{RF}}^{(k+1)},{\bf U}^{(k)}% \right)\right][ caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) ]
+[(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k))(𝐅~RF(k),𝐅RF(k+1),𝐔(k))]delimited-[]superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘\displaystyle+\left[\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k+1)},{% \bf F}_{\text{RF}}^{(k+1)},{\bf U}^{(k)}\right)\!-\!\mathcal{L}\!\left({\bf{% \tilde{F}}}_{\text{RF}}^{(k)},{\bf F}_{\text{RF}}^{(k+1)},{\bf U}^{(k)}\right)\right]+ [ caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) ]
+[(𝐅~RF(k),𝐅RF(k+1),𝐔(k))(𝐅~RF(k),𝐅RF(k),𝐔(k))].delimited-[]superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘\displaystyle+\left[\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k)},{\bf F% }_{\text{RF}}^{(k+1)},{\bf U}^{(k)}\right)\!-\!\mathcal{L}\!\left({\bf{\tilde{% F}}}_{\text{RF}}^{(k)},{\bf F}_{\text{RF}}^{(k)},{\bf U}^{(k)}\right)\right].+ [ caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) ] . (39)

The three terms in the above three square brackets are respectively calculated as follows. The first term is bounded as

(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k+1))(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k))superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘1superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘\displaystyle\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k+1)},{\bf F}_{% \text{RF}}^{(k+1)},{\bf U}^{(k+1)}\right)-\mathcal{L}\!\left({\bf{\tilde{F}}}_% {\text{RF}}^{(k+1)},{\bf F}_{\text{RF}}^{(k+1)},{\bf U}^{(k)}\right)caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT )
=\displaystyle== ρ2(𝐅~RF(k+1)𝐅RF(k+1)+𝐔(k+1)F2𝐔(k+1)F2)𝜌2superscriptsubscriptnormsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘1F2superscriptsubscriptnormsuperscript𝐔𝑘1F2\displaystyle~{}\!\frac{\rho}{2}\left(\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k+% 1)}-{\bf F}_{\text{RF}}^{(k+1)}+{\bf U}^{(k+1)}\right\|_{\text{F}}^{2}-\left\|% {\bf U}^{(k+1)}\right\|_{\text{F}}^{2}\right)divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG ( ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT + bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )
ρ2(𝐅~RF(k+1)𝐅RF(k+1)+𝐔(k)F2𝐔(k)F2)𝜌2superscriptsubscriptnormsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘F2superscriptsubscriptnormsuperscript𝐔𝑘F2\displaystyle-\frac{\rho}{2}\left(\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}-% {\bf F}_{\text{RF}}^{(k+1)}+{\bf U}^{(k)}\right\|_{\text{F}}^{2}-\left\|{\bf U% }^{(k)}\right\|_{\text{F}}^{2}\right)- divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG ( ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT + bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - ∥ bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) (40a)
=\displaystyle== ρ2(2𝐔(k+1)𝐔(k)F22𝐔(k+1)F2+𝐔(k)F2)𝜌2superscriptsubscriptnorm2superscript𝐔𝑘1superscript𝐔𝑘F22superscriptsubscriptnormsuperscript𝐔𝑘1F2superscriptsubscriptnormsuperscript𝐔𝑘F2\displaystyle~{}\!\frac{\rho}{2}\left(\left\|2{\bf U}^{(k+1)}\!-\!{\bf U}^{(k)% }\right\|_{\text{F}}^{2}\!-\!2\left\|{\bf U}^{(k+1)}\right\|_{\text{F}}^{2}\!+% \!\left\|{\bf U}^{(k)}\right\|_{\text{F}}^{2}\right)divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG ( ∥ 2 bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 ∥ bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∥ bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) (40b)
=\displaystyle== ρ𝐔(k+1)𝐔(k)F2𝜌superscriptsubscriptnormsuperscript𝐔𝑘1superscript𝐔𝑘F2\displaystyle~{}\!\rho\left\|{\bf U}^{(k+1)}-{\bf U}^{(k)}\right\|_{\text{F}}^% {2}italic_ρ ∥ bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
=\displaystyle== 1ρ(𝐅opt𝐅~RF(k+1)𝐅BB)𝐅BBH(𝐅opt𝐅~RF(k)𝐅BB)𝐅BBHF21𝜌superscriptsubscriptnormsubscript𝐅optsuperscriptsubscript~𝐅RF𝑘1subscript𝐅BBsuperscriptsubscript𝐅BBHsubscript𝐅optsuperscriptsubscript~𝐅RF𝑘subscript𝐅BBsuperscriptsubscript𝐅BBHF2\displaystyle~{}\!\frac{1}{\rho}\left\|\!\left(\!{\bf F}_{\text{opt}}\!-\!{\bf% {\tilde{F}}}_{\text{RF}}^{(k+1)}{\bf F}_{\text{BB}}\!\!\right)\!\!{\bf F}_{% \text{BB}}^{\textrm{H}}\!-\!\left(\!{\bf F}_{\text{opt}}\!-\!{\bf{\tilde{F}}}_% {\text{RF}}^{(k)}{\bf F}_{\text{BB}}\!\!\right)\!\!{\bf F}_{\text{BB}}^{% \textrm{H}}\right\|_{\text{F}}^{2}divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG ∥ ( bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT - ( bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (40c)
=\displaystyle== 1ρ(𝐅~RF(k)𝐅~RF(k+1))𝐅BB𝐅BBHF21𝜌superscriptsubscriptnormsuperscriptsubscript~𝐅RF𝑘superscriptsubscript~𝐅RF𝑘1subscript𝐅BBsuperscriptsubscript𝐅BBHF2\displaystyle~{}\!\frac{1}{\rho}\left\|\left({\bf{\tilde{F}}}_{\text{RF}}^{(k)% }-{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}\right){\bf F}_{\text{BB}}{\bf F}_{\text% {BB}}^{\textrm{H}}\right\|_{\text{F}}^{2}divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG ∥ ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
\displaystyle\leq 1ρ𝐅BB𝐅BBHF2𝐅~RF(k+1)𝐅~RF(k)F2,1𝜌superscriptsubscriptnormsubscript𝐅BBsuperscriptsubscript𝐅BBHF2superscriptsubscriptnormsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript~𝐅RF𝑘F2\displaystyle~{}\!\frac{1}{\rho}\left\|{\bf F}_{\text{BB}}{\bf F}_{\text{BB}}^% {\textrm{H}}\right\|_{\text{F}}^{2}\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}% -{\bf{\tilde{F}}}_{\text{RF}}^{(k)}\right\|_{\text{F}}^{2},divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (40d)

where in (40a) we used the definition of (𝐅~RF,𝐅RF,𝐔)subscript~𝐅RFsubscript𝐅RF𝐔\mathcal{L}\Big{(}{\bf{\tilde{F}}}_{\text{RF}},{\bf F}_{\text{RF}},{\bf U}\Big% {)}caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_U ); in (40b) we employed 𝐅~RF(k+1)𝐅RF(k+1)=𝐔(k+1)𝐔(k)superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘1superscript𝐔𝑘{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}-{\bf F}_{\text{RF}}^{(k+1)}={\bf U}^{(k+1% )}-{\bf U}^{(k)}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT = bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT (due to (23c)); in (40c) we utilized (35); in (40d) we used the fact that 𝐌𝐍F𝐌F𝐍Fsubscriptnorm𝐌𝐍Fsubscriptnorm𝐌Fsubscriptnorm𝐍F\|{\bf M}{\bf N}\|_{\text{F}}\leq\|{\bf M}\|_{\text{F}}\|{\bf N}\|_{\text{F}}∥ bold_MN ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ≤ ∥ bold_M ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT ∥ bold_N ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT holds for any matrices 𝐌𝐌{\bf M}bold_M and 𝐍𝐍{\bf N}bold_N of appropriate sizes. The second term is bounded as

(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k))(𝐅~RF(k),𝐅RF(k+1),𝐔(k))superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘\displaystyle\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k+1)},{\bf F}_{% \text{RF}}^{(k+1)},{\bf U}^{(k)}\right)-\mathcal{L}\!\left({\bf{\tilde{F}}}_{% \text{RF}}^{(k)},{\bf F}_{\text{RF}}^{(k+1)},{\bf U}^{(k)}\right)caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT )
\displaystyle\leq {𝐅~RF(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k)),𝐅~RF(k+1)𝐅~RF(k)}subscriptsubscript~𝐅RFsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘superscriptsubscript~𝐅RF𝑘1superscriptsubscript~𝐅RF𝑘\displaystyle~{}\!\Re\!\left\{\!\Big{\langle}\nabla_{{\bf{\tilde{F}}}_{\text{% RF}}}\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k+1)},{\bf F}_{\text{RF% }}^{(k+1)},{\bf U}^{(k)}\right),{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}\!-\!{\bf{% \tilde{F}}}_{\text{RF}}^{(k)}\Big{\rangle}\!\right\}roman_ℜ { ⟨ ∇ start_POSTSUBSCRIPT over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) , over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ⟩ }
γ2𝐅~RF(k+1)𝐅~RF(k)F2𝛾2superscriptsubscriptnormsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript~𝐅RF𝑘F2\displaystyle-\frac{\gamma}{2}\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}-{\bf% {\tilde{F}}}_{\text{RF}}^{(k)}\right\|_{\text{F}}^{2}- divide start_ARG italic_γ end_ARG start_ARG 2 end_ARG ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (41a)
=\displaystyle== λmin(𝐅BB𝐅BBH)+ρ2𝐅~RF(k+1)𝐅~RF(k)F2,subscript𝜆minsubscript𝐅BBsuperscriptsubscript𝐅BBH𝜌2superscriptsubscriptnormsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript~𝐅RF𝑘F2\displaystyle~{}\!-\frac{\lambda_{\textrm{min}}({\bf F}_{\text{BB}}{\bf F}_{% \text{BB}}^{\textrm{H}})+\rho}{2}\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}-{% \bf{\tilde{F}}}_{\text{RF}}^{(k)}\right\|_{\text{F}}^{2},- divide start_ARG italic_λ start_POSTSUBSCRIPT min end_POSTSUBSCRIPT ( bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ) + italic_ρ end_ARG start_ARG 2 end_ARG ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (41b)

where in (41a) we utilized the strongly convexity of the Lagrangian function (𝐅~RF,𝐅RF,𝐔)subscript~𝐅RFsubscript𝐅RF𝐔\mathcal{L}\Big{(}{\bf{\tilde{F}}}_{\text{RF}},{\bf F}_{\text{RF}},{\bf U}\Big% {)}caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_U ) w.r.t. 𝐅~RFsubscript~𝐅RF{\bf{\tilde{F}}}_{\text{RF}}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT with parameter γ>0𝛾0\gamma>0italic_γ > 0 [59]; in (41b) we adopted the optimality condition of (III-C) and γ=λmin(𝐅BB𝐅BBH)+ρ𝛾subscript𝜆minsubscript𝐅BBsuperscriptsubscript𝐅BBH𝜌\gamma=\lambda_{\textrm{min}}({\bf F}_{\text{BB}}{\bf F}_{\text{BB}}^{\textrm{% H}})+\rhoitalic_γ = italic_λ start_POSTSUBSCRIPT min end_POSTSUBSCRIPT ( bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ) + italic_ρ with λmin()subscript𝜆min\lambda_{\textrm{min}}(\cdot)italic_λ start_POSTSUBSCRIPT min end_POSTSUBSCRIPT ( ⋅ ) being the minimal eigenvalue of its argument (which is due to the facts that (𝐅~RF,𝐅RF,𝐔)subscript~𝐅RFsubscript𝐅RF𝐔\mathcal{L}\Big{(}{\bf{\tilde{F}}}_{\text{RF}},{\bf F}_{\text{RF}},{\bf U}\Big% {)}caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_U ) is twice continuously differentiable w.r.t. 𝐅~RFsubscript~𝐅RF{\bf{\tilde{F}}}_{\text{RF}}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT, and its strong convexity parameter γ𝛾\gammaitalic_γ satisfies 𝐅~RF2=𝐅BB𝐅BBH+ρ𝐈γ𝐈superscriptsubscriptsubscript~𝐅RF2subscript𝐅BBsuperscriptsubscript𝐅BBH𝜌𝐈succeeds-or-equals𝛾𝐈\nabla_{{\bf{\tilde{F}}}_{\text{RF}}}^{2}\mathcal{L}={\bf F}_{\text{BB}}{\bf F% }_{\text{BB}}^{\textrm{H}}+\rho{\bf I}\succeq\gamma{\bf I}∇ start_POSTSUBSCRIPT over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT caligraphic_L = bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT + italic_ρ bold_I ⪰ italic_γ bold_I for all 𝐅~RFsubscript~𝐅RF{\bf{\tilde{F}}}_{\text{RF}}over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT [59]). Finally, the third term is bounded as

(𝐅~RF(k),𝐅RF(k+1),𝐔(k))(𝐅~RF(k),𝐅RF(k),𝐔(k))0,superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘0\displaystyle\mathcal{L}\!\left(\!{\bf{\tilde{F}}}_{\text{RF}}^{(k)},{\bf F}_{% \text{RF}}^{(k+1)},{\bf U}^{(k)}\!\right)\!-\!\mathcal{L}\!\left(\!{\bf{\tilde% {F}}}_{\text{RF}}^{(k)},{\bf F}_{\text{RF}}^{(k)},{\bf U}^{(k)}\!\right)\leq 0,caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) ≤ 0 , (42)

where we employed the fact that 𝐅RF(k+1)superscriptsubscript𝐅RF𝑘1{\bf F}_{\text{RF}}^{(k+1)}bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT is the minimum of (𝐅~RF(k),𝐅RF,𝐔(k))superscriptsubscript~𝐅RF𝑘subscript𝐅RFsuperscript𝐔𝑘\mathcal{L}\Big{(}{\bf{\tilde{F}}}_{\text{RF}}^{(k)},{\bf F}_{\text{RF}},{\bf U% }^{(k)}\Big{)}caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) according to (III-C).

Substituting the results of (E), (E), and (42) in (39) yields

(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k+1))(𝐅~RF(k),𝐅RF(k),𝐔(k))superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘1superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘\displaystyle\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k+1)},{\bf F}_{% \text{RF}}^{(k+1)},{\bf U}^{(k+1)}\right)-\mathcal{L}\!\left({\bf{\tilde{F}}}_% {\text{RF}}^{(k)},{\bf F}_{\text{RF}}^{(k)},{\bf U}^{(k)}\right)caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT )
\displaystyle\leq (1ρ𝐅BB𝐅BBHF2λmin(𝐅BB𝐅BBH)+ρ2)(i)𝐅~RF(k+1)𝐅~RF(k)F2.subscript1𝜌superscriptsubscriptnormsubscript𝐅BBsuperscriptsubscript𝐅BBHF2subscript𝜆minsubscript𝐅BBsuperscriptsubscript𝐅BBH𝜌2(i)superscriptsubscriptnormsuperscriptsubscript~𝐅RF𝑘1superscriptsubscript~𝐅RF𝑘F2\displaystyle~{}\!\underbrace{\left(\frac{1}{\rho}\left\|{\bf F}_{\text{BB}}{% \bf F}_{\text{BB}}^{\textrm{H}}\right\|_{\text{F}}^{2}\!-\!\frac{\lambda_{% \textrm{min}}({\bf F}_{\text{BB}}{\bf F}_{\text{BB}}^{\textrm{H}})\!+\!\rho}{2% }\right)}_{\text{(i)}}\left\|{\bf{\tilde{F}}}_{\text{RF}}^{(k+1)}\!-\!{\bf{% \tilde{F}}}_{\text{RF}}^{(k)}\right\|_{\text{F}}^{2}.under⏟ start_ARG ( divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - divide start_ARG italic_λ start_POSTSUBSCRIPT min end_POSTSUBSCRIPT ( bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ) + italic_ρ end_ARG start_ARG 2 end_ARG ) end_ARG start_POSTSUBSCRIPT (i) end_POSTSUBSCRIPT ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

If ρ2𝐅BB𝐅BBHF𝜌2subscriptnormsubscript𝐅BBsuperscriptsubscript𝐅BBHF\rho\geq\sqrt{2}\|{\bf F}_{\text{BB}}{\bf F}_{\text{BB}}^{\textrm{H}}\|_{\text% {F}}italic_ρ ≥ square-root start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT, the term (i) satisfies: (i)0i0(\text{i})\leq 0( i ) ≤ 0, and thus

(𝐅~RF(k+1),𝐅RF(k+1),𝐔(k+1))(𝐅~RF(k),𝐅RF(k),𝐔(k))0.superscriptsubscript~𝐅RF𝑘1superscriptsubscript𝐅RF𝑘1superscript𝐔𝑘1superscriptsubscript~𝐅RF𝑘superscriptsubscript𝐅RF𝑘superscript𝐔𝑘0\displaystyle\mathcal{L}\!\left({\bf{\tilde{F}}}_{\text{RF}}^{(k+1)},{\bf F}_{% \text{RF}}^{(k+1)},{\bf U}^{(k+1)}\right)-\mathcal{L}\!\left({\bf{\tilde{F}}}_% {\text{RF}}^{(k)},{\bf F}_{\text{RF}}^{(k)},{\bf U}^{(k)}\right)\leq 0.caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k + 1 ) end_POSTSUPERSCRIPT ) - caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT , bold_U start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ) ≤ 0 .

This completes the proof of Lemma 1.

Appendix F Proof of Lemma 2

By using (35), we have

(𝐅~RF,𝐅RF,𝐔)subscript~𝐅RFsubscript𝐅RF𝐔\displaystyle\mathcal{L}\Big{(}{\bf{\tilde{F}}}_{\text{RF}},{\bf F}_{\text{RF}% },{\bf U}\Big{)}caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_U )
=\displaystyle== 12𝐅opt𝐅~RF𝐅BBF2+ρ2𝐅~RF𝐅RF+𝐔F212superscriptsubscriptnormsubscript𝐅optsubscript~𝐅RFsubscript𝐅BBF2𝜌2superscriptsubscriptnormsubscript~𝐅RFsubscript𝐅RF𝐔F2\displaystyle~{}\!\frac{1}{2}\|{\bf F}_{\text{opt}}\!-\!{\bf{\tilde{F}}}_{% \text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}^{2}\!+\!\frac{\rho}{2}\|{\bf{\tilde% {F}}}_{\text{RF}}\!-\!{\bf F}_{\text{RF}}\!+\!{\bf U}\|_{\text{F}}^{2}divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT + bold_U ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
ρ21ρ(𝐅opt𝐅~RF𝐅BB)𝐅BBHF2𝜌2superscriptsubscriptnorm1𝜌subscript𝐅optsubscript~𝐅RFsubscript𝐅BBsuperscriptsubscript𝐅BBHF2\displaystyle-\frac{\rho}{2}\left\|\frac{1}{\rho}\left({\bf F}_{\text{opt}}-{% \bf{\tilde{F}}}_{\text{RF}}{\bf F}_{\text{BB}}\right){\bf F}_{\text{BB}}^{% \textrm{H}}\right\|_{\text{F}}^{2}- divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG ∥ divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG ( bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ) bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT start_POSTSUPERSCRIPT H end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
\displaystyle\geq 12𝐅opt𝐅~RF𝐅BBF2+ρ2𝐅~RF𝐅RF+𝐔F212superscriptsubscriptnormsubscript𝐅optsubscript~𝐅RFsubscript𝐅BBF2𝜌2superscriptsubscriptnormsubscript~𝐅RFsubscript𝐅RF𝐔F2\displaystyle~{}\!\frac{1}{2}\|{\bf F}_{\text{opt}}\!-\!{\bf{\tilde{F}}}_{% \text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}^{2}\!+\!\frac{\rho}{2}\|{\bf{\tilde% {F}}}_{\text{RF}}\!-\!{\bf F}_{\text{RF}}\!+\!{\bf U}\|_{\text{F}}^{2}divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT + bold_U ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
12ρ𝐅opt𝐅~RF𝐅BBF2𝐅BBF212𝜌superscriptsubscriptnormsubscript𝐅optsubscript~𝐅RFsubscript𝐅BBF2superscriptsubscriptnormsubscript𝐅BBF2\displaystyle-\frac{1}{2\rho}\left\|{\bf F}_{\text{opt}}-{\bf{\tilde{F}}}_{% \text{RF}}{\bf F}_{\text{BB}}\right\|_{\text{F}}^{2}\left\|{\bf F}_{\text{BB}}% \right\|_{\text{F}}^{2}- divide start_ARG 1 end_ARG start_ARG 2 italic_ρ end_ARG ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
=\displaystyle== 12(11ρ𝐅BBF2)𝐅opt𝐅~RF𝐅BBF21211𝜌superscriptsubscriptnormsubscript𝐅BBF2superscriptsubscriptnormsubscript𝐅optsubscript~𝐅RFsubscript𝐅BBF2\displaystyle~{}\!\frac{1}{2}\left(1-\frac{1}{\rho}\left\|{\bf F}_{\text{BB}}% \right\|_{\text{F}}^{2}\right)\|{\bf F}_{\text{opt}}\!-\!{\bf{\tilde{F}}}_{% \text{RF}}{\bf F}_{\text{BB}}\|_{\text{F}}^{2}divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 - divide start_ARG 1 end_ARG start_ARG italic_ρ end_ARG ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ∥ bold_F start_POSTSUBSCRIPT opt end_POSTSUBSCRIPT - over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
+ρ2𝐅~RF𝐅RF+𝐔F2.𝜌2superscriptsubscriptnormsubscript~𝐅RFsubscript𝐅RF𝐔F2\displaystyle~{}\!+\frac{\rho}{2}\|{\bf{\tilde{F}}}_{\text{RF}}-{\bf F}_{\text% {RF}}+{\bf U}\|_{\text{F}}^{2}.+ divide start_ARG italic_ρ end_ARG start_ARG 2 end_ARG ∥ over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT - bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT + bold_U ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

If ρ𝐅BBF2𝜌superscriptsubscriptnormsubscript𝐅BBF2\rho\geq\|{\bf F}_{\text{BB}}\|_{\text{F}}^{2}italic_ρ ≥ ∥ bold_F start_POSTSUBSCRIPT BB end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT F end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, then (𝐅~RF,𝐅RF,𝐔)0subscript~𝐅RFsubscript𝐅RF𝐔0\mathcal{L}\Big{(}{\bf{\tilde{F}}}_{\text{RF}},{\bf F}_{\text{RF}},{\bf U}\Big% {)}\geq 0caligraphic_L ( over~ start_ARG bold_F end_ARG start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_F start_POSTSUBSCRIPT RF end_POSTSUBSCRIPT , bold_U ) ≥ 0, which completes the proof of Lemma 2.

References

  • [1] R. W. Heath, N. González-Prelcic, S. Rangan, W. Roh, and A. M. Sayeed, “An overview of signal processing techniques for millimeter wave MIMO systems,” IEEE J. Sel. Top. Signal Process., vol. 10, no. 3, pp. 436–453, April 2016.
  • [2] Y. Ge, O. Kaltiokallio, H. Kim, J. Talvitie, S. Kim, L. Svensson, M. Valkama, and H. Wymeersch, “MmWave map** and SLAM for 5G and beyond,” in Integrated Sensing and Communications, F. Liu, C. Masouros, and Y. C. Eldar, Eds.   Singapore: Springer Nature Singapore, 2023, pp. 445–475.
  • [3] S. Weng, F. Jiang, and H. Wymeersch, “Wideband mmWave massive MIMO channel estimation and localization,” IEEE Wireless Commun. Lett., vol. 12, no. 8, pp. 1314–1318, August 2023.
  • [4] M. A. Nazari, G. Seco-Granados, P. Johannisson, and H. Wymeersch, “mmWave 6D radio localization with a snapshot observation from a single BS,” IEEE Trans. Veh. Technol., vol. 72, no. 7, pp. 8914–8928, July 2023.
  • [5] H. Chen, H. Sarieddeen, T. Ballal, H. Wymeersch, M.-S. Alouini, and T. Y. Al-Naffouri, “A tutorial on terahertz-band localization for 6G communication systems,” IEEE Commun. Surv. Tutor., vol. 24, no. 3, pp. 1780–1815, May 2022.
  • [6] Z. Chen, X. Ma, B. Zhang, Y. Zhang, Z. Niu, N. Kuang, W. Chen, L. Li, and S. Li, “A survey on terahertz communications,” China Commun., vol. 16, no. 2, pp. 1–35, February 2019.
  • [7] V. Petrov, T. Kurner, and I. Hosako, “IEEE 802.15.3d: First standardization efforts for sub-terahertz band communications toward 6G,” IEEE Commun. Mag., vol. 58, no. 11, pp. 28–33, November 2020.
  • [8] X. Cai, X. Cheng, and F. Tufvesson, “Toward 6G with terahertz communications: Understanding the propagation channels,” arXiv, February 2023. [Online]. Available: https://arxiv.longhoe.net/abs/2209.07864
  • [9] C. Studer and G. Durisi, “Quantized massive MU-MIMO-OFDM uplink,” IEEE Trans. Commun., vol. 64, no. 6, pp. 2387–2399, June 2016.
  • [10] S. Jacobsson, G. Durisi, M. Coldrey, T. Goldstein, and C. Studer, “Quantized precoding for massive MU-MIMO,” IEEE Trans. Commun., vol. 65, no. 11, pp. 4670–4684, November 2017.
  • [11] E. Björnson, L. Sanguinetti, H. Wymeersch, J. Hoydis, and T. L. Marzetta, “Massive MIMO is a reality—What is next?: Five promising research directions for antenna arrays,” Digit. Signal Process., vol. 94, pp. 3–20, November 2019.
  • [12] M. F. Keskin, C. Marcus, O. Eriksson, A. Alvarado, J. Widmer, and H. Wymeersch, “Integrated sensing and communications with MIMO-OTFS,” 2023. [Online]. Available: https://arxiv.longhoe.net/abs/2306.06361
  • [13] T. Wei, Z. Cheng, and B. Liao, “Transmit beampattern synthesis for MIMO radar with one-bit digital-to-analog converters,” Signal Process., vol. 188, p. 108228, November 2021.
  • [14] Z. Gao, L. Dai, D. Mi, Z. Wang, M. A. Imran, and M. Z. Shakir, “MmWave massive-MIMO-based wireless backhaul for the 5G ultra-dense network,” IEEE Wirel. Commun., vol. 22, no. 5, pp. 13–21, October 2015.
  • [15] A. F. Molisch, V. V. Ratnam, S. Han, Z. Li, S. L. H. Nguyen, L. Li, and K. Haneda, “Hybrid beamforming for massive MIMO: A survey,” IEEE Commun. Mag., vol. 55, no. 9, pp. 134–141, December 2017.
  • [16] R. Rajamäki, S. P. Chepuri, and V. Koivunen, “Hybrid beamforming for active sensing using sparse arrays,” IEEE Trans. Signal Process., vol. 68, pp. 6402–6417, October 2020.
  • [17] H. Li, M. Li, and Q. Liu, “Hybrid beamforming with dynamic subarrays and low-resolution PSs for mmWave MU-MISO systems,” IEEE Trans. Commun., vol. 68, no. 1, pp. 602–614, January 2020.
  • [18] S. Lyu, Z. Wang, Z. Gao, H. He, and L. Hanzo, “Lattice-based mmWave hybrid beamforming,” IEEE Trans. Commun., vol. 69, no. 7, pp. 4907–4920, July 2021.
  • [19] J.-C. Chen, “Hybrid beamforming with discrete phase shifters for millimeter-wave massive MIMO systems,” IEEE Trans. Veh. Technol., vol. 66, no. 8, pp. 7604–7608, August 2017.
  • [20] Z. Wang, M. Li, Q. Liu, and A. L. Swindlehurst, “Hybrid precoder and combiner design with low-resolution phase shifters in mmWave MIMO systems,” IEEE J. Sel. Top. Signal Process., vol. 12, no. 2, pp. 256–269, May 2018.
  • [21] A. N. Uwaechia, N. M. Mahyuddin, M. F. Ain, N. M. Abdul Latiff, and N. F. Za’bah, “On the spectral-efficiency of low-complexity and reso-lution hybrid precoding and combining transceivers for mmWave MIMO systems,” IEEE Access, vol. 7, pp. 109 259–109 277, August 2019.
  • [22] H. Li, M. Li, Q. Liu, and A. L. Swindlehurst, “Dynamic hybrid beamforming with low-resolution PSs for wideband mmWave MIMO-OFDM systems,” IEEE J. Sel. Areas Commun., vol. 38, no. 9, pp. 2168–2181, September 2020.
  • [23] X. Zhang, A. Molisch, and S.-Y. Kung, “Variable-phase-shift-based RF-baseband codesign for MIMO antenna selection,” IEEE Trans. Signal Process., vol. 53, no. 11, pp. 4091–4103, November 2005.
  • [24] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave MIMO systems,” IEEE Trans. Wirel. Commun., vol. 13, no. 3, pp. 1499–1513, March 2014.
  • [25] C.-J. Wang, C.-K. Wen, S. **, and S.-H. Tsai, “Finite-alphabet precoding for massive MU-MIMO with low-resolution DACs,” IEEE Trans. Wirel. Commun., vol. 17, no. 7, pp. 4706–4720, July 2018.
  • [26] F. Dong, W. Wang, and Z. Wei, “Low-complexity hybrid precoding for multi-user mmWave systems with low-resolution phase shifters,” IEEE Trans. Veh. Technol., vol. 68, no. 10, pp. 9774–9784, October 2019.
  • [27] R. Rajamäki, S. P. Chepuri, and V. Koivunen, “Hybrid beamforming for active sensing using sparse arrays,” IEEE Trans. Signal Process., vol. 68, pp. 6402–6417, October 2020.
  • [28] E. Zhang and C. Huang, “On achieving optimal rate of digital precoder by RF-baseband codesign for MIMO systems,” in Proc. IEEE Veh. Technol. Conf., Vancouver, Canada, September 2014, pp. 1–5.
  • [29] X. Yu, J.-C. Shen, J. Zhang, and K. B. Letaief, “Alternating minimization algorithms for hybrid precoding in millimeter wave MIMO systems,” IEEE J. Sel. Top. Signal Process., vol. 10, no. 3, pp. 485–500, April 2016.
  • [30] X. Gao, L. Dai, S. Han, C.-L. I, and R. W. Heath, “Energy-efficient hybrid analog and digital precoding for mmWave MIMO systems with large antenna arrays,” IEEE J. Sel. Areas Commun., vol. 34, no. 4, pp. 998–1009, April 2016.
  • [31] F. Sohrabi and W. Yu, “Hybrid beamforming with finite-resolution phase shifters for large-scale MIMO systems,” in Proc. IEEE Workshop Signal Process. Adv. Wirel. Commun., Stockholm, Sweden, June 2015, pp. 136–140.
  • [32] F. Sohrabi and W. Yu, “Hybrid digital and analog beamforming design for large-scale antenna arrays,” IEEE J. Sel. Top. Signal Process., vol. 10, no. 3, pp. 501–513, April 2016.
  • [33] Y.-P. Lin, “On the quantization of phase shifters for hybrid precoding systems,” IEEE Trans. Signal Process., vol. 65, no. 9, pp. 2237–2246, May 2017.
  • [34] Z. Cheng, L. Wu, B. Wang, M. R. B. Shankar, and B. Ottersten, “Double-phase-shifter based hybrid beamforming for mmWave DFRC in the presence of extended target and clutters,” IEEE Trans. Wirel. Commun., vol. 22, no. 6, pp. 3671–3686, June 2023.
  • [35] Z. Cheng, L. Wu, B. Wang, J. Xie, and H. Li, “Relative entropy-based constant-envelope beamforming for target detection in large-scale MIMO radar with low-resoultion ADCs,” IEEE Trans. Veh. Technol., vol. 72, no. 8, pp. 10 090–10 106, August 2023.
  • [36] L. Babai, “On lovász’ lattice reduction and the nearest lattice point problem,” Combinatorica, vol. 6, pp. 1–13, March 1986.
  • [37] N. Boumal, An Introduction to Optimization on Smooth Manifolds.   Cambridge University Press, 2023.
  • [38] P.-A. Absil, R. Mahony, and R. Sepulchre, Optimization Algorithms on Matrix Manifolds.   USA: Princeton University Press, 2008.
  • [39] J. Hu, X. Liu, Z.-W. Wen, and Y.-X. Yuan, “A brief introduction to manifold optimization,” J. Oper. Res. Soc. China, vol. 8, pp. 199–248, April 2020.
  • [40] N. Garcia, H. Wymeersch, and D. T. M. Slock, “Optimal precoders for tracking the AoD and AoA of a mmWave path,” IEEE Trans. Signal Process., vol. 66, no. 21, pp. 5718–5729, November 2018.
  • [41] M. F. Keskin, F. Jiang, F. Munier, G. Seco-Granados, and H. Wymeersch, “Optimal spatial signal design for mmWave positioning under imperfect synchronization,” IEEE Trans. Veh. Technol., vol. 71, no. 5, pp. 5558–5563, May 2022.
  • [42] J. Li, L. Xu, P. Stoica, K. W. Forsythe, and D. W. Bliss, “Range compression and waveform optimization for MIMO radar: A Cramér–Rao bound based study,” IEEE Trans. Signal Process., vol. 56, no. 1, pp. 218–232, January 2008.
  • [43] M. Deng, Z. Cheng, L. Wu, B. Shankar, and Z. He, “One-bit ADCs/DACs based MIMO radar: Performance analysis and joint design,” IEEE Trans. Signal Process., vol. 70, pp. 2609–2624, May 2022.
  • [44] M. Hong, Z.-Q. Luo, and M. Razaviyayn, “Convergence analysis of alternating direction method of multipliers for a family of nonconvex problems,” SIAM J. Optim., vol. 26, no. 1, pp. 337–364, January 2016.
  • [45] S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, “Distributed optimization and statistical learning via the alternating direction method of multipliers,” Found. Trends Mach. Learn., vol. 3, no. 1, pp. 1–122, January 2011.
  • [46] K. Huang and N. D. Sidiropoulos, “Consensus-ADMM for general quadratically constrained quadratic programming,” IEEE Trans. Signal Process., vol. 64, no. 20, pp. 5297–5310, October 2016.
  • [47] H. Huang, H. C. So, and A. M. Zoubir, “Sparse array beamformer design via ADMM,” IEEE Trans. Signal Process., vol. 71, pp. 3357–3372, September 2023.
  • [48] H. Huang, H. C. So, and A. M. Zoubir, “Convergence analysis of consensus-ADMM for general QCQP,” Signal Process., vol. 208, p. 108991, July 2023.
  • [49] L. Wu, X. Cheng, H. Huang, D. Ciuonzo, B. Shankar, and B. Ottersten, “Constant-modulus waveform design with polarization-adaptive power allocation in polarimetric radar,” IEEE Trans. Signal Process., vol. 71, pp. 2146–2161, June 2023.
  • [50] A. Fascista, A. Coluccia, H. Wymeersch, and G. Seco-Granados, “Millimeter-wave downlink positioning with a single-antenna receiver,” IEEE Trans. Wirel. Commun., vol. 18, no. 9, pp. 4479–4490, September 2019.
  • [51] Q. Hu, Y. Cai, K. Kang, G. Yu, J. Hoydis, and Y. C. Eldar, “Two-timescale end-to-end learning for channel acquisition and hybrid precoding,” IEEE J. Sel. Areas Commun., vol. 40, no. 1, pp. 163–181, January 2022.
  • [52] A. Liu, V. K. N. Lau, and M.-J. Zhao, “Stochastic successive convex optimization for two-timescale hybrid precoding in massive MIMO,” IEEE J. Sel. Top. Signal Process., vol. 12, no. 3, pp. 432–444, June 2018.
  • [53] A. Alkhateeb, O. El Ayach, G. Leus, and R. W. Heath, “Channel estimation and hybrid precoding for millimeter wave cellular systems,” IEEE J. Sel. Top. Signal Process., vol. 8, no. 5, pp. 831–846, October 2014.
  • [54] S. Kay, Fundamentals of Statistical Signal Processing, Volume 3, ser. Fundamentals of statistical signal processing.   Prentice Hall PTR, 2017.
  • [55] H. Anton, Elementary Linear Algebra.   Wiley, 1994.
  • [56] N. Garcia, H. Wymeersch, E. G. Ström, and D. Slock, “Location-aided mm-wave channel estimation for vehicular communication,” in Proc. IEEE Workshop Signal Process. Adv. Wirel. Commun., Edinburgh, UK, July 2016, pp. 1–5.
  • [57] N. Boumal, B. Mishra, P.-A. Absil, and R. Sepulchre, “Manopt, a Matlab toolbox for optimization on manifolds,” J. Mach. Learn. Res., vol. 15, no. 42, pp. 1455–1459, 2014.
  • [58] W. Rudin, “Chapter 3 - Numerical sequences and series,” in Principles of mathematical analysis.   McGraw - Hill Book C., 1986, pp. 47–78.
  • [59] E. K. Ryu and S. P. Boyd, “A primer on monotone operator methods,” Appl. Comput. Math., vol. 15, no. 1, pp. 3–43, 2016.