Co-learning-aided Multi-modal-deep-learning Framework of Passive DOA Estimators for a Heterogeneous Hybrid Massive MIMO Receiver

Jiatong Bai, Feng Shu, Fuhui Zhou, Qinghe Zheng, Bo Xu, Baihua Shi, Yiwen Chen, Weibin Zhang, Xianpeng Wang Corresponding author: Feng ShuJiatong Bai, Xianpeng Wang, Bo Xu and Yiwen Chen is with the School of Information and Communication Engineering, Hainan University, Haikou, 570228, China. (e-mail: [email protected]; [email protected]; [email protected]; [email protected] ).Feng Shu is with the School of Information and Communication Engineering and Collaborative Innovation Center of Information Technology, Hainan University, Haikou 570228, China, and also with the School of Electronic and Optical Engineering, Nan**g University of Science and Technology, Nan**g 210094, China. (e-mail: [email protected]).Fuhui Zhou is with the College of Electronic and Information Engineering Nan**g University of Aeronautics and Astronautics, Nan**g 210094, China. (e-mail: [email protected]).Qinghe Zheng is with the School of Intelligent Engineering, Shandong Management University, **an 250357, China. (e-mail: [email protected]).Weibin Zhang and Baihua Shi is with the Nan**g University of Science and Technology, Jiangsu, Nan**g, 210094, China. (e-mail: [email protected]; [email protected]).
Abstract

Due to its excellent performance in rate and resolution, fully-digital (FD) massive multiple-input multiple-output (MIMO) antenna arrays has been widely applied in data transmission and direction of arrival (DOA) measurements, etc. But it confronts with two main challenges: high computational complexity and circuit cost. The two problems may be addressed well by hybrid analog-digital (HAD) structure. But there exists the problem of phase ambiguity for HAD, which leads to its low-efficiency or high-latency. Does exist there such a MIMO structure of owning low-cost, low-complexity and high time efficiency at the same time. To satisfy the three properties, a novel heterogeneous hybrid MIMO receiver structure of integrating FD and heterogeneous HAD (H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD-FD) is proposed and corresponding multi-modal (MD)-learning framework is developed. The framework includes three major stages: 1) generate the candidate sets via root multiple signal classification (Root-MUSIC) or deep learning (DL); 2) infer the class of true solutions from candidate sets using machine learning (ML) methods; 3) fuse the two-part true solutions to achieve a better DOA estimation. The above process form two methods named MD-Root-MUSIC and MDDL. To improve DOA estimation accuracy and reduce the clustering complexity, a co-learning-aided MD framework is proposed to form two enhanced methods named CoMDDL and CoMD-RootMUSIC. Moreover, the Cramer-Rao lower bound (CRLB) for the proposed H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD-FD structure is also derived. Experimental results demonstrate that our proposed four methods could approach the CRLB for signal-to-noise ratio (SNR) > 0 dB and the proposed CoMDDL and MDDL perform better than CoMD-RootMUSIC and MD-RootMUSIC, particularly in the extremely low SNR region.

Index Terms:
DOA, massive MIMO, heterogeneous hybrid, CRLB, multi-modal learning

I Introduction

Direction of arrival (DOA) estimation is pivotal in various domains, including target localization[1], radio propagation measurement[2], and wireless communications[3]. Its integration with massive multiple input multiple output (MIMO) techniques yields ultra-high-accuracy the desirable signal direction for numerous wireless communication techniques[4, 5, 6] such as beamforming and directional modulation. Additionally, integrated sensing and communication is emerging as a burgeoning research domain, DOA estimation based on massive MIMO plays a core role within ISAC, applicable in realms[7, 8, 9], like 6G, human-machine interaction, and the Internet of Things.

The existing DOA estimation methods can be roughly categorized into the classical spatial spectral-based techniques, parametric-based estimation methods, the sparsity sensing-based techniques and deep learning (DL)-based methods. To the best of our knowledge, the most applied techniques are subspace-based conventional DOA measurement method. The multiple signal classification (MUSIC) estimation method in [10] partitioned the total column space of the covariance matrix into signal and noise subspaces , thus achieving accurate source localization. However, MUSIC algorithms were computationally burdensome due to the fact that spatial spectral peak searches. To address this issue, the rotational invariance technique (ESPRIT) method for signal parameters estimation was introduced [11] to directly obtain DOA estimation from closed-form solutions. The MUSIC and ESPRIT algorithms obtained the subspace by the array covariance matrix’ eigenvalue decomposition, leading to a significant increase in computational complexity for large-scale antenna array. The root multiple signal classification (Root-MUSIC) algorithm [12, 13] transformed the spectral search into a problem of finding high-order polynomials roots, significantly enhancing computational efficiency. Based on above classical algorithms, several improved algorithms[14, 15, 16, 17] had been proposed to achieve superior DOA estimation. Furthermore, motivated by the potent feature extraction and precise prediction capabilities of deep learning, several scholars had investigated the DL-based DOA measurement methods[18, 19, 20]. For instance, DOA estimation performance was improved in low signal-to-noise ratio (SNR) by convolutional neural networks (CNN) in [19].

Massive MIMO-based DOA estimation systems can obtain ultra-high angular resolution and estimation accuracy, but a sharp increase in computational complexity and circuit cost also occurs. The hybrid analog and digital (HAD) structure converted signals from multi antennas into baseband signals through a single analog-to-digital converter (ADC) and radio frequency (RF) chain, thereby efficiently improving the utilization of RF chains and ADCs while reducing circuit costs[21, 22, 23]. However, due to the fact that the HAD structure treats each sub-array as a virtual antenna, the fact that the spacing between virtual antennas is equal to the number subarray antennas multiplied by the half-wavelength leads to phase ambiguity and generates pseudo-solutions. In [24], a low-complexity HAD structure and three maximum received power-based methods were proposed to achieve high-precision and multi-time-slot DOA estimation, where the Root-MUSIC-HDAPA algorithm realized a hybrid Cramér-Rao Lower Bound (CRLB) with lower complexity. However, this method required M+1𝑀1M+1italic_M + 1 (M𝑀Mitalic_M represented the number of subarray antennas) time-slots to eliminate phase ambiguity. To minimize estimation delay, the enhanced fast FHAD-Root-MUSIC algorithm in [25] achieved two time-slots phase ambiguity elimination by partitioning multi subgroups and performing analog beamforming for each subgroup. Additionally, in [26] and [27], rapid DOA estimation algorithms were proposed under four different HAD sub-connection structures to achieve single time-slot phase ambiguity elimination with a large performance loss.

Based on the above analysis, the DOA estimation method using HAD sub-connection structures consists of two steps: firstly, a set of candidate solution sets was obtained by a conventional methods like Root-MUISC. Then the pseudo-solutions in the candidate solution sets were eliminated to find the true DOA estimation value. In summary, the existing DOA estimation methods with phase ambiguity elimination capability generally require at least two time-slots, leading to low time-efficiency issues. Whereas, this paper will explore the MDL framework of DOA measurement in the heterogeneous hybrid massive MIMO system, focusing on eliminating phase ambiguity with high time-efficiency, high-performance, low cost and complexity. The major contributions of our research are outlined as follows:

  1. 1.

    To address the issue of the high latency or low time-efficiency of DOA estimation of the conventional HAD, a novel heterogeneous hybrid MIMO receiver structure, called H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD-FD, of integrating a FD subarray with a H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD structure is developed. The H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD part comprises H𝐻Hitalic_H groups, where different groups have the equal number of subarrays but varying amounts of antennas. Also, within each group, every subarray maintains an identical number of antennas. In the co-learning scenario, the FD subarray will provide a good starting sample point of true solution class and will accelerate the process of the following clustering. Moreover, the corresponding CRLB for proposed H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD-FD structure is also derived.

  2. 2.

    Based on the above structure, a multi-modal-learning framework is proposed to achieve a low-latency DOA estimation. Its primary stages are as follows: 1) all sub-array groups in H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD produce the set of candidate solutions using Root-MUSIC and FD subarry estimate the DOA value using Root-MUSIC or CNN-based methods, where CNN is designed to be a five-layer neural network; 2) infer the class of true solutions from candidate sets using ML methods; 3) fuse the two-part solutions to output an final DOA estimation. The above process forms two methods for DOA measurement without ambiguity as follows: MD-Root-MUSIC and MDDL. The corresponding fusion coefficients are computed by CRLB. Simulation results indicate that our proposed MDDL performs better than MD-Root-MUSIC.

  3. 3.

    To further enhance the DOA estimation accuracy, a co-learning-aided multi-modal (Co-MD) framework is proposed, Compared with the previous MD framework, a relation between FD and H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD modals is established. The FD modal will provide a prior knowledge to the H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD modal. This relation will not only improve performance but also reduce the clustering complexity. Similar to the the previous MD framework, the two new corresponding methods are as follows: CoMDDL and CoMD-RootMUSIC, where MDDL is short for MD deep learning (DL). Simulation results illustrate that our proposed CoMDDL and MDDL outperform CoMD-RootMUSIC and MD-RooMUSIC, respectively.

The remainder of this research is organized as follows. The system model of an developed heterogeneous MIMO structure with FD and H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD is described in Section II. In Section III, a MD-learning (MDL) for DOA estimator based on FDH2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD structure is proposed. In Section IV, the clustering and fusion methods for our proposed frameworks are proposed. Moreover, section V analyzes the performance of the proposal structure and approaches. Section VI presents the experimental results, with conclusions provided in Section VII

Notations: In this paper, uppercase letters and lowercase letters in bold typeface (i.e., 𝐀𝐀\mathbf{A}bold_A, 𝐚𝐚\mathbf{a}bold_a) denote matrices and vectors, respectively. Signs \|\cdot\|∥ ⋅ ∥, |||\cdot|| ⋅ |, ()Hsuperscript𝐻(\cdot)^{H}( ⋅ ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT, ()Tsuperscript𝑇(\cdot)^{T}( ⋅ ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, {}\Re\{\cdot\}roman_ℜ { ⋅ }, {}\Im\{\cdot\}roman_ℑ { ⋅ }, {}\angle\{\cdot\}∠ { ⋅ } and 𝐝𝐢𝐚𝐠()𝐝𝐢𝐚𝐠\mathbf{diag}(\cdot)bold_diag ( ⋅ ) denote norm, modulus, conjugate transpose, transpose, real part operations, imaginary part operations, phase component and diagonal, respectively. Besides, []delimited-[]\mathbb{C}[\cdot]blackboard_C [ ⋅ ] and 𝔼[]𝔼delimited-[]\mathbb{E}[\cdot]blackboard_E [ ⋅ ] denote the complex-valued matrix and the expectation operator, respectively. 𝐓𝐫()𝐓𝐫\mathbf{Tr}(\cdot)bold_Tr ( ⋅ ) is the matrix trace.

II System model

Refer to caption
Figure 1: Proposed heterogeneous hybird array with FD and H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD, where H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD has H𝐻Hitalic_H groups and group hhitalic_h has Khsubscript𝐾K_{h}italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT subarrays with each subarray having Mhsubscript𝑀M_{h}italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT antennas, h{1,2,,H}12𝐻h\in\{1,2,\cdots,H\}italic_h ∈ { 1 , 2 , ⋯ , italic_H }

Figure 1 presents the proposed heterogeneous hybrid structure, which involves a fully-digital (FD) and a H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD array. A far-field narrowband signal x(t)ej2πfct𝑥𝑡superscript𝑒𝑗2𝜋subscript𝑓𝑐𝑡x(t)e^{j2\pi f_{c}t}italic_x ( italic_t ) italic_e start_POSTSUPERSCRIPT italic_j 2 italic_π italic_f start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT, where x(t)𝑥𝑡x(t)italic_x ( italic_t ) and fcsubscript𝑓𝑐f_{c}italic_f start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT are the baseband signal and the carrier frequency, respectively. Considering the FD with M𝑀Mitalic_M antennas, the output signal is represented as

𝐲FD(t)=𝐚FD(θ0)x(t)+𝐰(t),subscript𝐲𝐹𝐷𝑡subscript𝐚𝐹𝐷subscript𝜃0𝑥𝑡𝐰𝑡\displaystyle\mathbf{y}_{FD}(t)=\mathbf{a}_{FD}(\theta_{0})x(t)+\mathbf{w}(t),bold_y start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_t ) = bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_x ( italic_t ) + bold_w ( italic_t ) , (1)

where 𝐰(t)𝒞𝒩(0,σw2𝐈m)similar-to𝐰𝑡𝒞𝒩0subscriptsuperscript𝜎2𝑤𝐈𝑚\mathbf{w}(t)\sim\mathcal{C}\mathcal{N}(0,\sigma^{2}_{w}\mathbf{I}m)bold_w ( italic_t ) ∼ caligraphic_C caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT bold_I italic_m ) is the additive white Gaussian noise (AWGN) vector, 𝐚FDsubscript𝐚𝐹𝐷\mathbf{a}_{FD}bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT denotes the array manifold vector expressed as

𝐚FD(θ0)=[1,ej2πλdsinθ0,,ej2πλ(M1)dsinθ0]T,subscript𝐚𝐹𝐷subscript𝜃0superscript1superscript𝑒𝑗2𝜋𝜆𝑑subscript𝜃0superscript𝑒𝑗2𝜋𝜆𝑀1𝑑subscript𝜃0𝑇\displaystyle\mathbf{a}_{FD}\left(\theta_{0}\right)=\left[1,e^{j\frac{2\pi}{% \lambda}d\sin\theta_{0}},\cdots,e^{j\frac{2\pi}{\lambda}(M-1)d\sin\theta_{0}}% \right]^{T},bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = [ 1 , italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_M - 1 ) italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (2)

The H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD array is divided into H𝐻Hitalic_H groups, group hhitalic_h has Khsubscript𝐾K_{h}italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT subarrays with each subarray containing Mhsubscript𝑀M_{h}italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT antennas, i.e.,

M=h=1HNh=h=1HKhMh,𝑀superscriptsubscript1𝐻subscript𝑁superscriptsubscript1𝐻subscript𝐾subscript𝑀\displaystyle M=\sum_{h=1}^{H}N_{h}=\sum_{h=1}^{H}K_{h}M_{h},italic_M = ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , (3)

In this heterogeneous array, M1M2MHsubscript𝑀1subscript𝑀2subscript𝑀𝐻M_{1}\neq M_{2}\neq\cdots\neq M_{H}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≠ italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≠ ⋯ ≠ italic_M start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT, and each group is a homogeneous structure. It is particularly noted that it is preferred to choose the values of M1,M2,,MHsubscript𝑀1subscript𝑀2subscript𝑀𝐻M_{1},M_{2},\cdots,M_{H}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_M start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT to be primes.

Considering ψh,k,msubscript𝜓𝑘𝑚\psi_{h,k,m}italic_ψ start_POSTSUBSCRIPT italic_h , italic_k , italic_m end_POSTSUBSCRIPT represents the phase corresponding to analog beamforming , then the output of the k𝑘kitalic_kth subarray is

yh,k(t)=1Mhm=1Mhx(t)ej2πfc(tτh,k,m)jψh,k,m+wh,k(t)subscript𝑦𝑘𝑡1subscript𝑀superscriptsubscript𝑚1subscript𝑀𝑥𝑡superscript𝑒𝑗2𝜋subscript𝑓𝑐𝑡subscript𝜏𝑘𝑚𝑗subscript𝜓𝑘𝑚subscript𝑤𝑘𝑡\displaystyle y_{h,k}(t)=\frac{1}{\sqrt{M_{h}}}\sum_{m=1}^{M_{h}}x(t)e^{j2\pi f% _{c}(t-\tau_{h,k,m})-j\psi_{h,k,m}}+w_{h,k}(t)italic_y start_POSTSUBSCRIPT italic_h , italic_k end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_x ( italic_t ) italic_e start_POSTSUPERSCRIPT italic_j 2 italic_π italic_f start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ( italic_t - italic_τ start_POSTSUBSCRIPT italic_h , italic_k , italic_m end_POSTSUBSCRIPT ) - italic_j italic_ψ start_POSTSUBSCRIPT italic_h , italic_k , italic_m end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_w start_POSTSUBSCRIPT italic_h , italic_k end_POSTSUBSCRIPT ( italic_t ) (4)

where wh,k(t)𝒞𝒩(0,σw2)similar-tosubscript𝑤𝑘𝑡𝒞𝒩0subscriptsuperscript𝜎2𝑤w_{h,k}(t)\sim\mathcal{C}\mathcal{N}(0,\sigma^{2}_{w})italic_w start_POSTSUBSCRIPT italic_h , italic_k end_POSTSUBSCRIPT ( italic_t ) ∼ caligraphic_C caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ) is the AWGN vector. τh,k,msubscript𝜏𝑘𝑚\tau_{h,k,m}italic_τ start_POSTSUBSCRIPT italic_h , italic_k , italic_m end_POSTSUBSCRIPT are the propagation delays established by the direction of the signal source with respect to the array expressed as

τh,k,m=τ0(km1)dsinθ0c,subscript𝜏𝑘𝑚subscript𝜏0𝑘𝑚1𝑑subscript𝜃0𝑐\displaystyle\tau_{h,k,m}=\tau_{0}-\frac{(km-1)d\sin\theta_{0}}{c},italic_τ start_POSTSUBSCRIPT italic_h , italic_k , italic_m end_POSTSUBSCRIPT = italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - divide start_ARG ( italic_k italic_m - 1 ) italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_c end_ARG , (5)

where τ0subscript𝜏0\tau_{0}italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT denotes the propagation delay from the radiating source to the reference point of the antenna array, and c𝑐citalic_c is the speed of light.

Stacking the outputs yh,k(t)subscript𝑦𝑘𝑡y_{h,k}(t)italic_y start_POSTSUBSCRIPT italic_h , italic_k end_POSTSUBSCRIPT ( italic_t ) of all Khsubscript𝐾K_{h}italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT subarray and the recieved signal vector of hhitalic_hth group is defined as

𝐲h(t)=𝚼A,hH𝐚h(θ0)x(t)+𝐰(t),subscript𝐲𝑡superscriptsubscript𝚼𝐴𝐻subscript𝐚subscript𝜃0𝑥𝑡𝐰𝑡\displaystyle\mathbf{y}_{h}(t)=\mathbf{\Upsilon}_{A,h}^{H}\mathbf{a}_{h}(% \theta_{0})x(t)+\mathbf{w}(t),bold_y start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_t ) = bold_Υ start_POSTSUBSCRIPT italic_A , italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_x ( italic_t ) + bold_w ( italic_t ) , (6)

where 𝐰(t)=[w1(t),w2(t),,wKh(t)]TKh×1𝐰𝑡superscriptsubscript𝑤1𝑡subscript𝑤2𝑡subscript𝑤subscript𝐾𝑡𝑇superscriptsubscript𝐾1\mathbf{w}(t)=\left[w_{1}(t),w_{2}(t),\ldots,w_{K_{h}}(t)\right]^{T}\in\mathbb% {C}^{K_{h}\times 1}bold_w ( italic_t ) = [ italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ) , … , italic_w start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_t ) ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT is the AWGN vector, 𝐚h(θ0)Nh×1subscript𝐚subscript𝜃0superscriptsubscript𝑁1\mathbf{a}_{h}(\theta_{0})\in\mathbb{C}^{N_{h}\times 1}bold_a start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∈ blackboard_C start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT × 1 end_POSTSUPERSCRIPT is the array manifold vector expressed as

𝐚h(θ0)=[1,ej2πλdsinθ0,,ej2πλ(Nh1)dsinθ0]T,subscript𝐚subscript𝜃0superscript1superscript𝑒𝑗2𝜋𝜆𝑑subscript𝜃0superscript𝑒𝑗2𝜋𝜆subscript𝑁1𝑑subscript𝜃0𝑇\displaystyle\mathbf{a}_{h}\left(\theta_{0}\right)=\left[1,e^{j\frac{2\pi}{% \lambda}d\sin\theta_{0}},\cdots,e^{j\frac{2\pi}{\lambda}(N_{h}-1)d\sin\theta_{% 0}}\right]^{T},bold_a start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = [ 1 , italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_N start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - 1 ) italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (7)

and 𝚼A,hsubscript𝚼𝐴\mathbf{\Upsilon}_{A,h}bold_Υ start_POSTSUBSCRIPT italic_A , italic_h end_POSTSUBSCRIPT denotes a block diagonal matrix

𝚼𝐀,𝐡=[γA,h,1000γA,h,2000γA,h,K],subscript𝚼𝐀𝐡delimited-[]subscript𝛾𝐴100missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression0subscript𝛾𝐴20missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression00subscript𝛾𝐴𝐾missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression\displaystyle{\mathbf{\Upsilon_{A,h}}}=\left[{\begin{array}[]{*{20}{c}}{{% \gamma_{A,h,1}}}&0&\cdots&0\\ 0&{{\gamma_{A,h,2}}}&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ 0&0&\cdots&{{\gamma_{A,h,K}}}\end{array}}\right],bold_Υ start_POSTSUBSCRIPT bold_A , bold_h end_POSTSUBSCRIPT = [ start_ARRAY start_ROW start_CELL italic_γ start_POSTSUBSCRIPT italic_A , italic_h , 1 end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL italic_γ start_POSTSUBSCRIPT italic_A , italic_h , 2 end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL italic_γ start_POSTSUBSCRIPT italic_A , italic_h , italic_K end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] , (12)

where γA,h,ksubscript𝛾𝐴𝑘\mathbf{\gamma}_{A,h,k}italic_γ start_POSTSUBSCRIPT italic_A , italic_h , italic_k end_POSTSUBSCRIPT is the k𝑘kitalic_k-th block diagonal element defined by

γA,h,k=1Mh[ejψh,k,1,ejψh,k,2,,ejψh,k,Mh]T,subscript𝛾𝐴𝑘1subscript𝑀superscriptsuperscript𝑒𝑗subscript𝜓𝑘1superscript𝑒𝑗subscript𝜓𝑘2superscript𝑒𝑗subscript𝜓𝑘subscript𝑀𝑇\displaystyle\mathbf{\gamma}_{A,h,k}=\frac{1}{\sqrt{M_{h}}}\left[e^{j\psi_{h,k% ,1}},e^{j\psi_{h,k,2}},\cdots,e^{j\psi_{h,k,M_{h}}}\right]^{T},italic_γ start_POSTSUBSCRIPT italic_A , italic_h , italic_k end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG end_ARG [ italic_e start_POSTSUPERSCRIPT italic_j italic_ψ start_POSTSUBSCRIPT italic_h , italic_k , 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , italic_e start_POSTSUPERSCRIPT italic_j italic_ψ start_POSTSUBSCRIPT italic_h , italic_k , 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT italic_j italic_ψ start_POSTSUBSCRIPT italic_h , italic_k , italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (13)

Passing through analog-to-digital converter (ADC), the (6) becomes

𝐲h(n)=𝚼A,hH𝐚h(θ0)x(n)+𝐰(n),subscript𝐲𝑛superscriptsubscript𝚼𝐴𝐻subscript𝐚subscript𝜃0𝑥𝑛𝐰𝑛\displaystyle\mathbf{y}_{h}(n)=\mathbf{\Upsilon}_{A,h}^{H}\mathbf{a}_{h}(% \theta_{0})x(n)+\mathbf{w}(n),bold_y start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_n ) = bold_Υ start_POSTSUBSCRIPT italic_A , italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_x ( italic_n ) + bold_w ( italic_n ) , (14)

where n=1,2,,L𝑛12𝐿n=1,2,\cdots,Litalic_n = 1 , 2 , ⋯ , italic_L, L𝐿Litalic_L is the number of snapshots.

III Proposed Multi-modal-learning frameworks for DOA estimators based on heterogeneous hybrid structure

In this section, a MDL for DOA estimator based on heterogeneous hybrid structure are proposed to accelerate the elimination of pseudo-solutions, form a class of true solutions, and make a fusion of all true solutions to achieve an enhancement on DOA estimation.

III-A Proposed MDL frameworks

Refer to caption

Figure 2: Multi-modal framework for estimating DOA.

Refer to caption

Figure 3: Co-learning-aided multi-modal framework for estimating DOA.

As shown in Figure 2, the proposed MDL framework consists of three main steps: 1) all sub-array groups in H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD form the set of candidate solutions using Root-MUSIC and FD subarray directly generates the coarse estimated value of true solution using Root-MUSIC or CNN-based method. 2) infer the class of true solutions from candidate sets of H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD using ML methods. 3) fuse the two-part solutions to output an enhanced DOA estimation. The above process forms two methods for DOA estimation named MD-Root-MUSIC and MDDL, shown in Figure 2(a) and (b), respectively. Then, a Co-MDL framework is proposed, shown in Figure 3, to further enhance the DOA estimation accuracy. Compared with the MDL framework, a relation between FD and H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD modals is established by utilizing the FD subarray to provide a better starting sample point of the true solution class. The two new corresponding methods is CoMDDL and CoMD-RootMUSIC, shown in Figure 3(a) and (b), respectively.

III-B Candidate solutions of H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD

Based on the above analysis, each subarray can be treated as a virtual antenna in group hhitalic_hth. Assume that the analog beamforming vector γA,h,k=1Mh[1,,1]Tsubscript𝛾𝐴𝑘1subscript𝑀superscript11𝑇\mathbf{\gamma}_{A,h,k}=\frac{1}{{\sqrt{{M_{h}}}}}{\left[{1,\cdots,1}\right]^{% T}}italic_γ start_POSTSUBSCRIPT italic_A , italic_h , italic_k end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG end_ARG [ 1 , ⋯ , 1 ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, the output vector of Khsubscript𝐾K_{h}italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT subarrays is

𝐲h(n)subscript𝐲𝑛\displaystyle\mathbf{y}_{h}(n)bold_y start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_n ) =[y1(n),y2(n),,yKh(n)]Tabsentsuperscriptsubscript𝑦1𝑛subscript𝑦2𝑛subscript𝑦subscript𝐾𝑛𝑇\displaystyle=[y_{1}(n),y_{2}(n),\cdots,y_{K_{h}}(n)]^{T}= [ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_n ) , italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_n ) , ⋯ , italic_y start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (15)
=1Mh𝐚Mh(θ0)bh(θ0)x(n)+𝐰(n),absent1subscript𝑀subscript𝐚subscript𝑀subscript𝜃0subscript𝑏subscript𝜃0𝑥𝑛𝐰𝑛\displaystyle=\frac{1}{{\sqrt{{M_{h}}}}}\mathbf{a}_{M_{h}}(\theta_{0})b_{h}(% \theta_{0})x(n)+\mathbf{w}(n),= divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG end_ARG bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_x ( italic_n ) + bold_w ( italic_n ) ,

where 𝐰(n)=[w1(n),,wKh(n)]T𝐰𝑛superscriptsubscript𝑤1𝑛subscript𝑤subscript𝐾𝑛𝑇\mathbf{w}(n)=[w_{1}(n),\cdots,w_{K_{h}}(n)]^{T}bold_w ( italic_n ) = [ italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_n ) , ⋯ , italic_w start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_n ) ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT, and 𝐚Mhsubscript𝐚subscript𝑀\mathbf{a}_{M_{h}}bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT is regarded as the array manifold per sub-array, which can be formed by

𝐚Mh(θ0)=[1,ej2πλMhdsinθ0,,ej2πλ(Kh1)Mhdsinθ0]T,subscript𝐚subscript𝑀subscript𝜃0superscript1superscript𝑒𝑗2𝜋𝜆subscript𝑀𝑑subscript𝜃0superscript𝑒𝑗2𝜋𝜆subscript𝐾1subscript𝑀𝑑subscript𝜃0𝑇\displaystyle\mathbf{a}_{M_{h}}\left(\theta_{0}\right)=\left[1,e^{j\frac{2\pi}% {\lambda}M_{h}d\sin\theta_{0}},\cdots,e^{j\frac{2\pi}{\lambda}(K_{h}-1)M_{h}d% \sin\theta_{0}}\right]^{T},bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = [ 1 , italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - 1 ) italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , (16)

and bh(θ0)subscript𝑏subscript𝜃0b_{h}(\theta_{0})italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) is defined as

bh(θ0)=m=1Mhej2πλ(m1)dsinθ0=1ej2πλMhdsinθ01ej2πλdsinθ0subscript𝑏subscript𝜃0superscriptsubscript𝑚1subscript𝑀superscript𝑒𝑗2𝜋𝜆𝑚1𝑑subscript𝜃01superscript𝑒𝑗2𝜋𝜆subscript𝑀𝑑subscript𝜃01superscript𝑒𝑗2𝜋𝜆𝑑subscript𝜃0\displaystyle b_{h}\left(\theta_{0}\right)=\sum_{m=1}^{M_{h}}e^{j\frac{2\pi}{% \lambda}(m-1)d\sin\theta_{0}}=\frac{1-e^{j\frac{2\pi}{\lambda}M_{h}d\sin\theta% _{0}}}{1-e^{j\frac{2\pi}{\lambda}d\sin\theta_{0}}}italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_m - 1 ) italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT = divide start_ARG 1 - italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG (17)

The covariance matrix of the output vector 𝐲𝐡(𝐧)subscript𝐲𝐡𝐧\mathbf{y_{h}(n)}bold_y start_POSTSUBSCRIPT bold_h end_POSTSUBSCRIPT ( bold_n ) is

𝐑hsubscript𝐑\displaystyle\mathbf{R}_{h}bold_R start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT =𝔼[𝐲𝐡(n)𝐲𝐡(n)H]absent𝔼delimited-[]subscript𝐲𝐡𝑛subscript𝐲𝐡superscript𝑛𝐻\displaystyle=\mathbb{E}\left[\mathbf{y_{h}}(n)\mathbf{y_{h}}(n)^{H}\right]= blackboard_E [ bold_y start_POSTSUBSCRIPT bold_h end_POSTSUBSCRIPT ( italic_n ) bold_y start_POSTSUBSCRIPT bold_h end_POSTSUBSCRIPT ( italic_n ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ] (18)
=bh(θ0)𝐚Mh(θ0)𝐑ss(bh(θ0)𝐚MhH(θ0)+𝐑ww\displaystyle=b_{h}(\theta_{0})\mathbf{a}_{M_{h}}(\theta_{0})\mathbf{R}_{ss}(b% _{h}(\theta_{0})\mathbf{a}_{M_{h}}^{H}\left(\theta_{0}\right)+\mathbf{R}_{ww}= italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_R start_POSTSUBSCRIPT italic_s italic_s end_POSTSUBSCRIPT ( italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) + bold_R start_POSTSUBSCRIPT italic_w italic_w end_POSTSUBSCRIPT
=1Mhσs2bh(θ0)2𝐚Mh(θ0)𝐚MhH(θ0)+σw2𝐈,absent1subscript𝑀superscriptsubscript𝜎𝑠2superscriptnormsubscript𝑏subscript𝜃02subscript𝐚subscript𝑀subscript𝜃0superscriptsubscript𝐚subscript𝑀𝐻subscript𝜃0superscriptsubscript𝜎𝑤2𝐈\displaystyle=\frac{1}{M_{h}}\sigma_{s}^{2}\left\|b_{h}\left(\theta_{0}\right)% \right\|^{2}\mathbf{a}_{M_{h}}\left(\theta_{0}\right)\mathbf{a}_{M_{h}}^{H}% \left(\theta_{0}\right)+\sigma_{w}^{2}\mathbf{I},= divide start_ARG 1 end_ARG start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG italic_σ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) + italic_σ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I ,

where σs2superscriptsubscript𝜎𝑠2\sigma_{s}^{2}italic_σ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is the SNR of the receive signal and σs2=𝐑ss=𝔼[|s(n)|2]superscriptsubscript𝜎𝑠2subscript𝐑𝑠𝑠𝔼delimited-[]superscript𝑠𝑛2\sigma_{s}^{2}=\mathbf{R}_{ss}=\mathbb{E}\left[{\left|{s\left(n\right)}\right|% ^{2}}\right]italic_σ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = bold_R start_POSTSUBSCRIPT italic_s italic_s end_POSTSUBSCRIPT = blackboard_E [ | italic_s ( italic_n ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ]. Besides, the eigenvalue decomposition (EVD) of 𝐑hsubscript𝐑\mathbf{R}_{h}bold_R start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT is performed as

𝐑h=[𝐔S𝐔N]Σ[𝐔S𝐔N]H,subscript𝐑delimited-[]subscript𝐔𝑆subscript𝐔𝑁Σsuperscriptdelimited-[]subscript𝐔𝑆subscript𝐔𝑁𝐻\displaystyle\mathbf{R}_{h}=[\mathbf{U}_{S}\mathbf{U}_{N}]\Sigma[\mathbf{U}_{S% }\mathbf{U}_{N}]^{H},bold_R start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = [ bold_U start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] roman_Σ [ bold_U start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , (19)

where 𝐔Nsubscript𝐔𝑁\mathbf{U}_{N}bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT is Kh×(Kh1)subscript𝐾subscript𝐾1{K_{h}}\times({K_{h}}-1)italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT × ( italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - 1 ) noise subspace, 𝐔Ssubscript𝐔𝑆\mathbf{U}_{S}bold_U start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT is Kh×1subscript𝐾1{K_{h}}\times 1italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT × 1 signal subspace. Σ=diag(σs2+σw2,σw2,,σw2)Σ𝑑𝑖𝑎𝑔superscriptsubscript𝜎𝑠2superscriptsubscript𝜎𝑤2superscriptsubscript𝜎𝑤2superscriptsubscript𝜎𝑤2\Sigma=diag(\sigma_{s}^{2}+\sigma_{w}^{2},\sigma_{w}^{2},\cdots,\sigma_{w}^{2})roman_Σ = italic_d italic_i italic_a italic_g ( italic_σ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_σ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_σ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ⋯ , italic_σ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) is the Kh×Khsubscript𝐾subscript𝐾K_{h}\times K_{h}italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT × italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT diagonal matrix.

Then, to compute so-called “MUSIC spectrum” of the virtual antenna array

PH2AD(θ)=1bh(θ)2𝐚MhH(θ)𝐔N𝐔NH𝐚Mh,subscript𝑃superscriptH2AD𝜃1superscriptnormsubscript𝑏𝜃2normsuperscriptsubscript𝐚subscript𝑀𝐻𝜃subscript𝐔𝑁superscriptsubscript𝐔𝑁𝐻subscript𝐚subscript𝑀\displaystyle P_{\rm{H}^{2}AD}(\theta)=\frac{1}{\left\|b_{h}(\theta)\right\|^{% 2}\left\|\mathbf{a}_{M_{h}}^{H}(\theta)\mathbf{U}_{N}\mathbf{U}_{N}^{H}\mathbf% {a}_{M_{h}}\right\|},italic_P start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD end_POSTSUBSCRIPT ( italic_θ ) = divide start_ARG 1 end_ARG start_ARG ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ ) bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ end_ARG , (20)

The direction corresponding to the maximum value of PH2AD(θ)subscript𝑃superscriptH2AD𝜃P_{\rm{H}^{2}AD}(\theta)italic_P start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD end_POSTSUBSCRIPT ( italic_θ ) is the final DOA estimation. Furthermore, this paper applies the Root-MUSIC [28] to obtain the optimal emitter direction owing to its remarkable asymptotic solution and low complexity.

The polynomial equation can be defined as

f(θ)𝑓𝜃\displaystyle f(\theta)italic_f ( italic_θ ) =bhH(θ)𝐚MhH(θ)𝐔N𝐔NH𝐚Mh(θ)bh(θ)f(z)absentsuperscriptsubscript𝑏𝐻𝜃superscriptsubscript𝐚subscript𝑀𝐻𝜃subscript𝐔𝑁superscriptsubscript𝐔𝑁𝐻subscript𝐚subscript𝑀𝜃subscript𝑏𝜃𝑓𝑧\displaystyle=b_{h}^{H}(\theta)\mathbf{a}_{M_{h}}^{H}(\theta)\mathbf{U}_{N}% \mathbf{U}_{N}^{H}\mathbf{a}_{M_{h}}(\theta)b_{h}(\theta)\triangleq f(z)= italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ ) bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ ) bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_θ ) italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ ) ≜ italic_f ( italic_z ) (21)
=2z1z2z1Mhz1Mhi=1Khj=1Khz(i1)𝐐ijzj1absent2superscript𝑧1𝑧2superscript𝑧1subscript𝑀superscript𝑧1subscript𝑀superscriptsubscript𝑖1subscript𝐾superscriptsubscript𝑗1subscript𝐾superscript𝑧𝑖1subscript𝐐𝑖𝑗superscript𝑧𝑗1\displaystyle=\frac{2-z^{-1}-z}{2-z^{-\frac{1}{M_{h}}}-z^{\frac{1}{M_{h}}}}% \sum_{i=1}^{K_{h}}\sum_{j=1}^{K_{h}}z^{-(i-1)}\mathbf{Q}_{ij}z^{j-1}= divide start_ARG 2 - italic_z start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT - italic_z end_ARG start_ARG 2 - italic_z start_POSTSUPERSCRIPT - divide start_ARG 1 end_ARG start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT - italic_z start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_z start_POSTSUPERSCRIPT - ( italic_i - 1 ) end_POSTSUPERSCRIPT bold_Q start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT
f(φ)=0,absent𝑓𝜑0\displaystyle\triangleq f(\varphi)=0,≜ italic_f ( italic_φ ) = 0 ,

where 𝐐=𝐔N𝐔NH𝐐subscript𝐔𝑁superscriptsubscript𝐔𝑁𝐻\mathbf{Q}=\mathbf{U}_{N}\mathbf{U}_{N}^{H}bold_Q = bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT bold_U start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT and 𝐐ijsubscript𝐐𝑖𝑗\mathbf{Q}_{ij}bold_Q start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT represents the element located in the i𝑖iitalic_ith row and j𝑗jitalic_jth column of matrix 𝐐𝐐\mathbf{Q}bold_Q.

z=ejφh,𝑧superscript𝑒𝑗subscript𝜑\displaystyle z=e^{j\varphi_{h}},italic_z = italic_e start_POSTSUPERSCRIPT italic_j italic_φ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , (22)

and

φh=2πλMhdsinθ0,subscript𝜑2𝜋𝜆subscript𝑀𝑑subscript𝜃0\displaystyle\varphi_{h}=\frac{2\pi}{\lambda}M_{h}d\sin\theta_{0},italic_φ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , (23)

The above equation (21) has 2Kh22subscript𝐾22K_{h}-22 italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - 2 roots, that is, ZRM={zi,i{1,2,,2Kh2}}subscript𝑍𝑅𝑀subscript𝑧𝑖𝑖122subscript𝐾2{Z}_{RM}=\left\{{z}_{i},i\in\{1,2,\cdots,2K_{h}-2\}\right\}italic_Z start_POSTSUBSCRIPT italic_R italic_M end_POSTSUBSCRIPT = { italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i ∈ { 1 , 2 , ⋯ , 2 italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - 2 } }. Then, using equation (22) to obtain the set of DOA estimates

Θ^RM={θ^i,i{1,2,,2Kh2}},subscript^Θ𝑅𝑀subscript^𝜃𝑖𝑖122subscript𝐾2\displaystyle\hat{\Theta}_{RM}=\left\{\hat{\theta}_{i},i\in\{1,2,\cdots,2K_{h}% -2\}\right\},over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_R italic_M end_POSTSUBSCRIPT = { over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i ∈ { 1 , 2 , ⋯ , 2 italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - 2 } } , (24)

where

θ^i=arcsin(λargzi2πMhd)subscript^𝜃𝑖𝜆subscript𝑧𝑖2𝜋subscript𝑀𝑑\displaystyle\hat{\theta}_{i}=\arcsin\left(\frac{\lambda\arg z_{i}}{2\pi M_{h}% d}\right)over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = roman_arcsin ( divide start_ARG italic_λ roman_arg italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_π italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_d end_ARG ) (25)

Then, the DOA estimation θ^hsubscript^𝜃\hat{\theta}_{h}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT of hhitalic_hth array group can select the nearest root to the unit circle

φ^h=2πλMhdsinθh^subscript^𝜑2𝜋𝜆subscript𝑀𝑑𝑠𝑖𝑛^subscript𝜃\displaystyle\hat{\varphi}_{h}=\frac{2\pi}{\lambda}M_{h}dsin\hat{\theta_{h}}over^ start_ARG italic_φ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_d italic_s italic_i italic_n over^ start_ARG italic_θ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG (26)

Observing the function f(φ)𝑓𝜑f(\varphi)italic_f ( italic_φ ) has a period of 2π2𝜋2\pi2 italic_π about φ𝜑\varphiitalic_φ, thus the presence of phase ambiguity that requires elimination,

f(φ^h)=f(φ^h+2πj)𝑓subscript^𝜑𝑓subscript^𝜑2𝜋𝑗\displaystyle f(\hat{\varphi}_{h})=f(\hat{\varphi}_{h}+2\pi j)italic_f ( over^ start_ARG italic_φ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ) = italic_f ( over^ start_ARG italic_φ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT + 2 italic_π italic_j ) (27)

which the hhitalic_h-th group yields feasible solutions including Mqsubscript𝑀𝑞M_{q}italic_M start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT values as follows

Θ^h={θ^h,jh,jh{1,2,,Mh}}subscript^Θsubscript^𝜃subscript𝑗subscript𝑗12subscript𝑀\displaystyle\hat{\Theta}_{h}=\left\{\hat{\theta}_{h,j_{h}},j_{h}\in\{1,2,% \cdots,M_{h}\}\right\}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = { over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_h , italic_j start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_j start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ∈ { 1 , 2 , ⋯ , italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT } } (28)

where

θ^h,jh=arcsin(λ(arg(ejφ^h)+2πj)2πMhd).subscript^𝜃subscript𝑗𝜆superscript𝑒𝑗subscript^𝜑2𝜋𝑗2𝜋subscript𝑀𝑑\displaystyle\hat{\theta}_{h,j_{h}}=\arcsin\left(\frac{\lambda\left(\arg(e^{j% \hat{\varphi}_{h}})+2\pi j\right)}{2\pi M_{h}d}\right).over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_h , italic_j start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT = roman_arcsin ( divide start_ARG italic_λ ( roman_arg ( italic_e start_POSTSUPERSCRIPT italic_j over^ start_ARG italic_φ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) + 2 italic_π italic_j ) end_ARG start_ARG 2 italic_π italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_d end_ARG ) . (29)

Combing all Q𝑄Qitalic_Q groups gives

{2πλM1dsinθ1,j1=φ^1+2πj2πλM2dsinθ2,j2=φ^2+2πj2πλMHdsinθH,jH=φ^H+2πjcases2𝜋𝜆subscript𝑀1𝑑𝑠𝑖𝑛subscript𝜃1subscript𝑗1subscript^𝜑12𝜋𝑗missing-subexpression2𝜋𝜆subscript𝑀2𝑑𝑠𝑖𝑛subscript𝜃2subscript𝑗2subscript^𝜑22𝜋𝑗missing-subexpressionmissing-subexpression2𝜋𝜆subscript𝑀𝐻𝑑𝑠𝑖𝑛subscript𝜃𝐻subscript𝑗𝐻subscript^𝜑𝐻2𝜋𝑗missing-subexpression\displaystyle\begin{split}\left\{\begin{array}[]{ll}\frac{2\pi}{\lambda}M_{1}% dsin\theta_{1,j_{1}}=\hat{\varphi}_{1}+2\pi j\\ \frac{2\pi}{\lambda}M_{2}dsin\theta_{2,j_{2}}=\hat{\varphi}_{2}+2\pi j\\ \quad\quad\quad\vdots\quad\quad\quad\quad\quad\quad\quad\vdots\\ \frac{2\pi}{\lambda}M_{H}dsin\theta_{H,j_{H}}=\hat{\varphi}_{H}+2\pi j\end{% array}\right.\end{split}start_ROW start_CELL { start_ARRAY start_ROW start_CELL divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_d italic_s italic_i italic_n italic_θ start_POSTSUBSCRIPT 1 , italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = over^ start_ARG italic_φ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 2 italic_π italic_j end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_d italic_s italic_i italic_n italic_θ start_POSTSUBSCRIPT 2 , italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = over^ start_ARG italic_φ end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + 2 italic_π italic_j end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ ⋮ end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_M start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT italic_d italic_s italic_i italic_n italic_θ start_POSTSUBSCRIPT italic_H , italic_j start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_POSTSUBSCRIPT = over^ start_ARG italic_φ end_ARG start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT + 2 italic_π italic_j end_CELL start_CELL end_CELL end_ROW end_ARRAY end_CELL end_ROW (30)

where jh{1,2,,Mh}subscript𝑗12subscript𝑀j_{h}\in\{1,2,\cdots,M_{h}\}italic_j start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ∈ { 1 , 2 , ⋯ , italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT }. Furthermore, forming the candidate solution set expressed as

Θ^={Θ^1,Θ^2,Θ^H}^Θsubscript^Θ1subscript^Θ2subscript^Θ𝐻\displaystyle\hat{\Theta}=\left\{\hat{\Theta}_{1},\hat{\Theta}_{2}\cdots,\hat{% \Theta}_{H}\right\}over^ start_ARG roman_Θ end_ARG = { over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⋯ , over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT } (31)

Based on the above discussion, the total candidate set Θ^^Θ\hat{\Theta}over^ start_ARG roman_Θ end_ARG includes h=1HMhsuperscriptsubscript1𝐻subscript𝑀\sum\limits_{h=1}^{H}{{M_{{}_{h}}}}∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT start_FLOATSUBSCRIPT italic_h end_FLOATSUBSCRIPT end_POSTSUBSCRIPT solutions, where each candidate set Θ^hsubscript^Θ{\hat{\Theta}_{h}}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT has a true angle and Mh1subscript𝑀1M_{{}_{h}}-1italic_M start_POSTSUBSCRIPT start_FLOATSUBSCRIPT italic_h end_FLOATSUBSCRIPT end_POSTSUBSCRIPT - 1 pseudo-solutions, thus finding the true angle is a challenging task. For the hhitalic_hth group, the true angle and pseudo solutions can be represented as follows

θ^t,h=θ0+εhsubscript^𝜃𝑡subscript𝜃0subscript𝜀\displaystyle\hat{\theta}_{t,h}=\theta_{0}+\varepsilon_{h}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_t , italic_h end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT (32)
θ^l,h,m=θ0+εh+Γh,m,m=1,2,,Mh1,formulae-sequencesubscript^𝜃𝑙𝑚subscript𝜃0subscript𝜀subscriptΓ𝑚𝑚12subscript𝑀1\displaystyle\hat{\theta}_{l,h,m}=\theta_{0}+\varepsilon_{h}+\Gamma_{h,m},~{}m% =1,2,\cdots,M_{h}-1,over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l , italic_h , italic_m end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT + roman_Γ start_POSTSUBSCRIPT italic_h , italic_m end_POSTSUBSCRIPT , italic_m = 1 , 2 , ⋯ , italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - 1 , (33)

where εhsubscript𝜀\varepsilon_{h}italic_ε start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT is the estimated error and Γh,msubscriptΓ𝑚\Gamma_{h,m}roman_Γ start_POSTSUBSCRIPT italic_h , italic_m end_POSTSUBSCRIPT is a constant decided by Mhsubscript𝑀M_{h}italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT and m𝑚mitalic_m. When there is no noise, εh0subscript𝜀0\varepsilon_{h}\rightarrow 0italic_ε start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT → 0, the intersection of all candidate angle sets, there is only one value, θ0subscript𝜃0\theta_{0}italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. The true and false solutions can be formed by

{θ^t,1θ^t,2θ^t,Qθ0θ^l,1,mθ^l,2,mθ^l,Q,mθ0casessubscript^𝜃𝑡1subscript^𝜃𝑡2subscript^𝜃𝑡𝑄subscript𝜃0subscript^𝜃𝑙1𝑚subscript^𝜃𝑙2𝑚subscript^𝜃𝑙𝑄𝑚subscript𝜃0\displaystyle\left\{\begin{array}[]{l}{\hat{\theta}_{t,1}}\approx{\hat{\theta}% _{t,2}}\approx\cdots\approx{\hat{\theta}_{t,Q}}\approx{\theta_{0}}\\ {\hat{\theta}_{l,1,m}}\neq{\hat{\theta}_{l,2,m}}\neq\cdots\neq{\hat{\theta}_{l% ,Q,m}}\neq{\theta_{0}}\end{array}\right.{ start_ARRAY start_ROW start_CELL over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_t , 1 end_POSTSUBSCRIPT ≈ over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_t , 2 end_POSTSUBSCRIPT ≈ ⋯ ≈ over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_t , italic_Q end_POSTSUBSCRIPT ≈ italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l , 1 , italic_m end_POSTSUBSCRIPT ≠ over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l , 2 , italic_m end_POSTSUBSCRIPT ≠ ⋯ ≠ over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l , italic_Q , italic_m end_POSTSUBSCRIPT ≠ italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY (36)

Via equations (33) and (36), pseudo solutions are unequal to each other even noise εh0subscript𝜀0\varepsilon_{h}\rightarrow 0italic_ε start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT → 0, that is θ^l,h,m=θ0+Γh,msubscript^𝜃𝑙𝑚subscript𝜃0subscriptΓ𝑚\hat{\theta}_{l,h,m}=\theta_{0}+\Gamma_{h,m}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_l , italic_h , italic_m end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + roman_Γ start_POSTSUBSCRIPT italic_h , italic_m end_POSTSUBSCRIPT, due to the fact that KHsubscript𝐾𝐻K_{H}italic_K start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT sun-array groups contains different number of antennas. Also, the distance between true angles tends to zero. This means the distances among true solutions is far smaller than the distances among pseudo solutions. Hence, in what follows, the minimum distance will be used as a metric of clustering.

Therefore, the set of true solutions can be defined as

Θ^t={θ^t,1,θ^t,2,,θ^t,H}subscript^Θ𝑡subscript^𝜃𝑡1subscript^𝜃𝑡2subscript^𝜃𝑡𝐻\hat{\Theta}_{t}=\left\{\hat{\theta}_{t,1},\hat{\theta}_{t,2},\cdots,\hat{% \theta}_{t,H}\right\}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = { over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_t , 1 end_POSTSUBSCRIPT , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_t , 2 end_POSTSUBSCRIPT , ⋯ , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_t , italic_H end_POSTSUBSCRIPT } (37)

where θ^t,hsubscript^𝜃𝑡\hat{\theta}_{t,h}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_t , italic_h end_POSTSUBSCRIPT is the inferred true angle of the hhitalic_h array group.

III-C RootMUSIC DOA estimation of FD subarray

The FD subarray estimates the initial coarse DOA value via state-of-the-art methods like Root-MUSIC and deep learning. The covariance matrix and EVD of equation (1) are expressed as

𝐑FDsubscript𝐑𝐹𝐷\displaystyle\mathbf{R}_{FD}bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT =σs2𝐚FD(θ0)𝐚FDH(θ0)+𝐈absentsuperscriptsubscript𝜎𝑠2subscript𝐚𝐹𝐷subscript𝜃0superscriptsubscript𝐚𝐹𝐷𝐻subscript𝜃0𝐈\displaystyle=\sigma_{s}^{2}\mathbf{a}_{FD}(\theta_{0})\mathbf{a}_{FD}^{H}(% \theta_{0})+\mathbf{I}= italic_σ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) + bold_I (38)
=[𝐄S𝐄N]Σ[𝐄S𝐄N]H,absentdelimited-[]subscript𝐄𝑆subscript𝐄𝑁Σsuperscriptdelimited-[]subscript𝐄𝑆subscript𝐄𝑁𝐻\displaystyle=[\mathbf{E}_{S}\mathbf{E}_{N}]\Sigma[\mathbf{E}_{S}\mathbf{E}_{N% }]^{H},= [ bold_E start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT bold_E start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] roman_Σ [ bold_E start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT bold_E start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ,

where σs2superscriptsubscript𝜎𝑠2\sigma_{s}^{2}italic_σ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT denotes the SNR of the receive signal and Σ=diag{[σs2,1,,1]}Σ𝑑𝑖𝑎𝑔superscriptsubscript𝜎𝑠211\Sigma=diag\left\{{\left[{\sigma_{s}^{2},1,\cdots,1}\right]}\right\}roman_Σ = italic_d italic_i italic_a italic_g { [ italic_σ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , 1 , ⋯ , 1 ] }.

Then the corresponding spatial spectrum function is obtained as follows

PFD(θ)=1𝐚FDH(θ)𝐄N2,subscript𝑃𝐹𝐷𝜃1superscriptnormsuperscriptsubscript𝐚𝐹𝐷𝐻𝜃subscript𝐄𝑁2\displaystyle P_{FD}(\theta)=\frac{1}{\left\|\mathbf{a}_{FD}^{H}(\theta)% \mathbf{E}_{N}\right\|^{2}},italic_P start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_θ ) = divide start_ARG 1 end_ARG start_ARG ∥ bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_θ ) bold_E start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (39)

Constructing the 2(M1)2𝑀12(M-1)2 ( italic_M - 1 ) polynomial equation through Root-MUSIC, and thus the nearest root to the unit circle is DOA estimation θ^FDsubscript^𝜃𝐹𝐷\hat{\theta}_{FD}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT, that is, initial mean of the class of true solution.

θ^FD=arcsin(λ2πdargzFD)subscript^𝜃𝐹𝐷𝜆2𝜋𝑑subscript𝑧𝐹𝐷\displaystyle\hat{\theta}_{FD}=\arcsin\left(\frac{\lambda}{2\pi d}\arg z_{FD}\right)over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT = roman_arcsin ( divide start_ARG italic_λ end_ARG start_ARG 2 italic_π italic_d end_ARG roman_arg italic_z start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ) (40)

III-D Proposed CNN-based DOA measurement for FD subarray

This section proposes a CNN model to form initial coarse DOA value for FD subarray, providing a better starting sample point of true solution class compared to RootMUSIC. Specifically, the feature is extracted by the convolution layer and then the DOA estimation is inferred by the fully connected layer (FC).

III-D1 Datasets and its Label

Considering that the DL model mainly involves real-valued computations, the final preprocessing procedure of the input data is to convert them to real description while retaining the imaginary part. Thus, the CNN model’ input data 𝐈M×M×3𝐈superscript𝑀𝑀3\mathbf{I}\in{\mathbb{R}^{M\times M\times 3}}bold_I ∈ blackboard_R start_POSTSUPERSCRIPT italic_M × italic_M × 3 end_POSTSUPERSCRIPT is a real-valued matrix, expressed as

𝐈={𝐈:,:,1={𝐑FD}𝐈:,:,2={𝐑FD}𝐈:,:,3={𝐑FD}𝐈casessubscript𝐈::1subscript𝐑𝐹𝐷subscript𝐈::2subscript𝐑𝐹𝐷subscript𝐈::3subscript𝐑𝐹𝐷\mathbf{I}=\left\{\begin{array}[]{l}{\mathbf{I}_{:,:,1}}=\Re\{{\mathbf{R}_{FD}% }\}\\ {\mathbf{I}_{:,:,2}}=\Im\{{\mathbf{R}_{FD}}\}\\ {\mathbf{I}_{:,:,3}}=\angle\{{\mathbf{R}_{FD}}\}\end{array}\right.bold_I = { start_ARRAY start_ROW start_CELL bold_I start_POSTSUBSCRIPT : , : , 1 end_POSTSUBSCRIPT = roman_ℜ { bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT } end_CELL end_ROW start_ROW start_CELL bold_I start_POSTSUBSCRIPT : , : , 2 end_POSTSUBSCRIPT = roman_ℑ { bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT } end_CELL end_ROW start_ROW start_CELL bold_I start_POSTSUBSCRIPT : , : , 3 end_POSTSUBSCRIPT = ∠ { bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT } end_CELL end_ROW end_ARRAY (41)

Assuming S𝑆Sitalic_S denotes the number of training sets, input samples of CNN can be defined as 𝐈={𝐈1,𝐈2,,𝐈S}superscript𝐈subscript𝐈1subscript𝐈2subscript𝐈𝑆\mathbf{I^{\prime}}=\{\mathbf{I}_{1},\mathbf{I}_{2},\cdots,\mathbf{I}_{S}\}bold_I start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { bold_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , bold_I start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT }. The angle of the training data is chosen from the p𝑝pitalic_p grid points of [θ,θ]𝜃𝜃[-\theta,\theta][ - italic_θ , italic_θ ], where θ[1,90]𝜃superscript1superscript90\theta\in[1^{\circ},90^{\circ}]italic_θ ∈ [ 1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 90 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT ] and p𝑝pitalic_p is interval cell. Thus, the desired grid is defined as Δ={θ,,p,0,p,,θ}Δ𝜃𝑝superscript0𝑝𝜃\Delta=\{-\theta,\cdots,-p,0^{\circ},p,\cdots,\theta\}roman_Δ = { - italic_θ , ⋯ , - italic_p , 0 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , italic_p , ⋯ , italic_θ } and the binary vectors corresponding to the training angles are taken as labels. For example, Δ={45,,1,0,1,,45}Δsuperscript45superscript1superscript0superscript1superscript45\Delta=\{-45^{\circ},\cdots,-1^{\circ},0^{\circ},1^{\circ},\cdots,45^{\circ}\}roman_Δ = { - 45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , ⋯ , - 1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 0 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , ⋯ , 45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT } having |Δ|=91Δ91\left|{\Delta}\right|=91| roman_Δ | = 91 grid points when θ=45𝜃superscript45\theta=45^{\circ}italic_θ = 45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT and p=1𝑝superscript1p=1^{\circ}italic_p = 1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT. Besides, the angle 45superscript45-45^{\circ}- 45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT is converted to a binary vector ν1=[1,0,,0]T91×1subscript𝜈1superscript100𝑇superscript911\nu_{1}=[1,0,\cdots,0]^{T}\in\mathbb{R}^{91\times 1}italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = [ 1 , 0 , ⋯ , 0 ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT 91 × 1 end_POSTSUPERSCRIPT. Based on the above analysis, the training dataset of CNN is

𝐈={𝐈1,𝐈2,,𝐈S}superscript𝐈subscript𝐈1subscript𝐈2subscript𝐈𝑆\mathbf{I^{\prime}}=\{\mathbf{I}_{1},\mathbf{I}_{2},\cdots,\mathbf{I}_{S}\}bold_I start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { bold_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , bold_I start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT } (42)

And the label set corresponding to the training dataset is

𝐕={ν1,ν2,,νS}𝐕subscript𝜈1subscript𝜈2subscript𝜈𝑆\mathbf{V}=\{\nu_{1},\nu_{2},\cdots,\nu_{S}\}bold_V = { italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ν start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_ν start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT } (43)

Therefore, the training sets is denoted by

𝐗={𝐈,𝐕}={(𝐈1,ν1),(𝐈2,ν2),,(𝐈S,νS)},𝐗superscript𝐈𝐕subscript𝐈1subscript𝜈1subscript𝐈2subscript𝜈2subscript𝐈𝑆subscript𝜈𝑆{\mathbf{X}}=\{\mathbf{I^{\prime}},\mathbf{V}\}=\{(\mathbf{I}_{1},\nu_{1}),(% \mathbf{I}_{2},\nu_{2}),\cdots,(\mathbf{I}_{S},\nu_{S})\},bold_X = { bold_I start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , bold_V } = { ( bold_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , ( bold_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_ν start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) , ⋯ , ( bold_I start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT , italic_ν start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) } , (44)

where (𝐈s,νs)subscript𝐈𝑠subscript𝜈𝑠(\mathbf{I}_{s},\nu_{s})( bold_I start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , italic_ν start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) is the s𝑠sitalic_s-th training sample.

Moreover, the training phase utilizes the covariance matrix, while the testing process adopts the sample covariance matrix defined by

𝐑¯FD=1Ll=1L𝐲FD(t)𝐲FDH(t)subscript¯𝐑𝐹𝐷1𝐿superscriptsubscript𝑙1𝐿subscript𝐲𝐹𝐷𝑡superscriptsubscript𝐲𝐹𝐷𝐻𝑡\overline{\mathbf{R}}_{FD}=\frac{1}{L}\sum\limits_{l=1}^{L}{\mathbf{y}_{FD}(t)% }\mathbf{y}_{{}_{FD}}^{H}(t)over¯ start_ARG bold_R end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_L end_ARG ∑ start_POSTSUBSCRIPT italic_l = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT bold_y start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_t ) bold_y start_POSTSUBSCRIPT start_FLOATSUBSCRIPT italic_F italic_D end_FLOATSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_t ) (45)

III-D2 CNN Structure

In addition to the impact of training dataset, the performance of CNN largely depends on the network structure. The objective of CNN-based DOA estimation is f:M×M×3θ^FD:𝑓superscript𝑀𝑀3subscript^𝜃𝐹𝐷f:{\mathbb{R}^{M\times M\times 3}}\to\hat{\theta}_{FD}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_M × italic_M × 3 end_POSTSUPERSCRIPT → over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT. We introduce a CNN network including 2D convolutional layers and FC layers, illustrated in Figure.4, to predict DOA estimation for FD Array. To this end, initial mean of the class of true solution is generated to accelerate the inference of true solution sets for H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD Array. The map** of input to output by proposed CNN can be defined as

ν=f(𝐈)=fF4(fF1(fC5(fC1(𝐈))))𝜈𝑓𝐈subscript𝑓subscript𝐹4subscript𝑓subscript𝐹1subscript𝑓subscript𝐶5subscript𝑓subscript𝐶1𝐈\nu=f\left(\mathbf{I}\right)={f_{{F_{4}}}}\left({\cdots{f_{{F_{1}}}}\left({{f_% {{C_{5}}}}\left({\cdots{f_{{C_{1}}}}\left(\mathbf{I}\right)}\right)}\right)}\right)italic_ν = italic_f ( bold_I ) = italic_f start_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋯ italic_f start_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_f start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋯ italic_f start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_I ) ) ) ) (46)

where fCi(i=1,,5f_{{C_{i}}}(i=1,\cdots,5italic_f start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_i = 1 , ⋯ , 5) is i𝑖iitalic_i-th convolutional layer, including a 2D convolutional layer with 256256256256 filters, a batch normalization (BN) layer and a ReLU layer. And ReLU can be denoted as

ReLU(x)=max(0,x)ReLU𝑥0𝑥{\rm{ReLU}}\left(x\right)=\max\left({0,x}\right)roman_ReLU ( italic_x ) = roman_max ( 0 , italic_x ) (47)

Then, expanding the output of fC5subscript𝑓subscript𝐶5f_{{C_{5}}}italic_f start_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_POSTSUBSCRIPT to a vector via a flatten layer. fFj(j=1,,4f_{{F_{j}}}(j=1,\cdots,4italic_f start_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_j = 1 , ⋯ , 4) is the j𝑗jitalic_j-th FC layer, involving a dense layer, a ReLU layer and a dropout layer. The dropout layer can inhibit over-fitting and enhance the regularization properties during the training phase of the CNN.

III-D3 Loss Function

The training datasets and the set of label in Eqs. (42) and (43) are used to train the CNN model. Furthermore, we add the L2subscript𝐿2L_{2}italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization term to mitigate the over-fitting and improve the model generalization. Therefore, the loss function is defined as

L(θ^FD)=12ν¯(θ^FD)22+μW2,𝐿subscript^𝜃𝐹𝐷12superscriptsubscriptnorm¯𝜈subscript^𝜃𝐹𝐷22𝜇superscriptnorm𝑊2L\left({{{\hat{\theta}}_{FD}}}\right)=\frac{1}{2}\left\|{\overline{\nu}\left({% {{\hat{\theta}}_{FD}}}\right)}\right\|_{{}^{2}}^{2}+\mu{{\left\|W\right\|}^{2}},italic_L ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ over¯ start_ARG italic_ν end_ARG ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ) ∥ start_POSTSUBSCRIPT start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_μ ∥ italic_W ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (48)
ν¯(θ^FD)=ν(θ^FD)𝜈(θ^FD),¯𝜈subscript^𝜃𝐹𝐷𝜈subscript^𝜃𝐹𝐷superscript𝜈subscript^𝜃𝐹𝐷\overline{\nu}\left({{{\hat{\theta}}_{FD}}}\right)=\nu\left({{{\hat{\theta}}_{% FD}}}\right)-\mathop{\nu}\limits^{\wedge}\left({{{\hat{\theta}}_{FD}}}\right),over¯ start_ARG italic_ν end_ARG ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ) = italic_ν ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ) - italic_ν start_POSTSUPERSCRIPT ∧ end_POSTSUPERSCRIPT ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ) , (49)

where 𝜈(θ^FD)superscript𝜈subscript^𝜃𝐹𝐷\mathop{\nu}\limits^{\wedge}\left({{{\hat{\theta}}_{FD}}}\right)italic_ν start_POSTSUPERSCRIPT ∧ end_POSTSUPERSCRIPT ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ) and ν(θ^FD)𝜈subscript^𝜃𝐹𝐷\nu\left({{{\hat{\theta}}_{FD}}}\right)italic_ν ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ) represent the true label and the predicted value, respectively. μ𝜇\muitalic_μ denotes the weight attenuation coefficient, set to 0.00050.00050.00050.0005. W𝑊Witalic_W is the weighting parameter of the FC layer. The loss function L𝐿Litalic_L is minimized to train the proposed model.

Refer to caption
Figure 4: The poposed CNN–based DOA estimation for FD Array.

III-D4 Training Details

The Adam optimizer is used to train the proposed CNN network with a learning rate of 0.00010.00010.00010.0001, which can avoid overfitting the optimal solution. Training batch_size is set to 32323232 and epochs is 30303030 in the entire operation of training.The L2subscript𝐿2L_{2}italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT regularization coefficient is 0.00050.00050.00050.0005. Dropout layers with rate 0.30.30.30.3 in the FC layers. Signal im**ing directions are Δ={90,,1,0,1,,90}Δsuperscript90superscript1superscript0superscript1superscript90\Delta=\{-90^{\circ},\cdots,-1^{\circ},0^{\circ},1^{\circ},\cdots,90^{\circ}\}roman_Δ = { - 90 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , ⋯ , - 1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 0 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , ⋯ , 90 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT } and Δ={45,,1,0,1,,45}Δsuperscript45superscript1superscript0superscript1superscript45\Delta=\{-45^{\circ},\cdots,-1^{\circ},0^{\circ},1^{\circ},\cdots,45^{\circ}\}roman_Δ = { - 45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , ⋯ , - 1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 0 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 1 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , ⋯ , 45 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT } in training and testing processes, respectively. Figure.5 shows the training and validation loss of the CNN model without over-fitting occurring.

Refer to caption
Figure 5: The training and validation loss of the CNN.

IV Proposed clustering and fusion methods for our proposed frameworks

In this section, we will focus on how to infer the class of true solutions from the candidate sets and fuse two-part solutions to form an enhanced DOA estimation. To make a high-accuracy classification, several high-performance methods: CoMDDL, CoMD-RootMUSIC, MDDL and MD-RooMUSIC are depicted as follows.

IV-A Improved Hierarchical Clustering-aided MDDL and MD-RooMUSIC

Refer to caption

Figure 6: The implementation of the hierarchical clustering for candidate angle set

Due to the fact that true angles of all group arrays in H2ADsuperscriptH2AD\rm{H}^{2}ADroman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD will be close to each other, the extraction of true angles could be transferred into a cluster problem.

The hierarchical clustering is able to calculate the distance between different clusters and generate a hierarchy of clusters. There are two strategies in hierarchical clustering: agglomerative and divisive. Agglomerative, so called "bottom-up" approach, is more suitable in this application. This method starts by treating all H𝐻Hitalic_H points as H𝐻Hitalic_H clusters. Then, calculate all distances between two different clusters and merge the two closest clusters into a new cluster. After that, repeat the above step until only one cluster is left.

In our problem, all H𝐻Hitalic_H point is 1-D points and we are trying to find five nearest points. Thus, Euclidean distance is selected and the mean value of a cluster is chosen for computing distance between clusters. As shown in Figure 6, we draw a dendrogram for the proposed heterogeneous HAD structure with M1=2subscript𝑀12M_{1}=2italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 2, M2=3subscript𝑀23M_{2}=3italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 3 and M3=5subscript𝑀35M_{3}=5italic_M start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 5. The 3333 true solutions are cluster 6, 7 and 8, respectively. It is obvious that the true solutions were merged into one cluster with a very low distance. Thus, the method could be terminated when the first cluster having H𝐻Hitalic_H is born. Then, selected points could also be combined by (52) (60) and (61). The whole method can be called as Improved-HS and shown in Algorithm 1.

Algorithm 1 Improved-HS-aided MD.
  Input:𝐲(n),n=1,2,,L.formulae-sequence𝐲𝑛𝑛12𝐿~{}\mathbf{y}(n),~{}n=1,2,\cdots,L.bold_y ( italic_n ) , italic_n = 1 , 2 , ⋯ , italic_L .
        Initialization:  divide 𝐲(n)𝐲𝑛\mathbf{y}(n)bold_y ( italic_n ) into 𝐲FD(n)subscript𝐲𝐹𝐷𝑛\mathbf{y}_{FD}(n)bold_y start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_n ) and 𝐲h(n)subscript𝐲𝑛\mathbf{y}_{h}(n)bold_y start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_n ), h=1,2,,H12𝐻h=1,2,\cdots,Hitalic_h = 1 , 2 , ⋯ , italic_H. Calculate the CRLBh𝐶𝑅𝐿subscript𝐵CRLB_{h}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT and CRLBFD𝐶𝑅𝐿subscript𝐵𝐹𝐷CRLB_{FD}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT.
        for h=1,2,,H12𝐻h=1,2,\cdots,Hitalic_h = 1 , 2 , ⋯ , italic_H do,
            perform the root-MUSIC method for 𝐲h(n)subscript𝐲𝑛\mathbf{y}_{h}(n)bold_y start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_n ), and obtain the solution set, Θ^hsubscript^Θ\hat{\Theta}_{h}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT.
        end for
        treat H𝐻Hitalic_H solutions as H𝐻Hitalic_H clusters
        repeat
        Calculate the θ^FDsubscript^𝜃𝐹𝐷\hat{\theta}_{FD}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT by the root-MUSIC method for 𝐲FD(n)subscript𝐲𝐹𝐷𝑛\mathbf{y}_{FD}(n)bold_y start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_n ).
            Compute the distance between different clusters and combine the nearest two clusters as a new cluster
        Until there is one cluster having H𝐻Hitalic_H solutions
        The cluster having H𝐻Hitalic_H solutions is regarded as Θ^tsubscript^Θ𝑡\hat{\Theta}_{t}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
        Integrate the Θ^tsubscript^Θ𝑡\hat{\Theta}_{t}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT,(60) and (61) into (52) to get θ^^𝜃\hat{\theta}over^ start_ARG italic_θ end_ARG.
  Output: θ^^𝜃\hat{\theta}over^ start_ARG italic_θ end_ARG

IV-B Proposed CoMDDL and CoMD-RootMUSIC methods

Based on the proposed CoMD framework in Figure 3, to eliminate pseudo-solutions for each group by transferring the estimation value θ^FDsubscript^𝜃𝐹𝐷\hat{\theta}_{FD}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT of the FD sunarray into the feasible solution sets Θ^hsubscript^Θ\hat{\Theta}_{h}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT from the hhitalic_h-th group of H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD part, which accelerates the true optimal angle of the hhitalic_h-th group.

θ^H2AD,h=argminθ^h,jhΘ^hθ^FDθ^h,jh2subscript^𝜃superscriptH2ADhsubscript^𝜃subscript𝑗subscript^Θsuperscriptnormsubscript^𝜃𝐹𝐷subscript^𝜃subscript𝑗2\displaystyle\hat{\theta}_{\rm{H}^{2}AD,h}=\underset{\hat{\theta}_{h,j_{h}}\in% \hat{\Theta}_{h}}{\arg\min}\left\|\hat{\theta}_{FD}-\hat{\theta}_{h,j_{h}}% \right\|^{2}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD , roman_h end_POSTSUBSCRIPT = start_UNDERACCENT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_h , italic_j start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_UNDERACCENT start_ARG roman_arg roman_min end_ARG ∥ over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT - over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_h , italic_j start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (50)

Based on (32)-(37), the set of true solutions is

Θ^t={θ^H2AD,1,θ^H2AD,2,,θ^H2AD,H}superscriptsubscript^Θ𝑡subscript^𝜃superscriptH2AD1subscript^𝜃superscriptH2AD2subscript^𝜃superscriptH2ADH\hat{\Theta}_{t}^{\prime}=\left\{\hat{\theta}_{\rm{H}^{2}AD,1},\hat{\theta}_{% \rm{H}^{2}AD,2},\cdots,\hat{\theta}_{\rm{H}^{2}AD,H}\right\}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD , 1 end_POSTSUBSCRIPT , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD , 2 end_POSTSUBSCRIPT , ⋯ , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD , roman_H end_POSTSUBSCRIPT } (51)

By fusing the two-part true solution to obtain the optimal DOA estimate

θ^=wFDθ^FD+h=1Hwhθ^H2AD,h^𝜃subscript𝑤𝐹𝐷subscript^𝜃𝐹𝐷superscriptsubscript1𝐻subscript𝑤subscript^𝜃superscriptH2ADh\displaystyle\hat{\theta}=w_{FD}\hat{\theta}_{FD}+\sum_{h=1}^{H}w_{h}\hat{% \theta}_{\rm{H}^{2}AD,h}over^ start_ARG italic_θ end_ARG = italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD , roman_h end_POSTSUBSCRIPT (52)

The mean square error (MSE) of the θ^^𝜃\hat{\theta}over^ start_ARG italic_θ end_ARG is

𝐌𝐒𝐄(θ^)=𝔼[(θ^θ0)2]𝐌𝐒𝐄^𝜃𝔼delimited-[]superscript^𝜃subscript𝜃02\displaystyle\mathbf{MSE}(\hat{\theta})=\mathbb{E}\left[\left(\hat{\theta}-% \theta_{0}\right)^{2}\right]bold_MSE ( over^ start_ARG italic_θ end_ARG ) = blackboard_E [ ( over^ start_ARG italic_θ end_ARG - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] (53)
=h=1Hwh2𝔼[(θ^H2AD,hθ0)2]+wFD2𝔼[(θ^FDθ0)2]absentsuperscriptsubscript1𝐻superscriptsubscript𝑤2𝔼delimited-[]superscriptsubscript^𝜃superscriptH2ADhsubscript𝜃02superscriptsubscript𝑤𝐹𝐷2𝔼delimited-[]superscriptsubscript^𝜃𝐹𝐷subscript𝜃02\displaystyle=\sum_{h=1}^{H}w_{h}^{2}\mathbb{E}\left[\left(\hat{\theta}_{\rm{H% }^{2}AD,h}-\theta_{0}\right)^{2}\right]+w_{FD}^{2}\mathbb{E}\left[\left(\hat{% \theta}_{FD}-\theta_{0}\right)^{2}\right]= ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT blackboard_E [ ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD , roman_h end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] + italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT blackboard_E [ ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ]
=h=1Hwh2𝐌𝐒𝐄(θ^H2AD,h)+wFD2𝐌𝐒𝐄(θ^FD)absentsuperscriptsubscript1𝐻superscriptsubscript𝑤2𝐌𝐒𝐄subscript^𝜃superscriptH2ADsuperscriptsubscript𝑤𝐹𝐷2𝐌𝐒𝐄subscript^𝜃𝐹𝐷\displaystyle=\sum_{h=1}^{H}w_{h}^{2}\mathbf{MSE}(\hat{\theta}_{\rm{H}^{2}AD},% h)+w_{FD}^{2}\mathbf{MSE}(\hat{\theta}_{FD})= ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_MSE ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD end_POSTSUBSCRIPT , italic_h ) + italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_MSE ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT )
h=1Hwh2CRLBh+wFD2CRLBFD,absentsuperscriptsubscript1𝐻superscriptsubscript𝑤2𝐶𝑅𝐿subscript𝐵superscriptsubscript𝑤𝐹𝐷2𝐶𝑅𝐿subscript𝐵𝐹𝐷\displaystyle\geq\sum_{h=1}^{H}w_{h}^{2}CRLB_{h}+w_{FD}^{2}CRLB_{FD},≥ ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT + italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ,

Thus

minwh,wFDh=1Hwh2CRLBh+wFD2CRLBFDsubscriptsubscript𝑤subscript𝑤𝐹𝐷superscriptsubscript1𝐻superscriptsubscript𝑤2𝐶𝑅𝐿subscript𝐵superscriptsubscript𝑤𝐹𝐷2𝐶𝑅𝐿subscript𝐵𝐹𝐷\displaystyle\min_{w_{h},w_{FD}}~{}~{}~{}\sum_{h=1}^{H}w_{h}^{2}CRLB_{h}+w_{FD% }^{2}CRLB_{FD}roman_min start_POSTSUBSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT + italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT
s.t.wFD+h=1Hwh=1,\displaystyle s.t.~{}~{}~{}~{}~{}~{}~{}~{}~{}w_{FD}+\sum_{h=1}^{H}w_{h}=1,italic_s . italic_t . italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = 1 , (54)

According to [29] and [24], the CRLBFD𝐶𝑅𝐿subscript𝐵𝐹𝐷CRLB_{FD}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT and is given by

CRLBFD=λ28Lπ2𝐒𝐍𝐑cos2θ0d22𝐶𝑅𝐿subscript𝐵𝐹𝐷superscript𝜆28𝐿superscript𝜋2𝐒𝐍𝐑superscript2subscript𝜃0superscriptsuperscript𝑑2\displaystyle CRLB_{FD}=\frac{{{\lambda^{2}}}}{{8L{\pi^{2}}\mathbf{SNR}{{\cos}% ^{2}}{\theta_{0}}\mathop{{d^{2}}}\limits^{-}}}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT = divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 8 italic_L italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_SNR roman_cos start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_BIGOP italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_BIGOP start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT end_ARG (55)

Referring to [24], the Fisher information matrix of the hhitalic_hth H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD group array can be expressed by (56), where

𝐅h=8π2𝐒𝐍𝐑2cos2θ0λ2MhΞ[112bh(θ0)4Mh2Kh2(Kh21)d2+MhKhΞ(bh(θ0)μ2+Kh{bh2(θ0)μ})],subscript𝐅8superscript𝜋2superscript𝐒𝐍𝐑2superscript2subscript𝜃0superscript𝜆2subscript𝑀Ξdelimited-[]112superscriptnormsubscript𝑏subscript𝜃04superscriptsubscript𝑀2superscriptsubscript𝐾2superscriptsubscript𝐾21superscript𝑑2subscript𝑀subscript𝐾Ξsuperscriptnormsubscript𝑏subscript𝜃0𝜇2subscript𝐾superscriptsubscript𝑏2subscript𝜃0𝜇\displaystyle\mathbf{F}_{h}=\frac{{8{\pi^{2}}\mathbf{SNR}^{2}}{{\cos}^{2}}{% \theta_{0}}}{{{\lambda^{2}}{M_{h}}\Xi}}\left[{\frac{1}{{12}}{{\left\|{{b_{h}}(% {\theta_{0}})}\right\|}^{4}}M_{h}^{2}K_{h}^{2}\left({K_{h}^{2}-1}\right){d^{2}% }+\frac{{{M_{h}}{K_{h}}}}{\Xi}\left({{{\left\|{{b_{h}}({\theta_{0}})\mu}\right% \|}^{2}}+{K_{h}}\Re\left\{{{b_{h}}^{2}({\theta_{0}})\mu}\right\}}\right)}% \right],bold_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = divide start_ARG 8 italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_SNR start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_cos start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT roman_Ξ end_ARG [ divide start_ARG 1 end_ARG start_ARG 12 end_ARG ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∥ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 ) italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG start_ARG roman_Ξ end_ARG ( ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_μ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT roman_ℜ { italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_μ } ) ] , (56)
μ=m=1Mh(m1)dej2πλ(m1)dsinθ0𝜇superscriptsubscript𝑚1subscript𝑀𝑚1𝑑superscript𝑒𝑗2𝜋𝜆𝑚1𝑑subscript𝜃0\displaystyle\mu=\sum_{m=1}^{M_{h}}\left(m-1\right)de^{-j\frac{2\pi}{\lambda}(% m-1)d\sin{\theta_{0}}}italic_μ = ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_m - 1 ) italic_d italic_e start_POSTSUPERSCRIPT - italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG ( italic_m - 1 ) italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT (57)
Ξ=Mh+Kh𝐒𝐍𝐑bh(θ0)2Ξsubscript𝑀subscript𝐾𝐒𝐍𝐑superscriptnormsubscript𝑏subscript𝜃02\displaystyle\Xi={M_{h}}+{K_{h}}\mathbf{SNR}{\left\|{{b_{h}}({\theta_{0}})}% \right\|^{2}}roman_Ξ = italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT bold_SNR ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (58)

The closed-form expression of the CRLBh𝐶𝑅𝐿subscript𝐵CRLB_{h}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT is expressed by (59).

CRLBh𝐶𝑅𝐿subscript𝐵\displaystyle CRLB_{h}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT =1N𝐅h1==λ2MhΞ8Lπ2𝐒𝐍𝐑cos2θ0[bh(θ0)4Mh2Kh2(Kh21)d212Ξ+MhKhΞ(bh(θ0)μ2+Kh{bh2(θ0)μ})]\displaystyle=\frac{1}{N}\mathbf{F}_{h}^{-1}==\frac{\lambda^{2}{M_{h}}\Xi}{{8L% {\pi^{2}}\mathbf{SNR}{{\cos}^{2}}{\theta_{0}}}\left[{\frac{{{\left\|{{b_{h}}({% \theta_{0}})}\right\|}^{4}}M_{h}^{2}K_{h}^{2}\left({K_{h}^{2}-1}\right){d^{2}}% }{{12\Xi}}+\frac{{M_{h}{K_{h}}}}{\Xi}\left({{{\left\|{{b_{h}}({\theta_{0}})\mu% }\right\|}^{2}}+{K_{h}}\Re\left\{{{b_{h}}^{2}({\theta_{0}})\mu}\right\}}\right% )}\right]}= divide start_ARG 1 end_ARG start_ARG italic_N end_ARG bold_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = = divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT roman_Ξ end_ARG start_ARG 8 italic_L italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_SNR roman_cos start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT [ divide start_ARG ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∥ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 ) italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 12 roman_Ξ end_ARG + divide start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG start_ARG roman_Ξ end_ARG ( ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_μ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT roman_ℜ { italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_μ } ) ] end_ARG (59)

Theorem 1: The closed-form expression of wFDsubscript𝑤𝐹𝐷w_{FD}italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT and whsubscript𝑤w_{h}italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT is respectively expressed by

wFD=CRLBFD1CRLBFD1+h=1HCRLBh1subscript𝑤𝐹𝐷𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1superscriptsubscript1𝐻𝐶𝑅𝐿superscriptsubscript𝐵1\displaystyle w_{FD}=\frac{CRLB_{FD}^{-1}}{CRLB_{FD}^{-1}+\sum_{h=1}^{H}CRLB_{% h}^{-1}}italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT = divide start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG (60)
wh=CRLBh1CRLBFD1+h=1HCRLBh1,subscript𝑤𝐶𝑅𝐿superscriptsubscript𝐵1𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1superscriptsubscript1𝐻𝐶𝑅𝐿superscriptsubscript𝐵1\displaystyle w_{h}=\frac{CRLB_{h}^{-1}}{CRLB_{FD}^{-1}+\sum_{h=1}^{H}CRLB_{h}% ^{-1}},italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = divide start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG , (61)

Proof: See Appendix A.

Based on the above analysis, the proposed CoMD framework contains three steps: 1) form the candidate sets for H2ADsuperscriptH2AD\rm{H}^{2}ADroman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD and initial coarse DOA value for FD subarray, respectively. 2) infer the true solution for each group by transferring the coarse DOA estimation θ^FDsubscript^𝜃𝐹𝐷\hat{\theta}_{FD}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT to H2ADsuperscriptH2AD\rm{H}^{2}ADroman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD. 3) fuse two-part true solution. The overall algorithm is described in Algorithm 2.

Algorithm 2 Proposed CoMD.
  
  Input:𝐲(n),n=1,2,,L.formulae-sequence𝐲𝑛𝑛12𝐿~{}\mathbf{y}(n),~{}n=1,2,\cdots,L.bold_y ( italic_n ) , italic_n = 1 , 2 , ⋯ , italic_L .
        Initialization:  divide 𝐲(n)𝐲𝑛\mathbf{y}(n)bold_y ( italic_n ) into 𝐲FD(n)subscript𝐲𝐹𝐷𝑛\mathbf{y}_{FD}(n)bold_y start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_n ) and 𝐲h(n)subscript𝐲𝑛\mathbf{y}_{h}(n)bold_y start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_n ), h=1,2,,H12𝐻h=1,2,\cdots,Hitalic_h = 1 , 2 , ⋯ , italic_H. Calculate the CRLBh𝐶𝑅𝐿subscript𝐵CRLB_{h}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT and CRLBFD𝐶𝑅𝐿subscript𝐵𝐹𝐷CRLB_{FD}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT.
        for h=1,2,,H12𝐻h=1,2,\cdots,Hitalic_h = 1 , 2 , ⋯ , italic_H do,
            perform the root-MUSIC method for 𝐲h(n)subscript𝐲𝑛\mathbf{y}_{h}(n)bold_y start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_n ), and obtain the solution set, Θ^hsubscript^Θ\hat{\Theta}_{h}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT.
        end for
        treat H𝐻Hitalic_H solutions as H𝐻Hitalic_H clusters
        repeat
        Calculate the θ^FDsubscript^𝜃𝐹𝐷\hat{\theta}_{FD}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT by the root-MUSIC or CNN-based method for 𝐲FD(n)subscript𝐲𝐹𝐷𝑛\mathbf{y}_{FD}(n)bold_y start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT ( italic_n ).
        Solve the problem (IV-B) to obtain the set Θ^tsuperscriptsubscript^Θ𝑡\hat{\Theta}_{t}^{\prime}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT.
        Integrate the Θ^tsuperscriptsubscript^Θ𝑡\hat{\Theta}_{t}^{\prime}over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT,(60) and (61) into (52) to get θ^^𝜃\hat{\theta}over^ start_ARG italic_θ end_ARG.
  Output: θ^^𝜃\hat{\theta}over^ start_ARG italic_θ end_ARG

V Theoretical Analysis

This section provides the analysis of the theoretical characterization for the H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD structure and the complexity of the proposed approaches.

V-A CRLB

The CRLB provides a lower bound of the variance for an unbiased DOA method. Therefore, the deriving process of CRLB for the proposed FDH2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD structure is described in theorem 1.

Theorem 2: The CRLB for the proposed FDH2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD structure is expressed by

σθ021L𝐅𝐲1superscriptsubscript𝜎subscript𝜃021𝐿superscriptsubscript𝐅𝐲1\displaystyle\sigma_{\theta_{0}}^{2}\geq\frac{1}{L}\mathbf{F}_{\mathbf{y}}^{-1}italic_σ start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ divide start_ARG 1 end_ARG start_ARG italic_L end_ARG bold_F start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (62)

where

𝐅𝐲=𝐅FD+𝐅H2AD=𝐅FD+h=1H𝐅hsubscript𝐅𝐲subscript𝐅𝐹𝐷subscript𝐅superscriptH2𝐴𝐷subscript𝐅𝐹𝐷superscriptsubscript1𝐻subscript𝐅\displaystyle\mathbf{F}_{\mathbf{y}}=\mathbf{F}_{FD}+\mathbf{F}_{\mathbf{\rm{H% }^{2}}AD}=\mathbf{F}_{FD}+\sum_{h=1}^{H}\mathbf{F}_{h}bold_F start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT = bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + bold_F start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT = bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT (63)

Proof: See Appendix B.

V-B MSE

This subsection presents the MSE of the proposed multi-modal-learning-based framework. Assuming that the DOA estimation of FD and H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD array can realize the CRLB. Also, we could obtain all correct solutions. According to (LABEL:MAEA)-(APPENDIX A: Proof of Theorem 1), the MSE of the proposed framework is given by

𝐌𝐒𝐄=wFD2CRLBFD+h=1Hwh2CRLBh𝐌𝐒𝐄superscriptsubscript𝑤𝐹𝐷2𝐶𝑅𝐿subscript𝐵𝐹𝐷superscriptsubscript1𝐻superscriptsubscript𝑤2𝐶𝑅𝐿subscript𝐵\displaystyle\mathbf{MSE}=w_{FD}^{2}CRLB_{FD}+\sum_{h=1}^{H}w_{h}^{2}CRLB_{h}bold_MSE = italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT (64)
=(CRLBFD1CRLBFD1+h=1HCRLBh1)2CRLBFD+absentlimit-fromsuperscript𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1superscriptsubscript1𝐻𝐶𝑅𝐿superscriptsubscript𝐵12𝐶𝑅𝐿subscript𝐵𝐹𝐷\displaystyle=\left(\frac{CRLB_{FD}^{-1}}{CRLB_{FD}^{-1}+\sum_{h=1}^{H}CRLB_{h% }^{-1}}\right)^{2}CRLB_{FD}+= ( divide start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT +
h=1H(CRLBh1CRLBFD1+h=1HCRLBh1)2CRLBhsuperscriptsubscript1𝐻superscript𝐶𝑅𝐿superscriptsubscript𝐵1limit-from𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1superscriptsubscript1𝐻𝐶𝑅𝐿superscriptsubscript𝐵12𝐶𝑅𝐿subscript𝐵\displaystyle\quad\sum_{h=1}^{H}\left(\frac{CRLB_{h}^{-1}}{CRLB_{FD}^{-1}+% \quad\quad\sum_{h=1}^{H}CRLB_{h}^{-1}}\right)^{2}CRLB_{h}∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( divide start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT
=1CRLBFD1+h=1HCRLBh1absent1𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1superscriptsubscript1𝐻𝐶𝑅𝐿superscriptsubscript𝐵1\displaystyle=\frac{1}{CRLB_{FD}^{-1}+\sum_{h=1}^{H}CRLB_{h}^{-1}}= divide start_ARG 1 end_ARG start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG

where

CRLBFD=1L𝐅FD𝐶𝑅𝐿subscript𝐵𝐹𝐷1𝐿subscript𝐅𝐹𝐷CRLB_{FD}=\frac{1}{L\cdot\mathbf{F}_{FD}}italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_L ⋅ bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_ARG (65)
CRLBh=1L𝐅h,𝐶𝑅𝐿subscript𝐵1𝐿subscript𝐅CRLB_{h}=\frac{1}{L\cdot\mathbf{F}_{h}},italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_L ⋅ bold_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG , (66)

then

𝐌𝐒𝐄=1L(𝐅FD+h=1H𝐅h)1=1L(𝐅FD+𝐅H2AD)1,𝐌𝐒𝐄1𝐿superscriptsubscript𝐅𝐹𝐷superscriptsubscript1𝐻subscript𝐅11𝐿superscriptsubscript𝐅𝐹𝐷subscript𝐅superscriptH2𝐴𝐷1\mathbf{MSE}=\frac{1}{L}\left(\mathbf{F}_{FD}+\sum_{h=1}^{H}\mathbf{F}_{h}% \right)^{-1}=\frac{1}{L}\left(\mathbf{F}_{FD}+\mathbf{F}_{\mathbf{\rm{H}^{2}}% AD}\right)^{-1},bold_MSE = divide start_ARG 1 end_ARG start_ARG italic_L end_ARG ( bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_L end_ARG ( bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + bold_F start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT , (67)

Obviously, the MSE is identical to (62). Hence, the multi-modal-learning-based framework is consistent with the corresponding CRLB.

VI Simulation Results

In this section, experimental results are presented to assess the performance of our proposed DOA estimator and the vital parameters of simulation settings will be given in Table I. Furthermore, we use the root-mean-squared error (RMSE) to represent the performance and the hybrid CRLB as a baseline, which is calculated as

RMSE=1UuU(θ^uθ0)2𝑅𝑀𝑆𝐸1𝑈superscriptsubscript𝑢𝑈superscriptsubscript^𝜃𝑢subscript𝜃02\displaystyle RMSE=\sqrt{\frac{1}{U}\sum_{u}^{U}(\hat{\theta}_{u}-\theta_{0})^% {2}}italic_R italic_M italic_S italic_E = square-root start_ARG divide start_ARG 1 end_ARG start_ARG italic_U end_ARG ∑ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_U end_POSTSUPERSCRIPT ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (68)

where U𝑈Uitalic_U denotes the number of Monte Carlo experiments.

TABLE I: SYSTEM DATA SET PARAMETERS
Parameters Values
Number of groups in H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD: H𝐻Hitalic_H 3
Number of subarrays in each group: K1,K2,K3subscript𝐾1subscript𝐾2subscript𝐾3K_{1},K_{2},K_{3}italic_K start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_K start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT 16, 16, 16
Number of antennas in each group: M1,M2,M3subscript𝑀1subscript𝑀2subscript𝑀3M_{1},M_{2},M_{3}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_M start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT 7, 11, 13
Number of antennas of FD array 128
Transmitter direction: θ0subscript𝜃0\theta_{0}italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 41
Monte Carlo experiments: U 5000
Number of snapshots:: L𝐿Litalic_L 100
Conv_1: kernel size, stride 3×3@25633@2563\times 3@2563 × 3 @ 256, 2
Conv_2-Conv_5: kernel size, stride 2×2@25622@2562\times 2@2562 × 2 @ 256, 1
Neurons in FC_1, FC_2, FC_3 1024, 512, 256

Figure.7 illustrates the curves of our proposed DOA methods in RMSE versus SNR, with the corresponding CRLB as the performance baseline. From Figure.7, the proposed four methods could approach the corresponding CRLB when SNR > 0 dB. In particular, as SNR tends to the extremely SNR region, the performance gains over MDDL and MD-RootMUSIC achieved by CoMDDL and CoMD-RootMUSIC grow gradually. At SNR=-10dB, the learned precision of the latter is ten times the former.

Refer to caption
Figure 7: RMSE versus SNR of the proposed method.

Figure.8 plots the curve of RMSE versus the number of snapshots L𝐿Litalic_L under SNR{0,10,20}absent01020\in\{0,10,20\}∈ { 0 , 10 , 20 }dB. From Figure.8, as the number of snapshots L𝐿Litalic_L increases, the RMSE performance gradually improves. Moreover, at SNR=0absent0=0= 0, the performance can approach the corresponding CRLB when L𝐿Litalic_L reaches 150. Meanwhile, the desired DOA estimation can be achieved at low-number snapshots as an increase in SNR.

Refer to caption
Figure 8: RMSE versus number of snapshots with SNR{0,10,20}absent01020\in\{0,10,20\}∈ { 0 , 10 , 20 } dB.

Figure.9 depicts RMSE versus the number of subarrays, where M1=7𝑀17M1=7italic_M 1 = 7, M2=11𝑀211M2=11italic_M 2 = 11, M3=13𝑀313M3=13italic_M 3 = 13, K1=K2=K3𝐾1𝐾2𝐾3K1=K2=K3italic_K 1 = italic_K 2 = italic_K 3 ({16:8:80}(\in\{16:8:80\}( ∈ { 16 : 8 : 80 }. It can be observed from the figure that the corresponding CRLB at SNR{0,10,20}absent01020\in\{0,10,20\}∈ { 0 , 10 , 20 }dB can be realized at almost all points of RMSE as Khsubscript𝐾K_{h}italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT increases. Combining the experimental results in Figure.7 and Figure.8 , the proposed DOA estimator can realize a superior performance in medium and high SNR.

Refer to caption
Figure 9: RMSE versus number of subarrys with SNR{0,10,20}absent01020\in\{0,10,20\}∈ { 0 , 10 , 20 } dB.

Figure.10 plots the curves between RMSE and the ratio of FD structure antennas to the total antenna under SNR{0,10,20}absent01020\in\{0,10,20\}∈ { 0 , 10 , 20 }dB. From Figure.10, the DOA estimation performance gradually improves as the proportion of the FD antennas increases, and the RMSE performance can reach the CRLB when the proportion is up to 4/314314/314 / 31. This implies different scenes should choose the suitable FD antennas to achieve DOA estimation.

Refer to caption
Figure 10: RMSE versus proportion of FD.

VII Conclusions

In this paper, a novel universal H2ADsuperscriptH2AD\rm{H}^{2}ADroman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD-FD MIMO receiver structure of integrating FD subarray with heterogeneous HAD structure is designed. It effectively solves the issue of high latency or low time-efficiency in DOA estimation encountered with conventional HAD structure. The primary advantage of this structure lies in its ability to rapidly eliminate phase ambiguity within a single time-slot, thereby enhancing the time-efficiency of DOA measurements. Based on this structure, two three-step MDL frameworks, MD and CoMD, for measuring DOA without ambiguity is proposed. The CoMD framework establishes a relation between FD and H2ADsuperscriptH2AD\rm{H}^{2}ADroman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD modals by leveraging a prior knowledge of FD, thereby further improving DOA performance and reducing the clustering complexity. Moreover, CNN designed to be a five-layer neural network can obtain more accurate starting sample point of true solution class. Simulation and analysis results demonstrate that the proposed four methods could approach the corresponding CRLB. In particular, the proposed CoMDDL and MDDL exhibit more excellent DOA performance than MD-RootMUSIC and CoMD-RootMUSIC in the extremely low-SNR scenarios. Furthermore, the structure, frameworks and methodologies presented in this paper strike a favorable balance in terms of circuit cost, time-efficiency, phase ambiguity elimination and accuracy. Leveraging these advantages, the proposed H2ADsuperscriptH2AD\rm{H}^{2}ADroman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_AD-FD-based DOA measurements hold promise for the development of green wireless communication technologies in the future.

APPENDIX A: Proof of Theorem 1

Based on (IV-B) and the Lagrangian theorem, we can define L(wFD,wh,ζ)𝐿subscript𝑤𝐹𝐷subscript𝑤𝜁L(w_{FD},w_{h},\zeta)italic_L ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_ζ ), as

L(wFD,wh,ζ)=h=1Hwh2CRLBh+wFD2CRLBFD𝐿subscript𝑤𝐹𝐷subscript𝑤𝜁superscriptsubscript1𝐻superscriptsubscript𝑤2𝐶𝑅𝐿subscript𝐵superscriptsubscript𝑤𝐹𝐷2𝐶𝑅𝐿subscript𝐵𝐹𝐷\displaystyle L(w_{FD},w_{h},\zeta)=\sum_{h=1}^{H}w_{h}^{2}CRLB_{h}+w_{FD}^{2}% CRLB_{FD}italic_L ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_ζ ) = ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT + italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT (69)
ζ(wFD+h=1Hwh1)𝜁subscript𝑤𝐹𝐷superscriptsubscript1𝐻subscript𝑤1\displaystyle\quad\quad\quad\quad\quad\quad-\zeta(w_{FD}+\sum_{h=1}^{H}w_{h}-1)- italic_ζ ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - 1 )

The partial derivatives of (LABEL:lagFunc) are expressed as

L(wFD,wh,ζ)wFD=2wFDCRLBFDζ𝐿subscript𝑤𝐹𝐷subscript𝑤𝜁subscript𝑤𝐹𝐷2subscript𝑤𝐹𝐷𝐶𝑅𝐿subscript𝐵𝐹𝐷𝜁\frac{\partial L(w_{FD},w_{h},\zeta)}{\partial w_{FD}}=2w_{FD}CRLB_{FD}-\zetadivide start_ARG ∂ italic_L ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_ζ ) end_ARG start_ARG ∂ italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_ARG = 2 italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT - italic_ζ (70)
L(wFD,wh,ζ)wh=2whCRLBhζ𝐿subscript𝑤𝐹𝐷subscript𝑤𝜁subscript𝑤2subscript𝑤𝐶𝑅𝐿subscript𝐵𝜁\frac{\partial L(w_{FD},w_{h},\zeta)}{\partial w_{h}}=2w_{h}CRLB_{h}-\zetadivide start_ARG ∂ italic_L ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_ζ ) end_ARG start_ARG ∂ italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG = 2 italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - italic_ζ (71)
L(wFD,wh,ζ)ζ=wFDh=1Hwh+1𝐿subscript𝑤𝐹𝐷subscript𝑤𝜁𝜁subscript𝑤𝐹𝐷superscriptsubscript1𝐻subscript𝑤1\frac{\partial L(w_{FD},w_{h},\zeta)}{\partial\zeta}=-w_{FD}-\sum_{h=1}^{H}w_{% h}+1divide start_ARG ∂ italic_L ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_ζ ) end_ARG start_ARG ∂ italic_ζ end_ARG = - italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT + 1 (72)

Then, we can set

{L(wFD,wh,ζ)wFD=0L(wFD,wh,ζ)wh=0L(wFD,wh,ζ)ζ=0cases𝐿subscript𝑤𝐹𝐷subscript𝑤𝜁subscript𝑤𝐹𝐷0missing-subexpression𝐿subscript𝑤𝐹𝐷subscript𝑤𝜁subscript𝑤0missing-subexpression𝐿subscript𝑤𝐹𝐷subscript𝑤𝜁𝜁0missing-subexpression\begin{split}\left\{\begin{array}[]{ll}\frac{\partial L(w_{FD},w_{h},\zeta)}{% \partial w_{FD}}=0\\ \frac{\partial L(w_{FD},w_{h},\zeta)}{\partial w_{h}}=0\\ \frac{\partial L(w_{FD},w_{h},\zeta)}{\partial\zeta}=0\end{array}\right.\end{split}start_ROW start_CELL { start_ARRAY start_ROW start_CELL divide start_ARG ∂ italic_L ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_ζ ) end_ARG start_ARG ∂ italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_ARG = 0 end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL divide start_ARG ∂ italic_L ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_ζ ) end_ARG start_ARG ∂ italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG = 0 end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL divide start_ARG ∂ italic_L ( italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT , italic_ζ ) end_ARG start_ARG ∂ italic_ζ end_ARG = 0 end_CELL start_CELL end_CELL end_ROW end_ARRAY end_CELL end_ROW (73)

which yields

wFD=CRLBFD1CRLBFD1+h=1HCRLBh1subscript𝑤𝐹𝐷𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1superscriptsubscript1𝐻𝐶𝑅𝐿superscriptsubscript𝐵1\displaystyle w_{FD}=\frac{CRLB_{FD}^{-1}}{CRLB_{FD}^{-1}+\sum_{h=1}^{H}CRLB_{% h}^{-1}}italic_w start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT = divide start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG
wh=CRLBh1CRLBFD1+h=1HCRLBh1,subscript𝑤𝐶𝑅𝐿superscriptsubscript𝐵1𝐶𝑅𝐿superscriptsubscript𝐵𝐹𝐷1superscriptsubscript1𝐻𝐶𝑅𝐿superscriptsubscript𝐵1\displaystyle w_{h}=\frac{CRLB_{h}^{-1}}{CRLB_{FD}^{-1}+\sum_{h=1}^{H}CRLB_{h}% ^{-1}},italic_w start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = divide start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_C italic_R italic_L italic_B start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG , (74)

Therefore, the proof of Theorem 1 is completed.

APPENDIX B: CRLB for the H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD-FD structure

In this section, the CRLB for the H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD-FD structure is derived. According to [24], the Fisher information matrix for H2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD-FD structure can be given by

𝐅𝐲=𝐓𝐫{𝐑𝐲1𝐑𝐲θ𝐑𝐲1𝐑𝐲θ}subscript𝐅𝐲𝐓𝐫superscriptsubscript𝐑𝐲1subscript𝐑𝐲𝜃superscriptsubscript𝐑𝐲1subscript𝐑𝐲𝜃\mathbf{F}_{\mathbf{y}}=\mathbf{Tr}\left\{\mathbf{R}_{\mathbf{y}}^{-1}\frac{% \partial\mathbf{R}_{\mathbf{y}}}{\partial\theta}\mathbf{R}_{\mathbf{y}}^{-1}% \frac{\partial\mathbf{R}_{\mathbf{y}}}{\partial\theta}\right\}bold_F start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT = bold_Tr { bold_R start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG bold_R start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG } (75)

where

𝐲𝐲\displaystyle\mathbf{y}bold_y =[𝐲FD𝐲H2AD]=Δ𝐆AHas+𝐰absentdelimited-[]subscript𝐲𝐹𝐷subscript𝐲superscriptH2𝐴𝐷superscriptΔsuperscriptsubscript𝐆𝐴𝐻𝑎𝑠𝐰\displaystyle=\left[\begin{array}[]{l}{\mathbf{y}_{FD}}\\ {\mathbf{y}_{{\rm{H}^{2}}AD}}\end{array}\right]\mathop{=}\limits^{\Delta}% \mathbf{G}_{A}^{H}as+\mathbf{w}= [ start_ARRAY start_ROW start_CELL bold_y start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_y start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] = start_POSTSUPERSCRIPT roman_Δ end_POSTSUPERSCRIPT bold_G start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT italic_a italic_s + bold_w (78)
=[100𝚼AH][𝐚FD𝐚H2AD]s+[𝐰1𝐰2]absentdelimited-[]10missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression0superscriptsubscript𝚼𝐴𝐻missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressiondelimited-[]subscript𝐚𝐹𝐷subscript𝐚superscriptH2𝐴𝐷𝑠delimited-[]subscript𝐰1subscript𝐰2\displaystyle=\left[\begin{array}[]{*{20}{c}}1&0\\ 0&{\mathbf{\Upsilon}_{A}^{H}}\end{array}\right]\left[\begin{array}[]{l}{% \mathbf{a}_{FD}}\\ {\mathbf{a}_{{\rm{H}^{2}}AD}}\end{array}\right]s+\left[\begin{array}[]{l}{% \mathbf{w}_{1}}\\ {\mathbf{w}_{2}}\end{array}\right]= [ start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] [ start_ARRAY start_ROW start_CELL bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_a start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] italic_s + [ start_ARRAY start_ROW start_CELL bold_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] (85)
𝐑𝐲=𝐒𝐍𝐑𝐆AH𝐚𝐚H𝐆A+𝐈subscript𝐑𝐲superscriptsubscript𝐒𝐍𝐑𝐆𝐴𝐻superscript𝐚𝐚𝐻subscript𝐆𝐴𝐈\mathbf{R}_{\mathbf{y}}=\mathbf{SNR}\mathbf{G}_{A}^{H}\mathbf{a}\mathbf{a}^{H}% \mathbf{G}_{A}+\mathbf{I}bold_R start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT = bold_SNRG start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_aa start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_G start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT + bold_I (86)

Therefore, 𝐑𝐲subscript𝐑𝐲\mathbf{R}_{\mathbf{y}}bold_R start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT can be expressed by (93)

𝐑𝐲subscript𝐑𝐲\displaystyle\mathbf{R}_{\mathbf{y}}bold_R start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT =𝐒𝐍𝐑[10𝟎𝚼AH][𝐚FD𝐚H2AD][𝐚FDH,𝐚H2ADH][10𝟎𝚼A]+𝐈absent𝐒𝐍𝐑delimited-[]10missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression0superscriptsubscript𝚼𝐴𝐻missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressiondelimited-[]subscript𝐚𝐹𝐷subscript𝐚superscriptH2𝐴𝐷superscriptsubscript𝐚𝐹𝐷𝐻superscriptsubscript𝐚superscriptH2𝐴𝐷𝐻delimited-[]10missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression0subscript𝚼𝐴missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression𝐈\displaystyle=\mathbf{SNR}\left[\begin{array}[]{*{20}{c}}1&0\\ \mathbf{0}&{\mathbf{\Upsilon}_{A}^{H}}\end{array}\right]\left[\begin{array}[]{% l}{\mathbf{a}_{FD}}\\ {\mathbf{a}_{{\rm{H}^{2}}AD}}\end{array}\right]\left[{\mathbf{a}_{FD}^{H},% \mathbf{a}_{{\rm{H}^{2}}AD}^{H}}\right]\left[\begin{array}[]{*{20}{c}}1&0\\ \mathbf{0}&{\mathbf{\Upsilon}_{A}}\end{array}\right]+\mathbf{I}= bold_SNR [ start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] [ start_ARRAY start_ROW start_CELL bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL bold_a start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] [ bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , bold_a start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ] [ start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] + bold_I (93)
=𝐒𝐍𝐑[𝐚FD𝐚FDH𝟎𝟎𝚼AH𝐚H2AD𝐚H2ADH𝚼A]+𝐈=[𝐑FD𝟎𝟎𝐑H2AD]absent𝐒𝐍𝐑delimited-[]subscript𝐚𝐹𝐷superscriptsubscript𝐚𝐹𝐷𝐻0missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression0superscriptsubscript𝚼𝐴𝐻subscript𝐚superscriptH2𝐴𝐷superscriptsubscript𝐚superscriptH2𝐴𝐷𝐻subscript𝚼𝐴missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression𝐈delimited-[]subscript𝐑𝐹𝐷0missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression0subscript𝐑superscriptH2𝐴𝐷missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression\displaystyle=\mathbf{SNR}\left[\begin{array}[]{*{20}{c}}{\mathbf{a}_{FD}}{% \mathbf{a}_{FD}^{H}}&\mathbf{0}\\ \mathbf{0}&{\mathbf{\Upsilon}_{A}^{H}}{\mathbf{a}_{{\rm{H}^{2}}AD}}{\mathbf{a}% _{{\rm{H}^{2}}AD}^{H}}{\mathbf{\Upsilon}_{A}}\end{array}\right]+\mathbf{I}=% \left[\begin{array}[]{*{20}{c}}{\mathbf{R}_{FD}}&\mathbf{0}\\ \mathbf{0}&{\mathbf{R}_{{\rm{H}^{2}}AD}}\end{array}\right]= bold_SNR [ start_ARRAY start_ROW start_CELL bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL bold_0 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Υ start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] + bold_I = [ start_ARRAY start_ROW start_CELL bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_CELL start_CELL bold_0 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_R start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] (98)

Then, 𝐅𝐲subscript𝐅𝐲\mathbf{F}_{\mathbf{y}}bold_F start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT is given by (101),

𝐅𝐲subscript𝐅𝐲\displaystyle\mathbf{F}_{\mathbf{y}}bold_F start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT =𝐓𝐫{[𝐑FD1𝐑FDθ𝐑FD1𝐑FDθ𝟎𝟎𝐑H2AD1𝐑H2ADθ𝐑H2AD1𝐑H2ADθ]}=𝐅FD+𝐅H2ADabsent𝐓𝐫delimited-[]superscriptsubscript𝐑𝐹𝐷1subscript𝐑𝐹𝐷𝜃superscriptsubscript𝐑𝐹𝐷1subscript𝐑𝐹𝐷𝜃0missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression0superscriptsubscript𝐑superscriptH2𝐴𝐷1subscript𝐑superscriptH2𝐴𝐷𝜃superscriptsubscript𝐑superscriptH2𝐴𝐷1subscript𝐑superscriptH2𝐴𝐷𝜃missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsubscript𝐅𝐹𝐷subscript𝐅superscriptH2𝐴𝐷\displaystyle=\mathbf{Tr}\left\{\left[{\begin{array}[]{*{20}{c}}\mathbf{R}_{FD% }^{-1}\frac{\partial\mathbf{R}_{{FD}}}{\partial\theta}\mathbf{R}_{FD}^{-1}% \frac{\partial\mathbf{R}_{FD}}{\partial\theta}&\mathbf{0}\\ \mathbf{0}&{\mathbf{R}_{{\rm{H}^{2}}AD}^{-1}\frac{\partial\mathbf{R}_{{\rm{H}^% {2}}AD}}{\partial\theta}\mathbf{R}_{{\rm{H}^{2}}AD}^{-1}\frac{\partial\mathbf{% R}_{{\rm{H}^{2}}AD}}{\partial\theta}}\end{array}}\right]\right\}=\mathbf{F}_{% FD}+\mathbf{F}_{{\rm{H}^{2}}AD}= bold_Tr { [ start_ARRAY start_ROW start_CELL bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG end_CELL start_CELL bold_0 end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_R start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG bold_R start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] } = bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + bold_F start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT (101)

where 𝐅FDsubscript𝐅𝐹𝐷\mathbf{F}_{FD}bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT is

𝐅FD=8π2𝐒𝐍𝐑cos2θ0d22λ2subscript𝐅𝐹𝐷8superscript𝜋2𝐒𝐍𝐑superscript2subscript𝜃0superscriptsuperscript𝑑2superscript𝜆2\displaystyle\mathbf{F}_{FD}=\frac{{8{\pi^{2}}\mathbf{SNR}{{\cos}^{2}}{\theta_% {0}}\mathop{{d^{2}}}\limits^{-}}}{{\lambda^{2}}}bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT = divide start_ARG 8 italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_SNR roman_cos start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_BIGOP italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_BIGOP start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT end_ARG start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (102)

And the deriving process of 𝐅H2ADsubscript𝐅superscriptH2𝐴𝐷\mathbf{F}_{{\rm{H}^{2}}AD}bold_F start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT as follows

𝐲H2ADsubscript𝐲superscriptH2𝐴𝐷\displaystyle\mathbf{y}_{{\rm{H}^{2}}AD}bold_y start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT =[𝐲1,𝐲2,,𝐲H]T=𝚼AH𝐚H2𝐀𝐃s+𝐰absentsuperscriptsubscript𝐲1subscript𝐲2subscript𝐲𝐻𝑇superscriptsubscript𝚼𝐴𝐻subscript𝐚superscriptH2𝐀𝐃𝑠𝐰\displaystyle=\left[\mathbf{y}_{1},\mathbf{y}_{2},\cdots,\mathbf{y}_{H}\right]% ^{T}=\mathbf{\Upsilon}_{A}^{H}\mathbf{a_{{\rm{H}^{2}}AD}}s+\mathbf{w}= [ bold_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , bold_y start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT = bold_Υ start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_AD end_POSTSUBSCRIPT italic_s + bold_w
=Δ[𝚼A,1H𝟎𝟎𝟎𝚼A,2H𝟎𝟎𝟎𝚼A,HH]𝐚H2ADs+𝐰superscriptΔdelimited-[]superscriptsubscript𝚼𝐴1𝐻000superscriptsubscript𝚼𝐴2𝐻000superscriptsubscript𝚼𝐴𝐻𝐻subscript𝐚superscriptH2𝐴𝐷𝑠𝐰\displaystyle\mathop{=}\limits^{\Delta}\left[\begin{array}[]{cccc}\mathbf{% \Upsilon}_{A,1}^{H}&\mathbf{0}&\cdots&\mathbf{0}\\ \mathbf{0}&\mathbf{\Upsilon}_{A,2}^{H}&\cdots&\mathbf{0}\\ \vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\mathbf{0}&\cdots&\mathbf{\Upsilon}_{A,H}^{H}\end{array}\right]% \mathbf{a}_{{\rm{H}^{2}}AD}s+\mathbf{w}= start_POSTSUPERSCRIPT roman_Δ end_POSTSUPERSCRIPT [ start_ARRAY start_ROW start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL bold_0 end_CELL start_CELL ⋯ end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_0 end_CELL start_CELL ⋯ end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , italic_H end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ] bold_a start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT italic_s + bold_w (107)

where

𝐚H2AD=[τ1𝐚1T,τ2𝐚2T,,τH𝐚HT]Tsubscript𝐚superscriptH2𝐴𝐷superscriptsubscript𝜏1superscriptsubscript𝐚1𝑇subscript𝜏2superscriptsubscript𝐚2𝑇subscript𝜏𝐻superscriptsubscript𝐚𝐻𝑇𝑇\mathbf{a}_{{\rm{H}^{2}}AD}=\left[\tau_{1}\mathbf{a}_{1}^{T},\tau_{2}\mathbf{a% }_{2}^{T},\cdots,\tau_{H}\mathbf{a}_{H}^{T}\right]^{T}bold_a start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT = [ italic_τ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , italic_τ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , ⋯ , italic_τ start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT (108)

And τhsubscript𝜏\tau_{h}italic_τ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT is the phase of the hhitalic_hth array corresponding to the leftmost antenna

τh={1,h=1ej2πλdsinθ0hj=1h1Nhj,h>1\tau_{h}=\left\{\begin{array}[]{ll}1&,h=1\\ e^{j\frac{2\pi}{\lambda}d\sin\theta_{0}\sum_{h_{j}=1}^{h-1}N_{h_{j}}}&,h>1\end% {array}\right.italic_τ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = { start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL , italic_h = 1 end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUPERSCRIPT italic_j divide start_ARG 2 italic_π end_ARG start_ARG italic_λ end_ARG italic_d roman_sin italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_h - 1 end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_CELL start_CELL , italic_h > 1 end_CELL end_ROW end_ARRAY (109)

Thus, 𝐑H2𝐀𝐃subscript𝐑superscriptH2𝐀𝐃\mathbf{R_{{\rm{H}^{2}}AD}}bold_R start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_AD end_POSTSUBSCRIPT is given by (122)

𝐑H2ADsubscript𝐑superscriptH2𝐴𝐷\displaystyle\mathbf{R}_{\mathbf{\rm{H}^{2}}AD}bold_R start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT =𝐒𝐍𝐑[𝚼A,1H𝟎𝟎𝟎𝚼A,2H𝟎𝟎𝟎𝚼A,HH][τ1𝐚1τ2𝐚2τH𝐚H][τ1H𝐚1H,,τHH𝐚HH][𝚼A,1𝟎𝟎𝟎𝚼A,2𝟎𝟎𝟎𝚼A,H]+𝐈absent𝐒𝐍𝐑delimited-[]superscriptsubscript𝚼𝐴1𝐻000superscriptsubscript𝚼𝐴2𝐻000superscriptsubscript𝚼𝐴𝐻𝐻delimited-[]subscript𝜏1subscript𝐚1subscript𝜏2subscript𝐚2subscript𝜏𝐻subscript𝐚𝐻superscriptsubscript𝜏1𝐻superscriptsubscript𝐚1𝐻superscriptsubscript𝜏𝐻𝐻superscriptsubscript𝐚𝐻𝐻delimited-[]subscript𝚼𝐴1000subscript𝚼𝐴2000subscript𝚼𝐴𝐻𝐈\displaystyle=\mathbf{SNR}\left[\begin{array}[]{cccc}\mathbf{\Upsilon}_{A,1}^{% H}&\mathbf{0}&\cdots&\mathbf{0}\\ \mathbf{0}&\mathbf{\Upsilon}_{A,2}^{H}&\cdots&\mathbf{0}\\ \vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\mathbf{0}&\cdots&\mathbf{\Upsilon}_{A,H}^{H}\end{array}\right]% \left[\begin{array}[]{c}\tau_{1}\mathbf{a}_{1}\\ \tau_{2}\mathbf{a}_{2}\\ \vdots\\ \tau_{H}\mathbf{a}_{H}\end{array}\right]\left[\tau_{1}^{H}\mathbf{a}_{1}^{H},% \cdots,\tau_{H}^{H}\mathbf{a}_{H}^{H}\right]\left[\begin{array}[]{cccc}\mathbf% {\Upsilon}_{A,1}&\mathbf{0}&\cdots&\mathbf{0}\\ \mathbf{0}&\mathbf{\Upsilon}_{A,2}&\cdots&\mathbf{0}\\ \vdots&\vdots&\ddots&\vdots\\ \mathbf{0}&\mathbf{0}&\cdots&\mathbf{\Upsilon}_{A,H}\end{array}\right]+\mathbf% {I}= bold_SNR [ start_ARRAY start_ROW start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL bold_0 end_CELL start_CELL ⋯ end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_0 end_CELL start_CELL ⋯ end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , italic_H end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ] [ start_ARRAY start_ROW start_CELL italic_τ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_τ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_τ start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] [ italic_τ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , ⋯ , italic_τ start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ] [ start_ARRAY start_ROW start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , 1 end_POSTSUBSCRIPT end_CELL start_CELL bold_0 end_CELL start_CELL ⋯ end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , 2 end_POSTSUBSCRIPT end_CELL start_CELL ⋯ end_CELL start_CELL bold_0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL bold_0 end_CELL start_CELL bold_0 end_CELL start_CELL ⋯ end_CELL start_CELL bold_Υ start_POSTSUBSCRIPT italic_A , italic_H end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] + bold_I (122)
=𝐒𝐍𝐑𝐝𝐢𝐚𝐠{[𝚼A,1H𝐚1𝐚1H𝚼A,1,𝚼A,2H𝐚2𝐚2H𝚼A,2,,𝚼A,HH𝐚H𝐚HH𝚼A,H]}+𝐈absent𝐒𝐍𝐑𝐝𝐢𝐚𝐠superscriptsubscript𝚼𝐴1𝐻subscript𝐚1superscriptsubscript𝐚1𝐻subscript𝚼𝐴1superscriptsubscript𝚼𝐴2𝐻subscript𝐚2superscriptsubscript𝐚2𝐻subscript𝚼𝐴2superscriptsubscript𝚼𝐴𝐻𝐻subscript𝐚𝐻superscriptsubscript𝐚𝐻𝐻subscript𝚼𝐴𝐻𝐈\displaystyle=\mathbf{SNR}\mathbf{diag}\left\{\left[\mathbf{\Upsilon}_{A,1}^{H% }\mathbf{a}_{1}\mathbf{a}_{1}^{H}\mathbf{\Upsilon}_{A,1},\mathbf{\Upsilon}_{A,% 2}^{H}\mathbf{a}_{2}\mathbf{a}_{2}^{H}\mathbf{\Upsilon}_{A,2},\cdots,\mathbf{% \Upsilon}_{A,H}^{H}\mathbf{a}_{H}\mathbf{a}_{H}^{H}\mathbf{\Upsilon}_{A,H}% \right]\right\}+\mathbf{I}= bold_SNRdiag { [ bold_Υ start_POSTSUBSCRIPT italic_A , 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Υ start_POSTSUBSCRIPT italic_A , 1 end_POSTSUBSCRIPT , bold_Υ start_POSTSUBSCRIPT italic_A , 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Υ start_POSTSUBSCRIPT italic_A , 2 end_POSTSUBSCRIPT , ⋯ , bold_Υ start_POSTSUBSCRIPT italic_A , italic_H end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_a start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT bold_a start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_Υ start_POSTSUBSCRIPT italic_A , italic_H end_POSTSUBSCRIPT ] } + bold_I
=𝐒𝐍𝐑𝐝𝐢𝐚𝐠{[𝐑𝐲1,𝐑𝐲2,,𝐑𝐲H]}absent𝐒𝐍𝐑𝐝𝐢𝐚𝐠subscript𝐑subscript𝐲1subscript𝐑subscript𝐲2subscript𝐑subscript𝐲𝐻\displaystyle=\mathbf{SNR}\mathbf{diag}\left\{\left[\mathbf{R}_{\mathbf{y}_{1}% },\mathbf{R}_{\mathbf{y}_{2}},\cdots,\mathbf{R}_{\mathbf{y}_{H}}\right]\right\}= bold_SNRdiag { [ bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ , bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] } (123)

According to (75) and (122), 𝐅H2ADsubscript𝐅superscriptH2𝐴𝐷\mathbf{F}_{{\rm{H}^{2}}AD}bold_F start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT is written as (124),

𝐅H2AD=subscript𝐅superscriptH2𝐴𝐷absent\displaystyle\mathbf{F}_{\mathbf{\rm{H}^{2}}AD}=bold_F start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT = =𝐓𝐫{𝐝𝐢𝐚𝐠([𝐑𝐲11𝐑𝐲1θ𝐑𝐲11𝐑𝐲1θ,𝐑𝐲21𝐑𝐲2θ𝐑𝐲21𝐑𝐲2θ,,𝐑𝐲H1𝐑𝐲Hθ𝐑𝐲H1𝐑𝐲Hθ])}=h=1H𝐅habsent𝐓𝐫𝐝𝐢𝐚𝐠superscriptsubscript𝐑subscript𝐲11subscript𝐑subscript𝐲1𝜃superscriptsubscript𝐑subscript𝐲11subscript𝐑subscript𝐲1𝜃superscriptsubscript𝐑subscript𝐲21subscript𝐑subscript𝐲2𝜃superscriptsubscript𝐑subscript𝐲21subscript𝐑subscript𝐲2𝜃superscriptsubscript𝐑subscript𝐲𝐻1subscript𝐑subscript𝐲𝐻𝜃superscriptsubscript𝐑subscript𝐲𝐻1subscript𝐑subscript𝐲𝐻𝜃superscriptsubscript1𝐻subscript𝐅\displaystyle=\mathbf{Tr}\left\{\mathbf{diag}\left(\left[\mathbf{R}_{\mathbf{y% }_{1}}^{-1}\frac{\partial\mathbf{R}_{\mathbf{y}_{1}}}{\partial\theta}\mathbf{R% }_{\mathbf{y}_{1}}^{-1}\frac{\partial\mathbf{R}_{\mathbf{y}_{1}}}{\partial% \theta},\mathbf{R}_{\mathbf{y}_{2}}^{-1}\frac{\partial\mathbf{R}_{\mathbf{y}_{% 2}}}{\partial\theta}\mathbf{R}_{\mathbf{y}_{2}}^{-1}\frac{\partial\mathbf{R}_{% \mathbf{y}_{2}}}{\partial\theta},\cdots,\mathbf{R}_{\mathbf{y}_{H}}^{-1}\frac{% \partial\mathbf{R}_{\mathbf{y}_{H}}}{\partial\theta}\mathbf{R}_{\mathbf{y}_{H}% }^{-1}\frac{\partial\mathbf{R}_{\mathbf{y}_{H}}}{\partial\theta}\right]\right)% \right\}=\sum_{h=1}^{H}\mathbf{F}_{h}= bold_Tr { bold_diag ( [ bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG , bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG , ⋯ , bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT divide start_ARG ∂ bold_R start_POSTSUBSCRIPT bold_y start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG ∂ italic_θ end_ARG ] ) } = ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT bold_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT (124)

where 𝐅hsubscript𝐅\mathbf{F}_{h}bold_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT in accordance with [24] expressed by

𝐅h=8π2𝐒𝐍𝐑2cos2θ0λ2MhΞ[112bh(θ0)4Mh2Kh2(Kh21)d2+MhKhΞ(bh(θ0)μ2+Kh{bh2(θ0)μ})],subscript𝐅8superscript𝜋2superscript𝐒𝐍𝐑2superscript2subscript𝜃0superscript𝜆2subscript𝑀Ξdelimited-[]112superscriptnormsubscript𝑏subscript𝜃04superscriptsubscript𝑀2superscriptsubscript𝐾2superscriptsubscript𝐾21superscript𝑑2subscript𝑀subscript𝐾Ξsuperscriptnormsubscript𝑏subscript𝜃0𝜇2subscript𝐾superscriptsubscript𝑏2subscript𝜃0𝜇\displaystyle\mathbf{F}_{h}=\frac{{8{\pi^{2}}\mathbf{SNR}^{2}}{{\cos}^{2}}{% \theta_{0}}}{{{\lambda^{2}}{M_{h}}\Xi}}\left[{\frac{1}{{12}}{{\left\|{{b_{h}}(% {\theta_{0}})}\right\|}^{4}}M_{h}^{2}K_{h}^{2}\left({K_{h}^{2}-1}\right){d^{2}% }+\frac{{{M_{h}}{K_{h}}}}{\Xi}\left({{{\left\|{{b_{h}}({\theta_{0}})\mu}\right% \|}^{2}}+{K_{h}}\Re\left\{{{b_{h}}^{2}({\theta_{0}})\mu}\right\}}\right)}% \right],bold_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT = divide start_ARG 8 italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_SNR start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_cos start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT roman_Ξ end_ARG [ divide start_ARG 1 end_ARG start_ARG 12 end_ARG ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∥ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 ) italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + divide start_ARG italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG start_ARG roman_Ξ end_ARG ( ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_μ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT roman_ℜ { italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_μ } ) ] , (125)

The CRLB can be given by

CRLB=1L𝐅𝐲1=1L(𝐅FD+𝐅H2AD)1𝐶𝑅𝐿𝐵1𝐿superscriptsubscript𝐅𝐲11𝐿superscriptsubscript𝐅𝐹𝐷subscript𝐅superscriptH2𝐴𝐷1\displaystyle CRLB=\frac{1}{L}\mathbf{F}_{\mathbf{y}}^{-1}=\frac{1}{L}\left(% \mathbf{F}_{FD}+\mathbf{F}_{\mathbf{\rm{H}^{2}}AD}\right)^{-1}italic_C italic_R italic_L italic_B = divide start_ARG 1 end_ARG start_ARG italic_L end_ARG bold_F start_POSTSUBSCRIPT bold_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_L end_ARG ( bold_F start_POSTSUBSCRIPT italic_F italic_D end_POSTSUBSCRIPT + bold_F start_POSTSUBSCRIPT roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_A italic_D end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (126)

Then, submitting (102) and (124) into (126), we can obtain the closed-form expression of the CRLB.

CRLB=λ28Lπ2𝐒𝐍𝐑cos2θ0[d22+h=1H(bh(θ0)4MhKh2(Kh21)d212Ξ+KhΞ2(bh(θ0)μ2+Kh{bh2(θ0)μ}))]𝐶𝑅𝐿𝐵superscript𝜆28𝐿superscript𝜋2𝐒𝐍𝐑superscript2subscript𝜃0delimited-[]superscriptsuperscript𝑑2superscriptsubscript1𝐻superscriptnormsubscript𝑏subscript𝜃04subscript𝑀superscriptsubscript𝐾2superscriptsubscript𝐾21superscript𝑑212Ξsubscript𝐾superscriptΞ2superscriptnormsubscript𝑏subscript𝜃0𝜇2subscript𝐾superscriptsubscript𝑏2subscript𝜃0𝜇\displaystyle CRLB=\frac{\lambda^{2}}{{8L{\pi^{2}}\mathbf{SNR}{{\cos}^{2}}{% \theta_{0}}}\left[\mathop{{d^{2}}}\limits^{-}+\sum_{h=1}^{H}\left({\frac{{{% \left\|{{b_{h}}({\theta_{0}})}\right\|}^{4}}M_{h}K_{h}^{2}\left({K_{h}^{2}-1}% \right){d^{2}}}{{12\Xi}}+\frac{{{K_{h}}}}{\Xi^{2}}\left({{{\left\|{{b_{h}}({% \theta_{0}})\mu}\right\|}^{2}}+{K_{h}}\Re\left\{{{b_{h}}^{2}({\theta_{0}})\mu}% \right\}}\right)}\right)\right]}italic_C italic_R italic_L italic_B = divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 8 italic_L italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_SNR roman_cos start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT [ start_BIGOP italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_BIGOP start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( divide start_ARG ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∥ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 1 ) italic_d start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 12 roman_Ξ end_ARG + divide start_ARG italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT end_ARG start_ARG roman_Ξ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ( ∥ italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_μ ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT roman_ℜ { italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_μ } ) ) ] end_ARG (127)

The derivation of CRLB for proposed FDH2superscriptH2\rm{H}^{2}roman_H start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPTAD structure is completed.

Data Availability

The data supporting the conclusions of this article are included in the article.

Conflicts of Interest

The authors declare that they have no conficts of interest.

References

  • [1] Z. Zhu, S. Guo, J. Chen, S. Xue, Z. Xu, P. Wu, G. Cui, and L. Kong, “Non-line-of-sight targets localization algorithm via joint estimation of dod and doa,” IEEE Transactions on Instrumentation and Measurement, vol. 72, pp. 1–11, 2023.
  • [2] D. An, S. Chang, M. Hwang, Y. Youn, D. Kim, C. Lee, and W. Hong, “Diagnosis and modification of propagating electromagnetic waves using doa systems and em skins,” IEEE Transactions on Antennas and Propagation, vol. 72, no. 4, pp. 3629–3640, 2024.
  • [3] A. Gorcin and H. Arslan, “A two-antenna single rf front-end doa estimation system for wireless communications signals,” IEEE Transactions on Antennas and Propagation, vol. 62, no. 10, pp. 5321–5333, 2014.
  • [4] R. Dong, S. Jiang, X. Hua, Y. Teng, F. Shu, and J. Wang, “Low-complexity joint phase adjustment and receive beamforming for directional modulation networks via irs,” IEEE open journal of the Communications Society, vol. 3, pp. 1234–1243, 2022.
  • [5] F. Shu, X. Wu, J. Hu, J. Li, R. Chen, and J. Wang, “Secure and precise wireless transmission for random-subcarrier-selection-based directional modulation transmit antenna array,” IEEE Journal on Selected Areas in Communications, vol. 36, no. 4, pp. 890–904, 2018.
  • [6] F. Shu, Y. Teng, J. Li, M. Huang, W. Shi, J. Li, Y. Wu, and J. Wang, “Enhanced secrecy rate maximization for directional modulation networks via irs,” IEEE Transactions on Communications, vol. 69, no. 12, pp. 8388–8401, 2021.
  • [7] F. Shu, T. Shen, L. Xu, Y. Qin, S. Wan, S. **, X. You, and J. Wang, “Directional modulation: A physical-layer security solution to b5g and future wireless networks,” IEEE Network, vol. 34, no. 2, pp. 210–216, 2020.
  • [8] H. A. Kassir, I. T. Rekanos, P. I. Lazaridis, T. V. Yioultsis, N. V. Kantartzis, C. S. Antonopoulos, G. K. Karagiannidis, and Z. D. Zaharis, “Doa estimation for 6g communication systems,” in 2023 12th International Conference on Modern Circuits and Systems Technologies (MOCAST), 2023, pp. 1–4.
  • [9] J. Zheng and Q. Zhang, “Secrecy outage probability of multiple-input–multiple-output secure internet of things communication systems,” IEEE Internet of Things Journal, vol. 11, no. 6, pp. 9843–9853, 2024.
  • [10] R. O. Schmidt, A signal subspace approach to multiple emitter location and spectral estimation.   Stanford University, 1982.
  • [11] R. Roy, A. Paulraj, and T. Kailath, “Esprit–a subspace rotation approach to estimation of parameters of cisoids in noise,” IEEE transactions on acoustics, speech, and signal processing, vol. 34, no. 5, pp. 1340–1342, 1986.
  • [12] A. Barabell, “Improving the resolution performance of eigenstructure-based direction-finding algorithms,” in ICASSP’83. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 8.   IEEE, 1983, pp. 336–339.
  • [13] B. D. Rao and K. S. Hari, “Performance analysis of root-music,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 37, no. 12, pp. 1939–1949, 1989.
  • [14] X. Zhang, X. Gao, and D. Xu, “Multi-invariance esprit-based blind doa estimation for mc-cdma with an antenna array,” IEEE Transactions on Vehicular Technology, vol. 58, no. 8, pp. 4686–4690, 2009.
  • [15] M. Pesavento, A. B. Gershman, and M. Haardt, “Unitary root-music with a real-valued eigendecomposition: A theoretical and experimental performance study,” IEEE transactions on signal processing, vol. 48, no. 5, pp. 1306–1314, 2000.
  • [16] F.-G. Yan, M. **, S. Liu, and X.-L. Qiao, “Real-valued music for efficient direction estimation with arbitrary array geometries,” IEEE Transactions on Signal Processing, vol. 62, no. 6, pp. 1548–1560, 2014.
  • [17] D. Zhang, Y. Zhang, G. Zheng, C. Feng, and J. Tang, “Improved doa estimation algorithm for co-prime linear arrays using root-music algorithm,” Electronics Letters, vol. 53, no. 18, pp. 1277–1279, 2017.
  • [18] D. Hu, Y. Zhang, L. He, and J. Wu, “Low-complexity deep-learning-based doa estimation for hybrid massive mimo systems with uniform circular arrays,” IEEE Wireless Communications Letters, vol. 9, no. 1, pp. 83–86, 2020.
  • [19] G. K. Papageorgiou, M. Sellathurai, and Y. C. Eldar, “Deep networks for direction-of-arrival estimation in low snr,” IEEE Transactions on Signal Processing, vol. 69, pp. 3714–3729, 2021.
  • [20] J. Ma, M. Wang, Y. Chen, and H. Wang, “Deep convolutional network-assisted multiple direction-of-arrival estimation,” IEEE Signal Processing Letters, 2024.
  • [21] J. A. Zhang, W. Ni, P. Cheng, and Y. Lu, “Angle-of-arrival estimation using different phase shifts across subarrays in localized hybrid arrays,” IEEE Communications Letters, vol. 20, no. 11, pp. 2205–2208, 2016.
  • [22] S.-F. Chuang, W.-R. Wu, and Y.-T. Liu, “High-resolution aoa estimation for hybrid antenna arrays,” IEEE Transactions on Antennas and Propagation, vol. 63, no. 7, pp. 2955–2968, 2015.
  • [23] B. Shi, N. Chen, X. Zhu, Y. Qian, Y. Zhang, F. Shu, and J. Wang, “Impact of low-resolution adc on doa estimation performance for massive mimo receive array,” IEEE Systems Journal, vol. 16, no. 2, pp. 2635–2638, 2022.
  • [24] F. Shu, Y. Qin, T. Liu, L. Gui, Y. Zhang, J. Li, and Z. Han, “Low-Complexity and High-Resolution DOA Estimation for Hybrid Analog and Digital Massive MIMO Receive Array,” IEEE Trans. Commun, vol. 66, no. 6, pp. 2487–2501, 2018.
  • [25] B. Shi, X. Jiang, N. Chen, Y. Teng, J. Lu, F. Shu, J. Zou, J. Li, and J. Wang, “Fast ambiguous doa elimination method of doa measurement for hybrid massive mimo receiver,” Science China Information Sciences, vol. 65, p. 159302, 2022.
  • [26] Y. Chen, X. Zhan, F. Shu, Q. Jie, X. Cheng, Z. Zhuang, and J. Wang, “Two low-complexity doa estimators for massive/ultra-massive mimo receive array,” IEEE Wireless Communications Letters, vol. 11, no. 11, pp. 2385–2389, 2022.
  • [27] F. Shu, X. Zhan, W. Cai, M. Huang, Q. Jie, Y. Li, B. Shi, and J. Wang, “Machine-learning-aided massive hybrid analog and digital MIMO DOA estimation for future wireless networks,” CoRR, vol. abs/2201.04452, 2022. [Online]. Available: https://arxiv.longhoe.net/abs/2201.04452
  • [28] B. Friedlander, “The root-MUSIC algorithm for direction finding with interpolated arrays,” Signal Processing, 1993.
  • [29] T. E. Tuncer and B. Friedlander, Classical and Modern Direction-of-Arrival Estimation.   Burlington, MA 01803, USA, 2009.