Near-field Beam Training with Sparse DFT Codebook

Cong Zhou, Chenyu Wu, Changsheng You, , Jiasi Zhou, Shuo Shi Cong Zhou, Chenyu Wu and Shuo Shi are with the School of Electronic and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China. (e-mail: [email protected], [email protected], [email protected]).Changsheng You is with the Department of Electronic and Electrical Engineering, Southern University of Science and Technology, Shenzhen 518055, China. (e-mails: [email protected]).Jiasi Zhou is with the School of Medical Information and Engineering, Xuzhou Medical University, Xuzhou, 221004, China. (e-mails: [email protected]).Corresponding author: Chenyu Wu and Changsheng You.
Abstract

Extremely large-scale arrays (XL-arrays) have emerged as one promising technology to improve the spectral efficiency and spatial resolution in future sixth generation (6G) wireless systems. The drastic increase in the number of antennas renders the communication users more likely to be located in the near-field region, which requires a more accurate spherical (instead of planar) wavefront propagation modeling. However, this also inevitably incurs unaffordable beam training overhead when performing a two-dimensional (2D) beam-search in both the angular and range domains. To address this issue, we first introduce in this paper a new sparse discrete Fourier transform (DFT) codebook, which exhibits the angular periodicity in the received beam pattern at the user. This thus motivates us to propose a three-phase beam training scheme. Specifically, in the first phase, we utilize the sparse DFT codebook for beam swee** in an angular subspace and estimate candidate user angles according to the received beam pattern. Then, a central subarray is activated to scan specific candidate angles for resolving the issue of angular ambiguity for identifying the user angle. In the third phase, the polar-domain codebook is applied in the estimated angle to search the best effective user range. Finally, numerical results show that our proposed beam training scheme enabled by the sparse DFT codebook achieves 98.67% beam training overhead reduction as compared to the exhaustive-search scheme, yet without compromising rate performance in the high signal-to-ratio (SNR) regime.

Index Terms:
Extremely large-scale array, near-field communications, beam training, DFT codebook, sparse array.

I Introduction

Extremely large-scale arrays/surfaces (XL-arrays/surfaces) have been envisioned as one of the key ingredients to drive the evolution of six generation (6G) wireless systems[1, 2, 3, 4]. Specifically, XL-arrays/surfaces with a significant number of antennas can be deployed at the base station (BS) to achieve ultra-high spectral efficiency and spatial resolution, hence accommodating the escalating demands for new applications such as spanning metaverse and digital twin [5, 6, 7]. The drastic increase in the number of antennas in high-frequency bands represents a qualitative paradigm shift in the electromagnetic (EM) propagation modeling, giving rise to the new near-field communications [8].

Particularly, different from the far-field EM propagation which is simply approximated by planar waves, the near-field channel modeling necessitates the use of more accurate spherical waves[9, 10, 11]. As such, near-field communications possess several unique properties in contrast to far-field communications. First, the spherical wavefront characteristic opens up the possibility of near-field beamfocusing, for which the beam energy can be concentrated at a specific location/region rather than a spatial angle typically for far-field beamforming[12, 13, 14]. The beam-focusing capability of near-field communications enables XL-array to flexibly form highly directional beams in both the angle and range domain, and hence can be leveraged in various applications to improve the system performance, such as mitigating the inter-user interference, improving the accuracy of sensing and localization[15, 16, 17], and enhancing the charging efficiency of wireless power transfer[18]. Second, the rank of line-of-sight (LoS) channels for near-field multiple-input multiple-output (MIMO) communication systems can be larger than one, hence enhancing the spatial multiplexing gains [19]. In this paper, we propose an efficient near-field beam training scheme with an sparse discrete Fourier transform (DFT) codebook by using the sparse antenna activation method, which significantly reduces the beam training overhead.

I-A Related Works

I-A1 Near-Field Wireless Systems

Near-field communications bring new opportunities and challenges, which has motivated upsurging research interest. For example, in [12], the authors studied the beamfocusing design based on the fully-digital architectures, hybrid phase shifter-based precoders, and dynamic metasurface antenna architecture for XL-MIMO arrays. It is shown that near-field beamfocusing provides new degree of freedom (DoF) to mitigate the interference among users, even when they are located at the same spatial angle. In addition, a new concept of location division multiple access (LDMA) was proposed in [20], which exploits orthogonality of the near-field beamfocusing vectors in the range domain to serve different users at the same angle. The authors in [21] developed a framework for analyzing and designing XL-MIMO systems with spatial non-stationarity. It was revealed that the performance of the proposed framework approaches that of the conventional full-antenna array based designs albeit with lower complexity. In addition, a holographic metasurface antennas (HMAs) based multi-user system was investigated in [22], where the digital transmit precoder and the analog HMA weighting matrix were jointly optimized to minimize the transmit power. The authors in [23] considered a simultaneous wireless information and power transfer (SWIPT) system, where energy harvesting (EH) and information decoding receivers are located in the near- and far-field regions of the XL-array, respectively. The beam scheduling and power allocation were jointly optimized to maximize the weighted-sum power harvested at EH receivers [23]. Moreover, a directional modulation system was developed for near-field physical layer security systems[24]. Specifically, a fully analog precoding algorithm along with artificial noise and power allocation was proposed to realize secure transmission in both the angular and range domains. Near-field sensing with XL-array was studied in [25], where the closed-form expressions of the Cramér-Rao Bounds for both the angle and range estimations are derived.

I-A2 Near-field Beam Training

In high-frequency bands, direct channel estimation methods may not be very effective due to severe path-loss and signal misalignment. As such, beam training is efficient in establishing initial links with high signal-to-noise ratio (SNR) for data transmission and channel state information (CSI) acquisition [26]. However, due to the spherical wave propagation, near-field beam training is more challenging compared with its far-field counterpart as it requires a joint beam search over both the angular and range domain. Particularly, conventional far-field beam training will suffer from significant performance loss in the near-field region due to the so-called energy-spread effect, for which the energy of a far-field beamformer is no longer steered towards one angle, but spread in multiple angles. Hence, the conventional far-field beam training method cannot be directly applied to the near-field beam training. To address this issue, the authors in [27] proposed a new codebook design in the polar domain, for which the angular domain is uniformly sampled whereas the range domain is non-uniformly sampled. One can simply invoke this codebook for the exhaustive-search based near-field beam training. However, the overhead of this scheme is the product of the number of antennas and range samples, which is prohibitively high for the implementation of XL-arrays. To reduce the overhead, the authors in [28] proposed a novel two-phase near-field beam training method. This method leveraged a key observation that the true user angle approximately lies in the middle of an angular support with high received SNRs. Based on this observation, one can first estimate the user angle by using far-field DFT beams, and then estimate the user range with the polar-domain codebook in [27]. Besides, deep learning techniques have also been exploited in [29] and [30] to reduce the near-field beam training overhead, where deep neural networks (DNN) are trained based on conventional far-field codebooks and near-field codebook, respectively. However, the training overhead of these methods scales linearly with the number of antennas, which is still unaffordable for communication systems. This issue motivates the design of efficient hierarchical beam training schemes for near-field communications to reduce the training overhead to the logarithmic order, e.g., [31, 32, 33]. However, hierarchical beam training schemes suffer from several inherent drawbacks. First, hierarchical methods usually require frequent feedback and neglect underlying transmission delays. Moreover, there exists the error propagation issue due to the progressive beam search [34].

I-B Motivations and Contributions

The existing works on near-field beam training inevitably incur unacceptable overhead, while the off-grid channel estimation methods face highly computational complexity such as high dimensional matrix inversion. Moreover, the beam training overhead of these existing works scales linearly with the number of antennas apart from hierarchical beam training schemes, which suffers from several inherent drawbacks such as error propagation and user feed-back overhead [34]. Motivated by the above, this paper explores a new sparse DFT codebook and a three-phase beam training scheme to reduce the near-field beam training overhead, which scales with the square root of the number of antennas. The main contributions are summarized as follows.

  • First, we propose a novel sparse DFT codebook by sparsely activating the XL-array antennas and constructing the reduced DFT codebook with the equivalent sparse linear array (SLA). Specifically, the sparse DFT codebook consists of sparse far-field channel response vectors, which is sampled from far-field channel response vectors. Then, we characterize the received beam pattern at the near-field user when the sparse DFT codebook is used for beam swee**. Interestingly, it is shown that the received beam pattern exhibits periodicity in the angular domain, while there still exists the energy-spread effect. Then, we show that the user angle information is contained in a period of the received beam pattern at the user and can be estimated via a defined angular support.

  • Second, we propose a novel three-phase beam training scheme based on the sparse DFT codebook. Specifically, in the first phase, we utilize a small number of the sparse DFT codewords to sweep an angular subspace and estimate one candidate user angle according to the middle of the defined angular support. Then, in the second phase, we activate a central subarray to resolve the angular ambiguity by virtue of the periodicity of the received beam pattern at the user. Subsequently, the polar-domain codebook is utilized to search the best user range in the estimated user angle.

  • Finally, extensive numerical results are presented to demonstrate the effectiveness of our proposed beam training scheme enabled by the sparse DFT codebook. It is shown that the proposed three-phase beam training scheme can achieve nearly the same performance with the exhaustive-search beam training method in the high-SNR regime, while reducing more than 98% of the training overhead. In the low-SNR regime, the proposed scheme suffers from slight performance loss, while the effective rate still significantly exceeds all benchmark schemes due to lower beam training overhead.

I-C Organization and Notations

The remainder of this paper is organized as follows. System model is presented in Sections II. In Section III, we introduce several benchmarks. Section IV provides comprehensive analysis of the received beam pattern at the user with the sparse DFT codebook. Then, in Section V, the proposed three-phase beam training scheme is elaborated. Finally, numerical results are provided in section VI to demonstrate the effectiveness of the proposed beam training scheme followed by the conclusions made in Section VII.

Notations: Vectors and matrices are respectively denoted by lower-case and upper-case boldface letters. The symbol ||\left|\cdot\right|| ⋅ | represents the absolute value, while \left\lVert\cdot\right\lVert∥ ⋅ ∥ denotes the l2subscript𝑙2l_{2}italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm. Moreover, we use ()Hsuperscript𝐻(\cdot)^{H}( ⋅ ) start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT to denote the conjugate transpose operation. Finally, the Hadamard product is represented by direct-product\odot. The key symbols used in this paper are listed in Table I.

Table I: List of main symbols and their physical meanings.
N𝑁Nitalic_N Number of BS antennas U𝑈Uitalic_U Antenna activation interval
D𝐷Ditalic_D Array aperture size M𝑀Mitalic_M Number of antennas of the activated subarray
λ𝜆\lambdaitalic_λ Carrier wavelength z𝑧zitalic_z AWGN
d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT Antenna spacing Q𝑄Qitalic_Q Number of antennas of the activated SLA
𝐡nearHsubscriptsuperscript𝐡𝐻near\mathbf{h}^{H}_{\rm near}bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_near end_POSTSUBSCRIPT Near-field channel κ𝜅\kappaitalic_κ Rician factor
L𝐿Litalic_L Number of channel paths Ptotsubscript𝑃tot{{{P}}_{\rm tot}}italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT Transmit SNR
θ0subscript𝜃0\theta_{0}italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT BS center-user spatial angle 𝒲DFTsubscript𝒲DFT{{\mathcal{W}}}_{\rm{DFT}}caligraphic_W start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT Sparse DFT codebook
r0subscript𝑟0r_{0}italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT BS center-user range β𝛽\betaitalic_β Channel gain
𝐛(r0,θ0)𝐛subscript𝑟0subscript𝜃0\mathbf{b}\left(r_{0},\theta_{0}\right)bold_b ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) Near-field channel steering vector ΔΔ\Deltaroman_Δ Difference of spatial angles
𝐰𝐰{\mathbf{w}}bold_w Beamforming vector f(r0,θ0;θ)𝑓subscript𝑟0subscript𝜃0𝜃f({r_{0}},{\theta_{0}};\theta)italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) Received beam pattern
𝒱Subsubscript𝒱Sub{{\mathcal{V}}}_{\rm{Sub}}caligraphic_V start_POSTSUBSCRIPT roman_Sub end_POSTSUBSCRIPT DFT codebook for the activated subarray 𝒳¯Polsubscript¯𝒳Pol\bar{\mathcal{X}}_{\rm{Pol}}over¯ start_ARG caligraphic_X end_ARG start_POSTSUBSCRIPT roman_Pol end_POSTSUBSCRIPT Polar-domain codebook

II System Model

We consider a single-user XL-array downlink communication system, where the BS is equipped with a dense uniform linear array (ULA) with N𝑁Nitalic_N antennas. In this section, the near-field channel and signal model for the ULA are introduced.

II-A Near-field Channel Model

We assume that the dense ULA is situated at the y𝑦yitalic_y-axis and centered at the origin. Specifically, each antenna of XL-array is located at (0,nd00𝑛subscript𝑑00,nd_{0}0 , italic_n italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT), where n𝒩{0,±1,,±N12}𝑛𝒩0plus-or-minus1plus-or-minus𝑁12n\in\mathcal{N}\triangleq\{0,\pm 1,\cdots,\pm\frac{N-1}{2}\}italic_n ∈ caligraphic_N ≜ { 0 , ± 1 , ⋯ , ± divide start_ARG italic_N - 1 end_ARG start_ARG 2 end_ARG } and d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT respectively denote the antenna index and inter spacing. For the dense ULA, we have d0=λ2subscript𝑑0𝜆2d_{0}=\frac{\lambda}{2}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = divide start_ARG italic_λ end_ARG start_ARG 2 end_ARG, where λ𝜆\lambdaitalic_λ represents the carrier wavelength. Moreover, the single user is assumed to be located in the Fresnel near-field region of the XL-array where the BS-user range r0subscript𝑟0r_{0}italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is larger than the Fresnel distance ZF=max{dR,1.2D}subscript𝑍Fsubscript𝑑𝑅1.2𝐷Z_{\rm F}=\max{\{d_{R},1.2D\}}italic_Z start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT = roman_max { italic_d start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT , 1.2 italic_D } and smaller than the Rayleigh distance ZR=2D2λsubscript𝑍R2superscript𝐷2𝜆Z_{\rm R}=\frac{2D^{2}}{\lambda}italic_Z start_POSTSUBSCRIPT roman_R end_POSTSUBSCRIPT = divide start_ARG 2 italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_λ end_ARG with D=(N1)d0𝐷𝑁1subscript𝑑0D=(N-1)d_{0}italic_D = ( italic_N - 1 ) italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT denoting the array aperture. Moreover, dRsubscript𝑑𝑅d_{R}italic_d start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT is proven to be several wavelengths in [35] and the Fresnel distance can be simplified by ZF=1.2Dsubscript𝑍F1.2𝐷Z_{\rm F}=1.2Ditalic_Z start_POSTSUBSCRIPT roman_F end_POSTSUBSCRIPT = 1.2 italic_D. Hence, the line-of-sight (LoS) channel follows the uniform spherical wave (USW) model [36]. For example, when N=257𝑁257N=257italic_N = 257 and f=30𝑓30f=30italic_f = 30 GHz, the Rayleigh distance is approximately 328328328328 m, which makes the user more likely to be located in the near-field region. Then the general multi-path channel from the XL-array to the user can be modeled as [23]

𝐡nearH=Nβ𝐛H(r0,θ0)+=1LNLβ𝐛H(r¯,θ¯),subscriptsuperscript𝐡𝐻near𝑁𝛽superscript𝐛𝐻subscript𝑟0subscript𝜃0superscriptsubscript1𝐿𝑁𝐿subscript𝛽superscript𝐛𝐻subscript¯𝑟subscript¯𝜃\mathbf{h}^{H}_{\rm near}=\sqrt{N}\beta\mathbf{b}^{H}(r_{0},\theta_{0})+\sum_{% \ell=1}^{L}\sqrt{\frac{N}{L}}\beta_{\ell}\mathbf{b}^{H}(\bar{r}_{\ell},\bar{% \theta}_{\ell}),bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_near end_POSTSUBSCRIPT = square-root start_ARG italic_N end_ARG italic_β bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L end_POSTSUPERSCRIPT square-root start_ARG divide start_ARG italic_N end_ARG start_ARG italic_L end_ARG end_ARG italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( over¯ start_ARG italic_r end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT , over¯ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) , (1)

which includes one LoS path 𝐡LoSHsuperscriptsubscript𝐡LoS𝐻\mathbf{h}_{\rm LoS}^{H}bold_h start_POSTSUBSCRIPT roman_LoS end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT and L𝐿Litalic_L non-LoS (NLoS) paths. Herein, the parameters r0subscript𝑟0r_{0}italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (r¯subscript¯𝑟{\bar{r}_{\ell}}over¯ start_ARG italic_r end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT) and θ0subscript𝜃0\theta_{0}italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (θ¯subscript¯𝜃\bar{\theta}_{\ell}over¯ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT) represent the range and spatial angle of the LoS (\ellroman_ℓ-th NLoS) signal path. Moreover, the parameters β𝛽\betaitalic_β and βsubscript𝛽\beta_{\ell}italic_β start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT denote the LoS path and \ellroman_ℓ-th NLoS path gain, respectively. Mathematically, β𝛽\betaitalic_β can be modeled as [37]

β=κκ+1β0r0eȷ2πr0λ,𝛽𝜅𝜅1subscript𝛽0subscript𝑟0superscript𝑒italic-ȷ2𝜋subscript𝑟0𝜆\beta=\sqrt{\frac{\kappa}{\kappa+1}}\frac{\sqrt{\beta_{0}}}{r_{{0}}}e^{-\frac{% \jmath 2\pi r_{{0}}}{\lambda}},italic_β = square-root start_ARG divide start_ARG italic_κ end_ARG start_ARG italic_κ + 1 end_ARG end_ARG divide start_ARG square-root start_ARG italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_ȷ 2 italic_π italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_λ end_ARG end_POSTSUPERSCRIPT , (2)

where κ𝜅\kappaitalic_κ and β0subscript𝛽0\beta_{0}italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT represent the Rician factor and reference channel gain at a range of 1 m, respectively.

Refer to caption
Figure 1: A narrow-band Near-field XL-array communication system.

In this paper, we mainly consider the near-field communication scenarios in high-frequency bands such as millimeter-wave (mmWave) and even terahertz (THz). In these scenarios, the NLoS channel paths exhibit negligible power owing to the severe path-loss and shadowing effects [38]. Therefore, we only consider the LoS channel and the BS-user channel can be approximated as 𝐡nearH𝐡LoSsubscriptsuperscript𝐡𝐻nearsubscript𝐡LoS\mathbf{h}^{H}_{\rm near}\approx\mathbf{h}_{\rm LoS}bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_near end_POSTSUBSCRIPT ≈ bold_h start_POSTSUBSCRIPT roman_LoS end_POSTSUBSCRIPT 111The case where comparable multi-path components exist is more complicated and discussions are provided in the Section V-D. We will evaluate the Rician fading channel in the simulation results.. Based on USW model, the near-field LoS channel from BS\touser can be modeled as [28]

𝐡nearHNβ𝐛H(r0,θ0),subscriptsuperscript𝐡𝐻near𝑁𝛽superscript𝐛𝐻subscript𝑟0subscript𝜃0\mathbf{h}^{H}_{\rm near}\approx\sqrt{N}\beta\mathbf{b}^{H}(r_{0},\theta_{0}),bold_h start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_near end_POSTSUBSCRIPT ≈ square-root start_ARG italic_N end_ARG italic_β bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , (3)

where 𝐛H(r0,θ0)superscript𝐛𝐻subscript𝑟0subscript𝜃0\mathbf{b}^{H}(r_{0},\theta_{0})bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) denotes the near-field channel steering vector, defined as [39]

[𝐛H(r0,θ0)]n=1Neȷ2πrnλ,n𝒩,formulae-sequencesubscriptdelimited-[]superscript𝐛𝐻subscript𝑟0subscript𝜃0𝑛1𝑁superscript𝑒italic-ȷ2𝜋subscript𝑟𝑛𝜆for-all𝑛𝒩\left[\mathbf{b}^{H}\left(r_{0},\theta_{0}\right)\right]_{n}=\frac{1}{\sqrt{N}% }e^{-\frac{\jmath 2\pi r_{n}}{\lambda}},\forall n\in\mathcal{N},[ bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_N end_ARG end_ARG italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_ȷ 2 italic_π italic_r start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_λ end_ARG end_POSTSUPERSCRIPT , ∀ italic_n ∈ caligraphic_N , (4)

with rn=r02+n2d022r0θ0nd0subscript𝑟𝑛superscriptsubscript𝑟02superscript𝑛2superscriptsubscript𝑑022subscript𝑟0subscript𝜃0𝑛subscript𝑑0r_{n}=\sqrt{r_{0}^{2}+{n}^{2}d_{0}^{2}-2r_{0}\theta_{0}nd_{0}}italic_r start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = square-root start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_n italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG denoting range between the n𝑛nitalic_n-th antenna and the user. Moreover, θ0=cosϕ0[1,1]subscript𝜃0subscriptitalic-ϕ011\theta_{0}=\cos\phi_{0}\in[-1,1]italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = roman_cos italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ [ - 1 , 1 ] represents the spatial angle at the BS, with ϕ0subscriptitalic-ϕ0\phi_{0}italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT denoting the physical angle-of-departure (AoD) from the BS center to the user. Further, by means of Fresnel approximation, rnsubscript𝑟𝑛r_{n}italic_r start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT can be approximated as

rnr0nd0θ0+n2d02(1θ02)2r0,subscript𝑟𝑛subscript𝑟0𝑛subscript𝑑0subscript𝜃0superscript𝑛2superscriptsubscript𝑑021superscriptsubscript𝜃022subscript𝑟0r_{n}\approx r_{0}-nd_{0}\theta_{0}+\frac{n^{2}d_{0}^{2}(1-\theta_{0}^{2})}{2r% _{0}},italic_r start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≈ italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_n italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + divide start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG 2 italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG , (5)

which is shown to be accurate in [5].

II-B Signal Model

Let x𝑥x\in\mathbb{C}italic_x ∈ blackboard_C denote the transmitted symbol by the BS with unit power and 𝐰¯N×1¯𝐰superscript𝑁1\bar{\mathbf{w}}\in\mathbb{C}^{N\times 1}over¯ start_ARG bold_w end_ARG ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT represent the beamforming vector [40]. Then the received signal at the user is given by

y(𝐰¯)=NβPtot𝐛H(r0,θ0)𝐰¯x+z,𝑦¯𝐰𝑁𝛽subscript𝑃totsuperscript𝐛𝐻subscript𝑟0subscript𝜃0¯𝐰𝑥𝑧\displaystyle y(\bar{\mathbf{w}})=\sqrt{N}\beta\sqrt{P_{\rm tot}}\mathbf{b}^{H% }(r_{0},\theta_{0})\bar{\mathbf{w}}x+z,italic_y ( over¯ start_ARG bold_w end_ARG ) = square-root start_ARG italic_N end_ARG italic_β square-root start_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT end_ARG bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) over¯ start_ARG bold_w end_ARG italic_x + italic_z , (6)

where z𝑧zitalic_z is the received additive white Gaussian noise (AWGN) and z𝒞𝒩(0,σ2)similar-to𝑧𝒞𝒩0superscript𝜎2z\sim\mathcal{CN}\left(0,\sigma^{2}\right)italic_z ∼ caligraphic_C caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). Moreover, Ptotsubscript𝑃totP_{\rm tot}italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT denotes the total transmit power of the BS. Then, the achievable rate in bits/second/hertz (bps/Hz) is given by

R=log2(1+PtotN|β|2|𝐛H(r0,θ0)𝐰¯|2σ2).𝑅subscript21subscript𝑃tot𝑁superscript𝛽2superscriptsuperscript𝐛𝐻subscript𝑟0subscript𝜃0¯𝐰2superscript𝜎2R=\log_{2}\left(1+\frac{P_{\rm tot}N|\beta|^{2}|\mathbf{b}^{H}(r_{0},\theta_{0% })\bar{\mathbf{w}}|^{2}}{\sigma^{2}}\right).italic_R = roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 + divide start_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT italic_N | italic_β | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) over¯ start_ARG bold_w end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) . (7)

III Benchmark Beam Training Schemes

In this section, two benchmark beam training schemes and their drawbacks are presented.

III-A 2D Exhaustive-Search Beam Training Method

The authors in [27] proposed a polar-domain codebook, each steering a focusing beam to a specific location. In particular, the angular domain is uniformly sampled, while the range domain is non-uniformly sampled. Specifically, the polar-domain codebook is given by

𝒲¯Pol={𝒲¯1,,𝒲¯n¯,,𝒲¯N},subscript¯𝒲Polsubscript¯𝒲1subscript¯𝒲¯𝑛subscript¯𝒲𝑁\bar{\mathcal{W}}_{\rm{Pol}}=\{\bar{\mathcal{W}}_{1},\cdots,\bar{\mathcal{W}}_% {\bar{n}},\cdots,\bar{\mathcal{W}}_{N}\},over¯ start_ARG caligraphic_W end_ARG start_POSTSUBSCRIPT roman_Pol end_POSTSUBSCRIPT = { over¯ start_ARG caligraphic_W end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG caligraphic_W end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG caligraphic_W end_ARG start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } , (8)

where 𝒲¯n¯={𝐰¯n¯,1,𝐰¯n¯,v,𝐰¯n¯,V}subscript¯𝒲¯𝑛subscript¯𝐰¯𝑛1subscript¯𝐰¯𝑛𝑣subscript¯𝐰¯𝑛𝑉\bar{\mathcal{W}}_{\bar{n}}=\{\bar{\mathbf{w}}_{\bar{n},1}\cdots,\bar{\mathbf{% w}}_{\bar{n},v},\cdots\bar{\mathbf{w}}_{\bar{n},V}\}over¯ start_ARG caligraphic_W end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT = { over¯ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG , 1 end_POSTSUBSCRIPT ⋯ , over¯ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG , italic_v end_POSTSUBSCRIPT , ⋯ over¯ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG , italic_V end_POSTSUBSCRIPT } denotes the sub-codebook steering V𝑉Vitalic_V beams towards the angles θn¯=2n¯N+1N,n¯𝒩¯{1,2,,N}formulae-sequencesubscript𝜃¯𝑛2¯𝑛𝑁1𝑁for-all¯𝑛¯𝒩12𝑁\theta_{\bar{n}}=\frac{2{\bar{n}}-N+1}{N},\forall{\bar{n}}\in\bar{\mathcal{N}}% \triangleq\{1,2,\cdots,N\}italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT = divide start_ARG 2 over¯ start_ARG italic_n end_ARG - italic_N + 1 end_ARG start_ARG italic_N end_ARG , ∀ over¯ start_ARG italic_n end_ARG ∈ over¯ start_ARG caligraphic_N end_ARG ≜ { 1 , 2 , ⋯ , italic_N }. Mathematically, we have 𝐰¯n¯,v=𝐛(rn¯,v,θn¯)subscript¯𝐰¯𝑛𝑣𝐛subscript𝑟¯𝑛𝑣subscript𝜃¯𝑛\bar{\mathbf{w}}_{{\bar{n}},v}=\mathbf{b}\left(r_{{\bar{n}},v},\theta_{\bar{n}% }\right)over¯ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG , italic_v end_POSTSUBSCRIPT = bold_b ( italic_r start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG , italic_v end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ) where rn¯,v=1vαΔ(1θn¯2),v𝒱{1,2,3,V}formulae-sequencesubscript𝑟¯𝑛𝑣1𝑣subscript𝛼Δ1superscriptsubscript𝜃¯𝑛2for-all𝑣𝒱123𝑉r_{{\bar{n}},v}=\frac{1}{v}\alpha_{\Delta}\left(1-\theta_{{\bar{n}}}^{2}\right% ),\quad\forall v\in\mathcal{V}\triangleq\{1,2,3,\cdots V\}italic_r start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG , italic_v end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_v end_ARG italic_α start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) , ∀ italic_v ∈ caligraphic_V ≜ { 1 , 2 , 3 , ⋯ italic_V } with αΔN2d022λβΔ2subscript𝛼Δsuperscript𝑁2superscriptsubscript𝑑022𝜆superscriptsubscript𝛽Δ2\alpha_{\Delta}\triangleq\frac{N^{2}d_{0}^{2}}{2\lambda\beta_{\Delta}^{2}}italic_α start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ≜ divide start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_λ italic_β start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. Moreover, βΔsubscript𝛽Δ\beta_{\Delta}italic_β start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT is a constant corresponding to the quantization loss in the range domain [27]. Given the polar-domain codebook 𝒲¯Polsubscript¯𝒲Pol\bar{\mathcal{W}}_{\rm{Pol}}over¯ start_ARG caligraphic_W end_ARG start_POSTSUBSCRIPT roman_Pol end_POSTSUBSCRIPT, a two-dimensional exhaustive search method can be directly applied in both the angular and range domains to search the best codeword, which yields the maximum received SNR at the user. The beam training overhead of this exhaustive-search beam training method is T(ex)=NVsuperscript𝑇ex𝑁𝑉T^{\rm{(ex)}}=NVitalic_T start_POSTSUPERSCRIPT ( roman_ex ) end_POSTSUPERSCRIPT = italic_N italic_V, which is proportional to the product of the number of antennas and range samples. When the number of antenna is large, the beam training overhead is unaffordable.

III-B Two-Phase Near-field Beam Training

To further reduce the beam training overhead, the authors in [28] proposed a two-phase near-field beam training method, which explored the so-called energy-spread phenomenon. Specifically, when the far-field DFT codebook is used for the angular swee** in the near-field, it is observed that the user angle approximated lies in the middle of an angular support region, which is to be defined in Section IV. Mathematically, the conventional DFT codebook is given by

𝐰¯n¯subscript¯𝐰¯𝑛\displaystyle\bar{\mathbf{w}}_{\bar{n}}over¯ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT =𝐚(θn¯)1N[1,eȷπθn¯,,eȷπ(N1)θn¯],absent𝐚subscript𝜃¯𝑛1𝑁1superscript𝑒italic-ȷ𝜋subscript𝜃¯𝑛superscript𝑒italic-ȷ𝜋𝑁1subscript𝜃¯𝑛\displaystyle=\mathbf{a}(\theta_{\bar{n}})\triangleq\frac{1}{\sqrt{N}}\left[1,% e^{-\jmath\pi\theta_{\bar{n}}},\cdots,e^{-\jmath\pi(N-1)\theta_{\bar{n}}}% \right],= bold_a ( italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ) ≜ divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_N end_ARG end_ARG [ 1 , italic_e start_POSTSUPERSCRIPT - italic_ȷ italic_π italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_ȷ italic_π ( italic_N - 1 ) italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] , (9)

where θn¯=2n¯N+1N,n¯𝒩¯formulae-sequencesubscript𝜃¯𝑛2¯𝑛𝑁1𝑁for-all¯𝑛¯𝒩\theta_{\bar{n}}=\frac{2{\bar{n}}-N+1}{N},\forall{\bar{n}}\in\bar{\mathcal{N}}italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT = divide start_ARG 2 over¯ start_ARG italic_n end_ARG - italic_N + 1 end_ARG start_ARG italic_N end_ARG , ∀ over¯ start_ARG italic_n end_ARG ∈ over¯ start_ARG caligraphic_N end_ARG. Given this observation, they explore the conventional DFT codebook in the first phase to perform beam swee**, which estimates the user angle information. Then, given the candidate user angle, the polar-domain codebook in (8) is used to search the best user range in the second phase. The beam training overhead of the two-phase beam training method is T(2P)=N+KVsuperscript𝑇2P𝑁𝐾𝑉T^{(\rm 2P)}=N+KVitalic_T start_POSTSUPERSCRIPT ( 2 roman_P ) end_POSTSUPERSCRIPT = italic_N + italic_K italic_V with K𝐾Kitalic_K representing the number of candidate user angles. Although this method significantly reduces the beam training overhead of the exhaustive-search method, the overhead of the two-phase beam training method is still proportional to the number of antennas, which is prohibitively high as N𝑁Nitalic_N is sufficiently large.

To address the above issues, we propose a new near-field beam training method using a proposed sparse DFT codebook, which is equivalent to sparsely activating the dense ULA equipped by the BS, yielding extremely lower overhead as compared with various benchmark schemes.

IV Received Beam Pattern of the Sparse DFT Codebook

In this section, we first introduce the sparse DFT codebook and then analyze its received beam pattern. Moreover, we design a periodical beam training codebook to reduce overhead and propose to activate a central subarray for resolving the angular ambiguity. The main definitions in this section are given as follows, including the received beam pattern and angular support.

Definition 1.

Given a fixed near-field user located at (r0,θ0)subscript𝑟0subscript𝜃0(r_{0},\theta_{0})( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) and an arbitrary far-field beamforming vector 𝐰¯=𝐚(θ)¯𝐰𝐚𝜃\bar{\mathbf{w}}=\mathbf{a}(\theta)over¯ start_ARG bold_w end_ARG = bold_a ( italic_θ ) steering the beam towards the angle θ𝜃\thetaitalic_θ, the received beam pattern at the user is defined as

f(r0,θ0;θ)|𝐛H(r0,θ0)𝐚(θ)|,θ.𝑓subscript𝑟0subscript𝜃0𝜃superscript𝐛𝐻subscript𝑟0subscript𝜃0𝐚𝜃for-all𝜃f(r_{0},\theta_{0};\mathbf{\theta})\triangleq|\mathbf{b}^{H}(r_{0},\theta_{0})% \mathbf{a}(\theta)|,\forall\theta.italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) ≜ | bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_a ( italic_θ ) | , ∀ italic_θ . (10)
Definition 2.

Given a near-field channel response vector 𝐛H(r0,θ0)superscript𝐛𝐻subscript𝑟0subscript𝜃0\mathbf{b}^{H}(r_{0},\theta_{0})bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) and a far-field beamforming vector 𝐰¯=𝐚(θ)¯𝐰𝐚𝜃\bar{\mathbf{w}}=\mathbf{a}(\theta)over¯ start_ARG bold_w end_ARG = bold_a ( italic_θ ) with θ𝜃\theta\in\mathcal{L}italic_θ ∈ caligraphic_L, the 3333-dB angular support 𝒜μ(r0,θ0)superscriptsubscript𝒜𝜇subscript𝑟0subscript𝜃0\mathcal{A}_{\mu}^{\mathcal{L}}(r_{0},\theta_{0})caligraphic_A start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_L end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) in the region {\mathcal{L}}caligraphic_L is defined by [28]

𝒜μ(r0,θ0)={θf(r0,θ0,θ)>κmaxθf(r0,θ0,θ)},subscriptsuperscript𝒜𝜇subscript𝑟0subscript𝜃0conditional-set𝜃𝑓subscript𝑟0subscript𝜃0𝜃𝜅subscript𝜃𝑓subscript𝑟0subscript𝜃0𝜃\displaystyle\mathcal{A}^{\mathcal{L}}_{\mu}(r_{0},\theta_{0})\!\!=\!\!\left\{% \theta\mid f\left(r_{0},\theta_{0},\theta\right)\!>\!\kappa\max_{\theta\in% \mathcal{L}}f\left(r_{0},\theta_{0},\theta\right)\right\},caligraphic_A start_POSTSUPERSCRIPT caligraphic_L end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = { italic_θ ∣ italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ ) > italic_κ roman_max start_POSTSUBSCRIPT italic_θ ∈ caligraphic_L end_POSTSUBSCRIPT italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ ) } , (11)

where κ=10μ/10𝜅superscript10𝜇10\kappa=10^{\mu/10}italic_κ = 10 start_POSTSUPERSCRIPT italic_μ / 10 end_POSTSUPERSCRIPT. Moreover, let θleftsubscript𝜃left\theta_{\rm left}italic_θ start_POSTSUBSCRIPT roman_left end_POSTSUBSCRIPT and θrightsubscript𝜃right\theta_{\rm right}italic_θ start_POSTSUBSCRIPT roman_right end_POSTSUBSCRIPT be the smallest and largest angle in 𝒜μ(r0,θ0)subscriptsuperscript𝒜𝜇subscript𝑟0subscript𝜃0\mathcal{A}^{\mathcal{L}}_{\mu}(r_{0},\theta_{0})caligraphic_A start_POSTSUPERSCRIPT caligraphic_L end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). Then, its angular support width is defined as

Γμ(r0,θ0)=θrightθleft.superscriptsubscriptΓ𝜇subscript𝑟0subscript𝜃0subscript𝜃rightsubscript𝜃left\Gamma_{\mu}^{\mathcal{L}}(r_{0},\theta_{0})=\theta_{\rm right}-\theta_{\rm left}.roman_Γ start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_L end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = italic_θ start_POSTSUBSCRIPT roman_right end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT roman_left end_POSTSUBSCRIPT . (12)
Refer to caption
Figure 2: Illustration of sparse linear array.

IV-A Sparse DFT Codebook

Each column of the DFT Codebook in (9) is a far-field channel response vector. For the sparse DFT codebook, we sample each column of the original DFT codebook using an interval of U𝑈Uitalic_U, while the positions that are not sampled are padded with zeros. As such, n¯¯𝑛\bar{n}over¯ start_ARG italic_n end_ARG-th column of the sparse DFT Codebook 𝐰~n¯N×1subscript~𝐰¯𝑛superscript𝑁1\tilde{\mathbf{w}}_{\bar{n}}\in\mathbb{C}^{N\times 1}over~ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT is given by 𝐰~n¯=𝐰¯n¯𝐧(U)subscript~𝐰¯𝑛direct-productsubscript¯𝐰¯𝑛𝐧𝑈\tilde{\mathbf{w}}_{\bar{n}}=\bar{\mathbf{w}}_{\bar{n}}\odot\mathbf{n}(U)over~ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT = over¯ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ⊙ bold_n ( italic_U ), where

𝐧H(U)=[1,0,,0,U11,,1]\mathbf{n}^{H}(U)=[1,\underbrace{0,\cdots,0,}_{U-1}1,\cdots,1]bold_n start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_U ) = [ 1 , under⏟ start_ARG 0 , ⋯ , 0 , end_ARG start_POSTSUBSCRIPT italic_U - 1 end_POSTSUBSCRIPT 1 , ⋯ , 1 ] (13)

denotes the sampling vector. It can be verified that the number of non-zero elements in 𝐰~n¯subscript~𝐰¯𝑛\tilde{\mathbf{w}}_{\bar{n}}over~ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT is Q=N1U+1𝑄𝑁1𝑈1Q=\frac{N-1}{U}+1italic_Q = divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + 1 (assuming Q𝑄Qitalic_Q is an integer for convenience). Moreover, we rearrange the non-zero elements in 𝐰~n¯subscript~𝐰¯𝑛\tilde{\mathbf{w}}_{\bar{n}}over~ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT into a new vector 𝐰n¯Q×1subscript𝐰¯𝑛superscript𝑄1{\mathbf{w}_{\bar{n}}}\in\mathbb{C}^{Q\times 1}bold_w start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ∈ blackboard_C start_POSTSUPERSCRIPT italic_Q × 1 end_POSTSUPERSCRIPT, which is given by

𝐰n¯=𝐚SLA(θn¯)=1Q[1,eȷπUθn¯,,eȷπ(Q1)Uθn¯]H,subscript𝐰¯𝑛subscript𝐚SLAsubscript𝜃¯𝑛1𝑄superscript1superscript𝑒italic-ȷ𝜋𝑈subscript𝜃¯𝑛superscript𝑒italic-ȷ𝜋𝑄1𝑈subscript𝜃¯𝑛𝐻{\mathbf{w}}_{\bar{n}}=\mathbf{a}_{\rm SLA}(\theta_{\bar{n}})=\frac{1}{\sqrt{Q% }}\bigg{[}1,e^{\jmath{\pi}U\theta_{\bar{n}}},\cdots,e^{\jmath{\pi}(Q-1)U\theta% _{\bar{n}}}\bigg{]}^{H}\!\!,bold_w start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT = bold_a start_POSTSUBSCRIPT roman_SLA end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ) = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_Q end_ARG end_ARG [ 1 , italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π italic_U italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π ( italic_Q - 1 ) italic_U italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT , (14)

referred to as the sparse far-field beamforming vector.

It is noteworthy that this sampling method is equivalent to sparsely activating a number of antennas of the the XL-array with an interval of U𝑈Uitalic_U, which effectively transforms the equipped ULA into an SLA with an inter-element spacing of Ud0𝑈subscript𝑑0Ud_{0}italic_U italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, as illustrated in Fig. 2. Furthermore, for brevity, we denote the channel response vector of the effective SLA as

[𝐛SLAH(r0,θ0)]q=1Qeȷ2πrqλ,q𝒬,formulae-sequencesubscriptdelimited-[]subscriptsuperscript𝐛𝐻SLAsubscript𝑟0subscript𝜃0𝑞1𝑄superscript𝑒italic-ȷ2𝜋subscript𝑟𝑞𝜆for-all𝑞𝒬\left[\mathbf{b}^{H}_{\rm SLA}\left(r_{0},\theta_{0}\right)\right]_{q}=\frac{1% }{\sqrt{Q}}e^{-\frac{\jmath 2\pi r_{q}}{\lambda}},\forall q\in\mathcal{Q},[ bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_SLA end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_Q end_ARG end_ARG italic_e start_POSTSUPERSCRIPT - divide start_ARG italic_ȷ 2 italic_π italic_r start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT end_ARG start_ARG italic_λ end_ARG end_POSTSUPERSCRIPT , ∀ italic_q ∈ caligraphic_Q , (15)

where 𝒬{0,±1,,±Q12}𝒬0plus-or-minus1plus-or-minus𝑄12\mathcal{Q}\triangleq\{0,\pm 1,\cdots,\pm\frac{Q-1}{2}\}caligraphic_Q ≜ { 0 , ± 1 , ⋯ , ± divide start_ARG italic_Q - 1 end_ARG start_ARG 2 end_ARG } denotes the set of the SLA antenna index and rq=r022qUd0r0θ0+(qUd0)2subscript𝑟𝑞superscriptsubscript𝑟022𝑞𝑈subscript𝑑0subscript𝑟0subscript𝜃0superscript𝑞𝑈subscript𝑑02r_{q}=\sqrt{r_{0}^{2}-2qUd_{0}r_{0}\theta_{0}+(qUd_{0})^{2}}italic_r start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT = square-root start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 italic_q italic_U italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( italic_q italic_U italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG represents the range between the user and q𝑞qitalic_q-th antenna of the SLA. Similar to (5), rqsubscript𝑟𝑞r_{q}italic_r start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT can be approximated as rqrqUd0θ0+q2(Ud0)2(1θ02)2r0subscript𝑟𝑞𝑟𝑞𝑈subscript𝑑0subscript𝜃0superscript𝑞2superscript𝑈subscript𝑑021superscriptsubscript𝜃022subscript𝑟0r_{q}\approx r-qUd_{0}\theta_{0}+\frac{q^{2}(Ud_{0})^{2}(1-\theta_{0}^{2})}{2r% _{0}}italic_r start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ≈ italic_r - italic_q italic_U italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + divide start_ARG italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_U italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG 2 italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG with Fresnel approximation. Then, the received signal with beamforming vector 𝐰n¯subscript𝐰¯𝑛\mathbf{w}_{\bar{n}}bold_w start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT at the user is rewritten as

y(𝐰n¯)=QβPtot𝐛SLAH(r0,θ0)𝐰n¯x+z.𝑦subscript𝐰¯𝑛𝑄𝛽subscript𝑃totsubscriptsuperscript𝐛𝐻SLAsubscript𝑟0subscript𝜃0subscript𝐰¯𝑛𝑥𝑧\displaystyle y(\mathbf{w}_{\bar{n}})=\sqrt{Q}\beta\sqrt{P_{\rm tot}}\mathbf{b% }^{H}_{\rm SLA}(r_{0},\theta_{0})\mathbf{w}_{\bar{n}}x+z.italic_y ( bold_w start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ) = square-root start_ARG italic_Q end_ARG italic_β square-root start_ARG italic_P start_POSTSUBSCRIPT roman_tot end_POSTSUBSCRIPT end_ARG bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_SLA end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_w start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT italic_x + italic_z . (16)

Moreover, the received beam pattern at the user in (10) can be rewritten as

f(r0,θ0;θn¯)𝑓subscript𝑟0subscript𝜃0subscript𝜃¯𝑛\displaystyle f(r_{0},\theta_{0};\theta_{\bar{n}})italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ) =|𝐛H(r0,θ0)𝐰¯n¯|=|𝐛SLAH(r0,θ0)𝐰n¯|absentsuperscript𝐛𝐻subscript𝑟0subscript𝜃0subscript¯𝐰¯𝑛subscriptsuperscript𝐛𝐻SLAsubscript𝑟0subscript𝜃0subscript𝐰¯𝑛\displaystyle=|\mathbf{b}^{H}(r_{0},\theta_{0}){\bar{\mathbf{w}}_{\bar{n}}}|=|% \mathbf{b}^{H}_{\rm SLA}(r_{0},\theta_{0}){\mathbf{w}}_{\bar{n}}|= | bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) over¯ start_ARG bold_w end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT | = | bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_SLA end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_w start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT | (17)
=|𝐛SLAH(r0,θ0)𝐚SLA(θn¯)|.absentsuperscriptsubscript𝐛SLA𝐻subscript𝑟0subscript𝜃0subscript𝐚SLAsubscript𝜃¯𝑛\displaystyle=|\mathbf{b}_{\rm SLA}^{H}(r_{0},\theta_{0})\mathbf{a}_{\rm SLA}(% \theta_{\bar{n}})|.= | bold_b start_POSTSUBSCRIPT roman_SLA end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_a start_POSTSUBSCRIPT roman_SLA end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT ) | .

IV-B Near-field Received Beam Pattern

To obtain more insights, we first characterize the received beam pattern at the user with sparse far-field beamforming vectors spanning in the continuous spatial angles. Let 𝐰(θ)=𝐚SLA(θ)𝐰𝜃subscript𝐚SLA𝜃\mathbf{w}(\theta)=\mathbf{a}_{\rm SLA}(\theta)bold_w ( italic_θ ) = bold_a start_POSTSUBSCRIPT roman_SLA end_POSTSUBSCRIPT ( italic_θ ) represent a sparse far-field beamforming vector, for which θ=θn¯,n¯𝒩¯formulae-sequence𝜃subscript𝜃¯𝑛for-all¯𝑛¯𝒩\theta=\theta_{\bar{n}},\forall{\bar{n}}\in{\bar{\mathcal{N}}}italic_θ = italic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG end_POSTSUBSCRIPT , ∀ over¯ start_ARG italic_n end_ARG ∈ over¯ start_ARG caligraphic_N end_ARG is the discrete sampled angle. We first characterize the received beam pattern of the sparse far-field beamforming vector 𝐰𝐰{\mathbf{w}}bold_w as follows.

Lemma 1.

For a sampled beamforming vector 𝐰𝐰{\mathbf{w}}bold_w parameterized by {Q,U}𝑄𝑈\{Q,U\}{ italic_Q , italic_U }, the received beam pattern at the user is given by

f(r0,θ0;θ)𝑓subscript𝑟0subscript𝜃0𝜃\displaystyle f\left(r_{0},\theta_{0};\theta\right)italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ )
(a1)subscript𝑎1\displaystyle\!\!\!\!\!\!\overset{(a_{1})}{\approx}start_OVERACCENT ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG ≈ end_ARG 1Q|q𝒬exp(ȷπqUΔB1+ȷπλq2(Ud0)21θ02r0B2)|1𝑄subscript𝑞𝒬subscriptitalic-ȷ𝜋𝑞𝑈Δsubscript𝐵1subscriptitalic-ȷ𝜋𝜆superscript𝑞2superscript𝑈subscript𝑑021superscriptsubscript𝜃02subscript𝑟0subscript𝐵2\displaystyle\frac{1}{Q}\!\!\left|\sum_{q\in\mathcal{Q}}\!\exp{\left(% \underbrace{\jmath{\pi qU\Delta}}_{B_{1}}+\underbrace{\jmath{\frac{\pi}{% \lambda}q^{2}(Ud_{0})^{2}\frac{1-\theta_{0}^{2}}{r_{0}}}}_{B_{2}}\right)}\!% \right|\!divide start_ARG 1 end_ARG start_ARG italic_Q end_ARG | ∑ start_POSTSUBSCRIPT italic_q ∈ caligraphic_Q end_POSTSUBSCRIPT roman_exp ( under⏟ start_ARG italic_ȷ italic_π italic_q italic_U roman_Δ end_ARG start_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + under⏟ start_ARG italic_ȷ divide start_ARG italic_π end_ARG start_ARG italic_λ end_ARG italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_U italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG 1 - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_ARG start_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) | (18)
\displaystyle\triangleq f^(r0,θ0;θ),^𝑓subscript𝑟0subscript𝜃0𝜃\displaystyle\hat{f}(r_{0},\theta_{0};\theta),over^ start_ARG italic_f end_ARG ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) ,

where Δθθ0Δ𝜃subscript𝜃0\Delta\triangleq\theta-\theta_{\rm 0}roman_Δ ≜ italic_θ - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and (a1)subscript𝑎1(a_{1})( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) is due to the Fresnel approximation and shown to be accurate in [41].

Proposition 1 (The periodicity of θ𝜃\thetaitalic_θ).

f(r0,θ0;θ)𝑓subscript𝑟0subscript𝜃0𝜃f\left(r_{0},\theta_{0};\theta\right)italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) is a periodic function of θ𝜃\thetaitalic_θ with the period of 2U2𝑈\frac{2}{U}divide start_ARG 2 end_ARG start_ARG italic_U end_ARG. Mathematically, we have

f(r0,θ0;θ)=f(r0,θ0;θ+2kU),k.formulae-sequence𝑓subscript𝑟0subscript𝜃0𝜃𝑓subscript𝑟0subscript𝜃0𝜃2𝑘𝑈for-all𝑘f\left(r_{0},\theta_{0};\theta\right)=f\left(r_{0},\theta_{0};\theta+\frac{2k}% {U}\right),\forall k\in\mathbb{Z}.italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) = italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ + divide start_ARG 2 italic_k end_ARG start_ARG italic_U end_ARG ) , ∀ italic_k ∈ blackboard_Z . (19)

Proof: For an arbitrary integer k𝑘kitalic_k and 𝐚(θ+2kU)𝐚𝜃2𝑘𝑈{\bf{a}}(\theta+\frac{2k}{U})bold_a ( italic_θ + divide start_ARG 2 italic_k end_ARG start_ARG italic_U end_ARG ), we have

[𝐚(θ+2k/U)]qsubscriptdelimited-[]𝐚𝜃2𝑘𝑈𝑞\displaystyle{[{\bf{a}}(\theta+{2k}/{U})]_{q}}[ bold_a ( italic_θ + 2 italic_k / italic_U ) ] start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT =exp(ȷπqU(θ+2k/U))absentitalic-ȷ𝜋𝑞𝑈𝜃2𝑘𝑈\displaystyle=\exp(\jmath\pi qU(\theta+{2k}/{U}))= roman_exp ( italic_ȷ italic_π italic_q italic_U ( italic_θ + 2 italic_k / italic_U ) )
=exp(ȷπqUθ)exp(ȷ2πkqU)absentitalic-ȷ𝜋𝑞𝑈𝜃italic-ȷ2𝜋𝑘𝑞𝑈\displaystyle=\exp(\jmath\pi qU\theta)\exp(\jmath 2\pi kqU)= roman_exp ( italic_ȷ italic_π italic_q italic_U italic_θ ) roman_exp ( italic_ȷ 2 italic_π italic_k italic_q italic_U )
=exp(ȷπqUθ)=[𝐚(θ)]q.absentitalic-ȷ𝜋𝑞𝑈𝜃subscriptdelimited-[]𝐚𝜃𝑞\displaystyle=\exp(\jmath\pi qU\theta)={[{\bf{a}}(\theta)]_{q}}.= roman_exp ( italic_ȷ italic_π italic_q italic_U italic_θ ) = [ bold_a ( italic_θ ) ] start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT .

Hence, we have f(r0,θ0;θ)=f(r0,θ0;θ+k2U)𝑓subscript𝑟0subscript𝜃0𝜃𝑓subscript𝑟0subscript𝜃0𝜃𝑘2𝑈f\left(r_{0},\theta_{0};\theta\right)=f\left(r_{0},\theta_{0};\theta+k\frac{2}% {U}\right)italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) = italic_f ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ + italic_k divide start_ARG 2 end_ARG start_ARG italic_U end_ARG ) and thus complete the proof.  \Box

Refer to caption
Figure 3: Received beam pattern, where N=257𝑁257N=257italic_N = 257, U=16𝑈16U=16italic_U = 16 and f=30𝑓30f=30italic_f = 30 GHz. The actual user angle is θ0=0subscript𝜃00\theta_{0}=0italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0 marked by the red line.

In Fig. 3, we plot the received beam pattern versus the spatial angle θ𝜃\thetaitalic_θ, where the periodicity of θ𝜃\thetaitalic_θ is exhibited. Moreover, it is observed that the user angle is located in the middle of the angular support 𝒜μ0(r0,θ0)subscriptsuperscript𝒜subscript0𝜇subscript𝑟0subscript𝜃0\mathcal{A}^{\mathcal{L}_{0}}_{\mu}(r_{0},\theta_{0})caligraphic_A start_POSTSUPERSCRIPT caligraphic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), where 0=[θ01/U,θ0+1/U)subscript0subscript𝜃01𝑈subscript𝜃01𝑈\mathcal{L}_{0}=[\theta_{0}-1/U,\theta_{0}+1/U)caligraphic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 / italic_U , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 / italic_U ) (a period 2U2𝑈\frac{2}{U}divide start_ARG 2 end_ARG start_ARG italic_U end_ARG). Then, in the following, we prove that this observation holds for arbitrary user location (r0,θ0)subscript𝑟0subscript𝜃0(r_{0},\theta_{0})( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ).

Due to the periodicity of θ𝜃\thetaitalic_θ, we only need to focus on the region 0subscript0\mathcal{L}_{0}caligraphic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, i.e., Δ[1/U,1/U)Δ1𝑈1𝑈\Delta\in[-1/U,1/U)roman_Δ ∈ [ - 1 / italic_U , 1 / italic_U ), which inexplicitly contains useful user angle information. Given Δ[1/U,1/U)Δ1𝑈1𝑈\Delta\in[-1/U,1/U)roman_Δ ∈ [ - 1 / italic_U , 1 / italic_U ), (1) can be approximated as follows.

Lemma 2.

When Δ[1/U,1/U)Δ1𝑈1𝑈\Delta\in[-1/U,1/U)roman_Δ ∈ [ - 1 / italic_U , 1 / italic_U ), the received beam pattern f^(r0,θ0;θ)^𝑓subscript𝑟0subscript𝜃0𝜃\hat{f}(r_{0},\theta_{0};\theta)over^ start_ARG italic_f end_ARG ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) in (1) can be approximated as

f^(r0,θ0;θ)|G(β1,β2)|.^𝑓subscript𝑟0subscript𝜃0𝜃𝐺subscript𝛽1subscript𝛽2\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)\approx\left|G(\beta_{% 1},\beta_{2})\right|.over^ start_ARG italic_f end_ARG ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) ≈ | italic_G ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) | . (20)

Specifically, we have

G(β1,β2)C^(β1,β2)+ȷ(S^(β1,β2)2β2,G(\beta_{1},\beta_{2})\triangleq\frac{\widehat{C}(\beta_{1},\beta_{2})+\jmath(% \widehat{S}(\beta_{1},\beta_{2})}{2\beta_{2}},italic_G ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≜ divide start_ARG over^ start_ARG italic_C end_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + italic_ȷ ( over^ start_ARG italic_S end_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG 2 italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG , (21)

where C^(β1,β2)C(β1+β2)C(β1β2)^𝐶subscript𝛽1subscript𝛽2𝐶subscript𝛽1subscript𝛽2𝐶subscript𝛽1subscript𝛽2\widehat{C}(\beta_{1},\beta_{2})\triangleq{C}(\beta_{1}+\beta_{2})-C(\beta_{1}% -\beta_{2})over^ start_ARG italic_C end_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≜ italic_C ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_C ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) and S^(β1,β2)S(β1+β2)S(β1β2)^𝑆subscript𝛽1subscript𝛽2𝑆subscript𝛽1subscript𝛽2𝑆subscript𝛽1subscript𝛽2\widehat{S}(\beta_{1},\beta_{2})\triangleq S(\beta_{1}+\beta_{2})-S(\beta_{1}-% \beta_{2})over^ start_ARG italic_S end_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≜ italic_S ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_S ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). Further, C(x)𝐶𝑥C(x)italic_C ( italic_x ) and S(x)𝑆𝑥S(x)italic_S ( italic_x ) are Fresnel integrals, which are given by

C(x)=0xcos(π2t2)dt,S(x)=0xsin(π2t2)dt.formulae-sequence𝐶𝑥superscriptsubscript0𝑥𝜋2superscript𝑡2differential-d𝑡𝑆𝑥superscriptsubscript0𝑥𝜋2superscript𝑡2differential-d𝑡C(x)=\int_{0}^{x}\cos(\frac{\pi}{2}t^{2}){\rm d}t,S(x)=\int_{0}^{x}\sin(\frac{% \pi}{2}t^{2}){\rm d}t.italic_C ( italic_x ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT roman_cos ( divide start_ARG italic_π end_ARG start_ARG 2 end_ARG italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) roman_d italic_t , italic_S ( italic_x ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT roman_sin ( divide start_ARG italic_π end_ARG start_ARG 2 end_ARG italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) roman_d italic_t .

Moreover, β1subscript𝛽1\beta_{1}italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and β2subscript𝛽2\beta_{2}italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are given by

β1=Δr0d0(1θ02),β2=QU2d0(1θ02)r0.formulae-sequencesubscript𝛽1Δsubscript𝑟0subscript𝑑01superscriptsubscript𝜃02subscript𝛽2𝑄𝑈2subscript𝑑01superscriptsubscript𝜃02subscript𝑟0\beta_{1}=\Delta\sqrt{\frac{r_{0}}{d_{0}(1-\theta_{0}^{2})}},\leavevmode% \nobreak\ \leavevmode\nobreak\ \beta_{2}=\frac{QU}{2}\sqrt{\frac{d_{0}(1-% \theta_{0}^{2})}{r_{0}}}.italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = roman_Δ square-root start_ARG divide start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG end_ARG , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = divide start_ARG italic_Q italic_U end_ARG start_ARG 2 end_ARG square-root start_ARG divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_ARG . (22)

Proof: Please refer to Appendix A.  \Box

As observed from (22), we have

β1β2=QUΔ2D2d0Δ.subscript𝛽1subscript𝛽2𝑄𝑈Δ2𝐷2subscript𝑑0Δ\beta_{1}\beta_{2}=\frac{QU\Delta}{2}\approx\frac{D}{2d_{0}}\Delta.italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = divide start_ARG italic_Q italic_U roman_Δ end_ARG start_ARG 2 end_ARG ≈ divide start_ARG italic_D end_ARG start_ARG 2 italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG roman_Δ .

Hence, the function G()𝐺G(\cdot)italic_G ( ⋅ ) can be rewritten as a function of {Δ,β2}Δsubscript𝛽2\{\Delta,\beta_{2}\}{ roman_Δ , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT }. It is observed that if the array aperture is fixed, β2subscript𝛽2\beta_{2}italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is only determined by the user location (r0,θ0)subscript𝑟0subscript𝜃0(r_{0},\theta_{0})( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), which indicates that each user location corresponds to a specific β2subscript𝛽2\beta_{2}italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Furthermore, as the user moves farther from the BS (consequently, the user is more likely to be located in the far-field region), the value of β2subscript𝛽2\beta_{2}italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT decreases. Supposing that N=257𝑁257N=257italic_N = 257, U=16𝑈16U=16italic_U = 16 and f=30𝑓30f=30italic_f = 30 GHz and the user is located at the Fresnel and focusing region (θ0=0subscript𝜃00\theta_{0}=0italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0), we have β2[1.68,3.57]subscript𝛽21.683.57\beta_{2}\in[1.68,3.57]italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ [ 1.68 , 3.57 ]. In Fig. 4, we numerically show the function of G()𝐺G(\cdot)italic_G ( ⋅ ). Importantly, for any near-field users, the so-called energy-spread phenomenon [27] is still observed. Moreover, due to the period of θ𝜃\thetaitalic_θ, the energy-spread effects exist in each period interval (2U2𝑈\frac{2}{U}divide start_ARG 2 end_ARG start_ARG italic_U end_ARG). Then, two key observations are obtained.

Refer to caption
Figure 4: Received beam pattern in one period, where N=257𝑁257N=257italic_N = 257, U=16𝑈16U=16italic_U = 16 and f=30𝑓30f=30italic_f = 30 GHz.
Refer to caption
Figure 5: Illustration of the proposed three-phase training scheme based on the sparse DFT codebook
Observation 1.

In Figs. 3 and 4, the energy-spread effects exhibit in the whole radiating near-field region, and the received beam pattern with sampled beamforming vector 𝐰𝐰\mathbf{w}bold_w contains useful user angle information:

  • 1)

    User angle information: It is observed that the actual user angle approximately locates in the middle of the 3333 dB angular support in one period where θ0=[θ01/U,θ0+1/U]𝜃subscript0subscript𝜃01𝑈subscript𝜃01𝑈\theta\in{\mathcal{L}_{0}}=[\theta_{0}-1/U,\theta_{0}+1/U]italic_θ ∈ caligraphic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = [ italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 / italic_U , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 / italic_U ]. Mathematically, we have

    θ0Med(𝒜30(θ0,r0)).subscript𝜃0Medsuperscriptsubscript𝒜3subscript0subscript𝜃0subscript𝑟0\theta_{0}\approx{\rm{Med}}(\mathcal{A}_{3}^{\mathcal{L}_{0}}(\theta_{0},r_{0}% )).italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≈ roman_Med ( caligraphic_A start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) . (23)

    Moreover, we define u𝑢uitalic_u-th angular support as 𝒜3u(θ0,r0),superscriptsubscript𝒜3subscript𝑢subscript𝜃0subscript𝑟0\mathcal{A}_{3}^{\mathcal{L}_{u}}(\theta_{0},r_{0}),caligraphic_A start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , where u=0+2uUsubscript𝑢subscript02𝑢𝑈{\mathcal{L}_{u}}={\mathcal{L}_{0}+\frac{2u}{U}}caligraphic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = caligraphic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + divide start_ARG 2 italic_u end_ARG start_ARG italic_U end_ARG, u𝒰{±1,±2,,±(U1)}for-all𝑢𝒰plus-or-minus1plus-or-minus2plus-or-minus𝑈1\forall u\in\mathcal{U}\triangleq\{\pm 1,\pm 2,\cdots,\pm(U-1)\}∀ italic_u ∈ caligraphic_U ≜ { ± 1 , ± 2 , ⋯ , ± ( italic_U - 1 ) }. If we can employ the middle of u𝑢uitalic_u-th angular support denoted by θuMed(𝒜3u(θ0,r0))subscript𝜃𝑢Medsuperscriptsubscript𝒜3subscript𝑢subscript𝜃0subscript𝑟0\theta_{u}\approx{\rm{Med}}(\mathcal{A}_{3}^{\mathcal{L}_{u}}(\theta_{0},r_{0}))italic_θ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ≈ roman_Med ( caligraphic_A start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ), the BS can infer that the user angle is among the following candidate angles θuMed(𝒜3u(θ0,r0)),subscript𝜃𝑢Medsuperscriptsubscript𝒜3subscript𝑢subscript𝜃0subscript𝑟0\theta_{u}\approx{\rm{Med}}(\mathcal{A}_{3}^{\mathcal{L}_{u}}(\theta_{0},r_{0}% )),italic_θ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ≈ roman_Med ( caligraphic_A start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT caligraphic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) , where c=u+2cUsubscriptcsubscript𝑢2𝑐𝑈{\mathcal{L}_{\rm c}}={\mathcal{L}_{u}+\frac{2c}{U}}caligraphic_L start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT = caligraphic_L start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT + divide start_ARG 2 italic_c end_ARG start_ARG italic_U end_ARG, c𝒰for-all𝑐𝒰\forall c\in\mathcal{U}∀ italic_c ∈ caligraphic_U, due to the period of 2U2𝑈\frac{2}{U}divide start_ARG 2 end_ARG start_ARG italic_U end_ARG.

  • 2)

    Although energy spread effects can provide actual user angle information, it is worth noting that when the user is located near the boundary of Fresnel region (in Fig. 4, the boundary corresponds to β2=3.57subscript𝛽23.57\beta_{2}=3.57italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 3.57 and r0=7.42subscript𝑟07.42r_{0}=7.42italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 7.42m ), the angular support is distorted, which affects the accuracy of the beam training results. Moreover, by taking the noise and power fluctuation into account, the actual user angle may slightly deviate from the middle of the angular support.

Observation 1 indicates that the actual user angle can be estimated by finding the middle of the angular support in a period. In other words, the sparse DFT codebook within the angle range of a period (for example θ[1/U,1/U)𝜃1𝑈1𝑈\theta\in[-1/U,1/U)italic_θ ∈ [ - 1 / italic_U , 1 / italic_U )) contained all information for which the BS can infer U𝑈Uitalic_U candidate angles for actual user angle.

V Proposed Beam Training Scheme Enabled by Sparse DFT Codebook

In this section, we propose a three-phase near-field beam training method enabled by the sparse DFT codebook. Then, an optimization problem is formulated to minimize the beam training overhead.

V-A Phase 1: Beam Swee** with the Sparse DFT codebook

In Section IV-B, we show that the received beam pattern with the sparse DFT codebook exhibits a period of 2/U2𝑈2/U2 / italic_U and the user angle information can be inferred from the middle of the angular support within the region 0subscript0\mathcal{L}_{0}caligraphic_L start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

This motivates us to perform angular swee** in a period of θ𝜃\thetaitalic_θ (steering beams varying from 1/U1𝑈-1/U- 1 / italic_U to 1/U1𝑈1/U1 / italic_U) to estimate the potential user angles in the first phase, thereby further decreasing the beam training overhead. Considering that the beam width of the sparse far-field beamforming vector (14) is 4QU4𝑄𝑈\frac{4}{QU}divide start_ARG 4 end_ARG start_ARG italic_Q italic_U end_ARG [42], we sample the angular space as

θs=2sQU1QU,s=1,,QU.formulae-sequencesubscript𝜃𝑠2𝑠𝑄𝑈1𝑄𝑈𝑠1𝑄𝑈\theta_{s}=\frac{2s-QU-1}{QU},s=1,\cdots,QU.italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = divide start_ARG 2 italic_s - italic_Q italic_U - 1 end_ARG start_ARG italic_Q italic_U end_ARG , italic_s = 1 , ⋯ , italic_Q italic_U .

Then, the sparse DFT codebook in the first phase to perform the angular swee** is given by

𝒲DFT={𝐰QUQ+12,,𝐰g,,𝐰QU+Q12},subscript𝒲DFTsubscript𝐰𝑄𝑈𝑄12subscript𝐰𝑔subscript𝐰𝑄𝑈𝑄12{{\mathcal{W}}}_{\rm{DFT}}\!\!=\!\!\{{{\mathbf{w}}}_{\frac{QU-Q+1}{2}},\!% \cdots\!,{{\mathbf{w}}}_{g},\cdots,{{\mathbf{w}}}_{\frac{QU+Q-1}{2}}\},caligraphic_W start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT = { bold_w start_POSTSUBSCRIPT divide start_ARG italic_Q italic_U - italic_Q + 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT , ⋯ , bold_w start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT , ⋯ , bold_w start_POSTSUBSCRIPT divide start_ARG italic_Q italic_U + italic_Q - 1 end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT } , (24)

where 𝐰g=𝐚SLA(θg)subscript𝐰𝑔subscript𝐚SLAsubscript𝜃𝑔{{\mathbf{w}}}_{g}=\mathbf{a}_{\rm SLA}(\theta_{g})bold_w start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT = bold_a start_POSTSUBSCRIPT roman_SLA end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ), θg=2gQU1QUsubscript𝜃𝑔2𝑔𝑄𝑈1𝑄𝑈\theta_{g}=\frac{2g-QU-1}{QU}italic_θ start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT = divide start_ARG 2 italic_g - italic_Q italic_U - 1 end_ARG start_ARG italic_Q italic_U end_ARG and the index g𝒢{QUQ+12,,QU+Q12}𝑔𝒢𝑄𝑈𝑄12𝑄𝑈𝑄12g\in\mathcal{G}\triangleq\{\frac{QU-Q+1}{2},\cdots,\frac{QU+Q-1}{2}\}italic_g ∈ caligraphic_G ≜ { divide start_ARG italic_Q italic_U - italic_Q + 1 end_ARG start_ARG 2 end_ARG , ⋯ , divide start_ARG italic_Q italic_U + italic_Q - 1 end_ARG start_ARG 2 end_ARG }.

Then, the BS sequentially transmits Q𝑄Qitalic_Q pilot symbols with the sparse DFT codebook in (24), while it tunes beam angles varying from 1/U1𝑈-1/U- 1 / italic_U to 1/U1𝑈1/U1 / italic_U as illustrated in Fig. 5(a). For each codeword, the received signal power at the user is given by

p(𝐰g)=|Qβ𝐛H(r0,θ0)𝐰gx+z|2,g𝒢.formulae-sequence𝑝subscript𝐰𝑔superscript𝑄𝛽superscript𝐛𝐻subscript𝑟0subscript𝜃0subscript𝐰𝑔𝑥𝑧2for-all𝑔𝒢p(\mathbf{w}_{g})=|\sqrt{Q}\beta\mathbf{b}^{H}(r_{0},\theta_{0})\mathbf{w}_{g}% x+z|^{2},\leavevmode\nobreak\ \forall g\in\mathcal{G}.italic_p ( bold_w start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) = | square-root start_ARG italic_Q end_ARG italic_β bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_w start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT italic_x + italic_z | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∀ italic_g ∈ caligraphic_G . (25)

However, it is worth noting that when we transmit sparse far-field beamforming vectors varying from 1/U1𝑈-1/U- 1 / italic_U to 1/U1𝑈1/U1 / italic_U, the received beam pattern may exhibit a shifted angular support within two periods instead of a whole angular support in one period as illustrated in Fig. 6. To obtain a regular angular support, the user needs to perform the received-beam-pattern shifting based on the index of the codeword 𝐰gsubscript𝐰𝑔\mathbf{w}_{g}bold_w start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT with the lowest power. We denote the codeword with the lowest power as 𝐰subscript𝐰\mathbf{w}_{\ell}bold_w start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT corresponding to the angle θsubscript𝜃\theta_{\ell}italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. Specifically, we shift the angle larger than θsubscript𝜃\theta_{\ell}italic_θ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT by one period (2/U2𝑈2/U2 / italic_U) as illustrated in Fig. 6. Then, the shifted indices of the codewords are arranged in a vector 𝐬=[Q+1,,Q+2,,]𝐬𝑄1𝑄2\mathbf{s}=[{\ell-Q+1},\cdots,{\ell-Q+2},\cdots,\ell]bold_s = [ roman_ℓ - italic_Q + 1 , ⋯ , roman_ℓ - italic_Q + 2 , ⋯ , roman_ℓ ]. Moreover, the equivalent received power of the shifted codewords can be recast as

𝒫={\displaystyle\mathcal{P}=\{caligraphic_P = { p(𝐰+1),p(𝐰+2),,p(𝐰(QU+Q1)/2),𝑝subscript𝐰1𝑝subscript𝐰2𝑝subscript𝐰𝑄𝑈𝑄12\displaystyle p(\mathbf{w}_{\ell+1}),p(\mathbf{w}_{\ell+2}),\cdots,p(\mathbf{w% }_{(QU+Q-1)/2}),italic_p ( bold_w start_POSTSUBSCRIPT roman_ℓ + 1 end_POSTSUBSCRIPT ) , italic_p ( bold_w start_POSTSUBSCRIPT roman_ℓ + 2 end_POSTSUBSCRIPT ) , ⋯ , italic_p ( bold_w start_POSTSUBSCRIPT ( italic_Q italic_U + italic_Q - 1 ) / 2 end_POSTSUBSCRIPT ) ,
p(𝐰(QUQ+1)/2),p(𝐰(QUQ+3)/2),,p(𝐰)}.\displaystyle p(\mathbf{w}_{(QU-Q+1)/2}),p(\mathbf{w}_{(QU-Q+3)/2}),\cdots,p(% \mathbf{w}_{\ell})\}.italic_p ( bold_w start_POSTSUBSCRIPT ( italic_Q italic_U - italic_Q + 1 ) / 2 end_POSTSUBSCRIPT ) , italic_p ( bold_w start_POSTSUBSCRIPT ( italic_Q italic_U - italic_Q + 3 ) / 2 end_POSTSUBSCRIPT ) , ⋯ , italic_p ( bold_w start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) } .

As such, we obtain a complete and regular angular support, which involves the user’s angle information based on Observation 1. Specifically, the indices of shifted codewords with significantly high power received by the users are given by

𝒮={𝐬δ|p(𝐰δ)>κmax𝒫},𝒮conditional-setsubscript𝐬𝛿𝑝subscript𝐰𝛿𝜅𝒫\mathcal{S}=\{\mathbf{s}_{\delta}|p({\mathbf{w}_{\delta}})>\kappa\max\mathcal{% P}\},caligraphic_S = { bold_s start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT | italic_p ( bold_w start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT ) > italic_κ roman_max caligraphic_P } , (26)

where κ0.5𝜅0.5\kappa\approx 0.5italic_κ ≈ 0.5. Then, the estimated angle is given by θsˇ=2sˇQU1QUsubscript𝜃ˇ𝑠2ˇ𝑠𝑄𝑈1𝑄𝑈\theta_{\check{s}}=\frac{2\check{s}-QU-1}{QU}italic_θ start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG end_POSTSUBSCRIPT = divide start_ARG 2 overroman_ˇ start_ARG italic_s end_ARG - italic_Q italic_U - 1 end_ARG start_ARG italic_Q italic_U end_ARG, where sˇ=Med(𝒮)ˇ𝑠Med𝒮\check{s}=\rm{Med}(\mathcal{S})overroman_ˇ start_ARG italic_s end_ARG = roman_Med ( caligraphic_S ) and the corresponding codeword is 𝐰sˇsubscript𝐰ˇ𝑠\mathbf{w}_{\check{s}}bold_w start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG end_POSTSUBSCRIPT. Then, according to the periodicity, the BS can infer U𝑈Uitalic_U candidate user angles, which are given by

θc=θsˇ+2uU,u𝒰.formulae-sequencesubscript𝜃csubscript𝜃ˇ𝑠2𝑢𝑈for-all𝑢𝒰\theta_{\rm c}={\theta}_{\check{s}}+\frac{2u}{U},\forall u\in\mathcal{U}.italic_θ start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG end_POSTSUBSCRIPT + divide start_ARG 2 italic_u end_ARG start_ARG italic_U end_ARG , ∀ italic_u ∈ caligraphic_U . (27)

V-B Phase 2: Angular Ambiguity Elimination

In the second phase, we propose an efficient method to resolve the angular ambiguity. The key idea is to utilize a central subarray to sequentially examine the candidate angles in (27). Specifically, we activate a central subarray with M𝑀Mitalic_M antennas to eliminate the angular ambiguity, for which the codebook 𝒱Subsubscript𝒱Sub{{\mathcal{V}}}_{\rm{Sub}}caligraphic_V start_POSTSUBSCRIPT roman_Sub end_POSTSUBSCRIPT is presented as follows

𝒱Sub={𝐯1,,𝐯s,,𝐯QU},subscript𝒱Subsubscript𝐯1subscript𝐯𝑠subscript𝐯𝑄𝑈{{\mathcal{V}}}_{\rm{Sub}}\!\!=\!\!\{{{\mathbf{v}}}_{1},\!\cdots\!,{{\mathbf{v% }}}_{s},\cdots,{{\mathbf{v}}}_{QU}\},caligraphic_V start_POSTSUBSCRIPT roman_Sub end_POSTSUBSCRIPT = { bold_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , bold_v start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , ⋯ , bold_v start_POSTSUBSCRIPT italic_Q italic_U end_POSTSUBSCRIPT } , (28)

where 𝐯ssubscript𝐯𝑠{\mathbf{v}}_{s}bold_v start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is given by

𝐯s=1M[0,,0NM2,eȷπNM2θs,,eȷπN+M22θs,0,,0NM2,]H.{\mathbf{v}}_{s}=\!\!\frac{1}{\sqrt{M}}\bigg{[}\underbrace{0,\cdots,0}_{\frac{% N-M}{2}},e^{\jmath{\pi}\frac{N-M}{2}\theta_{s}},\!\cdots\!,e^{\jmath{\pi}\frac% {N+M-2}{2}\theta_{s}},\underbrace{0,\cdots,0}_{\frac{N-M}{2}},\bigg{]}^{H}\!\!.bold_v start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG square-root start_ARG italic_M end_ARG end_ARG [ under⏟ start_ARG 0 , ⋯ , 0 end_ARG start_POSTSUBSCRIPT divide start_ARG italic_N - italic_M end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT , italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π divide start_ARG italic_N - italic_M end_ARG start_ARG 2 end_ARG italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT italic_ȷ italic_π divide start_ARG italic_N + italic_M - 2 end_ARG start_ARG 2 end_ARG italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , under⏟ start_ARG 0 , ⋯ , 0 end_ARG start_POSTSUBSCRIPT divide start_ARG italic_N - italic_M end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT , ] start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT . (29)
Refer to caption
Figure 6: The shift of the received beam pattern with the sparse DFT codebook in a period, where N=257𝑁257N=257italic_N = 257, U=16𝑈16U=16italic_U = 16 and f=30𝑓30f=30italic_f = 30 GHz. The actual user angle is θ0=0.2subscript𝜃00.2\theta_{0}=0.2italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0.2

It is noteworthy that there are two criteria for ensuring the effectiveness of angular ambiguity elimination. The first one is that the user should be located in the far-field region of the activated central subarray to avoid the energy-spread effect. Moreover, considering that the interval between two adjacent candidate angles is 2U2𝑈\frac{2}{U}divide start_ARG 2 end_ARG start_ARG italic_U end_ARG, extra angular ambiguity will be introduced if the beam width of the central subarray is wider than 4U4𝑈\frac{4}{U}divide start_ARG 4 end_ARG start_ARG italic_U end_ARG. Therefore, the second criterion is that the beam width of the central subarray is required to be smaller than 4U4𝑈\frac{4}{U}divide start_ARG 4 end_ARG start_ARG italic_U end_ARG, which ensures no interference between two adjacent candidate angles.

To satisfy the first criterion, we set the Rayleigh distance of the subarray to be less than the Fresnel distance of the XL-array, which is given by

2M2d02λ1.2D.2superscript𝑀2superscriptsubscript𝑑02𝜆1.2𝐷\frac{2M^{2}d_{0}^{2}}{\lambda}\leq 1.2D.divide start_ARG 2 italic_M start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_λ end_ARG ≤ 1.2 italic_D . (30)

Then, (30) can be simplified as M1.2(N1)𝑀1.2𝑁1M\leq\sqrt{1.2{(N-1)}}italic_M ≤ square-root start_ARG 1.2 ( italic_N - 1 ) end_ARG.

For the second criterion, we have

4M4U,4𝑀4𝑈\frac{4}{M}\leq\frac{4}{U},divide start_ARG 4 end_ARG start_ARG italic_M end_ARG ≤ divide start_ARG 4 end_ARG start_ARG italic_U end_ARG , (31)

which can be simplified to MU𝑀𝑈M\geq Uitalic_M ≥ italic_U. Therefore, the number of antennas of the central subarray needs to satisfy

UM1.2(N1).𝑈𝑀1.2𝑁1U\leq M\leq\sqrt{{1.2(N-1)}}.italic_U ≤ italic_M ≤ square-root start_ARG 1.2 ( italic_N - 1 ) end_ARG . (32)

Moreover, from (32), the activation interval U𝑈Uitalic_U has the following constraint U1.2(N1)𝑈1.2𝑁1U\leq\sqrt{1.2{(N-1)}}italic_U ≤ square-root start_ARG 1.2 ( italic_N - 1 ) end_ARG. For example, given a setup where N=257𝑁257N=257italic_N = 257 and U=16𝑈16U=16italic_U = 16, we have 16M17.516𝑀17.516\leq M\leq 17.516 ≤ italic_M ≤ 17.5. In Fig. 7, we plot the beam width of the subarray with M=17𝑀17M=17italic_M = 17. It is observed that two adjacent candidate user angles have no considerable mutual interference, which indicates the effectiveness of angular ambiguity elimination.

Refer to caption
Figure 7: Illustration of angle ambiguity eliminated by the central subarray, where N=257𝑁257N=257italic_N = 257, M=17𝑀17M=17italic_M = 17 and f=30𝑓30f=30italic_f = 30 GHz. The beam is steered towards an candidate angle 1/161161/161 / 16. The red lines are two adjacent candidate user angles 1/16116-1/16- 1 / 16 and 1/161161/161 / 16.

Based on the above, the BS activates a central subarray comprising M𝑀Mitalic_M antennas satisfying (32) to resolve the angular ambiguity. For the candidate user angles θsˇ+u2U,u𝒰subscript𝜃ˇ𝑠𝑢2𝑈for-all𝑢𝒰{\theta}_{\check{s}}+u\frac{2}{U},\forall u\in\mathcal{U}italic_θ start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG end_POSTSUBSCRIPT + italic_u divide start_ARG 2 end_ARG start_ARG italic_U end_ARG , ∀ italic_u ∈ caligraphic_U in (27), the codewords chosen from (28) to resolve the angular ambiguity are 𝐯sˇ+uQ,u𝒰subscript𝐯ˇ𝑠𝑢𝑄for-all𝑢𝒰\mathbf{v}_{\check{s}+uQ},\forall u\in\mathcal{U}bold_v start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG + italic_u italic_Q end_POSTSUBSCRIPT , ∀ italic_u ∈ caligraphic_U. Then, the BS sequentially transmits U𝑈Uitalic_U pilot symbols with beamforming vectors {𝐯sˇ+uQ}subscript𝐯ˇ𝑠𝑢𝑄\{\mathbf{v}_{\check{s}+uQ}\}{ bold_v start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG + italic_u italic_Q end_POSTSUBSCRIPT } as illustrated in Fig. 5 (b). For each codeword, the received signal power at the user in the second phase is given by

p(𝐯sˇ+uQ)=|Mβ𝐛H(r0,θ0)𝐯sˇ+uQx+z|2,u𝒰.formulae-sequence𝑝subscript𝐯ˇ𝑠𝑢𝑄superscript𝑀𝛽superscript𝐛𝐻subscript𝑟0subscript𝜃0subscript𝐯ˇ𝑠𝑢𝑄𝑥𝑧2for-all𝑢𝒰p(\mathbf{v}_{\check{s}+uQ})=|\sqrt{M}\beta\mathbf{b}^{H}(r_{0},\theta_{0})% \mathbf{v}_{\check{s}+uQ}x+z|^{2},\leavevmode\nobreak\ \forall u\in\mathcal{U}.italic_p ( bold_v start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG + italic_u italic_Q end_POSTSUBSCRIPT ) = | square-root start_ARG italic_M end_ARG italic_β bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_v start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG + italic_u italic_Q end_POSTSUBSCRIPT italic_x + italic_z | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∀ italic_u ∈ caligraphic_U . (33)

Then, we can obtain the optimal codeword 𝐯ssubscript𝐯superscript𝑠\mathbf{v}_{s^{\ast}}bold_v start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT via simple comparisons of received signal power in (33), which is given by

s=argmaxu𝒰p(𝐯sˇ+uQ).superscript𝑠subscript𝑢𝒰𝑝subscript𝐯ˇ𝑠𝑢𝑄{s}^{\ast}=\arg\max_{u\in\mathcal{U}}p(\mathbf{v}_{\check{s}+uQ}).italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_arg roman_max start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_p ( bold_v start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG + italic_u italic_Q end_POSTSUBSCRIPT ) . (34)

Then, the estimated user angle can be obtained by θs=2sQU1QUsubscript𝜃superscript𝑠2superscript𝑠𝑄𝑈1𝑄𝑈\theta_{s^{\ast}}=\frac{2s^{\ast}-QU-1}{QU}italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = divide start_ARG 2 italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_Q italic_U - 1 end_ARG start_ARG italic_Q italic_U end_ARG.

V-C Phase 3: Beam Swee** with Polar-domain Codebook

Once we determine the user angle θssubscript𝜃superscript𝑠\theta_{s^{\ast}}italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT in the second phase, we can use the polar-domain codebook to perform the range domain swee**, achieving beamfocusing gain in the near-field. The polar-domain codebook utilized in the third phase is given by

𝒳¯Pol={𝒳1,,𝒳s,,𝒳QU},subscript¯𝒳Polsubscript𝒳1subscript𝒳𝑠subscript𝒳𝑄𝑈\bar{\mathcal{X}}_{\rm{Pol}}=\{{\mathcal{X}}_{1},\cdots,{\mathcal{X}}_{s},% \cdots,{\mathcal{X}}_{QU}\},over¯ start_ARG caligraphic_X end_ARG start_POSTSUBSCRIPT roman_Pol end_POSTSUBSCRIPT = { caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , caligraphic_X start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , ⋯ , caligraphic_X start_POSTSUBSCRIPT italic_Q italic_U end_POSTSUBSCRIPT } , (35)

where 𝒳s={𝐱s,1,,𝐱s,v,,𝐱s,V}subscript𝒳𝑠subscript𝐱𝑠1subscript𝐱𝑠𝑣subscript𝐱𝑠𝑉{\mathcal{X}}_{s}=\{{\mathbf{x}}_{s,1},\cdots,{\mathbf{x}}_{s,v},\cdots,{% \mathbf{x}}_{s,V}\}caligraphic_X start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = { bold_x start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT , ⋯ , bold_x start_POSTSUBSCRIPT italic_s , italic_v end_POSTSUBSCRIPT , ⋯ , bold_x start_POSTSUBSCRIPT italic_s , italic_V end_POSTSUBSCRIPT } and 𝐱s,v=𝐛H(rs,v,θs)subscript𝐱𝑠𝑣superscript𝐛𝐻subscript𝑟𝑠𝑣subscript𝜃𝑠{\mathbf{x}}_{s,v}=\mathbf{b}^{H}(r_{s,v},\theta_{s})bold_x start_POSTSUBSCRIPT italic_s , italic_v end_POSTSUBSCRIPT = bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT italic_s , italic_v end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) with rs,v=1vαΔ(1θs2)subscript𝑟𝑠𝑣1𝑣subscript𝛼Δ1superscriptsubscript𝜃𝑠2r_{s,v}=\frac{1}{v}\alpha_{\Delta}\left(1-\theta_{s}^{2}\right)italic_r start_POSTSUBSCRIPT italic_s , italic_v end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_v end_ARG italic_α start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) [27]. Moreover, V𝑉Vitalic_V denotes the number of range samples.

Specifically, the codewords employed in the third phase are given by 𝒳s={𝐱s,1,,𝐱s,v,,𝐱s,V}subscript𝒳superscript𝑠subscript𝐱superscript𝑠1subscript𝐱superscript𝑠𝑣subscript𝐱superscript𝑠𝑉\mathcal{X}_{s^{\ast}}=\{{\mathbf{x}}_{s^{\ast},1},\cdots,{\mathbf{x}}_{s^{% \ast},v},\cdots,{\mathbf{x}}_{s^{\ast},V}\}caligraphic_X start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = { bold_x start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , 1 end_POSTSUBSCRIPT , ⋯ , bold_x start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_v end_POSTSUBSCRIPT , ⋯ , bold_x start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_V end_POSTSUBSCRIPT }. Then, the BS activates the whole XL-array and sequentially transmits training V𝑉Vitalic_V symbols in the estimated user angle θssubscript𝜃superscript𝑠\theta_{s^{\ast}}italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT with codewords 𝒳ssubscript𝒳superscript𝑠\mathcal{X}_{s^{\ast}}caligraphic_X start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT as illustrated in Fig 5(c). For each codeword, the received signal power at the user is given by

p(𝐱s,v)=|Nβ𝐛H(r0,θ0)𝐱s,vx+z|2,v𝒱.formulae-sequence𝑝subscript𝐱superscript𝑠𝑣superscript𝑁𝛽superscript𝐛𝐻subscript𝑟0subscript𝜃0subscript𝐱superscript𝑠𝑣𝑥𝑧2for-all𝑣𝒱p({\mathbf{x}}_{s^{\ast},v})=|\sqrt{N}\beta\mathbf{b}^{H}(r_{0},\theta_{0}){% \mathbf{x}}_{s^{\ast},v}x+z|^{2},\leavevmode\nobreak\ \forall v\in\mathcal{V}.italic_p ( bold_x start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_v end_POSTSUBSCRIPT ) = | square-root start_ARG italic_N end_ARG italic_β bold_b start_POSTSUPERSCRIPT italic_H end_POSTSUPERSCRIPT ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_x start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_v end_POSTSUBSCRIPT italic_x + italic_z | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∀ italic_v ∈ caligraphic_V . (36)

Subsequently, the optimal codeword is determined through straightforward comparisons of received signal power in (36) and the best codeword index is given by

v=argmaxv𝒱p(𝐱s,v).superscript𝑣subscript𝑣𝒱𝑝subscript𝐱superscript𝑠𝑣{v}^{\ast}=\arg\max_{v\in\mathcal{V}}p({\mathbf{x}}_{s^{\ast},v}).italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_arg roman_max start_POSTSUBSCRIPT italic_v ∈ caligraphic_V end_POSTSUBSCRIPT italic_p ( bold_x start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_v end_POSTSUBSCRIPT ) . (37)

Therefore, we can obtain the user location (rs,v,θs)subscript𝑟superscript𝑠superscript𝑣subscript𝜃superscript𝑠(r_{{s}^{\ast},{v}^{\ast}},\theta_{{s}^{\ast}})( italic_r start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) with rs,v=1vαΔ(1θs2)subscript𝑟superscript𝑠superscript𝑣1superscript𝑣subscript𝛼Δ1superscriptsubscript𝜃superscript𝑠2r_{{s}^{\ast},{v}^{\ast}}=\frac{1}{{v}^{\ast}}\alpha_{\Delta}\left(1-\theta_{{% s}^{\ast}}^{2}\right)italic_r start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG italic_α start_POSTSUBSCRIPT roman_Δ end_POSTSUBSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ).

The detailed procedures of the proposed three-phase beam training method are summarized in Algorithm 1.

Algorithm 1 Proposed Three-phase Beam Training Method
1:  Phase 1: Angular swee** in the subspace.
2:  Use the sparse DFT codebook 𝒲DFTsubscript𝒲DFT\mathcal{W}_{\rm DFT}caligraphic_W start_POSTSUBSCRIPT roman_DFT end_POSTSUBSCRIPT for the beam swee** in the angular subspace [1/U,1/U)1𝑈1𝑈[-1/U,1/U)[ - 1 / italic_U , 1 / italic_U ).
3:  Perform the received-beam-pattern shifting to obtain a regular angular support 𝒫𝒫\mathcal{P}caligraphic_P.
4:  Obtain U𝑈Uitalic_U candidate user angles θc=θsˇ+2uU,u𝒰formulae-sequencesubscript𝜃csubscript𝜃ˇ𝑠2𝑢𝑈for-all𝑢𝒰\theta_{\rm c}={\theta}_{\check{s}}+\frac{2u}{U},\forall u\in\mathcal{U}italic_θ start_POSTSUBSCRIPT roman_c end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG end_POSTSUBSCRIPT + divide start_ARG 2 italic_u end_ARG start_ARG italic_U end_ARG , ∀ italic_u ∈ caligraphic_U according to the middle of the angular support 𝒫𝒫\mathcal{P}caligraphic_P.
5:  Phase 2: Resolve angular ambiguity.
6:  Activate a central subarray with M𝑀Mitalic_M antennas and use codebook 𝒱Subsubscript𝒱Sub{{\mathcal{V}}}_{\rm{Sub}}caligraphic_V start_POSTSUBSCRIPT roman_Sub end_POSTSUBSCRIPT to examine U𝑈Uitalic_U candidate angles estimated in the first phase.
7:  Obtain the best user angle θssubscript𝜃superscript𝑠\theta_{s^{\ast}}italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT according to the highest power received at the user with respective to p(𝐯sˇ+uQ)𝑝subscript𝐯ˇ𝑠𝑢𝑄p(\mathbf{v}_{\check{s}+uQ})italic_p ( bold_v start_POSTSUBSCRIPT overroman_ˇ start_ARG italic_s end_ARG + italic_u italic_Q end_POSTSUBSCRIPT ).
8:  Phase 3: Range swee**.
9:  Use polar-domain codebook 𝒳¯Polsubscript¯𝒳Pol\bar{\mathcal{X}}_{\rm{Pol}}over¯ start_ARG caligraphic_X end_ARG start_POSTSUBSCRIPT roman_Pol end_POSTSUBSCRIPT with codewords 𝒳ssubscript𝒳superscript𝑠\mathcal{X}_{s^{\ast}}caligraphic_X start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT to sweep the range domain in the user angle θssubscript𝜃superscript𝑠\theta_{s^{\ast}}italic_θ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT.
10:  Obtain the best user range rs,vsubscript𝑟superscript𝑠superscript𝑣r_{s^{\ast},v^{\ast}}italic_r start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_v start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT according to the highest power received at the user with respective to p(𝐱s,v)𝑝subscript𝐱superscript𝑠𝑣p({\mathbf{x}}_{s^{\ast},v})italic_p ( bold_x start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_v end_POSTSUBSCRIPT ).

V-D Discussions

Beam training overhead: In the first phase, the overhead is Q=N1U+1𝑄𝑁1𝑈1Q=\frac{N-1}{U}+1italic_Q = divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + 1. Moreover, the overhead of the second and third phases are U𝑈Uitalic_U and V𝑉Vitalic_V, respectively. Finally, the overall overhead of the proposed beam training scheme is T(3P)=Q+U+Vsuperscript𝑇3P𝑄𝑈𝑉T^{\rm(3P)}=Q+U+Vitalic_T start_POSTSUPERSCRIPT ( 3 roman_P ) end_POSTSUPERSCRIPT = italic_Q + italic_U + italic_V. Given Q=N1U+1+𝑄𝑁1𝑈1superscriptQ=\frac{N-1}{U}+1\in\mathbb{Z}^{+}italic_Q = divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + 1 ∈ blackboard_Z start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT with +superscript\mathbb{Z}^{+}blackboard_Z start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT denoting the positive integer set, the beam training overhead of the proposed method enabled by the sparse DFT codebook can be recast as

T(3P)=N1U+U+V+1.superscript𝑇3P𝑁1𝑈𝑈𝑉1T^{\rm(3P)}=\frac{N-1}{U}+U+V+1.italic_T start_POSTSUPERSCRIPT ( 3 roman_P ) end_POSTSUPERSCRIPT = divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + italic_U + italic_V + 1 . (38)

It can be easily obtained from (38) that as U𝑈Uitalic_U increases, the overhead during the first phase decreases, while it increases during the subsequent phase, and vice versa. Thus, there is a fundamental trade-off between the first and second phase of the beam training method depending on the value of U𝑈Uitalic_U. Then, we aim to minimize the beam training overhead T(3P)superscript𝑇3PT^{\rm(3P)}italic_T start_POSTSUPERSCRIPT ( 3 roman_P ) end_POSTSUPERSCRIPT and the optimization problem is formulated as

(𝐏𝟏)::𝐏𝟏absent\displaystyle({\bf P1}):\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ ( bold_P1 ) : minUF(U)=N1U+U+V+1subscript𝑈𝐹𝑈𝑁1𝑈𝑈𝑉1\displaystyle\min_{U}\leavevmode\nobreak\ F(U)=\frac{N-1}{U}+U+V+1roman_min start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT italic_F ( italic_U ) = divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + italic_U + italic_V + 1
s.t.U1.2(N1),formulae-sequences.t𝑈1.2𝑁1\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ {\text{s}}{\text{.t}}{% \rm{.}}\leavevmode\nobreak\ U\leq\sqrt{{1.2(N-1)}},roman_s .t . italic_U ≤ square-root start_ARG 1.2 ( italic_N - 1 ) end_ARG , (39a)
U{N1U+1+,U+}.𝑈formulae-sequence𝑁1𝑈1superscript𝑈superscript\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ U\in\mathcal{F}% \triangleq\{\frac{N-1}{U}\!+\!1\in\mathbb{Z}^{+},U\in\mathbb{Z}^{+}\}.italic_U ∈ caligraphic_F ≜ { divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + 1 ∈ blackboard_Z start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT , italic_U ∈ blackboard_Z start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT } . (39b)

If we remove the integer constraint (39b), (𝐏𝟏)𝐏𝟏({\bf P1})( bold_P1 ) is a convex problem. Then, Problem (𝐏𝟏)𝐏𝟏({\bf P1})( bold_P1 ) can be rewritten as

(𝐏𝟐)::𝐏𝟐absent\displaystyle({\bf P2}):\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ ( bold_P2 ) : minUF(U)=N1U+U+V+1subscript𝑈𝐹𝑈𝑁1𝑈𝑈𝑉1\displaystyle\min_{U}\leavevmode\nobreak\ F(U)=\frac{N-1}{U}+U+V+1roman_min start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT italic_F ( italic_U ) = divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + italic_U + italic_V + 1
s.t.U1.2(N1)4.formulae-sequences.t𝑈41.2𝑁1\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ {\text{s}}{\text{.t}}{% \rm{.}}\leavevmode\nobreak\ U\leq\sqrt[4]{1.2{(N-1)}}.roman_s .t . italic_U ≤ nth-root start_ARG 4 end_ARG start_ARG 1.2 ( italic_N - 1 ) end_ARG . (40a)

Problem (P2) is an convex optimization problem, whose optimal solution can be easily obtained as follows due to convexity.

Lemma 3.

The optimal solution to Problem (P2) is given by UP2=(N1)subscriptsuperscript𝑈subscript𝑃2𝑁1U^{\ast}_{P_{2}}=\sqrt{(N-1)}italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = square-root start_ARG ( italic_N - 1 ) end_ARG. Moreover, the overhead of the proposed method is T(3P)=2(N1)+V+1superscript𝑇3P2𝑁1𝑉1T^{(\rm 3P)}=2\sqrt{(N-1)}+V+1italic_T start_POSTSUPERSCRIPT ( 3 roman_P ) end_POSTSUPERSCRIPT = 2 square-root start_ARG ( italic_N - 1 ) end_ARG + italic_V + 1.

Proof: It is observed that N1U+U+V+12N1+V+1𝑁1𝑈𝑈𝑉12𝑁1𝑉1\frac{N-1}{U}+U+V+1\geq 2\sqrt{N-1}+V+1divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + italic_U + italic_V + 1 ≥ 2 square-root start_ARG italic_N - 1 end_ARG + italic_V + 1, where the equal holds with U=(N1)𝑈𝑁1U=\sqrt{(N-1)}italic_U = square-root start_ARG ( italic_N - 1 ) end_ARG. Moreover, considering U=(N1)1.2(N1)𝑈𝑁11.2𝑁1U=\sqrt{(N-1)}\leq\sqrt{1.2{(N-1)}}italic_U = square-root start_ARG ( italic_N - 1 ) end_ARG ≤ square-root start_ARG 1.2 ( italic_N - 1 ) end_ARG, the optimal solution to Problem (𝐏𝟐)𝐏𝟐({\bf P2})( bold_P2 ) is UP2=(N1)subscriptsuperscript𝑈subscript𝑃2𝑁1U^{\ast}_{P_{2}}=\sqrt{(N-1)}italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = square-root start_ARG ( italic_N - 1 ) end_ARG with the beam training overhead T(3P)=2(N1)+V+1superscript𝑇3P2𝑁1𝑉1T^{(\rm 3P)}=2\sqrt{(N-1)}+V+1italic_T start_POSTSUPERSCRIPT ( 3 roman_P ) end_POSTSUPERSCRIPT = 2 square-root start_ARG ( italic_N - 1 ) end_ARG + italic_V + 1.  \Box

When the integer constraint (39b) is taken into account, a suboptimal solution to Problem (P1) is given by

U^=argmin{|UP2f|,f}.^𝑈subscriptsubscriptsuperscript𝑈subscriptP2𝑓𝑓\hat{U}=\arg\min\limits_{\mathcal{F}}\{|U^{\ast}_{\rm P_{2}}-f|,f\in\mathcal{F% }\}.over^ start_ARG italic_U end_ARG = roman_arg roman_min start_POSTSUBSCRIPT caligraphic_F end_POSTSUBSCRIPT { | italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_f | , italic_f ∈ caligraphic_F } . (41)

Next, we present an example to illustrate the low overhead feature of the proposed beam training method. We consider a setup with N=1025𝑁1025N=1025italic_N = 1025 and V=5𝑉5V=5italic_V = 5. Then, U=32𝑈32U=32italic_U = 32 can minimize T(3P)superscript𝑇3PT^{(\rm 3P)}italic_T start_POSTSUPERSCRIPT ( 3 roman_P ) end_POSTSUPERSCRIPT according to the optimal solution to Problem (𝐏𝟏)𝐏𝟏({\bf P1})( bold_P1 ). The beam training overhead of the proposed multi-beam training scheme in this setup is T(3P)=2N1+V+1=64+5+1=70superscript𝑇3P2𝑁1𝑉1645170T^{(\rm 3P)}=2\sqrt{N-1}+V+1=64+5+1=70italic_T start_POSTSUPERSCRIPT ( 3 roman_P ) end_POSTSUPERSCRIPT = 2 square-root start_ARG italic_N - 1 end_ARG + italic_V + 1 = 64 + 5 + 1 = 70, which is significantly smaller than that of the exhaustive-search method (T(EX)=5280superscript𝑇EX5280T^{(\rm EX)}=5280italic_T start_POSTSUPERSCRIPT ( roman_EX ) end_POSTSUPERSCRIPT = 5280) and the two-phase near-field beam training method (T(2P)=1061superscript𝑇2P1061T^{(\rm 2P)}=1061italic_T start_POSTSUPERSCRIPT ( 2 roman_P ) end_POSTSUPERSCRIPT = 1061).

Remark 1 (Improved scheme: Middle-K𝐾Kitalic_K angle selection).

Due to the power fluctuation [28], the middle angle of the angular support may not be accurate. To improve the estimation accuracy, we can select the middle-K𝐾Kitalic_K angles of the angular support instead of selecting one potential angle in the first procedure. Specifically, in the third phase, we should perform the beam swee** in the range domain in the K𝐾Kitalic_K potential angles θs¯k,k=1,,Kformulae-sequencesubscript𝜃subscript¯𝑠𝑘𝑘1𝐾\theta_{\bar{s}_{k}},k=1,\cdots,Kitalic_θ start_POSTSUBSCRIPT over¯ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_k = 1 , ⋯ , italic_K to determine an optimal polar-domain codeword. The overhead of the middle-K𝐾Kitalic_K angle scheme is given by TMidK(3P)=N1U+U+KVsubscriptsuperscript𝑇3PMid𝐾𝑁1𝑈𝑈𝐾𝑉T^{(\rm 3P)}_{{\rm Mid}-K}=\frac{N-1}{U}+U+KVitalic_T start_POSTSUPERSCRIPT ( 3 roman_P ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Mid - italic_K end_POSTSUBSCRIPT = divide start_ARG italic_N - 1 end_ARG start_ARG italic_U end_ARG + italic_U + italic_K italic_V. It is noteworthy that this does not significantly increase the beam training overhead, which is still proportional to N𝑁\sqrt{N}square-root start_ARG italic_N end_ARG.

Remark 2 (Estimation error).

From the above analysis, the accuracy of the proposed beam training scheme enabled by sparse DFT codebook is mainly dependent on two factors. First, the sampling resolution is a key factor because the proposed beam training scheme is an on-grid channel estimation method. Therefore, when the sampling interval is small enough, the performance of proposed multi-beam training scheme approaches that of the optimal beamformer without noise taking into account. Second, noise is another key issue. The estimated angle is derived from the received signal power in different time, consequently influenced by the received SNR. Hence, a higher SNR is expected to achieve more accurate angle estimation, which will be numerically verified in Section VI.

Remark 3 (Multi-path channels).

With respect to multi-path channel cases, we divide the extension of the proposed algorithm into two cases according to different Rician parameters.

  • LoS-dominant channel: When the Rician factor is large enough (or equivalently the LoS path is dominant), we can regard the NLoS components as environmental noise. Therefore, the proposed beam training scheme based on the sparse DFT codebook still holds as this method only depends on the LoS path component.

  • Comparable multi-path components: This case is much more complicated. Considering that the accuracy depends on the periodic angular supports, NLoS components may bring about randomly overlapped received beam pattern in the angular domain, which poses challenges to the angle estimation in the first phase of our proposed beam training scheme. Therefore, the case with comparable multi-path components is left as a topic for our future works.

Remark 4 (Universal in both near- and far-field communications).

The proposed beam scheme can be applied to both near- and far-field communications. We can identify the near- or far-field user according to the angular support width [43]. Specifically, in cases where the angular support width is small (e.g., only contains one candidate angle), it signifies that the user is located in the far-field region. Then, there is no need to perform the range estimation in the third phase (see Section V-C). In other words, for far-field users, only the first two phases of the proposed beam training scheme need to be executed.

Refer to caption
Figure 8: Achievable rate versus SNR.

VI Numerical Results

Numerical results are provided to validate the effectiveness of the proposed near-field beam training scheme in this section. We first present the system parameters and benchmark schemes followed by the performance comparison under numerous setups.

VI-A System Setup and Benchmark Schemes

We set the system parameters as follows. We consider that the XL-array is equipped with N=257𝑁257N=257italic_N = 257 antennas and f=30𝑓30f=30italic_f = 30 GHz. The transmit power and reference channel gain at 1111 m are set as Ptol=30subscript𝑃tol30P_{\rm tol}=30italic_P start_POSTSUBSCRIPT roman_tol end_POSTSUBSCRIPT = 30 dBm and β0=(4πλ)2=62subscript𝛽0superscript4𝜋𝜆262\beta_{0}=(\frac{4\pi}{\lambda})^{2}=-62italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( divide start_ARG 4 italic_π end_ARG start_ARG italic_λ end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = - 62 dB, respectively. Moreover, the noise power is σ2=80superscript𝜎280\sigma^{2}=-80italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = - 80 dBm. According to (𝐏𝟏)𝐏𝟏({\bf P1})( bold_P1 ), the optimal activation interval is set as U=16𝑈16U=16italic_U = 16. Furthermore, the antenna number of the activated central subarray is M=17𝑀17M=17italic_M = 17. With respective to NLoS paths, we set L=2𝐿2L=2italic_L = 2 and κk=30subscript𝜅𝑘30\kappa_{k}=30italic_κ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 30 dB [29, 32]. The reference SNR for a user is defined by γ=NPtolβ0r02σ2𝛾𝑁subscript𝑃tolsubscript𝛽0superscriptsubscript𝑟02superscript𝜎2\gamma=\frac{NP_{\rm tol}\beta_{0}}{r_{\rm 0}^{2}\sigma^{2}}italic_γ = divide start_ARG italic_N italic_P start_POSTSUBSCRIPT roman_tol end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG [28]. To characterize the overhead, we assume that the total transmission time and a pilot symbol time are Ttol=0.2subscript𝑇tol0.2T_{\mathrm{tol}}=0.2italic_T start_POSTSUBSCRIPT roman_tol end_POSTSUBSCRIPT = 0.2 ms and Ts=0.1subscript𝑇s0.1T_{\mathrm{s}}=0.1italic_T start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT = 0.1 µsmicrosecond\mathrm{\SIUnitSymbolMicro s}roman_µ roman_s [44], respectively. Then, the effective rate is defined by REff=(1ToverTsTtol)Rsubscript𝑅Eff1subscript𝑇oversubscript𝑇ssubscript𝑇tol𝑅R_{\mathrm{Eff}}=\left(1-\frac{T_{\rm over}T_{\mathrm{s}}}{T_{\mathrm{tol}}}% \right)Ritalic_R start_POSTSUBSCRIPT roman_Eff end_POSTSUBSCRIPT = ( 1 - divide start_ARG italic_T start_POSTSUBSCRIPT roman_over end_POSTSUBSCRIPT italic_T start_POSTSUBSCRIPT roman_s end_POSTSUBSCRIPT end_ARG start_ARG italic_T start_POSTSUBSCRIPT roman_tol end_POSTSUBSCRIPT end_ARG ) italic_R, where Toversubscript𝑇overT_{\rm over}italic_T start_POSTSUBSCRIPT roman_over end_POSTSUBSCRIPT denotes the beam training overhead of each scheme. All the numerical results are averaged over 1000 channel realizations. The following benchmark schemes are considered for performance comparison:

Refer to caption
Figure 9: Achievable rate versus user range.
  • Perfect-CSI based beamforming: This scheme assumes that the BS perfectly aligns the user near-field channel and beamforming vector is set by 𝐰¯=𝐛(r0,θ0)¯𝐰𝐛subscript𝑟0subscript𝜃0\bar{\mathbf{w}}=\mathbf{b}(r_{0},\theta_{0})over¯ start_ARG bold_w end_ARG = bold_b ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) Obviously, this scheme is the performance upper bound for all methods.

  • Least square channel estimation: This scheme is a classic off-grid channel estimation method where the user estimates the channels by N𝑁Nitalic_N pilot symbols transmitted by the BS in the downlink. The estimated channel is given by 𝐡^LS=(𝐗H𝐗)1𝐗H𝐲subscript^𝐡LSsuperscriptsuperscript𝐗H𝐗1superscript𝐗H𝐲\hat{\mathbf{h}}_{\mathrm{LS}}=\left(\mathbf{X}^{\mathrm{H}}\mathbf{X}\right)^% {-1}\mathbf{X}^{\mathrm{H}}\mathbf{y}over^ start_ARG bold_h end_ARG start_POSTSUBSCRIPT roman_LS end_POSTSUBSCRIPT = ( bold_X start_POSTSUPERSCRIPT roman_H end_POSTSUPERSCRIPT bold_X ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT bold_X start_POSTSUPERSCRIPT roman_H end_POSTSUPERSCRIPT bold_y, where 𝐗N×N𝐗superscript𝑁𝑁\mathbf{X}\in\mathbb{C}^{N\times N}bold_X ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT denotes the pilot matrix for each user and 𝐲=𝐗𝐡+𝐳N×1𝐲𝐗𝐡𝐳superscript𝑁1\mathbf{y}=\mathbf{X}\mathbf{h}+\mathbf{z}\in\mathbb{C}^{N\times 1}bold_y = bold_Xh + bold_z ∈ blackboard_C start_POSTSUPERSCRIPT italic_N × 1 end_POSTSUPERSCRIPT represents the received signal vector with 𝐳𝒞𝒩(𝟎,σ2𝐈)similar-to𝐳𝒞𝒩0superscript𝜎2𝐈\mathbf{z}\sim\mathcal{CN}\left(\mathbf{0},\sigma^{2}\mathbf{I}\right)bold_z ∼ caligraphic_C caligraphic_N ( bold_0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_I ). Moreover, 𝐡𝐡\mathbf{h}bold_h is shown in (1). Obviously, the pilot overhead of this scheme is T(LS)=Nsuperscript𝑇LS𝑁T^{({\rm LS})}=Nitalic_T start_POSTSUPERSCRIPT ( roman_LS ) end_POSTSUPERSCRIPT = italic_N.

  • Exhaustive-search beam training: This scheme is detailed in Section III-A. Due to the different angle sampling interval, the overhead of this scheme is modified by T(ex)=QUVsuperscript𝑇ex𝑄𝑈𝑉T^{\rm{(ex)}}=QUVitalic_T start_POSTSUPERSCRIPT ( roman_ex ) end_POSTSUPERSCRIPT = italic_Q italic_U italic_V.

  • Two-phase beam training: This scheme is detailed in Section III-B. Due to the different angle sampling interval, the overhead of this scheme is revised by T(2P)=QU+KVsuperscript𝑇2P𝑄𝑈𝐾𝑉T^{\rm{(2P)}}=QU+KVitalic_T start_POSTSUPERSCRIPT ( 2 roman_P ) end_POSTSUPERSCRIPT = italic_Q italic_U + italic_K italic_V.

  • Far-field beam training based on DFT codebook: Conventional DFT codebook is used to sweep the whole angular domain for choosing a best codeword for which the maximum received signal power is achieved at the user. The beam training overhead of this scheme is T(Far)=QUsuperscript𝑇Far𝑄𝑈T^{\rm(Far)}=QUitalic_T start_POSTSUPERSCRIPT ( roman_Far ) end_POSTSUPERSCRIPT = italic_Q italic_U.

Refer to caption
Figure 10: Effective rate versus user range.
Refer to caption
Figure 11: Effective rate versus SNR.

VI-B Performance Analysis

In Fig. 8, we plot the achievable rate R𝑅Ritalic_R versus the reference SNR γ𝛾\gammaitalic_γ under different beam training schemes. Some key observations can be concluded as follows. First, our proposed near-field beam training scheme achieves very close performance to the two-phase beam training and exhaustive search based beam training scheme, especially in the high-SNR regime (i.e., larger than 26 dB). Second, with the decrease of the reference SNR, the achievable rate attained by off-grid channel estimation degrades more dramatically than other schemes, and becomes inferior to our proposed scheme when the reference SNR is lower than 26 dB. This is because the direct channel estimation is more sensitive to the received SNR as the XL-array beamforming vector may not be well aligned with the channel path during signaling. Finally, there is a large performance gap between the conventional far-field beam training scheme and other schemes dedicated to near-field communications, which implies that far-field beam training is no longer effective for next generation wireless systems with more antennas.

Then, Fig. 11 illustrates the effective rate REffsubscript𝑅EffR_{\mathrm{Eff}}italic_R start_POSTSUBSCRIPT roman_Eff end_POSTSUBSCRIPT versus the reference SNR γ𝛾\gammaitalic_γ. Interestingly, our proposed scheme outperforms other schemes in terms of the effective rate except for the perfect-CSI based beamforming. This is because our scheme achieves close or even superior achievable rates compared with other benchmarks (as have shown in Fig. 8), but with far less training overhead. Moreover, the exhaustive search based near-field beam training scheme is not practically applicable due to small effective rate caused by large training overhead, as shown in Fig. 11.

Refer to caption
Figure 12: Achievable rate versus Rician factor.
Refer to caption
Figure 13: Achievable rate versus the number of antennas in the central subarray.

Fig. 9 shows the effect of user range on the achievable rate. It can be observed that the proposed scheme with K=3𝐾3K=3italic_K = 3 exhibits approximately the same performance as the exhaustive-search based and two-phase near-field beam training schemes for all user ranges. This is attributed to the smart design of the sparse DFT codebook. This method leads to a periodical energy-spread effect during beam training, and hence the former key observation in [28] can be leveraged for beam training. Second, the achievable rate performance of the proposed scheme largely outperforms the far-field beam training when the user range is less than 50 m and gradually converges to that of the far-field beam training. This verifies the universality of our proposed scheme for both near-field and far-field beam training cases. In addition, the relationship between effective rate and user range is depicted in Fig. 10. Considering the overhead of beam training, the effective rate attained by our scheme is only slightly lower than (less than 1 bps/Hz) that of the perfect-CSI based schemes, which further verifies its effectiveness.

In Fig. 12, we evaluate the impact of Rician factor on the system achievable rate. It can be observed that the achievable rates of all schemes increase with the Rician factor at first and gradually saturate when the Rician factor approximates 10 dB. Moreover, the two-phase beam training scheme slightly outperforms the proposed scheme when the Rician factor is less than 5 dB. This is because our scheme is more sensitive to noise due to operations such as received-beam-pattern shifting, since NLoS components can be treated as a form of environment noise.

Refer to caption
Figure 14: Achievable rate versus number of antennas.

In Fig. 13, we plot the achievable rate versus the number of antennas in the central subarray. It can be observed that the achievable rate of the proposed scheme suffers from significant performance loss when the number of antennas in the central subarray is sufficiently small or large. This can be explained by two facts: 1) When M𝑀Mitalic_M is sufficiently large (Violation of Criteria 1 in Section V-B), the user are more likely to be located in the near-field region of the central subarray, for which the energy-spread effect is dominant and thus results in significant performance loss; 2) When M𝑀Mitalic_M is sufficiently small (Violation of Criteria 2 in Section V-B), the beam width of the central subarray becomes too large, introducing extra ambiguity and thus failing to distinguish candidate user angles.

Finally, we plot the achievable rate versus the number of XL-array antennas in Fig. 14, where the user ranges are fixed at 80 m. It is observed that the achievable rates increase as there are more antennas for all schemes except for the far-field beam training scheme. This is because as the number of antennas increases, the near-field effect is more prominent, and hence the far-field beam training is no longer effective even when the user is located at a relatively far range (i.e., 80 m) from the XL-array.

VII Conclusion

In this paper, we proposed a novel near-field beam training scheme enabled by the sparse DFT codebook (sparse far-field beamforming vectors) to construct periodic received beam pattern at the user. To this end, we showed that the angular periodicity of the received beam pattern boosts reduction in swee** space, thereby significantly decreasing beam training overhead. Specifically, the middle of the angular support within a period contains the user angle information. Then, an activated central subarray can resolve the angular ambiguity followed by polar-domain codebook swee** in the best user angle. Finally, numerical results were presented to show that the proposed beam training scheme can achieve nearly the same performance in the high-SNR regime with the exhaustive-search scheme, while significantly reducing the beam training overhead.

Appendix A

Proof of Lemma 2

From (1), when Δ[1/U,1/U)Δ1𝑈1𝑈\Delta\in[-1/U,1/U)roman_Δ ∈ [ - 1 / italic_U , 1 / italic_U ), B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is not a constant at 2kπ2𝑘𝜋2k\pi2 italic_k italic_π for arbitrary integer k𝑘kitalic_k as q𝑞qitalic_q changes. Hence, we have

f^(r0,θ0;θ)=^𝑓subscript𝑟0subscript𝜃0𝜃absent\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)=over^ start_ARG italic_f end_ARG ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) = 1Q|q𝒬exp(ȷπ(A1q+A2q2))|1𝑄subscript𝑞𝒬italic-ȷ𝜋subscript𝐴1𝑞subscript𝐴2superscript𝑞2\displaystyle\frac{1}{Q}\!\!\left|\sum_{q\in\mathcal{Q}}\!\exp{\left(\jmath\pi% (A_{1}q+A_{2}q^{2})\right)}\right|divide start_ARG 1 end_ARG start_ARG italic_Q end_ARG | ∑ start_POSTSUBSCRIPT italic_q ∈ caligraphic_Q end_POSTSUBSCRIPT roman_exp ( italic_ȷ italic_π ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_q + italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ) | (42)
=\displaystyle== 1Q|q𝒬exp(ȷπA2(q+A12A2)2)|1𝑄subscript𝑞𝒬italic-ȷ𝜋subscript𝐴2superscript𝑞subscript𝐴12subscript𝐴22\displaystyle\frac{1}{Q}\!\!\left|\sum_{q\in\mathcal{Q}}\!\exp{\left(\jmath\pi A% _{2}(q+\frac{A_{1}}{2A_{2}})^{2}\right)}\right|divide start_ARG 1 end_ARG start_ARG italic_Q end_ARG | ∑ start_POSTSUBSCRIPT italic_q ∈ caligraphic_Q end_POSTSUBSCRIPT roman_exp ( italic_ȷ italic_π italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_q + divide start_ARG italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) |

where A1=UΔsubscript𝐴1𝑈ΔA_{1}=U\Deltaitalic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_U roman_Δ and A2=(Ud0)2λ1θ02r0subscript𝐴2superscript𝑈subscript𝑑02𝜆1superscriptsubscript𝜃02subscript𝑟0A_{2}=\frac{(Ud_{0})^{2}}{\lambda}\frac{1-\theta_{0}^{2}}{r_{0}}italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = divide start_ARG ( italic_U italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_λ end_ARG divide start_ARG 1 - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG.

Then, the summation in (42) can be approximated by an integral, which is given by

f^(r0,θ0;θ)(b1)1Q|Q2Q2exp(ȷπ(A1q+A2q2)dq)|^𝑓subscript𝑟0subscript𝜃0𝜃subscript𝑏11𝑄superscriptsubscript𝑄2𝑄2italic-ȷ𝜋subscript𝐴1𝑞subscript𝐴2superscript𝑞2d𝑞\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)\overset{{(b_{1})}}{% \approx}\frac{1}{Q}\!\!\left|\int_{-\frac{Q}{2}}^{\frac{Q}{2}}\!\exp{\left(% \jmath\pi(A_{1}q+A_{2}q^{2})\mathrm{d}q\right)}\right|over^ start_ARG italic_f end_ARG ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG ≈ end_ARG divide start_ARG 1 end_ARG start_ARG italic_Q end_ARG | ∫ start_POSTSUBSCRIPT - divide start_ARG italic_Q end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG italic_Q end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT roman_exp ( italic_ȷ italic_π ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_q + italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) roman_d italic_q ) | (43)
=(b2)1Q|12A22A2(Q2+A12A2)2A2(Q2+A12A2)e(ȷπt22)dt|subscript𝑏21𝑄12subscript𝐴2superscriptsubscript2subscript𝐴2𝑄2subscript𝐴12subscript𝐴22subscript𝐴2𝑄2subscript𝐴12subscript𝐴2superscript𝑒italic-ȷ𝜋superscript𝑡22differential-d𝑡\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \overset{{(b_{2% })}}{=}\frac{1}{Q}\!\!\left|\frac{1}{\sqrt{2A_{2}}}\int_{\sqrt{2A_{2}}(-\frac{% Q}{2}+\frac{A_{1}}{2A_{2}})}^{\sqrt{2A_{2}}(\frac{Q}{2}+\frac{A_{1}}{2A_{2}})}% \!e^{(\frac{\jmath\pi t^{2}}{2})}\mathrm{d}t\right|start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG divide start_ARG 1 end_ARG start_ARG italic_Q end_ARG | divide start_ARG 1 end_ARG start_ARG square-root start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG end_ARG ∫ start_POSTSUBSCRIPT square-root start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ( - divide start_ARG italic_Q end_ARG start_ARG 2 end_ARG + divide start_ARG italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT square-root start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ( divide start_ARG italic_Q end_ARG start_ARG 2 end_ARG + divide start_ARG italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT ( divide start_ARG italic_ȷ italic_π italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT roman_d italic_t |
=|02A2(Q2+A12A2)e(ȷπt22)dt02A2(Q2+A12A2)e(ȷπt22)dt2A2Q|,absentsuperscriptsubscript02subscript𝐴2𝑄2subscript𝐴12subscript𝐴2superscript𝑒italic-ȷ𝜋superscript𝑡22differential-d𝑡superscriptsubscript02subscript𝐴2𝑄2subscript𝐴12subscript𝐴2superscript𝑒italic-ȷ𝜋superscript𝑡22differential-d𝑡2subscript𝐴2𝑄\displaystyle=\!\!\left|\!\!\frac{\int_{0}^{\sqrt{2A_{2}}(\frac{Q}{2}+\frac{A_% {1}}{2A_{2}})}\!\!e^{(\frac{\jmath\pi t^{2}}{2})}\!\mathrm{d}t\!\!-\!\!\int_{0% }^{\sqrt{2A_{2}}(-\frac{Q}{2}+\frac{A_{1}}{2A_{2}})}\!\!e^{(\frac{\jmath\pi t^% {2}}{2})}\mathrm{d}t}{\sqrt{2A_{2}}Q}\!\right|,= | divide start_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT square-root start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ( divide start_ARG italic_Q end_ARG start_ARG 2 end_ARG + divide start_ARG italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT ( divide start_ARG italic_ȷ italic_π italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT roman_d italic_t - ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT square-root start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ( - divide start_ARG italic_Q end_ARG start_ARG 2 end_ARG + divide start_ARG italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT ( divide start_ARG italic_ȷ italic_π italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT roman_d italic_t end_ARG start_ARG square-root start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG italic_Q end_ARG | ,

where (b1)subscript𝑏1(b_{1})( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) is due to the approximation from the summation to the integral and (b2)subscript𝑏2(b_{2})( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) is obtained by setting A2(q+A12A2)2=t22subscript𝐴2superscript𝑞subscript𝐴12subscript𝐴22superscript𝑡22A_{2}(q+\frac{A_{1}}{2A_{2}})^{2}=\frac{t^{2}}{2}italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_q + divide start_ARG italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = divide start_ARG italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG. Let β1=A12A2=Δr0d0(1θ02)subscript𝛽1subscript𝐴12subscript𝐴2Δsubscript𝑟0subscript𝑑01superscriptsubscript𝜃02\beta_{1}=\frac{A_{1}}{2A_{2}}=\Delta\sqrt{\frac{r_{0}}{d_{0}(1-\theta_{0}^{2}% )}}italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = divide start_ARG italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG = roman_Δ square-root start_ARG divide start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG end_ARG and β2=A2Q2=QU2d0(1θ02)r0subscript𝛽2subscript𝐴2𝑄2𝑄𝑈2subscript𝑑01superscriptsubscript𝜃02subscript𝑟0\beta_{2}=\frac{\sqrt{A_{2}}Q}{2}=\frac{QU}{2}\sqrt{\frac{d_{0}(1-\theta_{0}^{% 2})}{r_{0}}}italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = divide start_ARG square-root start_ARG italic_A start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG italic_Q end_ARG start_ARG 2 end_ARG = divide start_ARG italic_Q italic_U end_ARG start_ARG 2 end_ARG square-root start_ARG divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 1 - italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_ARG, (43) can be simplified as

f^(r0,θ0;θ)^𝑓subscript𝑟0subscript𝜃0𝜃\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)over^ start_ARG italic_f end_ARG ( italic_r start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ; italic_θ ) =|0β1+β2e(ȷπt22)dt0β1β2e(ȷπt22)dt2β2|absentsuperscriptsubscript0subscript𝛽1subscript𝛽2superscript𝑒italic-ȷ𝜋superscript𝑡22differential-d𝑡superscriptsubscript0subscript𝛽1subscript𝛽2superscript𝑒italic-ȷ𝜋superscript𝑡22differential-d𝑡2subscript𝛽2\displaystyle=\left|\frac{\int_{0}^{\beta_{1}+\beta_{2}}e^{(\frac{\jmath\pi t^% {2}}{2})}\mathrm{d}t-\int_{0}^{\beta_{1}-\beta_{2}}e^{(\frac{\jmath\pi t^{2}}{% 2})}\mathrm{d}t}{2\beta_{2}}\right|= | divide start_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT ( divide start_ARG italic_ȷ italic_π italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT roman_d italic_t - ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT ( divide start_ARG italic_ȷ italic_π italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT roman_d italic_t end_ARG start_ARG 2 italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG | (44)
=G(β1,β2),absent𝐺subscript𝛽1subscript𝛽2\displaystyle=G(\beta_{1},\beta_{2}),= italic_G ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ,

where G(β1,β2)(C^(β1,β2)+ȷ(S^(β1,β2))/(2β2)G(\beta_{1},\beta_{2})\triangleq(\widehat{C}(\beta_{1},\beta_{2})+\jmath(% \widehat{S}(\beta_{1},\beta_{2}))/(2\beta_{2})italic_G ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≜ ( over^ start_ARG italic_C end_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) + italic_ȷ ( over^ start_ARG italic_S end_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ) / ( 2 italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ), C^(β1,β2)C(β1+β2)C(β1β2)^𝐶subscript𝛽1subscript𝛽2𝐶subscript𝛽1subscript𝛽2𝐶subscript𝛽1subscript𝛽2\widehat{C}(\beta_{1},\beta_{2})\triangleq{C}(\beta_{1}+\beta_{2})-C(\beta_{1}% -\beta_{2})over^ start_ARG italic_C end_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≜ italic_C ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_C ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) and S^(β1,β2)S(β1+β2)S(β1β2)^𝑆subscript𝛽1subscript𝛽2𝑆subscript𝛽1subscript𝛽2𝑆subscript𝛽1subscript𝛽2\widehat{S}(\beta_{1},\beta_{2})\triangleq S(\beta_{1}+\beta_{2})-S(\beta_{1}-% \beta_{2})over^ start_ARG italic_S end_ARG ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≜ italic_S ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_S ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). Specifically, C(x)=0xcos(π2t2)dt𝐶𝑥superscriptsubscript0𝑥𝜋2superscript𝑡2differential-d𝑡C(x)=\int_{0}^{x}\cos(\frac{\pi}{2}t^{2}){\rm d}titalic_C ( italic_x ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT roman_cos ( divide start_ARG italic_π end_ARG start_ARG 2 end_ARG italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) roman_d italic_t and S(x)=0xsin(π2t2)dt𝑆𝑥superscriptsubscript0𝑥𝜋2superscript𝑡2differential-d𝑡S(x)=\int_{0}^{x}\sin(\frac{\pi}{2}t^{2}){\rm d}titalic_S ( italic_x ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT roman_sin ( divide start_ARG italic_π end_ARG start_ARG 2 end_ARG italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) roman_d italic_t are the Fresnel integrals. The proof of Lemma 2 .

References

  • [1] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potentials, and future directions,” IEEE Commun. Mag., vol. 61, no. 1, pp. 40–46, Jan. 2023.
  • [2] C. You, Y. Zhang, C. Wu, Y. Zeng, B. Zheng, L. Chen, L. Dai, and A. L. Swindlehurst, “Near-field beam management for extremely large-scale array communications,” arXiv preprint arXiv:2306.16206, 2023.
  • [3] Z. Wang, J. Zhang, H. Du, W. E. I. Sha, B. Ai, D. Niyato, and M. Debbah, “Extremely large-scale MIMO: Fundamentals, challenges, solutions, and future Directions,” IEEE Wireless Commun., 2023, Early Access.
  • [4] Q. Wu, S. Zhang, B. Zheng, C. You, and R. Zhang, “Intelligent reflecting surface-aided wireless communications: A tutorial,” IEEE Trans. Commun., vol. 69, no. 5, pp. 3313–3351, May 2021.
  • [5] Y. Liu, Z. Wang, J. Xu, C. Ouyang, X. Mu, and R. Schober, “Near-field communications: A tutorial review,” IEEE Open J. Commun. Society, vol. 4, pp. 1999–2049, Aug. 2023.
  • [6] L. U. Khan, W. Saad, D. Niyato, Z. Han, and C. S. Hong, “Digital-twin-enabled 6G: Vision, architectural trends, and future directions,” IEEE Commun. Mag., vol. 60, no. 1, pp. 74–80, Jan. 2022.
  • [7] A. Paul, K. Singh, M.-H. T. Nguyen, C. Pan, and C.-P. Li, “Digital twin-assisted space-air-ground integrated networks for vehicular edge computing,” IEEE J. Sel. Topics Signal Process., vol. 18, no. 1, pp. 66–82, Jan. 2024.
  • [8] Y. Zhang, B. Di, H. Zhang, and L. Song, “Near-far field beamforming for holographic multiple-input multiple-output,” J. Commun. Inf. Netw., vol. 8, no. 2, pp. 99–110, Jun. 2023.
  • [9] E. Björnson, Ö. T. Demir, and L. Sanguinetti, “A primer on near-field beamforming for arrays and reconfigurable intelligent surfaces,” in Proc. 55th Asilomar Conf. Signals Syst. Comput., Pacific Grove, CA, USA, Nov. 2021, pp. 105–112.
  • [10] C. You, Y. Cai, Y. Liu, M. Di Renzo, T. M. Duman, A. Yener, and A. L. Swindlehurst, “Next generation advanced transceiver technologies for 6G,” arXiv preprint arXiv:2403.16458, 2024.
  • [11] J. An, C. Yuen, L. Dai, M. D. Renzo, M. Debbah, and L. Hanzo, “Near-field communications: Research advances, potential, and challenges,” IEEE Commun. Mag., 2024,Early Access.
  • [12] H. Zhang, N. Shlezinger, F. Guidi, D. Dardari, M. F. Imani, and Y. C. Eldar, “Beam focusing for near-field multiuser MIMO communications,” IEEE Trans. Wireless Commun., vol. 21, no. 9, pp. 7476–7490, Sept. 2022.
  • [13] H. Zhang, N. Shlezinger, F. Guidi, D. Dardari, and Y. C. Eldar, “6G wireless communications: From far-field beam steering to near-field beam focusing,” IEEE Commun. Mag., vol. 61, no. 4, pp. 72–77, Apr. 2023.
  • [14] P. Nepa and A. Buffi, “Near-field-focused microwave antennas: Near-field sha** and implementation,” IEEE Antennas Propag. Mag., vol. 59, no. 3, pp. 42–53, Jun. 2017.
  • [15] J. Cong, C. You, J. Li, L. Chen, B. Zheng, Y. Liu, W. Wu, Y. Gong, S. **, and R. Zhang, “Near-field integrated sensing and communication: Opportunities and challenges,” arXiv preprint arXiv:2310.01342, 2023.
  • [16] Y. Chen, Z. Ren, J. Xu, Y. Zeng, D. W. K. Ng, and S. Cui, “Integrated sensing, communication, and powering (ISCAP): Towards multi-functional 6G wireless networks,” arXiv preprint arXiv:2401.03516, 2024.
  • [17] Y. Liu, C. Ouyang, Z. Wang, J. Xu, X. Mu, and A. L. Swindlehurst, “Near-field communications: A comprehensive survey,” arXiv preprint arXiv:2401.05900, 2024.
  • [18] H. Zhang, N. Shlezinger, F. Guidi, D. Dardari, M. F. Imani, and Y. C. Eldar, “Near-field wireless power transfer for 6G internet of everything mobile networks: Opportunities and challenges,” IEEE Commun. Mag., vol. 60, no. 3, pp. 12–18, Mar. 2022.
  • [19] Y. Han, S. **, M. Matthaiou, T. Q. S. Quek, and C.-K. Wen, “Toward extra large-scale MIMO: New channel properties and low-cost designs,” IEEE Internet of Things J., vol. 10, no. 16, pp. 14 569–14 594, Aug. 2023.
  • [20] Z. Wu and L. Dai, “Multiple access for near-field communications: SDMA or LDMA?” IEEE J. Sel. Areas Commun., vol. 41, no. 6, pp. 1918–1935, Jun. 2023.
  • [21] K. Zhi, C. Pan, H. Ren, K. K. Chai, C.-X. Wang, R. Schober, and X. You, “Performance analysis and low-complexity design for XL-MIMO with near-field spatial non-stationarities,” IEEE J. Sel. Areas Commun., vol. 42, no. 6, pp. 1656–1672, Apr. 2024.
  • [22] Y. Li, S. Gong, H. Liu, C. Xing, N. Zhao, and X. Wang, “Near-field beamforming optimization for holographic XL-MIMO multiuser systems,” IEEE Trans. Commun., vol. 72, no. 4, pp. 2309–2323, Apr. 2024.
  • [23] Y. Zhang and C. You, “SWIPT in mixed near- and far-field channels: Joint beam scheduling and power allocation,” IEEE J. Sel. Areas Commun., Early Access, 2024.
  • [24] J. Chen, Y. Xiao, K. Liu, Y. Zhong, X. Lei, and M. Xiao, “Physical layer security for near-field communications via directional modulation,” IEEE Trans. Veh. Technol., Early Access, 2024, doi:10.1109/TVT.2024.3382324.
  • [25] H. Wang, Z. Xiao, and Y. Zeng, “Cramér-rao bounds for near-field sensing with extremely large-scale MIMO,” IEEE Trans. Signal Process, Jan. 2024.
  • [26] Y. Zhang, B. Di, H. Zhang, M. Dong, L. Yang, and L. Song, “Dual codebook design for intelligent omni-surface aided communications,” IEEE Trans. Wireless Commun., vol. 21, no. 11, pp. 9232–9245, May 2022.
  • [27] M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: Far-field or near-field?” IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, Apr. 2022.
  • [28] Y. Zhang, X. Wu, and C. You, “Fast near-field beam training for extremely large-scale array,” IEEE Wireless Commun. Lett., vol. 11, no. 12, pp. 2625–2629, Dec. 2022.
  • [29] W. Liu, H. Ren, C. Pan, and J. Wang, “Deep learning based beam training for extremely large-scale massive MIMO in near-field domain,” IEEE Commun. Lett., vol. 27, no. 1, pp. 170–174, Jan. 2023.
  • [30] G. Jiang and C. Qi, “Near-field beam training based on deep learning for extremely large-scale MIMO,” IEEE Commun. Lett., vol. 27, no. 8, pp. 2063–2067, Aug. 2023.
  • [31] C. Wu, C. You, Y. Liu, L. Chen, and S. Shi, “Two-stage hierarchical beam training for near-field communications,” IEEE Trans. Veh. Technol., vol. 73, no. 2, pp. 2032–2044, Feb. 2024.
  • [32] X. Shi, J. Wang, Z. Sun, and J. Song, “Spatial-chirp codebook-based hierarchical beam training for extremely large-scale massive MIMO,” IEEE Trans. Wireless Commun., 2023, Early Access.
  • [33] Y. Lu, Z. Zhang, and L. Dai, “Hierarchical beam training for extremely large-scale MIMO: From far-field to near-field,” IEEE Trans. Commun., vol. 72, no. 4, pp. 2247–2259, Apr. 2024.
  • [34] S. Noh, M. D. Zoltowski, and D. J. Love, “Multi-resolution codebook based beamforming sequence design in millimeter-wave systems,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), San Diego, CA, USA, 2015, pp. 1–6.
  • [35] C. Ouyang, Z. Wang, B. Zhao, X. Zhang, and Y. Liu, “On the impact of reactive region on the near-field channel gain,” arXiv preprint arXiv:2404.08343, 2024.
  • [36] H. Lu, Y. Zeng, C. You, Y. Han, J. Zhang, Z. Wang, Z. Dong, S. **, C.-X. Wang, T. Jiang et al., “A tutorial on near-field XL-MIMO communications towards 6G,” arXiv preprint arXiv:2310.11044, 2023.
  • [37] S. Zhang and R. Zhang, “Capacity characterization for intelligent reflecting surface aided MIMO communication,” IEEE J. Sel. Areas Commun., vol. 38, no. 8, pp. 1823–1838, Jun. 2020.
  • [38] W. Liu, C. Pan, H. Ren, F. Shu, S. **, and J. Wang, “Low-overhead beam training scheme for extremely large-scale RIS in near field,” IEEE Trans. Commun., vol. 71, no. 8, pp. 4924–4940, May 2023.
  • [39] Y. Zhang, C. You, L. Chen, and B. Zheng, “Mixed near- and far-field communications for extremely large-scale array: An interference perspective,” IEEE Commun. Lett., vol. 27, no. 9, pp. 2496–2500, Sept. 2023.
  • [40] A. F. Molisch, V. V. Ratnam, S. Han, Z. Li, S. L. H. Nguyen, L. Li, and K. Haneda, “Hybrid beamforming for massive MIMO: A survey,” IEEE Commun. Mag., vol. 55, no. 9, pp. 134–141, Sep. 2017.
  • [41] A. Kosasih and E. Björnson, “Finite beam depth analysis for large arrays,” arXiv preprint arXiv:2306.12367, 2023.
  • [42] H. Wang and Y. Zeng, “Can sparse arrays outperform collocated arrays for future wireless communications?” arXiv preprint arXiv:2307.07925, 2023.
  • [43] X. Wu, C. You, J. Li, and Y. Zhang, “Near-field beam training: Joint angle and range estimation with DFT codebook,” IEEE Trans. Wireless Commun., Early Access, 2024, doi:10.1109/TWC.2024.3385749.
  • [44] W. Liu, C. Pan, H. Ren, J. Wang, R. Schober, and L. Hanzo, “Near-field multiuser beam-training for extremely large-scale MIMO systems,” arXiv preprint arXiv:2402.13597, 2024.