Near-field Beam Training with Sparse DFT Codebook

Cong Zhou, Chenyu Wu, Changsheng You, , Jiasi Zhou, Shuo Shi Cong Zhou, Chenyu Wu and Shuo Shi are with the School of Electronic and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China. (e-mail: [email protected], [email protected], [email protected]).Changsheng You is with the Department of Electronic and Electrical Engineering, Southern University of Science and Technology, Shenzhen 518055, China. (e-mails: [email protected]).Jiasi Zhou is with the School of Medical Information and Engineering, Xuzhou Medical University, Xuzhou, 221004, China. (e-mails: [email protected]).Corresponding author: Chenyu Wu and Changsheng You.

Abstract

Extremely large-scale arrays (XL-arrays) have emerged as one promising technology to improve the spectral efficiency and spatial resolution in future sixth generation (6G) wireless systems. The drastic increase in the number of antennas renders the communication users more likely to be located in the near-field region, which requires a more accurate spherical (instead of planar) wavefront propagation modeling. However, this also inevitably incurs unaffordable beam training overhead when performing a two-dimensional (2D) beam-search in both the angular and range domains. To address this issue, we first introduce in this paper a new sparse discrete Fourier transform (DFT) codebook, which exhibits the angular periodicity in the received beam pattern at the user. This thus motivates us to propose a three-phase beam training scheme. Specifically, in the first phase, we utilize the sparse DFT codebook for beam swee** in an angular subspace and estimate candidate user angles according to the received beam pattern. Then, a central subarray is activated to scan specific candidate angles for resolving the issue of angular ambiguity for identifying the user angle. In the third phase, the polar-domain codebook is applied in the estimated angle to search the best effective user range. Finally, numerical results show that our proposed beam training scheme enabled by the sparse DFT codebook achieves 98.67% beam training overhead reduction as compared to the exhaustive-search scheme, yet without compromising rate performance in the high signal-to-ratio (SNR) regime.

Index Terms:

Extremely large-scale array, near-field communications, beam training, DFT codebook, sparse array.

I Introduction

Extremely large-scale arrays/surfaces (XL-arrays/surfaces) have been envisioned as one of the key ingredients to drive the evolution of six generation (6G) wireless systems[1, 2, 3, 4]. Specifically, XL-arrays/surfaces with a significant number of antennas can be deployed at the base station (BS) to achieve ultra-high spectral efficiency and spatial resolution, hence accommodating the escalating demands for new applications such as spanning metaverse and digital twin [5, 6, 7]. The drastic increase in the number of antennas in high-frequency bands represents a qualitative paradigm shift in the electromagnetic (EM) propagation modeling, giving rise to the new near-field communications [8].

Particularly, different from the far-field EM propagation which is simply approximated by planar waves, the near-field channel modeling necessitates the use of more accurate spherical waves[9, 10, 11]. As such, near-field communications possess several unique properties in contrast to far-field communications. First, the spherical wavefront characteristic opens up the possibility of near-field beamfocusing, for which the beam energy can be concentrated at a specific location/region rather than a spatial angle typically for far-field beamforming[12, 13, 14]. The beam-focusing capability of near-field communications enables XL-array to flexibly form highly directional beams in both the angle and range domain, and hence can be leveraged in various applications to improve the system performance, such as mitigating the inter-user interference, improving the accuracy of sensing and localization[15, 16, 17], and enhancing the charging efficiency of wireless power transfer[18]. Second, the rank of line-of-sight (LoS) channels for near-field multiple-input multiple-output (MIMO) communication systems can be larger than one, hence enhancing the spatial multiplexing gains [19]. In this paper, we propose an efficient near-field beam training scheme with an sparse discrete Fourier transform (DFT) codebook by using the sparse antenna activation method, which significantly reduces the beam training overhead.

I-A Related Works

I-A1 Near-Field Wireless Systems

Near-field communications bring new opportunities and challenges, which has motivated upsurging research interest. For example, in [12], the authors studied the beamfocusing design based on the fully-digital architectures, hybrid phase shifter-based precoders, and dynamic metasurface antenna architecture for XL-MIMO arrays. It is shown that near-field beamfocusing provides new degree of freedom (DoF) to mitigate the interference among users, even when they are located at the same spatial angle. In addition, a new concept of location division multiple access (LDMA) was proposed in [20], which exploits orthogonality of the near-field beamfocusing vectors in the range domain to serve different users at the same angle. The authors in [21] developed a framework for analyzing and designing XL-MIMO systems with spatial non-stationarity. It was revealed that the performance of the proposed framework approaches that of the conventional full-antenna array based designs albeit with lower complexity. In addition, a holographic metasurface antennas (HMAs) based multi-user system was investigated in [22], where the digital transmit precoder and the analog HMA weighting matrix were jointly optimized to minimize the transmit power. The authors in [23] considered a simultaneous wireless information and power transfer (SWIPT) system, where energy harvesting (EH) and information decoding receivers are located in the near- and far-field regions of the XL-array, respectively. The beam scheduling and power allocation were jointly optimized to maximize the weighted-sum power harvested at EH receivers [23]. Moreover, a directional modulation system was developed for near-field physical layer security systems[24]. Specifically, a fully analog precoding algorithm along with artificial noise and power allocation was proposed to realize secure transmission in both the angular and range domains. Near-field sensing with XL-array was studied in [25], where the closed-form expressions of the Cramér-Rao Bounds for both the angle and range estimations are derived.

I-A2 Near-field Beam Training

In high-frequency bands, direct channel estimation methods may not be very effective due to severe path-loss and signal misalignment. As such, beam training is efficient in establishing initial links with high signal-to-noise ratio (SNR) for data transmission and channel state information (CSI) acquisition [26]. However, due to the spherical wave propagation, near-field beam training is more challenging compared with its far-field counterpart as it requires a joint beam search over both the angular and range domain. Particularly, conventional far-field beam training will suffer from significant performance loss in the near-field region due to the so-called energy-spread effect, for which the energy of a far-field beamformer is no longer steered towards one angle, but spread in multiple angles. Hence, the conventional far-field beam training method cannot be directly applied to the near-field beam training. To address this issue, the authors in [27] proposed a new codebook design in the polar domain, for which the angular domain is uniformly sampled whereas the range domain is non-uniformly sampled. One can simply invoke this codebook for the exhaustive-search based near-field beam training. However, the overhead of this scheme is the product of the number of antennas and range samples, which is prohibitively high for the implementation of XL-arrays. To reduce the overhead, the authors in [28] proposed a novel two-phase near-field beam training method. This method leveraged a key observation that the true user angle approximately lies in the middle of an angular support with high received SNRs. Based on this observation, one can first estimate the user angle by using far-field DFT beams, and then estimate the user range with the polar-domain codebook in [27]. Besides, deep learning techniques have also been exploited in [29] and [30] to reduce the near-field beam training overhead, where deep neural networks (DNN) are trained based on conventional far-field codebooks and near-field codebook, respectively. However, the training overhead of these methods scales linearly with the number of antennas, which is still unaffordable for communication systems. This issue motivates the design of efficient hierarchical beam training schemes for near-field communications to reduce the training overhead to the logarithmic order, e.g., [31, 32, 33]. However, hierarchical beam training schemes suffer from several inherent drawbacks. First, hierarchical methods usually require frequent feedback and neglect underlying transmission delays. Moreover, there exists the error propagation issue due to the progressive beam search [34].

I-B Motivations and Contributions

The existing works on near-field beam training inevitably incur unacceptable overhead, while the off-grid channel estimation methods face highly computational complexity such as high dimensional matrix inversion. Moreover, the beam training overhead of these existing works scales linearly with the number of antennas apart from hierarchical beam training schemes, which suffers from several inherent drawbacks such as error propagation and user feed-back overhead [34]. Motivated by the above, this paper explores a new sparse DFT codebook and a three-phase beam training scheme to reduce the near-field beam training overhead, which scales with the square root of the number of antennas. The main contributions are summarized as follows.

•

First, we propose a novel sparse DFT codebook by sparsely activating the XL-array antennas and constructing the reduced DFT codebook with the equivalent sparse linear array (SLA). Specifically, the sparse DFT codebook consists of sparse far-field channel response vectors, which is sampled from far-field channel response vectors. Then, we characterize the received beam pattern at the near-field user when the sparse DFT codebook is used for beam swee**. Interestingly, it is shown that the received beam pattern exhibits periodicity in the angular domain, while there still exists the energy-spread effect. Then, we show that the user angle information is contained in a period of the received beam pattern at the user and can be estimated via a defined angular support.
•

Second, we propose a novel three-phase beam training scheme based on the sparse DFT codebook. Specifically, in the first phase, we utilize a small number of the sparse DFT codewords to sweep an angular subspace and estimate one candidate user angle according to the middle of the defined angular support. Then, in the second phase, we activate a central subarray to resolve the angular ambiguity by virtue of the periodicity of the received beam pattern at the user. Subsequently, the polar-domain codebook is utilized to search the best user range in the estimated user angle.
•

Finally, extensive numerical results are presented to demonstrate the effectiveness of our proposed beam training scheme enabled by the sparse DFT codebook. It is shown that the proposed three-phase beam training scheme can achieve nearly the same performance with the exhaustive-search beam training method in the high-SNR regime, while reducing more than 98% of the training overhead. In the low-SNR regime, the proposed scheme suffers from slight performance loss, while the effective rate still significantly exceeds all benchmark schemes due to lower beam training overhead.

I-C Organization and Notations

The remainder of this paper is organized as follows. System model is presented in Sections II. In Section III, we introduce several benchmarks. Section IV provides comprehensive analysis of the received beam pattern at the user with the sparse DFT codebook. Then, in Section V, the proposed three-phase beam training scheme is elaborated. Finally, numerical results are provided in section VI to demonstrate the effectiveness of the proposed beam training scheme followed by the conclusions made in Section VII.

Notations: Vectors and matrices are respectively denoted by lower-case and upper-case boldface letters. The symbol $\left|\cdot\right|$ represents the absolute value, while $\left\lVert\cdot\right\lVert$ denotes the $l_{2}$ norm. Moreover, we use $(\cdot)^{H}$ to denote the conjugate transpose operation. Finally, the Hadamard product is represented by $\odot$ . The key symbols used in this paper are listed in Table I.

Table I: List of main symbols and their physical meanings.

$N$	Number of BS antennas	$U$	Antenna activation interval
$D$	Array aperture size	$M$	Number of antennas of the activated subarray
$\lambda$	Carrier wavelength	$z$	AWGN
$d_{0}$	Antenna spacing	$Q$	Number of antennas of the activated SLA
$\mathbf{h}^{H}_{\rm near}$	Near-field channel	$\kappa$	Rician factor
$L$	Number of channel paths	${{{P}}_{\rm tot}}$	Transmit SNR
$\theta_{0}$	BS center-user spatial angle	${{\mathcal{W}}}_{\rm{DFT}}$	Sparse DFT codebook
$r_{0}$	BS center-user range	$\beta$	Channel gain
$\mathbf{b}\left(r_{0},\theta_{0}\right)$	Near-field channel steering vector	$\Delta$	Difference of spatial angles
${\mathbf{w}}$	Beamforming vector	$f({r_{0}},{\theta_{0}};\theta)$	Received beam pattern
${{\mathcal{V}}}_{\rm{Sub}}$	DFT codebook for the activated subarray	$\bar{\mathcal{X}}_{\rm{Pol}}$	Polar-domain codebook

II System Model

We consider a single-user XL-array downlink communication system, where the BS is equipped with a dense uniform linear array (ULA) with $N$ antennas. In this section, the near-field channel and signal model for the ULA are introduced.

II-A Near-field Channel Model

We assume that the dense ULA is situated at the $y$ -axis and centered at the origin. Specifically, each antenna of XL-array is located at ( $0,nd_{0}$ ), where $n\in\mathcal{N}\triangleq\{0,\pm 1,\cdots,\pm\frac{N-1}{2}\}$ and $d_{0}$ respectively denote the antenna index and inter spacing. For the dense ULA, we have $d_{0}=\frac{\lambda}{2}$ , where $\lambda$ represents the carrier wavelength. Moreover, the single user is assumed to be located in the Fresnel near-field region of the XL-array where the BS-user range $r_{0}$ is larger than the Fresnel distance $Z_{\rm F}=\max{\{d_{R},1.2D\}}$ and smaller than the Rayleigh distance $Z_{\rm R}=\frac{2D^{2}}{\lambda}$ with $D=(N-1)d_{0}$ denoting the array aperture. Moreover, $d_{R}$ is proven to be several wavelengths in [35] and the Fresnel distance can be simplified by $Z_{\rm F}=1.2D$ . Hence, the line-of-sight (LoS) channel follows the uniform spherical wave (USW) model [36]. For example, when $N=257$ and $f=30$ GHz, the Rayleigh distance is approximately $328$ m, which makes the user more likely to be located in the near-field region. Then the general multi-path channel from the XL-array to the user can be modeled as [23]

\mathbf{h}^{H}_{\rm near}=\sqrt{N}\beta\mathbf{b}^{H}(r_{0},\theta_{0})+\sum_{% \ell=1}^{L}\sqrt{\frac{N}{L}}\beta_{\ell}\mathbf{b}^{H}(\bar{r}_{\ell},\bar{% \theta}_{\ell}),

(1)

which includes one LoS path $\mathbf{h}_{\rm LoS}^{H}$ and $L$ non-LoS (NLoS) paths. Herein, the parameters $r_{0}$ ( ${\bar{r}_{\ell}}$ ) and $\theta_{0}$ ( $\bar{\theta}_{\ell}$ ) represent the range and spatial angle of the LoS ( $\ell$ -th NLoS) signal path. Moreover, the parameters $\beta$ and $\beta_{\ell}$ denote the LoS path and $\ell$ -th NLoS path gain, respectively. Mathematically, $\beta$ can be modeled as [37]

\beta=\sqrt{\frac{\kappa}{\kappa+1}}\frac{\sqrt{\beta_{0}}}{r_{{0}}}e^{-\frac{% \jmath 2\pi r_{{0}}}{\lambda}},

(2)

where $\kappa$ and $\beta_{0}$ represent the Rician factor and reference channel gain at a range of 1 m, respectively.

Refer to caption — Figure 1: A narrow-band Near-field XL-array communication system.

In this paper, we mainly consider the near-field communication scenarios in high-frequency bands such as millimeter-wave (mmWave) and even terahertz (THz). In these scenarios, the NLoS channel paths exhibit negligible power owing to the severe path-loss and shadowing effects [38]. Therefore, we only consider the LoS channel and the BS-user channel can be approximated as $\mathbf{h}^{H}_{\rm near}\approx\mathbf{h}_{\rm LoS}$ ¹¹1The case where comparable multi-path components exist is more complicated and discussions are provided in the Section V-D. We will evaluate the Rician fading channel in the simulation results.. Based on USW model, the near-field LoS channel from BS $\to$ user can be modeled as [28]

\mathbf{h}^{H}_{\rm near}\approx\sqrt{N}\beta\mathbf{b}^{H}(r_{0},\theta_{0}),

(3)

where $\mathbf{b}^{H}(r_{0},\theta_{0})$ denotes the near-field channel steering vector, defined as [39]

\left[\mathbf{b}^{H}\left(r_{0},\theta_{0}\right)\right]_{n}=\frac{1}{\sqrt{N}% }e^{-\frac{\jmath 2\pi r_{n}}{\lambda}},\forall n\in\mathcal{N},

(4)

with $r_{n}=\sqrt{r_{0}^{2}+{n}^{2}d_{0}^{2}-2r_{0}\theta_{0}nd_{0}}$ denoting range between the $n$ -th antenna and the user. Moreover, $\theta_{0}=\cos\phi_{0}\in[-1,1]$ represents the spatial angle at the BS, with $\phi_{0}$ denoting the physical angle-of-departure (AoD) from the BS center to the user. Further, by means of Fresnel approximation, $r_{n}$ can be approximated as

r_{n}\approx r_{0}-nd_{0}\theta_{0}+\frac{n^{2}d_{0}^{2}(1-\theta_{0}^{2})}{2r% _{0}},

(5)

which is shown to be accurate in [5].

II-B Signal Model

Let $x\in\mathbb{C}$ denote the transmitted symbol by the BS with unit power and $\bar{\mathbf{w}}\in\mathbb{C}^{N\times 1}$ represent the beamforming vector [40]. Then the received signal at the user is given by

\displaystyle y(\bar{\mathbf{w}})=\sqrt{N}\beta\sqrt{P_{\rm tot}}\mathbf{b}^{H% }(r_{0},\theta_{0})\bar{\mathbf{w}}x+z,

(6)

where $z$ is the received additive white Gaussian noise (AWGN) and $z\sim\mathcal{CN}\left(0,\sigma^{2}\right)$ . Moreover, $P_{\rm tot}$ denotes the total transmit power of the BS. Then, the achievable rate in bits/second/hertz (bps/Hz) is given by

R=\log_{2}\left(1+\frac{P_{\rm tot}N|\beta|^{2}|\mathbf{b}^{H}(r_{0},\theta_{0% })\bar{\mathbf{w}}|^{2}}{\sigma^{2}}\right).

(7)

III Benchmark Beam Training Schemes

In this section, two benchmark beam training schemes and their drawbacks are presented.

III-A 2D Exhaustive-Search Beam Training Method

The authors in [27] proposed a polar-domain codebook, each steering a focusing beam to a specific location. In particular, the angular domain is uniformly sampled, while the range domain is non-uniformly sampled. Specifically, the polar-domain codebook is given by

\bar{\mathcal{W}}_{\rm{Pol}}=\{\bar{\mathcal{W}}_{1},\cdots,\bar{\mathcal{W}}_% {\bar{n}},\cdots,\bar{\mathcal{W}}_{N}\},

(8)

where $\bar{\mathcal{W}}_{\bar{n}}=\{\bar{\mathbf{w}}_{\bar{n},1}\cdots,\bar{\mathbf{% w}}_{\bar{n},v},\cdots\bar{\mathbf{w}}_{\bar{n},V}\}$ denotes the sub-codebook steering $V$ beams towards the angles $\theta_{\bar{n}}=\frac{2{\bar{n}}-N+1}{N},\forall{\bar{n}}\in\bar{\mathcal{N}}% \triangleq\{1,2,\cdots,N\}$ . Mathematically, we have $\bar{\mathbf{w}}_{{\bar{n}},v}=\mathbf{b}\left(r_{{\bar{n}},v},\theta_{\bar{n}% }\right)$ where $r_{{\bar{n}},v}=\frac{1}{v}\alpha_{\Delta}\left(1-\theta_{{\bar{n}}}^{2}\right% ),\quad\forall v\in\mathcal{V}\triangleq\{1,2,3,\cdots V\}$ with $\alpha_{\Delta}\triangleq\frac{N^{2}d_{0}^{2}}{2\lambda\beta_{\Delta}^{2}}$ . Moreover, $\beta_{\Delta}$ is a constant corresponding to the quantization loss in the range domain [27]. Given the polar-domain codebook $\bar{\mathcal{W}}_{\rm{Pol}}$ , a two-dimensional exhaustive search method can be directly applied in both the angular and range domains to search the best codeword, which yields the maximum received SNR at the user. The beam training overhead of this exhaustive-search beam training method is $T^{\rm{(ex)}}=NV$ , which is proportional to the product of the number of antennas and range samples. When the number of antenna is large, the beam training overhead is unaffordable.

III-B Two-Phase Near-field Beam Training

To further reduce the beam training overhead, the authors in [28] proposed a two-phase near-field beam training method, which explored the so-called energy-spread phenomenon. Specifically, when the far-field DFT codebook is used for the angular swee** in the near-field, it is observed that the user angle approximated lies in the middle of an angular support region, which is to be defined in Section IV. Mathematically, the conventional DFT codebook is given by

\displaystyle\bar{\mathbf{w}}_{\bar{n}}

\displaystyle=\mathbf{a}(\theta_{\bar{n}})\triangleq\frac{1}{\sqrt{N}}\left[1,% e^{-\jmath\pi\theta_{\bar{n}}},\cdots,e^{-\jmath\pi(N-1)\theta_{\bar{n}}}% \right],

(9)

where $\theta_{\bar{n}}=\frac{2{\bar{n}}-N+1}{N},\forall{\bar{n}}\in\bar{\mathcal{N}}$ . Given this observation, they explore the conventional DFT codebook in the first phase to perform beam swee**, which estimates the user angle information. Then, given the candidate user angle, the polar-domain codebook in (8) is used to search the best user range in the second phase. The beam training overhead of the two-phase beam training method is $T^{(\rm 2P)}=N+KV$ with $K$ representing the number of candidate user angles. Although this method significantly reduces the beam training overhead of the exhaustive-search method, the overhead of the two-phase beam training method is still proportional to the number of antennas, which is prohibitively high as $N$ is sufficiently large.

To address the above issues, we propose a new near-field beam training method using a proposed sparse DFT codebook, which is equivalent to sparsely activating the dense ULA equipped by the BS, yielding extremely lower overhead as compared with various benchmark schemes.

IV Received Beam Pattern of the Sparse DFT Codebook

In this section, we first introduce the sparse DFT codebook and then analyze its received beam pattern. Moreover, we design a periodical beam training codebook to reduce overhead and propose to activate a central subarray for resolving the angular ambiguity. The main definitions in this section are given as follows, including the received beam pattern and angular support.

Definition 1.

Given a fixed near-field user located at $(r_{0},\theta_{0})$ and an arbitrary far-field beamforming vector $\bar{\mathbf{w}}=\mathbf{a}(\theta)$ steering the beam towards the angle $\theta$ , the received beam pattern at the user is defined as

f(r_{0},\theta_{0};\mathbf{\theta})\triangleq|\mathbf{b}^{H}(r_{0},\theta_{0})% \mathbf{a}(\theta)|,\forall\theta.

(10)

Definition 2.

Given a near-field channel response vector $\mathbf{b}^{H}(r_{0},\theta_{0})$ and a far-field beamforming vector $\bar{\mathbf{w}}=\mathbf{a}(\theta)$ with $\theta\in\mathcal{L}$ , the $3$ -dB angular support $\mathcal{A}_{\mu}^{\mathcal{L}}(r_{0},\theta_{0})$ in the region ${\mathcal{L}}$ is defined by [28]

\displaystyle\mathcal{A}^{\mathcal{L}}_{\mu}(r_{0},\theta_{0})\!\!=\!\!\left\{% \theta\mid f\left(r_{0},\theta_{0},\theta\right)\!>\!\kappa\max_{\theta\in% \mathcal{L}}f\left(r_{0},\theta_{0},\theta\right)\right\},

(11)

where $\kappa=10^{\mu/10}$ . Moreover, let $\theta_{\rm left}$ and $\theta_{\rm right}$ be the smallest and largest angle in $\mathcal{A}^{\mathcal{L}}_{\mu}(r_{0},\theta_{0})$ . Then, its angular support width is defined as

\Gamma_{\mu}^{\mathcal{L}}(r_{0},\theta_{0})=\theta_{\rm right}-\theta_{\rm left}.

(12)

IV-A Sparse DFT Codebook

Each column of the DFT Codebook in (9) is a far-field channel response vector. For the sparse DFT codebook, we sample each column of the original DFT codebook using an interval of $U$ , while the positions that are not sampled are padded with zeros. As such, $\bar{n}$ -th column of the sparse DFT Codebook $\tilde{\mathbf{w}}_{\bar{n}}\in\mathbb{C}^{N\times 1}$ is given by $\tilde{\mathbf{w}}_{\bar{n}}=\bar{\mathbf{w}}_{\bar{n}}\odot\mathbf{n}(U)$ , where

\mathbf{n}^{H}(U)=[1,\underbrace{0,\cdots,0,}_{U-1}1,\cdots,1]

(13)

denotes the sampling vector. It can be verified that the number of non-zero elements in $\tilde{\mathbf{w}}_{\bar{n}}$ is $Q=\frac{N-1}{U}+1$ (assuming $Q$ is an integer for convenience). Moreover, we rearrange the non-zero elements in $\tilde{\mathbf{w}}_{\bar{n}}$ into a new vector ${\mathbf{w}_{\bar{n}}}\in\mathbb{C}^{Q\times 1}$ , which is given by

{\mathbf{w}}_{\bar{n}}=\mathbf{a}_{\rm SLA}(\theta_{\bar{n}})=\frac{1}{\sqrt{Q% }}\bigg{[}1,e^{\jmath{\pi}U\theta_{\bar{n}}},\cdots,e^{\jmath{\pi}(Q-1)U\theta% _{\bar{n}}}\bigg{]}^{H}\!\!,

(14)

referred to as the sparse far-field beamforming vector.

It is noteworthy that this sampling method is equivalent to sparsely activating a number of antennas of the the XL-array with an interval of $U$ , which effectively transforms the equipped ULA into an SLA with an inter-element spacing of $Ud_{0}$ , as illustrated in Fig. 2. Furthermore, for brevity, we denote the channel response vector of the effective SLA as

\left[\mathbf{b}^{H}_{\rm SLA}\left(r_{0},\theta_{0}\right)\right]_{q}=\frac{1% }{\sqrt{Q}}e^{-\frac{\jmath 2\pi r_{q}}{\lambda}},\forall q\in\mathcal{Q},

(15)

where $\mathcal{Q}\triangleq\{0,\pm 1,\cdots,\pm\frac{Q-1}{2}\}$ denotes the set of the SLA antenna index and $r_{q}=\sqrt{r_{0}^{2}-2qUd_{0}r_{0}\theta_{0}+(qUd_{0})^{2}}$ represents the range between the user and $q$ -th antenna of the SLA. Similar to (5), $r_{q}$ can be approximated as $r_{q}\approx r-qUd_{0}\theta_{0}+\frac{q^{2}(Ud_{0})^{2}(1-\theta_{0}^{2})}{2r% _{0}}$ with Fresnel approximation. Then, the received signal with beamforming vector $\mathbf{w}_{\bar{n}}$ at the user is rewritten as

\displaystyle y(\mathbf{w}_{\bar{n}})=\sqrt{Q}\beta\sqrt{P_{\rm tot}}\mathbf{b% }^{H}_{\rm SLA}(r_{0},\theta_{0})\mathbf{w}_{\bar{n}}x+z.

(16)

Moreover, the received beam pattern at the user in (10) can be rewritten as

	$\displaystyle f(r_{0},\theta_{0};\theta_{\bar{n}})$	$\displaystyle=\|\mathbf{b}^{H}(r_{0},\theta_{0}){\bar{\mathbf{w}}_{\bar{n}}}\|=\|% \mathbf{b}^{H}_{\rm SLA}(r_{0},\theta_{0}){\mathbf{w}}_{\bar{n}}\|$		(17)
		$\displaystyle=\|\mathbf{b}_{\rm SLA}^{H}(r_{0},\theta_{0})\mathbf{a}_{\rm SLA}(% \theta_{\bar{n}})\|.$		(17)

IV-B Near-field Received Beam Pattern

To obtain more insights, we first characterize the received beam pattern at the user with sparse far-field beamforming vectors spanning in the continuous spatial angles. Let $\mathbf{w}(\theta)=\mathbf{a}_{\rm SLA}(\theta)$ represent a sparse far-field beamforming vector, for which $\theta=\theta_{\bar{n}},\forall{\bar{n}}\in{\bar{\mathcal{N}}}$ is the discrete sampled angle. We first characterize the received beam pattern of the sparse far-field beamforming vector ${\mathbf{w}}$ as follows.

Lemma 1.

For a sampled beamforming vector ${\mathbf{w}}$ parameterized by $\{Q,U\}$ , the received beam pattern at the user is given by

	$\displaystyle f\left(r_{0},\theta_{0};\theta\right)$
$\displaystyle\!\!\!\!\!\!\overset{(a_{1})}{\approx}$	$\displaystyle\frac{1}{Q}\!\!\left\|\sum_{q\in\mathcal{Q}}\!\exp{\left(% \underbrace{\jmath{\pi qU\Delta}}_{B_{1}}+\underbrace{\jmath{\frac{\pi}{% \lambda}q^{2}(Ud_{0})^{2}\frac{1-\theta_{0}^{2}}{r_{0}}}}_{B_{2}}\right)}\!% \right\|\!$	(18)
$\displaystyle\triangleq$	$\displaystyle\hat{f}(r_{0},\theta_{0};\theta),$

where $\Delta\triangleq\theta-\theta_{\rm 0}$ and $(a_{1})$ is due to the Fresnel approximation and shown to be accurate in [41].

Proposition 1 (The periodicity of $\theta$ ).

$f\left(r_{0},\theta_{0};\theta\right)$ is a periodic function of $\theta$ with the period of $\frac{2}{U}$ . Mathematically, we have

f\left(r_{0},\theta_{0};\theta\right)=f\left(r_{0},\theta_{0};\theta+\frac{2k}% {U}\right),\forall k\in\mathbb{Z}.

(19)

Proof: For an arbitrary integer $k$ and ${\bf{a}}(\theta+\frac{2k}{U})$ , we have

	$\displaystyle{[{\bf{a}}(\theta+{2k}/{U})]_{q}}$	$\displaystyle=\exp(\jmath\pi qU(\theta+{2k}/{U}))$
		$\displaystyle=\exp(\jmath\pi qU\theta)\exp(\jmath 2\pi kqU)$
		$\displaystyle=\exp(\jmath\pi qU\theta)={[{\bf{a}}(\theta)]_{q}}.$

Hence, we have $f\left(r_{0},\theta_{0};\theta\right)=f\left(r_{0},\theta_{0};\theta+k\frac{2}% {U}\right)$ and thus complete the proof. $\Box$

In Fig. 3, we plot the received beam pattern versus the spatial angle $\theta$ , where the periodicity of $\theta$ is exhibited. Moreover, it is observed that the user angle is located in the middle of the angular support $\mathcal{A}^{\mathcal{L}_{0}}_{\mu}(r_{0},\theta_{0})$ , where $\mathcal{L}_{0}=[\theta_{0}-1/U,\theta_{0}+1/U)$ (a period $\frac{2}{U}$ ). Then, in the following, we prove that this observation holds for arbitrary user location $(r_{0},\theta_{0})$ .

Due to the periodicity of $\theta$ , we only need to focus on the region $\mathcal{L}_{0}$ , i.e., $\Delta\in[-1/U,1/U)$ , which inexplicitly contains useful user angle information. Given $\Delta\in[-1/U,1/U)$ , (1) can be approximated as follows.

Lemma 2.

When $\Delta\in[-1/U,1/U)$ , the received beam pattern $\hat{f}(r_{0},\theta_{0};\theta)$ in (1) can be approximated as

\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)\approx\left|G(\beta_{% 1},\beta_{2})\right|.

(20)

Specifically, we have

G(\beta_{1},\beta_{2})\triangleq\frac{\widehat{C}(\beta_{1},\beta_{2})+\jmath(% \widehat{S}(\beta_{1},\beta_{2})}{2\beta_{2}},

(21)

where $\widehat{C}(\beta_{1},\beta_{2})\triangleq{C}(\beta_{1}+\beta_{2})-C(\beta_{1}% -\beta_{2})$ and $\widehat{S}(\beta_{1},\beta_{2})\triangleq S(\beta_{1}+\beta_{2})-S(\beta_{1}-% \beta_{2})$ . Further, $C(x)$ and $S(x)$ are Fresnel integrals, which are given by

C(x)=\int_{0}^{x}\cos(\frac{\pi}{2}t^{2}){\rm d}t,S(x)=\int_{0}^{x}\sin(\frac{% \pi}{2}t^{2}){\rm d}t.

Moreover, $\beta_{1}$ and $\beta_{2}$ are given by

\beta_{1}=\Delta\sqrt{\frac{r_{0}}{d_{0}(1-\theta_{0}^{2})}},\leavevmode% \nobreak\ \leavevmode\nobreak\ \beta_{2}=\frac{QU}{2}\sqrt{\frac{d_{0}(1-% \theta_{0}^{2})}{r_{0}}}.

(22)

Proof: Please refer to Appendix A. $\Box$

As observed from (22), we have

\beta_{1}\beta_{2}=\frac{QU\Delta}{2}\approx\frac{D}{2d_{0}}\Delta.

Hence, the function $G(\cdot)$ can be rewritten as a function of $\{\Delta,\beta_{2}\}$ . It is observed that if the array aperture is fixed, $\beta_{2}$ is only determined by the user location $(r_{0},\theta_{0})$ , which indicates that each user location corresponds to a specific $\beta_{2}$ . Furthermore, as the user moves farther from the BS (consequently, the user is more likely to be located in the far-field region), the value of $\beta_{2}$ decreases. Supposing that $N=257$ , $U=16$ and $f=30$ GHz and the user is located at the Fresnel and focusing region ( $\theta_{0}=0$ ), we have $\beta_{2}\in[1.68,3.57]$ . In Fig. 4, we numerically show the function of $G(\cdot)$ . Importantly, for any near-field users, the so-called energy-spread phenomenon [27] is still observed. Moreover, due to the period of $\theta$ , the energy-spread effects exist in each period interval ( $\frac{2}{U}$ ). Then, two key observations are obtained.

Observation 1.

In Figs. 3 and 4, the energy-spread effects exhibit in the whole radiating near-field region, and the received beam pattern with sampled beamforming vector $\mathbf{w}$ contains useful user angle information:

User angle information: It is observed that the actual user angle approximately locates in the middle of the $3$ dB angular support in one period where $\theta\in{\mathcal{L}_{0}}=[\theta_{0}-1/U,\theta_{0}+1/U]$ . Mathematically, we have

\theta_{0}\approx{\rm{Med}}(\mathcal{A}_{3}^{\mathcal{L}_{0}}(\theta_{0},r_{0}% )).

(23)

Moreover, we define $u$ -th angular support as $\mathcal{A}_{3}^{\mathcal{L}_{u}}(\theta_{0},r_{0}),$ where ${\mathcal{L}_{u}}={\mathcal{L}_{0}+\frac{2u}{U}}$ , $\forall u\in\mathcal{U}\triangleq\{\pm 1,\pm 2,\cdots,\pm(U-1)\}$ . If we can employ the middle of $u$ -th angular support denoted by $\theta_{u}\approx{\rm{Med}}(\mathcal{A}_{3}^{\mathcal{L}_{u}}(\theta_{0},r_{0}))$ , the BS can infer that the user angle is among the following candidate angles $\theta_{u}\approx{\rm{Med}}(\mathcal{A}_{3}^{\mathcal{L}_{u}}(\theta_{0},r_{0}% )),$ where ${\mathcal{L}_{\rm c}}={\mathcal{L}_{u}+\frac{2c}{U}}$ , $\forall c\in\mathcal{U}$ , due to the period of $\frac{2}{U}$ .

2)

Although energy spread effects can provide actual user angle information, it is worth noting that when the user is located near the boundary of Fresnel region (in Fig. 4, the boundary corresponds to $\beta_{2}=3.57$ and $r_{0}=7.42$ m ), the angular support is distorted, which affects the accuracy of the beam training results. Moreover, by taking the noise and power fluctuation into account, the actual user angle may slightly deviate from the middle of the angular support.

Observation 1 indicates that the actual user angle can be estimated by finding the middle of the angular support in a period. In other words, the sparse DFT codebook within the angle range of a period (for example $\theta\in[-1/U,1/U)$ ) contained all information for which the BS can infer $U$ candidate angles for actual user angle.

V Proposed Beam Training Scheme Enabled by Sparse DFT Codebook

In this section, we propose a three-phase near-field beam training method enabled by the sparse DFT codebook. Then, an optimization problem is formulated to minimize the beam training overhead.

V-A Phase 1: Beam Swee** with the Sparse DFT codebook

In Section IV-B, we show that the received beam pattern with the sparse DFT codebook exhibits a period of $2/U$ and the user angle information can be inferred from the middle of the angular support within the region $\mathcal{L}_{0}$ .

This motivates us to perform angular swee** in a period of $\theta$ (steering beams varying from $-1/U$ to $1/U$ ) to estimate the potential user angles in the first phase, thereby further decreasing the beam training overhead. Considering that the beam width of the sparse far-field beamforming vector (14) is $\frac{4}{QU}$ [42], we sample the angular space as

\theta_{s}=\frac{2s-QU-1}{QU},s=1,\cdots,QU.

Then, the sparse DFT codebook in the first phase to perform the angular swee** is given by

{{\mathcal{W}}}_{\rm{DFT}}\!\!=\!\!\{{{\mathbf{w}}}_{\frac{QU-Q+1}{2}},\!% \cdots\!,{{\mathbf{w}}}_{g},\cdots,{{\mathbf{w}}}_{\frac{QU+Q-1}{2}}\},

(24)

where ${{\mathbf{w}}}_{g}=\mathbf{a}_{\rm SLA}(\theta_{g})$ , $\theta_{g}=\frac{2g-QU-1}{QU}$ and the index $g\in\mathcal{G}\triangleq\{\frac{QU-Q+1}{2},\cdots,\frac{QU+Q-1}{2}\}$ .

Then, the BS sequentially transmits $Q$ pilot symbols with the sparse DFT codebook in (24), while it tunes beam angles varying from $-1/U$ to $1/U$ as illustrated in Fig. 5(a). For each codeword, the received signal power at the user is given by

p(\mathbf{w}_{g})=|\sqrt{Q}\beta\mathbf{b}^{H}(r_{0},\theta_{0})\mathbf{w}_{g}% x+z|^{2},\leavevmode\nobreak\ \forall g\in\mathcal{G}.

(25)

However, it is worth noting that when we transmit sparse far-field beamforming vectors varying from $-1/U$ to $1/U$ , the received beam pattern may exhibit a shifted angular support within two periods instead of a whole angular support in one period as illustrated in Fig. 6. To obtain a regular angular support, the user needs to perform the received-beam-pattern shifting based on the index of the codeword $\mathbf{w}_{g}$ with the lowest power. We denote the codeword with the lowest power as $\mathbf{w}_{\ell}$ corresponding to the angle $\theta_{\ell}$ . Specifically, we shift the angle larger than $\theta_{\ell}$ by one period ( $2/U$ ) as illustrated in Fig. 6. Then, the shifted indices of the codewords are arranged in a vector $\mathbf{s}=[{\ell-Q+1},\cdots,{\ell-Q+2},\cdots,\ell]$ . Moreover, the equivalent received power of the shifted codewords can be recast as

	$\displaystyle\mathcal{P}=\{$	$\displaystyle p(\mathbf{w}_{\ell+1}),p(\mathbf{w}_{\ell+2}),\cdots,p(\mathbf{w% }_{(QU+Q-1)/2}),$
		$\displaystyle p(\mathbf{w}_{(QU-Q+1)/2}),p(\mathbf{w}_{(QU-Q+3)/2}),\cdots,p(% \mathbf{w}_{\ell})\}.$

As such, we obtain a complete and regular angular support, which involves the user’s angle information based on Observation 1. Specifically, the indices of shifted codewords with significantly high power received by the users are given by

\mathcal{S}=\{\mathbf{s}_{\delta}|p({\mathbf{w}_{\delta}})>\kappa\max\mathcal{% P}\},

(26)

where $\kappa\approx 0.5$ . Then, the estimated angle is given by $\theta_{\check{s}}=\frac{2\check{s}-QU-1}{QU}$ , where $\check{s}=\rm{Med}(\mathcal{S})$ and the corresponding codeword is $\mathbf{w}_{\check{s}}$ . Then, according to the periodicity, the BS can infer $U$ candidate user angles, which are given by

\theta_{\rm c}={\theta}_{\check{s}}+\frac{2u}{U},\forall u\in\mathcal{U}.

(27)

V-B Phase 2: Angular Ambiguity Elimination

In the second phase, we propose an efficient method to resolve the angular ambiguity. The key idea is to utilize a central subarray to sequentially examine the candidate angles in (27). Specifically, we activate a central subarray with $M$ antennas to eliminate the angular ambiguity, for which the codebook ${{\mathcal{V}}}_{\rm{Sub}}$ is presented as follows

{{\mathcal{V}}}_{\rm{Sub}}\!\!=\!\!\{{{\mathbf{v}}}_{1},\!\cdots\!,{{\mathbf{v% }}}_{s},\cdots,{{\mathbf{v}}}_{QU}\},

(28)

where ${\mathbf{v}}_{s}$ is given by

{\mathbf{v}}_{s}=\!\!\frac{1}{\sqrt{M}}\bigg{[}\underbrace{0,\cdots,0}_{\frac{% N-M}{2}},e^{\jmath{\pi}\frac{N-M}{2}\theta_{s}},\!\cdots\!,e^{\jmath{\pi}\frac% {N+M-2}{2}\theta_{s}},\underbrace{0,\cdots,0}_{\frac{N-M}{2}},\bigg{]}^{H}\!\!.

(29)

It is noteworthy that there are two criteria for ensuring the effectiveness of angular ambiguity elimination. The first one is that the user should be located in the far-field region of the activated central subarray to avoid the energy-spread effect. Moreover, considering that the interval between two adjacent candidate angles is $\frac{2}{U}$ , extra angular ambiguity will be introduced if the beam width of the central subarray is wider than $\frac{4}{U}$ . Therefore, the second criterion is that the beam width of the central subarray is required to be smaller than $\frac{4}{U}$ , which ensures no interference between two adjacent candidate angles.

To satisfy the first criterion, we set the Rayleigh distance of the subarray to be less than the Fresnel distance of the XL-array, which is given by

\frac{2M^{2}d_{0}^{2}}{\lambda}\leq 1.2D.

(30)

Then, (30) can be simplified as $M\leq\sqrt{1.2{(N-1)}}$ .

For the second criterion, we have

\frac{4}{M}\leq\frac{4}{U},

(31)

which can be simplified to $M\geq U$ . Therefore, the number of antennas of the central subarray needs to satisfy

U\leq M\leq\sqrt{{1.2(N-1)}}.

(32)

Moreover, from (32), the activation interval $U$ has the following constraint $U\leq\sqrt{1.2{(N-1)}}$ . For example, given a setup where $N=257$ and $U=16$ , we have $16\leq M\leq 17.5$ . In Fig. 7, we plot the beam width of the subarray with $M=17$ . It is observed that two adjacent candidate user angles have no considerable mutual interference, which indicates the effectiveness of angular ambiguity elimination.

Based on the above, the BS activates a central subarray comprising $M$ antennas satisfying (32) to resolve the angular ambiguity. For the candidate user angles ${\theta}_{\check{s}}+u\frac{2}{U},\forall u\in\mathcal{U}$ in (27), the codewords chosen from (28) to resolve the angular ambiguity are $\mathbf{v}_{\check{s}+uQ},\forall u\in\mathcal{U}$ . Then, the BS sequentially transmits $U$ pilot symbols with beamforming vectors $\{\mathbf{v}_{\check{s}+uQ}\}$ as illustrated in Fig. 5 (b). For each codeword, the received signal power at the user in the second phase is given by

p(\mathbf{v}_{\check{s}+uQ})=|\sqrt{M}\beta\mathbf{b}^{H}(r_{0},\theta_{0})% \mathbf{v}_{\check{s}+uQ}x+z|^{2},\leavevmode\nobreak\ \forall u\in\mathcal{U}.

(33)

Then, we can obtain the optimal codeword $\mathbf{v}_{s^{\ast}}$ via simple comparisons of received signal power in (33), which is given by

{s}^{\ast}=\arg\max_{u\in\mathcal{U}}p(\mathbf{v}_{\check{s}+uQ}).

(34)

Then, the estimated user angle can be obtained by $\theta_{s^{\ast}}=\frac{2s^{\ast}-QU-1}{QU}$ .

V-C Phase 3: Beam Swee** with Polar-domain Codebook

Once we determine the user angle $\theta_{s^{\ast}}$ in the second phase, we can use the polar-domain codebook to perform the range domain swee**, achieving beamfocusing gain in the near-field. The polar-domain codebook utilized in the third phase is given by

\bar{\mathcal{X}}_{\rm{Pol}}=\{{\mathcal{X}}_{1},\cdots,{\mathcal{X}}_{s},% \cdots,{\mathcal{X}}_{QU}\},

(35)

where ${\mathcal{X}}_{s}=\{{\mathbf{x}}_{s,1},\cdots,{\mathbf{x}}_{s,v},\cdots,{% \mathbf{x}}_{s,V}\}$ and ${\mathbf{x}}_{s,v}=\mathbf{b}^{H}(r_{s,v},\theta_{s})$ with $r_{s,v}=\frac{1}{v}\alpha_{\Delta}\left(1-\theta_{s}^{2}\right)$ [27]. Moreover, $V$ denotes the number of range samples.

Specifically, the codewords employed in the third phase are given by $\mathcal{X}_{s^{\ast}}=\{{\mathbf{x}}_{s^{\ast},1},\cdots,{\mathbf{x}}_{s^{% \ast},v},\cdots,{\mathbf{x}}_{s^{\ast},V}\}$ . Then, the BS activates the whole XL-array and sequentially transmits training $V$ symbols in the estimated user angle $\theta_{s^{\ast}}$ with codewords $\mathcal{X}_{s^{\ast}}$ as illustrated in Fig 5(c). For each codeword, the received signal power at the user is given by

p({\mathbf{x}}_{s^{\ast},v})=|\sqrt{N}\beta\mathbf{b}^{H}(r_{0},\theta_{0}){% \mathbf{x}}_{s^{\ast},v}x+z|^{2},\leavevmode\nobreak\ \forall v\in\mathcal{V}.

(36)

Subsequently, the optimal codeword is determined through straightforward comparisons of received signal power in (36) and the best codeword index is given by

{v}^{\ast}=\arg\max_{v\in\mathcal{V}}p({\mathbf{x}}_{s^{\ast},v}).

(37)

Therefore, we can obtain the user location $(r_{{s}^{\ast},{v}^{\ast}},\theta_{{s}^{\ast}})$ with $r_{{s}^{\ast},{v}^{\ast}}=\frac{1}{{v}^{\ast}}\alpha_{\Delta}\left(1-\theta_{{% s}^{\ast}}^{2}\right)$ .

The detailed procedures of the proposed three-phase beam training method are summarized in Algorithm 1.

Algorithm 1 Proposed Three-phase Beam Training Method

1: Phase 1: Angular swee** in the subspace.

2: Use the sparse DFT codebook

\mathcal{W}_{\rm DFT}

for the beam swee** in the angular subspace

[-1/U,1/U)

3: Perform the received-beam-pattern shifting to obtain a regular angular support

\mathcal{P}

4: Obtain

U

candidate user angles

\theta_{\rm c}={\theta}_{\check{s}}+\frac{2u}{U},\forall u\in\mathcal{U}

according to the middle of the angular support

\mathcal{P}

5: Phase 2: Resolve angular ambiguity.

6: Activate a central subarray with

M

antennas and use codebook

{{\mathcal{V}}}_{\rm{Sub}}

to examine

U

candidate angles estimated in the first phase.

7: Obtain the best user angle

\theta_{s^{\ast}}

according to the highest power received at the user with respective to

p(\mathbf{v}_{\check{s}+uQ})

8: Phase 3: Range swee**.

9: Use polar-domain codebook

\bar{\mathcal{X}}_{\rm{Pol}}

with codewords

\mathcal{X}_{s^{\ast}}

to sweep the range domain in the user angle

\theta_{s^{\ast}}

10: Obtain the best user range

r_{s^{\ast},v^{\ast}}

according to the highest power received at the user with respective to

p({\mathbf{x}}_{s^{\ast},v})

V-D Discussions

Beam training overhead: In the first phase, the overhead is $Q=\frac{N-1}{U}+1$ . Moreover, the overhead of the second and third phases are $U$ and $V$ , respectively. Finally, the overall overhead of the proposed beam training scheme is $T^{\rm(3P)}=Q+U+V$ . Given $Q=\frac{N-1}{U}+1\in\mathbb{Z}^{+}$ with $\mathbb{Z}^{+}$ denoting the positive integer set, the beam training overhead of the proposed method enabled by the sparse DFT codebook can be recast as

T^{\rm(3P)}=\frac{N-1}{U}+U+V+1.

(38)

It can be easily obtained from (38) that as $U$ increases, the overhead during the first phase decreases, while it increases during the subsequent phase, and vice versa. Thus, there is a fundamental trade-off between the first and second phase of the beam training method depending on the value of $U$ . Then, we aim to minimize the beam training overhead $T^{\rm(3P)}$ and the optimization problem is formulated as


$\displaystyle({\bf P1}):\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\$	$\displaystyle\min_{U}\leavevmode\nobreak\ F(U)=\frac{N-1}{U}+U+V+1$
	$\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ {\text{s}}{\text{.t}}{% \rm{.}}\leavevmode\nobreak\ U\leq\sqrt{{1.2(N-1)}},$	(39a)
	$\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ U\in\mathcal{F}% \triangleq\{\frac{N-1}{U}\!+\!1\in\mathbb{Z}^{+},U\in\mathbb{Z}^{+}\}.$	(39b)

If we remove the integer constraint (39b), $({\bf P1})$ is a convex problem. Then, Problem $({\bf P1})$ can be rewritten as


$\displaystyle({\bf P2}):\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\$	$\displaystyle\min_{U}\leavevmode\nobreak\ F(U)=\frac{N-1}{U}+U+V+1$
	$\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ {\text{s}}{\text{.t}}{% \rm{.}}\leavevmode\nobreak\ U\leq\sqrt[4]{1.2{(N-1)}}.$	(40a)

Problem (P2) is an convex optimization problem, whose optimal solution can be easily obtained as follows due to convexity.

Lemma 3.

The optimal solution to Problem (P2) is given by $U^{\ast}_{P_{2}}=\sqrt{(N-1)}$ . Moreover, the overhead of the proposed method is $T^{(\rm 3P)}=2\sqrt{(N-1)}+V+1$ .

Proof: It is observed that $\frac{N-1}{U}+U+V+1\geq 2\sqrt{N-1}+V+1$ , where the equal holds with $U=\sqrt{(N-1)}$ . Moreover, considering $U=\sqrt{(N-1)}\leq\sqrt{1.2{(N-1)}}$ , the optimal solution to Problem $({\bf P2})$ is $U^{\ast}_{P_{2}}=\sqrt{(N-1)}$ with the beam training overhead $T^{(\rm 3P)}=2\sqrt{(N-1)}+V+1$ . $\Box$

When the integer constraint (39b) is taken into account, a suboptimal solution to Problem (P1) is given by

\hat{U}=\arg\min\limits_{\mathcal{F}}\{|U^{\ast}_{\rm P_{2}}-f|,f\in\mathcal{F% }\}.

(41)

Next, we present an example to illustrate the low overhead feature of the proposed beam training method. We consider a setup with $N=1025$ and $V=5$ . Then, $U=32$ can minimize $T^{(\rm 3P)}$ according to the optimal solution to Problem $({\bf P1})$ . The beam training overhead of the proposed multi-beam training scheme in this setup is $T^{(\rm 3P)}=2\sqrt{N-1}+V+1=64+5+1=70$ , which is significantly smaller than that of the exhaustive-search method ( $T^{(\rm EX)}=5280$ ) and the two-phase near-field beam training method ( $T^{(\rm 2P)}=1061$ ).

Remark 1 (Improved scheme: Middle- $K$ angle selection).

Due to the power fluctuation [28], the middle angle of the angular support may not be accurate. To improve the estimation accuracy, we can select the middle- $K$ angles of the angular support instead of selecting one potential angle in the first procedure. Specifically, in the third phase, we should perform the beam swee** in the range domain in the $K$ potential angles $\theta_{\bar{s}_{k}},k=1,\cdots,K$ to determine an optimal polar-domain codeword. The overhead of the middle- $K$ angle scheme is given by $T^{(\rm 3P)}_{{\rm Mid}-K}=\frac{N-1}{U}+U+KV$ . It is noteworthy that this does not significantly increase the beam training overhead, which is still proportional to $\sqrt{N}$ .

Remark 2 (Estimation error).

From the above analysis, the accuracy of the proposed beam training scheme enabled by sparse DFT codebook is mainly dependent on two factors. First, the sampling resolution is a key factor because the proposed beam training scheme is an on-grid channel estimation method. Therefore, when the sampling interval is small enough, the performance of proposed multi-beam training scheme approaches that of the optimal beamformer without noise taking into account. Second, noise is another key issue. The estimated angle is derived from the received signal power in different time, consequently influenced by the received SNR. Hence, a higher SNR is expected to achieve more accurate angle estimation, which will be numerically verified in Section VI.

Remark 3 (Multi-path channels).

With respect to multi-path channel cases, we divide the extension of the proposed algorithm into two cases according to different Rician parameters.

•

LoS-dominant channel: When the Rician factor is large enough (or equivalently the LoS path is dominant), we can regard the NLoS components as environmental noise. Therefore, the proposed beam training scheme based on the sparse DFT codebook still holds as this method only depends on the LoS path component.
•

Comparable multi-path components: This case is much more complicated. Considering that the accuracy depends on the periodic angular supports, NLoS components may bring about randomly overlapped received beam pattern in the angular domain, which poses challenges to the angle estimation in the first phase of our proposed beam training scheme. Therefore, the case with comparable multi-path components is left as a topic for our future works.

Remark 4 (Universal in both near- and far-field communications).

The proposed beam scheme can be applied to both near- and far-field communications. We can identify the near- or far-field user according to the angular support width [43]. Specifically, in cases where the angular support width is small (e.g., only contains one candidate angle), it signifies that the user is located in the far-field region. Then, there is no need to perform the range estimation in the third phase (see Section V-C). In other words, for far-field users, only the first two phases of the proposed beam training scheme need to be executed.

VI Numerical Results

Numerical results are provided to validate the effectiveness of the proposed near-field beam training scheme in this section. We first present the system parameters and benchmark schemes followed by the performance comparison under numerous setups.

VI-A System Setup and Benchmark Schemes

We set the system parameters as follows. We consider that the XL-array is equipped with $N=257$ antennas and $f=30$ GHz. The transmit power and reference channel gain at $1$ m are set as $P_{\rm tol}=30$ dBm and $\beta_{0}=(\frac{4\pi}{\lambda})^{2}=-62$ dB, respectively. Moreover, the noise power is $\sigma^{2}=-80$ dBm. According to $({\bf P1})$ , the optimal activation interval is set as $U=16$ . Furthermore, the antenna number of the activated central subarray is $M=17$ . With respective to NLoS paths, we set $L=2$ and $\kappa_{k}=30$ dB [29, 32]. The reference SNR for a user is defined by $\gamma=\frac{NP_{\rm tol}\beta_{0}}{r_{\rm 0}^{2}\sigma^{2}}$ [28]. To characterize the overhead, we assume that the total transmission time and a pilot symbol time are $T_{\mathrm{tol}}=0.2$ ms and $T_{\mathrm{s}}=0.1$ $\mathrm{\SIUnitSymbolMicro s}$ [44], respectively. Then, the effective rate is defined by $R_{\mathrm{Eff}}=\left(1-\frac{T_{\rm over}T_{\mathrm{s}}}{T_{\mathrm{tol}}}% \right)R$ , where $T_{\rm over}$ denotes the beam training overhead of each scheme. All the numerical results are averaged over 1000 channel realizations. The following benchmark schemes are considered for performance comparison:

•

Perfect-CSI based beamforming: This scheme assumes that the BS perfectly aligns the user near-field channel and beamforming vector is set by $\bar{\mathbf{w}}=\mathbf{b}(r_{0},\theta_{0})$ Obviously, this scheme is the performance upper bound for all methods.
•

Least square channel estimation: This scheme is a classic off-grid channel estimation method where the user estimates the channels by $N$ pilot symbols transmitted by the BS in the downlink. The estimated channel is given by $\hat{\mathbf{h}}_{\mathrm{LS}}=\left(\mathbf{X}^{\mathrm{H}}\mathbf{X}\right)^% {-1}\mathbf{X}^{\mathrm{H}}\mathbf{y}$ , where $\mathbf{X}\in\mathbb{C}^{N\times N}$ denotes the pilot matrix for each user and $\mathbf{y}=\mathbf{X}\mathbf{h}+\mathbf{z}\in\mathbb{C}^{N\times 1}$ represents the received signal vector with $\mathbf{z}\sim\mathcal{CN}\left(\mathbf{0},\sigma^{2}\mathbf{I}\right)$ . Moreover, $\mathbf{h}$ is shown in (1). Obviously, the pilot overhead of this scheme is $T^{({\rm LS})}=N$ .
•

Exhaustive-search beam training: This scheme is detailed in Section III-A. Due to the different angle sampling interval, the overhead of this scheme is modified by $T^{\rm{(ex)}}=QUV$ .
•

Two-phase beam training: This scheme is detailed in Section III-B. Due to the different angle sampling interval, the overhead of this scheme is revised by $T^{\rm{(2P)}}=QU+KV$ .
•

Far-field beam training based on DFT codebook: Conventional DFT codebook is used to sweep the whole angular domain for choosing a best codeword for which the maximum received signal power is achieved at the user. The beam training overhead of this scheme is $T^{\rm(Far)}=QU$ .

VI-B Performance Analysis

In Fig. 8, we plot the achievable rate $R$ versus the reference SNR $\gamma$ under different beam training schemes. Some key observations can be concluded as follows. First, our proposed near-field beam training scheme achieves very close performance to the two-phase beam training and exhaustive search based beam training scheme, especially in the high-SNR regime (i.e., larger than 26 dB). Second, with the decrease of the reference SNR, the achievable rate attained by off-grid channel estimation degrades more dramatically than other schemes, and becomes inferior to our proposed scheme when the reference SNR is lower than 26 dB. This is because the direct channel estimation is more sensitive to the received SNR as the XL-array beamforming vector may not be well aligned with the channel path during signaling. Finally, there is a large performance gap between the conventional far-field beam training scheme and other schemes dedicated to near-field communications, which implies that far-field beam training is no longer effective for next generation wireless systems with more antennas.

Then, Fig. 11 illustrates the effective rate $R_{\mathrm{Eff}}$ versus the reference SNR $\gamma$ . Interestingly, our proposed scheme outperforms other schemes in terms of the effective rate except for the perfect-CSI based beamforming. This is because our scheme achieves close or even superior achievable rates compared with other benchmarks (as have shown in Fig. 8), but with far less training overhead. Moreover, the exhaustive search based near-field beam training scheme is not practically applicable due to small effective rate caused by large training overhead, as shown in Fig. 11.

Fig. 9 shows the effect of user range on the achievable rate. It can be observed that the proposed scheme with $K=3$ exhibits approximately the same performance as the exhaustive-search based and two-phase near-field beam training schemes for all user ranges. This is attributed to the smart design of the sparse DFT codebook. This method leads to a periodical energy-spread effect during beam training, and hence the former key observation in [28] can be leveraged for beam training. Second, the achievable rate performance of the proposed scheme largely outperforms the far-field beam training when the user range is less than 50 m and gradually converges to that of the far-field beam training. This verifies the universality of our proposed scheme for both near-field and far-field beam training cases. In addition, the relationship between effective rate and user range is depicted in Fig. 10. Considering the overhead of beam training, the effective rate attained by our scheme is only slightly lower than (less than 1 bps/Hz) that of the perfect-CSI based schemes, which further verifies its effectiveness.

In Fig. 12, we evaluate the impact of Rician factor on the system achievable rate. It can be observed that the achievable rates of all schemes increase with the Rician factor at first and gradually saturate when the Rician factor approximates 10 dB. Moreover, the two-phase beam training scheme slightly outperforms the proposed scheme when the Rician factor is less than 5 dB. This is because our scheme is more sensitive to noise due to operations such as received-beam-pattern shifting, since NLoS components can be treated as a form of environment noise.

In Fig. 13, we plot the achievable rate versus the number of antennas in the central subarray. It can be observed that the achievable rate of the proposed scheme suffers from significant performance loss when the number of antennas in the central subarray is sufficiently small or large. This can be explained by two facts: 1) When $M$ is sufficiently large (Violation of Criteria 1 in Section V-B), the user are more likely to be located in the near-field region of the central subarray, for which the energy-spread effect is dominant and thus results in significant performance loss; 2) When $M$ is sufficiently small (Violation of Criteria 2 in Section V-B), the beam width of the central subarray becomes too large, introducing extra ambiguity and thus failing to distinguish candidate user angles.

Finally, we plot the achievable rate versus the number of XL-array antennas in Fig. 14, where the user ranges are fixed at 80 m. It is observed that the achievable rates increase as there are more antennas for all schemes except for the far-field beam training scheme. This is because as the number of antennas increases, the near-field effect is more prominent, and hence the far-field beam training is no longer effective even when the user is located at a relatively far range (i.e., 80 m) from the XL-array.

VII Conclusion

In this paper, we proposed a novel near-field beam training scheme enabled by the sparse DFT codebook (sparse far-field beamforming vectors) to construct periodic received beam pattern at the user. To this end, we showed that the angular periodicity of the received beam pattern boosts reduction in swee** space, thereby significantly decreasing beam training overhead. Specifically, the middle of the angular support within a period contains the user angle information. Then, an activated central subarray can resolve the angular ambiguity followed by polar-domain codebook swee** in the best user angle. Finally, numerical results were presented to show that the proposed beam training scheme can achieve nearly the same performance in the high-SNR regime with the exhaustive-search scheme, while significantly reducing the beam training overhead.

Appendix A

Proof of Lemma 2

From (1), when $\Delta\in[-1/U,1/U)$ , $B_{1}$ is not a constant at $2k\pi$ for arbitrary integer $k$ as $q$ changes. Hence, we have

	$\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)=$	$\displaystyle\frac{1}{Q}\!\!\left\|\sum_{q\in\mathcal{Q}}\!\exp{\left(\jmath\pi% (A_{1}q+A_{2}q^{2})\right)}\right\|$		(42)
	$\displaystyle=$	$\displaystyle\frac{1}{Q}\!\!\left\|\sum_{q\in\mathcal{Q}}\!\exp{\left(\jmath\pi A% _{2}(q+\frac{A_{1}}{2A_{2}})^{2}\right)}\right\|$		(42)

where $A_{1}=U\Delta$ and $A_{2}=\frac{(Ud_{0})^{2}}{\lambda}\frac{1-\theta_{0}^{2}}{r_{0}}$ .

Then, the summation in (42) can be approximated by an integral, which is given by

		$\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)\overset{{(b_{1})}}{% \approx}\frac{1}{Q}\!\!\left\|\int_{-\frac{Q}{2}}^{\frac{Q}{2}}\!\exp{\left(% \jmath\pi(A_{1}q+A_{2}q^{2})\mathrm{d}q\right)}\right\|$		(43)
		$\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \overset{{(b_{2% })}}{=}\frac{1}{Q}\!\!\left\|\frac{1}{\sqrt{2A_{2}}}\int_{\sqrt{2A_{2}}(-\frac{% Q}{2}+\frac{A_{1}}{2A_{2}})}^{\sqrt{2A_{2}}(\frac{Q}{2}+\frac{A_{1}}{2A_{2}})}% \!e^{(\frac{\jmath\pi t^{2}}{2})}\mathrm{d}t\right\|$
		$\displaystyle=\!\!\left\|\!\!\frac{\int_{0}^{\sqrt{2A_{2}}(\frac{Q}{2}+\frac{A_% {1}}{2A_{2}})}\!\!e^{(\frac{\jmath\pi t^{2}}{2})}\!\mathrm{d}t\!\!-\!\!\int_{0% }^{\sqrt{2A_{2}}(-\frac{Q}{2}+\frac{A_{1}}{2A_{2}})}\!\!e^{(\frac{\jmath\pi t^% {2}}{2})}\mathrm{d}t}{\sqrt{2A_{2}}Q}\!\right\|,$

where $(b_{1})$ is due to the approximation from the summation to the integral and $(b_{2})$ is obtained by setting $A_{2}(q+\frac{A_{1}}{2A_{2}})^{2}=\frac{t^{2}}{2}$ . Let $\beta_{1}=\frac{A_{1}}{2A_{2}}=\Delta\sqrt{\frac{r_{0}}{d_{0}(1-\theta_{0}^{2}% )}}$ and $\beta_{2}=\frac{\sqrt{A_{2}}Q}{2}=\frac{QU}{2}\sqrt{\frac{d_{0}(1-\theta_{0}^{% 2})}{r_{0}}}$ , (43) can be simplified as

	$\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)$	$\displaystyle=\left\|\frac{\int_{0}^{\beta_{1}+\beta_{2}}e^{(\frac{\jmath\pi t^% {2}}{2})}\mathrm{d}t-\int_{0}^{\beta_{1}-\beta_{2}}e^{(\frac{\jmath\pi t^{2}}{% 2})}\mathrm{d}t}{2\beta_{2}}\right\|$		(44)
		$\displaystyle=G(\beta_{1},\beta_{2}),$		(44)

where $G(\beta_{1},\beta_{2})\triangleq(\widehat{C}(\beta_{1},\beta_{2})+\jmath(% \widehat{S}(\beta_{1},\beta_{2}))/(2\beta_{2})$ , $\widehat{C}(\beta_{1},\beta_{2})\triangleq{C}(\beta_{1}+\beta_{2})-C(\beta_{1}% -\beta_{2})$ and $\widehat{S}(\beta_{1},\beta_{2})\triangleq S(\beta_{1}+\beta_{2})-S(\beta_{1}-% \beta_{2})$ . Specifically, $C(x)=\int_{0}^{x}\cos(\frac{\pi}{2}t^{2}){\rm d}t$ and $S(x)=\int_{0}^{x}\sin(\frac{\pi}{2}t^{2}){\rm d}t$ are the Fresnel integrals. The proof of Lemma 2 .

References

[1] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-field MIMO communications for 6G: Fundamentals, challenges, potentials, and future directions,” IEEE Commun. Mag., vol. 61, no. 1, pp. 40–46, Jan. 2023.
[2] C. You, Y. Zhang, C. Wu, Y. Zeng, B. Zheng, L. Chen, L. Dai, and A. L. Swindlehurst, “Near-field beam management for extremely large-scale array communications,” arXiv preprint arXiv:2306.16206, 2023.
[3] Z. Wang, J. Zhang, H. Du, W. E. I. Sha, B. Ai, D. Niyato, and M. Debbah, “Extremely large-scale MIMO: Fundamentals, challenges, solutions, and future Directions,” IEEE Wireless Commun., 2023, Early Access.
[4] Q. Wu, S. Zhang, B. Zheng, C. You, and R. Zhang, “Intelligent reflecting surface-aided wireless communications: A tutorial,” IEEE Trans. Commun., vol. 69, no. 5, pp. 3313–3351, May 2021.
[5] Y. Liu, Z. Wang, J. Xu, C. Ouyang, X. Mu, and R. Schober, “Near-field communications: A tutorial review,” IEEE Open J. Commun. Society, vol. 4, pp. 1999–2049, Aug. 2023.
[6] L. U. Khan, W. Saad, D. Niyato, Z. Han, and C. S. Hong, “Digital-twin-enabled 6G: Vision, architectural trends, and future directions,” IEEE Commun. Mag., vol. 60, no. 1, pp. 74–80, Jan. 2022.
[7] A. Paul, K. Singh, M.-H. T. Nguyen, C. Pan, and C.-P. Li, “Digital twin-assisted space-air-ground integrated networks for vehicular edge computing,” IEEE J. Sel. Topics Signal Process., vol. 18, no. 1, pp. 66–82, Jan. 2024.
[8] Y. Zhang, B. Di, H. Zhang, and L. Song, “Near-far field beamforming for holographic multiple-input multiple-output,” J. Commun. Inf. Netw., vol. 8, no. 2, pp. 99–110, Jun. 2023.
[9] E. Björnson, Ö. T. Demir, and L. Sanguinetti, “A primer on near-field beamforming for arrays and reconfigurable intelligent surfaces,” in Proc. 55th Asilomar Conf. Signals Syst. Comput., Pacific Grove, CA, USA, Nov. 2021, pp. 105–112.
[10] C. You, Y. Cai, Y. Liu, M. Di Renzo, T. M. Duman, A. Yener, and A. L. Swindlehurst, “Next generation advanced transceiver technologies for 6G,” arXiv preprint arXiv:2403.16458, 2024.
[11] J. An, C. Yuen, L. Dai, M. D. Renzo, M. Debbah, and L. Hanzo, “Near-field communications: Research advances, potential, and challenges,” IEEE Commun. Mag., 2024,Early Access.
[12] H. Zhang, N. Shlezinger, F. Guidi, D. Dardari, M. F. Imani, and Y. C. Eldar, “Beam focusing for near-field multiuser MIMO communications,” IEEE Trans. Wireless Commun., vol. 21, no. 9, pp. 7476–7490, Sept. 2022.
[13] H. Zhang, N. Shlezinger, F. Guidi, D. Dardari, and Y. C. Eldar, “6G wireless communications: From far-field beam steering to near-field beam focusing,” IEEE Commun. Mag., vol. 61, no. 4, pp. 72–77, Apr. 2023.
[14] P. Nepa and A. Buffi, “Near-field-focused microwave antennas: Near-field sha** and implementation,” IEEE Antennas Propag. Mag., vol. 59, no. 3, pp. 42–53, Jun. 2017.
[15] J. Cong, C. You, J. Li, L. Chen, B. Zheng, Y. Liu, W. Wu, Y. Gong, S. **, and R. Zhang, “Near-field integrated sensing and communication: Opportunities and challenges,” arXiv preprint arXiv:2310.01342, 2023.
[16] Y. Chen, Z. Ren, J. Xu, Y. Zeng, D. W. K. Ng, and S. Cui, “Integrated sensing, communication, and powering (ISCAP): Towards multi-functional 6G wireless networks,” arXiv preprint arXiv:2401.03516, 2024.
[17] Y. Liu, C. Ouyang, Z. Wang, J. Xu, X. Mu, and A. L. Swindlehurst, “Near-field communications: A comprehensive survey,” arXiv preprint arXiv:2401.05900, 2024.
[18] H. Zhang, N. Shlezinger, F. Guidi, D. Dardari, M. F. Imani, and Y. C. Eldar, “Near-field wireless power transfer for 6G internet of everything mobile networks: Opportunities and challenges,” IEEE Commun. Mag., vol. 60, no. 3, pp. 12–18, Mar. 2022.
[19] Y. Han, S. **, M. Matthaiou, T. Q. S. Quek, and C.-K. Wen, “Toward extra large-scale MIMO: New channel properties and low-cost designs,” IEEE Internet of Things J., vol. 10, no. 16, pp. 14 569–14 594, Aug. 2023.
[20] Z. Wu and L. Dai, “Multiple access for near-field communications: SDMA or LDMA?” IEEE J. Sel. Areas Commun., vol. 41, no. 6, pp. 1918–1935, Jun. 2023.
[21] K. Zhi, C. Pan, H. Ren, K. K. Chai, C.-X. Wang, R. Schober, and X. You, “Performance analysis and low-complexity design for XL-MIMO with near-field spatial non-stationarities,” IEEE J. Sel. Areas Commun., vol. 42, no. 6, pp. 1656–1672, Apr. 2024.
[22] Y. Li, S. Gong, H. Liu, C. Xing, N. Zhao, and X. Wang, “Near-field beamforming optimization for holographic XL-MIMO multiuser systems,” IEEE Trans. Commun., vol. 72, no. 4, pp. 2309–2323, Apr. 2024.
[23] Y. Zhang and C. You, “SWIPT in mixed near- and far-field channels: Joint beam scheduling and power allocation,” IEEE J. Sel. Areas Commun., Early Access, 2024.
[24] J. Chen, Y. Xiao, K. Liu, Y. Zhong, X. Lei, and M. Xiao, “Physical layer security for near-field communications via directional modulation,” IEEE Trans. Veh. Technol., Early Access, 2024, doi:10.1109/TVT.2024.3382324.
[25] H. Wang, Z. Xiao, and Y. Zeng, “Cramér-rao bounds for near-field sensing with extremely large-scale MIMO,” IEEE Trans. Signal Process, Jan. 2024.
[26] Y. Zhang, B. Di, H. Zhang, M. Dong, L. Yang, and L. Song, “Dual codebook design for intelligent omni-surface aided communications,” IEEE Trans. Wireless Commun., vol. 21, no. 11, pp. 9232–9245, May 2022.
[27] M. Cui and L. Dai, “Channel estimation for extremely large-scale MIMO: Far-field or near-field?” IEEE Trans. Commun., vol. 70, no. 4, pp. 2663–2677, Apr. 2022.
[28] Y. Zhang, X. Wu, and C. You, “Fast near-field beam training for extremely large-scale array,” IEEE Wireless Commun. Lett., vol. 11, no. 12, pp. 2625–2629, Dec. 2022.
[29] W. Liu, H. Ren, C. Pan, and J. Wang, “Deep learning based beam training for extremely large-scale massive MIMO in near-field domain,” IEEE Commun. Lett., vol. 27, no. 1, pp. 170–174, Jan. 2023.
[30] G. Jiang and C. Qi, “Near-field beam training based on deep learning for extremely large-scale MIMO,” IEEE Commun. Lett., vol. 27, no. 8, pp. 2063–2067, Aug. 2023.
[31] C. Wu, C. You, Y. Liu, L. Chen, and S. Shi, “Two-stage hierarchical beam training for near-field communications,” IEEE Trans. Veh. Technol., vol. 73, no. 2, pp. 2032–2044, Feb. 2024.
[32] X. Shi, J. Wang, Z. Sun, and J. Song, “Spatial-chirp codebook-based hierarchical beam training for extremely large-scale massive MIMO,” IEEE Trans. Wireless Commun., 2023, Early Access.
[33] Y. Lu, Z. Zhang, and L. Dai, “Hierarchical beam training for extremely large-scale MIMO: From far-field to near-field,” IEEE Trans. Commun., vol. 72, no. 4, pp. 2247–2259, Apr. 2024.
[34] S. Noh, M. D. Zoltowski, and D. J. Love, “Multi-resolution codebook based beamforming sequence design in millimeter-wave systems,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), San Diego, CA, USA, 2015, pp. 1–6.
[35] C. Ouyang, Z. Wang, B. Zhao, X. Zhang, and Y. Liu, “On the impact of reactive region on the near-field channel gain,” arXiv preprint arXiv:2404.08343, 2024.
[36] H. Lu, Y. Zeng, C. You, Y. Han, J. Zhang, Z. Wang, Z. Dong, S. **, C.-X. Wang, T. Jiang et al., “A tutorial on near-field XL-MIMO communications towards 6G,” arXiv preprint arXiv:2310.11044, 2023.
[37] S. Zhang and R. Zhang, “Capacity characterization for intelligent reflecting surface aided MIMO communication,” IEEE J. Sel. Areas Commun., vol. 38, no. 8, pp. 1823–1838, Jun. 2020.
[38] W. Liu, C. Pan, H. Ren, F. Shu, S. **, and J. Wang, “Low-overhead beam training scheme for extremely large-scale RIS in near field,” IEEE Trans. Commun., vol. 71, no. 8, pp. 4924–4940, May 2023.
[39] Y. Zhang, C. You, L. Chen, and B. Zheng, “Mixed near- and far-field communications for extremely large-scale array: An interference perspective,” IEEE Commun. Lett., vol. 27, no. 9, pp. 2496–2500, Sept. 2023.
[40] A. F. Molisch, V. V. Ratnam, S. Han, Z. Li, S. L. H. Nguyen, L. Li, and K. Haneda, “Hybrid beamforming for massive MIMO: A survey,” IEEE Commun. Mag., vol. 55, no. 9, pp. 134–141, Sep. 2017.
[41] A. Kosasih and E. Björnson, “Finite beam depth analysis for large arrays,” arXiv preprint arXiv:2306.12367, 2023.
[42] H. Wang and Y. Zeng, “Can sparse arrays outperform collocated arrays for future wireless communications?” arXiv preprint arXiv:2307.07925, 2023.
[43] X. Wu, C. You, J. Li, and Y. Zhang, “Near-field beam training: Joint angle and range estimation with DFT codebook,” IEEE Trans. Wireless Commun., Early Access, 2024, doi:10.1109/TWC.2024.3385749.
[44] W. Liu, C. Pan, H. Ren, J. Wang, R. Schober, and L. Hanzo, “Near-field multiuser beam-training for extremely large-scale MIMO systems,” arXiv preprint arXiv:2402.13597, 2024.

		$\displaystyle\hat{f}\left(r_{0},\theta_{0};\theta\right)\overset{{(b_{1})}}{% \approx}\frac{1}{Q}\!\!\left\|\int_{-\frac{Q}{2}}^{\frac{Q}{2}}\!\exp{\left(% \jmath\pi(A_{1}q+A_{2}q^{2})\mathrm{d}q\right)}\right\|$		(43)
		$\displaystyle\leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ % \leavevmode\nobreak\ \leavevmode\nobreak\ \leavevmode\nobreak\ \overset{{(b_{2% })}}{=}\frac{1}{Q}\!\!\left\|\frac{1}{\sqrt{2A_{2}}}\int_{\sqrt{2A_{2}}(-\frac{% Q}{2}+\frac{A_{1}}{2A_{2}})}^{\sqrt{2A_{2}}(\frac{Q}{2}+\frac{A_{1}}{2A_{2}})}% \!e^{(\frac{\jmath\pi t^{2}}{2})}\mathrm{d}t\right\|$
		$\displaystyle=\!\!\left\|\!\!\frac{\int_{0}^{\sqrt{2A_{2}}(\frac{Q}{2}+\frac{A_% {1}}{2A_{2}})}\!\!e^{(\frac{\jmath\pi t^{2}}{2})}\!\mathrm{d}t\!\!-\!\!\int_{0% }^{\sqrt{2A_{2}}(-\frac{Q}{2}+\frac{A_{1}}{2A_{2}})}\!\!e^{(\frac{\jmath\pi t^% {2}}{2})}\mathrm{d}t}{\sqrt{2A_{2}}Q}\!\right\|,$