License: arXiv.org perpetual non-exclusive license
arXiv:2306.07008v3 [quant-ph] 19 Dec 2023

Quantum phase estimation by compressed sensing

Changhao Yi State Key Laboratory of Surface Physics and Department of Physics, Fudan University, Shanghai 200433, China Institute for Nanoelectronic Devices and Quantum Computing, Fudan University, Shanghai 200433, China    Cunlu Zhou    Jun Takahashi Center for Quantum Information and Control, Department of Physics and Astronomy, University of New Mexico, NM 87131, USA
Abstract

As a signal recovery algorithm, compressed sensing is particularly useful when the data has low-complexity and samples are rare, which matches perfectly with the task of quantum phase estimation (QPE). In this work we present a new Heisenberg-limited QPE algorithm for early fault-tolerant quantum computers based on compressed sensing. More specifically, given many copies of a proper initial state and queries to a specific unitary matrix, our algorithm is able to recover the phase with a total runtime 𝒪(ϵ1polylog(ϵ1))𝒪superscriptitalic-ϵ1polysuperscriptitalic-ϵ1\mathcal{O}(\epsilon^{-1}\text{poly}\log(\epsilon^{-1}))caligraphic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT poly roman_log ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ), where ϵitalic-ϵ\epsilonitalic_ϵ is the desired accuracy. Moreover, the maximal runtime satisfies Tmaxϵπmuch-less-thansubscript𝑇italic-ϵ𝜋T_{\max}\epsilon\ll\piitalic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT italic_ϵ ≪ italic_π, which is comparable to the state-of-the-art algorithms, and our algorithm is also robust against certain amount of noise from sampling and state preparation.

1 Introduction

Quantum phase estimation (QPE) [1] is one of the most useful subroutines in quantum computing and plays an important role in many promising quantum applications [2, 3, 4, 5]. Given a unitary matrix U𝑈Uitalic_U and one of its eigenvectors |ΦketΦ|\Phi\rangle| roman_Φ ⟩ such that U|Φ=ei2πθ|Φ𝑈ketΦsuperscript𝑒𝑖2𝜋𝜃ketΦU|\Phi\rangle=e^{i2\pi\theta}|\Phi\rangleitalic_U | roman_Φ ⟩ = italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π italic_θ end_POSTSUPERSCRIPT | roman_Φ ⟩, the task of QPE is to estimate phase θ𝜃\thetaitalic_θ with high probability within a given accuracy guarantee. The problem of estimating multiple phases of U𝑈Uitalic_U has been referred to the quantum eigenvalue estimation problem (QEEP) [5, 6, 7, 8, 9, 10]. When we set the unitary matrix U𝑈Uitalic_U as the evolution operator under a Hamiltonian H𝐻Hitalic_H, the task of QPE is equivalent to estimating a specific energy level E0subscript𝐸0E_{0}italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with accuracy ϵitalic-ϵ\epsilonitalic_ϵ [7, 8].

While fully fault-tolerant quantum computers may still be years away from realization, early fault-tolerant quantum computers with limited number of logical qubits and limited circuit depth are expected to be realized much sooner and to solve nontrivial tasks that demonstrate practical quantum advantages. Given the crucial role of QPE in many of such tasks, it becomes imperative to design QPE algorithms specifically tailored for early fault-tolerant quantum computers. The standard textbook QPE algorithm [11] does not require an exact eigenstate as the initial state and takes only one measurement, but it uses a large amount of ancilla qubits and controlled operations, which is fairly demanding in experiment. Although Kitaev’s original iterative QPE algorithm [1] only uses one ancilla qubit and one controlled operation (see Fig. 1), it requires the initial state to be an exact eigenstate which can be a difficult task by itself. Therefore, both of them are not suitable for early fault-tolerant quantum computers.

Much of the recent work [7, 12, 13] in QPE for early fault-tolerant quantum computers has focused on designing better protocols to improve various aspects of Kitaev’s original QPE algorithm. More specifically, the following properties are desired in designing such algorithms:

  • The quantum circuit should be simple, using at most one ancilla qubit and one controlled operation.

  • The initial state is not necessarily an exact eigenstate of U𝑈Uitalic_U.

  • The total runtime achieves the Heisenberg limit, i.e., the total cost should be
    𝒪(ϵ1polylog(ϵ1δ1))𝒪superscriptitalic-ϵ1polysuperscriptitalic-ϵ1superscript𝛿1\mathcal{O}(\epsilon^{-1}\operatorname{poly}\log(\epsilon^{-1}\delta^{-1}))caligraphic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_poly roman_log ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_δ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ) for estimating the phase θ𝜃\thetaitalic_θ to accuracy ϵitalic-ϵ\epsilonitalic_ϵ with probability 1δ1𝛿1-\delta1 - italic_δ.

  • When the overlap of the initial state and the targeted eigenstate is large, the maximal runtime Tmaxsubscript𝑇T_{\max}italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT (hence the maximum circuit depth) can be much smaller than π/ϵ𝜋italic-ϵ\pi/\epsilonitalic_π / italic_ϵ.

Refer to caption
Figure 1: The one-ancilla quantum circuit used in Kitaev-type QPE algorithms. The measurement is done in Z𝑍Zitalic_Z basis. In terms of the measurement outcome, we regard the |0ket0|0\rangle| 0 ⟩ state as obtaining value +11+1+ 1, and the |1ket1|1\rangle| 1 ⟩ state as obtaining value 11-1- 1. 𝐇𝐇\mathbf{H}bold_H is the Hadamard gate. 𝐖𝐖\mathbf{W}bold_W has two choices: when 𝐖=I𝐖𝐼\mathbf{W}=Ibold_W = italic_I, the measurement outcome is ±1plus-or-minus1\pm 1± 1 with probability (1±Re(Φ|U(t)|Φ))/2plus-or-minus1Requantum-operator-productΦ𝑈𝑡Φ2(1\pm\text{Re}(\langle\Phi|U(t)|\Phi\rangle))/2( 1 ± Re ( ⟨ roman_Φ | italic_U ( italic_t ) | roman_Φ ⟩ ) ) / 2 respectively. When 𝐖=S𝐖superscript𝑆\mathbf{W}=S^{\dagger}bold_W = italic_S start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT, the complex conjugation of the phase gate, the measurement outcome is ±1plus-or-minus1\pm 1± 1 with probability (1±Im(Φ|U(t)|Φ))/2plus-or-minus1Imquantum-operator-productΦ𝑈𝑡Φ2(1\pm\text{Im}(\langle\Phi|U(t)|\Phi\rangle))/2( 1 ± Im ( ⟨ roman_Φ | italic_U ( italic_t ) | roman_Φ ⟩ ) ) / 2 instead. After taking the average over many test outcomes, we obtain an estimate of the true signal Φ|U(t)|Φquantum-operator-productΦ𝑈𝑡Φ\langle\Phi|U(t)|\Phi\rangle⟨ roman_Φ | italic_U ( italic_t ) | roman_Φ ⟩.

The procedure of these algorithms can be separated into the quantum part and the classical part. Usually, the quantum part is a combination of Hamiltonian simulation and the Hadamard test (see Fig. 1). Hamiltonian simulation algorithms [14] are used to prepare the evolution operator U(t)𝑈𝑡U(t)italic_U ( italic_t ). Longer evolution time requires more quantum gates, and the best known circuit complexity for running U(t)𝑈𝑡U(t)italic_U ( italic_t ) without ancilla qubits is almost linear in H|t|norm𝐻𝑡\|H\|\cdot|t|∥ italic_H ∥ ⋅ | italic_t | [15, 16]. The Hadamard test produces information about U(t)𝑈𝑡U(t)italic_U ( italic_t ) in the form of a complex signal. Specifically, given an initial state |ΦketΦ|\Phi\rangle| roman_Φ ⟩, Φ|U(t)|Φquantum-operator-productΦ𝑈𝑡Φ\langle\Phi|U(t)|\Phi\rangle⟨ roman_Φ | italic_U ( italic_t ) | roman_Φ ⟩ can be estimated with many runs of Hadamard tests. For QPE, the signal Φ|U(t)|Φquantum-operator-productΦ𝑈𝑡Φ\langle\Phi|U(t)|\Phi\rangle⟨ roman_Φ | italic_U ( italic_t ) | roman_Φ ⟩ is dominated by a single complex sinusoidal function (i.e., f(t)=Aeikt𝑓𝑡𝐴superscript𝑒𝑖𝑘𝑡f(t)=Ae^{ikt}italic_f ( italic_t ) = italic_A italic_e start_POSTSUPERSCRIPT italic_i italic_k italic_t end_POSTSUPERSCRIPT), and the objective is to estimate its frequency k𝑘kitalic_k. For QEEP, the signal is regarded as a linear combination of multiple complex sinusoidal functions (i.e., f(t)=nAneiknt𝑓𝑡subscript𝑛subscript𝐴𝑛superscript𝑒𝑖subscript𝑘𝑛𝑡f(t)=\sum_{n}A_{n}e^{ik_{n}t}italic_f ( italic_t ) = ∑ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i italic_k start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT)). The target signal f(t)𝑓𝑡f(t)italic_f ( italic_t ) is said to have length-N𝑁Nitalic_N and sparsity-K𝐾Kitalic_K if t𝑡titalic_t takes value from integers in [1,N]1𝑁[1,N][ 1 , italic_N ] and contains K𝐾Kitalic_K distinct frequencies {kn}subscript𝑘𝑛\{k_{n}\}{ italic_k start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }. For QEEP, K𝐾Kitalic_K is usually assumed to be much smaller than N𝑁Nitalic_N (for QPE, K=1𝐾1K=1italic_K = 1). The classical aspect of QPE and QEEP involves estimating frequencies from these statistically sampled sparse signals, a process akin to the objectives of sparse Fourier transformation algorithms [17].

There are several aspects of evaluating the performance of a sparse Fourier transformation algorithm. The runtime complexity, the sample complexity, and the resolution are all important ingredients to consider. Here the runtime complexity quantifies how long the algorithm takes, and the sample complexity measures the number of time-domain signal samples required in the algorithm. For example, the Fast Fourier Transformation algorithm [18] has runtime complexity 𝒪(NlogN)𝒪𝑁𝑁\mathcal{O}(N\log N)caligraphic_O ( italic_N roman_log italic_N ) with sample complexity 𝒪(N)𝒪𝑁\mathcal{O}(N)caligraphic_O ( italic_N ). So far, the best runtime complexity is 𝒪(Klogc(N)log(N/K))𝒪𝐾superscript𝑐𝑁𝑁𝐾\mathcal{O}(K\log^{c}(N)\log(N/K))caligraphic_O ( italic_K roman_log start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ( italic_N ) roman_log ( italic_N / italic_K ) ) with c>2𝑐2c>2italic_c > 2 [19], and the most sample-efficient algorithm requires only 𝒪(KlogKlogN)𝒪𝐾𝐾𝑁\mathcal{O}(K\log K\log N)caligraphic_O ( italic_K roman_log italic_K roman_log italic_N ) samples [20]. In practical scenarios, we most likely have noisy data, necessitating the need for algorithmic robustness. Classical algorithms such as Multiple Signal Classification (MUSIC) [21] and Estimation of Signal Parameters via Rational Invariance Techniques (ESPRIT) [22] are two examples of the robust signal processing algorithms. Given the unique characteristics of our quantum setting, we prioritize the sample complexity, resolution, and robustness of an algorithm.

Many of these classical signal processing algorithms have been used in the task of QEEP. To the best of our knowledge, [6] was the first attempt to solve QEEP with Hadamard tests, where QEEP was treated as a time-series analysis problem. Later, [7] emphasized the importance of the Heisenberg-limited scaling, and listed a few other requirements for the post-processing algorithm. By applying the Fourier-filter function techniques they succeeded in designing the first Heisenberg-limited QPE algorithm for early fault-tolerant quantum computers. Their algorithm was further improved by other work [5, 8], where the maximal evolution time was largely reduced. Two recent QPE algorithms [12, 13], inspired by Robust Phase Estimation [23, 24, 25], were proposed. Both algorithms have a hierarchy structure, where the unit evolution times are taken from {τ0,2τ0,,2J1τ0}subscript𝜏02subscript𝜏0superscript2𝐽1subscript𝜏0\{\tau_{0},2\tau_{0},\cdots,2^{J-1}\tau_{0}\}{ italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , ⋯ , 2 start_POSTSUPERSCRIPT italic_J - 1 end_POSTSUPERSCRIPT italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }. These recent algorithms improved the relation between Tmaxsubscript𝑇T_{\max}italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, the initial overlap p0subscript𝑝0p_{0}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and the final accuracy ϵitalic-ϵ\epsilonitalic_ϵ. When the overlap p0subscript𝑝0p_{0}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is large, the work of [12] reduces the prefactor β𝛽\betaitalic_β in the maximum runtime scaling Tmax=β/ϵsubscript𝑇𝛽italic-ϵT_{\max}=\beta/\epsilonitalic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = italic_β / italic_ϵ by using a subroutine called the quantum complex exponential least squares (QCELS). In contrast to [7] in which the prefactor β𝛽\betaitalic_β is at least π𝜋\piitalic_π, the prefactor in [12] can be arbitrarily close to 00 as p01subscript𝑝01p_{0}\to 1italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT → 1. The more recent work [13] further improved the requirement on p0subscript𝑝0p_{0}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. In [26] and [27], the last two QPE algorithms have been extended to the QEEP set up.

In this work we present a new algorithmic framework of solving both QPE and QEEP using compressed sensing [28, 29, 30], which has not been carefully considered as far as we know. Compressed sensing is a prominent algorithm for signal recovery with wide applications in various domains such as time-frequency analysis, image processing, and quantum state tomography [31, 32, 33]. The framework of compressed sensing assumes the sparsity of the signal and recovers the entire signal from a few samples by solving a linear/semidefinite programming (SDP) problem. Here, the sparsity refers to the number of sinusoidal functions in the signal being small. The small number of required samples and the robustness makes compressed sensing an appealing choice for designing QPE algorithm.

Our main contribution is a simple and robust classical post-processing algorithm for QPE based on compressed sensing. In QPE, we regard both the uncertainty from the Hadamard tests and the inaccuracy of initial state preparation as noise. To extract target frequency f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (which has a linear relationship with the target energy E0subscript𝐸0E_{0}italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT) in the presence of noise, our main idea is to use the robust recovery property of convex relaxation algorithm [29], a modified version of the vanilla compressed sensing. For signal vectors with size N𝑁Nitalic_N, when the frequency f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is nearly on-grid (f0k0/N,k0formulae-sequencesubscript𝑓0subscript𝑘0𝑁subscript𝑘0f_{0}\approx k_{0}/N,k_{0}\in\mathbb{Z}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≈ italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_N , italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ blackboard_Z) and the noise for each sample is bounded by a constant, the convex relaxation algorithm can recover f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with only 𝒪(logN)𝒪𝑁\mathcal{O}(\log N)caligraphic_O ( roman_log italic_N ) samples, which satisfies the Heisenberg limit. With no prior knowledge about f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (i.e., f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT could be off-grid), our algorithm can still find a grid shift parameter ν𝜈\nuitalic_ν such that after shifting the signal by f0f0νsubscript𝑓0subscript𝑓0𝜈f_{0}\to f_{0}-\nuitalic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT → italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_ν, the new signal becomes nearly on-grid, and the convex relaxation algorithm can still be applied. By searching the optimal grid shift parameter in a finite set, eventually the accuracy of f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is βN1𝛽superscript𝑁1\beta N^{-1}italic_β italic_N start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, where β𝛽\betaitalic_β is a constant related to the noise and the interval between the grid shift parameters. In terms of the maximum runtime Tmaxsubscript𝑇T_{\max}italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, since the samples of the compressed sensing algorithm are integers in [1,N]1𝑁[1,N][ 1 , italic_N ], Tmaxsubscript𝑇T_{\max}italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT scales linearly in N𝑁Nitalic_N, and Ttotalsubscript𝑇totalT_{\text{total}}italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT is 𝒪(NlogN)𝒪𝑁𝑁\mathcal{O}(N\log N)caligraphic_O ( italic_N roman_log italic_N ).

The rest of the paper is organized as follows. We start with preliminaries about QEEP and compressed sensing in Sec. 2. We then introduce our QPE algorithm based on compressed sensing in Sec. 3 and prove several analytical results including its Heisenberg-limit scaling. In Sec. 4 we numerically test the performance of our algorithm and compare it to previous works. Finally, we summarize several open problems and potential future research directions in Sec. 5.

2 Preliminaries

The notations frequently used in the main text are summarized in Table 1.

Table 1: Notations used in the main text
Notation Meaning
y(t)𝑦𝑡y(t)italic_y ( italic_t ) sampled time-domain signal
y0(t)superscript𝑦0𝑡y^{0}(t)italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) ideal time-domain signal
z(t)𝑧𝑡z(t)italic_z ( italic_t ) noise
x(k)𝑥𝑘x(k)italic_x ( italic_k ) ideal frequency-domain signal
𝒯𝒯\mathcal{T}caligraphic_T set of evolution times
r𝑟ritalic_r sampling ratio
ΩΩ\Omegaroman_Ω samples in compressed sensing
τ𝜏\tauitalic_τ unit time step
ϵitalic-ϵ\epsilonitalic_ϵ accuracy on energy level
δ𝛿\deltaitalic_δ failure probability
η𝜂\etaitalic_η noise tolerance for each signal

2.1 QEEP as a signal recovery problem

In this section, we express the QEEP as a sparse signal recovery problem. Given a specific Hamiltonian with spectrum decomposition H=EP𝐻subscriptsubscript𝐸subscript𝑃H=\sum_{\ell}E_{\ell}P_{\ell}italic_H = ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT, where {E}subscript𝐸\{E_{\ell}\}{ italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } are energy levels and {P}subscript𝑃\{P_{\ell}\}{ italic_P start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } are projectors onto the corresponding eigenstates, and an initial state |ΦketΦ|\Phi\rangle| roman_Φ ⟩, the time-domain signal in QEEP is given as

y0(t)=Φ|eiHt|Φ=Φ|P|ΦeiEt.superscript𝑦0𝑡quantum-operator-productΦsuperscript𝑒𝑖𝐻𝑡Φsubscriptquantum-operator-productΦsubscript𝑃Φsuperscript𝑒𝑖subscript𝐸𝑡y^{0}(t)=\langle\Phi|e^{-iHt}|\Phi\rangle=\sum_{\ell}\langle\Phi|P_{\ell}|\Phi% \rangle e^{-iE_{\ell}t}.italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) = ⟨ roman_Φ | italic_e start_POSTSUPERSCRIPT - italic_i italic_H italic_t end_POSTSUPERSCRIPT | roman_Φ ⟩ = ∑ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⟨ roman_Φ | italic_P start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | roman_Φ ⟩ italic_e start_POSTSUPERSCRIPT - italic_i italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT . (1)

Denote the set of energy levels that have non-zero overlaps with |ΦketΦ|\Phi\rangle| roman_Φ ⟩ as the target set

Ξ={E:Φ|P|Φ>0}.Ξconditional-setsubscript𝐸quantum-operator-productΦsubscript𝑃Φ0\Xi=\{E_{\ell}:\langle\Phi|P_{\ell}|\Phi\rangle>0\}.roman_Ξ = { italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT : ⟨ roman_Φ | italic_P start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT | roman_Φ ⟩ > 0 } . (2)

Each element in ΞΞ\Xiroman_Ξ corresponds to a sinusoidal function whose frequency is determined by Esubscript𝐸E_{\ell}italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. We assume |Ξ|Ξ|\Xi|| roman_Ξ | is small so that y0(t)superscript𝑦0𝑡y^{0}(t)italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) can be regarded as a sparse signal. More specifically, in compressed sensing algorithms, sparsity usually means that |Ξ|=𝒪(log(N))Ξ𝒪𝑁|\Xi|=\mathcal{O}(\log(N))| roman_Ξ | = caligraphic_O ( roman_log ( italic_N ) ) [28], where N𝑁Nitalic_N is the size of the discrete signals.

The task of a QEEP algorithm is equivalent to estimating ΞΞ\Xiroman_Ξ within certain accuracy level using the data obtained from the Hadamrd tests. Assuming the output of the QEEP algorithm is ΞsuperscriptΞ\Xi^{\ast}roman_Ξ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, we require |Ξ|=|Ξ|ΞsuperscriptΞ|\Xi|=|\Xi^{\ast}|| roman_Ξ | = | roman_Ξ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT |, and

EΞ,minEΞ|EE|ϵ.formulae-sequencefor-all𝐸Ξsubscriptsuperscript𝐸superscriptΞ𝐸superscript𝐸italic-ϵ\forall E\in\Xi,\quad\min_{E^{\ast}\in\Xi^{\ast}}|E-E^{\ast}|\leq\epsilon.∀ italic_E ∈ roman_Ξ , roman_min start_POSTSUBSCRIPT italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∈ roman_Ξ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | italic_E - italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | ≤ italic_ϵ . (3)

We have the freedom to choose the set of evolution times to use in the algorithm. Denote the set of evolution times as 𝒯𝒯\mathcal{T}caligraphic_T. For each time t𝒯𝑡𝒯t\in\mathcal{T}italic_t ∈ caligraphic_T, y0(t)superscript𝑦0𝑡y^{0}(t)italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) can be obtained from averaging over the Hadamard tests. More precisely, when 𝐖=I𝐖𝐼\mathbf{W}=Ibold_W = italic_I, the measurement outcome in Fig. 1 is a random variable

hx(t)={+1,p=12(1+Re(y0(t))),1,p=12(1Re(y0(t))).subscript𝑥𝑡cases1𝑝121Resuperscript𝑦0𝑡𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒1𝑝121Resuperscript𝑦0𝑡𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒h_{x}(t)=\begin{cases}+1,\quad p=\frac{1}{2}(1+\text{Re}(y^{0}(t))),\\ -1,\quad p=\frac{1}{2}(1-\text{Re}(y^{0}(t))).\\ \end{cases}italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_t ) = { start_ROW start_CELL + 1 , italic_p = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 + Re ( italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) ) ) , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL - 1 , italic_p = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 - Re ( italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) ) ) . end_CELL start_CELL end_CELL end_ROW (4)

Similarly, when 𝐖=S𝐖superscript𝑆\mathbf{W}=S^{\dagger}bold_W = italic_S start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT, we obtain

hy(t)={+1,p=12(1+Im(y0(t))),1,p=12(1Im(y0(t))).subscript𝑦𝑡cases1𝑝121Imsuperscript𝑦0𝑡𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒1𝑝121Imsuperscript𝑦0𝑡𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒h_{y}(t)=\begin{cases}+1,\quad p=\frac{1}{2}(1+\text{Im}(y^{0}(t))),\\ -1,\quad p=\frac{1}{2}(1-\text{Im}(y^{0}(t))).\\ \end{cases}italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ( italic_t ) = { start_ROW start_CELL + 1 , italic_p = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 + Im ( italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) ) ) , end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL - 1 , italic_p = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 - Im ( italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) ) ) . end_CELL start_CELL end_CELL end_ROW (5)

The summation of the two gives the estimate of y0(t)superscript𝑦0𝑡y^{0}(t)italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ):

E[h(t)]=E[hx(t)+ihy(t)]=y0(t).𝐸delimited-[]𝑡𝐸delimited-[]subscript𝑥𝑡𝑖subscript𝑦𝑡superscript𝑦0𝑡E[h(t)]=E[h_{x}(t)+ih_{y}(t)]=y^{0}(t).italic_E [ italic_h ( italic_t ) ] = italic_E [ italic_h start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT ( italic_t ) + italic_i italic_h start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ( italic_t ) ] = italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) . (6)

After sampling the random variable h(t)𝑡h(t)italic_h ( italic_t ) for M𝑀Mitalic_M times, we obtain a noisy signal:

y(t)=h(t)¯=y0(t)+z(t),𝑦𝑡¯𝑡superscript𝑦0𝑡𝑧𝑡y(t)=\overline{h(t)}=y^{0}(t)+z(t),italic_y ( italic_t ) = over¯ start_ARG italic_h ( italic_t ) end_ARG = italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_t ) + italic_z ( italic_t ) , (7)

then we use the noisy samples {(t,y(t)),t𝒯}𝑡𝑦𝑡𝑡𝒯\{(t,y(t)),t\in\mathcal{T}\}{ ( italic_t , italic_y ( italic_t ) ) , italic_t ∈ caligraphic_T } to recover ΞΞ\Xiroman_Ξ. Here z(t)𝑧𝑡z(t)italic_z ( italic_t ) originates from the statistical uncertainty of the Hadamard tests. Hoeffding’s inequality ensures that with probability 1δ1superscript𝛿1-\delta^{\prime}1 - italic_δ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, we have

|z(t)|=𝒪(1Mlog1δ).𝑧𝑡𝒪1𝑀1superscript𝛿|z(t)|{=}\mathcal{O}\left(\sqrt{\frac{1}{M}\log\frac{1}{\delta^{\prime}}}% \right).| italic_z ( italic_t ) | = caligraphic_O ( square-root start_ARG divide start_ARG 1 end_ARG start_ARG italic_M end_ARG roman_log divide start_ARG 1 end_ARG start_ARG italic_δ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG end_ARG ) . (8)

In the rest of the paper, the meanings of z(t)𝑧𝑡z(t)italic_z ( italic_t ) are not identical, but they always represent the part of the signal that should be considered as noise. Introduce the noise tolerance parameter η𝜂\etaitalic_η such that the signal recovery algorithm can recover accurate estimates for {E}subscript𝐸\{E_{\ell}\}{ italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } as long as the noise of each signal is not larger than η𝜂\etaitalic_η. Thus to guarantee |z(t)|<η𝑧𝑡𝜂|z(t)|<\eta| italic_z ( italic_t ) | < italic_η for all t𝒯𝑡𝒯t\in\mathcal{T}italic_t ∈ caligraphic_T with probability 1δ1𝛿1-\delta1 - italic_δ, we require δ𝒪(δ|𝒯|1)𝛿𝒪superscript𝛿superscript𝒯1\delta\leq\mathcal{O}(\delta^{\prime}|\mathcal{T}|^{-1})italic_δ ≤ caligraphic_O ( italic_δ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | caligraphic_T | start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ), so M𝑀Mitalic_M would be proportional to 𝒪(log(|𝒯|/δ)/η2)𝒪𝒯𝛿superscript𝜂2\mathcal{O}(\log(|\mathcal{T}|/\delta)/\eta^{2})caligraphic_O ( roman_log ( | caligraphic_T | / italic_δ ) / italic_η start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). For a rigorous proof of this, see Appendix A of [12].

The total experiment cost can be captured by the total runtime Ttotalsubscript𝑇totalT_{\text{total}}italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT, which reflects the total circuit depth for completing the entire algorithm. In Hamiltonian simulation, the circuit complexity of constructing operator eiHtsuperscript𝑒𝑖𝐻𝑡e^{-iHt}italic_e start_POSTSUPERSCRIPT - italic_i italic_H italic_t end_POSTSUPERSCRIPT scales linearly with H|t|norm𝐻𝑡\|H\|\cdot|t|∥ italic_H ∥ ⋅ | italic_t | [15] (t𝑡titalic_t can be negative). Thus, in this set up, the total runtime is

Ttotal=t𝒯M×|t|=𝒪(log(|𝒯|/δ)η2t|𝒯||t|).subscript𝑇totalsubscript𝑡𝒯𝑀𝑡𝒪𝒯𝛿superscript𝜂2subscript𝑡𝒯𝑡T_{\text{total}}=\sum_{t\in\mathcal{T}}M\times|t|=\mathcal{O}\left(\log(|% \mathcal{T}|/\delta)\cdot\eta^{-2}\cdot\sum_{t\in|\mathcal{T}|}|t|\right).italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_t ∈ caligraphic_T end_POSTSUBSCRIPT italic_M × | italic_t | = caligraphic_O ( roman_log ( | caligraphic_T | / italic_δ ) ⋅ italic_η start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ⋅ ∑ start_POSTSUBSCRIPT italic_t ∈ | caligraphic_T | end_POSTSUBSCRIPT | italic_t | ) . (9)

For instance, if the signal recovery algorithm has parameters η=𝒪(1),maxt𝒯|t|=𝒪(ϵ1)formulae-sequence𝜂𝒪1subscript𝑡𝒯𝑡𝒪superscriptitalic-ϵ1\eta=\mathcal{O}(1),\max_{t\in\mathcal{T}}|t|=\mathcal{O}(\epsilon^{-1})italic_η = caligraphic_O ( 1 ) , roman_max start_POSTSUBSCRIPT italic_t ∈ caligraphic_T end_POSTSUBSCRIPT | italic_t | = caligraphic_O ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ), and |𝒯|=𝒪(polylog(ϵ1))𝒯𝒪polysuperscriptitalic-ϵ1|\mathcal{T}|=\mathcal{O}(\text{poly}\log(\epsilon^{-1}))| caligraphic_T | = caligraphic_O ( poly roman_log ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ), then it achieves the Heisenberg limit. We will see that the algorithm we have using compressed sensing fits this description. The maximal runtime Tmax=maxt𝒯|t|subscript𝑇subscript𝑡𝒯𝑡T_{\max}=\max_{t\in\mathcal{T}}|t|italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT italic_t ∈ caligraphic_T end_POSTSUBSCRIPT | italic_t |, which reflects the maximum circuit depth, is particularly important for early fault-tolerant quantum computers.

When ΞΞ\Xiroman_Ξ contains only one energy level, the task becomes QPE. In QPE, without loss of generality111In this work we don’t consider the hardness of the preparation of the initial state. From the point view of phase estimation, there’s nothing special about the ground state energy compared to other eigenvalues as long as one can prepare an initial state that is close enough to the target eigenstate., we will be mainly discussing the estimation of the ground energy E0subscript𝐸0E_{0}italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, i.e., the smallest eigenvalue of H𝐻Hitalic_H. In general, we do not expect to be able to prepare the exact ground state but assume that the initial state has a large overlap with the ground state:

|Φ=1γ|Ψ+γ|Ψ.ketΦ1𝛾ketΨ𝛾ketsuperscriptΨperpendicular-to|\Phi\rangle=\sqrt{1-\gamma}|\Psi\rangle+\sqrt{\gamma}|\Psi^{\perp}\rangle.| roman_Φ ⟩ = square-root start_ARG 1 - italic_γ end_ARG | roman_Ψ ⟩ + square-root start_ARG italic_γ end_ARG | roman_Ψ start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT ⟩ . (10)

Note that γ𝛾\gammaitalic_γ equals to 1p01subscript𝑝0\sqrt{1-p_{0}}square-root start_ARG 1 - italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG in the Introduction. As long as γ𝛾\gammaitalic_γ is small enough, the signal is still dominated by eiE0tsuperscript𝑒𝑖subscript𝐸0𝑡e^{-iE_{0}t}italic_e start_POSTSUPERSCRIPT - italic_i italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT, and the ground energy E0subscript𝐸0E_{0}italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be estimated efficiently.

2.2 Compressed sensing for signal recovery

In this section we introduce some necessary concepts and notations for compressed sensing. Given a vector v=[v1,v2,,vN]𝑣superscriptsubscript𝑣1subscript𝑣2subscript𝑣𝑁topv=[v_{1},v_{2},\cdots,v_{N}]^{\top}italic_v = [ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_v start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, its 1111-norm, 2222-norm and \infty norm are defined as

v1=n=1N|vn|,v2=(n=1N|vn|2)1/2,v=maxn|vn|.formulae-sequencesubscriptnorm𝑣1superscriptsubscript𝑛1𝑁subscript𝑣𝑛formulae-sequencesubscriptnorm𝑣2superscriptsuperscriptsubscript𝑛1𝑁superscriptsubscript𝑣𝑛212subscriptnorm𝑣subscript𝑛subscript𝑣𝑛\|v\|_{1}=\sum_{n=1}^{N}|v_{n}|,\quad\|v\|_{2}=\left(\sum_{n=1}^{N}|v_{n}|^{2}% \right)^{1/2},\quad\|v\|_{\infty}=\max_{n}|v_{n}|.∥ italic_v ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT | italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | , ∥ italic_v ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT | italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT , ∥ italic_v ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | . (11)

In the following paragraph, we use k𝑘kitalic_k to label the indices of entries in frequency domain, and use n𝑛nitalic_n to label the indices of entries in time domain. Denote the set of integers from 1 to N𝑁Nitalic_N as [N]delimited-[]𝑁[N][ italic_N ]. In regular compressed sensing, we deal with a time-domain discrete signal y0superscript𝑦0y^{0}italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT in the form of

yn0=fcfei2πfn,n[N]formulae-sequencesubscriptsuperscript𝑦0𝑛subscript𝑓subscript𝑐𝑓superscript𝑒𝑖2𝜋𝑓𝑛𝑛delimited-[]𝑁y^{0}_{n}=\sum_{f\in\mathcal{F}}c_{f}e^{-i2\pi fn},\quad n\in[N]italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_f italic_n end_POSTSUPERSCRIPT , italic_n ∈ [ italic_N ] (12)

where cf[0,1],fcf=1,f[0,1)formulae-sequencesubscript𝑐𝑓01formulae-sequencesubscript𝑓subscript𝑐𝑓1𝑓01c_{f}\in[0,1],\sum_{f}c_{f}=1,f\in[0,1)italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∈ [ 0 , 1 ] , ∑ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = 1 , italic_f ∈ [ 0 , 1 ), and \mathcal{F}caligraphic_F is the set of frequency support. The time-domain signal can thus be written as an N𝑁Nitalic_N-dimensional vector

y0=[y10,y20,,yN0].superscript𝑦0superscriptsubscriptsuperscript𝑦01subscriptsuperscript𝑦02subscriptsuperscript𝑦0𝑁topy^{0}=[y^{0}_{1},y^{0}_{2},\cdots,y^{0}_{N}]^{\top}.italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = [ italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT . (13)

Define the Fourier matrix as Fkn=ei2πkn/N,k,n[N]formulae-sequencesubscript𝐹𝑘𝑛superscript𝑒𝑖2𝜋𝑘𝑛𝑁𝑘𝑛delimited-[]𝑁F_{kn}=e^{-i2\pi kn/N},\ k,n\in[N]italic_F start_POSTSUBSCRIPT italic_k italic_n end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_k italic_n / italic_N end_POSTSUPERSCRIPT , italic_k , italic_n ∈ [ italic_N ]. By labeling its columns with {wn,n[N]}subscript𝑤𝑛𝑛delimited-[]𝑁\{w_{n},n\in[N]\}{ italic_w start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ∈ [ italic_N ] }, we have

F=[w1,w2,,wN].𝐹subscript𝑤1subscript𝑤2subscript𝑤𝑁F=[w_{1},\ w_{2},\ \cdots,\ w_{N}].italic_F = [ italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_w start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] . (14)

Throughout the paper, if the frequency f𝑓fitalic_f satisfies

k[N],f=k/N,formulae-sequence𝑘delimited-[]𝑁𝑓𝑘𝑁\exists k\in[N],\quad f=k/N,∃ italic_k ∈ [ italic_N ] , italic_f = italic_k / italic_N , (15)

then we say f𝑓fitalic_f is on-grid, otherwise it is off-grid. For a frequency f[0,1)𝑓01f\in[0,1)italic_f ∈ [ 0 , 1 ), we define its off-grid deviation as

ν=fkf/N, where kf=argmink|fk/N|.formulae-sequence𝜈𝑓subscript𝑘𝑓𝑁 where subscript𝑘𝑓subscript𝑘𝑓𝑘𝑁\nu=f-k_{f}/N,\text{ where }k_{f}=\arg\min_{k}|f-k/N|.italic_ν = italic_f - italic_k start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT / italic_N , where italic_k start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | italic_f - italic_k / italic_N | . (16)

If all f𝑓f\in\mathcal{F}italic_f ∈ caligraphic_F are on-grid satisfying Eq. (15), the frequency-domain signal x𝑥xitalic_x can be written in the form of a vector as

x=1NFy0,xk=fcfδ(kNf).formulae-sequence𝑥1𝑁superscript𝐹superscript𝑦0subscript𝑥𝑘subscript𝑓subscript𝑐𝑓𝛿𝑘𝑁𝑓x=\frac{1}{N}F^{\dagger}y^{0},\quad x_{k}=\sum_{f\in\mathcal{F}}c_{f}{\delta% \left(k-Nf\right)}.italic_x = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG italic_F start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT , italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_δ ( italic_k - italic_N italic_f ) . (17)

It is a sparse vector with |||\mathcal{F}|| caligraphic_F | non-zero entries. The purpose of compressed sensing is then to recover x𝑥xitalic_x from a few noisy samples of {yt0}subscriptsuperscript𝑦0𝑡\{y^{0}_{t}\}{ italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT }.

The algorithm is accomplished in the following scenario. Choose a sampling ratio r𝑟ritalic_r, and assign each integer n𝑛nitalic_n in [N]delimited-[]𝑁[N][ italic_N ] a random variable 1nsubscript1𝑛1_{n}1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT that satisfies

P(1n=1)=r,P(1n=0)=1r.formulae-sequence𝑃subscript1𝑛1𝑟𝑃subscript1𝑛01𝑟P(1_{n}=1)=r,\quad P(1_{n}=0)=1-r.italic_P ( 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 1 ) = italic_r , italic_P ( 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 0 ) = 1 - italic_r . (18)

Draw one sample from each 1n,n[N]subscript1𝑛𝑛delimited-[]𝑁1_{n},n\in[N]1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_n ∈ [ italic_N ], and denote the set of integers with 1n=1subscript1𝑛11_{n}=11 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 1 as the sample set ΩΩ\Omegaroman_Ω. The size of the sample set concentrates around Nr𝑁𝑟Nritalic_N italic_r. Based on the choice of ΩΩ\Omegaroman_Ω, we define the partial inverse Fourier transformation and the signal samples as

FΩ=[wn1,wn2,wn|Ω|],subscript𝐹Ωsubscript𝑤subscript𝑛1subscript𝑤subscript𝑛2subscript𝑤subscript𝑛Ω\displaystyle F_{\Omega}=[w_{n_{1}},\ w_{n_{2}},\ \cdots\ w_{n_{|\Omega|}}],italic_F start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT = [ italic_w start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_w start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ italic_w start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT | roman_Ω | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] , (19)
yΩ0=[yn10,yn20,yn|Ω|0].subscriptsuperscript𝑦0Ωsuperscriptsubscriptsuperscript𝑦0subscript𝑛1subscriptsuperscript𝑦0subscript𝑛2subscriptsuperscript𝑦0subscript𝑛Ωtop\displaystyle y^{0}_{\Omega}=[y^{0}_{n_{1}},\ y^{0}_{n_{2}},\ \cdots\ y^{0}_{n% _{|\Omega|}}]^{\top}.italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT = [ italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT | roman_Ω | end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT . (20)

Note that F𝐹Fitalic_F has dimension N×N𝑁𝑁N\times Nitalic_N × italic_N, while FΩsubscript𝐹ΩF_{\Omega}italic_F start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT has dimension N×|Ω|𝑁ΩN\times|\Omega|italic_N × | roman_Ω |. With these notations, the compressed sensing algorithm can be thought as solving the optimization problem

mins1,s.t.FΩs=yΩ0,sn,\min\|s\|_{1},\quad s.t.\quad F_{\Omega}s=y^{0}_{\Omega},\quad{s\in\mathbb{R}^% {n},}roman_min ∥ italic_s ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s . italic_t . italic_F start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_s = italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , italic_s ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , (21)

which can be rewritten as a linear programming problem. When Nr=𝒪(logN)𝑁𝑟𝒪𝑁Nr=\mathcal{O}(\log N)italic_N italic_r = caligraphic_O ( roman_log italic_N ) and the frequency support \mathcal{F}caligraphic_F is sparse in the sense that ||𝒪(logN)𝒪𝑁|\mathcal{F}|\leq\mathcal{O}(\log N)| caligraphic_F | ≤ caligraphic_O ( roman_log italic_N ), the optimal solution s#superscript𝑠#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT equals to the frequency-domain signal x𝑥xitalic_x with high probability. Rigorous statements can be found in [28].

Provided that the signal has extra noise yn=yn0+znsubscript𝑦𝑛subscriptsuperscript𝑦0𝑛subscript𝑧𝑛y_{n}=y^{0}_{n}+z_{n}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT where |zn|ηsubscript𝑧𝑛𝜂|z_{n}|\leq\eta| italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ≤ italic_η, the signal can be approximately recovered by the convex relaxation algorithm [29]:

mins1,s.t.FΩsyΩ2|Ω|η.\min\|s\|_{1},\quad s.t.\quad\|F_{\Omega}s-y_{\Omega}\|_{2}\leq\sqrt{|\Omega|}\eta.roman_min ∥ italic_s ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s . italic_t . ∥ italic_F start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_s - italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ square-root start_ARG | roman_Ω | end_ARG italic_η . (22)

The difference between s#superscript𝑠#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT and x𝑥xitalic_x depends on the size of η𝜂\etaitalic_η.

The uniqueness and robustness of compressed sensing solution can be analyzed through the so-called dual certificate [30]. In optimization, the dual certificate is the optimal solution to the dual problem. In a regular compressed sensing task where all frequencies are on-grid, for each frequency support \mathcal{F}caligraphic_F and each sample set ΩΩ\Omegaroman_Ω, the dual certificate is an N𝑁Nitalic_N-dimensional random vector p𝑝pitalic_p such that

V,p=FΩV,𝑉𝑝subscriptsuperscript𝐹Ω𝑉\displaystyle\exists V,\quad p=F^{\dagger}_{\Omega}V,∃ italic_V , italic_p = italic_F start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_V , (23)
k,k/N,pk=1,formulae-sequencefor-all𝑘𝑘𝑁subscript𝑝𝑘1\displaystyle\forall k,\ k/N\in\mathcal{F},\quad p_{k}=1,∀ italic_k , italic_k / italic_N ∈ caligraphic_F , italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1 , (24)
k,k/N,|pk|<1ε(Ω),formulae-sequencefor-all𝑘𝑘𝑁subscript𝑝𝑘1𝜀Ω\displaystyle\forall k,\ k/N\not\in\mathcal{F},\quad|p_{k}|<1-\varepsilon(% \Omega),∀ italic_k , italic_k / italic_N ∉ caligraphic_F , | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | < 1 - italic_ε ( roman_Ω ) , (25)

where ε(Ω)(0,1)𝜀Ω01\varepsilon(\Omega)\in(0,1)italic_ε ( roman_Ω ) ∈ ( 0 , 1 ) is determined by the size of ΩΩ\Omegaroman_Ω. The existence of such a dual certificate is called the exact reconstruction principle [28]. Using dual certificate as a pivot, we can quantify the part of s#superscript𝑠#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT that is not supported on \mathcal{F}caligraphic_F, which further helps us determine the accuracy of the algorithm. The basic idea is to separate the entries of s#superscript𝑠#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT into two parts:

k/N|sk#|,k/N|sk#|.subscript𝑘𝑁subscriptsuperscript𝑠#𝑘subscript𝑘𝑁subscriptsuperscript𝑠#𝑘\sum_{k/N\in\mathcal{F}}|s^{\#}_{k}|,\quad\sum_{k/N\not\in\mathcal{F}}|s^{\#}_% {k}|.∑ start_POSTSUBSCRIPT italic_k / italic_N ∈ caligraphic_F end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | , ∑ start_POSTSUBSCRIPT italic_k / italic_N ∉ caligraphic_F end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | . (26)

Because the true frequency-domain solution xksubscript𝑥𝑘x_{k}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT satisfies the constraint in Eq. (22), the 1-norm of s#superscript𝑠#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT is upper bounded by the 1-norm of x𝑥xitalic_x. On the other hand, the norm of the inner product between s#superscript𝑠#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT and p𝑝pitalic_p can be well-estimated, and the restrictions on the entries of p𝑝pitalic_p enable us to deal with k/N|sk#|subscript𝑘𝑁subscriptsuperscript𝑠#𝑘\sum_{k/N\in\mathcal{F}}|s^{\#}_{k}|∑ start_POSTSUBSCRIPT italic_k / italic_N ∈ caligraphic_F end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | and k/N|sk#|subscript𝑘𝑁subscriptsuperscript𝑠#𝑘\sum_{k/N\not\in\mathcal{F}}|s^{\#}_{k}|∑ start_POSTSUBSCRIPT italic_k / italic_N ∉ caligraphic_F end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | separately. The combination of these considerations eventually provides us an upper bound on k/N|sk#|subscript𝑘𝑁subscriptsuperscript𝑠#𝑘\sum_{k/N\not\in\mathcal{F}}|s^{\#}_{k}|∑ start_POSTSUBSCRIPT italic_k / italic_N ∉ caligraphic_F end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT |.

To write the QEEP signal in Eq. (1) in the form of the compressed sensing signal in Eq. (12), we introduce a unit time step τ𝜏\tauitalic_τ, such that

yn0=y0(nτ),𝒯={nτ,nΩ}.formulae-sequencesubscriptsuperscript𝑦0𝑛superscript𝑦0𝑛𝜏𝒯𝑛𝜏𝑛Ωy^{0}_{n}=y^{0}(n\tau),\quad\mathcal{T}=\{n\tau,n\in\Omega\}.italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_n italic_τ ) , caligraphic_T = { italic_n italic_τ , italic_n ∈ roman_Ω } . (27)

In the same framework, we have the following correspondence:

f=Eτ2π,={Eτ2π,EΞ}.formulae-sequence𝑓subscript𝐸𝜏2𝜋subscript𝐸𝜏2𝜋subscript𝐸Ξf=\frac{E_{\ell}\tau}{2\pi},\quad\mathcal{F}=\left\{\frac{E_{\ell}\tau}{2\pi},% \quad E_{\ell}\in\Xi\right\}.italic_f = divide start_ARG italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_τ end_ARG start_ARG 2 italic_π end_ARG , caligraphic_F = { divide start_ARG italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_τ end_ARG start_ARG 2 italic_π end_ARG , italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ∈ roman_Ξ } . (28)

The target energy levels ΞΞ\Xiroman_Ξ can be directly obtained from the frequency support \mathcal{F}caligraphic_F. Note that the frequencies are defined on [0,1)01[0,1)[ 0 , 1 ). To keep the order of energy levels unchanged, it is necessary to have

,Eτ[0,2π).for-allsubscript𝐸𝜏02𝜋\forall\ell,\quad E_{\ell}\tau\in[0,2\pi).∀ roman_ℓ , italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_τ ∈ [ 0 , 2 italic_π ) . (29)

This condition can always be satisfied by adding a constant to the Hamiltonian H𝐻Hitalic_H and choosing τ𝜏\tauitalic_τ properly.

3 Single eigenvalue estimation

In this section, we provide a quantitative restriction on the noise tolerance η𝜂\etaitalic_η in Eq. (22). Then by considering the combination of three sources of inexactness, namely the uncertainty of the Hadamard test, the inaccuracy of the initial state, and the off-grid deviation as noise, we prove performance bounds for our algorithm.

Let us start with the case when the signal contains only one frequency ={f0}subscript𝑓0\mathcal{F}=\{f_{0}\}caligraphic_F = { italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } and the frequency is on-grid: f0=χ/N,χ[N]formulae-sequencesubscript𝑓0𝜒𝑁𝜒delimited-[]𝑁f_{0}=\chi/N,\chi\in[N]italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_χ / italic_N , italic_χ ∈ [ italic_N ], for which we are given the values of

yn0=ei2πχn/Nsubscriptsuperscript𝑦0𝑛superscript𝑒𝑖2𝜋𝜒𝑛𝑁y^{0}_{n}=e^{-i2\pi\chi n/N}italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_χ italic_n / italic_N end_POSTSUPERSCRIPT (30)

on a random sample set ΩΩ\Omegaroman_Ω. Note that the frequency-domain signal is xk=δk,χsubscript𝑥𝑘subscript𝛿𝑘𝜒x_{k}=\delta_{k,\chi}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_δ start_POSTSUBSCRIPT italic_k , italic_χ end_POSTSUBSCRIPT. The following lemma shows how its dual certificate can be constructed.

Lemma 1.

Given values of signal yn0=ei2πχn/Nsubscriptsuperscript𝑦0𝑛superscript𝑒𝑖2𝜋𝜒𝑛𝑁y^{0}_{n}=e^{-i2\pi\chi n/N}italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_χ italic_n / italic_N end_POSTSUPERSCRIPT on a random sample set Ωnormal-Ω\Omegaroman_Ω with sampling ratio

r>35Nln(5Nδ),𝑟35𝑁5𝑁𝛿r>\frac{35}{N}\ln\left(\frac{5N}{\delta}\right),italic_r > divide start_ARG 35 end_ARG start_ARG italic_N end_ARG roman_ln ( divide start_ARG 5 italic_N end_ARG start_ARG italic_δ end_ARG ) , (31)

the random vector p=FΩyΩ0/|Ω|𝑝subscriptsuperscript𝐹normal-†normal-Ωsubscriptsuperscript𝑦0normal-Ωnormal-Ωp=F^{\dagger}_{\Omega}y^{0}_{\Omega}/|\Omega|italic_p = italic_F start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT / | roman_Ω | satisfies

pχ=1;|pk|12,kχformulae-sequencesubscript𝑝𝜒1formulae-sequencesubscript𝑝𝑘12𝑘𝜒p_{\chi}=1;\quad|p_{k}|\leq\frac{1}{2},\ k\neq\chiitalic_p start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT = 1 ; | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG 2 end_ARG , italic_k ≠ italic_χ (32)

with probability at least 1δ1𝛿1-\delta1 - italic_δ.

Proof.

See Appendix A. ∎

We can now prove the robust recovery of algorithm Eq. (22) using Lemma 1.

Lemma 2.

Given a noisy signal yn=ei2πχn/N+znsubscript𝑦𝑛superscript𝑒𝑖2𝜋𝜒𝑛𝑁subscript𝑧𝑛y_{n}=e^{-i2\pi\chi n/N}+z_{n}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_χ italic_n / italic_N end_POSTSUPERSCRIPT + italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and a random sample set Ωnormal-Ω\Omegaroman_Ω with sampling ratio satisfying Lemma 1, if the noise satisfies zΩ2|Ω|ηsubscriptnormsubscript𝑧normal-Ω2normal-Ω𝜂\|z_{\Omega}\|_{2}\leq\sqrt{|\Omega|}\eta∥ italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ square-root start_ARG | roman_Ω | end_ARG italic_η, the solution s#superscript𝑠normal-#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT to Eq. (22) satisfies:

kχ|sk#|4η,|sχ#|14ηformulae-sequencesubscript𝑘𝜒subscriptsuperscript𝑠#𝑘4𝜂subscriptsuperscript𝑠#𝜒14𝜂\sum_{k\neq\chi}|s^{\#}_{k}|\leq 4\eta,\quad|s^{\#}_{\chi}|\geq 1-4\eta∑ start_POSTSUBSCRIPT italic_k ≠ italic_χ end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≤ 4 italic_η , | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT | ≥ 1 - 4 italic_η (33)

with probability at least 1δ1𝛿1-\delta1 - italic_δ.

Proof.

See Appendix B. ∎

By Lemma 2 we see that as long as η<1/8𝜂18\eta<1/8italic_η < 1 / 8, we have |sχ#|>kn|sχ#|subscriptsuperscript𝑠#𝜒subscript𝑘𝑛subscriptsuperscript𝑠#𝜒|s^{\#}_{\chi}|>\sum_{k\neq n}|s^{\#}_{\chi}|| italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT | > ∑ start_POSTSUBSCRIPT italic_k ≠ italic_n end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT |, so the solution to Eq. (22) still satisfies χ=argmax|sk#|𝜒subscriptsuperscript𝑠#𝑘\chi=\arg\max|s^{\#}_{k}|italic_χ = roman_arg roman_max | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT |.

In practical situations we cannot assume that the frequency is always perfectly on-grid, but this is not a problem. First, when Nf0𝑁subscript𝑓0Nf_{0}italic_N italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is very close to an integer, Eq. (22) can still approximately recovery the signal. The following proposition quantifies how large the off-grid deviation can be.

Proposition 1.

Given a signal yn=ei2π(χ+ω)n/Nsubscript𝑦𝑛superscript𝑒𝑖2𝜋𝜒𝜔𝑛𝑁y_{n}=e^{-i2\pi(\chi+\omega)n/N}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π ( italic_χ + italic_ω ) italic_n / italic_N end_POSTSUPERSCRIPT with χN,ω(0.5,0.5]formulae-sequence𝜒subscript𝑁𝜔0.50.5\chi\in\mathbb{Z}_{N},\,\omega\in(-0.5,0.5]italic_χ ∈ blackboard_Z start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , italic_ω ∈ ( - 0.5 , 0.5 ], and a set of samples Ωnormal-Ω\Omegaroman_Ω with sampling ratio satisfying Lemma 1, if |ω|<1/16π𝜔116𝜋|\omega|<1/16\pi| italic_ω | < 1 / 16 italic_π, then with probability at least 1δ1𝛿1-\delta1 - italic_δ, the optimal solution s#superscript𝑠normal-#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT to Eq. (21) satisfies

χ=argmax|sk#|.𝜒subscriptsuperscript𝑠#𝑘\chi=\arg\max|s^{\#}_{k}|.italic_χ = roman_arg roman_max | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | . (34)
Proof.

In this situation, we have

zn=ei2πχn/N(ei2πωn/N1),subscript𝑧𝑛superscript𝑒𝑖2𝜋𝜒𝑛𝑁superscript𝑒𝑖2𝜋𝜔𝑛𝑁1\displaystyle{z_{n}=e^{-i2\pi\chi n/N}(e^{i2\pi\omega n/N}-1)},italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_χ italic_n / italic_N end_POSTSUPERSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π italic_ω italic_n / italic_N end_POSTSUPERSCRIPT - 1 ) , (35)
|zn|=2|sin(πωn/N)|<2π|ω|n/N.subscript𝑧𝑛conditional2𝜋𝜔𝑛𝑁bra2𝜋𝜔𝑛𝑁\displaystyle|z_{n}|=2|\sin(\pi\omega n/N)|<2\pi|\omega|n/N.| italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | = 2 | roman_sin ( italic_π italic_ω italic_n / italic_N ) | < 2 italic_π | italic_ω | italic_n / italic_N . (36)

According to Lemma 2, one sufficient condition for robust recovery is |zn|1/8subscript𝑧𝑛18|z_{n}|\leq 1/8| italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ≤ 1 / 8 for all n𝑛nitalic_n. Hence it is enough to have |ω|1/16π𝜔116𝜋|\omega|\leq 1/16\pi| italic_ω | ≤ 1 / 16 italic_π. ∎

The |ω|𝜔|\omega|| italic_ω | in Proposition 34 represents the off-grid deviation. If |ω|1/16π𝜔116𝜋|\omega|\leq 1/16\pi| italic_ω | ≤ 1 / 16 italic_π, the solution f#=argmax|sk#|/Nsuperscript𝑓#subscriptsuperscript𝑠#𝑘𝑁f^{\#}=\arg\max|s^{\#}_{k}|/Nitalic_f start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT = roman_arg roman_max | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | / italic_N is still a good approximation of the true frequency (χ+ω)/N𝜒𝜔𝑁(\chi+\omega)/N( italic_χ + italic_ω ) / italic_N, and the accuracy still scales as 𝒪(N1)𝒪superscript𝑁1\mathcal{O}(N^{-1})caligraphic_O ( italic_N start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ). When the off-grid deviation is large, we can find a grid-shift parameter ν𝜈\nuitalic_ν such that after the transformation ynynei2πνn/Nsubscript𝑦𝑛subscript𝑦𝑛superscript𝑒𝑖2𝜋𝜈𝑛𝑁y_{n}\to y_{n}e^{i2\pi\nu n/N}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT → italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π italic_ν italic_n / italic_N end_POSTSUPERSCRIPT, the new signal is nearly on-grid. The details are given in Algorithm 1.

Besides the Hadamard tests and the off-grid deviation, the third type of noise comes from the preparation of the initial state. In practice, it is hard to prepare the exact ground state |Ψ0ketsubscriptΨ0|\Psi_{0}\rangle| roman_Ψ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟩. Suppose the actual initial state is

|Φ=1γ|Ψ0+0γ|Ψ,ketΦ1𝛾ketsubscriptΨ0subscript0subscript𝛾ketsubscriptΨ|\Phi\rangle=\sqrt{1-\gamma}|\Psi_{0}\rangle+\sum_{\ell\neq 0}\sqrt{\gamma_{% \ell}}|\Psi_{\ell}\rangle,| roman_Φ ⟩ = square-root start_ARG 1 - italic_γ end_ARG | roman_Ψ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟩ + ∑ start_POSTSUBSCRIPT roman_ℓ ≠ 0 end_POSTSUBSCRIPT square-root start_ARG italic_γ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG | roman_Ψ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ⟩ , (37)

with 0γ=γsubscript0subscript𝛾𝛾\sum_{\ell\neq 0}\gamma_{\ell}=\gamma∑ start_POSTSUBSCRIPT roman_ℓ ≠ 0 end_POSTSUBSCRIPT italic_γ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT = italic_γ, then yn0subscriptsuperscript𝑦0𝑛y^{0}_{n}italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT can be expanded as

Φ|U(nτ)|Φ=(1γ)eiE0nτ+0γeiEnτ=eiE0nτ+𝒪(γ).quantum-operator-productΦ𝑈𝑛𝜏Φ1𝛾superscript𝑒𝑖subscript𝐸0𝑛𝜏subscript0subscript𝛾superscript𝑒𝑖subscript𝐸𝑛𝜏superscript𝑒𝑖subscript𝐸0𝑛𝜏𝒪𝛾\langle\Phi|U(n\tau)|\Phi\rangle=(1-\gamma)e^{-iE_{0}n\tau}+\sum_{\ell\neq 0}% \gamma_{\ell}e^{-iE_{\ell}n\tau}=e^{-iE_{0}n\tau}+\mathcal{O}(\gamma).⟨ roman_Φ | italic_U ( italic_n italic_τ ) | roman_Φ ⟩ = ( 1 - italic_γ ) italic_e start_POSTSUPERSCRIPT - italic_i italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_n italic_τ end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT roman_ℓ ≠ 0 end_POSTSUBSCRIPT italic_γ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_i italic_E start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT italic_n italic_τ end_POSTSUPERSCRIPT = italic_e start_POSTSUPERSCRIPT - italic_i italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_n italic_τ end_POSTSUPERSCRIPT + caligraphic_O ( italic_γ ) . (38)

Note that only eiE0nτsuperscript𝑒𝑖subscript𝐸0𝑛𝜏e^{-iE_{0}n\tau}italic_e start_POSTSUPERSCRIPT - italic_i italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_n italic_τ end_POSTSUPERSCRIPT is the target signal and we can treat the 𝒪(γ)𝒪𝛾\mathcal{O}(\gamma)caligraphic_O ( italic_γ ) term as noise as well. Thus, let E0τ=2π(χ+ω)/Nsubscript𝐸0𝜏2𝜋𝜒𝜔𝑁E_{0}\tau=2\pi(\chi+\omega)/Nitalic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_τ = 2 italic_π ( italic_χ + italic_ω ) / italic_N, then each sampled signal can be decomposed as

ynsubscript𝑦𝑛\displaystyle y_{n}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT =Φ|U(nτ)|Φ+hnabsentquantum-operator-productΦ𝑈𝑛𝜏Φsubscript𝑛\displaystyle=\langle\Phi|U(n\tau)|\Phi\rangle+h_{n}= ⟨ roman_Φ | italic_U ( italic_n italic_τ ) | roman_Φ ⟩ + italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT (39)
=Ψ|U(nτ)|Ψ+hn+𝒪(γ)absentquantum-operator-productΨ𝑈𝑛𝜏Ψsubscript𝑛𝒪𝛾\displaystyle=\langle\Psi|U(n\tau)|\Psi\rangle+h_{n}+\mathcal{O}(\gamma)= ⟨ roman_Ψ | italic_U ( italic_n italic_τ ) | roman_Ψ ⟩ + italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + caligraphic_O ( italic_γ )
=eiE0nτ+hn+𝒪(γ)absentsuperscript𝑒𝑖subscript𝐸0𝑛𝜏subscript𝑛𝒪𝛾\displaystyle=e^{-iE_{0}n\tau}+h_{n}+\mathcal{O}(\gamma)= italic_e start_POSTSUPERSCRIPT - italic_i italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_n italic_τ end_POSTSUPERSCRIPT + italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + caligraphic_O ( italic_γ )
=ei2π(χ+ν)n/N+hn+𝒪(γ)+ei2π(χ+ν)n/N(ei2π(ων)n/N1)absentsuperscript𝑒𝑖2𝜋𝜒𝜈𝑛𝑁subscript𝑛𝒪𝛾superscript𝑒𝑖2𝜋𝜒𝜈𝑛𝑁superscript𝑒𝑖2𝜋𝜔𝜈𝑛𝑁1\displaystyle=e^{-i2\pi(\chi+\nu)n/N}+h_{n}+\mathcal{O}(\gamma)+e^{-i2\pi(\chi% +\nu)n/N}(e^{-i2\pi(\omega-\nu)n/N}-1)= italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π ( italic_χ + italic_ν ) italic_n / italic_N end_POSTSUPERSCRIPT + italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + caligraphic_O ( italic_γ ) + italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π ( italic_χ + italic_ν ) italic_n / italic_N end_POSTSUPERSCRIPT ( italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π ( italic_ω - italic_ν ) italic_n / italic_N end_POSTSUPERSCRIPT - 1 )

where hnsubscript𝑛h_{n}italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT originates from the Hadamard test, 𝒪(γ)𝒪𝛾\mathcal{O}(\gamma)caligraphic_O ( italic_γ ) originates from the state preparation, and ν𝜈\nuitalic_ν is a grid-shift parameter. Recall that in Lemma 2, as long as η<1/8𝜂18\eta<1/8italic_η < 1 / 8, the noise does not break down the signal recovery (after shifting the entire signal by ynynei2πνn/Nsubscript𝑦𝑛subscript𝑦𝑛superscript𝑒𝑖2𝜋𝜈𝑛𝑁y_{n}\to y_{n}e^{i2\pi\nu n/N}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT → italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π italic_ν italic_n / italic_N end_POSTSUPERSCRIPT). We separate η𝜂\etaitalic_η into three parts. Let 2γη12𝛾subscript𝜂12\gamma\leq\eta_{1}2 italic_γ ≤ italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, |hn|η2subscript𝑛subscript𝜂2|h_{n}|\leq\eta_{2}| italic_h start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ≤ italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, 2π|ων|<η32𝜋𝜔𝜈subscript𝜂32\pi|\omega-\nu|<\eta_{3}2 italic_π | italic_ω - italic_ν | < italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, and η1+η2+η31/8subscript𝜂1subscript𝜂2subscript𝜂318\eta_{1}+\eta_{2}+\eta_{3}\leq 1/8italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ≤ 1 / 8. Then given η1subscript𝜂1\eta_{1}italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, η2subscript𝜂2\eta_{2}italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, η3subscript𝜂3\eta_{3}italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, the fidelity error upper bound 1η12/41superscriptsubscript𝜂124\sqrt{1-\eta_{1}^{2}/4}square-root start_ARG 1 - italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 4 end_ARG, the number of Hadamard tests that is at least 𝒪(log|Ω|/η22)𝒪Ωsuperscriptsubscript𝜂22\mathcal{O}(\log|\Omega|/\eta_{2}^{2})caligraphic_O ( roman_log | roman_Ω | / italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), and the number of grid shifts as 2π/η32𝜋subscript𝜂3\lceil 2\pi/\eta_{3}\rceil⌈ 2 italic_π / italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ⌉, the complete algorithm for QPE is presented in Algorithm 1.

Algorithm 1 QPE
1:accuracy level N𝑁Nitalic_N, unit time step τ𝜏\tauitalic_τ, failure probability δ1,δ2subscript𝛿1subscript𝛿2\delta_{1},\delta_{2}italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, noise tolerance parameters η1,η2,η3subscript𝜂1subscript𝜂2subscript𝜂3\eta_{1},\eta_{2},\eta_{3}italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, Hamiltonian H𝐻Hitalic_H, and an initial state |ΦketΦ|\Phi\rangle| roman_Φ ⟩ with overlap larger than 1η12/41superscriptsubscript𝜂124\sqrt{1-\eta_{1}^{2}/4}square-root start_ARG 1 - italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 4 end_ARG.
2:E=2πk/Nsuperscript𝐸2𝜋superscript𝑘𝑁E^{\ast}=2\pi k^{\ast}/Nitalic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = 2 italic_π italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT / italic_N.
3:Sample integers from [N]delimited-[]𝑁[N][ italic_N ] each with sampling ratio r=Θ(ln(N/δ1))𝑟Θ𝑁subscript𝛿1r=\Theta(\ln(N/\delta_{1}))italic_r = roman_Θ ( roman_ln ( italic_N / italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ), denoted by ΩΩ\Omegaroman_Ω.
4:for nΩ𝑛Ωn\in\Omegaitalic_n ∈ roman_Ω do
5:     Prepare the initial state |ΦketΦ|\Phi\rangle| roman_Φ ⟩ and unitary operator eiHnτsuperscript𝑒𝑖𝐻𝑛𝜏e^{-iHn\tau}italic_e start_POSTSUPERSCRIPT - italic_i italic_H italic_n italic_τ end_POSTSUPERSCRIPT;
6:     Perform Hadamard tests on eiHnτ|Φsuperscript𝑒𝑖𝐻𝑛𝜏ketΦe^{-iHn\tau}|\Phi\rangleitalic_e start_POSTSUPERSCRIPT - italic_i italic_H italic_n italic_τ end_POSTSUPERSCRIPT | roman_Φ ⟩ for 𝒪(log(|Ω|/δ2)/η22)𝒪Ωsubscript𝛿2superscriptsubscript𝜂22\mathcal{O}(\log(|\Omega|/\delta_{2})/\eta_{2}^{2})caligraphic_O ( roman_log ( | roman_Ω | / italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) / italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) times;
7:     Calculate the average value of the test outcomes as signal ynsubscript𝑦𝑛y_{n}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.
8:end for
9:for j=1,2,,2π/η3𝑗122𝜋subscript𝜂3j=1,2,\cdots,\lceil 2\pi/\eta_{3}\rceilitalic_j = 1 , 2 , ⋯ , ⌈ 2 italic_π / italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ⌉ do
10:     Set ν=0.5+j/2π/η3𝜈0.5𝑗2𝜋subscript𝜂3\nu=-0.5+j/\lceil 2\pi/\eta_{3}\rceilitalic_ν = - 0.5 + italic_j / ⌈ 2 italic_π / italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ⌉;
11:     Generate a new signal set {y~n=ynei2πνn/N,nΩ\{\tilde{y}_{n}=y_{n}\cdot e^{i2\pi\nu n/N},n\in\Omega{ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⋅ italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π italic_ν italic_n / italic_N end_POSTSUPERSCRIPT , italic_n ∈ roman_Ω};
12:     Set η=η1+η2+η3𝜂subscript𝜂1subscript𝜂2subscript𝜂3\eta=\eta_{1}+\eta_{2}+\eta_{3}italic_η = italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT. Solve
mins1,s.t.FΩsy~Ω2|Ω|η\min\|s\|_{1},\quad s.t.\quad\|F_{\Omega}s-\tilde{y}_{\Omega}\|_{2}\leq\sqrt{|% \Omega|}\etaroman_min ∥ italic_s ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s . italic_t . ∥ italic_F start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_s - over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ square-root start_ARG | roman_Ω | end_ARG italic_η
to obtain solution s#superscript𝑠#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT;
13:     Set kν=argmax|sk#|+νsubscript𝑘𝜈subscriptsuperscript𝑠#𝑘𝜈k_{\nu}=\arg\max|s^{\#}_{k}|+\nuitalic_k start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT = roman_arg roman_max | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | + italic_ν;
14:     Calculate the total empirical error nΩ|ynei2πkνn/N|2subscript𝑛Ωsuperscriptsubscript𝑦𝑛superscript𝑒𝑖2𝜋subscript𝑘𝜈𝑛𝑁2\sum_{n\in\Omega}|y_{n}-e^{-i2\pi k_{\nu}n/N}|^{2}∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT | italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_k start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT italic_n / italic_N end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT.
15:end for
16:Find the frequency kνsubscript𝑘𝜈k_{\nu}italic_k start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT with the smallest total empirical error, set it as ksuperscript𝑘k^{\ast}italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.
Theorem 1.

Given the ground state |Ψketnormal-Ψ|\Psi\rangle| roman_Ψ ⟩, the ground-state energy E0subscript𝐸0E_{0}italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and the initial state |Φ=1γ|Ψ+γ|Ψketnormal-Φ1𝛾ketnormal-Ψ𝛾ketsuperscriptnormal-Ψperpendicular-to|\Phi\rangle=\sqrt{1-\gamma}|\Psi\rangle+\sqrt{\gamma}|\Psi^{\perp}\rangle| roman_Φ ⟩ = square-root start_ARG 1 - italic_γ end_ARG | roman_Ψ ⟩ + square-root start_ARG italic_γ end_ARG | roman_Ψ start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT ⟩, if the noise tolerance parameters satisfy

η1+η2+η318,η12γ,formulae-sequencesubscript𝜂1subscript𝜂2subscript𝜂318subscript𝜂12𝛾\eta_{1}+\eta_{2}+\eta_{3}\leq\frac{1}{8},\quad{\eta_{1}\geq 2\gamma},italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 8 end_ARG , italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ 2 italic_γ , (40)

and the accuracy level satisfies N100𝑁100N\geq 100italic_N ≥ 100, then with probability at least (1δ1)(1δ2)(1δ3)31subscript𝛿11subscript𝛿2superscript1subscript𝛿33(1-\delta_{1})(1-\delta_{2})(1-\delta_{3})^{3}( 1 - italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ( 1 - italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ( 1 - italic_δ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT, where δ3=(δ1/4N)1/4subscript𝛿3superscriptsubscript𝛿14𝑁14\delta_{3}=(\delta_{1}/4N)^{1/4}italic_δ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = ( italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 4 italic_N ) start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT, the output of Algorithm 1 satisfies

|E0E|mod 2π/τπNτ[1.0132η3+62(η1+η2)]=ϵ,subscriptsubscript𝐸0superscript𝐸mod 2𝜋𝜏𝜋𝑁𝜏delimited-[]1.0132subscript𝜂362subscript𝜂1subscript𝜂2italic-ϵ|E_{0}-E^{\ast}|_{\text{mod }2\pi/\tau}\leq\frac{\pi}{N\tau}\left[\frac{1.01% \sqrt{3}}{2}\eta_{3}+\frac{\sqrt{6}}{2}(\eta_{1}+\eta_{2})\right]=\epsilon,| italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT | start_POSTSUBSCRIPT mod 2 italic_π / italic_τ end_POSTSUBSCRIPT ≤ divide start_ARG italic_π end_ARG start_ARG italic_N italic_τ end_ARG [ divide start_ARG 1.01 square-root start_ARG 3 end_ARG end_ARG start_ARG 2 end_ARG italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT + divide start_ARG square-root start_ARG 6 end_ARG end_ARG start_ARG 2 end_ARG ( italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ] = italic_ϵ , (41)

and the cost of the algorithm satisfies

TmaxNτ,subscript𝑇𝑁𝜏\displaystyle T_{\max}\leq N\tau,italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ≤ italic_N italic_τ , (42)
T𝑡𝑜𝑡𝑎𝑙=𝒪(Nτlog(N/δ1η3)log(log(N/δ1η3)/δ2)/η22).subscript𝑇𝑡𝑜𝑡𝑎𝑙𝒪𝑁𝜏𝑁subscript𝛿1subscript𝜂3𝑁subscript𝛿1subscript𝜂3subscript𝛿2superscriptsubscript𝜂22\displaystyle T_{\text{total}}=\mathcal{O}\left(N\tau\cdot\log(N/\delta_{1}% \eta_{3})\cdot\log(\log(N/\delta_{1}\eta_{3})/\delta_{2})/\eta_{2}^{2}\right).italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT = caligraphic_O ( italic_N italic_τ ⋅ roman_log ( italic_N / italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) ⋅ roman_log ( roman_log ( italic_N / italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) / italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) / italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) . (43)
Proof.

Since δ2subscript𝛿2\delta_{2}italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT denotes the total failure rate of the Hadamard tests, by Hoeffding’s inequality log(|Ω|δ21)/η22Ωsuperscriptsubscript𝛿21superscriptsubscript𝜂22\log(|\Omega|\delta_{2}^{-1})/\eta_{2}^{2}roman_log ( | roman_Ω | italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) / italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT Hadamard tests are enough to guarantee that for all nΩ𝑛Ωn\in\Omegaitalic_n ∈ roman_Ω, the additive error from the Hadamard tests is smaller than η2subscript𝜂2\eta_{2}italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT with probability 1δ21subscript𝛿21-\delta_{2}1 - italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

Now we analyze the accuracy on E0subscript𝐸0E_{0}italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Let k0=NE0τ/2πsubscript𝑘0𝑁subscript𝐸0𝜏2𝜋k_{0}=NE_{0}\tau/2\piitalic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_N italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_τ / 2 italic_π. Define a(k)𝑎𝑘a(k)italic_a ( italic_k ) as following

a(k)=[ei2πk/N,ei2πk2/N,,ei2πkN/N].𝑎𝑘superscriptsuperscript𝑒𝑖2𝜋𝑘𝑁superscript𝑒𝑖2𝜋𝑘2𝑁superscript𝑒𝑖2𝜋𝑘𝑁𝑁topa(k)=[e^{-i2\pi k/N},e^{-i2\pi k2/N},\cdots,e^{-i2\pi kN/N}]^{\top}.italic_a ( italic_k ) = [ italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_k / italic_N end_POSTSUPERSCRIPT , italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_k 2 / italic_N end_POSTSUPERSCRIPT , ⋯ , italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_k italic_N / italic_N end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT . (44)

Let a(k)Ω𝑎subscript𝑘Ωa(k)_{\Omega}italic_a ( italic_k ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT be the projection of a(k)𝑎𝑘a(k)italic_a ( italic_k ) on the random set ΩΩ\Omegaroman_Ω. The sampled signal becomes

yΩ=a(k0)Ω+zΩ,|zn|η¯=η1+η2.formulae-sequencesubscript𝑦Ω𝑎subscriptsubscript𝑘0Ωsubscript𝑧Ωsubscript𝑧𝑛¯𝜂subscript𝜂1subscript𝜂2y_{\Omega}=a(k_{0})_{\Omega}+z_{\Omega},\quad|z_{n}|\leq\bar{\eta}=\eta_{1}+% \eta_{2}.italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT = italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT + italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , | italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ≤ over¯ start_ARG italic_η end_ARG = italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT . (45)

Define the total empirical error function as

(k)=a(k)Ωa(k0)ΩzΩ2.𝑘subscriptnorm𝑎subscript𝑘Ω𝑎subscriptsubscript𝑘0Ωsubscript𝑧Ω2\mathcal{E}(k)=\|a(k)_{\Omega}-a(k_{0})_{\Omega}-z_{\Omega}\|_{2}.caligraphic_E ( italic_k ) = ∥ italic_a ( italic_k ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT . (46)

The choice of the grid shift parameters and the sampling ratio guarantee that, among the set of solutions 𝒦={kν}𝒦subscript𝑘𝜈\mathcal{K}=\{k_{\nu}\}caligraphic_K = { italic_k start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT }, there exists at least one k1𝒦subscript𝑘1𝒦k_{1}\in\mathcal{K}italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ caligraphic_K satisfying

|k1k0|mod Nη32π,(k1)|Ω|ηformulae-sequencesubscriptsubscript𝑘1subscript𝑘0mod 𝑁subscript𝜂32𝜋subscript𝑘1Ω𝜂|k_{1}-k_{0}|_{\text{mod }N}\leq\frac{\eta_{3}}{2\pi},\quad\mathcal{E}(k_{1})% \leq\sqrt{|\Omega|}\eta| italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT ≤ divide start_ARG italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_π end_ARG , caligraphic_E ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ≤ square-root start_ARG | roman_Ω | end_ARG italic_η (47)

with probability at least 1δ11subscript𝛿11-\delta_{1}1 - italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. We omit the notationmodNmoduloabsent𝑁\mod Nroman_mod italic_N from now on. If k1subscript𝑘1k_{1}italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is the only one in 𝒦𝒦\mathcal{K}caligraphic_K that minimizes the function, then k=k1superscript𝑘subscript𝑘1k^{\ast}=k_{1}italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, and the accuracy on ksuperscript𝑘k^{\ast}italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is η3/2πsubscript𝜂32𝜋\eta_{3}/2\piitalic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT / 2 italic_π. Otherwise, suppose there exists (k2)(k1)subscript𝑘2subscript𝑘1\mathcal{E}(k_{2})\leq\mathcal{E}(k_{1})caligraphic_E ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≤ caligraphic_E ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) (no matter k2subscript𝑘2k_{2}italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT originates from a successful compressed sensing algorithm or not), we need to quantify the relation between |k1k0|subscript𝑘1subscript𝑘0|k_{1}-k_{0}|| italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | and |k2k0|subscript𝑘2subscript𝑘0|k_{2}-k_{0}|| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | and show that |k2k0|=𝒪(η)subscript𝑘2subscript𝑘0𝒪𝜂|k_{2}-k_{0}|=\mathcal{O}(\eta)| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | = caligraphic_O ( italic_η ) with high probability.

Our strategy is the following: if |k2k0||k1k0|subscript𝑘2subscript𝑘0subscript𝑘1subscript𝑘0|k_{2}-k_{0}|\leq|k_{1}-k_{0}|| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ≤ | italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT |, then this already gives an error upper bound η3/2πsubscript𝜂32𝜋\eta_{3}/2\piitalic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT / 2 italic_π; thus we only consider the case where |k2k0|>|k1k0|subscript𝑘2subscript𝑘0subscript𝑘1subscript𝑘0|k_{2}-k_{0}|>|k_{1}-k_{0}|| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | > | italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT |. Suppose we can find positive β0,β1,κsubscript𝛽0subscript𝛽1𝜅\beta_{0},\beta_{1},\kappaitalic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_κ such that for all |kk0|κ𝑘subscript𝑘0𝜅|k-k_{0}|\leq\kappa| italic_k - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ≤ italic_κ,

a(k)Ωa(k0)Ω2[|Ω|β0|kk0|,|Ω|β1|kk0|],subscriptnorm𝑎subscript𝑘Ω𝑎subscriptsubscript𝑘0Ω2Ωsubscript𝛽0𝑘subscript𝑘0Ωsubscript𝛽1𝑘subscript𝑘0\|a(k)_{\Omega}-a(k_{0})_{\Omega}\|_{2}\in\left[\sqrt{|\Omega|}\beta_{0}|k-k_{% 0}|,\sqrt{|\Omega|}\beta_{1}|k-k_{0}|\right],∥ italic_a ( italic_k ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ [ square-root start_ARG | roman_Ω | end_ARG italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_k - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | , square-root start_ARG | roman_Ω | end_ARG italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_k - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ] , (48)

then if |k2k0|<κsubscript𝑘2subscript𝑘0𝜅|k_{2}-k_{0}|<\kappa| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | < italic_κ, we have |k1k0|<|k2k0|<κsubscript𝑘1subscript𝑘0subscript𝑘2subscript𝑘0𝜅|k_{1}-k_{0}|<|k_{2}-k_{0}|<\kappa| italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | < | italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | < italic_κ as well, and

(k2)(k1),subscript𝑘2subscript𝑘1\displaystyle\mathcal{E}(k_{2})\leq\mathcal{E}(k_{1}),caligraphic_E ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≤ caligraphic_E ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , (49)
a(k2)Ωa(k0)ΩzΩ2a(k1)Ωa(k0)ΩzΩ2,subscriptnorm𝑎subscriptsubscript𝑘2Ω𝑎subscriptsubscript𝑘0Ωsubscript𝑧Ω2subscriptnorm𝑎subscriptsubscript𝑘1Ω𝑎subscriptsubscript𝑘0Ωsubscript𝑧Ω2\displaystyle\|a(k_{2})_{\Omega}-a(k_{0})_{\Omega}-z_{\Omega}\|_{2}\leq\|a(k_{% 1})_{\Omega}-a(k_{0})_{\Omega}-z_{\Omega}\|_{2},∥ italic_a ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ ∥ italic_a ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , (50)
a(k2)Ωa(k0)Ω2|Ω|η¯a(k1)Ωa(k0)Ω2+|Ω|η¯,subscriptnorm𝑎subscriptsubscript𝑘2Ω𝑎subscriptsubscript𝑘0Ω2Ω¯𝜂subscriptnorm𝑎subscriptsubscript𝑘1Ω𝑎subscriptsubscript𝑘0Ω2Ω¯𝜂\displaystyle\|a(k_{2})_{\Omega}-a(k_{0})_{\Omega}\|_{2}-\sqrt{|\Omega|}\bar{% \eta}\leq\|a(k_{1})_{\Omega}-a(k_{0})_{\Omega}\|_{2}+\sqrt{|\Omega|}\bar{\eta},∥ italic_a ( italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - square-root start_ARG | roman_Ω | end_ARG over¯ start_ARG italic_η end_ARG ≤ ∥ italic_a ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + square-root start_ARG | roman_Ω | end_ARG over¯ start_ARG italic_η end_ARG , (51)
|Ω|β0|k2k0||Ω|β1|k1k0|+2|Ω|η¯,Ωsubscript𝛽0subscript𝑘2subscript𝑘0Ωsubscript𝛽1subscript𝑘1subscript𝑘02Ω¯𝜂\displaystyle\sqrt{|\Omega|}\beta_{0}|k_{2}-k_{0}|\leq\sqrt{|\Omega|}\beta_{1}% |k_{1}-k_{0}|+2\sqrt{|\Omega|}\bar{\eta},square-root start_ARG | roman_Ω | end_ARG italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ≤ square-root start_ARG | roman_Ω | end_ARG italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | + 2 square-root start_ARG | roman_Ω | end_ARG over¯ start_ARG italic_η end_ARG , (52)
|k2k0|β1β0η32π+2η¯β0.subscript𝑘2subscript𝑘0subscript𝛽1subscript𝛽0subscript𝜂32𝜋2¯𝜂subscript𝛽0\displaystyle|k_{2}-k_{0}|\leq\frac{\beta_{1}}{\beta_{0}}\cdot\frac{\eta_{3}}{% 2\pi}+\frac{2\bar{\eta}}{\beta_{0}}.| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ≤ divide start_ARG italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ⋅ divide start_ARG italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_π end_ARG + divide start_ARG 2 over¯ start_ARG italic_η end_ARG end_ARG start_ARG italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG . (53)

On the other hand, suppose for all k𝒦,kk1,formulae-sequence𝑘𝒦𝑘subscript𝑘1k\in\mathcal{K},\,k\neq k_{1},italic_k ∈ caligraphic_K , italic_k ≠ italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , in region |kk0|>κ𝑘subscript𝑘0𝜅|k-k_{0}|>\kappa| italic_k - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | > italic_κ, we have a constant lower bound:

a(k)Ωa(k0)Ω2|Ω|c.subscriptnorm𝑎subscript𝑘Ω𝑎subscriptsubscript𝑘0Ω2Ω𝑐\|a(k)_{\Omega}-a(k_{0})_{\Omega}\|_{2}\geq\sqrt{|\Omega|}\cdot c.∥ italic_a ( italic_k ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ square-root start_ARG | roman_Ω | end_ARG ⋅ italic_c . (54)

Using the same argument, (k)(k1)|Ω|η𝑘subscript𝑘1Ω𝜂\mathcal{E}(k)\leq\mathcal{E}(k_{1})\leq\sqrt{|\Omega|}\etacaligraphic_E ( italic_k ) ≤ caligraphic_E ( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ≤ square-root start_ARG | roman_Ω | end_ARG italic_η leads to

cη+η¯<1/4.𝑐𝜂¯𝜂14\displaystyle c\leq\eta+\bar{\eta}<1/4.italic_c ≤ italic_η + over¯ start_ARG italic_η end_ARG < 1 / 4 . (55)

Therefore, if c>1/4𝑐14c>1/4italic_c > 1 / 4, we can rule out the possibility of |k2k0|>κsubscript𝑘2subscript𝑘0𝜅|k_{2}-k_{0}|>\kappa| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | > italic_κ.

Regard a(k)Ωa(k0)Ω22subscriptsuperscriptnorm𝑎subscript𝑘Ω𝑎subscriptsubscript𝑘0Ω22\|a(k)_{\Omega}-a(k_{0})_{\Omega}\|^{2}_{2}∥ italic_a ( italic_k ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_a ( italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT as the summation of random variables Xn=4sin2(nπ(kk0)/N)1nsubscriptsuperscript𝑋𝑛4superscript2𝑛𝜋𝑘subscript𝑘0𝑁subscript1𝑛X^{\prime}_{n}=4\sin^{2}(n\pi(k-k_{0})/N)1_{n}italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = 4 roman_sin start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_n italic_π ( italic_k - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) / italic_N ) 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. According to Lemma 90 (in Appendix C), with probability at least

(1exp(Nr/36))(12exp(Nr/24))>(1exp(Nr/36))3>[1(δ1/4N)1/4]3,1𝑁𝑟3612𝑁𝑟24superscript1𝑁𝑟363superscriptdelimited-[]1superscriptsubscript𝛿14𝑁143\begin{split}(1-\exp(-Nr/36))(1-2\exp(-Nr/24))&>(1-\exp(-Nr/36))^{3}\\ &>[1-(\delta_{1}/4N)^{1/4}]^{3},\end{split}start_ROW start_CELL ( 1 - roman_exp ( - italic_N italic_r / 36 ) ) ( 1 - 2 roman_exp ( - italic_N italic_r / 24 ) ) end_CELL start_CELL > ( 1 - roman_exp ( - italic_N italic_r / 36 ) ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL > [ 1 - ( italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / 4 italic_N ) start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , end_CELL end_ROW (56)

the following parameters suffice for Eq. (48) and Eq. (54):

κ=1/2,β0=8/3,β1=1.012π,c=5/6.formulae-sequence𝜅12formulae-sequencesubscript𝛽083formulae-sequencesubscript𝛽11.012𝜋𝑐56\kappa=1/2,\quad\beta_{0}=\sqrt{8/3},\quad\beta_{1}=1.01\sqrt{2}\pi,\quad c=% \sqrt{5/6}.italic_κ = 1 / 2 , italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = square-root start_ARG 8 / 3 end_ARG , italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1.01 square-root start_ARG 2 end_ARG italic_π , italic_c = square-root start_ARG 5 / 6 end_ARG . (57)

Thus, because 5/6>1/45614\sqrt{5/6}>1/4square-root start_ARG 5 / 6 end_ARG > 1 / 4, k2subscript𝑘2k_{2}italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT must belongs to |k2k0|<1/2subscript𝑘2subscript𝑘012|k_{2}-k_{0}|<1/2| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | < 1 / 2. In this region, we have

|k2k0|1.013π2η32π+η¯2/3=1.013η34+6η¯4,subscript𝑘2subscript𝑘01.013𝜋2subscript𝜂32𝜋¯𝜂231.013subscript𝜂346¯𝜂4|k_{2}-k_{0}|\leq\frac{1.01\sqrt{3}\pi}{2}\cdot\frac{\eta_{3}}{2\pi}+\frac{% \bar{\eta}}{\sqrt{2/3}}=\frac{1.01\sqrt{3}\eta_{3}}{4}+\frac{\sqrt{6}\bar{\eta% }}{4},| italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | ≤ divide start_ARG 1.01 square-root start_ARG 3 end_ARG italic_π end_ARG start_ARG 2 end_ARG ⋅ divide start_ARG italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_π end_ARG + divide start_ARG over¯ start_ARG italic_η end_ARG end_ARG start_ARG square-root start_ARG 2 / 3 end_ARG end_ARG = divide start_ARG 1.01 square-root start_ARG 3 end_ARG italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_ARG start_ARG 4 end_ARG + divide start_ARG square-root start_ARG 6 end_ARG over¯ start_ARG italic_η end_ARG end_ARG start_ARG 4 end_ARG , (58)

which determines the accuracy of |EE0|superscript𝐸subscript𝐸0|E^{\ast}-E_{0}|| italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT |.

The maximal runtime equals to maxnΩnτsubscript𝑛Ω𝑛𝜏\max_{n\in\Omega}n\tauroman_max start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT italic_n italic_τ. Because ΩΩ\Omegaroman_Ω is sampled from [N]delimited-[]𝑁[N][ italic_N ], we have TmaxNτsubscript𝑇𝑁𝜏T_{\max}\leq N\tauitalic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ≤ italic_N italic_τ. The upper bound on Ttotalsubscript𝑇totalT_{\text{total}}italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT can be obtained from Eq. (9) and related parameters in the algorithm. ∎

Remark.

In order to express Tmaxsubscript𝑇T_{\max}italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT and T𝑡𝑜𝑡𝑎𝑙subscript𝑇𝑡𝑜𝑡𝑎𝑙T_{\text{total}}italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT in terms of ϵitalic-ϵ\epsilonitalic_ϵ and γ𝛾\gammaitalic_γ, we can choose η2,η3subscript𝜂2subscript𝜂3\eta_{2},\eta_{3}italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT to be smaller than η1subscript𝜂1\eta_{1}italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and obtain the following approximation:

N=𝒪(max{η1,η2,η3}ϵ)=𝒪(γϵ1).𝑁𝒪subscript𝜂1subscript𝜂2subscript𝜂3italic-ϵ𝒪𝛾superscriptitalic-ϵ1N=\mathcal{O}\left(\frac{\max\{\eta_{1},\eta_{2},\eta_{3}\}}{\epsilon}\right)=% \mathcal{O}(\gamma\epsilon^{-1}).italic_N = caligraphic_O ( divide start_ARG roman_max { italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT } end_ARG start_ARG italic_ϵ end_ARG ) = caligraphic_O ( italic_γ italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) . (59)

Thus, because Tmax=𝒪(N) and T𝑡𝑜𝑡𝑎𝑙=𝒪(Nlog(N/δ3)/η22)subscript𝑇𝒪𝑁 and subscript𝑇𝑡𝑜𝑡𝑎𝑙𝒪𝑁𝑁subscript𝛿3superscriptsubscript𝜂22T_{\max}=\mathcal{O}(N)\text{ and }T_{\text{total}}=\mathcal{O}(N\log(N/\delta% _{3})/\eta_{2}^{2})italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = caligraphic_O ( italic_N ) and italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT = caligraphic_O ( italic_N roman_log ( italic_N / italic_δ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) / italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), we have

Tmax=𝒪(γϵ1),T𝑡𝑜𝑡𝑎𝑙=𝒪(γ1ϵ1log(ϵ1)).formulae-sequencesubscript𝑇𝒪𝛾superscriptitalic-ϵ1subscript𝑇𝑡𝑜𝑡𝑎𝑙𝒪superscript𝛾1superscriptitalic-ϵ1superscriptitalic-ϵ1T_{\max}=\mathcal{O}(\gamma\epsilon^{-1}),\quad T_{\text{total}}=\mathcal{O}% \left(\gamma^{-1}\epsilon^{-1}\log(\epsilon^{-1})\right).italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = caligraphic_O ( italic_γ italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) , italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT = caligraphic_O ( italic_γ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_log ( italic_ϵ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ) . (60)
Refer to caption
Figure 2: Searching for the optimal grid shift parameter. Set N=1000𝑁1000N=1000italic_N = 1000, |Ω|=50Ω50|\Omega|=50| roman_Ω | = 50, and η=0.344𝜂0.344\eta=0.344italic_η = 0.344. In this numerical test, the noiseless signal is 0.9ei2πf0t+0.1ei2πf1t0.9superscript𝑒𝑖2𝜋subscript𝑓0𝑡0.1superscript𝑒𝑖2𝜋subscript𝑓1𝑡0.9e^{-i2\pi f_{0}t}+0.1e^{-i2\pi f_{1}t}0.9 italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT + 0.1 italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT with f0=0.02025subscript𝑓00.02025f_{0}=0.02025italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0.02025 and f1=0.1subscript𝑓10.1f_{1}=0.1italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.1. Each noisy signal is sampled from NH=100subscript𝑁𝐻100N_{H}=100italic_N start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT = 100 Hadamard tests. The set of grid shift parameters is {0.5+j/20}0.5𝑗20\{-0.5+j/20\}{ - 0.5 + italic_j / 20 }. As shown by Fig a), the grid shift parameter that minimizes the total empirical error is exactly ν=0.25𝜈0.25\nu=0.25italic_ν = 0.25. Fig b) shows the value kνsubscript𝑘𝜈k_{\nu}italic_k start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT as a function of the grid shift parameter ν𝜈\nuitalic_ν. Here k0=20.25subscript𝑘020.25k_{0}=20.25italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 20.25 is linked to the target frequency f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Fig c) shows the comparison between the sampled data and the recovered data.

In Fig. 2 we demonstrate our algorithm with a simple test case where the target frequency is f0=(χ+ω)/Nsubscript𝑓0𝜒𝜔𝑁f_{0}=(\chi+\omega)/Nitalic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ( italic_χ + italic_ω ) / italic_N with N=1000,χ=20formulae-sequence𝑁1000𝜒20N=1000,\chi=20italic_N = 1000 , italic_χ = 20, and ω=0.25𝜔0.25\omega=0.25italic_ω = 0.25. As shown in Fig. 2, when the grid shift parameter is in region [0,0.5]00.5[0,0.5][ 0 , 0.5 ], the integer parts of {kν}k_{\nu}\}italic_k start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT } are all 20, which matches with the χ𝜒\chiitalic_χ. In the same region, the total empirical error is approximately a quadratic function of the grid shift parameter.

For this test, the three noise tolerance parameters are

η1=0.1,η20.1,η3=π10,formulae-sequencesubscript𝜂10.1formulae-sequencesubscript𝜂20.1subscript𝜂3𝜋10\eta_{1}=0.1,\quad\eta_{2}\approx 0.1,\quad\eta_{3}=\frac{\pi}{10},italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.1 , italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≈ 0.1 , italic_η start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = divide start_ARG italic_π end_ARG start_ARG 10 end_ARG , (61)

therefore η0.344𝜂0.344\eta\approx 0.344italic_η ≈ 0.344. Clearly, the noise tolerance parameters do not satisfy Eq. (40), but the algorithm still outputs the true frequency f=0.02025superscript𝑓0.02025f^{\ast}=0.02025italic_f start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = 0.02025. This numerical experiment demonstrates that the parameters in Lemma 2 and Theorem 1 can potentially be further improved.

It is worth mentioning that if we assume the energies of H𝐻Hitalic_H are nearly on-grid simultaneously, then our algorithm can be directly applied to the multiple phase estimation. Indeed, suppose that there are |Ξ|Ξ|\Xi|| roman_Ξ | eigenvalues with non-negligible amplitudes, then for each compressed sensing task, we can choose the |Ξ|Ξ|\Xi|| roman_Ξ | largest entries of s#superscript𝑠#s^{\#}italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT as a trial solution for the frequency domain signal. Similarly, by trying all the grid shift parameters we can obtain an optimal solution ssuperscript𝑠s^{\ast}italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and the output is related to the |Ξ|Ξ|\Xi|| roman_Ξ | largest entries of ssuperscript𝑠s^{\ast}italic_s start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT:

E^l=2π(nl+ν)Nτ,l=1,2,,|Ξ|.formulae-sequencesubscript^𝐸𝑙2𝜋subscript𝑛𝑙superscript𝜈𝑁𝜏𝑙12Ξ\hat{E}_{l}=\frac{2\pi(n_{l}+\nu^{\ast})}{N\tau},\quad l=1,2,\cdots,|\Xi|.over^ start_ARG italic_E end_ARG start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT = divide start_ARG 2 italic_π ( italic_n start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT + italic_ν start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_N italic_τ end_ARG , italic_l = 1 , 2 , ⋯ , | roman_Ξ | . (62)

Unfortunately, there is no guarantee that all the frequencies are nearly on-grid. For example, if f0=χ0/N,f1=(χ1+0.5)/Nformulae-sequencesubscript𝑓0subscript𝜒0𝑁subscript𝑓1subscript𝜒10.5𝑁f_{0}=\chi_{0}/N,f_{1}=(\chi_{1}+0.5)/Nitalic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_χ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_N , italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( italic_χ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + 0.5 ) / italic_N, our compressed sensing algorithm is not guaranteed to work no matter how we choose the grid shift parameters. Essentially, the frequencies we try to estimate are continuous parameters, however, we can only find solution in a discrete set. This problem is formally termed as basis mismatch in signal analysis.

4 Comparison to previous works

In this section, we compare our algorithm with previous results. First, we briefly introduce the two types of QCELS algorithms for QPE: the first one [12] (named multi-level QCELS) has a hierarchy structure similar to Robust Phase Estimation and it is used for QPE, the second one [26] (named MM-QCELS) is designed for QEEP.

The outline of the two algorithms can be described as follows. The two algorithms share a hierarchy structure, namely, the algorithm can be divided into several hierarchies. At each hierarchy, the algorithm outputs an estimate of the target energy, and in the next hierarchy, it searches for solutions in a narrower region and obtains a new estimate. At each hierarchy the multilevel QCELS first picks a time scale: {τ0,2τ0,,2J1τ0}subscript𝜏02subscript𝜏0superscript2𝐽1subscript𝜏0\{\tau_{0},2\tau_{0},\cdots,2^{J-1}\tau_{0}\}{ italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , 2 italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , ⋯ , 2 start_POSTSUPERSCRIPT italic_J - 1 end_POSTSUPERSCRIPT italic_τ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }, then performs the Hadamard tests for t=τ,2τ,,N0τ𝑡𝜏2𝜏subscript𝑁0𝜏t=\tau,2\tau,\cdots,N_{0}\tauitalic_t = italic_τ , 2 italic_τ , ⋯ , italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_τ separately. Here N0subscript𝑁0N_{0}italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is a constant. Given the value of the noisy signal y(t)𝑦𝑡y(t)italic_y ( italic_t ) at N0subscript𝑁0N_{0}italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT different times, the algorithm outputs the estimate for E𝐸Eitalic_E by minimizing the following cost function:

L(r,E)=1N0n=1N0|reiEnτy(nτ)|2.𝐿𝑟𝐸1subscript𝑁0superscriptsubscript𝑛1subscript𝑁0superscript𝑟superscript𝑒𝑖𝐸𝑛𝜏𝑦𝑛𝜏2L(r,E)=\frac{1}{N_{0}}\sum_{n=1}^{N_{0}}\left|re^{-iEn\tau}-y(n\tau)\right|^{2}.italic_L ( italic_r , italic_E ) = divide start_ARG 1 end_ARG start_ARG italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | italic_r italic_e start_POSTSUPERSCRIPT - italic_i italic_E italic_n italic_τ end_POSTSUPERSCRIPT - italic_y ( italic_n italic_τ ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (63)

This algorithm is proved to be efficient for single-phase estimation but not for multiple-phases estimation. The authors later proposed another relevant algorithm (MM-QCELS). In this method, they adapt the previous algorithm in two steps: the times at which the Hadamard tests are performed become random samples drawn from a probability distribution, and the cost function is changed to

L(r,E)=1|𝒯|t𝒯|k=1KrkeiEkty(t)|2.𝐿𝑟𝐸1𝒯subscript𝑡𝒯superscriptsuperscriptsubscript𝑘1𝐾subscript𝑟𝑘superscript𝑒𝑖subscript𝐸𝑘𝑡𝑦𝑡2L(\vec{r},\vec{E})=\frac{1}{|\mathcal{T}|}\sum_{t\in\mathcal{T}}\left|\sum_{k=% 1}^{K}r_{k}e^{-iE_{k}t}-y(t)\right|^{2}.italic_L ( over→ start_ARG italic_r end_ARG , over→ start_ARG italic_E end_ARG ) = divide start_ARG 1 end_ARG start_ARG | caligraphic_T | end_ARG ∑ start_POSTSUBSCRIPT italic_t ∈ caligraphic_T end_POSTSUBSCRIPT | ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_i italic_E start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_t end_POSTSUPERSCRIPT - italic_y ( italic_t ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (64)

Moreover, when applying MM-QCELS to single-phase estimation, the hierarchy structure can be removed, so that the algorithm only has one-level.

We show ythe between multi-level QCELS and Algorithm 1 in Fig. 3. Here we perform a similar numerical test as the one in Sec. V. A of [12] using our algorithm. It turns out for the specific choice of sampling ratio, the scalings of averaged error versus Tmaxsubscript𝑇T_{\max}italic_T start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT are almost the same, while our Ttotalsubscript𝑇totalT_{\text{total}}italic_T start_POSTSUBSCRIPT total end_POSTSUBSCRIPT is slightly larger than theirs.

Refer to caption
Figure 3: A numerical test that demonstrates the Heisenberg-limit scaling of Algorithm 1. In this example we consider the 8-site transverse field Ising model H=jZjZj+14jXj𝐻subscript𝑗subscript𝑍𝑗subscript𝑍𝑗14subscript𝑗subscript𝑋𝑗H=-\sum_{j}Z_{j}Z_{j+1}-4\sum_{j}X_{j}italic_H = - ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_Z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_Z start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT - 4 ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT with periodic boundary condition and normalize it by HπH/4H𝐻𝜋𝐻4norm𝐻H\to\pi H/4\|H\|italic_H → italic_π italic_H / 4 ∥ italic_H ∥. We set the initial state as the ground state of jZjZj+1jXjsubscript𝑗subscript𝑍𝑗subscript𝑍𝑗1subscript𝑗subscript𝑋𝑗-\sum_{j}Z_{j}Z_{j+1}-\sum_{j}X_{j}- ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_Z start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_Z start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, which gives γ0.303𝛾0.303\gamma\approx 0.303italic_γ ≈ 0.303. Each data is obtained by averaging over 50 runs of the algorithm. We use N𝑁Nitalic_N from 61616161 to 511511511511 and set the sample size as 2lnN2𝑁2\lceil\ln N\rceil2 ⌈ roman_ln italic_N ⌉. We set the parameter η𝜂\etaitalic_η as 2π/Nν+2lnN/NH2𝜋subscript𝑁𝜈2𝑁subscript𝑁𝐻2\pi/N_{\nu}+\sqrt{2\lceil\ln N\rceil/N_{H}}2 italic_π / italic_N start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT + square-root start_ARG 2 ⌈ roman_ln italic_N ⌉ / italic_N start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT end_ARG (notice that the initial infidelity γ𝛾\gammaitalic_γ is not included in η𝜂\etaitalic_η). For each signal, we sample over NH=100subscript𝑁𝐻100N_{H}=100italic_N start_POSTSUBSCRIPT italic_H end_POSTSUBSCRIPT = 100 Hadamard tests and choose grid shift parameter from {0.5+n/Nν}0.5𝑛subscript𝑁𝜈\{-0.5+n/N_{\nu}\}{ - 0.5 + italic_n / italic_N start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT } with Nν=100subscript𝑁𝜈100N_{\nu}=100italic_N start_POSTSUBSCRIPT italic_ν end_POSTSUBSCRIPT = 100. The model we use here is the same as the one used in Fig. 4. of [12]. The code can be found in [34].

Essentially, the idea of compressed sensing is similar to that of the one-level MM-QCELS. As mentioned in the discussion section of [26], the one-level MM-QCELS may be able to estimate the dominant phase efficiently. In both methods, one intends to fit the sampled data by an ansatz of the signal. The difference lies in the target function to optimize. In one-level MM-QCELS, the target function is the total empirical error in time domain. In compressed sensing, the target function is the 1-norm of the frequency domain signal.

For single-phase estimation, our compressed-sensing-based algorithm does not have a prominent advantage comparing to multi-level QCELS. However, for specific multiple-phase estimation tasks, our Algorithm 1 can perform better than the one-level MM-QCELS algorithm (see for example Fig. 4). This is because compressed sensing works for on-grid sparse signals in general, and the energy levels of H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT times τ𝜏\tauitalic_τ can happen to be nearly on-grid at the same time.

Refer to caption
Figure 4: Apply Algorithm 1 to a QEEP task. The model we use here is the same as the one used in Fig. 7 of [26]. In the test, we choose |Ω|=4lnNΩ4𝑁|\Omega|=4\lceil\ln N\rceil| roman_Ω | = 4 ⌈ roman_ln italic_N ⌉. Each data point is obtained by averaging over 10 runs of the algorithm. The left figure shows the error scaling of the dominant phase. The right figure shows the total runtime. For this specific model, Algorithm 1 performs better in both accuracy and total runtime. The code can be found in [34].

5 Discussion

In this paper we presented a simple and robust algorithm for quantum phase estimation using compressed sensing. For the single eigenvalue estimation (i.e., QPE), we rigorously established its Heisenberg-limit scaling and numerically demonstrated its comparable performance compared to the other state-of-the-art QPE algorithms. We further did some exploratory work for the case of the multiple eigenvalues estimation (i.e., QEEP) and showed that empirically for certain scenarios our algorithm can be more sample-efficient than previous algorithms. Potentially in practice one can use our algorithm to solve the QEEP as a first attempt, and if the signal recovered by the algorithm fits the data well enough, then one already obtains a good estimation of the eigenvalues; otherwise we can switch to the other algorithms. It would be an excellent future direction to design a more complete compressed sensing based QEEP algorithm that performs well in practice while achieving the Heisenberg limit.

One main contribution of our work is the new connection between QPE and compressed sensing we established. While some aspects of our algorithm are still not ideal, e.g., our algorithm currently requires a higher initial overlap γ𝛾\gammaitalic_γ compared to the other recent works [7, 12, 13], we believe that our work provides a solid first step towards a more robust and resource-efficient compressed sensing based QPE algorithm for early fault-tolerant quantum computers, especially considering the rich literature on compressed sensing and its efficiency and robustness demonstrated in practical applications. Moreover, in practice before running any QPE algorithms, one always needs to prepare the initial state based on the choice of the Hamiltonian first. As long as we only require a constant overlap with the target eigenstate, the cost in state preparation should be approximately in the same order.

Lastly, we summarize a few open questions here:

  • As we mentioned along the numerical demonstration in Fig. 2, the results suggest that our algorithm is more robust than what we were able to prove rigorously. It therefore should be possible to further improve the noise tolerance in Lemma 2, which would also result in reducing the total runtime in our analysis.

  • Instead of sampling discrete time steps, we can also sample the evolution times on a continuous region. Similar to the setup in [26], we can design a probability distribution of evolution times q(t)𝑞𝑡q(t)italic_q ( italic_t ), which then, as long as Eq[|t|]=𝒪(N1)subscript𝐸𝑞delimited-[]𝑡𝒪superscript𝑁1E_{q}[|t|]=\mathcal{O}(N^{-1})italic_E start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT [ | italic_t | ] = caligraphic_O ( italic_N start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ), the Heisenberg limit is still satisfied. Theoretically, continuous Fourier sampling can further reduce the sample complexity and the maximal runtime.

  • Designing Heisenberg-limited QEEP algorithms based on compressed sensing. One solution that we tried is the off-grid compressed sensing [35]. See Appendix D for a detailed discussion. Numerically the accuracy of the off-grid algorithm seems not as good as the other state-of-the-art methods. It would be an excellent direction to further improve this method, or find an alternative solution that solves the basis mismatch problem in QEEP.

  • Finding the optimal grid shift parameter by optimization instead of trying every grid shift parameter in a trial set.

6 Acknowledgement

We thank Tianyu Wang for helpful discussions. C.Y. acknowledges support from the National Natural Science Foundation of China (Grant No. 92165109), National Key Research and Development Program of China (Grant No. 2022YFA1404204), and Shanghai Municipal Science and Technology Major Project (Grant No. 2019SHZDZX01). C.Z. and J.T. acknowledge support from the U.S. National Science Foundation under Grant No. 2116246, the U.S. Department of Energy, Office of Science, National Quantum Information Science Research Centers, and Quantum Systems Accelerator.

References

  • [1] A Yu Kitaev. Quantum measurements and the abelian stabilizer problem. arXiv preprint quant-ph/9511026, 1995.
  • [2] Peter W Shor. Polynomial-time algorithms for prime factorization and discrete logarithms on a quantum computer. SIAM review, 41(2):303–332, 1999.
  • [3] Daniel S Abrams and Seth Lloyd. Quantum algorithm providing exponential speed increase for finding eigenvalues and eigenvectors. Physical Review Letters, 83(24):5162, 1999.
  • [4] Sam McArdle, Suguru Endo, Alán Aspuru-Guzik, Simon C Benjamin, and Xiao Yuan. Quantum computational chemistry. Reviews of Modern Physics, 92(1):015003, 2020.
  • [5] Ruizhe Zhang, Guoming Wang, and Peter Johnson. Computing ground state properties with early fault-tolerant quantum computers. Quantum, 6:761, 2022.
  • [6] Rolando D Somma. Quantum eigenvalue estimation via time series analysis. New Journal of Physics, 21(12):123025, 2019.
  • [7] Lin Lin and Yu Tong. Heisenberg-limited ground-state energy estimation for early fault-tolerant quantum computers. PRX Quantum, 3(1):010318, 2022.
  • [8] Guoming Wang, Daniel Stilck-França, Ruizhe Zhang, Shuchen Zhu, and Peter D Johnson. Quantum algorithm for ground state energy estimation using circuit depth with exponentially improved dependence on precision. arXiv preprint arXiv:2209.06811, 2022.
  • [9] Thomas E O’Brien, Brian Tarasinski, and Barbara M Terhal. Quantum phase estimation of multiple eigenvalues for small-scale (noisy) experiments. New Journal of Physics, 21(2):023022, 2019.
  • [10] Alicja Dutkiewicz, Barbara M. Terhal, and Thomas E O’Brien. Heisenberg-limited quantum phase estimation of multiple eigenvalues with few control qubits. Quantum, 6:830, oct 2022.
  • [11] Michael A Nielsen and Isaac Chuang. Quantum computation and quantum information, 2010.
  • [12] Zhiyan Ding and Lin Lin. Even shorter quantum circuit for phase estimation on early fault-tolerant quantum computers with applications to ground-state energy estimation. PRX Quantum, 4:020331, May 2023.
  • [13] Hongkang Ni, Haoya Li, and Lexing Ying. On low-depth algorithms for quantum phase estimation. arXiv preprint arXiv:2302.02454, 2023.
  • [14] Iulia M Georgescu, Sahel Ashhab, and Franco Nori. Quantum simulation. Reviews of Modern Physics, 86(1):153, 2014.
  • [15] Andrew M Childs and Yuan Su. Nearly optimal lattice simulation by product formulas. Physical review letters, 123(5):050503, 2019.
  • [16] Andrew M Childs, Yuan Su, Minh C Tran, Nathan Wiebe, and Shuchen Zhu. Theory of trotter error with commutator scaling. Physical Review X, 11(1):011020, 2021.
  • [17] Haitham Hassanieh, Piotr Indyk, Dina Katabi, and Eric Price. Nearly optimal sparse fourier transform. In Proceedings of the forty-fourth annual ACM symposium on Theory of computing, pages 563–578, 2012.
  • [18] William T Cochran, James W Cooley, David L Favin, Howard D Helms, Reginald A Kaenel, William W Lang, George C Maling, David E Nelson, Charles M Rader, and Peter D Welch. What is the fast fourier transform? Proceedings of the IEEE, 55(10):1664–1674, 1967.
  • [19] Anna C Gilbert, Shan Muthukrishnan, and Martin Strauss. Improved time bounds for near-optimal sparse fourier representations. In Wavelets XI, volume 5914, pages 398–412. SPIE, 2005.
  • [20] Piotr Indyk, Michael Kapralov, and Eric Price. (nearly) sample-optimal sparse fourier transform. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms, pages 480–499. SIAM, 2014.
  • [21] Wen**g Liao and Albert Fannjiang. Music for single-snapshot spectral estimation: Stability and super-resolution. Applied and Computational Harmonic Analysis, 40(1):33–67, 2016.
  • [22] Richard Roy and Thomas Kailath. Esprit-estimation of signal parameters via rotational invariance techniques. IEEE Transactions on acoustics, speech, and signal processing, 37(7):984–995, 1989.
  • [23] BL Higgins, DW Berry, SD Bartlett, MW Mitchell, HM Wiseman, and GJ Pryde. Demonstrating heisenberg-limited unambiguous phase estimation without adaptive measurements. New Journal of Physics, 11(7):073023, 2009.
  • [24] Shelby Kimmel, Guang Hao Low, and Theodore J Yoder. Robust calibration of a universal single-qubit gate set via robust phase estimation. Physical Review A, 92(6):062315, 2015.
  • [25] Federico Belliardo and Vittorio Giovannetti. Achieving heisenberg scaling with maximally entangled states: An analytic upper bound for the attainable root-mean-square error. Physical Review A, 102(4), oct 2020.
  • [26] Zhiyan Ding and Lin Lin. Simultaneous estimation of multiple eigenvalues with short-depth quantum circuit on early fault-tolerant quantum computers. arXiv preprint arXiv:2303.05714, 2023.
  • [27] Haoya Li, Hongkang Ni, and Lexing Ying. On low-depth quantum algorithms for robust multiple-phase estimation. arXiv preprint arXiv:2303.08099, 2023.
  • [28] Emmanuel J Candes and Terence Tao. Near-optimal signal recovery from random projections: Universal encoding strategies? IEEE transactions on information theory, 52(12):5406–5425, 2006.
  • [29] Emmanuel J Candes, Justin K Romberg, and Terence Tao. Stable signal recovery from incomplete and inaccurate measurements. Communications on Pure and Applied Mathematics: A Journal Issued by the Courant Institute of Mathematical Sciences, 59(8):1207–1223, 2006.
  • [30] Emmanuel J Candès, Justin Romberg, and Terence Tao. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Transactions on information theory, 52(2):489–509, 2006.
  • [31] David Gross, Yi-Kai Liu, Steven T Flammia, Stephen Becker, and Jens Eisert. Quantum state tomography via compressed sensing. Physical review letters, 105(15):150401, 2010.
  • [32] A. Smith, C. A. Riofrí o, B. E. Anderson, H. Sosa-Martinez, I. H. Deutsch, and P. S. Jessen. Quantum state tomography by continuous measurement and compressed sensing. Physical Review A, 87(3), mar 2013.
  • [33] Amir Kalev, Robert L. Kosut, and Ivan H. Deutsch. Quantum tomography protocols with positivity are compressed sensing protocols. npj Quantum Information, 1(1):15018, Dec 2015.
  • [34] https://github.com/CYI1995/QEEP/tree/main/Paper_QPE.
  • [35] Gongguo Tang, Badri Narayan Bhaskar, Parikshit Shah, and Benjamin Recht. Compressed sensing off the grid. IEEE transactions on information theory, 59(11):7465–7490, 2013.
  • [36] Joel A Tropp et al. An introduction to matrix concentration inequalities. Foundations and Trends® in Machine Learning, 8(1-2):1–230, 2015.
  • [37] Yohann De Castro and Fabrice Gamboa. Exact reconstruction using beurling minimal extrapolation. Journal of Mathematical Analysis and applications, 395(1):336–354, 2012.
  • [38] Clarice Poon, Nicolas Keriven, and Gabriel Peyré. The geometry of off-the-grid compressed sensing. Foundations of Computational Mathematics, 23(1):241–327, 2023.

Appendix A Proof of Lemma 1

The outline of proof largely follows Lemma 6.6 of [28].

Proof.

By the definition of p𝑝pitalic_p, we can verify that pχ=1subscript𝑝𝜒1p_{\chi}=1italic_p start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT = 1. For all kχ𝑘𝜒k\neq\chiitalic_k ≠ italic_χ, we have

pk=1|Ω|nΩei2πk¯n/N,k¯=kχ.formulae-sequencesubscript𝑝𝑘1Ωsubscript𝑛Ωsuperscript𝑒𝑖2𝜋¯𝑘𝑛𝑁¯𝑘𝑘𝜒p_{k}=\frac{1}{|\Omega|}\sum_{n\in\Omega}e^{i2\pi\bar{k}n/N},\quad\bar{k}=k-\chi.italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG | roman_Ω | end_ARG ∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π over¯ start_ARG italic_k end_ARG italic_n / italic_N end_POSTSUPERSCRIPT , over¯ start_ARG italic_k end_ARG = italic_k - italic_χ . (65)

Our target is to estimate P(|pk|>1/2)𝑃subscript𝑝𝑘12P(|p_{k}|>1/2)italic_P ( | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | > 1 / 2 ). Using

P(|pk||Ω|>Nr/4)P(|pk||Ω|>|Ω|/2|Ω|>Nr/2),𝑃subscript𝑝𝑘Ω𝑁𝑟4𝑃subscript𝑝𝑘ΩΩ2Ω𝑁𝑟2\displaystyle P(|p_{k}|\cdot|\Omega|>Nr/4)\geq P(|p_{k}|\cdot|\Omega|>|\Omega|% /2\cap|\Omega|>Nr/2),italic_P ( | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ⋅ | roman_Ω | > italic_N italic_r / 4 ) ≥ italic_P ( | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ⋅ | roman_Ω | > | roman_Ω | / 2 ∩ | roman_Ω | > italic_N italic_r / 2 ) , (66)
P(AB)=P(A)+P(B)P(AB)P(A)+P(B)1,𝑃𝐴𝐵𝑃𝐴𝑃𝐵𝑃𝐴𝐵𝑃𝐴𝑃𝐵1\displaystyle P(A\cap B)=P(A)+P(B)-P(A\cup B)\geq P(A)+P(B)-1,italic_P ( italic_A ∩ italic_B ) = italic_P ( italic_A ) + italic_P ( italic_B ) - italic_P ( italic_A ∪ italic_B ) ≥ italic_P ( italic_A ) + italic_P ( italic_B ) - 1 , (67)
P(|pk||Ω|>Nr/4)P(|pk||Ω|>|Ω|/2)+P(|Ω|>Nr/2)1,𝑃subscript𝑝𝑘Ω𝑁𝑟4𝑃subscript𝑝𝑘ΩΩ2𝑃Ω𝑁𝑟21\displaystyle P(|p_{k}|\cdot|\Omega|>Nr/4)\geq P(|p_{k}|\cdot|\Omega|>|\Omega|% /2)+P(|\Omega|>Nr/2)-1,italic_P ( | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ⋅ | roman_Ω | > italic_N italic_r / 4 ) ≥ italic_P ( | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ⋅ | roman_Ω | > | roman_Ω | / 2 ) + italic_P ( | roman_Ω | > italic_N italic_r / 2 ) - 1 , (68)

we obtain

P(|pk|>1/2)P(|Ω|Nr/2)+P(|pk||Ω|>Nr/4).𝑃subscript𝑝𝑘12𝑃Ω𝑁𝑟2𝑃subscript𝑝𝑘Ω𝑁𝑟4P(|p_{k}|>1/2)\leq P(|\Omega|\leq Nr/2)+P(|p_{k}|\cdot|\Omega|>Nr/4).italic_P ( | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | > 1 / 2 ) ≤ italic_P ( | roman_Ω | ≤ italic_N italic_r / 2 ) + italic_P ( | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ⋅ | roman_Ω | > italic_N italic_r / 4 ) . (69)

Using Bernstein’s inequality, the first term is bounded by

P(|Ω|Nr/2)<exp(328Nr1r).𝑃Ω𝑁𝑟2328𝑁𝑟1𝑟P(|\Omega|\leq Nr/2)<\exp\left(-\frac{3}{28}\frac{Nr}{1-r}\right).italic_P ( | roman_Ω | ≤ italic_N italic_r / 2 ) < roman_exp ( - divide start_ARG 3 end_ARG start_ARG 28 end_ARG divide start_ARG italic_N italic_r end_ARG start_ARG 1 - italic_r end_ARG ) . (70)

To estimate the second term, we associate |pk|subscript𝑝𝑘|p_{k}|| italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | with the summation of a sequence of random matrices:

Mn=Rn(1nr),Rn=(cos(2πnk¯/N)sin(2πnk¯/N)sin(2πnk¯/N)cos(2πnk¯/N)).formulae-sequencesubscript𝑀𝑛subscript𝑅𝑛subscript1𝑛𝑟subscript𝑅𝑛2𝜋𝑛¯𝑘𝑁2𝜋𝑛¯𝑘𝑁2𝜋𝑛¯𝑘𝑁2𝜋𝑛¯𝑘𝑁M_{n}=R_{n}(1_{n}-r),\quad R_{n}=\left(\begin{array}[]{cc}\cos(2\pi n\bar{k}/N% )&\sin(2\pi n\bar{k}/N)\\ -\sin(2\pi n\bar{k}/N)&\cos(2\pi n\bar{k}/N)\end{array}\right).italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_r ) , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ( start_ARRAY start_ROW start_CELL roman_cos ( 2 italic_π italic_n over¯ start_ARG italic_k end_ARG / italic_N ) end_CELL start_CELL roman_sin ( 2 italic_π italic_n over¯ start_ARG italic_k end_ARG / italic_N ) end_CELL end_ROW start_ROW start_CELL - roman_sin ( 2 italic_π italic_n over¯ start_ARG italic_k end_ARG / italic_N ) end_CELL start_CELL roman_cos ( 2 italic_π italic_n over¯ start_ARG italic_k end_ARG / italic_N ) end_CELL end_ROW end_ARRAY ) . (71)

where {1n}subscript1𝑛\{1_{n}\}{ 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } are defined in Eq. (18). Because when k¯0¯𝑘0\bar{k}\neq 0over¯ start_ARG italic_k end_ARG ≠ 0, n[N]Rnr=0subscript𝑛delimited-[]𝑁subscript𝑅𝑛𝑟0\sum_{n\in[N]}R_{n}r=0∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_r = 0, the summation of Mnsubscript𝑀𝑛M_{n}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is

n[N]Mn=n[N]Rn1n=nΩRnsubscript𝑛delimited-[]𝑁subscript𝑀𝑛subscript𝑛delimited-[]𝑁subscript𝑅𝑛subscript1𝑛subscript𝑛Ωsubscript𝑅𝑛\sum_{n\in[N]}M_{n}=\sum_{n\in[N]}R_{n}1_{n}=\sum_{n\in\Omega}R_{n}∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT (72)

which implies

n[N]Mn=nΩRn=|nΩei2πnk¯/N|=|Ω||pk|.normsubscript𝑛delimited-[]𝑁subscript𝑀𝑛normsubscript𝑛Ωsubscript𝑅𝑛subscript𝑛Ωsuperscript𝑒𝑖2𝜋𝑛¯𝑘𝑁Ωsubscript𝑝𝑘\|\sum_{n\in[N]}M_{n}\|=\|\sum_{n\in\Omega}R_{n}\|=|\sum_{n\in\Omega}e^{i2\pi n% \bar{k}/N}|=|\Omega|\cdot|p_{k}|.∥ ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ = ∥ ∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ = | ∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π italic_n over¯ start_ARG italic_k end_ARG / italic_N end_POSTSUPERSCRIPT | = | roman_Ω | ⋅ | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | . (73)

The matrix Bernstein’s inequality (Theorem 1.6.2 of [36]) states that

P(n[N]Mn>λ)<4exp(λ2/2v+Lλ/3)𝑃normsubscript𝑛delimited-[]𝑁subscript𝑀𝑛𝜆4superscript𝜆22𝑣𝐿𝜆3P\left(\|\sum_{n\in[N]}M_{n}\|>\lambda\right)<4\exp\left(-\frac{\lambda^{2}/2}% {v+L\lambda/3}\right)italic_P ( ∥ ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ > italic_λ ) < 4 roman_exp ( - divide start_ARG italic_λ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 end_ARG start_ARG italic_v + italic_L italic_λ / 3 end_ARG ) (74)

with

v=n[N]E[MnMn]=Nr(1r),L=maxnMn=1r.formulae-sequence𝑣normsubscript𝑛delimited-[]𝑁𝐸delimited-[]subscript𝑀𝑛subscriptsuperscript𝑀𝑛𝑁𝑟1𝑟𝐿subscript𝑛normsubscript𝑀𝑛1𝑟\displaystyle v=\|\sum_{n\in[N]}E[M_{n}M^{\dagger}_{n}]\|=Nr(1-r),\quad L=\max% _{n}\|M_{n}\|=1-r.italic_v = ∥ ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_E [ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_M start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] ∥ = italic_N italic_r ( 1 - italic_r ) , italic_L = roman_max start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ = 1 - italic_r . (75)

Choose λ=Nr/4𝜆𝑁𝑟4\lambda=Nr/4italic_λ = italic_N italic_r / 4, we obtain

P(n[N]Mn>Nr/4)<4exp(3104Nr1r).𝑃normsubscript𝑛delimited-[]𝑁subscript𝑀𝑛𝑁𝑟443104𝑁𝑟1𝑟P(\|\sum_{n\in[N]}M_{n}\|>Nr/4)<4\exp\left(-\frac{3}{104}\frac{Nr}{1-r}\right).italic_P ( ∥ ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∥ > italic_N italic_r / 4 ) < 4 roman_exp ( - divide start_ARG 3 end_ARG start_ARG 104 end_ARG divide start_ARG italic_N italic_r end_ARG start_ARG 1 - italic_r end_ARG ) . (76)

We want the P(|Ω|Nr/2)+P(|pk||Ω|>Nr/4)𝑃Ω𝑁𝑟2𝑃subscript𝑝𝑘Ω𝑁𝑟4P(|\Omega|\leq Nr/2)+P(|p_{k}|\cdot|\Omega|>Nr/4)italic_P ( | roman_Ω | ≤ italic_N italic_r / 2 ) + italic_P ( | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ⋅ | roman_Ω | > italic_N italic_r / 4 ) to be smaller than δ/N𝛿𝑁\delta/Nitalic_δ / italic_N. This condition leads to

exp(3104Nr1r)<δ5N,3104𝑁𝑟1𝑟𝛿5𝑁\displaystyle\exp\left(-\frac{3}{104}\frac{Nr}{1-r}\right)<\frac{\delta}{5N},roman_exp ( - divide start_ARG 3 end_ARG start_ARG 104 end_ARG divide start_ARG italic_N italic_r end_ARG start_ARG 1 - italic_r end_ARG ) < divide start_ARG italic_δ end_ARG start_ARG 5 italic_N end_ARG , (77)
r>1043Nln(5Nδ)/[1+1043Nln(5Nδ)].𝑟1043𝑁5𝑁𝛿delimited-[]11043𝑁5𝑁𝛿\displaystyle r>\frac{104}{3N}\ln\left(\frac{5N}{\delta}\right)/\left[1+\frac{% 104}{3N}\ln\left(\frac{5N}{\delta}\right)\right].italic_r > divide start_ARG 104 end_ARG start_ARG 3 italic_N end_ARG roman_ln ( divide start_ARG 5 italic_N end_ARG start_ARG italic_δ end_ARG ) / [ 1 + divide start_ARG 104 end_ARG start_ARG 3 italic_N end_ARG roman_ln ( divide start_ARG 5 italic_N end_ARG start_ARG italic_δ end_ARG ) ] . (78)

Therefore, if we choose

r>35Nln(5Nδ),𝑟35𝑁5𝑁𝛿r>\frac{35}{N}\ln\left(\frac{5N}{\delta}\right),italic_r > divide start_ARG 35 end_ARG start_ARG italic_N end_ARG roman_ln ( divide start_ARG 5 italic_N end_ARG start_ARG italic_δ end_ARG ) , (79)

then p𝑝pitalic_p can serve as a dual certificate with probability at least 1δ1𝛿1-\delta1 - italic_δ.

Appendix B Proof of Lemma 2

Proof.

In this problem, we have

yΩ=yΩ0+zΩ,y0=Fx,xk=δk,χ.formulae-sequencesubscript𝑦Ωsubscriptsuperscript𝑦0Ωsubscript𝑧Ωformulae-sequencesuperscript𝑦0superscript𝐹𝑥subscript𝑥𝑘subscript𝛿𝑘𝜒y_{\Omega}=y^{0}_{\Omega}+z_{\Omega},\quad y^{0}=F^{\dagger}x,\quad x_{k}=% \delta_{k,\chi}.italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT = italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT + italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = italic_F start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_x , italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_δ start_POSTSUBSCRIPT italic_k , italic_χ end_POSTSUBSCRIPT . (80)

According to Lemma 1, with high probability the dual certificate p=FΩyΩ0/|Ω|𝑝subscriptsuperscript𝐹Ωsubscriptsuperscript𝑦0ΩΩp=F^{\dagger}_{\Omega}y^{0}_{\Omega}/|\Omega|italic_p = italic_F start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT / | roman_Ω | satisfies

pχ=1;|pk|12,kχ.formulae-sequencesubscript𝑝𝜒1formulae-sequencesubscript𝑝𝑘12for-all𝑘𝜒p_{\chi}=1;\quad|p_{k}|\leq\frac{1}{2},\ \forall k\neq\chi.italic_p start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT = 1 ; | italic_p start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG 2 end_ARG , ∀ italic_k ≠ italic_χ . (81)

Calculate the inner product to obtain

s#,psuperscript𝑠#𝑝\displaystyle\langle s^{\#},p\rangle⟨ italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT , italic_p ⟩ =1|Ω|s#,FΩyΩ0absent1Ωsuperscript𝑠#subscriptsuperscript𝐹Ωsubscriptsuperscript𝑦0Ω\displaystyle=\frac{1}{|\Omega|}\langle s^{\#},F^{\dagger}_{\Omega}y^{0}_{% \Omega}\rangle= divide start_ARG 1 end_ARG start_ARG | roman_Ω | end_ARG ⟨ italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT , italic_F start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ⟩ (82)
=1|Ω|FΩs#,yΩ0absent1Ωsubscript𝐹Ωsuperscript𝑠#subscriptsuperscript𝑦0Ω\displaystyle=\frac{1}{|\Omega|}\langle F_{\Omega}s^{\#},y^{0}_{\Omega}\rangle= divide start_ARG 1 end_ARG start_ARG | roman_Ω | end_ARG ⟨ italic_F start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ⟩
=1|Ω|yΩ+z~Ω,yΩ0absent1Ωsubscript𝑦Ωsubscript~𝑧Ωsubscriptsuperscript𝑦0Ω\displaystyle=\frac{1}{|\Omega|}\langle y_{\Omega}+\tilde{z}_{\Omega},y^{0}_{% \Omega}\rangle= divide start_ARG 1 end_ARG start_ARG | roman_Ω | end_ARG ⟨ italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT + over~ start_ARG italic_z end_ARG start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ⟩
=1|Ω|yΩ0+zΩ+z~Ω,yΩ0absent1Ωsubscriptsuperscript𝑦0Ωsubscript𝑧Ωsubscript~𝑧Ωsubscriptsuperscript𝑦0Ω\displaystyle=\frac{1}{|\Omega|}\langle y^{0}_{\Omega}+z_{\Omega}+\tilde{z}_{% \Omega},y^{0}_{\Omega}\rangle= divide start_ARG 1 end_ARG start_ARG | roman_Ω | end_ARG ⟨ italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT + italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT + over~ start_ARG italic_z end_ARG start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ⟩
=1+1|Ω|zΩ+z~Ω,yΩ0.absent11Ωsubscript𝑧Ωsubscript~𝑧Ωsubscriptsuperscript𝑦0Ω\displaystyle=1+\frac{1}{|\Omega|}\langle z_{\Omega}+\tilde{z}_{\Omega},y^{0}_% {\Omega}\rangle.= 1 + divide start_ARG 1 end_ARG start_ARG | roman_Ω | end_ARG ⟨ italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT + over~ start_ARG italic_z end_ARG start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ⟩ .

Here z~Ωsubscript~𝑧Ω\tilde{z}_{\Omega}over~ start_ARG italic_z end_ARG start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT comes from the fact that FΩs#yΩsubscript𝐹Ωsuperscript𝑠#subscript𝑦ΩF_{\Omega}s^{\#}-y_{\Omega}italic_F start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT - italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT can have a small deviation that satisfies z~Ω2|Ω|ηsubscriptnormsubscript~𝑧Ω2Ω𝜂\|\tilde{z}_{\Omega}\|_{2}\leq\sqrt{|\Omega|}\eta∥ over~ start_ARG italic_z end_ARG start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ square-root start_ARG | roman_Ω | end_ARG italic_η. Furthermore,

1|Ω||zΩ,yΩ0|1|Ω|nΩ|zn|1|Ω|nΩ|zn|2η.1Ωsubscript𝑧Ωsubscriptsuperscript𝑦0Ω1Ωsubscript𝑛Ωsubscript𝑧𝑛1Ωsubscript𝑛Ωsuperscriptsubscript𝑧𝑛2𝜂\frac{1}{|\Omega|}|\langle z_{\Omega},y^{0}_{\Omega}\rangle|\leq\frac{1}{|% \Omega|}\sum_{n\in\Omega}|z_{n}|\leq\frac{1}{\sqrt{|\Omega|}}\sqrt{\sum_{n\in% \Omega}|z_{n}|^{2}}\leq\eta.divide start_ARG 1 end_ARG start_ARG | roman_Ω | end_ARG | ⟨ italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ⟩ | ≤ divide start_ARG 1 end_ARG start_ARG | roman_Ω | end_ARG ∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT | italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ≤ divide start_ARG 1 end_ARG start_ARG square-root start_ARG | roman_Ω | end_ARG end_ARG square-root start_ARG ∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT | italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ≤ italic_η . (83)

Thus,

12η|s#,p|1+2η.12𝜂superscript𝑠#𝑝12𝜂1-2\eta\leq|\langle s^{\#},p\rangle|\leq 1+2\eta.1 - 2 italic_η ≤ | ⟨ italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT , italic_p ⟩ | ≤ 1 + 2 italic_η . (84)

On the other hand, x𝑥xitalic_x is a feasible solution, hence s#1x1=1subscriptnormsuperscript𝑠#1subscriptnorm𝑥11\|s^{\#}\|_{1}\leq\|x\|_{1}=1∥ italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ ∥ italic_x ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1, and

|s#,p||sχ#|+12kχ|sk#|112kχ|sk#|.superscript𝑠#𝑝subscriptsuperscript𝑠#𝜒12subscript𝑘𝜒subscriptsuperscript𝑠#𝑘112subscript𝑘𝜒subscriptsuperscript𝑠#𝑘|\langle s^{\#},p\rangle|\leq|s^{\#}_{\chi}|+\frac{1}{2}\sum_{k\neq\chi}|s^{\#% }_{k}|\leq 1-\frac{1}{2}\sum_{k\neq\chi}|s^{\#}_{k}|.| ⟨ italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT , italic_p ⟩ | ≤ | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT | + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_k ≠ italic_χ end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≤ 1 - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_k ≠ italic_χ end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | . (85)

Combining them together, we obtain

kχ|sk#|4η,|sχ#|12η12kχ|sk#|14η.formulae-sequencesubscript𝑘𝜒subscriptsuperscript𝑠#𝑘4𝜂subscriptsuperscript𝑠#𝜒12𝜂12subscript𝑘𝜒subscriptsuperscript𝑠#𝑘14𝜂\sum_{k\neq\chi}|s^{\#}_{k}|\leq 4\eta,\quad|s^{\#}_{\chi}|\geq 1-2\eta-\frac{% 1}{2}\sum_{k\neq\chi}|s^{\#}_{k}|\geq 1-4\eta.∑ start_POSTSUBSCRIPT italic_k ≠ italic_χ end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≤ 4 italic_η , | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_χ end_POSTSUBSCRIPT | ≥ 1 - 2 italic_η - divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_k ≠ italic_χ end_POSTSUBSCRIPT | italic_s start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≥ 1 - 4 italic_η . (86)

Appendix C Concentration of Xnsubscript𝑋𝑛X_{n}italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT

Here we define

|k|mod N=minz|kzN|,subscript𝑘mod 𝑁subscript𝑧𝑘𝑧𝑁|k|_{\text{mod }N}=\min_{z\in\mathbb{Z}}|k-zN|,| italic_k | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT italic_z ∈ blackboard_Z end_POSTSUBSCRIPT | italic_k - italic_z italic_N | , (87)

then we can prove

Lemma 3.

For N100𝑁100N\geq 100italic_N ≥ 100, consider the summation of random variables

Xn=sin2(nπk/N)1nsubscript𝑋𝑛superscript2𝑛𝜋𝑘𝑁subscript1𝑛X_{n}=\sin^{2}(n\pi k/N)1_{n}italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = roman_sin start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_n italic_π italic_k / italic_N ) 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT (88)

with k𝑘kitalic_k being a parameter in region (N,N)𝑁𝑁(-N,N)( - italic_N , italic_N ) and 1nsubscript1𝑛1_{n}1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is defined in Eq. (18). For each |k|mod N>1/2subscript𝑘mod 𝑁12|k|_{\text{mod }N}>1/2| italic_k | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT > 1 / 2, with probability at least 1exp(Nr/36)1𝑁𝑟361-\exp(-Nr/36)1 - roman_exp ( - italic_N italic_r / 36 ), we have

n=1NXn>5Nr24.superscriptsubscript𝑛1𝑁subscript𝑋𝑛5𝑁𝑟24\sum_{n=1}^{N}X_{n}>\frac{5Nr}{24}.∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT > divide start_ARG 5 italic_N italic_r end_ARG start_ARG 24 end_ARG . (89)

With probability at least 12exp(Nr/24)12𝑁𝑟241-2\exp(-Nr/24)1 - 2 roman_exp ( - italic_N italic_r / 24 ), we have for all |k|mod N1/2subscript𝑘mod 𝑁12|k|_{\text{mod }N}\leq 1/2| italic_k | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT ≤ 1 / 2,

n=1NXn[2k23Nr,1.012π2k22Nr].superscriptsubscript𝑛1𝑁subscript𝑋𝑛2superscript𝑘23𝑁𝑟superscript1.012superscript𝜋2superscript𝑘22𝑁𝑟\sum_{n=1}^{N}X_{n}\in\left[\frac{2k^{2}}{3}Nr,\frac{1.01^{2}\pi^{2}k^{2}}{2}% Nr\right].∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ [ divide start_ARG 2 italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 3 end_ARG italic_N italic_r , divide start_ARG 1.01 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG italic_N italic_r ] . (90)
Proof.

Start with the expectation value of Sn=n[N]Xnsubscript𝑆𝑛subscript𝑛delimited-[]𝑁subscript𝑋𝑛S_{n}=\sum_{n\in[N]}X_{n}italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT:

E[Sn]𝐸delimited-[]subscript𝑆𝑛\displaystyle E[S_{n}]italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] =rn=1Nsin2(nπk/N)absent𝑟superscriptsubscript𝑛1𝑁superscript2𝑛𝜋𝑘𝑁\displaystyle=r\sum_{n=1}^{N}\sin^{2}(n\pi k/N)= italic_r ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT roman_sin start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_n italic_π italic_k / italic_N ) (91)
=rn=1N2ei2nπk/Nei2nπk/N4absent𝑟superscriptsubscript𝑛1𝑁2superscript𝑒𝑖2𝑛𝜋𝑘𝑁superscript𝑒𝑖2𝑛𝜋𝑘𝑁4\displaystyle=r\sum_{n=1}^{N}\frac{2-e^{i2n\pi k/N}-e^{-i2n\pi k/N}}{4}= italic_r ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT divide start_ARG 2 - italic_e start_POSTSUPERSCRIPT italic_i 2 italic_n italic_π italic_k / italic_N end_POSTSUPERSCRIPT - italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_n italic_π italic_k / italic_N end_POSTSUPERSCRIPT end_ARG start_ARG 4 end_ARG (92)
=Nr2r4n=1N(ei2nπk/N+ei2nπk/N)absent𝑁𝑟2𝑟4superscriptsubscript𝑛1𝑁superscript𝑒𝑖2𝑛𝜋𝑘𝑁superscript𝑒𝑖2𝑛𝜋𝑘𝑁\displaystyle=\frac{Nr}{2}-\frac{r}{4}\sum_{n=1}^{N}(e^{i2n\pi k/N}+e^{-i2n\pi k% /N})= divide start_ARG italic_N italic_r end_ARG start_ARG 2 end_ARG - divide start_ARG italic_r end_ARG start_ARG 4 end_ARG ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ( italic_e start_POSTSUPERSCRIPT italic_i 2 italic_n italic_π italic_k / italic_N end_POSTSUPERSCRIPT + italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_n italic_π italic_k / italic_N end_POSTSUPERSCRIPT ) (93)
=Nr2r2cos(πk(N+1)N)sin(πk)sin(πk/N).absent𝑁𝑟2𝑟2𝜋𝑘𝑁1𝑁𝜋𝑘𝜋𝑘𝑁\displaystyle=\frac{Nr}{2}-\frac{r}{2}\cos\left(\frac{\pi k(N+1)}{N}\right)% \frac{\sin(\pi k)}{\sin(\pi k/N)}.= divide start_ARG italic_N italic_r end_ARG start_ARG 2 end_ARG - divide start_ARG italic_r end_ARG start_ARG 2 end_ARG roman_cos ( divide start_ARG italic_π italic_k ( italic_N + 1 ) end_ARG start_ARG italic_N end_ARG ) divide start_ARG roman_sin ( italic_π italic_k ) end_ARG start_ARG roman_sin ( italic_π italic_k / italic_N ) end_ARG . (94)

We verify that when |k|mod N(1/2,1]subscript𝑘mod 𝑁121|k|_{\text{mod }N}\in(1/2,1]| italic_k | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT ∈ ( 1 / 2 , 1 ], cos(πk(N+1)/N)<0,sin(πk)/sin(πk/N)0formulae-sequence𝜋𝑘𝑁1𝑁0𝜋𝑘𝜋𝑘𝑁0\cos(\pi k(N+1)/N)<0,\sin(\pi k)/\sin(\pi k/N)\geq 0roman_cos ( italic_π italic_k ( italic_N + 1 ) / italic_N ) < 0 , roman_sin ( italic_π italic_k ) / roman_sin ( italic_π italic_k / italic_N ) ≥ 0, thus E[Sn]Nr/2𝐸delimited-[]subscript𝑆𝑛𝑁𝑟2E[S_{n}]\geq Nr/2italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] ≥ italic_N italic_r / 2; when |k|mod N>1subscript𝑘mod 𝑁1|k|_{\text{mod }N}>1| italic_k | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT > 1, we use

E[Sn]>Nr2Nr2|sin(πk)Nsin(πk/N)|.𝐸delimited-[]subscript𝑆𝑛𝑁𝑟2𝑁𝑟2𝜋𝑘𝑁𝜋𝑘𝑁{E[S_{n}]>\frac{Nr}{2}-\frac{Nr}{2}\left|\frac{\sin(\pi k)}{N\sin(\pi k/N)}% \right|.}italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] > divide start_ARG italic_N italic_r end_ARG start_ARG 2 end_ARG - divide start_ARG italic_N italic_r end_ARG start_ARG 2 end_ARG | divide start_ARG roman_sin ( italic_π italic_k ) end_ARG start_ARG italic_N roman_sin ( italic_π italic_k / italic_N ) end_ARG | . (95)

Notice that sin(πk)/(Nsin(πk/N))𝜋𝑘𝑁𝜋𝑘𝑁\sin(\pi k)/(N\sin(\pi k/N))roman_sin ( italic_π italic_k ) / ( italic_N roman_sin ( italic_π italic_k / italic_N ) ) is the Dirichlet kernel, which is close to a delta function. With N>100𝑁100N>100italic_N > 100, one can verify that for |k|mod N>1subscript𝑘mod 𝑁1|k|_{\text{mod }N}>1| italic_k | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT > 1, because the second peak of the Dirichlet kernel is in region (1,2)12(1,2)( 1 , 2 ), we have

|sin(πk)Nsin(πk/N)|<1Nsin(2π/N)<16,𝜋𝑘𝑁𝜋𝑘𝑁1𝑁2𝜋𝑁16\displaystyle\left|\frac{\sin(\pi k)}{N\sin(\pi k/N)}\right|<\frac{1}{N\sin(2% \pi/N)}<\frac{1}{6},| divide start_ARG roman_sin ( italic_π italic_k ) end_ARG start_ARG italic_N roman_sin ( italic_π italic_k / italic_N ) end_ARG | < divide start_ARG 1 end_ARG start_ARG italic_N roman_sin ( 2 italic_π / italic_N ) end_ARG < divide start_ARG 1 end_ARG start_ARG 6 end_ARG , (96)
E[Sn]>512Nr.𝐸delimited-[]subscript𝑆𝑛512𝑁𝑟\displaystyle E[S_{n}]>\frac{5}{12}Nr.italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] > divide start_ARG 5 end_ARG start_ARG 12 end_ARG italic_N italic_r . (97)

Therefore, when |k|mod N>1/2subscript𝑘mod 𝑁12|k|_{\text{mod }N}>1/2| italic_k | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT > 1 / 2, we have E[Sn]>5Nr/12𝐸delimited-[]subscript𝑆𝑛5𝑁𝑟12E[S_{n}]>5Nr/12italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] > 5 italic_N italic_r / 12, and Xn[0,1]subscript𝑋𝑛01X_{n}\in[0,1]italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ [ 0 , 1 ] for all n𝑛nitalic_n. By Bernstein’s inequality,

P[SnE[Sn]<E[Sn]/2]<exp(E[Sn]2/8rsin4(πnk/N)+E[Sn]/6).𝑃delimited-[]subscript𝑆𝑛𝐸delimited-[]subscript𝑆𝑛𝐸delimited-[]subscript𝑆𝑛2𝐸superscriptdelimited-[]subscript𝑆𝑛28𝑟superscript4𝜋𝑛𝑘𝑁𝐸delimited-[]subscript𝑆𝑛6P\left[S_{n}-E[S_{n}]<-E[S_{n}]/2\right]<\exp\left(-\frac{E[S_{n}]^{2}/8}{r% \sum\sin^{4}(\pi nk/N)+E[S_{n}]/6}\right).italic_P [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] < - italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] / 2 ] < roman_exp ( - divide start_ARG italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8 end_ARG start_ARG italic_r ∑ roman_sin start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ( italic_π italic_n italic_k / italic_N ) + italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] / 6 end_ARG ) . (98)

After calculation, sin4(πnk/N)superscript4𝜋𝑛𝑘𝑁\sum\sin^{4}(\pi nk/N)∑ roman_sin start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ( italic_π italic_n italic_k / italic_N ) is close to 3N/83𝑁83N/83 italic_N / 8. Thus, with probability at least

1exp(Nr/36),1𝑁𝑟361-\exp(-Nr/36),1 - roman_exp ( - italic_N italic_r / 36 ) , (99)

we have

n[N]Xn=Sn>12E[Sn]>5Nr24.subscript𝑛delimited-[]𝑁subscript𝑋𝑛subscript𝑆𝑛12𝐸delimited-[]subscript𝑆𝑛5𝑁𝑟24\sum_{n\in[N]}X_{n}=S_{n}>\frac{1}{2}E[S_{n}]>\frac{5Nr}{24}.∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT > divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_E [ italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] > divide start_ARG 5 italic_N italic_r end_ARG start_ARG 24 end_ARG . (100)

This is the first conclusion. In the other case where |k|mod N1/2subscript𝑘mod 𝑁12|k|_{\text{mod }N}\leq 1/2| italic_k | start_POSTSUBSCRIPT mod italic_N end_POSTSUBSCRIPT ≤ 1 / 2, the sin\sinroman_sin function satisfies

1|sin(πk)πk|2π,1𝜋𝑘𝜋𝑘2𝜋1\geq\left|\frac{\sin(\pi k)}{\pi k}\right|\geq\frac{2}{\pi},1 ≥ | divide start_ARG roman_sin ( italic_π italic_k ) end_ARG start_ARG italic_π italic_k end_ARG | ≥ divide start_ARG 2 end_ARG start_ARG italic_π end_ARG , (101)

therefore we have

4k2N2n[N]n21nn[N]Xnπ2k2N2n[N]n21n4superscript𝑘2superscript𝑁2subscript𝑛delimited-[]𝑁superscript𝑛2subscript1𝑛subscript𝑛delimited-[]𝑁subscript𝑋𝑛superscript𝜋2superscript𝑘2superscript𝑁2subscript𝑛delimited-[]𝑁superscript𝑛2subscript1𝑛\frac{4k^{2}}{N^{2}}\sum_{n\in[N]}n^{2}1_{n}\leq\sum_{n\in[N]}X_{n}\leq\frac{% \pi^{2}k^{2}}{N^{2}}\sum_{n\in[N]}n^{2}1_{n}divide start_ARG 4 italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ divide start_ARG italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_n ∈ [ italic_N ] end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT (102)

Similarly, using Bernstein’s inequality, we obtain

P[|n21nrn2|>r2n2]2exp((rn2)2/8rn4+N2rn2/6).𝑃delimited-[]superscript𝑛2subscript1𝑛𝑟superscript𝑛2𝑟2superscript𝑛22superscript𝑟superscript𝑛228𝑟superscript𝑛4superscript𝑁2𝑟superscript𝑛26P\left[\left|\sum n^{2}1_{n}-r\sum n^{2}\right|>\frac{r}{2}\sum n^{2}\right]% \leq 2\exp\left(-\frac{(r\sum n^{2})^{2}/8}{r\sum n^{4}+N^{2}r\sum n^{2}/6}% \right).italic_P [ | ∑ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 1 start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_r ∑ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | > divide start_ARG italic_r end_ARG start_ARG 2 end_ARG ∑ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ 2 roman_exp ( - divide start_ARG ( italic_r ∑ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8 end_ARG start_ARG italic_r ∑ italic_n start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT + italic_N start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r ∑ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 6 end_ARG ) . (103)

Because n2=(N+1)(2N+1)/6N3/3,nn4N5/5formulae-sequencesuperscript𝑛2𝑁12𝑁16superscript𝑁33subscript𝑛superscript𝑛4superscript𝑁55\sum n^{2}=(N+1)(2N+1)/6\approx N^{3}/3,\sum_{n}n^{4}\approx N^{5}/5∑ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ( italic_N + 1 ) ( 2 italic_N + 1 ) / 6 ≈ italic_N start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT / 3 , ∑ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ≈ italic_N start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT / 5, with probability at least 12exp(Nr/24)12𝑁𝑟241-2\exp(-Nr/24)1 - 2 roman_exp ( - italic_N italic_r / 24 ), we have

Nr2E[n2]nΩn23Nr2E[n2],𝑁𝑟2𝐸delimited-[]superscript𝑛2subscript𝑛Ωsuperscript𝑛23𝑁𝑟2𝐸delimited-[]superscript𝑛2\displaystyle\frac{Nr}{2}E[n^{2}]\leq\sum_{n\in\Omega}n^{2}\leq\frac{3Nr}{2}E[% n^{2}],divide start_ARG italic_N italic_r end_ARG start_ARG 2 end_ARG italic_E [ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ ∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ divide start_ARG 3 italic_N italic_r end_ARG start_ARG 2 end_ARG italic_E [ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] , (104)
2k2rNE[n2]nΩXn3π2k2r2NE[n2].2superscript𝑘2𝑟𝑁𝐸delimited-[]superscript𝑛2subscript𝑛Ωsubscript𝑋𝑛3superscript𝜋2superscript𝑘2𝑟2𝑁𝐸delimited-[]superscript𝑛2\displaystyle\frac{2k^{2}r}{N}E[n^{2}]\leq\sum_{n\in\Omega}X_{n}\leq\frac{3\pi% ^{2}k^{2}r}{2N}E[n^{2}].divide start_ARG 2 italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_N end_ARG italic_E [ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ ∑ start_POSTSUBSCRIPT italic_n ∈ roman_Ω end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ divide start_ARG 3 italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG 2 italic_N end_ARG italic_E [ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] . (105)

After simplification, the second conclusion is obtained. ∎

Appendix D Off-grid compressed sensing

In this section we discuss an off-grid compressed sensing algorithm for the multiple eigenvalue estimation. Although the algorithm we will be describing works well numerically, the rigorous proof of its Heisenberg limit is hard, the full investigation of which will be left for future work.

In regular compressed sensing, the optimization algorithm Eq. (21) can be rewritten in the following concise form:

minFr1,s.t.rΩ=yΩ,\min\|Fr\|_{1},\quad s.t.\quad r_{\Omega}=y_{\Omega},roman_min ∥ italic_F italic_r ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s . italic_t . italic_r start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , (106)

where r𝑟ritalic_r is the time-domain ansatz of the signal instead of the frequency-domain ansatz. The idea of the off-grid compressed sensing is similar, except we need another vector norm for the continuous values of frequencies [35]:

minr𝒜,s.t.rΩ=yΩ,\displaystyle\min\|r\|_{\mathcal{A}},\quad s.t.\quad r_{\Omega}=y_{\Omega},roman_min ∥ italic_r ∥ start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT , italic_s . italic_t . italic_r start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT , (107)
r𝒜=infcf0,ϕf[0,2π),f[0,1){fcf:rn=fcfei(2πfn+ϕf)}.subscriptnorm𝑟𝒜subscriptinfimumformulae-sequencesubscript𝑐𝑓0formulae-sequencesubscriptitalic-ϕ𝑓02𝜋𝑓01conditional-setsubscript𝑓subscript𝑐𝑓subscript𝑟𝑛subscript𝑓subscript𝑐𝑓superscript𝑒𝑖2𝜋𝑓𝑛subscriptitalic-ϕ𝑓\displaystyle\|r\|_{\mathcal{A}}=\inf_{c_{f}\geq 0,\phi_{f}\in[0,2\pi),f\in[0,% 1)}\left\{\sum_{f}c_{f}:r_{n}=\sum_{f}c_{f}e^{i(2\pi fn+\phi_{f})}\right\}.∥ italic_r ∥ start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT = roman_inf start_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ≥ 0 , italic_ϕ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∈ [ 0 , 2 italic_π ) , italic_f ∈ [ 0 , 1 ) end_POSTSUBSCRIPT { ∑ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT : italic_r start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i ( 2 italic_π italic_f italic_n + italic_ϕ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT } . (108)

The new vector norm 𝒜\|\cdot\|_{\mathcal{A}}∥ ⋅ ∥ start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT is called the atomic norm, which can be written as the solution of an SDP (see Proposition 2.1 of [35]):

r𝒜=infu,v{12Ntr(Toep(u))+v2:[Toep(u)rrv]0},subscriptnorm𝑟𝒜subscriptinfimum𝑢𝑣conditional-set12𝑁trToep𝑢𝑣2succeeds-or-equalsmatrixToep𝑢𝑟superscript𝑟𝑣0\|r\|_{\mathcal{A}}=\inf_{u,v}\left\{\frac{1}{2N}\text{tr}(\text{Toep}(u))+% \frac{v}{2}:\begin{bmatrix}\text{Toep}(u)&r\\ r^{\dagger}&v\end{bmatrix}\succeq 0\right\},∥ italic_r ∥ start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT = roman_inf start_POSTSUBSCRIPT italic_u , italic_v end_POSTSUBSCRIPT { divide start_ARG 1 end_ARG start_ARG 2 italic_N end_ARG tr ( Toep ( italic_u ) ) + divide start_ARG italic_v end_ARG start_ARG 2 end_ARG : [ start_ARG start_ROW start_CELL Toep ( italic_u ) end_CELL start_CELL italic_r end_CELL end_ROW start_ROW start_CELL italic_r start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT end_CELL start_CELL italic_v end_CELL end_ROW end_ARG ] ⪰ 0 } , (109)

where u𝑢uitalic_u is an (2N+1)2𝑁1(2N+1)( 2 italic_N + 1 )-dimensional vector (in the off-grid case, instead of choosing n𝑛nitalic_n from [N]delimited-[]𝑁[N][ italic_N ], we allow it to take values from N𝑁-N- italic_N to N𝑁Nitalic_N), v𝑣vitalic_v is a real number, and Toep(u)Toep𝑢\text{Toep}(u)Toep ( italic_u ) represents a Toeplitz matrix whose first column is u𝑢uitalic_u:

(Toep(u))mn=umn,un=un.formulae-sequencesubscriptToep𝑢𝑚𝑛subscript𝑢𝑚𝑛subscript𝑢𝑛superscriptsubscript𝑢𝑛(\text{Toep}(u))_{mn}=u_{m-n},\quad u_{-n}=u_{n}^{\dagger}.( Toep ( italic_u ) ) start_POSTSUBSCRIPT italic_m italic_n end_POSTSUBSCRIPT = italic_u start_POSTSUBSCRIPT italic_m - italic_n end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT - italic_n end_POSTSUBSCRIPT = italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT . (110)

Thus, the full optimization task for the off-grid compressed sensing is

minu,r,v12Ntr(Toep(u))+v2,s.t.[Toep(u)rrv]0,rΩ=yΩ.\min_{u,r,v}\frac{1}{2N}\text{tr}(\text{Toep}(u))+\frac{v}{2},\quad s.t.\quad% \begin{bmatrix}\text{Toep}(u)&r\\ r^{\dagger}&v\end{bmatrix}\succeq 0,\quad r_{\Omega}=y_{\Omega}.roman_min start_POSTSUBSCRIPT italic_u , italic_r , italic_v end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 italic_N end_ARG tr ( Toep ( italic_u ) ) + divide start_ARG italic_v end_ARG start_ARG 2 end_ARG , italic_s . italic_t . [ start_ARG start_ROW start_CELL Toep ( italic_u ) end_CELL start_CELL italic_r end_CELL end_ROW start_ROW start_CELL italic_r start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT end_CELL start_CELL italic_v end_CELL end_ROW end_ARG ] ⪰ 0 , italic_r start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT . (111)

Suppose the error tolerance for each signal is η𝜂\etaitalic_η, then the robust version that allows small noise is given as

minu,r,v12rΩyΩ22+λ(12Ntr(Toep(u))+v2),s.t.[Toep(u)rrv]0,\min_{u,r,v}\frac{1}{2}\|r_{\Omega}-y_{\Omega}\|_{2}^{2}+\lambda\left(\frac{1}% {2N}\text{tr}(\text{Toep}(u))+\frac{v}{2}\right),\quad s.t.\quad\begin{bmatrix% }\text{Toep}(u)&r\\ r^{\dagger}&v\end{bmatrix}\succeq 0,roman_min start_POSTSUBSCRIPT italic_u , italic_r , italic_v end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ italic_r start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ ( divide start_ARG 1 end_ARG start_ARG 2 italic_N end_ARG tr ( Toep ( italic_u ) ) + divide start_ARG italic_v end_ARG start_ARG 2 end_ARG ) , italic_s . italic_t . [ start_ARG start_ROW start_CELL Toep ( italic_u ) end_CELL start_CELL italic_r end_CELL end_ROW start_ROW start_CELL italic_r start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT end_CELL start_CELL italic_v end_CELL end_ROW end_ARG ] ⪰ 0 , (112)

with λ=Θ(η|Ω|/||)𝜆Θ𝜂Ω\lambda=\Theta(\eta\sqrt{|\Omega|/|\mathcal{F}|})italic_λ = roman_Θ ( italic_η square-root start_ARG | roman_Ω | / | caligraphic_F | end_ARG ). This is the so-called Beurling-Lasso algorithm [37].

The next theorem shows the effciency of the algorithm, which is simply a rewriting of the results in [38] with our notations.

Theorem 2.

Suppose we have the signal

yn=yn0+zn,yn0=fcfei2πfn,n=N,N+1,,N,formulae-sequencesubscript𝑦𝑛subscriptsuperscript𝑦0𝑛subscript𝑧𝑛formulae-sequencesubscriptsuperscript𝑦0𝑛subscript𝑓subscript𝑐𝑓superscript𝑒𝑖2𝜋𝑓𝑛𝑛𝑁𝑁1𝑁y_{n}=y^{0}_{n}+z_{n},\quad y^{0}_{n}=\sum_{f\in\mathcal{F}}c_{f}e^{i2\pi fn},% \quad n=-N,-N+1,\cdots,N,italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π italic_f italic_n end_POSTSUPERSCRIPT , italic_n = - italic_N , - italic_N + 1 , ⋯ , italic_N , (113)

where cfsubscript𝑐𝑓c_{f}italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT are positive numbers with summation 1, f[0,1)𝑓01f\in[0,1)italic_f ∈ [ 0 , 1 ). We know the values of ynsubscript𝑦𝑛y_{n}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT on the set Ωnormal-Ω\Omegaroman_Ω, a set of integers sampled from N𝑁-N- italic_N to N𝑁Nitalic_N. The solution to Eq. (112) r#superscript𝑟normal-#r^{\#}italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT can be decomposed as

rn#=g𝒢dgei2πgn.subscriptsuperscript𝑟#𝑛subscript𝑔𝒢subscript𝑑𝑔superscript𝑒𝑖2𝜋𝑔𝑛r^{\#}_{n}=\sum_{g\in\mathcal{G}}d_{g}e^{i2\pi gn}.italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_g ∈ caligraphic_G end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_i 2 italic_π italic_g italic_n end_POSTSUPERSCRIPT . (114)

To quantify the similarity between 𝒢𝒢\mathcal{G}caligraphic_G and \mathcal{F}caligraphic_F, define

Δ=minmminf1f2|f1f2+m|,subscriptΔsubscript𝑚subscriptsubscript𝑓1subscript𝑓2subscript𝑓1subscript𝑓2𝑚\displaystyle\Delta_{\mathcal{F}}=\min_{m\in\mathbb{Z}}\min_{f_{1}\neq f_{2}% \in\mathcal{F}}|f_{1}-f_{2}+m|,roman_Δ start_POSTSUBSCRIPT caligraphic_F end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT italic_m ∈ blackboard_Z end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≠ italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ caligraphic_F end_POSTSUBSCRIPT | italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_m | , (115)
Rf𝑛𝑒𝑎𝑟={k:π23N(N+4)(kf)21128},subscriptsuperscript𝑅𝑛𝑒𝑎𝑟𝑓conditional-set𝑘superscript𝜋23𝑁𝑁4superscript𝑘𝑓21128\displaystyle R^{\text{near}}_{f}=\left\{k:\frac{\pi^{2}}{3}N(N+4)(k-f)^{2}% \leq\frac{1}{128}\right\},italic_R start_POSTSUPERSCRIPT near end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = { italic_k : divide start_ARG italic_π start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 3 end_ARG italic_N ( italic_N + 4 ) ( italic_k - italic_f ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 128 end_ARG } , (116)
R𝑓𝑎𝑟=[0,1)fRf𝑛𝑒𝑎𝑟.superscript𝑅𝑓𝑎𝑟01subscript𝑓subscriptsuperscript𝑅𝑛𝑒𝑎𝑟𝑓\displaystyle R^{\text{far}}=[0,1)\char 92\bigcup_{f\in\mathcal{F}}R^{\text{% near}}_{f}.italic_R start_POSTSUPERSCRIPT far end_POSTSUPERSCRIPT = [ 0 , 1 ) “ ⋃ start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT italic_R start_POSTSUPERSCRIPT near end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT . (117)

Then the estimator for f𝑓fitalic_f is

f^=average value of {g:gRf𝑛𝑒𝑎𝑟},^𝑓average value of conditional-set𝑔𝑔subscriptsuperscript𝑅𝑛𝑒𝑎𝑟𝑓\hat{f}=\text{average value of }\{g:g\in R^{\text{near}}_{f}\},over^ start_ARG italic_f end_ARG = average value of { italic_g : italic_g ∈ italic_R start_POSTSUPERSCRIPT near end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT } , (118)

and the estimator for cfsubscript𝑐𝑓c_{f}italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT is

c^f=gRf𝑛𝑒𝑎𝑟dg.subscript^𝑐𝑓subscript𝑔subscriptsuperscript𝑅𝑛𝑒𝑎𝑟𝑓subscript𝑑𝑔\hat{c}_{f}=\sum_{g\in R^{\text{near}}_{f}}d_{g}.over^ start_ARG italic_c end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_g ∈ italic_R start_POSTSUPERSCRIPT near end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT . (119)

If N128,zΩ2|Ω|η,Δ>2||1/4N(N+4),λ=Θ(η|Ω|/||)formulae-sequence𝑁128formulae-sequencesubscriptnormsubscript𝑧normal-Ω2normal-Ω𝜂formulae-sequencesubscriptnormal-Δ2superscript14𝑁𝑁4𝜆normal-Θ𝜂normal-ΩN\geq 128,\,\|z_{\Omega}\|_{2}\leq\sqrt{|\Omega|}\eta,\,\Delta_{\mathcal{F}}>% \frac{2|\mathcal{F}|^{1/4}}{\sqrt{N(N+4)}},\,\lambda=\Theta(\eta\sqrt{|\Omega|% /|\mathcal{F}|})italic_N ≥ 128 , ∥ italic_z start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ square-root start_ARG | roman_Ω | end_ARG italic_η , roman_Δ start_POSTSUBSCRIPT caligraphic_F end_POSTSUBSCRIPT > divide start_ARG 2 | caligraphic_F | start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT end_ARG start_ARG square-root start_ARG italic_N ( italic_N + 4 ) end_ARG end_ARG , italic_λ = roman_Θ ( italic_η square-root start_ARG | roman_Ω | / | caligraphic_F | end_ARG ), and

|Ω|𝒪(max{||logNδ,||log2||δ}),Ω𝒪𝑁𝛿superscript2𝛿|\Omega|\geq\mathcal{O}\left(\max\left\{|\mathcal{F}|\log\frac{N}{\delta},|% \mathcal{F}|\log^{2}\frac{|\mathcal{F}|}{\delta}\right\}\right),| roman_Ω | ≥ caligraphic_O ( roman_max { | caligraphic_F | roman_log divide start_ARG italic_N end_ARG start_ARG italic_δ end_ARG , | caligraphic_F | roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT divide start_ARG | caligraphic_F | end_ARG start_ARG italic_δ end_ARG } ) , (120)

introduce a constant ε00.000504subscript𝜀00.000504\varepsilon_{0}\approx 0.000504italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≈ 0.000504, then we have

|c^fcf|8|Ω|η+(8λ+|Ω|η)||+4λ+1ε0ε0(|Ω|η+λ||)22λ,subscript^𝑐𝑓subscript𝑐𝑓8Ω𝜂8𝜆Ω𝜂4𝜆1subscript𝜀0subscript𝜀0superscriptΩ𝜂𝜆22𝜆\displaystyle|\hat{c}_{f}-c_{f}|\leq 8\sqrt{|\Omega|}\eta+(8\lambda+\sqrt{|% \Omega|}\eta)\sqrt{|\mathcal{F}|}+4\lambda+\frac{1-\varepsilon_{0}}{% \varepsilon_{0}}\frac{(\sqrt{|\Omega|}\eta+\lambda\sqrt{|\mathcal{F}|})^{2}}{2% \lambda},| over^ start_ARG italic_c end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT - italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT | ≤ 8 square-root start_ARG | roman_Ω | end_ARG italic_η + ( 8 italic_λ + square-root start_ARG | roman_Ω | end_ARG italic_η ) square-root start_ARG | caligraphic_F | end_ARG + 4 italic_λ + divide start_ARG 1 - italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG divide start_ARG ( square-root start_ARG | roman_Ω | end_ARG italic_η + italic_λ square-root start_ARG | caligraphic_F | end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_λ end_ARG , (121)
|f^f|mod 1186N,gR𝑓𝑎𝑟|dg|(|Ω|η+λ||)22ε0λformulae-sequencesubscript^𝑓𝑓mod 1186𝑁subscript𝑔superscript𝑅𝑓𝑎𝑟subscript𝑑𝑔superscriptΩ𝜂𝜆22subscript𝜀0𝜆\displaystyle|\hat{f}-f|_{\text{mod }1}\leq\frac{1}{8\sqrt{6}N},\quad\sum_{g% \in R^{\text{far}}}|d_{g}|\leq\frac{(\sqrt{|\Omega|}\eta+\lambda\sqrt{|% \mathcal{F}|})^{2}}{2\varepsilon_{0}\lambda}| over^ start_ARG italic_f end_ARG - italic_f | start_POSTSUBSCRIPT mod 1 end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 8 square-root start_ARG 6 end_ARG italic_N end_ARG , ∑ start_POSTSUBSCRIPT italic_g ∈ italic_R start_POSTSUPERSCRIPT far end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | italic_d start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT | ≤ divide start_ARG ( square-root start_ARG | roman_Ω | end_ARG italic_η + italic_λ square-root start_ARG | caligraphic_F | end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_λ end_ARG (122)

with probability at least 1δ1𝛿1-\delta1 - italic_δ.

The more detailed discussions can be found in Proposition 1, Proposition 2, and Appendix C of [38]. Roughly speaking, similarly to the regular compressed sensing, 𝒪(logN)𝒪𝑁\mathcal{O}(\log N)caligraphic_O ( roman_log italic_N ) random samples are enough to approximately recover the noisy signal. But the output of the off-grid compressed sensing r#superscript𝑟#r^{\#}italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT is an estimate of the time-domain signal, not the frequency-domain signal. We need an extra step to recover the frequency support. In signal recovery, when the minimal frequency gap of the signal has lower bound N1superscript𝑁1N^{-1}italic_N start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, we say the frequency support is well-separated and can be recovered easily. However, we do not have any conclusion about the minimal frequency gap of the output signal r#superscript𝑟#r^{\#}italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT, thus the original method in [35] that depends on the reconstruction of dual certificate is not guaranteed to work. Besides, in practice the performance of the original algorithm is very sensitive to noise, which makes it impractical. We need a different post-processing procedure to recover an estimation of \mathcal{F}caligraphic_F from r#superscript𝑟#r^{\#}italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT.

The multiple signal classification (MUSIC) algorithm [21] is one strong candidate for this. The algorithm is constructed on the Hankel matrix that is defined as (when t𝑡titalic_t takes value from N𝑁-N- italic_N to N𝑁Nitalic_N)

(Hank(y))mn=ym+nN1,m,n[N].formulae-sequencesubscriptHank𝑦𝑚𝑛subscript𝑦𝑚𝑛𝑁1𝑚𝑛delimited-[]𝑁(\text{Hank}(y))_{mn}=y_{m+n-N-1},\quad m,n\in[N].( Hank ( italic_y ) ) start_POSTSUBSCRIPT italic_m italic_n end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_m + italic_n - italic_N - 1 end_POSTSUBSCRIPT , italic_m , italic_n ∈ [ italic_N ] . (123)

For a noisy signal yn=yn0+znsubscript𝑦𝑛subscriptsuperscript𝑦0𝑛subscript𝑧𝑛y_{n}=y^{0}_{n}+z_{n}italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_z start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we can further separate the Hankel matrix into two parts:

Hank(y)=Hank(y0)+Hank(z).Hank𝑦Hanksuperscript𝑦0Hank𝑧\text{Hank}(y)=\text{Hank}(y^{0})+\text{Hank}(z).Hank ( italic_y ) = Hank ( italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) + Hank ( italic_z ) . (124)

The Hankel matrix has the following decomposition:

Hank(y0)Hanksuperscript𝑦0\displaystyle\text{Hank}(y^{0})Hank ( italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) =fcfei2π(N1)fa(f)a(f),absentsubscript𝑓subscript𝑐𝑓superscript𝑒𝑖2𝜋𝑁1𝑓𝑎superscript𝑓top𝑎𝑓\displaystyle=\sum_{f\in\mathcal{F}}c_{f}e^{-i2\pi(N-1)f}a(f)^{\top}a(f),= ∑ start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π ( italic_N - 1 ) italic_f end_POSTSUPERSCRIPT italic_a ( italic_f ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_a ( italic_f ) , (125)
a(f)𝑎𝑓\displaystyle a(f)italic_a ( italic_f ) =[ei2πfei2π2fei2πNf].absentsuperscriptdelimited-[]superscript𝑒𝑖2𝜋𝑓superscript𝑒𝑖2𝜋2𝑓superscript𝑒𝑖2𝜋𝑁𝑓top\displaystyle=[e^{-i2\pi f}\ e^{-i2\pi 2f}\ \cdots\ e^{-i2\pi Nf}]^{\top}.= [ italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_f end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π 2 italic_f end_POSTSUPERSCRIPT ⋯ italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_N italic_f end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT . (126)

Therefore, the set of {a(f),f}𝑎𝑓𝑓\{a(f),f\in\mathcal{F}\}{ italic_a ( italic_f ) , italic_f ∈ caligraphic_F } can be found in the space spanned by the vectors with the |||\mathcal{F}|| caligraphic_F | largest singular values of Hank(y0)Hanksuperscript𝑦0\text{Hank}(y^{0})Hank ( italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ). Define the projector onto the null space of Hank(y0)Hanksuperscript𝑦0\text{Hank}(y^{0})Hank ( italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) as Pnullsubscript𝑃nullP_{\text{null}}italic_P start_POSTSUBSCRIPT null end_POSTSUBSCRIPT. Define the noise-space correlation function as

C0(k)=Pnulla(k)2a(k)2.subscript𝐶0𝑘subscriptnormsubscript𝑃null𝑎𝑘2subscriptnorm𝑎𝑘2C_{0}(k)=\frac{\|P_{\text{null}}a(k)\|_{2}}{\|a(k)\|_{2}}.italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_k ) = divide start_ARG ∥ italic_P start_POSTSUBSCRIPT null end_POSTSUBSCRIPT italic_a ( italic_k ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_a ( italic_k ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG . (127)

It turns out C0(k)=0subscript𝐶0𝑘0C_{0}(k)=0italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_k ) = 0 if and only if k𝑘k\in\mathcal{F}italic_k ∈ caligraphic_F (see Theorem 1 of [21]). Similarly, we can define C(k)𝐶𝑘C(k)italic_C ( italic_k ) as the noise-space correlation function for y𝑦yitalic_y, and as long as Hank(z)Hank𝑧\text{Hank}(z)Hank ( italic_z ) does not perturb the null space too much, the local minima of C(k)𝐶𝑘C(k)italic_C ( italic_k ) can serve as estimator for \mathcal{F}caligraphic_F.

Combining the MUSIC algorithm with the off-grid compressed sensing we obtain Algorithm 2 for the QEEP. A numerical test that demonstrates the efficiency of the algorithm is shown in Fig. 5.

Algorithm 2 QEEP
1: accuracy level N𝑁Nitalic_N, time step τ𝜏\tauitalic_τ, size of frequency support |||\mathcal{F}|| caligraphic_F |, expected failure probability δ1,δ2subscript𝛿1subscript𝛿2\delta_{1},\delta_{2}italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, Hamiltonian H𝐻Hitalic_H, and an initial state |ΦketΦ|\Phi\rangle| roman_Φ ⟩.
2: the |||\mathcal{F}|| caligraphic_F | smallest local minima of C(k)𝐶𝑘C(k)italic_C ( italic_k ).
3:Sample 𝒪(max{||log(N/δ1),||log2(||/δ1)})𝒪𝑁subscript𝛿1superscript2subscript𝛿1\mathcal{O}\left(\max\left\{|\mathcal{F}|\log(N/\delta_{1}),|\mathcal{F}|\log^% {2}(|\mathcal{F}|/\delta_{1})\right\}\right)caligraphic_O ( roman_max { | caligraphic_F | roman_log ( italic_N / italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , | caligraphic_F | roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( | caligraphic_F | / italic_δ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) } ) integers from [N,N]𝑁𝑁[-N,N][ - italic_N , italic_N ] as ΩΩ\Omegaroman_Ω.
4:for nΩ𝑛Ωn\in\Omegaitalic_n ∈ roman_Ω do
5:     Prepare the initial state |ΦketΦ|\Phi\rangle| roman_Φ ⟩ and the unitary operator eiHnτsuperscript𝑒𝑖𝐻𝑛𝜏e^{-iHn\tau}italic_e start_POSTSUPERSCRIPT - italic_i italic_H italic_n italic_τ end_POSTSUPERSCRIPT;
6:     Perform Hadamard tests on eiHnτ|Φsuperscript𝑒𝑖𝐻𝑛𝜏ketΦe^{-iHn\tau}|\Phi\rangleitalic_e start_POSTSUPERSCRIPT - italic_i italic_H italic_n italic_τ end_POSTSUPERSCRIPT | roman_Φ ⟩ for 𝒪(log(|Ω|δ21))𝒪Ωsuperscriptsubscript𝛿21\mathcal{O}(\log(|\Omega|\delta_{2}^{-1}))caligraphic_O ( roman_log ( | roman_Ω | italic_δ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ) times;
7:     Calculate the average value of the test outcomes as signal ytsubscript𝑦𝑡y_{t}italic_y start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT.
8:end for
9:Solve
minu,r,v12rΩyΩ22+λ(12Ntr(Toep(u))+v2),s.t.[Toep(u)rrv]0\min_{u,r,v}\frac{1}{2}\|r_{\Omega}-y_{\Omega}\|_{2}^{2}+\lambda\left(\frac{1}% {2N}\text{tr}(\text{Toep}(u))+\frac{v}{2}\right),\quad s.t.\quad\begin{bmatrix% }\text{Toep}(u)&r\\ r^{\dagger}&v\end{bmatrix}\succeq 0roman_min start_POSTSUBSCRIPT italic_u , italic_r , italic_v end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∥ italic_r start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_λ ( divide start_ARG 1 end_ARG start_ARG 2 italic_N end_ARG tr ( Toep ( italic_u ) ) + divide start_ARG italic_v end_ARG start_ARG 2 end_ARG ) , italic_s . italic_t . [ start_ARG start_ROW start_CELL Toep ( italic_u ) end_CELL start_CELL italic_r end_CELL end_ROW start_ROW start_CELL italic_r start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT end_CELL start_CELL italic_v end_CELL end_ROW end_ARG ] ⪰ 0
to obtain solution u#,r#,v#superscript𝑢#superscript𝑟#superscript𝑣#u^{\#},r^{\#},v^{\#}italic_u start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT , italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT , italic_v start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT.
10:Perform singular value decomposition on Hank(r#)Hanksuperscript𝑟#\text{Hank}(r^{\#})Hank ( italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT ):
Hank(r#)=[u1,u2,,u||]diag{σ1,σ2,,σ||,}[v1,v2,,v||],Hanksuperscript𝑟#subscript𝑢1subscript𝑢2subscript𝑢diagsubscript𝜎1subscript𝜎2subscript𝜎superscriptsubscript𝑣1subscript𝑣2subscript𝑣\text{Hank}(r^{\#})=[u_{1},u_{2},\cdots,u_{|\mathcal{F}|}\cdots]\text{diag}\{% \sigma_{1},\sigma_{2},\cdots,\sigma_{|\mathcal{F}|},\cdots\}[v_{1},v_{2},% \cdots,v_{|\mathcal{F}|}\cdots]^{\dagger},Hank ( italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT ) = [ italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_u start_POSTSUBSCRIPT | caligraphic_F | end_POSTSUBSCRIPT ⋯ ] diag { italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_σ start_POSTSUBSCRIPT | caligraphic_F | end_POSTSUBSCRIPT , ⋯ } [ italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_v start_POSTSUBSCRIPT | caligraphic_F | end_POSTSUBSCRIPT ⋯ ] start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT ,
where {σ1,σ2,,σ||}subscript𝜎1subscript𝜎2subscript𝜎\{\sigma_{1},\sigma_{2},\cdots,\sigma_{|\mathcal{F}|}\}{ italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_σ start_POSTSUBSCRIPT | caligraphic_F | end_POSTSUBSCRIPT } are the |||\mathcal{F}|| caligraphic_F | largest singular values.
11:Find the projector onto span{u1,u2,,u||}spansubscript𝑢1subscript𝑢2subscript𝑢\text{span}\{u_{1},u_{2},\cdots,u_{|\mathcal{F}|}\}span { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_u start_POSTSUBSCRIPT | caligraphic_F | end_POSTSUBSCRIPT } as P1subscript𝑃1P_{1}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.
12:Compute the noise-space correlation function: C(k)=(IP1)a(k)2/a(k)2𝐶𝑘subscriptnorm𝐼subscript𝑃1𝑎𝑘2subscriptnorm𝑎𝑘2C(k)=\|(I-P_{1})a(k)\|_{2}/\|a(k)\|_{2}italic_C ( italic_k ) = ∥ ( italic_I - italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_a ( italic_k ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT / ∥ italic_a ( italic_k ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.
Refer to caption
Figure 5: Signal recovery by off-grid compressed sensing and MUSIC algorithm. The example uses signal yn0=13ei2πf1n+13ei2πf1n+13ei2πf3nsubscriptsuperscript𝑦0𝑛13superscript𝑒𝑖2𝜋subscript𝑓1𝑛13superscript𝑒𝑖2𝜋subscript𝑓1𝑛13superscript𝑒𝑖2𝜋subscript𝑓3𝑛y^{0}_{n}=\frac{1}{3}e^{-i2\pi f_{1}n}+\frac{1}{3}e^{-i2\pi f_{1}n}+\frac{1}{3% }e^{-i2\pi f_{3}n}italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 3 end_ARG italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 3 end_ARG italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_n end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 3 end_ARG italic_e start_POSTSUPERSCRIPT - italic_i 2 italic_π italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_n end_POSTSUPERSCRIPT with f1=0.032,f2=0.064formulae-sequencesubscript𝑓10.032subscript𝑓20.064f_{1}=0.032,\,f_{2}=0.064italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0.032 , italic_f start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0.064, and f3=0.128subscript𝑓30.128f_{3}=0.128italic_f start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 0.128. We set N=100𝑁100N=100italic_N = 100 (totally 201 points), |Ω|=50Ω50|\Omega|=50| roman_Ω | = 50, and λ=0.05𝜆0.05\lambda=0.05italic_λ = 0.05. The sampled data comes from taking the average over 500 Hadamard tests. Fig a) shows the comparison between the real parts of the true signal y0superscript𝑦0y^{0}italic_y start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT, the sampled signal yΩsubscript𝑦Ωy_{\Omega}italic_y start_POSTSUBSCRIPT roman_Ω end_POSTSUBSCRIPT, and the recovered signal from the off-grid compressed sensing r#superscript𝑟#r^{\#}italic_r start_POSTSUPERSCRIPT # end_POSTSUPERSCRIPT. Fig b) shows the outcomes of the MUSIC algorithm. The local minima set of C(k)𝐶𝑘C(k)italic_C ( italic_k ) is the estimator for \mathcal{F}caligraphic_F. Fig c) shows the inverse of C(k)𝐶𝑘C(k)italic_C ( italic_k ).