Quantum phase estimation by compressed sensing
Abstract
As a signal recovery algorithm, compressed sensing is particularly useful when the data has low-complexity and samples are rare, which matches perfectly with the task of quantum phase estimation (QPE). In this work we present a new Heisenberg-limited QPE algorithm for early fault-tolerant quantum computers based on compressed sensing. More specifically, given many copies of a proper initial state and queries to a specific unitary matrix, our algorithm is able to recover the phase with a total runtime , where is the desired accuracy. Moreover, the maximal runtime satisfies , which is comparable to the state-of-the-art algorithms, and our algorithm is also robust against certain amount of noise from sampling and state preparation.
1 Introduction
Quantum phase estimation (QPE) [1] is one of the most useful subroutines in quantum computing and plays an important role in many promising quantum applications [2, 3, 4, 5]. Given a unitary matrix and one of its eigenvectors such that , the task of QPE is to estimate phase with high probability within a given accuracy guarantee. The problem of estimating multiple phases of has been referred to the quantum eigenvalue estimation problem (QEEP) [5, 6, 7, 8, 9, 10]. When we set the unitary matrix as the evolution operator under a Hamiltonian , the task of QPE is equivalent to estimating a specific energy level with accuracy [7, 8].
While fully fault-tolerant quantum computers may still be years away from realization, early fault-tolerant quantum computers with limited number of logical qubits and limited circuit depth are expected to be realized much sooner and to solve nontrivial tasks that demonstrate practical quantum advantages. Given the crucial role of QPE in many of such tasks, it becomes imperative to design QPE algorithms specifically tailored for early fault-tolerant quantum computers. The standard textbook QPE algorithm [11] does not require an exact eigenstate as the initial state and takes only one measurement, but it uses a large amount of ancilla qubits and controlled operations, which is fairly demanding in experiment. Although Kitaev’s original iterative QPE algorithm [1] only uses one ancilla qubit and one controlled operation (see Fig. 1), it requires the initial state to be an exact eigenstate which can be a difficult task by itself. Therefore, both of them are not suitable for early fault-tolerant quantum computers.
Much of the recent work [7, 12, 13] in QPE for early fault-tolerant quantum computers has focused on designing better protocols to improve various aspects of Kitaev’s original QPE algorithm. More specifically, the following properties are desired in designing such algorithms:
-
•
The quantum circuit should be simple, using at most one ancilla qubit and one controlled operation.
-
•
The initial state is not necessarily an exact eigenstate of .
-
•
The total runtime achieves the Heisenberg limit, i.e., the total cost should be
for estimating the phase to accuracy with probability . -
•
When the overlap of the initial state and the targeted eigenstate is large, the maximal runtime (hence the maximum circuit depth) can be much smaller than .
![Refer to caption](extracted/5304001/hadamard.jpg)
The procedure of these algorithms can be separated into the quantum part and the classical part. Usually, the quantum part is a combination of Hamiltonian simulation and the Hadamard test (see Fig. 1). Hamiltonian simulation algorithms [14] are used to prepare the evolution operator . Longer evolution time requires more quantum gates, and the best known circuit complexity for running without ancilla qubits is almost linear in [15, 16]. The Hadamard test produces information about in the form of a complex signal. Specifically, given an initial state , can be estimated with many runs of Hadamard tests. For QPE, the signal is dominated by a single complex sinusoidal function (i.e., ), and the objective is to estimate its frequency . For QEEP, the signal is regarded as a linear combination of multiple complex sinusoidal functions (i.e., )). The target signal is said to have length- and sparsity- if takes value from integers in and contains distinct frequencies . For QEEP, is usually assumed to be much smaller than (for QPE, ). The classical aspect of QPE and QEEP involves estimating frequencies from these statistically sampled sparse signals, a process akin to the objectives of sparse Fourier transformation algorithms [17].
There are several aspects of evaluating the performance of a sparse Fourier transformation algorithm. The runtime complexity, the sample complexity, and the resolution are all important ingredients to consider. Here the runtime complexity quantifies how long the algorithm takes, and the sample complexity measures the number of time-domain signal samples required in the algorithm. For example, the Fast Fourier Transformation algorithm [18] has runtime complexity with sample complexity . So far, the best runtime complexity is with [19], and the most sample-efficient algorithm requires only samples [20]. In practical scenarios, we most likely have noisy data, necessitating the need for algorithmic robustness. Classical algorithms such as Multiple Signal Classification (MUSIC) [21] and Estimation of Signal Parameters via Rational Invariance Techniques (ESPRIT) [22] are two examples of the robust signal processing algorithms. Given the unique characteristics of our quantum setting, we prioritize the sample complexity, resolution, and robustness of an algorithm.
Many of these classical signal processing algorithms have been used in the task of QEEP. To the best of our knowledge, [6] was the first attempt to solve QEEP with Hadamard tests, where QEEP was treated as a time-series analysis problem. Later, [7] emphasized the importance of the Heisenberg-limited scaling, and listed a few other requirements for the post-processing algorithm. By applying the Fourier-filter function techniques they succeeded in designing the first Heisenberg-limited QPE algorithm for early fault-tolerant quantum computers. Their algorithm was further improved by other work [5, 8], where the maximal evolution time was largely reduced. Two recent QPE algorithms [12, 13], inspired by Robust Phase Estimation [23, 24, 25], were proposed. Both algorithms have a hierarchy structure, where the unit evolution times are taken from . These recent algorithms improved the relation between , the initial overlap , and the final accuracy . When the overlap is large, the work of [12] reduces the prefactor in the maximum runtime scaling by using a subroutine called the quantum complex exponential least squares (QCELS). In contrast to [7] in which the prefactor is at least , the prefactor in [12] can be arbitrarily close to as . The more recent work [13] further improved the requirement on . In [26] and [27], the last two QPE algorithms have been extended to the QEEP set up.
In this work we present a new algorithmic framework of solving both QPE and QEEP using compressed sensing [28, 29, 30], which has not been carefully considered as far as we know. Compressed sensing is a prominent algorithm for signal recovery with wide applications in various domains such as time-frequency analysis, image processing, and quantum state tomography [31, 32, 33]. The framework of compressed sensing assumes the sparsity of the signal and recovers the entire signal from a few samples by solving a linear/semidefinite programming (SDP) problem. Here, the sparsity refers to the number of sinusoidal functions in the signal being small. The small number of required samples and the robustness makes compressed sensing an appealing choice for designing QPE algorithm.
Our main contribution is a simple and robust classical post-processing algorithm for QPE based on compressed sensing. In QPE, we regard both the uncertainty from the Hadamard tests and the inaccuracy of initial state preparation as noise. To extract target frequency (which has a linear relationship with the target energy ) in the presence of noise, our main idea is to use the robust recovery property of convex relaxation algorithm [29], a modified version of the vanilla compressed sensing. For signal vectors with size , when the frequency is nearly on-grid () and the noise for each sample is bounded by a constant, the convex relaxation algorithm can recover with only samples, which satisfies the Heisenberg limit. With no prior knowledge about (i.e., could be off-grid), our algorithm can still find a grid shift parameter such that after shifting the signal by , the new signal becomes nearly on-grid, and the convex relaxation algorithm can still be applied. By searching the optimal grid shift parameter in a finite set, eventually the accuracy of is , where is a constant related to the noise and the interval between the grid shift parameters. In terms of the maximum runtime , since the samples of the compressed sensing algorithm are integers in , scales linearly in , and is .
The rest of the paper is organized as follows. We start with preliminaries about QEEP and compressed sensing in Sec. 2. We then introduce our QPE algorithm based on compressed sensing in Sec. 3 and prove several analytical results including its Heisenberg-limit scaling. In Sec. 4 we numerically test the performance of our algorithm and compare it to previous works. Finally, we summarize several open problems and potential future research directions in Sec. 5.
2 Preliminaries
The notations frequently used in the main text are summarized in Table 1.
Notation | Meaning |
---|---|
sampled time-domain signal | |
ideal time-domain signal | |
noise | |
ideal frequency-domain signal | |
set of evolution times | |
sampling ratio | |
samples in compressed sensing | |
unit time step | |
accuracy on energy level | |
failure probability | |
noise tolerance for each signal |
2.1 QEEP as a signal recovery problem
In this section, we express the QEEP as a sparse signal recovery problem. Given a specific Hamiltonian with spectrum decomposition , where are energy levels and are projectors onto the corresponding eigenstates, and an initial state , the time-domain signal in QEEP is given as
(1) |
Denote the set of energy levels that have non-zero overlaps with as the target set
(2) |
Each element in corresponds to a sinusoidal function whose frequency is determined by . We assume is small so that can be regarded as a sparse signal. More specifically, in compressed sensing algorithms, sparsity usually means that [28], where is the size of the discrete signals.
The task of a QEEP algorithm is equivalent to estimating within certain accuracy level using the data obtained from the Hadamrd tests. Assuming the output of the QEEP algorithm is , we require , and
(3) |
We have the freedom to choose the set of evolution times to use in the algorithm. Denote the set of evolution times as . For each time , can be obtained from averaging over the Hadamard tests. More precisely, when , the measurement outcome in Fig. 1 is a random variable
(4) |
Similarly, when , we obtain
(5) |
The summation of the two gives the estimate of :
(6) |
After sampling the random variable for times, we obtain a noisy signal:
(7) |
then we use the noisy samples to recover . Here originates from the statistical uncertainty of the Hadamard tests. Hoeffding’s inequality ensures that with probability , we have
(8) |
In the rest of the paper, the meanings of are not identical, but they always represent the part of the signal that should be considered as noise. Introduce the noise tolerance parameter such that the signal recovery algorithm can recover accurate estimates for as long as the noise of each signal is not larger than . Thus to guarantee for all with probability , we require , so would be proportional to . For a rigorous proof of this, see Appendix A of [12].
The total experiment cost can be captured by the total runtime , which reflects the total circuit depth for completing the entire algorithm. In Hamiltonian simulation, the circuit complexity of constructing operator scales linearly with [15] ( can be negative). Thus, in this set up, the total runtime is
(9) |
For instance, if the signal recovery algorithm has parameters , and , then it achieves the Heisenberg limit. We will see that the algorithm we have using compressed sensing fits this description. The maximal runtime , which reflects the maximum circuit depth, is particularly important for early fault-tolerant quantum computers.
When contains only one energy level, the task becomes QPE. In QPE, without loss of generality111In this work we don’t consider the hardness of the preparation of the initial state. From the point view of phase estimation, there’s nothing special about the ground state energy compared to other eigenvalues as long as one can prepare an initial state that is close enough to the target eigenstate., we will be mainly discussing the estimation of the ground energy , i.e., the smallest eigenvalue of . In general, we do not expect to be able to prepare the exact ground state but assume that the initial state has a large overlap with the ground state:
(10) |
Note that equals to in the Introduction. As long as is small enough, the signal is still dominated by , and the ground energy can be estimated efficiently.
2.2 Compressed sensing for signal recovery
In this section we introduce some necessary concepts and notations for compressed sensing. Given a vector , its -norm, -norm and norm are defined as
(11) |
In the following paragraph, we use to label the indices of entries in frequency domain, and use to label the indices of entries in time domain. Denote the set of integers from 1 to as . In regular compressed sensing, we deal with a time-domain discrete signal in the form of
(12) |
where , and is the set of frequency support. The time-domain signal can thus be written as an -dimensional vector
(13) |
Define the Fourier matrix as . By labeling its columns with , we have
(14) |
Throughout the paper, if the frequency satisfies
(15) |
then we say is on-grid, otherwise it is off-grid. For a frequency , we define its off-grid deviation as
(16) |
If all are on-grid satisfying Eq. (15), the frequency-domain signal can be written in the form of a vector as
(17) |
It is a sparse vector with non-zero entries. The purpose of compressed sensing is then to recover from a few noisy samples of .
The algorithm is accomplished in the following scenario. Choose a sampling ratio , and assign each integer in a random variable that satisfies
(18) |
Draw one sample from each , and denote the set of integers with as the sample set . The size of the sample set concentrates around . Based on the choice of , we define the partial inverse Fourier transformation and the signal samples as
(19) | |||
(20) |
Note that has dimension , while has dimension . With these notations, the compressed sensing algorithm can be thought as solving the optimization problem
(21) |
which can be rewritten as a linear programming problem. When and the frequency support is sparse in the sense that , the optimal solution equals to the frequency-domain signal with high probability. Rigorous statements can be found in [28].
Provided that the signal has extra noise where , the signal can be approximately recovered by the convex relaxation algorithm [29]:
(22) |
The difference between and depends on the size of .
The uniqueness and robustness of compressed sensing solution can be analyzed through the so-called dual certificate [30]. In optimization, the dual certificate is the optimal solution to the dual problem. In a regular compressed sensing task where all frequencies are on-grid, for each frequency support and each sample set , the dual certificate is an -dimensional random vector such that
(23) | |||
(24) | |||
(25) |
where is determined by the size of . The existence of such a dual certificate is called the exact reconstruction principle [28]. Using dual certificate as a pivot, we can quantify the part of that is not supported on , which further helps us determine the accuracy of the algorithm. The basic idea is to separate the entries of into two parts:
(26) |
Because the true frequency-domain solution satisfies the constraint in Eq. (22), the 1-norm of is upper bounded by the 1-norm of . On the other hand, the norm of the inner product between and can be well-estimated, and the restrictions on the entries of enable us to deal with and separately. The combination of these considerations eventually provides us an upper bound on .
To write the QEEP signal in Eq. (1) in the form of the compressed sensing signal in Eq. (12), we introduce a unit time step , such that
(27) |
In the same framework, we have the following correspondence:
(28) |
The target energy levels can be directly obtained from the frequency support . Note that the frequencies are defined on . To keep the order of energy levels unchanged, it is necessary to have
(29) |
This condition can always be satisfied by adding a constant to the Hamiltonian and choosing properly.
3 Single eigenvalue estimation
In this section, we provide a quantitative restriction on the noise tolerance in Eq. (22). Then by considering the combination of three sources of inexactness, namely the uncertainty of the Hadamard test, the inaccuracy of the initial state, and the off-grid deviation as noise, we prove performance bounds for our algorithm.
Let us start with the case when the signal contains only one frequency and the frequency is on-grid: , for which we are given the values of
(30) |
on a random sample set . Note that the frequency-domain signal is . The following lemma shows how its dual certificate can be constructed.
Lemma 1.
Given values of signal on a random sample set with sampling ratio
(31) |
the random vector satisfies
(32) |
with probability at least .
Proof.
See Appendix A. ∎
Lemma 2.
Proof.
See Appendix B. ∎
In practical situations we cannot assume that the frequency is always perfectly on-grid, but this is not a problem. First, when is very close to an integer, Eq. (22) can still approximately recovery the signal. The following proposition quantifies how large the off-grid deviation can be.
Proposition 1.
Proof.
In this situation, we have
(35) | |||
(36) |
According to Lemma 2, one sufficient condition for robust recovery is for all . Hence it is enough to have . ∎
The in Proposition 34 represents the off-grid deviation. If , the solution is still a good approximation of the true frequency , and the accuracy still scales as . When the off-grid deviation is large, we can find a grid-shift parameter such that after the transformation , the new signal is nearly on-grid. The details are given in Algorithm 1.
Besides the Hadamard tests and the off-grid deviation, the third type of noise comes from the preparation of the initial state. In practice, it is hard to prepare the exact ground state . Suppose the actual initial state is
(37) |
with , then can be expanded as
(38) |
Note that only is the target signal and we can treat the term as noise as well. Thus, let , then each sampled signal can be decomposed as
(39) | ||||
where originates from the Hadamard test, originates from the state preparation, and is a grid-shift parameter. Recall that in Lemma 2, as long as , the noise does not break down the signal recovery (after shifting the entire signal by ). We separate into three parts. Let , , , and . Then given , , , the fidelity error upper bound , the number of Hadamard tests that is at least , and the number of grid shifts as , the complete algorithm for QPE is presented in Algorithm 1.
Theorem 1.
Given the ground state , the ground-state energy , and the initial state , if the noise tolerance parameters satisfy
(40) |
and the accuracy level satisfies , then with probability at least , where , the output of Algorithm 1 satisfies
(41) |
and the cost of the algorithm satisfies
(42) | |||
(43) |
Proof.
Since denotes the total failure rate of the Hadamard tests, by Hoeffding’s inequality Hadamard tests are enough to guarantee that for all , the additive error from the Hadamard tests is smaller than with probability .
Now we analyze the accuracy on . Let . Define as following
(44) |
Let be the projection of on the random set . The sampled signal becomes
(45) |
Define the total empirical error function as
(46) |
The choice of the grid shift parameters and the sampling ratio guarantee that, among the set of solutions , there exists at least one satisfying
(47) |
with probability at least . We omit the notation from now on. If is the only one in that minimizes the function, then , and the accuracy on is . Otherwise, suppose there exists (no matter originates from a successful compressed sensing algorithm or not), we need to quantify the relation between and and show that with high probability.
Our strategy is the following: if , then this already gives an error upper bound ; thus we only consider the case where . Suppose we can find positive such that for all ,
(48) |
then if , we have as well, and
(49) | |||
(50) | |||
(51) | |||
(52) | |||
(53) |
On the other hand, suppose for all in region , we have a constant lower bound:
(54) |
Using the same argument, leads to
(55) |
Therefore, if , we can rule out the possibility of .
Regard as the summation of random variables . According to Lemma 90 (in Appendix C), with probability at least
(56) |
the following parameters suffice for Eq. (48) and Eq. (54):
(57) |
Thus, because , must belongs to . In this region, we have
(58) |
which determines the accuracy of .
The maximal runtime equals to . Because is sampled from , we have . The upper bound on can be obtained from Eq. (9) and related parameters in the algorithm. ∎
Remark.
In order to express and in terms of and , we can choose to be smaller than and obtain the following approximation:
(59) |
Thus, because , we have
(60) |
![Refer to caption](x1.png)
In Fig. 2 we demonstrate our algorithm with a simple test case where the target frequency is with , and . As shown in Fig. 2, when the grid shift parameter is in region , the integer parts of { are all 20, which matches with the . In the same region, the total empirical error is approximately a quadratic function of the grid shift parameter.
For this test, the three noise tolerance parameters are
(61) |
therefore . Clearly, the noise tolerance parameters do not satisfy Eq. (40), but the algorithm still outputs the true frequency . This numerical experiment demonstrates that the parameters in Lemma 2 and Theorem 1 can potentially be further improved.
It is worth mentioning that if we assume the energies of are nearly on-grid simultaneously, then our algorithm can be directly applied to the multiple phase estimation. Indeed, suppose that there are eigenvalues with non-negligible amplitudes, then for each compressed sensing task, we can choose the largest entries of as a trial solution for the frequency domain signal. Similarly, by trying all the grid shift parameters we can obtain an optimal solution , and the output is related to the largest entries of :
(62) |
Unfortunately, there is no guarantee that all the frequencies are nearly on-grid. For example, if , our compressed sensing algorithm is not guaranteed to work no matter how we choose the grid shift parameters. Essentially, the frequencies we try to estimate are continuous parameters, however, we can only find solution in a discrete set. This problem is formally termed as basis mismatch in signal analysis.
4 Comparison to previous works
In this section, we compare our algorithm with previous results. First, we briefly introduce the two types of QCELS algorithms for QPE: the first one [12] (named multi-level QCELS) has a hierarchy structure similar to Robust Phase Estimation and it is used for QPE, the second one [26] (named MM-QCELS) is designed for QEEP.
The outline of the two algorithms can be described as follows. The two algorithms share a hierarchy structure, namely, the algorithm can be divided into several hierarchies. At each hierarchy, the algorithm outputs an estimate of the target energy, and in the next hierarchy, it searches for solutions in a narrower region and obtains a new estimate. At each hierarchy the multilevel QCELS first picks a time scale: , then performs the Hadamard tests for separately. Here is a constant. Given the value of the noisy signal at different times, the algorithm outputs the estimate for by minimizing the following cost function:
(63) |
This algorithm is proved to be efficient for single-phase estimation but not for multiple-phases estimation. The authors later proposed another relevant algorithm (MM-QCELS). In this method, they adapt the previous algorithm in two steps: the times at which the Hadamard tests are performed become random samples drawn from a probability distribution, and the cost function is changed to
(64) |
Moreover, when applying MM-QCELS to single-phase estimation, the hierarchy structure can be removed, so that the algorithm only has one-level.
We show ythe between multi-level QCELS and Algorithm 1 in Fig. 3. Here we perform a similar numerical test as the one in Sec. V. A of [12] using our algorithm. It turns out for the specific choice of sampling ratio, the scalings of averaged error versus are almost the same, while our is slightly larger than theirs.
![Refer to caption](extracted/5304001/QPE.jpeg)
Essentially, the idea of compressed sensing is similar to that of the one-level MM-QCELS. As mentioned in the discussion section of [26], the one-level MM-QCELS may be able to estimate the dominant phase efficiently. In both methods, one intends to fit the sampled data by an ansatz of the signal. The difference lies in the target function to optimize. In one-level MM-QCELS, the target function is the total empirical error in time domain. In compressed sensing, the target function is the 1-norm of the frequency domain signal.
For single-phase estimation, our compressed-sensing-based algorithm does not have a prominent advantage comparing to multi-level QCELS. However, for specific multiple-phase estimation tasks, our Algorithm 1 can perform better than the one-level MM-QCELS algorithm (see for example Fig. 4). This is because compressed sensing works for on-grid sparse signals in general, and the energy levels of times can happen to be nearly on-grid at the same time.
![Refer to caption](extracted/5304001/QEEP.jpeg)
5 Discussion
In this paper we presented a simple and robust algorithm for quantum phase estimation using compressed sensing. For the single eigenvalue estimation (i.e., QPE), we rigorously established its Heisenberg-limit scaling and numerically demonstrated its comparable performance compared to the other state-of-the-art QPE algorithms. We further did some exploratory work for the case of the multiple eigenvalues estimation (i.e., QEEP) and showed that empirically for certain scenarios our algorithm can be more sample-efficient than previous algorithms. Potentially in practice one can use our algorithm to solve the QEEP as a first attempt, and if the signal recovered by the algorithm fits the data well enough, then one already obtains a good estimation of the eigenvalues; otherwise we can switch to the other algorithms. It would be an excellent future direction to design a more complete compressed sensing based QEEP algorithm that performs well in practice while achieving the Heisenberg limit.
One main contribution of our work is the new connection between QPE and compressed sensing we established. While some aspects of our algorithm are still not ideal, e.g., our algorithm currently requires a higher initial overlap compared to the other recent works [7, 12, 13], we believe that our work provides a solid first step towards a more robust and resource-efficient compressed sensing based QPE algorithm for early fault-tolerant quantum computers, especially considering the rich literature on compressed sensing and its efficiency and robustness demonstrated in practical applications. Moreover, in practice before running any QPE algorithms, one always needs to prepare the initial state based on the choice of the Hamiltonian first. As long as we only require a constant overlap with the target eigenstate, the cost in state preparation should be approximately in the same order.
Lastly, we summarize a few open questions here:
-
•
As we mentioned along the numerical demonstration in Fig. 2, the results suggest that our algorithm is more robust than what we were able to prove rigorously. It therefore should be possible to further improve the noise tolerance in Lemma 2, which would also result in reducing the total runtime in our analysis.
-
•
Instead of sampling discrete time steps, we can also sample the evolution times on a continuous region. Similar to the setup in [26], we can design a probability distribution of evolution times , which then, as long as , the Heisenberg limit is still satisfied. Theoretically, continuous Fourier sampling can further reduce the sample complexity and the maximal runtime.
-
•
Designing Heisenberg-limited QEEP algorithms based on compressed sensing. One solution that we tried is the off-grid compressed sensing [35]. See Appendix D for a detailed discussion. Numerically the accuracy of the off-grid algorithm seems not as good as the other state-of-the-art methods. It would be an excellent direction to further improve this method, or find an alternative solution that solves the basis mismatch problem in QEEP.
-
•
Finding the optimal grid shift parameter by optimization instead of trying every grid shift parameter in a trial set.
6 Acknowledgement
We thank Tianyu Wang for helpful discussions. C.Y. acknowledges support from the National Natural Science Foundation of China (Grant No. 92165109), National Key Research and Development Program of China (Grant No. 2022YFA1404204), and Shanghai Municipal Science and Technology Major Project (Grant No. 2019SHZDZX01). C.Z. and J.T. acknowledge support from the U.S. National Science Foundation under Grant No. 2116246, the U.S. Department of Energy, Office of Science, National Quantum Information Science Research Centers, and Quantum Systems Accelerator.
References
- [1] A Yu Kitaev. Quantum measurements and the abelian stabilizer problem. arXiv preprint quant-ph/9511026, 1995.
- [2] Peter W Shor. Polynomial-time algorithms for prime factorization and discrete logarithms on a quantum computer. SIAM review, 41(2):303–332, 1999.
- [3] Daniel S Abrams and Seth Lloyd. Quantum algorithm providing exponential speed increase for finding eigenvalues and eigenvectors. Physical Review Letters, 83(24):5162, 1999.
- [4] Sam McArdle, Suguru Endo, Alán Aspuru-Guzik, Simon C Benjamin, and Xiao Yuan. Quantum computational chemistry. Reviews of Modern Physics, 92(1):015003, 2020.
- [5] Ruizhe Zhang, Guoming Wang, and Peter Johnson. Computing ground state properties with early fault-tolerant quantum computers. Quantum, 6:761, 2022.
- [6] Rolando D Somma. Quantum eigenvalue estimation via time series analysis. New Journal of Physics, 21(12):123025, 2019.
- [7] Lin Lin and Yu Tong. Heisenberg-limited ground-state energy estimation for early fault-tolerant quantum computers. PRX Quantum, 3(1):010318, 2022.
- [8] Guoming Wang, Daniel Stilck-França, Ruizhe Zhang, Shuchen Zhu, and Peter D Johnson. Quantum algorithm for ground state energy estimation using circuit depth with exponentially improved dependence on precision. arXiv preprint arXiv:2209.06811, 2022.
- [9] Thomas E O’Brien, Brian Tarasinski, and Barbara M Terhal. Quantum phase estimation of multiple eigenvalues for small-scale (noisy) experiments. New Journal of Physics, 21(2):023022, 2019.
- [10] Alicja Dutkiewicz, Barbara M. Terhal, and Thomas E O’Brien. Heisenberg-limited quantum phase estimation of multiple eigenvalues with few control qubits. Quantum, 6:830, oct 2022.
- [11] Michael A Nielsen and Isaac Chuang. Quantum computation and quantum information, 2010.
- [12] Zhiyan Ding and Lin Lin. Even shorter quantum circuit for phase estimation on early fault-tolerant quantum computers with applications to ground-state energy estimation. PRX Quantum, 4:020331, May 2023.
- [13] Hongkang Ni, Haoya Li, and Lexing Ying. On low-depth algorithms for quantum phase estimation. arXiv preprint arXiv:2302.02454, 2023.
- [14] Iulia M Georgescu, Sahel Ashhab, and Franco Nori. Quantum simulation. Reviews of Modern Physics, 86(1):153, 2014.
- [15] Andrew M Childs and Yuan Su. Nearly optimal lattice simulation by product formulas. Physical review letters, 123(5):050503, 2019.
- [16] Andrew M Childs, Yuan Su, Minh C Tran, Nathan Wiebe, and Shuchen Zhu. Theory of trotter error with commutator scaling. Physical Review X, 11(1):011020, 2021.
- [17] Haitham Hassanieh, Piotr Indyk, Dina Katabi, and Eric Price. Nearly optimal sparse fourier transform. In Proceedings of the forty-fourth annual ACM symposium on Theory of computing, pages 563–578, 2012.
- [18] William T Cochran, James W Cooley, David L Favin, Howard D Helms, Reginald A Kaenel, William W Lang, George C Maling, David E Nelson, Charles M Rader, and Peter D Welch. What is the fast fourier transform? Proceedings of the IEEE, 55(10):1664–1674, 1967.
- [19] Anna C Gilbert, Shan Muthukrishnan, and Martin Strauss. Improved time bounds for near-optimal sparse fourier representations. In Wavelets XI, volume 5914, pages 398–412. SPIE, 2005.
- [20] Piotr Indyk, Michael Kapralov, and Eric Price. (nearly) sample-optimal sparse fourier transform. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms, pages 480–499. SIAM, 2014.
- [21] Wen**g Liao and Albert Fannjiang. Music for single-snapshot spectral estimation: Stability and super-resolution. Applied and Computational Harmonic Analysis, 40(1):33–67, 2016.
- [22] Richard Roy and Thomas Kailath. Esprit-estimation of signal parameters via rotational invariance techniques. IEEE Transactions on acoustics, speech, and signal processing, 37(7):984–995, 1989.
- [23] BL Higgins, DW Berry, SD Bartlett, MW Mitchell, HM Wiseman, and GJ Pryde. Demonstrating heisenberg-limited unambiguous phase estimation without adaptive measurements. New Journal of Physics, 11(7):073023, 2009.
- [24] Shelby Kimmel, Guang Hao Low, and Theodore J Yoder. Robust calibration of a universal single-qubit gate set via robust phase estimation. Physical Review A, 92(6):062315, 2015.
- [25] Federico Belliardo and Vittorio Giovannetti. Achieving heisenberg scaling with maximally entangled states: An analytic upper bound for the attainable root-mean-square error. Physical Review A, 102(4), oct 2020.
- [26] Zhiyan Ding and Lin Lin. Simultaneous estimation of multiple eigenvalues with short-depth quantum circuit on early fault-tolerant quantum computers. arXiv preprint arXiv:2303.05714, 2023.
- [27] Haoya Li, Hongkang Ni, and Lexing Ying. On low-depth quantum algorithms for robust multiple-phase estimation. arXiv preprint arXiv:2303.08099, 2023.
- [28] Emmanuel J Candes and Terence Tao. Near-optimal signal recovery from random projections: Universal encoding strategies? IEEE transactions on information theory, 52(12):5406–5425, 2006.
- [29] Emmanuel J Candes, Justin K Romberg, and Terence Tao. Stable signal recovery from incomplete and inaccurate measurements. Communications on Pure and Applied Mathematics: A Journal Issued by the Courant Institute of Mathematical Sciences, 59(8):1207–1223, 2006.
- [30] Emmanuel J Candès, Justin Romberg, and Terence Tao. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Transactions on information theory, 52(2):489–509, 2006.
- [31] David Gross, Yi-Kai Liu, Steven T Flammia, Stephen Becker, and Jens Eisert. Quantum state tomography via compressed sensing. Physical review letters, 105(15):150401, 2010.
- [32] A. Smith, C. A. Riofrí o, B. E. Anderson, H. Sosa-Martinez, I. H. Deutsch, and P. S. Jessen. Quantum state tomography by continuous measurement and compressed sensing. Physical Review A, 87(3), mar 2013.
- [33] Amir Kalev, Robert L. Kosut, and Ivan H. Deutsch. Quantum tomography protocols with positivity are compressed sensing protocols. npj Quantum Information, 1(1):15018, Dec 2015.
- [34] https://github.com/CYI1995/QEEP/tree/main/Paper_QPE.
- [35] Gongguo Tang, Badri Narayan Bhaskar, Parikshit Shah, and Benjamin Recht. Compressed sensing off the grid. IEEE transactions on information theory, 59(11):7465–7490, 2013.
- [36] Joel A Tropp et al. An introduction to matrix concentration inequalities. Foundations and Trends® in Machine Learning, 8(1-2):1–230, 2015.
- [37] Yohann De Castro and Fabrice Gamboa. Exact reconstruction using beurling minimal extrapolation. Journal of Mathematical Analysis and applications, 395(1):336–354, 2012.
- [38] Clarice Poon, Nicolas Keriven, and Gabriel Peyré. The geometry of off-the-grid compressed sensing. Foundations of Computational Mathematics, 23(1):241–327, 2023.
Appendix A Proof of Lemma 1
The outline of proof largely follows Lemma 6.6 of [28].
Proof.
By the definition of , we can verify that . For all , we have
(65) |
Our target is to estimate . Using
(66) | |||
(67) | |||
(68) |
we obtain
(69) |
Using Bernstein’s inequality, the first term is bounded by
(70) |
To estimate the second term, we associate with the summation of a sequence of random matrices:
(71) |
where are defined in Eq. (18). Because when , , the summation of is
(72) |
which implies
(73) |
The matrix Bernstein’s inequality (Theorem 1.6.2 of [36]) states that
(74) |
with
(75) |
Choose , we obtain
(76) |
We want the to be smaller than . This condition leads to
(77) | |||
(78) |
Therefore, if we choose
(79) |
then can serve as a dual certificate with probability at least .
∎
Appendix B Proof of Lemma 2
Proof.
In this problem, we have
(80) |
According to Lemma 1, with high probability the dual certificate satisfies
(81) |
Calculate the inner product to obtain
(82) | ||||
Here comes from the fact that can have a small deviation that satisfies . Furthermore,
(83) |
Thus,
(84) |
On the other hand, is a feasible solution, hence , and
(85) |
Combining them together, we obtain
(86) |
∎
Appendix C Concentration of
Here we define
(87) |
then we can prove
Lemma 3.
For , consider the summation of random variables
(88) |
with being a parameter in region and is defined in Eq. (18). For each , with probability at least , we have
(89) |
With probability at least , we have for all ,
(90) |
Proof.
Start with the expectation value of :
(91) | ||||
(92) | ||||
(93) | ||||
(94) |
We verify that when , , thus ; when , we use
(95) |
Notice that is the Dirichlet kernel, which is close to a delta function. With , one can verify that for , because the second peak of the Dirichlet kernel is in region , we have
(96) | |||
(97) |
Therefore, when , we have , and for all . By Bernstein’s inequality,
(98) |
After calculation, is close to . Thus, with probability at least
(99) |
we have
(100) |
This is the first conclusion. In the other case where , the function satisfies
(101) |
therefore we have
(102) |
Similarly, using Bernstein’s inequality, we obtain
(103) |
Because , with probability at least , we have
(104) | |||
(105) |
After simplification, the second conclusion is obtained. ∎
Appendix D Off-grid compressed sensing
In this section we discuss an off-grid compressed sensing algorithm for the multiple eigenvalue estimation. Although the algorithm we will be describing works well numerically, the rigorous proof of its Heisenberg limit is hard, the full investigation of which will be left for future work.
In regular compressed sensing, the optimization algorithm Eq. (21) can be rewritten in the following concise form:
(106) |
where is the time-domain ansatz of the signal instead of the frequency-domain ansatz. The idea of the off-grid compressed sensing is similar, except we need another vector norm for the continuous values of frequencies [35]:
(107) | |||
(108) |
The new vector norm is called the atomic norm, which can be written as the solution of an SDP (see Proposition 2.1 of [35]):
(109) |
where is an -dimensional vector (in the off-grid case, instead of choosing from , we allow it to take values from to ), is a real number, and represents a Toeplitz matrix whose first column is :
(110) |
Thus, the full optimization task for the off-grid compressed sensing is
(111) |
Suppose the error tolerance for each signal is , then the robust version that allows small noise is given as
(112) |
with . This is the so-called Beurling-Lasso algorithm [37].
The next theorem shows the effciency of the algorithm, which is simply a rewriting of the results in [38] with our notations.
Theorem 2.
Suppose we have the signal
(113) |
where are positive numbers with summation 1, . We know the values of on the set , a set of integers sampled from to . The solution to Eq. (112) can be decomposed as
(114) |
To quantify the similarity between and , define
(115) | |||
(116) | |||
(117) |
Then the estimator for is
(118) |
and the estimator for is
(119) |
If , and
(120) |
introduce a constant , then we have
(121) | |||
(122) |
with probability at least .
The more detailed discussions can be found in Proposition 1, Proposition 2, and Appendix C of [38]. Roughly speaking, similarly to the regular compressed sensing, random samples are enough to approximately recover the noisy signal. But the output of the off-grid compressed sensing is an estimate of the time-domain signal, not the frequency-domain signal. We need an extra step to recover the frequency support. In signal recovery, when the minimal frequency gap of the signal has lower bound , we say the frequency support is well-separated and can be recovered easily. However, we do not have any conclusion about the minimal frequency gap of the output signal , thus the original method in [35] that depends on the reconstruction of dual certificate is not guaranteed to work. Besides, in practice the performance of the original algorithm is very sensitive to noise, which makes it impractical. We need a different post-processing procedure to recover an estimation of from .
The multiple signal classification (MUSIC) algorithm [21] is one strong candidate for this. The algorithm is constructed on the Hankel matrix that is defined as (when takes value from to )
(123) |
For a noisy signal , we can further separate the Hankel matrix into two parts:
(124) |
The Hankel matrix has the following decomposition:
(125) | ||||
(126) |
Therefore, the set of can be found in the space spanned by the vectors with the largest singular values of . Define the projector onto the null space of as . Define the noise-space correlation function as
(127) |
It turns out if and only if (see Theorem 1 of [21]). Similarly, we can define as the noise-space correlation function for , and as long as does not perturb the null space too much, the local minima of can serve as estimator for .
Combining the MUSIC algorithm with the off-grid compressed sensing we obtain Algorithm 2 for the QEEP. A numerical test that demonstrates the efficiency of the algorithm is shown in Fig. 5.
![Refer to caption](extracted/5304001/off_grid.jpeg)