The In-Medium Similarity Renormalization Group at Finite Temperature

Isaac G. Smith, Heiko Hergert, Scott K. Bogner Facility for Rare Isotope Beams, Michigan State University, East Lansing, MI 48824 Department of Physics and Astronomy, Michigan State University, East Lansing, MI 48824
Abstract

The study of nuclei at finite temperature is of immense interest for many areas of nuclear astrophysics and nuclear-reaction science. A variety of ab initio methods are now available for computing the properties of nuclei from interactions rooted in Quantum Chromodynamics, but applications have largely been limited to zero temperature. In the present work, we extend one such method, the In-Medium Similarity Renormalization Group (IMSRG), to finite temperature. Using an exactly-solvable schematic model that captures essential features of nuclear interactions, we show that the FT-IMSRG can accurately determine the energetics of nuclei at finite temperature, and we explore the accuracy of the FT-IMSRG in different parameter regimes, e.g., strong and weak pairing. In anticipation of FT-IMSRG applications for finite nuclei and infinite matter, we discuss differences arising from the choice of working with the canonical and the grand canonical ensembles. In future work, we will apply the FT-IMSRG with realistic nuclear interactions to compute nuclear structure and reaction properties at finite temperature, which are important ingredients for understanding nucleosynthesis in stellar environments, or modeling reactions of hot compound nuclei.

I Introduction

Efforts to describe the properties of atomic nuclei based on nuclear forces that are rooted in Quantum Chromodynamics have made significant progress in recent decades. So-called ab initio nuclear many-body calculations have been performed for hundreds of nuclei up to the Z50similar-to𝑍50Z\sim 50italic_Z ∼ 50 region, and results for even heavier nuclei are published with increasing frequency [1, 2, 3, 4, 5, 6, 7, 8, 9]. The main challenge for most of the methods that are used in these kinds of calculations is the sheer size of the many-body Hilbert spaces, which need to encompass tens or hundreds of nucleons and their degrees of freedom, as modeled by a chosen single-particle basis. The dimension of the many-body basis scales as (NA)binomial𝑁𝐴\binom{N}{A}( FRACOP start_ARG italic_N end_ARG start_ARG italic_A end_ARG ) with the number of (indistinguishable) particles A𝐴Aitalic_A and single-particle states N𝑁Nitalic_N, hence exact solutions of the (stationary) nuclear Schrödinger equation through diagonalization of the Hamiltonian matrix are only feasible for nuclei of mass A20𝐴20A\leq 20italic_A ≤ 20. In order to efficiently study heavier nuclei from first principles, it is necessary to develop methods to approximate the solution to the many-body problem in polynomial time. Several such methods have been developed that can extract properties of specific energy eigenstates, most frequently the ground state (see, e.g., [1] and references therein).

The common first step in a large number of many-body approaches is the construction of a reference state for the many-body basis via a Hartree-Fock (HF) calculation (see, e.g., [10]). Hartree-Fock is a variational method to approximate the ground state of a many-body Hamiltonian as a single Slater determinant by constructing an optimal single-particle basis that minimizes the energy at the mean field level. The HF solution then serves as the basis for so-called beyond mean-field methods, which improve the approximation of the ground state in a systematic fashion and converge to the exact solution in a well-defined way. Approaches like many-body perturbation theory (MBPT) or the non-perturbative Coupled Cluster (CC) and Self-Consistent Green’s Function (SCGF) methods have been very successful in approximating solutions to the nuclear many-body problem (see [1, 11, 12] and references therein).

The beyond mean-field method we will focus on in this work is the In-Medium Similarity Renormalization Group (IMSRG) [13, 14, 15, 1]. The IMSRG applies a continuous unitary transformation to the Hamiltonian, with the goal of extracting the ground state energy (or potentially the energy of a selected excited state, see [16]). Its variants have been used with great success in the prediction of ground state and excited state properties in a wide range of nuclei [14, 15, 17, 18, 6, 7, 8, 19].

Thus far, applications of the IMSRG and other modern beyond-mean field methods for finite nuclei have been limited to zero temperature, although ab initio studies of infinite matter at finite temperature based on modern nuclear interactions have been performed with a number of methods (see [20] and references therein). Extensions of mean-field methods, e.g., the Finite Temperature HF (FT-HF) and Hartree-Fock-Bogoliubov (FT-HFB) [21, 22, 23], and beyond mean-field methods like Shell-Model Monte Carlo [24, 25] have been developed decades ago, but they rely on schematic Hamiltonians or effective interactions whose parameters are fitted to data. More recently, several groups have studied nuclear ground-state properties and their response using non-relativistic and relativistic energy density functionals as input [26, 27, 28, 29, 30, 31].

In this work, we present the formalism for the finite-temperature extension of the IMSRG (FT-IMSRG), and assess its performance using an exactly solvable schematic model [32]. Our goal is to set the stage for ab initio calculations of nuclear properties, decay and reaction rates at finite temperatures. They are important ingredients for understanding nucleosynthesis processes in hot, stellar environments [33, 34], or reactions involving hot compound nuclei, e.g., neutron-induced fission [35]. Such efforts will be aided by the IMSRG’s capabilities for tracking how correlations are resummed into effective in-medium interactions, akin to the spirit of nuclear Density Functional Theory (DFT) [14, 36]. This will allow us to link the aforementioned finite-temperature DFT work and ab initio methodology, and provide insight into the successes and failures of either approach.

Our discussion of the FT-IMSRG is organized as follows. In Section II, we present a summary of the IMSRG at zero temperature, before describing our implementation of FT-HF in Section III.1 and the FT-IMSRG flow equations in Section III.2. In Section IV, we assess the performance of the FT-IMSRG: We introduce our schematic model in Section IV.1 and present results for a system of four fermions in eight single-particle states in Section IV.2, before discussing larger systems in Section IV.3. In Section IV.4, we compare results obtained by working in the canonical and grand canonical ensembles, respectively. In Section IV.5, we demonstrate the computation of free energy from the FT-IMSRG results. Finally, we conclude in Section V. Appendix A provides additional details on the calculation of occupation numbers in the canonical and grand-canonical ensembles.

II Zero-Temperature IMSRG

The first step in setting up a zero-temperature IMSRG calculation is the choice of a reference state |ΦketΦ|\Phi\rangle| roman_Φ ⟩ [14], e.g., a HF Slater determinant serving as a first approximation to the system’s ground state. A complete many-body basis can then be constructed from |ΦketΦ|\Phi\rangle| roman_Φ ⟩, its 1-particle-1-hole excitations, its 2-particle-2-hole excitations, and so on. The goal of the IMSRG is to continuously apply a unitary transformation U(s)𝑈𝑠U(s)italic_U ( italic_s ) to the Hamiltonian in order to decouple |ΦketΦ|\Phi\rangle| roman_Φ ⟩ from its excitations as s𝑠s\rightarrow\inftyitalic_s → ∞. This is shown schematically in Fig. 1.

Refer to caption
Figure 1: A schematic view of IMSRG decoupling [14]. The many-body Hamiltonian is depicted in blocks, with the reference state |ΦketΦ|\Phi\rangle| roman_Φ ⟩ and its 1-particle-1-hole excitations, 2-particle-2-hole excitations, and 3-particle-3-hole excitations (see text). During the IMSRG flow, the reference state is decoupled from its excitations.

To do this, we first express the Hamiltonian as

H^=K^+V^=pqKpqapaq+14pqrsVpqrsapaqasar.^𝐻^𝐾^𝑉subscript𝑝𝑞subscript𝐾𝑝𝑞subscriptsuperscript𝑎𝑝subscript𝑎𝑞14subscript𝑝𝑞𝑟𝑠subscript𝑉𝑝𝑞𝑟𝑠subscriptsuperscript𝑎𝑝subscriptsuperscript𝑎𝑞subscript𝑎𝑠subscript𝑎𝑟\hat{H}=\hat{K}+\hat{V}=\sum_{pq}K_{pq}a^{\dagger}_{p}a_{q}+\frac{1}{4}\sum_{% pqrs}V_{pqrs}a^{\dagger}_{p}a^{\dagger}_{q}a_{s}a_{r}\,.over^ start_ARG italic_H end_ARG = over^ start_ARG italic_K end_ARG + over^ start_ARG italic_V end_ARG = ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 4 end_ARG ∑ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT . (1)

Here, we use K^^𝐾\hat{K}over^ start_ARG italic_K end_ARG for the kinetic energy to avoid confusion with the temperature later on, and we adopt chemistry conventions for labeling the HF single-particle states, where a,b,𝑎𝑏a,b,\ldotsitalic_a , italic_b , … refer to unoccupied (particle) states, i,j,𝑖𝑗i,j,\ldotsitalic_i , italic_j , … to occupied (hole) states, and p,q,𝑝𝑞p,q,\ldotsitalic_p , italic_q , … run over the entire basis. Next, we introduce normal-ordering with respect to the reference state |ΦketΦ|\Phi\rangle| roman_Φ ⟩, defined by

{apaq}apaqΦ|apaq|Φ=apaqρpq,superscriptsubscript𝑎𝑝subscript𝑎𝑞superscriptsubscript𝑎𝑝subscript𝑎𝑞quantum-operator-productΦsuperscriptsubscript𝑎𝑝subscript𝑎𝑞Φsuperscriptsubscript𝑎𝑝subscript𝑎𝑞subscript𝜌𝑝𝑞\{a_{p}^{\dagger}a_{q}\}\equiv a_{p}^{\dagger}a_{q}-\langle\Phi|a_{p}^{\dagger% }a_{q}|\Phi\rangle=a_{p}^{\dagger}a_{q}-\rho_{pq},{ italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT } ≡ italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT - ⟨ roman_Φ | italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT | roman_Φ ⟩ = italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT - italic_ρ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT , (2)

where Φ|apaq|Φquantum-operator-productΦsuperscriptsubscript𝑎𝑝subscript𝑎𝑞Φ\langle\Phi|a_{p}^{\dagger}a_{q}|\Phi\rangle⟨ roman_Φ | italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT | roman_Φ ⟩ is also called the (Wick) contraction of apsuperscriptsubscript𝑎𝑝a_{p}^{\dagger}italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT and aqsubscript𝑎𝑞a_{q}italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT, which is equal to the one-body density matrix ρpqsubscript𝜌𝑝𝑞\rho_{pq}italic_ρ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT of the reference state. In the HF basis, this density matrix is diagonal,

ρpqHF=npδpq,superscriptsubscript𝜌𝑝𝑞HFsubscript𝑛𝑝subscript𝛿𝑝𝑞\rho_{pq}^{\textrm{HF}}=n_{p}\delta_{pq},italic_ρ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT HF end_POSTSUPERSCRIPT = italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT , (3)

and its eigenvalues, the occupation numbers npsubscript𝑛𝑝n_{p}italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT, are 1111 when the pthsuperscript𝑝thp^{\rm th}italic_p start_POSTSUPERSCRIPT roman_th end_POSTSUPERSCRIPT single-particle state is occupied in the reference state (hole state) and 00 when the pthsuperscript𝑝thp^{\rm th}italic_p start_POSTSUPERSCRIPT roman_th end_POSTSUPERSCRIPT single-particle state is unoccupied in the reference state (particle state). A detailed description of this normal ordering scheme, including the normal ordering of general A-body operators, can be found in [14, 15].

We then write the Hamiltonian in terms of normal-ordered operators:

H^^𝐻\displaystyle\hat{H}over^ start_ARG italic_H end_ARG =E^+f^+Γ^absent^𝐸^𝑓^Γ\displaystyle=\hat{E}+\hat{f}+\hat{\Gamma}= over^ start_ARG italic_E end_ARG + over^ start_ARG italic_f end_ARG + over^ start_ARG roman_Γ end_ARG
=E+pqfpq{apaq}+14pqrsΓpqrs{apaqasar}.absent𝐸subscript𝑝𝑞subscript𝑓𝑝𝑞subscriptsuperscript𝑎𝑝subscript𝑎𝑞14subscript𝑝𝑞𝑟𝑠subscriptΓ𝑝𝑞𝑟𝑠subscriptsuperscript𝑎𝑝subscriptsuperscript𝑎𝑞subscript𝑎𝑠subscript𝑎𝑟\displaystyle=E+\sum_{pq}f_{pq}\{a^{\dagger}_{p}a_{q}\}+\frac{1}{4}\sum_{pqrs}% \Gamma_{pqrs}\{a^{\dagger}_{p}a^{\dagger}_{q}a_{s}a_{r}\}.= italic_E + ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT } + divide start_ARG 1 end_ARG start_ARG 4 end_ARG ∑ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } . (4)

The 0-, 1-, and 2-body parts of the normal-ordered Hamiltonian are respectively given by

E𝐸\displaystyle Eitalic_E =pKppnp+12pqVpqpqnpnqabsentsubscript𝑝subscript𝐾𝑝𝑝subscript𝑛𝑝12subscript𝑝𝑞subscript𝑉𝑝𝑞𝑝𝑞subscript𝑛𝑝subscript𝑛𝑞\displaystyle=\sum_{p}K_{pp}n_{p}+\frac{1}{2}\sum_{pq}V_{pqpq}n_{p}n_{q}= ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_p italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_p italic_q italic_p italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT (5)
fpqsubscript𝑓𝑝𝑞\displaystyle f_{pq}italic_f start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT =Kpq+pVprqrnrabsentsubscript𝐾𝑝𝑞subscript𝑝subscript𝑉𝑝𝑟𝑞𝑟subscript𝑛𝑟\displaystyle=K_{pq}+\sum_{p}V_{prqr}n_{r}= italic_K start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_p italic_r italic_q italic_r end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT (6)
ΓpqrssubscriptΓ𝑝𝑞𝑟𝑠\displaystyle\Gamma_{pqrs}roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT =Vpqrs,absentsubscript𝑉𝑝𝑞𝑟𝑠\displaystyle=V_{pqrs},= italic_V start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT , (7)

which can be derived directly from the n-body normal ordering in [14]. We see that E𝐸Eitalic_E, f𝑓fitalic_f, and ΓΓ\Gammaroman_Γ now depend on the “medium” via the dependence on the occupation numbers of the reference state. We note that

E=Φ|H^|Φ,𝐸quantum-operator-productΦ^𝐻ΦE=\langle\Phi|\hat{H}|\Phi\rangle,italic_E = ⟨ roman_Φ | over^ start_ARG italic_H end_ARG | roman_Φ ⟩ , (8)

making E𝐸Eitalic_E the expectation value of energy in the reference state. When we evolve the Hamiltonian as shown in Fig. 1, E𝐸Eitalic_E will evolve towards an exact eigenvalue of H^^𝐻\hat{H}over^ start_ARG italic_H end_ARG (up to truncation effects), which is usually the desired ground-state energy. In the present work, we only deal with schematic models which include one- and two-body interactions in the initial Hamiltonian. Three-body interactions in the initial Hamiltonian can also be accounted for in the normal ordered 0-, 1-, and 2-body terms, see [14, 15].

The IMSRG flow is defined through the operator differential equation

ddsH^(s)=[η^(s),H^(s)]𝑑𝑑𝑠^𝐻𝑠^𝜂𝑠^𝐻𝑠\frac{d}{ds}\hat{H}(s)=[\hat{\eta}(s),\hat{H}(s)]divide start_ARG italic_d end_ARG start_ARG italic_d italic_s end_ARG over^ start_ARG italic_H end_ARG ( italic_s ) = [ over^ start_ARG italic_η end_ARG ( italic_s ) , over^ start_ARG italic_H end_ARG ( italic_s ) ] (9)

where the generator η^^𝜂\hat{\eta}over^ start_ARG italic_η end_ARG is a function of the Hamiltonian that is chosen to induce the wanted behavior out of the IMSRG flow, as will be discussed below. We note that for this flow to be unitary, the generator must be anti-Hermitian. Plugging the Hamiltonian (1) and a similarly structured η^^𝜂\hat{\eta}over^ start_ARG italic_η end_ARG into the flow equation (9) at s=0𝑠0s=0italic_s = 0, we immediately find that the commutator includes three-body operators. These and further higher-rank contributions to the Hamiltonian will be induced by the IMSRG flow as we evolve. In order to close the system of flow equations, we truncate operators at the normal-ordered two-body level, in the so-called normal-ordered two-body (NO2B) approximation. We then have the IMSRG(2) flow equations:

dEds𝑑𝐸𝑑𝑠\displaystyle\frac{dE}{ds}divide start_ARG italic_d italic_E end_ARG start_ARG italic_d italic_s end_ARG =pq(npnq)ηpqfqpabsentsubscript𝑝𝑞subscript𝑛𝑝subscript𝑛𝑞subscript𝜂𝑝𝑞subscript𝑓𝑞𝑝\displaystyle=\sum_{pq}(n_{p}-n_{q})\eta_{pq}f_{qp}= ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ) italic_η start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_q italic_p end_POSTSUBSCRIPT
+12pqrsηpqrsΓrspqnpnqn¯rn¯s12subscript𝑝𝑞𝑟𝑠subscript𝜂𝑝𝑞𝑟𝑠subscriptΓ𝑟𝑠𝑝𝑞subscript𝑛𝑝subscript𝑛𝑞subscript¯𝑛𝑟subscript¯𝑛𝑠\displaystyle+\frac{1}{2}\sum_{pqrs}\eta_{pqrs}\Gamma_{rspq}n_{p}n_{q}\bar{n}_% {r}\bar{n}_{s}+ divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_r italic_s italic_p italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT (10)
dfpqds𝑑subscript𝑓𝑝𝑞𝑑𝑠\displaystyle\frac{df_{pq}}{ds}divide start_ARG italic_d italic_f start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT end_ARG start_ARG italic_d italic_s end_ARG =r(1+Ppq)ηprfrqabsentsubscript𝑟1subscript𝑃𝑝𝑞subscript𝜂𝑝𝑟subscript𝑓𝑟𝑞\displaystyle=\sum_{r}(1+P_{pq})\eta_{pr}f_{rq}= ∑ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( 1 + italic_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) italic_η start_POSTSUBSCRIPT italic_p italic_r end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_r italic_q end_POSTSUBSCRIPT
+rs(nrns)(ηrsΓsprqfrsηsprq)subscript𝑟𝑠subscript𝑛𝑟subscript𝑛𝑠subscript𝜂𝑟𝑠subscriptΓ𝑠𝑝𝑟𝑞subscript𝑓𝑟𝑠subscript𝜂𝑠𝑝𝑟𝑞\displaystyle+\sum_{rs}(n_{r}-n_{s})(\eta_{rs}\Gamma_{sprq}-f_{rs}\eta_{sprq})+ ∑ start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT ( italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) ( italic_η start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_s italic_p italic_r italic_q end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_s italic_p italic_r italic_q end_POSTSUBSCRIPT )
+12rst(nrnsn¯t+n¯rn¯snt)(1+Ppq)ηtprsΓrstq12subscript𝑟𝑠𝑡subscript𝑛𝑟subscript𝑛𝑠subscript¯𝑛𝑡subscript¯𝑛𝑟subscript¯𝑛𝑠subscript𝑛𝑡1subscript𝑃𝑝𝑞subscript𝜂𝑡𝑝𝑟𝑠subscriptΓ𝑟𝑠𝑡𝑞\displaystyle+\frac{1}{2}\sum_{rst}(n_{r}n_{s}\bar{n}_{t}+\bar{n}_{r}\bar{n}_{% s}n_{t})(1+P_{pq})\eta_{tprs}\Gamma_{rstq}+ divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_r italic_s italic_t end_POSTSUBSCRIPT ( italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ( 1 + italic_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) italic_η start_POSTSUBSCRIPT italic_t italic_p italic_r italic_s end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_r italic_s italic_t italic_q end_POSTSUBSCRIPT (11)
dΓpqrsds𝑑subscriptΓ𝑝𝑞𝑟𝑠𝑑𝑠\displaystyle\frac{d\Gamma_{pqrs}}{ds}divide start_ARG italic_d roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT end_ARG start_ARG italic_d italic_s end_ARG =t[(1Ppq)(ηptΓtqrsfptηtqrs)\displaystyle=\sum_{t}\bigg{[}(1-P_{pq})(\eta_{pt}\Gamma_{tqrs}-f_{pt}\eta_{% tqrs})= ∑ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT [ ( 1 - italic_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) ( italic_η start_POSTSUBSCRIPT italic_p italic_t end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_t italic_q italic_r italic_s end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_p italic_t end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_t italic_q italic_r italic_s end_POSTSUBSCRIPT )
(1Prs)(ηtrΓpqtsftrηpqts)]\displaystyle\qquad-(1-P_{rs})(\eta_{tr}\Gamma_{pqts}-f_{tr}\eta_{pqts})\bigg{]}- ( 1 - italic_P start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT ) ( italic_η start_POSTSUBSCRIPT italic_t italic_r end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_t italic_s end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_t italic_r end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_p italic_q italic_t italic_s end_POSTSUBSCRIPT ) ]
+tu[12(1ntnu)(ηpqtuΓtursΓpqtuηturs)\displaystyle+\sum_{tu}\bigg{[}\frac{1}{2}(1-n_{t}-n_{u})(\eta_{pqtu}\Gamma_{% turs}-\Gamma_{pqtu}\eta_{turs})+ ∑ start_POSTSUBSCRIPT italic_t italic_u end_POSTSUBSCRIPT [ divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 - italic_n start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ( italic_η start_POSTSUBSCRIPT italic_p italic_q italic_t italic_u end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_t italic_u italic_r italic_s end_POSTSUBSCRIPT - roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_t italic_u end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_t italic_u italic_r italic_s end_POSTSUBSCRIPT )
+(ntnu)(1Ppq)(1Prs)ηtpurΓuqts].\displaystyle\qquad+(n_{t}-n_{u})(1-P_{pq})(1-P_{rs})\eta_{tpur}\Gamma_{uqts}% \bigg{]}.+ ( italic_n start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) ( 1 - italic_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) ( 1 - italic_P start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT ) italic_η start_POSTSUBSCRIPT italic_t italic_p italic_u italic_r end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_u italic_q italic_t italic_s end_POSTSUBSCRIPT ] . (12)

Here, n¯p=1npsubscript¯𝑛𝑝1subscript𝑛𝑝\bar{n}_{p}=1-n_{p}over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = 1 - italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT is the hole occupation number, and Ppqsubscript𝑃𝑝𝑞P_{pq}italic_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT is an operator which switches the p𝑝pitalic_p and q𝑞qitalic_q indices in subsequent expressions. Naively, with a single-particle basis size N𝑁Nitalic_N, these equations scale computationally as 𝒪(N6)𝒪superscript𝑁6\mathcal{O}(N^{6})caligraphic_O ( italic_N start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT ), but by differentiating between particle and hole states, it is possible to reduce this to 𝒪(Np4Nh2)𝒪superscriptsubscript𝑁𝑝4superscriptsubscript𝑁2\mathcal{O}(N_{p}^{4}N_{h}^{2})caligraphic_O ( italic_N start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), were Npsubscript𝑁𝑝N_{p}italic_N start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and Nhsubscript𝑁N_{h}italic_N start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT are respectively the number of particle and hole states [16].

There are several generators that can be used to get the desired decoupling. We seek to decouple the reference state and its particle-hole excitations, which means we want to suppress the so-called “off-diagonal” matrix elements of the form

Φ|H^{aaai}|Φ=faiquantum-operator-productΦ^𝐻subscriptsuperscript𝑎𝑎subscript𝑎𝑖Φsubscript𝑓𝑎𝑖\langle\Phi|\hat{H}\{a^{\dagger}_{a}a_{i}\}|\Phi\rangle=f_{ai}⟨ roman_Φ | over^ start_ARG italic_H end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } | roman_Φ ⟩ = italic_f start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT (13)

and

Φ|H^{aaabajai}|Φ=Γabij.quantum-operator-productΦ^𝐻subscriptsuperscript𝑎𝑎subscriptsuperscript𝑎𝑏subscript𝑎𝑗subscript𝑎𝑖ΦsubscriptΓ𝑎𝑏𝑖𝑗\langle\Phi|\hat{H}\{a^{\dagger}_{a}a^{\dagger}_{b}a_{j}a_{i}\}|\Phi\rangle=% \Gamma_{abij}.⟨ roman_Φ | over^ start_ARG italic_H end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } | roman_Φ ⟩ = roman_Γ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT . (14)

To achieve this [14, 15], we use a generator of the form

η^^𝜂\displaystyle\hat{\eta}over^ start_ARG italic_η end_ARG =η^1B+η^2Babsentsuperscript^𝜂1Bsuperscript^𝜂2B\displaystyle=\hat{\eta}^{\textrm{1B}}+\hat{\eta}^{\textrm{2B}}= over^ start_ARG italic_η end_ARG start_POSTSUPERSCRIPT 1B end_POSTSUPERSCRIPT + over^ start_ARG italic_η end_ARG start_POSTSUPERSCRIPT 2B end_POSTSUPERSCRIPT
=aiηai{aaai}+14abijηabij{aaabajai}absentsubscript𝑎𝑖subscript𝜂𝑎𝑖subscriptsuperscript𝑎𝑎subscript𝑎𝑖14subscript𝑎𝑏𝑖𝑗subscript𝜂𝑎𝑏𝑖𝑗subscriptsuperscript𝑎𝑎subscriptsuperscript𝑎𝑏subscript𝑎𝑗subscript𝑎𝑖\displaystyle=\sum_{ai}\eta_{ai}\{a^{\dagger}_{a}a_{i}\}+\frac{1}{4}\sum_{abij% }\eta_{abij}\{a^{\dagger}_{a}a^{\dagger}_{b}a_{j}a_{i}\}= ∑ start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } + divide start_ARG 1 end_ARG start_ARG 4 end_ARG ∑ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }
H.c.,H.c.\displaystyle\qquad-\textrm{H.c.},- H.c. , (15)

where ηaisubscript𝜂𝑎𝑖\eta_{ai}italic_η start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT and ηabijsubscript𝜂𝑎𝑏𝑖𝑗\eta_{abij}italic_η start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT are related respectively to the off-diagonal matrix elements we seek to eliminate, faisubscript𝑓𝑎𝑖f_{ai}italic_f start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT and ΓabijsubscriptΓ𝑎𝑏𝑖𝑗\Gamma_{abij}roman_Γ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT.

A simple yet effective generator is the so-called White generator [14, 37]

η^Wsuperscript^𝜂W\displaystyle\hat{\eta}^{\textrm{W}}over^ start_ARG italic_η end_ARG start_POSTSUPERSCRIPT W end_POSTSUPERSCRIPT =aifaiΔai{aaai}+14abijΓabijΔabij{aaabajai}absentsubscript𝑎𝑖subscript𝑓𝑎𝑖subscriptΔ𝑎𝑖subscriptsuperscript𝑎𝑎subscript𝑎𝑖14subscript𝑎𝑏𝑖𝑗subscriptΓ𝑎𝑏𝑖𝑗subscriptΔ𝑎𝑏𝑖𝑗subscriptsuperscript𝑎𝑎subscriptsuperscript𝑎𝑏subscript𝑎𝑗subscript𝑎𝑖\displaystyle=\sum_{ai}\frac{f_{ai}}{\Delta_{ai}}\{a^{\dagger}_{a}a_{i}\}+% \frac{1}{4}\sum_{abij}\frac{\Gamma_{abij}}{\Delta_{abij}}\{a^{\dagger}_{a}a^{% \dagger}_{b}a_{j}a_{i}\}= ∑ start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT divide start_ARG italic_f start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT end_ARG start_ARG roman_Δ start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } + divide start_ARG 1 end_ARG start_ARG 4 end_ARG ∑ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT divide start_ARG roman_Γ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG roman_Δ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }
H.c.,H.c.\displaystyle\qquad-\textrm{H.c.},- H.c. , (16)

where

ΔaisubscriptΔ𝑎𝑖\displaystyle\Delta_{ai}roman_Δ start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT =Φ|{aiaa}H^{aaai}|ΦΦ|H^|Φabsentquantum-operator-productΦsubscriptsuperscript𝑎𝑖subscript𝑎𝑎^𝐻subscriptsuperscript𝑎𝑎subscript𝑎𝑖Φquantum-operator-productΦ^𝐻Φ\displaystyle=\langle\Phi|\{a^{\dagger}_{i}a_{a}\}\hat{H}\{a^{\dagger}_{a}a_{i% }\}|\Phi\rangle-\langle\Phi|\hat{H}|\Phi\rangle= ⟨ roman_Φ | { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT } over^ start_ARG italic_H end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } | roman_Φ ⟩ - ⟨ roman_Φ | over^ start_ARG italic_H end_ARG | roman_Φ ⟩
=faafiiΓaiaiabsentsubscript𝑓𝑎𝑎subscript𝑓𝑖𝑖subscriptΓ𝑎𝑖𝑎𝑖\displaystyle=f_{aa}-f_{ii}-\Gamma_{aiai}= italic_f start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_i italic_i end_POSTSUBSCRIPT - roman_Γ start_POSTSUBSCRIPT italic_a italic_i italic_a italic_i end_POSTSUBSCRIPT (17)

is the unperturbed energy difference between the reference state and its p-h excitation, and

ΔabijsubscriptΔ𝑎𝑏𝑖𝑗\displaystyle\Delta_{abij}roman_Δ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT =Φ|{aiajabaa}H^{aaabajai}|ΦΦ|H^|Φabsentquantum-operator-productΦsubscriptsuperscript𝑎𝑖subscriptsuperscript𝑎𝑗subscript𝑎𝑏subscript𝑎𝑎^𝐻subscriptsuperscript𝑎𝑎subscriptsuperscript𝑎𝑏subscript𝑎𝑗subscript𝑎𝑖Φquantum-operator-productΦ^𝐻Φ\displaystyle=\langle\Phi|\{a^{\dagger}_{i}a^{\dagger}_{j}a_{b}a_{a}\}\hat{H}% \{a^{\dagger}_{a}a^{\dagger}_{b}a_{j}a_{i}\}|\Phi\rangle-\langle\Phi|\hat{H}|\Phi\rangle= ⟨ roman_Φ | { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT } over^ start_ARG italic_H end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } | roman_Φ ⟩ - ⟨ roman_Φ | over^ start_ARG italic_H end_ARG | roman_Φ ⟩
=faa+fbbfiifjj+Γabab+Γijijabsentsubscript𝑓𝑎𝑎subscript𝑓𝑏𝑏subscript𝑓𝑖𝑖subscript𝑓𝑗𝑗subscriptΓ𝑎𝑏𝑎𝑏subscriptΓ𝑖𝑗𝑖𝑗\displaystyle=f_{aa}+f_{bb}-f_{ii}-f_{jj}+\Gamma_{abab}+\Gamma_{ijij}= italic_f start_POSTSUBSCRIPT italic_a italic_a end_POSTSUBSCRIPT + italic_f start_POSTSUBSCRIPT italic_b italic_b end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_i italic_i end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_j italic_j end_POSTSUBSCRIPT + roman_Γ start_POSTSUBSCRIPT italic_a italic_b italic_a italic_b end_POSTSUBSCRIPT + roman_Γ start_POSTSUBSCRIPT italic_i italic_j italic_i italic_j end_POSTSUBSCRIPT
ΓaiaiΓbjbjΓajajΓbibisubscriptΓ𝑎𝑖𝑎𝑖subscriptΓ𝑏𝑗𝑏𝑗subscriptΓ𝑎𝑗𝑎𝑗subscriptΓ𝑏𝑖𝑏𝑖\displaystyle\qquad-\Gamma_{aiai}-\Gamma_{bjbj}-\Gamma_{ajaj}-\Gamma_{bibi}- roman_Γ start_POSTSUBSCRIPT italic_a italic_i italic_a italic_i end_POSTSUBSCRIPT - roman_Γ start_POSTSUBSCRIPT italic_b italic_j italic_b italic_j end_POSTSUBSCRIPT - roman_Γ start_POSTSUBSCRIPT italic_a italic_j italic_a italic_j end_POSTSUBSCRIPT - roman_Γ start_POSTSUBSCRIPT italic_b italic_i italic_b italic_i end_POSTSUBSCRIPT (18)

is the unperturbed energy difference between the reference state and its 2p-2h excitation. A variation, the White arctan generator,

η^arctansuperscript^𝜂arctan\displaystyle\hat{\eta}^{\textrm{arctan}}over^ start_ARG italic_η end_ARG start_POSTSUPERSCRIPT arctan end_POSTSUPERSCRIPT =12aiarctan(2faiΔai){aaai}absent12subscript𝑎𝑖2subscript𝑓𝑎𝑖subscriptΔ𝑎𝑖subscriptsuperscript𝑎𝑎subscript𝑎𝑖\displaystyle=\frac{1}{2}\sum_{ai}\arctan\left(\frac{2f_{ai}}{\Delta_{ai}}% \right)\{a^{\dagger}_{a}a_{i}\}= divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT roman_arctan ( divide start_ARG 2 italic_f start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT end_ARG start_ARG roman_Δ start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT end_ARG ) { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }
+18abijarctan(2ΓabijΔabij){aaabajai}18subscript𝑎𝑏𝑖𝑗2subscriptΓ𝑎𝑏𝑖𝑗subscriptΔ𝑎𝑏𝑖𝑗subscriptsuperscript𝑎𝑎subscriptsuperscript𝑎𝑏subscript𝑎𝑗subscript𝑎𝑖\displaystyle+\frac{1}{8}\sum_{abij}\arctan\left(\frac{2\Gamma_{abij}}{\Delta_% {abij}}\right)\{a^{\dagger}_{a}a^{\dagger}_{b}a_{j}a_{i}\}+ divide start_ARG 1 end_ARG start_ARG 8 end_ARG ∑ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT roman_arctan ( divide start_ARG 2 roman_Γ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG roman_Δ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT end_ARG ) { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }
H.c.H.c.\displaystyle\qquad-\textrm{H.c.}- H.c. (19)

is useful in cases where the energy denominators ΔaisubscriptΔ𝑎𝑖\Delta_{ai}roman_Δ start_POSTSUBSCRIPT italic_a italic_i end_POSTSUBSCRIPT and ΔabijsubscriptΔ𝑎𝑏𝑖𝑗\Delta_{abij}roman_Δ start_POSTSUBSCRIPT italic_a italic_b italic_i italic_j end_POSTSUBSCRIPT become small [15, 37]. Using either of these generators keeps the overall computational scaling of the IMSRG at 𝒪(Np4Nh2)𝒪superscriptsubscript𝑁𝑝4superscriptsubscript𝑁2\mathcal{O}(N_{p}^{4}N_{h}^{2})caligraphic_O ( italic_N start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ).

Finally, we note that the IMSRG can determine observable quantities besides energy. Given any operator O^^𝑂\hat{O}over^ start_ARG italic_O end_ARG written in the HF basis

O^^𝑂\displaystyle\hat{O}over^ start_ARG italic_O end_ARG =O^1B+O^2Babsentsuperscript^𝑂1Bsuperscript^𝑂2B\displaystyle=\hat{O}^{\textrm{1B}}+\hat{O}^{\textrm{2B}}= over^ start_ARG italic_O end_ARG start_POSTSUPERSCRIPT 1B end_POSTSUPERSCRIPT + over^ start_ARG italic_O end_ARG start_POSTSUPERSCRIPT 2B end_POSTSUPERSCRIPT
=pqOpq1Bapaq+pqrsOpqrs2Bapaqasar,absentsubscript𝑝𝑞subscriptsuperscript𝑂1B𝑝𝑞subscriptsuperscript𝑎𝑝subscript𝑎𝑞subscript𝑝𝑞𝑟𝑠subscriptsuperscript𝑂2B𝑝𝑞𝑟𝑠subscriptsuperscript𝑎𝑝subscriptsuperscript𝑎𝑞subscript𝑎𝑠subscript𝑎𝑟\displaystyle=\sum_{pq}O^{\textrm{1B}}_{pq}a^{\dagger}_{p}a_{q}+\sum_{pqrs}O^{% \textrm{2B}}_{pqrs}a^{\dagger}_{p}a^{\dagger}_{q}a_{s}a_{r},= ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_O start_POSTSUPERSCRIPT 1B end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT italic_O start_POSTSUPERSCRIPT 2B end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT , (20)

the IMSRG flow equation for O^^𝑂\hat{O}over^ start_ARG italic_O end_ARG is

ddsO^(s)=[η^(s),O^(s)].𝑑𝑑𝑠^𝑂𝑠^𝜂𝑠^𝑂𝑠\frac{d}{ds}\hat{O}(s)=[\hat{\eta}(s),\hat{O}(s)].divide start_ARG italic_d end_ARG start_ARG italic_d italic_s end_ARG over^ start_ARG italic_O end_ARG ( italic_s ) = [ over^ start_ARG italic_η end_ARG ( italic_s ) , over^ start_ARG italic_O end_ARG ( italic_s ) ] . (21)

The normal ordering (Eqs. (5)–(7)) remains unchanged, just replacing K^^𝐾\hat{K}over^ start_ARG italic_K end_ARG and V^^𝑉\hat{V}over^ start_ARG italic_V end_ARG with O^1Bsuperscript^𝑂1B\hat{O}^{\textrm{1B}}over^ start_ARG italic_O end_ARG start_POSTSUPERSCRIPT 1B end_POSTSUPERSCRIPT and O^2Bsuperscript^𝑂2B\hat{O}^{\textrm{2B}}over^ start_ARG italic_O end_ARG start_POSTSUPERSCRIPT 2B end_POSTSUPERSCRIPT respectively. The normal-ordered flow equations (Eqs. (II)–(II)) also remain unchanged, replacing E𝐸Eitalic_E, f^^𝑓\hat{f}over^ start_ARG italic_f end_ARG and Γ^^Γ\hat{\Gamma}over^ start_ARG roman_Γ end_ARG with the normal-ordered zero-, one- and two-body parts of O^^𝑂\hat{O}over^ start_ARG italic_O end_ARG respectively. Note that the generator is the same generator used by the Hamiltonian flow, calculated based on the Hamiltonian’s normal-ordered matrix elements, hence Eqs. (9) and (21) must be solved concurrently. Alternatively, one can use the so-called Magnus operator formulation of the IMSRG to extract a parameterization of the unitary transformation, which can then be used to construct any O^(s)^𝑂𝑠\hat{O}(s)over^ start_ARG italic_O end_ARG ( italic_s ) at a later time [38, 14].

III Finite-Temperature IMSRG

III.1 Finite-Temperature Hartree-Fock

We first describe our implementation of finite-temperature HF (FT-HF) [39, 23, 40]. Zero-temperature HF attempts to find a single Slater determinant that minimizes the energy of the system. In the finite-temperature case, we will attempt to find the thermal ensemble that minimizes the system’s free energy

F=ESβ.𝐹𝐸𝑆𝛽F=E-\frac{S}{\beta}.italic_F = italic_E - divide start_ARG italic_S end_ARG start_ARG italic_β end_ARG . (22)

Here, S𝑆Sitalic_S is the entropy and β1/kBT𝛽1subscript𝑘𝐵𝑇\beta\equiv 1/k_{B}Titalic_β ≡ 1 / italic_k start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT italic_T is the inverse temperature. To do this, we will no longer be able to simply use occupation numbers of 0 and 1, and therefore there is no sense in talking about particles and holes. Instead, the occupation number npsubscript𝑛𝑝n_{p}italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT represents the average number of nucleons occupying the pthsuperscript𝑝thp^{\rm th}italic_p start_POSTSUPERSCRIPT roman_th end_POSTSUPERSCRIPT single-particle state. For fermions, which must obey the Pauli exclusion principle, this is equivalent to the probability that, upon measurement, there will be a nucleon occupying the pthsuperscript𝑝thp^{\rm th}italic_p start_POSTSUPERSCRIPT roman_th end_POSTSUPERSCRIPT single-particle state. Thus, we maintain that 0np10subscript𝑛𝑝10\leq n_{p}\leq 10 ≤ italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≤ 1 and np=Asubscript𝑛𝑝𝐴\sum n_{p}=A∑ italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_A where A𝐴Aitalic_A is the total number of nucleons.

We begin by expressing the Hamiltonian

H^=k^+v^=αβkαβaαaβ+14αβγδvαβγδaαaβaδaγ,^𝐻^𝑘^𝑣subscript𝛼𝛽subscript𝑘𝛼𝛽subscriptsuperscript𝑎𝛼subscript𝑎𝛽14subscript𝛼𝛽𝛾𝛿subscript𝑣𝛼𝛽𝛾𝛿subscriptsuperscript𝑎𝛼subscriptsuperscript𝑎𝛽subscript𝑎𝛿subscript𝑎𝛾\hat{H}=\hat{k}+\hat{v}=\sum_{\alpha\beta}k_{\alpha\beta}a^{\dagger}_{\alpha}a% _{\beta}+\frac{1}{4}\sum_{\alpha\beta\gamma\delta}v_{\alpha\beta\gamma\delta}a% ^{\dagger}_{\alpha}a^{\dagger}_{\beta}a_{\delta}a_{\gamma}\,,over^ start_ARG italic_H end_ARG = over^ start_ARG italic_k end_ARG + over^ start_ARG italic_v end_ARG = ∑ start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT + divide start_ARG 1 end_ARG start_ARG 4 end_ARG ∑ start_POSTSUBSCRIPT italic_α italic_β italic_γ italic_δ end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_α italic_β italic_γ italic_δ end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_β end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_δ end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_γ end_POSTSUBSCRIPT , (23)

where we use Greek indices to indicate a general working basis of single-particle states. We will then define the effective single-particle (HF) Hamiltonian f^^𝑓\hat{f}over^ start_ARG italic_f end_ARG, starting with the ansatz

f^(0)=k^.superscript^𝑓0^𝑘\hat{f}^{(0)}=\hat{k}.over^ start_ARG italic_f end_ARG start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = over^ start_ARG italic_k end_ARG . (24)

We diagonalize f^^𝑓\hat{f}over^ start_ARG italic_f end_ARG, yielding eigenvalues {ϵp}subscriptitalic-ϵ𝑝\{\epsilon_{p}\}{ italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } and eigenstates that define a unitary similarity transformation u^^𝑢\hat{u}over^ start_ARG italic_u end_ARG.

Then, we calculate the free-energy minimizing occupation numbers assuming A𝐴Aitalic_A independent fermions with energy levels {ϵp}subscriptitalic-ϵ𝑝\{\epsilon_{p}\}{ italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT }. This can be done in either the canonical or grand-canonical ensembles — additional details are described in Appendix A. In summary, while the canonical ensemble is best-suited for a finite nucleus, calculating occupation numbers in the canonical ensemble scales exponentially with the size of the many-body basis because we need to perform traces over the ensemble. For realistic applications of the FT-IMSRG, we must resort to the grand canonical ensemble, where the calculation of occupation numbers merely scales linearly with the single-particle basis size. This comes with a loss of accuracy, which will be quantified in Section IV.4. When A𝐴Aitalic_A becomes large, however, the grand canonical ensemble occupation numbers approach those of the canonical ensemble, as the relative deviation of particle number in the grand canonical ensemble behaves as 1/A1𝐴1/\sqrt{A}1 / square-root start_ARG italic_A end_ARG (see [41]).

Using the occupation numbers npsubscript𝑛𝑝n_{p}italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and the previously mentioned unitary transformation, we iteratively construct the one-body density matrix

ραβ(i)=pnp(i)uαp(i)uβp(i),superscriptsubscript𝜌𝛼𝛽𝑖subscript𝑝superscriptsubscript𝑛𝑝𝑖superscriptsubscript𝑢𝛼𝑝𝑖superscriptsubscript𝑢𝛽𝑝𝑖\rho_{\alpha\beta}^{(i)}=\sum_{p}n_{p}^{(i)}u_{\alpha p}^{(i)}u_{\beta p}^{(i)% }\,,italic_ρ start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_α italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_β italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , (25)

as well as a new f^^𝑓\hat{f}over^ start_ARG italic_f end_ARG and HF energy E𝐸Eitalic_E using

fαβ(i+1)superscriptsubscript𝑓𝛼𝛽𝑖1\displaystyle f_{\alpha\beta}^{(i+1)}italic_f start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT =kαβ+γδvαγβδργδ(i)absentsubscript𝑘𝛼𝛽subscript𝛾𝛿subscript𝑣𝛼𝛾𝛽𝛿superscriptsubscript𝜌𝛾𝛿𝑖\displaystyle=k_{\alpha\beta}+\sum_{\gamma\delta}v_{\alpha\gamma\beta\delta}% \rho_{\gamma\delta}^{(i)}= italic_k start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_γ italic_δ end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_α italic_γ italic_β italic_δ end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_γ italic_δ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT (26)
E(i+1)superscript𝐸𝑖1\displaystyle E^{(i+1)}italic_E start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT =αβkαβραβ(i)+12αβγδvαβγδραγ(i)ρβδ(i).absentsubscript𝛼𝛽subscript𝑘𝛼𝛽superscriptsubscript𝜌𝛼𝛽𝑖12subscript𝛼𝛽𝛾𝛿subscript𝑣𝛼𝛽𝛾𝛿superscriptsubscript𝜌𝛼𝛾𝑖superscriptsubscript𝜌𝛽𝛿𝑖\displaystyle=\sum_{\alpha\beta}k_{\alpha\beta}\rho_{\alpha\beta}^{(i)}+\frac{% 1}{2}\sum_{\alpha\beta\gamma\delta}v_{\alpha\beta\gamma\delta}\rho_{\alpha% \gamma}^{(i)}\rho_{\beta\delta}^{(i)}.= ∑ start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT + divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_α italic_β italic_γ italic_δ end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_α italic_β italic_γ italic_δ end_POSTSUBSCRIPT italic_ρ start_POSTSUBSCRIPT italic_α italic_γ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT italic_ρ start_POSTSUBSCRIPT italic_β italic_δ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT . (27)

Note that in the case of a diagonal density matrix (which will occur if f^^𝑓\hat{f}over^ start_ARG italic_f end_ARG is diagonal), these equations resemble the normal ordering of Eqs. (5) and (6).

We summarize the process below:

  1. 1.

    Diagonalize f^^𝑓\hat{f}over^ start_ARG italic_f end_ARG, yielding eigenvalues {ϵp}subscriptitalic-ϵ𝑝\{\epsilon_{p}\}{ italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } and a unitary transformation u^^𝑢\hat{u}over^ start_ARG italic_u end_ARG.

  2. 2.

    Calculate the occupation numbers npsubscript𝑛𝑝n_{p}italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT based on the eigenvalues {ϵp}subscriptitalic-ϵ𝑝\{\epsilon_{p}\}{ italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT }.

  3. 3.

    Construct the density matrix via Eq. (25).

  4. 4.

    Construct a new f^^𝑓\hat{f}over^ start_ARG italic_f end_ARG and HF energy E𝐸Eitalic_E using Eqs. (26) and (27).

We repeat these steps until the energy E𝐸Eitalic_E is converged, which we define to be the case once

|E(i+1)E(i)|A105.superscript𝐸𝑖1superscript𝐸𝑖𝐴superscript105\frac{\left|E^{(i+1)}-E^{(i)}\right|}{A}\leq 10^{-5}.divide start_ARG | italic_E start_POSTSUPERSCRIPT ( italic_i + 1 ) end_POSTSUPERSCRIPT - italic_E start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT | end_ARG start_ARG italic_A end_ARG ≤ 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT . (28)

Finally, we transform the Hamiltonian (Eq. (23)) to the HF basis (Eq. (1)), using the eigenvalues {np}subscript𝑛𝑝\{n_{p}\}{ italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } and eigenstates {Up}subscript𝑈𝑝\{U_{p}\}{ italic_U start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } of the final density matrix:

Kpq=αβUpαkαβUβq.subscript𝐾𝑝𝑞subscript𝛼𝛽subscriptsuperscript𝑈𝑝𝛼subscript𝑘𝛼𝛽subscript𝑈𝛽𝑞K_{pq}=\sum_{\alpha\beta}U^{*}_{p\alpha}k_{\alpha\beta}U_{\beta q}.italic_K start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_α end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT italic_U start_POSTSUBSCRIPT italic_β italic_q end_POSTSUBSCRIPT . (29)

and

Vpqrs=αβγδUpαUqβvαβγδUγrUδs.subscript𝑉𝑝𝑞𝑟𝑠subscript𝛼𝛽𝛾𝛿subscriptsuperscript𝑈𝑝𝛼subscriptsuperscript𝑈𝑞𝛽subscript𝑣𝛼𝛽𝛾𝛿subscript𝑈𝛾𝑟subscript𝑈𝛿𝑠V_{pqrs}=\sum_{\alpha\beta\gamma\delta}U^{*}_{p\alpha}U^{*}_{q\beta}v_{\alpha% \beta\gamma\delta}U_{\gamma r}U_{\delta s}.italic_V start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_α italic_β italic_γ italic_δ end_POSTSUBSCRIPT italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_α end_POSTSUBSCRIPT italic_U start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q italic_β end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_α italic_β italic_γ italic_δ end_POSTSUBSCRIPT italic_U start_POSTSUBSCRIPT italic_γ italic_r end_POSTSUBSCRIPT italic_U start_POSTSUBSCRIPT italic_δ italic_s end_POSTSUBSCRIPT . (30)

III.2 The FT-IMSRG Flow

The FT-IMSRG flow has two key differences from the zero-temperature IMSRG described in Section II. First, our reference is no longer a Slater determinant, but a reference ensemble given by FT-HF. We take the normal-ordered Hamiltonian in the HF basis to be given by Eqs. (5)–(7), but using the occupation numbers given by FT-HF. Note that this represents an exact re-writing of the Hamiltonian when working in the grand canonical ensemble, but is only approximate when working in the canonical ensemble. This is because in the latter case, Wick’s theorem picks up correction terms that have no analogs in the zero-temperature version, see [42]. Since these correction terms scale exponentially with the number of single-particle basis states, we presently omit them in our canonical ensemble calculations. Our results presented in Section IV imply that, at least for the schematic model studied here, this is a reasonable approximation. In future work, we will explore if the corrections laid out in [42] can be cast into the generalized normal-ordering formalism of Mukherjee and Kutzelnigg[43]. In this way, it might be possible to remove the current approximations in our FT-IMSRG canonical ensemble calculations by switching over to something that resembles a finite-temperature extension of the multi-reference IMSRG [15, 1].

The interpretation of the energy E𝐸Eitalic_E must also be adapted. Eq. (8) still holds, but E𝐸Eitalic_E is to be interpreted as the internal energy of the reference ensemble. Additionally, expectation values are taken to mean

O^=Tr(ρ^O^),delimited-⟨⟩^𝑂Tr^𝜌^𝑂\langle\hat{O}\rangle=\operatorname{Tr}\left(\hat{\rho}\hat{O}\right),⟨ over^ start_ARG italic_O end_ARG ⟩ = roman_Tr ( over^ start_ARG italic_ρ end_ARG over^ start_ARG italic_O end_ARG ) , (31)

where O^^𝑂\hat{O}over^ start_ARG italic_O end_ARG is any many-body operator and ρ^^𝜌\hat{\rho}over^ start_ARG italic_ρ end_ARG is the statistical operator of the FT-HF reference ensemble. The FT-IMSRG flow equations are identical to Eqs. (II)-(II), once again using the occupation numbers from FT-HF. The evolution of other observables also remains the same.

The second key difference involves the generators to be used. Since we cannot distinguish between particles and holes, in our decoupling scheme we seek to suppress all expectation values of the form

H^{apaq}=n¯pnqfpqdelimited-⟨⟩^𝐻subscriptsuperscript𝑎𝑝subscript𝑎𝑞subscript¯𝑛𝑝subscript𝑛𝑞subscript𝑓𝑝𝑞\langle\hat{H}\{a^{\dagger}_{p}a_{q}\}\rangle=\bar{n}_{p}n_{q}f_{pq}⟨ over^ start_ARG italic_H end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT } ⟩ = over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT (32)

and

H^{apaqasar}=n¯pn¯qnrnsΓpqrs,delimited-⟨⟩^𝐻subscriptsuperscript𝑎𝑝subscriptsuperscript𝑎𝑞subscript𝑎𝑠subscript𝑎𝑟subscript¯𝑛𝑝subscript¯𝑛𝑞subscript𝑛𝑟subscript𝑛𝑠subscriptΓ𝑝𝑞𝑟𝑠\langle\hat{H}\{a^{\dagger}_{p}a^{\dagger}_{q}a_{s}a_{r}\}\rangle=\bar{n}_{p}% \bar{n}_{q}n_{r}n_{s}\Gamma_{pqrs},⟨ over^ start_ARG italic_H end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } ⟩ = over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT , (33)

in analogy to the off-diagonal matrix elements (13),(14) of the zero-temperature formulation. Analogously to the zero-temperature case, we look for generators of the form

η^^𝜂\displaystyle\hat{\eta}over^ start_ARG italic_η end_ARG =pqn¯pnqηpq{apaq}absentsubscript𝑝𝑞subscript¯𝑛𝑝subscript𝑛𝑞subscript𝜂𝑝𝑞subscriptsuperscript𝑎𝑝subscript𝑎𝑞\displaystyle=\sum_{pq}\bar{n}_{p}n_{q}\eta_{pq}\{a^{\dagger}_{p}a_{q}\}= ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT }
+14pqrsn¯pn¯qnrnsηpqrs{apaqasar}H.c.14subscript𝑝𝑞𝑟𝑠subscript¯𝑛𝑝subscript¯𝑛𝑞subscript𝑛𝑟subscript𝑛𝑠subscript𝜂𝑝𝑞𝑟𝑠subscriptsuperscript𝑎𝑝subscriptsuperscript𝑎𝑞subscript𝑎𝑠subscript𝑎𝑟H.c.\displaystyle\hphantom{=}+\frac{1}{4}\sum_{pqrs}\bar{n}_{p}\bar{n}_{q}n_{r}n_{% s}\eta_{pqrs}\{a^{\dagger}_{p}a^{\dagger}_{q}a_{s}a_{r}\}-\textrm{H.c.}+ divide start_ARG 1 end_ARG start_ARG 4 end_ARG ∑ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } - H.c. (34)

The energy denominators used in our generators will also become

ΔpqsubscriptΔ𝑝𝑞\displaystyle\Delta_{pq}roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ={aqap}H^{apaq}H^absentdelimited-⟨⟩subscriptsuperscript𝑎𝑞subscript𝑎𝑝^𝐻subscriptsuperscript𝑎𝑝subscript𝑎𝑞delimited-⟨⟩^𝐻\displaystyle=\langle\{a^{\dagger}_{q}a_{p}\}\hat{H}\{a^{\dagger}_{p}a_{q}\}% \rangle-\langle\hat{H}\rangle= ⟨ { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } over^ start_ARG italic_H end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT } ⟩ - ⟨ over^ start_ARG italic_H end_ARG ⟩
=n¯pnq(n¯pfppnqfqqn¯pnqΓpqpq)absentsubscript¯𝑛𝑝subscript𝑛𝑞subscript¯𝑛𝑝subscript𝑓𝑝𝑝subscript𝑛𝑞subscript𝑓𝑞𝑞subscript¯𝑛𝑝subscript𝑛𝑞subscriptΓ𝑝𝑞𝑝𝑞\displaystyle=\bar{n}_{p}n_{q}\left(\bar{n}_{p}f_{pp}-n_{q}f_{qq}-\bar{n}_{p}n% _{q}\Gamma_{pqpq}\right)= over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ( over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_p italic_p end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_q italic_q end_POSTSUBSCRIPT - over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_p italic_q end_POSTSUBSCRIPT ) (35)

and

ΔpqrssubscriptΔ𝑝𝑞𝑟𝑠\displaystyle\Delta_{pqrs}roman_Δ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT ={arasaqap}H^{apaqasar}H^absentdelimited-⟨⟩subscriptsuperscript𝑎𝑟subscriptsuperscript𝑎𝑠subscript𝑎𝑞subscript𝑎𝑝^𝐻subscriptsuperscript𝑎𝑝subscriptsuperscript𝑎𝑞subscript𝑎𝑠subscript𝑎𝑟delimited-⟨⟩^𝐻\displaystyle=\langle\{a^{\dagger}_{r}a^{\dagger}_{s}a_{q}a_{p}\}\hat{H}\{a^{% \dagger}_{p}a^{\dagger}_{q}a_{s}a_{r}\}\rangle-\langle\hat{H}\rangle= ⟨ { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } over^ start_ARG italic_H end_ARG { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT } ⟩ - ⟨ over^ start_ARG italic_H end_ARG ⟩
=n¯pn¯qnrns(n¯pfpp+n¯qfqqnrfrrnsfss\displaystyle=\bar{n}_{p}\bar{n}_{q}n_{r}n_{s}\bigg{(}\bar{n}_{p}f_{pp}+\bar{n% }_{q}f_{qq}-n_{r}f_{rr}-n_{s}f_{ss}= over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_p italic_p end_POSTSUBSCRIPT + over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_q italic_q end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_r italic_r end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_s italic_s end_POSTSUBSCRIPT
+n¯pn¯qΓpqpq+nrnsΓrsrsn¯pnrΓprprsubscript¯𝑛𝑝subscript¯𝑛𝑞subscriptΓ𝑝𝑞𝑝𝑞subscript𝑛𝑟subscript𝑛𝑠subscriptΓ𝑟𝑠𝑟𝑠subscript¯𝑛𝑝subscript𝑛𝑟subscriptΓ𝑝𝑟𝑝𝑟\displaystyle\qquad+\bar{n}_{p}\bar{n}_{q}\Gamma_{pqpq}+n_{r}n_{s}\Gamma_{rsrs% }-\bar{n}_{p}n_{r}\Gamma_{prpr}+ over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_p italic_q end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_r italic_s italic_r italic_s end_POSTSUBSCRIPT - over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_p italic_r italic_p italic_r end_POSTSUBSCRIPT
n¯qnsΓqsqsn¯pnsΓpspsn¯qnrΓqrqr).\displaystyle\qquad-\bar{n}_{q}n_{s}\Gamma_{qsqs}-\bar{n}_{p}n_{s}\Gamma_{psps% }-\bar{n}_{q}n_{r}\Gamma_{qrqr}\bigg{)}.- over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_q italic_s italic_q italic_s end_POSTSUBSCRIPT - over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_p italic_s italic_p italic_s end_POSTSUBSCRIPT - over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_q italic_r italic_q italic_r end_POSTSUBSCRIPT ) . (36)

To determine the appropriate generator to use, we insist that in the low-temperature limit (where the occupation numbers become 0 and 1), our generator approaches a zero-temperature generator. If we naively attempt to generalize the White generator to finite-temperature, we will find that due to the occupation factors n¯pnqsubscript¯𝑛𝑝subscript𝑛𝑞\bar{n}_{p}n_{q}over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT and n¯pn¯qnrnssubscript¯𝑛𝑝subscript¯𝑛𝑞subscript𝑛𝑟subscript𝑛𝑠\bar{n}_{p}\bar{n}_{q}n_{r}n_{s}over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT appearing in both the numerators and denominators, we will be left with many nonzero and potentially divergent generator matrix elements for non-ph or -pphh entries. To address this problematic behavior, we use the arctan variant of the generator

η^^𝜂\displaystyle\hat{\eta}over^ start_ARG italic_η end_ARG =12pqn¯pnqarctan(2fpqΔpq){apaq}absent12subscript𝑝𝑞subscript¯𝑛𝑝subscript𝑛𝑞2subscript𝑓𝑝𝑞subscriptΔ𝑝𝑞subscriptsuperscript𝑎𝑝subscript𝑎𝑞\displaystyle=\frac{1}{2}\sum_{pq}\bar{n}_{p}n_{q}\arctan\left(\frac{2f_{pq}}{% \Delta_{pq}}\right)\{a^{\dagger}_{p}a_{q}\}= divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT roman_arctan ( divide start_ARG 2 italic_f start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT end_ARG start_ARG roman_Δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT end_ARG ) { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT }
+18pqrsn¯pn¯qnrnsarctan(2ΓpqrsΔpqrs){apaqasar}18subscript𝑝𝑞𝑟𝑠subscript¯𝑛𝑝subscript¯𝑛𝑞subscript𝑛𝑟subscript𝑛𝑠2subscriptΓ𝑝𝑞𝑟𝑠subscriptΔ𝑝𝑞𝑟𝑠subscriptsuperscript𝑎𝑝subscriptsuperscript𝑎𝑞subscript𝑎𝑠subscript𝑎𝑟\displaystyle+\frac{1}{8}\sum_{pqrs}\bar{n}_{p}\bar{n}_{q}n_{r}n_{s}\arctan% \left(\frac{2\Gamma_{pqrs}}{\Delta_{pqrs}}\right)\{a^{\dagger}_{p}a^{\dagger}_% {q}a_{s}a_{r}\}+ divide start_ARG 1 end_ARG start_ARG 8 end_ARG ∑ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT roman_arctan ( divide start_ARG 2 roman_Γ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT end_ARG start_ARG roman_Δ start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT end_ARG ) { italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT }
H.c.H.c.\displaystyle\qquad-\textrm{H.c.}- H.c. (37)

which does behave properly in the low-temperature limit.

Finally, we note that as we are no longer able to distinguish between particles and holes, the computational scaling of the FT-IMSRG is returned to 𝒪(N6)𝒪superscript𝑁6\mathcal{O}(N^{6})caligraphic_O ( italic_N start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT ).

III.3 Particle Number Variance

One of the complications of using a thermal reference ensemble as opposed to a Slater determinant reference state is that the ensemble can run over states with different particle numbers, and the particle number variance ΔA=A^2A^2Δ𝐴delimited-⟨⟩superscript^𝐴2superscriptdelimited-⟨⟩^𝐴2\Delta A=\langle\hat{A}^{2}\rangle-\langle\hat{A}\rangle^{2}roman_Δ italic_A = ⟨ over^ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ - ⟨ over^ start_ARG italic_A end_ARG ⟩ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT may be nonzero.

We have

A^=papap^𝐴subscript𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑝\hat{A}=\sum_{p}a_{p}^{\dagger}a_{p}over^ start_ARG italic_A end_ARG = ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT (38)

and thus

A^2superscript^𝐴2\displaystyle\hat{A}^{2}over^ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =pqapapaqaq=pq(apaqaqap+δpqapaq)absentsubscript𝑝𝑞superscriptsubscript𝑎𝑝subscript𝑎𝑝superscriptsubscript𝑎𝑞subscript𝑎𝑞subscript𝑝𝑞superscriptsubscript𝑎𝑝superscriptsubscript𝑎𝑞subscript𝑎𝑞subscript𝑎𝑝subscript𝛿𝑝𝑞superscriptsubscript𝑎𝑝subscript𝑎𝑞\displaystyle=\sum_{pq}a_{p}^{\dagger}a_{p}a_{q}^{\dagger}a_{q}=\sum_{pq}\left% (a_{p}^{\dagger}a_{q}^{\dagger}a_{q}a_{p}+\delta_{pq}a_{p}^{\dagger}a_{q}\right)= ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + italic_δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT )
=pqapaqaqap+papap.absentsubscript𝑝𝑞superscriptsubscript𝑎𝑝superscriptsubscript𝑎𝑞subscript𝑎𝑞subscript𝑎𝑝subscript𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑝\displaystyle=\sum_{pq}a_{p}^{\dagger}a_{q}^{\dagger}a_{q}a_{p}+\sum_{p}a_{p}^% {\dagger}a_{p}.= ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT . (39)

We can then normal order the above operators:

A^=pnp+p{apap}=A+p{apap}^𝐴subscript𝑝subscript𝑛𝑝subscript𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑝𝐴subscript𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑝\hat{A}=\sum_{p}n_{p}+\sum_{p}\{a_{p}^{\dagger}a_{p}\}=A+\sum_{p}\{a_{p}^{% \dagger}a_{p}\}over^ start_ARG italic_A end_ARG = ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT { italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } = italic_A + ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT { italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } (40)

and

A^2superscript^𝐴2\displaystyle\hat{A}^{2}over^ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =p(npnp2)+pqnpnq+p{apap}absentsubscript𝑝subscript𝑛𝑝superscriptsubscript𝑛𝑝2subscript𝑝𝑞subscript𝑛𝑝subscript𝑛𝑞subscript𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑝\displaystyle=\sum_{p}(n_{p}-n_{p}^{2})+\sum_{pq}n_{p}n_{q}+\sum_{p}\{a_{p}^{% \dagger}a_{p}\}= ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT { italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT }
+pq{apaqaqap}subscript𝑝𝑞superscriptsubscript𝑎𝑝superscriptsubscript𝑎𝑞subscript𝑎𝑞subscript𝑎𝑝\displaystyle\qquad+\sum_{pq}\{a_{p}^{\dagger}a_{q}^{\dagger}a_{q}a_{p}\}+ ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT { italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } (41)
=pn¯pnp+A2+p{apap}+pq{apaqaqap}.absentsubscript𝑝subscript¯𝑛𝑝subscript𝑛𝑝superscript𝐴2subscript𝑝superscriptsubscript𝑎𝑝subscript𝑎𝑝subscript𝑝𝑞superscriptsubscript𝑎𝑝superscriptsubscript𝑎𝑞subscript𝑎𝑞subscript𝑎𝑝\displaystyle=\sum_{p}\bar{n}_{p}n_{p}+A^{2}+\sum_{p}\{a_{p}^{\dagger}a_{p}\}+% \sum_{pq}\{a_{p}^{\dagger}a_{q}^{\dagger}a_{q}a_{p}\}.= ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT + italic_A start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT { italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } + ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT { italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } . (42)

This means that

A^=Adelimited-⟨⟩^𝐴𝐴\langle\hat{A}\rangle=A⟨ over^ start_ARG italic_A end_ARG ⟩ = italic_A (43)

and

A^2=A2+pn¯pnp,delimited-⟨⟩superscript^𝐴2superscript𝐴2subscript𝑝subscript¯𝑛𝑝subscript𝑛𝑝\langle\hat{A}^{2}\rangle=A^{2}+\sum_{p}\bar{n}_{p}n_{p},⟨ over^ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ = italic_A start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , (44)

so that

ΔA=A^2A^2=pn¯pnp.Δ𝐴delimited-⟨⟩superscript^𝐴2superscriptdelimited-⟨⟩^𝐴2subscript𝑝subscript¯𝑛𝑝subscript𝑛𝑝\Delta A=\langle\hat{A}^{2}\rangle-\langle\hat{A}\rangle^{2}=\sum_{p}\bar{n}_{% p}n_{p}.roman_Δ italic_A = ⟨ over^ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ - ⟨ over^ start_ARG italic_A end_ARG ⟩ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT over¯ start_ARG italic_n end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT . (45)

While it appears that this result is in full generality, the canonical ensemble is defined such that ΔA=0Δ𝐴0\Delta A=0roman_Δ italic_A = 0. In fact, there are corrections to Wick’s theorem for the canonical ensemble [42] which enforce the condition ΔA=0Δ𝐴0\Delta A=0roman_Δ italic_A = 0. As explained above, the evaluation of these corrections is impractical because of their scaling with the many-body basis, and we neglect them at present.

Finally, we note that Eq. (40) holds in both ensembles, and we can use it to determine the IMSRG flow of A^^𝐴\hat{A}over^ start_ARG italic_A end_ARG by calculating [η^,A^]^𝜂^𝐴[\hat{\eta},\hat{A}][ over^ start_ARG italic_η end_ARG , over^ start_ARG italic_A end_ARG ]. At s=0𝑠0s=0italic_s = 0, the zero-body part is

[η^,A^]0B=pq(npnq)ηpqδpq=0,superscript^𝜂^𝐴0𝐵subscript𝑝𝑞subscript𝑛𝑝subscript𝑛𝑞subscript𝜂𝑝𝑞subscript𝛿𝑝𝑞0[\hat{\eta},\hat{A}]^{0B}=\sum_{pq}(n_{p}-n_{q})\eta_{pq}\delta_{pq}=0,[ over^ start_ARG italic_η end_ARG , over^ start_ARG italic_A end_ARG ] start_POSTSUPERSCRIPT 0 italic_B end_POSTSUPERSCRIPT = ∑ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ( italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ) italic_η start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT = 0 , (46)

the one-body part is

[η^,A^]pq1Bsubscriptsuperscript^𝜂^𝐴1𝐵𝑝𝑞\displaystyle[\hat{\eta},\hat{A}]^{1B}_{pq}[ over^ start_ARG italic_η end_ARG , over^ start_ARG italic_A end_ARG ] start_POSTSUPERSCRIPT 1 italic_B end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT =r(1+Ppq)ηprδrqrs(nrns)δrsηsprqabsentsubscript𝑟1subscript𝑃𝑝𝑞subscript𝜂𝑝𝑟subscript𝛿𝑟𝑞subscript𝑟𝑠subscript𝑛𝑟subscript𝑛𝑠subscript𝛿𝑟𝑠subscript𝜂𝑠𝑝𝑟𝑞\displaystyle=\sum_{r}(1+P_{pq})\eta_{pr}\delta_{rq}-\sum_{rs}(n_{r}-n_{s})% \delta_{rs}\eta_{sprq}= ∑ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( 1 + italic_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) italic_η start_POSTSUBSCRIPT italic_p italic_r end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_r italic_q end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT ( italic_n start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) italic_δ start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_s italic_p italic_r italic_q end_POSTSUBSCRIPT
=ηpq+ηqp=0absentsubscript𝜂𝑝𝑞subscript𝜂𝑞𝑝0\displaystyle=\eta_{pq}+\eta_{qp}=0= italic_η start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT italic_q italic_p end_POSTSUBSCRIPT = 0 (47)

due to the anti-Hermiticity of η^^𝜂\hat{\eta}over^ start_ARG italic_η end_ARG, and the two-body part is

[η^,A^]pqrs2Bsubscriptsuperscript^𝜂^𝐴2𝐵𝑝𝑞𝑟𝑠\displaystyle[\hat{\eta},\hat{A}]^{2B}_{pqrs}[ over^ start_ARG italic_η end_ARG , over^ start_ARG italic_A end_ARG ] start_POSTSUPERSCRIPT 2 italic_B end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT =t((1Ppq)δptηtqrs(1Prs)δtrηpqts)absentsubscript𝑡1subscript𝑃𝑝𝑞subscript𝛿𝑝𝑡subscript𝜂𝑡𝑞𝑟𝑠1subscript𝑃𝑟𝑠subscript𝛿𝑡𝑟subscript𝜂𝑝𝑞𝑡𝑠\displaystyle=-\sum_{t}\left((1-P_{pq})\delta_{pt}\eta_{tqrs}-(1-P_{rs})\delta% _{tr}\eta_{pqts}\right)= - ∑ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( ( 1 - italic_P start_POSTSUBSCRIPT italic_p italic_q end_POSTSUBSCRIPT ) italic_δ start_POSTSUBSCRIPT italic_p italic_t end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_t italic_q italic_r italic_s end_POSTSUBSCRIPT - ( 1 - italic_P start_POSTSUBSCRIPT italic_r italic_s end_POSTSUBSCRIPT ) italic_δ start_POSTSUBSCRIPT italic_t italic_r end_POSTSUBSCRIPT italic_η start_POSTSUBSCRIPT italic_p italic_q italic_t italic_s end_POSTSUBSCRIPT )
=(ηpqrsηqprs+ηpqrsηpqsr)=0.absentsubscript𝜂𝑝𝑞𝑟𝑠subscript𝜂𝑞𝑝𝑟𝑠subscript𝜂𝑝𝑞𝑟𝑠subscript𝜂𝑝𝑞𝑠𝑟0\displaystyle=-\left(\eta_{pqrs}-\eta_{qprs}+\eta_{pqrs}-\eta_{pqsr}\right)=0.= - ( italic_η start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT - italic_η start_POSTSUBSCRIPT italic_q italic_p italic_r italic_s end_POSTSUBSCRIPT + italic_η start_POSTSUBSCRIPT italic_p italic_q italic_r italic_s end_POSTSUBSCRIPT - italic_η start_POSTSUBSCRIPT italic_p italic_q italic_s italic_r end_POSTSUBSCRIPT ) = 0 . (48)

This implies that the there is no change to A^^𝐴\hat{A}over^ start_ARG italic_A end_ARG from the initial derivative at s=0𝑠0s=0italic_s = 0, especially no induced two-body (or higher-rank) contribution. This also means that A^^𝐴\hat{A}over^ start_ARG italic_A end_ARG remains a pure one-body operator through the evolution, so Eqs. (46)–(48) will be valid for any s𝑠sitalic_s and we see that

ddsA^=[η^(s),A^]=0.𝑑𝑑𝑠^𝐴^𝜂𝑠^𝐴0\frac{d}{ds}\hat{A}=[\hat{\eta}(s),\hat{A}]=0\,.divide start_ARG italic_d end_ARG start_ARG italic_d italic_s end_ARG over^ start_ARG italic_A end_ARG = [ over^ start_ARG italic_η end_ARG ( italic_s ) , over^ start_ARG italic_A end_ARG ] = 0 . (49)

Using this result, it is easy to prove that

[η^,A^2]=A^[η^,A^]+[η^,A^]A^=0.^𝜂superscript^𝐴2^𝐴^𝜂^𝐴^𝜂^𝐴^𝐴0[\hat{\eta},\hat{A}^{2}]=\hat{A}[\hat{\eta},\hat{A}]+[\hat{\eta},\hat{A}]\hat{% A}=0.[ over^ start_ARG italic_η end_ARG , over^ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = over^ start_ARG italic_A end_ARG [ over^ start_ARG italic_η end_ARG , over^ start_ARG italic_A end_ARG ] + [ over^ start_ARG italic_η end_ARG , over^ start_ARG italic_A end_ARG ] over^ start_ARG italic_A end_ARG = 0 . (50)

Thus, the average particle number as well as the particle number variance are invariant under the FT-IMSRG flow, and entirely determined by the reference ensemble.

IV Results

IV.1 The P3H Hamiltonian

To test the FT-IMSRG, we employ the pairing-plus-particle-hole (P3H) model, which is exactly solvable and qualitatively captures important features of nuclear interactions [32]. We work with single particle states {(α,σ)}𝛼𝜎\{(\alpha,\sigma)\}{ ( italic_α , italic_σ ) } where α=1,2,3,,N/2𝛼123𝑁2\alpha=1,2,3,...,N/2italic_α = 1 , 2 , 3 , … , italic_N / 2 is the principal quantum number and σ=+,𝜎\sigma=+,-italic_σ = + , - represents the spin.

The P3H Hamiltonian is governed by the parameters δ𝛿\deltaitalic_δ, g𝑔gitalic_g, and b𝑏bitalic_b, and, in the notation of Eq. (23), has the one- and two-body parts

k^P3H=δασ(α1)aασaασsubscript^𝑘P3H𝛿subscript𝛼𝜎𝛼1subscriptsuperscript𝑎𝛼𝜎subscript𝑎𝛼𝜎\hat{k}_{\textrm{P3H}}=\delta\sum_{\alpha\sigma}(\alpha-1)a^{\dagger}_{\alpha% \sigma}a_{\alpha\sigma}over^ start_ARG italic_k end_ARG start_POSTSUBSCRIPT P3H end_POSTSUBSCRIPT = italic_δ ∑ start_POSTSUBSCRIPT italic_α italic_σ end_POSTSUBSCRIPT ( italic_α - 1 ) italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_α italic_σ end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_α italic_σ end_POSTSUBSCRIPT (51)

and

v^P3Hsubscript^𝑣P3H\displaystyle\hat{v}_{\textrm{P3H}}over^ start_ARG italic_v end_ARG start_POSTSUBSCRIPT P3H end_POSTSUBSCRIPT =g2αβaα+aαaβaβ+absent𝑔2subscript𝛼𝛽subscriptsuperscript𝑎limit-from𝛼subscriptsuperscript𝑎limit-from𝛼subscript𝑎limit-from𝛽subscript𝑎limit-from𝛽\displaystyle=-\frac{g}{2}\sum_{\alpha\beta}a^{\dagger}_{\alpha+}a^{\dagger}_{% \alpha-}a_{\beta-}a_{\beta+}= - divide start_ARG italic_g end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_α italic_β end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_α + end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_α - end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_β - end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_β + end_POSTSUBSCRIPT
b2α,β,γβ(aα+aαaβaγ+\displaystyle\quad-\frac{b}{2}\sum_{\alpha,\beta,\gamma\neq\beta}\bigg{(}a^{% \dagger}_{\alpha+}a^{\dagger}_{\alpha-}a_{\beta-}a_{\gamma+}- divide start_ARG italic_b end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_α , italic_β , italic_γ ≠ italic_β end_POSTSUBSCRIPT ( italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_α + end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_α - end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_β - end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_γ + end_POSTSUBSCRIPT
+aγ+aβaαaα+).\displaystyle\quad\qquad\qquad\qquad+a^{\dagger}_{\gamma+}a^{\dagger}_{\beta-}% a_{\alpha-}a_{\alpha+}\bigg{)}.+ italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_γ + end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT † end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_β - end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_α - end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_α + end_POSTSUBSCRIPT ) . (52)

Here, δ𝛿\deltaitalic_δ is the (constant) spacing between single-particle energy levels, g𝑔gitalic_g is the strength of pairing interaction, and b𝑏bitalic_b controls the strength of pair-breaking, particle-hole type excitations. The structure of the eigenstates will be driven by the competition between the pairing and pair-breaking interactions, as well as the ratios g/δ𝑔𝛿g/\deltaitalic_g / italic_δ and b/δ𝑏𝛿b/\deltaitalic_b / italic_δ, i.e., the ability of the interaction terms to overcome the level spacing. Because of the latter observation, we set δ=1𝛿1\delta=1italic_δ = 1 for the remainder of this paper, without loss of generality. We will express g𝑔gitalic_g, b𝑏bitalic_b, and other energies in units of δ𝛿\deltaitalic_δ, and express β𝛽\betaitalic_β in units of δ1superscript𝛿1\delta^{-1}italic_δ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.

IV.2 Four Particles in Eight States

We first consider the case of four fermions in eight possible single-particle states (i.e., A=4𝐴4A=4italic_A = 4, N=8𝑁8N=8italic_N = 8). The many-body basis will have dimension 70, so the exact solution can be easily computed for comparison. Note that for this section, occupation numbers will be computed in the canonical ensemble so that the FT-IMSRG can be tested with the most accurate FT-HF input possible.

Fig. 2 plots E𝐸Eitalic_E versus β𝛽\betaitalic_β for different coupling strengths. In general, the FT-IMSRG significantly improves the FT-HF results in accuracy. Particularly at low temperatures (high β𝛽\betaitalic_β), the FT-IMSRG results are extremely close to the exact internal energies.

Refer to caption
Figure 2: Internal energy versus inverse temperature with A=4𝐴4A=4italic_A = 4 and N=8𝑁8N=8italic_N = 8 for g=0.3𝑔0.3g=0.3italic_g = 0.3, b=0.1𝑏0.1b=-0.1italic_b = - 0.1 (left) and g=0.1𝑔0.1g=-0.1italic_g = - 0.1, b=0.4𝑏0.4b=-0.4italic_b = - 0.4 (right). The bottom panels show the low-temperature range, β5.0𝛽5.0\beta\geq 5.0italic_β ≥ 5.0. The FT-IMSRG results (blue) are much closer to the exact results (orange) than the FT-HF results (green) are. The FT-IMSRG results successfully replicate the behavior of the exact results at low temperatures.

As would be expected, the FT-IMSRG performs best for weak coupling in the Hamiltonian (i.e. low |g|𝑔|g|| italic_g | and |b|𝑏|b|| italic_b |). For |g|,|b|0.5𝑔𝑏0.5|g|,|b|\leq 0.5| italic_g | , | italic_b | ≤ 0.5, the FT-IMSRG results consistently demonstrate good agreement with the exact results. This can be seen in Fig. 3. At lower temperature, we see strong agreement for the widest range of parameters. Interestingly, the performance of the FT-IMSRG is weakest at mid-range temperatures, around β=1𝛽1\beta=1italic_β = 1. While it still improves upon the FT-HF results in this temperature range, to achieve a similar accuracy to that achieved at lower temperatures, further improvements to the FT-IMSRG truncation scheme would be necessary (see, e.g., [44, 45, 46] for recent discussions.)

Refer to caption
Figure 3: Relative error in internal energy in parameter space for various inverse temperatures with a logarithmic color scale. Shades of red denote less than 1% error. The FT-IMSRG results display the most accuracy for lower temperatures and parameters of lower magnitude.

When |g|𝑔|g|| italic_g | and |b|𝑏|b|| italic_b | become too large, specifically when they are of opposite sign, the FT-IMSRG frequently diverges (shown in deep blue in Fig. 3). This is because positive g𝑔gitalic_g encourages pairing, and negative b𝑏bitalic_b discourages pair-breaking, and vice versa, leading to a mutual reinforcement.

The relative error of the internal energy as a function of β𝛽\betaitalic_β is shown on a logarithmic plot for various coupling strengths in Fig. 4. Once again, we see the strong agreement between the FT-IMSRG and the exact results, which is strongest with weaker coupling and lower temperatures. The less smooth results in the bottom panel are likely due to numerical complications that arise from the pair-breaking term of the Hamiltonian. The relative errors seen here are comparable to those of Finite-Temperature Coupled Cluster for similar schematic models (see, e.g., [47, 48]).

Refer to caption
Figure 4: Relative error in internal energy versus inverse temperature for b=0𝑏0b=0italic_b = 0 (top) and b=0.2𝑏0.2b=0.2italic_b = 0.2 (bottom) with various values of g𝑔gitalic_g on a logarithmic scale. The FT-HF results are shown with dotted lines and the FT-IMSRG results are shown with solid lines. In all cases, the FT-IMSRG results show a significant improvement compared to the FT-HF results, with better results for weaker coupling.

We also show plots of the correlation energy

Ecorr=EEHFsubscript𝐸corr𝐸subscript𝐸HFE_{\textrm{corr}}=E-E_{\mathrm{HF}}italic_E start_POSTSUBSCRIPT corr end_POSTSUBSCRIPT = italic_E - italic_E start_POSTSUBSCRIPT roman_HF end_POSTSUBSCRIPT (53)

versus the pairing strength g𝑔gitalic_g for different β𝛽\betaitalic_β and b𝑏bitalic_b values in Fig. 5. Interestingly, at higher temperatures, more deviations are observed for negative g𝑔gitalic_g (attractive pairing), and at lower temperatures, more deviations are observed for positive g𝑔gitalic_g (repulsive pairing). This trend is apparent in Fig. 3 as well. The behavior at low temperature can be explained from the fact that the IMSRG(2) is known to under count a subset of fourth-order perturbation theory contributions by a factor of 1/2121/21 / 2, see [46, 14]. Higher temperatures weaken the effects of heavily-favored pairing, but exacerbate the effects of heavily-disfavored pairing so that the interaction cannot be fully accounted for by the IMSRG. This leads to the behavior we observe in the correlation energies as temperature increases.

Refer to caption
Figure 5: Correlation energy versus g𝑔gitalic_g for various values of b𝑏bitalic_b and β𝛽\betaitalic_β. At low temperature, the FT-IMSRG results (blue) and exact results (orange) are in great agreement. Weaker coupling in both b𝑏bitalic_b and g𝑔gitalic_g produce more accurate FT-IMSRG results.

It is also informative to look at how the FT-IMSRG decouples the many-body Hamiltonian matrix at different temperatures (recall the decoupling behavior of the zero-temperature IMSRG shown in Fig. 1). In Fig. 6, we show this for the example of a pure pairing Hamiltonian with g=0.5𝑔0.5g=0.5italic_g = 0.5, as well as the same Hamiltonian when a pair-breaking term is added with b=0.2𝑏0.2b=0.2italic_b = 0.2. The normal-ordered pieces of the Hamiltonian after the FT-IMSRG flow are de-normal ordered and used to build the full many-body Hamiltonian matrix. At low temperatures, the FT-IMSRG decouples only the lowest-energy states as is the case in the zero-temperature IMSRG, while at higher temperatures the FT-IMSRG decouples many more states. However, this comes at the cost of truncation error. After computing the eigenvalues of these matrices, we compare the exact free energies. At β=20𝛽20\beta=20italic_β = 20 (very low temperature), the truncation error is under 0.25% for the pure-pairing Hamiltonian, and under 0.55% once the pair-breaking interaction is added. At higher temperature, with β=2𝛽2\beta=2italic_β = 2, the truncation error is about 0.86% for the pure-pairing Hamiltonian, but grows to a little less than 3% once the pair-breaking interaction is added. These truncation errors can help explain the decrease in the FT-IMSRG’s accuracy at high temperatures.

Refer to caption
Figure 6: The full 70×\times×70 many-body Hamiltonian matrix for A=4𝐴4A=4italic_A = 4 and N=8𝑁8N=8italic_N = 8 and two sets of parameters. We compare the unevolved Hamiltonian at zero temperature (β𝛽\beta\to\inftyitalic_β → ∞, row), to the FT-IMSRG evolved Hamiltonian for β=20𝛽20\beta=20italic_β = 20 (middle row), and for β=2𝛽2\beta=2italic_β = 2 (bottom row). Darker colors correspond to matrix elements with larger absolute values, and matrix elements with a value of zero are shown in white. At low temperatures, the FT-IMSRG decouples only the lowest-energy states, while at higher temperatures the FT-IMSRG decouples many more states.

IV.3 Increasing Particle Number and Basis Size

We now turn to cases with larger values of A𝐴Aitalic_A and N𝑁Nitalic_N, once again computing occupation numbers in the canonical ensemble. In Fig. 7, we show a the relative error in internal energy in parameter space at β=5.0𝛽5.0\beta=5.0italic_β = 5.0 and various values for A𝐴Aitalic_A and N𝑁Nitalic_N. We see a similar pattern to before, where weaker couplings generally lead to better agreement between the FT-IMSRG and exact energies.

Refer to caption
Figure 7: Relative error in internal energy in parameter space for β=5.0𝛽5.0\beta=5.0italic_β = 5.0 and various values of A𝐴Aitalic_A and N𝑁Nitalic_N with a logarithmic color scale. Shades of red denote less than 1% error. The FT-IMSRG results display the most accuracy for higher A𝐴Aitalic_A and lower N𝑁Nitalic_N.

In the plots on the left of Fig. 7, we show the case where the single-particle states are half-filled. Other than convergence issues for some parameters at A=8𝐴8A=8italic_A = 8 and N=16𝑁16N=16italic_N = 16, these plots are quite similar to each other. We see improvement in the FT-IMSRG’s accuracy for nearly all parameters when N𝑁Nitalic_N is fixed and A𝐴Aitalic_A increases. When N𝑁Nitalic_N is increased for fixed A𝐴Aitalic_A, however, the relative error increases slightly for most parameters. This can be understood by noting that the P3H interactions are analogous to unregulated delta function potentials, since the two-body matrix elements do not fall off in strength as the number of single particle states N𝑁Nitalic_N is increased. Therefore, increasing N𝑁Nitalic_N effectively makes the interactions “harder”, resulting in larger errors. Unsurprisingly, this effect is much more pronounced for stronger couplings in the Hamiltonian. Note that in applications of the FT-IMSRG to nuclei, N𝑁Nitalic_N will be increased with a fixed A𝐴Aitalic_A to converge the result with respect to the basis size. In contrast, for the P3H model the N𝑁N\rightarrow\inftyitalic_N → ∞ limit is not well-defined without renormalization.

The behavior of the FT-IMSRG error with respect to changing A𝐴Aitalic_A and N𝑁Nitalic_N can be seen clearly in Fig. 8, which plots the relative error versus β𝛽\betaitalic_β on a logarithmic scale for various values of g𝑔gitalic_g, b𝑏bitalic_b, A𝐴Aitalic_A, and N𝑁Nitalic_N. We see a significant increase in the FT-IMSRG’s accuracy as A𝐴Aitalic_A is increased, and a smaller decrease in its accuracy as N𝑁Nitalic_N is increased. The FT-IMSRG typically improves on the FT-HF results, except for a few instances at high temperature/low β𝛽\betaitalic_β, which likely result from truncation errors. For the pure pairing interaction in the top row, the results are nearly identical as N𝑁Nitalic_N is increased – this is because the pure pairing interaction couples fewer single-particle states than when the pair-breaking interaction is added [49]. It is also notable in the bottom two rows that as N𝑁Nitalic_N is increased, the errors appear to converge from below to a fixed value. Since we would expect the error to converge from above as N𝑁Nitalic_N increases, we suspect that this behavior is caused by the use of the zero-range interaction. We will revisit this result in the future with a renormalized zero-range interaction.

Refer to caption
Figure 8: Relative error in internal energy versus inverse temperature on a logarithmic scale for various values of g𝑔gitalic_g, b𝑏bitalic_b, A𝐴Aitalic_A, and N𝑁Nitalic_N. The FT-HF results are shown with dotted lines and the FT-IMSRG results are shown with solid lines.

IV.4 Comparing the Canonical and Grand Canonical Ensembles

As mentioned previously, while it is a more accurate description of nuclei at finite temperature, the canonical ensemble is too computationally expensive to use for general realistic applications, but we can compute the canonical occupation numbers for the P3H model. Thus, we can explore the differences between the FT-HF/FT-IMSRG results for references built in the canonical and grand canonical ensembles for different model parameters to gain some insight for future applications.

Figure 9 shows the relative difference between the canonical and grand canonical results for both FT-HF and FT-IMSRG. The differences are very similar in both methods, which is not surprising given that the choice of ensemble primarily affects the values of the occupation numbers prodcued by the FT-HF, which then serve as input for the FT-IMSRG, but remain unchanged during the flow. As a comparison of the center and bottom rows shows, the differences seem to be slightly more pronounced in FT-IMSRG than in FT-HF, which is likely the result of truncation effects.

As expected, there is a noticeable decrease in the difference between the ensembles upon increasing A𝐴Aitalic_A (see Sec. III and [42]). Changing N𝑁Nitalic_N, however, has a significantly smaller effect on the comparison between the ensembles. This suggests that, as expected, the two ensembles will become equivalent in the thermodynamic limit, where A𝐴Aitalic_A and N𝑁Nitalic_N both become very large. For systems with A8𝐴8A\geq 8italic_A ≥ 8, it seems safe to use the grand canonical ensemble, but for systems with fewer particles, this approximation does introduce an error in the 1-10% range.

Refer to caption
Figure 9: Relative difference between the canonical ensemble and grand canonical ensemble results versus inverse temperature on a logarithmic scale, using the same parameters as Fig. 8. The FT-HF results are shown with dotted lines and the FT-IMSRG results are shown with solid lines. Increasing A𝐴Aitalic_A brings the canonical and grand canonical results significantly closer together, while decreasing N𝑁Nitalic_N only does this slightly.

With all of this discussion, it is worth recalling that FT-IMSRG in the grand canonical ensemble does produce more accurate results than FT-HF in the grand canonical ensemble when compared to exact results. Thus, the FT-IMSRG remains successful as a post-HF method even in this approximation.

IV.5 Entropy and Free Energy

Finally, we calculate entropy and free energy, quantities of much thermodynamic interest, from the FT-HF and FT-IMSRG results. Following the general idea of [48], we compute the entropy via integration. After running the FT-IMSRG for different values of β𝛽\betaitalic_β, we can express the internal energy as a function E(β)𝐸𝛽E(\beta)italic_E ( italic_β ). This can be inverted to give β𝛽\betaitalic_β as a function β(E)𝛽𝐸\beta(E)italic_β ( italic_E ). We then have

S(β)=E()E(β)β(E)𝑑E.𝑆𝛽superscriptsubscript𝐸𝐸𝛽superscript𝛽superscript𝐸differential-dsuperscript𝐸S(\beta)=\int_{E(\infty)}^{E(\beta)}\beta^{\prime}(E^{\prime})dE^{\prime}.italic_S ( italic_β ) = ∫ start_POSTSUBSCRIPT italic_E ( ∞ ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_E ( italic_β ) end_POSTSUPERSCRIPT italic_β start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_d italic_E start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT . (54)

Since the integrand β(E)𝛽𝐸\beta(E)italic_β ( italic_E ) would be infinite in the zero-temperature limit, but the FT-IMSRG energy is insensitive to variations of β𝛽\betaitalic_β in this regime, we use E()E(10)𝐸𝐸10E(\infty)\approx E(10)italic_E ( ∞ ) ≈ italic_E ( 10 ), which gives an excellent (and controllable, if necessary) approximation.

Fig. 10 shows the relative error in entropy for both FT-HF and FT-IMSRG, going back to the A=4,N=8formulae-sequence𝐴4𝑁8A=4,N=8italic_A = 4 , italic_N = 8 model. Notice that the E()E(βmax)𝐸𝐸subscript𝛽maxE(\infty)\approx E(\beta_{\mathrm{max}})italic_E ( ∞ ) ≈ italic_E ( italic_β start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ) scheme necessitates a 100% error in the entropy at β=βmax𝛽subscript𝛽max\beta=\beta_{\mathrm{max}}italic_β = italic_β start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, which is seen in the figure. Outside of β𝛽\betaitalic_β close to 10, the entropy calculations for both FT-HF and FT-IMSRG hover around 5-10% error, except for the case of weak coupling. This is very similar to what was found for Finite-Temperature Coupled Cluster calculations [47].

Refer to caption
Figure 10: Relative error in entropy versus inverse temperature for A=4𝐴4A=4italic_A = 4 and N=8𝑁8N=8italic_N = 8 with various values of g𝑔gitalic_g and b𝑏bitalic_b on a logarithmic scale. The FT-HF results are shown with dotted lines and the FT-IMSRG results are shown with solid lines. The 100% error at β=10𝛽10\beta=10italic_β = 10 is a result of assuming E()=E(10)𝐸𝐸10E(\infty)=E(10)italic_E ( ∞ ) = italic_E ( 10 ) in the integration limits of Eq. (54). Entropy results are more accurate for weaker coupling, and the difference between FT-IMSRG and FT-HF in terms of entropy is generally small.

The Helmholtz free energy F𝐹Fitalic_F can then be calculated via Eq. (22). Fig. 11 shows the relative error in F𝐹Fitalic_F for both FT-HF and FT-IMSRG (once again with A=4𝐴4A=4italic_A = 4 and N=8𝑁8N=8italic_N = 8). We find that the FT-IMSRG calculation of F𝐹Fitalic_F significantly improves upon that of FT-HF, which is expected as this is heavily influenced by the accuracy of the internal energy calculations. Thus it is sensible that the relative error in F𝐹Fitalic_F resembles the relative error in E𝐸Eitalic_E (see Fig. 4).

Refer to caption
Figure 11: Relative error in Helmholtz free energy versus inverse temperature for A=4𝐴4A=4italic_A = 4 and N=8𝑁8N=8italic_N = 8 with various values of g𝑔gitalic_g and b𝑏bitalic_b on a logarithmic scale. The FT-HF results are shown with dotted lines and the FT-IMSRG results are shown with solid lines. Beyond β2𝛽2\beta\approx 2italic_β ≈ 2, the FT-IMSRG results are consistently more accurate than the FT-HF results.

The free energy is of significant importance because of its relationship to the partition function Z𝑍Zitalic_Z itself:

lnZ=βF.𝑍𝛽𝐹\ln Z=-\beta F.roman_ln italic_Z = - italic_β italic_F . (55)

This allows most thermodynamic properties of interest to be computed solely in terms of F𝐹Fitalic_F, β𝛽\betaitalic_β, and derivatives of F𝐹Fitalic_F. As we have shown that the FT-IMSRG can accurately calculate F𝐹Fitalic_F, it can be used to reliably calculate these ensemble averages.

V Conclusion

In this work, we have extended the IMSRG to finite temperature, and demonstrated that the FT-IMSRG is a useful tool for calculating properties of many-fermion systems at finite temperature. Using the schematic model pairing-plus-particle-hole model that captures essential features of nuclear interactions, we performed a thorough assessment of the properties of FT-HF and FT-IMSRG, setting the stage for realistic applications of the FT-IMSRG to nuclei. We found the best agreement between FT-IMSRG and exact solutions at low temperatures and with weak coupling in the Hamiltonian, but the FT-IMSRG produced highly accurate results for a wide range of parameters and temperatures, improving on FT-HF.

As we looked at models with various different particle numbers A𝐴Aitalic_A and single-particle basis sizes N𝑁Nitalic_N, we showed that the FT-IMSRG results improve in accuracy with higher A𝐴Aitalic_A. We then demonstrated that the choice between the canonical and grand canonical ensembles in the setup of the FT-HF optimized occupation numbers can have noticeable effects on the FT-IMSRG results, but these effects become significantly lessened as the particle number increases. Finally, we used the FT-HF and FT-IMSRG results to calculate entropy and free energy, showing that the FT-IMSRG produces accurate results.

As our next steps, we will perform FT-IMSRG calculations for atomic nuclei with modern nuclear interaction derived from chiral Effective Field Theory. We will investigate the evolution of nuclear structure features like the neutron driplines with increasing temperature, and compute reaction and decay rates that are relevant for understanding nuclear processes in stellar environments, including nucleosynthesis. In parallel, we will pursue the implementation of finite temperature in other IMSRG variants and IMSRG-based hybrid methods [15, 50, 17, 51, 52].

Acknowledgements

We thank A. Ravlić for useful discussions.

This work has been supported in part by the U.S. Department of Energy, Office of Science, Office of Nuclear Physics under Award Number DE-SC0023516, as well as the Jeffrey R. Cole Honors College Research Fund at Michigan State University. SKB is partially supported by National Science Foundation (NSF) Grants PHY-2013047 and PHY-2310020.

References

Appendix A Calculation of Occupation Numbers in the Canonical and Grand Canonical Ensembles

Suppose we have A𝐴Aitalic_A fermions, each able to occupy one of N𝑁Nitalic_N single-particle states, with energies ϵ1,ϵ2,,ϵNsubscriptitalic-ϵ1subscriptitalic-ϵ2subscriptitalic-ϵ𝑁\epsilon_{1},\epsilon_{2},...,\epsilon_{N}italic_ϵ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_ϵ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT. By the Pauli exclusion principle, only one fermion may occupy any given state at a time, so the full many-body basis consists of (NA)binomial𝑁𝐴\binom{N}{A}( FRACOP start_ARG italic_N end_ARG start_ARG italic_A end_ARG ) states.

In the canonical ensemble, the number of particles is fixed at A𝐴Aitalic_A, and the inverse temperature β𝛽\betaitalic_β is fixed, since the system can exchange energy with its surroundings, which is at an inverse temperature β𝛽\betaitalic_β. This is the most natural way to describe nucleons bound in a nucleus.

We now wish to find occupation numbers {ni}subscript𝑛𝑖\{n_{i}\}{ italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } such that the Helmholtz free energy (Eq. (22)) is minimized. Using Boltzmann factors, we have that the probability that the system is in the many-body state where the energy levels ϵj1,ϵj2,,ϵjAsubscriptitalic-ϵsubscript𝑗1subscriptitalic-ϵsubscript𝑗2subscriptitalic-ϵsubscript𝑗𝐴\epsilon_{j_{1}},\epsilon_{j_{2}},...,\epsilon_{j_{A}}italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , … , italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUBSCRIPT are occupied is [53]

e(ϵj1+ϵj2++ϵjA)βk1,k2,,kAe(ϵk1+ϵk2++ϵkA)β=e(ϵj1+ϵj2++ϵjA)βZA,superscript𝑒subscriptitalic-ϵsubscript𝑗1subscriptitalic-ϵsubscript𝑗2subscriptitalic-ϵsubscript𝑗𝐴𝛽subscriptsubscript𝑘1subscript𝑘2subscript𝑘𝐴superscript𝑒subscriptitalic-ϵsubscript𝑘1subscriptitalic-ϵsubscript𝑘2subscriptitalic-ϵsubscript𝑘𝐴𝛽superscript𝑒subscriptitalic-ϵsubscript𝑗1subscriptitalic-ϵsubscript𝑗2subscriptitalic-ϵsubscript𝑗𝐴𝛽subscript𝑍𝐴\frac{e^{-(\epsilon_{j_{1}}+\epsilon_{j_{2}}+...+\epsilon_{j_{A}})\beta}}{\sum% \limits_{k_{1},k_{2},...,k_{A}}e^{-(\epsilon_{k_{1}}+\epsilon_{k_{2}}+...+% \epsilon_{k_{A}})\beta}}=\frac{e^{-(\epsilon_{j_{1}}+\epsilon_{j_{2}}+...+% \epsilon_{j_{A}})\beta}}{Z_{A}},divide start_ARG italic_e start_POSTSUPERSCRIPT - ( italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + … + italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_β end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_k start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - ( italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + … + italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_β end_POSTSUPERSCRIPT end_ARG = divide start_ARG italic_e start_POSTSUPERSCRIPT - ( italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + … + italic_ϵ start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_β end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_ARG , (56)

introducing the partition function in the A𝐴Aitalic_A-body system, ZAsubscript𝑍𝐴Z_{A}italic_Z start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT. The occupation number nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the sum of these probabilities for all many-body states in which the energy level ϵisubscriptitalic-ϵ𝑖\epsilon_{i}italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is occupied. In the notation of [53], this can be conveniently written as

ni=ZA1{i}eϵiβZA,subscript𝑛𝑖subscriptsuperscript𝑍𝑖𝐴1superscript𝑒subscriptitalic-ϵ𝑖𝛽subscript𝑍𝐴n_{i}=\frac{Z^{\setminus\{i\}}_{A-1}e^{-\epsilon_{i}\beta}}{Z_{A}},italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG italic_Z start_POSTSUPERSCRIPT ∖ { italic_i } end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A - 1 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_β end_POSTSUPERSCRIPT end_ARG start_ARG italic_Z start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT end_ARG , (57)

where ZA1{i}subscriptsuperscript𝑍𝑖𝐴1Z^{\setminus\{i\}}_{A-1}italic_Z start_POSTSUPERSCRIPT ∖ { italic_i } end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A - 1 end_POSTSUBSCRIPT is the partition function for A1𝐴1A-1italic_A - 1 fermions where the energy level ϵisubscriptitalic-ϵ𝑖\epsilon_{i}italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is removed.

While this formula is simple, due to the presence of the partition functions, the computation time for calculating these occupation numbers scales exponentially as (NA)binomial𝑁𝐴\binom{N}{A}( FRACOP start_ARG italic_N end_ARG start_ARG italic_A end_ARG ). Thus, for realistic problems with a large number of single-particle states, evaluating these occupation numbers is not feasible.

In the grand canonical ensemble, particle number is no longer fixed, meaning the system can exchange particles with its surroundings. We then determine occupation numbers that minimize the grand potential

Ω=FμNΩ𝐹𝜇𝑁\Omega=F-\mu Nroman_Ω = italic_F - italic_μ italic_N (58)

where μ𝜇\muitalic_μ is the chemical potential. Unlike in the canonical ensemble, a convenient formula exists for this: the Fermi-Dirac distribution

np=11+e(ϵpμ)β.subscript𝑛𝑝11superscript𝑒subscriptitalic-ϵ𝑝𝜇𝛽n_{p}=\frac{1}{1+e^{(\epsilon_{p}-\mu)\beta}}.italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 1 + italic_e start_POSTSUPERSCRIPT ( italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT - italic_μ ) italic_β end_POSTSUPERSCRIPT end_ARG . (59)

The chemical potential is determined to ensure that the expected value of the particle number is A𝐴Aitalic_A, i.e.

pnp=A.subscript𝑝subscript𝑛𝑝𝐴\sum_{p}n_{p}=A.∑ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_A . (60)

In practice, one starts with an ansatz μ=12(ϵA+1+ϵA)𝜇12subscriptitalic-ϵ𝐴1subscriptitalic-ϵ𝐴\mu=\frac{1}{2}(\epsilon_{A+1}+\epsilon_{A})italic_μ = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( italic_ϵ start_POSTSUBSCRIPT italic_A + 1 end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT ), and iteratively updates μ𝜇\muitalic_μ until convergence is reached.

The benefit of employing the grand canonical ensemble is that it scales computationally as N𝑁Nitalic_N, as opposed to the exponential scaling of the canonical ensemble. The drawback is that the premise – particle number being variable – does not physically represent the system; Eq. (60) only ensures that the expectation value of the particle number is A𝐴Aitalic_A. For large A𝐴Aitalic_A, the fluctuation in particle number behaves as A𝐴\sqrt{A}square-root start_ARG italic_A end_ARG, which becomes small compared to A𝐴Aitalic_A itself. However, in few-fermion systems, the two ensembles can have non-negligible differences, as demonstrated by our results.