-
Basis set extrapolation from the vanishing counterpoise correction condition
Authors:
Vladimir Fishman,
Emmanouil Semidalas,
Jan M. L. Martin
Abstract:
Basis set extrapolations are typically rationalized either from analytical arguments involving the partial-wave or principal expansions of the correlation energy in helium-like systems, or from fitting extrapolation parameters to reference energetics for a small(ish) training set. Seeking to avoid both, we explore a third alternative: extracting extrapolation parameters from the requirement that t…
▽ More
Basis set extrapolations are typically rationalized either from analytical arguments involving the partial-wave or principal expansions of the correlation energy in helium-like systems, or from fitting extrapolation parameters to reference energetics for a small(ish) training set. Seeking to avoid both, we explore a third alternative: extracting extrapolation parameters from the requirement that the BSSE (basis set superposition error) should vanish at the complete basis set limit. We find this to be a viable approach provided that the underlying basis sets are not too small and reasonably well balanced. For basis sets not augmented by diffuse functions, BSSE minimization and energy fitting yield quite similar parameters.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
W4$Λ$: leveraging $Λ$ coupled cluster for accurate computational thermochemistry approaches
Authors:
Emmanouil Semidalas,
Amir Karton,
Jan M. L. Martin
Abstract:
High-accuracy composite wavefunction methods like Weizmann-4 (W4) theory, high-accuracy extrapolated \textit{ab initio} thermochemistry (HEAT), and Feller-Peterson-Dixon (FPD) enable sub-kJ/mol accuracy in gas-phase thermochemical properties. Their biggest computational bottleneck is the evaluation of the valence post-CCSD(T) correction term. We demonstrate here, for the W4-17 thermochemistry benc…
▽ More
High-accuracy composite wavefunction methods like Weizmann-4 (W4) theory, high-accuracy extrapolated \textit{ab initio} thermochemistry (HEAT), and Feller-Peterson-Dixon (FPD) enable sub-kJ/mol accuracy in gas-phase thermochemical properties. Their biggest computational bottleneck is the evaluation of the valence post-CCSD(T) correction term. We demonstrate here, for the W4-17 thermochemistry benchmark and subsets thereof, that the lambda coupled cluster expansion converges more rapidly and smoothly than the regular coupled cluster series. By means of CCSDT(Q)$_Λ$ and CCSDTQ(5)$_Λ$, we can considerably (up to an order of magnitude) accelerate W4- and W4.3-type calculations without loss in accuracy, leading to the W4$Λ$ and W4.3$Λ$ computational thermochemistry protocols.
△ Less
Submitted 11 April, 2024; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Is valence CCSD(T) enough for the binding of water clusters? The isomers of (H$_2$O)$_6$ and (H$_2$O)$_{20}$ as a case study
Authors:
Golokesh Santra,
Margarita Shepelenko,
Emmanouil Semidalas,
Jan M. L. Martin
Abstract:
Benchmark calculations on noncovalent interactions typically exclude correlation effects beyond valence CCSD(T) owing to their steep computational cost scaling. In this work, we consider their importance for water clusters, specifically, eight isomers of (H$_2$O)$_6$ and four Wales-Hodges isomers of (H$_2$O)$_{20}$. Higher order connected triples, $T_3$--(T), reduce dissociation energies of the la…
▽ More
Benchmark calculations on noncovalent interactions typically exclude correlation effects beyond valence CCSD(T) owing to their steep computational cost scaling. In this work, we consider their importance for water clusters, specifically, eight isomers of (H$_2$O)$_6$ and four Wales-Hodges isomers of (H$_2$O)$_{20}$. Higher order connected triples, $T_3$--(T), reduce dissociation energies of the latter by about 0.4 kcal/mol, but this is more than compensated by an increase of up to 0.85 kcal/mol due to connected quadruple excitations. In general, higher-order correlation effects favor more compact isomers over more `spread-out' ones. We also consider additional small effects for balance: scalar relativistics reduce binding in (H$_2$O)$_{20}$ by ca. --0.4 kcal/mol, which fortuitously is compensated by the ca. 0.55 kcal/mol diagonal Born-Oppenheimer correction. Core-valence correlation has the greatest impact, at ca. 1.3 kcal/mol for the icosamer.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Post-CCSD(T) corrections to bond distances and vibrational frequencies: the power of $Λ$
Authors:
Maciej Spiegel,
Emmanouil Semidalas,
Jan M. L. Martin,
Megan R. Bentley,
John F. Stanton
Abstract:
The importance of post-CCSD(T) corrections as high as CCSDTQ56 for ground-state spectroscopic constants ($D_e$, $ω_e$, $ω_ex_e$, and $α_e$) has been surveyed for a sample of two dozen mostly heavy-atom diatomics spanning a broad range of static correlation strength. While CCSD(T) is known to be an unusually felicitous `Pauling point' between accuracy and computational cost, performance leaves some…
▽ More
The importance of post-CCSD(T) corrections as high as CCSDTQ56 for ground-state spectroscopic constants ($D_e$, $ω_e$, $ω_ex_e$, and $α_e$) has been surveyed for a sample of two dozen mostly heavy-atom diatomics spanning a broad range of static correlation strength. While CCSD(T) is known to be an unusually felicitous `Pauling point' between accuracy and computational cost, performance leaves something to be desired for molecules with strong static correlation. We find CCSDT(Q)$_Λ$ to be the next `sweet spot' up, of comparable or superior quality to the much more expensive CCSDTQ. A similar comparison applies to CCSDTQ(5)$_Λ$ vs. CCSDTQ5, while CCSDTQ5(6)$_Λ$ is essentially indistinguishable from CCSDTQ56. A composite of CCSD(T)-X2C/ACV5Z-X2C with [CCSDT(Q)$_Λ$ -- CCSD(T)]/cc-pVTZ or even cc-pVDZ basis sets appears highly effective for computational vibrational spectroscopy. Unlike CCSDT(Q) which breaks down for the ozone vibrational frequencies, CCSDT(Q)$_Λ$ handles them gracefully.
△ Less
Submitted 22 August, 2023; v1 submitted 27 July, 2023;
originally announced July 2023.
-
S66x8 Noncovalent Interactions Revisited: New Benchmark and Performance of Composite Localized Coupled-Cluster Methods
Authors:
Golokesh Santra,
Emmanouil Semidalas,
Nisha Mehta,
Amir Karton,
Jan M. L. Martin
Abstract:
The S66x8 noncovalent interactions benchmark has been re-evaluated at the "sterling silver" level, using explicitly correlated MP2-F12 near the complete basis set limit, CCSD(F12*)/aug-cc-pVTZ-F12, and a (T) correction from conventional CCSD(T)/sano-V{D,T}Z+ calculations. The revised reference value disagrees by 0.1 kcal/mol RMS with the original Hobza benchmark and its revision by Brauer et al, b…
▽ More
The S66x8 noncovalent interactions benchmark has been re-evaluated at the "sterling silver" level, using explicitly correlated MP2-F12 near the complete basis set limit, CCSD(F12*)/aug-cc-pVTZ-F12, and a (T) correction from conventional CCSD(T)/sano-V{D,T}Z+ calculations. The revised reference value disagrees by 0.1 kcal/mol RMS with the original Hobza benchmark and its revision by Brauer et al, but by only 0.04 kcal/mol variety from the "bronze" level data in Kesharwani et al., Aust. J. Chem. 71, 238-248 (2018). We then used these to assess the performance of localized-orbital coupled cluster approaches with and without counterpoise corrections, such as PNO-LCCSD(T) as implemented in MOLPRO, DLPNO-CCSD (T1) as implemented in ORCA, and LNO-CCSD(T) as implemented in MRCC, for their respective "Normal", "Tight", and "very Tight" settings. We also considered composite approaches combining different basis sets and cutoffs. Furthermore, in order to isolate basis set convergence from domain truncation error, for the aug-cc-pVTZ basis set we compared PNO, DLPNO, and LNO approaches with canonical CCSD(T). We conclude that LNO-CCSD(T) with veryTight criteria performs very well for "raw" (CP-uncorrected), but struggles to reproduce counterpoise-corrected numbers even for veryVeryTight criteria: this means that accurate results can be obtained using either extrapolation from basis sets large enough to quench basis set superposition error (BSSE) such as aug-cc-pV{Q,5}Z, or using a composite scheme such as Tight{T,Q}+1.11[vvTight(T) - Tight(T)]. In contrast, PNO-LCCSD(T) works best with counterpoise, while performance with and without counterpoise is comparable for DLPNO-CCSD(T1). Among more economical methods, the highest accuracies are seen for dRPA75-D3BJ, ωB97M-V, ωB97M(2), revDSD-PBEP86-D4, and DFT(SAPT) with a TDEXX or ATDEXX kernel.
△ Less
Submitted 27 October, 2022; v1 submitted 2 August, 2022;
originally announced August 2022.
-
Automatic generation of complementary auxiliary basis sets (CABS) for explicitly correlated methods
Authors:
Emmanouil Semidalas,
Jan M. L. Martin
Abstract:
Explicitly correlated calculations, aside from the orbital basis set, typically require three auxiliary basis sets: JK (Coulomb-exchange fitting), RI-MP2 (resolution of the identity MP2), and CABS (complementary auxiliary basis set). If unavailable for the orbital basis set and chemical elements of interest, the first two can be auto-generated on the fly using existing algorithms, but not the thir…
▽ More
Explicitly correlated calculations, aside from the orbital basis set, typically require three auxiliary basis sets: JK (Coulomb-exchange fitting), RI-MP2 (resolution of the identity MP2), and CABS (complementary auxiliary basis set). If unavailable for the orbital basis set and chemical elements of interest, the first two can be auto-generated on the fly using existing algorithms, but not the third. In this paper, we present a quite simple algorithm named autoCABS; a Python implementation under a free software license is offered at Github. For the cc-pVnZ-F12 (n=D,T,Q,5) and the W4-08 thermochemical benchmark, we demonstrate that autoCABS-generated CABS basis sets are comparable in quality to purpose-optimized OptRI basis sets from the literature, and that the quality difference becomes entirely negligible as n increases.
△ Less
Submitted 13 June, 2022; v1 submitted 14 April, 2022;
originally announced April 2022.
-
The MOBH35 metal-organic barrier heights reconsidered: performance of local-orbital coupled cluster approaches in different static correlation regimes
Authors:
Emmanouil Semidalas,
Jan M. L. Martin
Abstract:
We have revisited the MOBH35 (Metal-Organic Barrier Heights, 35 reactions) benchmark [Iron, M. A.; Janes, T. J. Phys. Chem. A 2019, 123 (17), 3761-3781; ibid. 2019, 123, 6379-6380] for realistic organometallic catalytic reactions, using both canonical CCSD(T) and localized orbital approximations to it. For low levels of static correlation, all of DLPNO-CCSD(T), PNO-LCCSD(T), and LNO-CCSD(T) perfor…
▽ More
We have revisited the MOBH35 (Metal-Organic Barrier Heights, 35 reactions) benchmark [Iron, M. A.; Janes, T. J. Phys. Chem. A 2019, 123 (17), 3761-3781; ibid. 2019, 123, 6379-6380] for realistic organometallic catalytic reactions, using both canonical CCSD(T) and localized orbital approximations to it. For low levels of static correlation, all of DLPNO-CCSD(T), PNO-LCCSD(T), and LNO-CCSD(T) perform well; for moderately strong levels of static correlation, DLPNO-CCSD(T) and (T1) may break down catastrophically, and PNO-LCCSD(T) is vulnerable as well. In contrast, LNO-CCSD(T) converges smoothly to the canonical CCSD(T) answer with increasingly tight convergence settings. The only two reactions for which our revised MOBH35 reference values differ substantially from the original ones are reaction 9 and to a lesser extent 8, both involving iron. For the purpose of evaluating DFT methods for MOBH35, it would be best to excise reaction 9 entirely as its severe level of static correlation is just too demanding a test. The magnitude of the difference between DLPNO-CCSD(T) and DLPNO-CCSD(T1) is a reasonably good predictor for errors in DLPNO-CCSD(T1) compared to canonical CCSD(T); [...]
△ Less
Submitted 20 January, 2022; v1 submitted 8 November, 2021;
originally announced November 2021.
-
The S66 Noncovalent Interaction Benchmark Re-examined: Composite Localized Coupled Cluster Approaches
Authors:
Emmanouil Semidalas,
Golokesh Santra,
Nisha Mehta,
Jan M. L. Martin
Abstract:
The S66 non-covalent interactions are studied through localized coupled-cluster methods and general LNO-CCSD(T)-based composite schemes. Very small RMS deviations (\leq 0.05 kcal/mol) for the low-cost composite approaches from the SILVER reference interaction energies of S66 indicate that we can safely avoid carrying out the largest basis set calculations with veryVeryTight thresholds, and apply i…
▽ More
The S66 non-covalent interactions are studied through localized coupled-cluster methods and general LNO-CCSD(T)-based composite schemes. Very small RMS deviations (\leq 0.05 kcal/mol) for the low-cost composite approaches from the SILVER reference interaction energies of S66 indicate that we can safely avoid carrying out the largest basis set calculations with veryVeryTight thresholds, and apply instead additivity corrections in smaller basis sets. Interestingly, the counterpoise corrections do not have an appreciable effect on the composite schemes. These findings may prove useful for intermolecular and intramolecular NCIs of larger systems.
△ Less
Submitted 14 November, 2021; v1 submitted 2 November, 2021;
originally announced November 2021.
-
An Exchange-Based Diagnostic for Static Correlation
Authors:
Jan M. L. Martin,
Golokesh Santra,
Emmanouil Semidalas
Abstract:
We propose here a DFT-based diagnostic for static correlation %TAEX[TPSS@HF - HF] which effectively measures how different the DFT and HF exchange energies for a given HF density are. This and %TAEcorr[TPSS] are two cost-effective a priori estimates for the adequacy of the importance of static correlation. %TAEX[TPSS@HF - HF] contains nearly the same information as the earlier A diagnostic, but ma…
▽ More
We propose here a DFT-based diagnostic for static correlation %TAEX[TPSS@HF - HF] which effectively measures how different the DFT and HF exchange energies for a given HF density are. This and %TAEcorr[TPSS] are two cost-effective a priori estimates for the adequacy of the importance of static correlation. %TAEX[TPSS@HF - HF] contains nearly the same information as the earlier A diagnostic, but may be more intuitive to understand. Principal component and variable clustering analysis of a large number of static correlation diagnostics reveals much of the variation is explained by just two components, and almost all of it by four; these are blocked by four variable clusters (single excitations; correlation entropy; double excitations; pragmatic energetics).
△ Less
Submitted 14 November, 2021; v1 submitted 2 November, 2021;
originally announced November 2021.
-
Surprisingly Good Performance of XYG3 Family Functionals Using Scaled KS-MP3 Correlation
Authors:
Golokesh Santra,
Emmanouil Semidalas,
Jan M. L. Martin
Abstract:
By adding a GLPT3 (third-order Görling-Levy perturbation theory, or KS-MP3) term E3 to the XYG7 form for a double hybrid, we are able to bring down WTMAD2 (weighted total mean absolute deviation) for the very large and chemically diverse GMTKN55 benchmark to an unprecedented 1.17 kcal/mol, competitive with much costlier composite wavefunction ab initio approaches. Intriguingly: (a) the introductio…
▽ More
By adding a GLPT3 (third-order Görling-Levy perturbation theory, or KS-MP3) term E3 to the XYG7 form for a double hybrid, we are able to bring down WTMAD2 (weighted total mean absolute deviation) for the very large and chemically diverse GMTKN55 benchmark to an unprecedented 1.17 kcal/mol, competitive with much costlier composite wavefunction ab initio approaches. Intriguingly: (a) the introduction of E3 makes an empirical dispersion correction redundant; (b) GGA or mGGA semilocal correlation functionals offer no advantage over LDA in this framework; (c) if a dispersion correction is retained, then simple Slater exchange leads to no significant loss in accuracy. It is possible to create a 6-parameter functional with WTMAD2=1.42 that has no post-LDA DFT components and no dispersion correction in the final energy.
△ Less
Submitted 14 September, 2021; v1 submitted 29 August, 2021;
originally announced August 2021.
-
Exploring Avenues Beyond Revised DSD Functionals: II. Random-Phase Approximation and scaled MP3 corrections
Authors:
Golokesh Santra,
Emmanouil Semidalas,
Jan M. L. Martin
Abstract:
For revDSD double hybrids, the Görling-Levy second-order perturbation theory component is an Achilles' Heel when applied to systems with significant near-degeneracy ("static") correlation. We have explored its replacement by the direct random phase approximation (dRPA), inspired by the SCS-dRPA75 functional of Kállay and coworkers. The addition to the final energy of both a D4 empirical dispersion…
▽ More
For revDSD double hybrids, the Görling-Levy second-order perturbation theory component is an Achilles' Heel when applied to systems with significant near-degeneracy ("static") correlation. We have explored its replacement by the direct random phase approximation (dRPA), inspired by the SCS-dRPA75 functional of Kállay and coworkers. The addition to the final energy of both a D4 empirical dispersion correction, and of a semilocal correlation component lead, to significant improvements, with DSD-PBEdRPA75-D4 approaching the performance of revDSD-PBEP86-D4 and the Berkeley $ω$B97M(2). This form appears to be fairly insensitive to the choice of semilocal functional, but does exhibit stronger basis set sensitivity than the PT2-based double hybrids (due to much larger prefactors for the nonlocal correlation). As an alternative, we explored adding an MP3-like correction term (in a medium-sized basis sets) to a range-separated $ω$DSD-PBEP86-D4 double hybrid, and found it to have significantly lower WTMAD2 (weighted mean absolute deviation) for the large and chemically diverse GMTKN55 benchmark suite; the added computational cost can be mitigated through density fitting techniques.
△ Less
Submitted 22 May, 2021; v1 submitted 9 February, 2021;
originally announced February 2021.
-
Canonical and DLPNO-based composite wavefunction methods parametrized against large and chemically diverse training sets. 2. Correlation consistent basis sets, core-valence correlation, and F12 alternatives
Authors:
Emmanouil Semidalas,
Jan M. L. Martin
Abstract:
A hierarchy of wavefunction composite methods (cWFT), based on G4- type cWFT methods available for elements H through Rn, was recently reported by Semidalas and Martin [J. Chem. Theor. Comput. 2020, 16, 4238]. We extend this hierarchy by considering the inner-shell correlation energy in the second-order Moller-Plesset correction and replacing the Weigend-Ahlrichs def2-mZVPP(D) basis sets used in t…
▽ More
A hierarchy of wavefunction composite methods (cWFT), based on G4- type cWFT methods available for elements H through Rn, was recently reported by Semidalas and Martin [J. Chem. Theor. Comput. 2020, 16, 4238]. We extend this hierarchy by considering the inner-shell correlation energy in the second-order Moller-Plesset correction and replacing the Weigend-Ahlrichs def2-mZVPP(D) basis sets used in the aforementioned paper with complete basis set extrapolation from augmented correlation consistent core-valence triple-zeta, aug-cc-pwCVTZ(-PP), and quadruple-zeta, aug-cc-pwCVQZ(-PP), basis sets, thus creating cc-G4- type methods. For the large and chemically diverse GMTKN55 benchmark suite, they represent a substantial further improvement and bring WTMAD2 (weighted mean absolute deviation) down below 1 kcal/mol. Intriguingly, the lion's share of the improvement comes from better capture of valence correlation; the inclusion of core-valence correlation is almost an order of magnitude less important. These robust correlation consistent cWFT methods approach the CCSD(T) complete basis limit with just one or a few fitted parameters. Particularly the DLPNO variants such as cc-G4-T-DLPNO are applicable to fairly large molecules at modest computational cost, as is (for a reduced range of elements) a different variant using MP2-F12/cc-pVTZ-F12 for the MP2 component.
△ Less
Submitted 17 November, 2020; v1 submitted 4 August, 2020;
originally announced August 2020.
-
Canonical and DLPNO-based G4(MP2)XK-inspired composite wavefunction methods parametrized against large and chemically diverse training sets: Are they more accurate and/or robust than double hybrid DFT?
Authors:
Emmanouil Semidalas,
Jan M. L. Martin
Abstract:
The large and chemically diverse GMTKN55 benchmark was used as a training set for parametrizing composite wave function thermochemistry protocols akin to G4(MP2)XK theory (Chan et al, JCTC 2019, 15, 4478-4484). Even after reparametrization, the GMTKN55 WTMAD2 (weighted mean absolute deviation, type 2) for G4(MP2)-XK is actually inferior to that of the best rung-4 DFT functional, wB97M-V. By increa…
▽ More
The large and chemically diverse GMTKN55 benchmark was used as a training set for parametrizing composite wave function thermochemistry protocols akin to G4(MP2)XK theory (Chan et al, JCTC 2019, 15, 4478-4484). Even after reparametrization, the GMTKN55 WTMAD2 (weighted mean absolute deviation, type 2) for G4(MP2)-XK is actually inferior to that of the best rung-4 DFT functional, wB97M-V. By increasing the basis set for the MP2 part to def2-QZVPPD, we were able to substantially improve performance at modest cost (if an RI-MP2 approximation is made), with WTMAD2 for this G4(MP2)-XK-D method now comparable to the better rung-5 functionals (albeit at greater cost). A three-tier approach with a scaled MP3/def2-TZVPP intermediate step, however, leads to a G4(MP3)-D method that is markedly superior to even the best double hybrids wB97M(2) and revDSD-PBEP86-D4. Evaluating the CCSD(T) component with a triple-zeta, rather than split-valence, basis set yields only a modest further improvement that is incommensurate with the drastic increase in computational cost. G4(MP3)-D and G4(MP2)- XK-D have about 40% better WTMAD2, at similar or lower computational cost, than their counterparts G4 and G4(MP2), respectively: detailed comparison reveals that the difference lies in larger molecules due to basis set incompleteness error. An E2/ {T,Q} extrapolation and a CCSD(T)/def2-TZVP step provided the G4-T method of high accuracy and with just three fitted parameters. Using KS orbitals in MP2 leads to the G4(MP3|KS)-D method, which entirely eliminates the CCSD(T) step and has no steps costlier than scaled MP3; this shows a path forward to further improvements in double-hybrid density functional methods. G4-T-DLPNO, a variant in which post-MP2 corrections are evaluated at the DLPNO- CCSD(T) level, achieves nearly the accuracy of G4-T but is applicable to much larger systems.
△ Less
Submitted 8 June, 2020;
originally announced June 2020.