-
TREXIO: A File Format and Library for Quantum Chemistry
Authors:
Evgeny Posenitskiy,
Vijay Gopal Chilkuri,
Abdallah Ammar,
Michał Hapka,
Katarzyna Pernal,
Ravindra Shinde,
Edgar Josué Landinez Borda,
Claudia Filippi,
Kosuke Nakano,
Otto Kohulák,
Sandro Sorella,
Pablo de Oliveira Castro,
William Jalby,
Pablo López Rıós,
Ali Alavi,
Anthony Scemama
Abstract:
TREXIO is an open-source file format and library developed for the storage and manipulation of data produced by quantum chemistry calculations. It is designed with the goal of providing a reliable and efficient method of storing and exchanging wave function parameters and matrix elements, making it an important tool for researchers in the field of quantum chemistry. In this work, we present an ove…
▽ More
TREXIO is an open-source file format and library developed for the storage and manipulation of data produced by quantum chemistry calculations. It is designed with the goal of providing a reliable and efficient method of storing and exchanging wave function parameters and matrix elements, making it an important tool for researchers in the field of quantum chemistry. In this work, we present an overview of the TREXIO file format and library. The library consists of a front-end implemented in the C programming language and two different back-ends: a text back-end and a binary back-end utilizing the HDF5 library which enables fast read and write operations. It is compatible with a variety of platforms and has interfaces for the Fortran, Python, and OCaml programming languages. In addition, a suite of tools has been developed to facilitate the use of the TREXIO format and library, including converters for popular quantum chemistry codes and utilities for validating and manipulating data stored in TREXIO files. The simplicity, versatility, and ease of use of TREXIO make it a valuable resource for researchers working with quantum chemistry data.
△ Less
Submitted 30 March, 2023; v1 submitted 28 February, 2023;
originally announced February 2023.
-
Gaussian Processes for Finite Size Extrapolation of Many-Body Simulations
Authors:
Edgar Josué Landinez Borda,
Kenneth O. Berard,
Annette Lopez,
Brenda Rubenstein
Abstract:
Key to being able to accurately model the properties of realistic materials is being able to predict their properties in the thermodynamic limit. Nevertheless, because most many-body electronic structure methods scale as a high-order polynomial, or even exponentially, with system size, directly simulating large systems in their thermodynamic limit rapidly becomes computationally intractable. As a…
▽ More
Key to being able to accurately model the properties of realistic materials is being able to predict their properties in the thermodynamic limit. Nevertheless, because most many-body electronic structure methods scale as a high-order polynomial, or even exponentially, with system size, directly simulating large systems in their thermodynamic limit rapidly becomes computationally intractable. As a result, researchers typically estimate the properties of large systems that approach the thermodynamic limit by extrapolating the properties of smaller, computationally-accessible systems based on relatively simple scaling expressions. In this work, we employ Gaussian processes to more accurately and efficiently extrapolate many-body simulations to their thermodynamic limit. We train our Gaussian processes on Smooth Overlap of Atomic Positions (SOAP) descriptors to extrapolate the energies of one-dimensional hydrogen chains obtained using two high-accuracy many-body methods: Coupled Cluster theory and Auxiliary Field Quantum Monte Carlo (AFQMC). In so doing, we show that Gaussian processes trained on relatively short, 10-30-atom chains can predict the energies of both homogeneous and inhomogeneous hydrogen chains in their thermodynamic limit with sub-milliHartree accuracy. Unlike standard scaling expressions, our GPR-based approach is highly generalizable given representative training data and is not dependent on systems' geometries or dimensionality. This work highlights the potential for machine learning to correct for the finite size effects that routinely complicate the interpretation of finite size many-body simulations.
△ Less
Submitted 3 April, 2024; v1 submitted 19 December, 2021;
originally announced December 2021.
-
A practical approach to Hohenberg-Kohn maps based on many-body correlations: learning the electronic density
Authors:
Edgar Josué Landinez Borda,
Amit Samanta
Abstract:
High throughput screening of materials for technologically relevant areas, like identification of better catalysts, electronic materials, ceramics for high temperature applications and drug discovery, is an emerging topic of research. To facilitate this, density functional theory based (DFT) calculations are routinely used to calculate the electronic structure of a wide variety of materials. Howev…
▽ More
High throughput screening of materials for technologically relevant areas, like identification of better catalysts, electronic materials, ceramics for high temperature applications and drug discovery, is an emerging topic of research. To facilitate this, density functional theory based (DFT) calculations are routinely used to calculate the electronic structure of a wide variety of materials. However, DFT calculations are expensive and the computing cost scales as the cube of the number of electrons present in the system. Thus, it is desirable to generate surrogate models that can mitigate these issues. To this end, we present a two step procedure to predict total energies of large three-dimensional systems (with periodic boundary conditions) with chemical accuracy (1kcal/mol) per atom using a small data set, meaning that such models can be trained on-the-fly. Our procedure is based on the idea of the Hohenberg-Kohn map proposed by Brockherde et al. (Nat. Commun, 8, 872 (2017)) and involves two training models: one, to predict the ground state charge density, $ρ(r)$, directly from the atomic structure, and another to predict the total energy from $ρ(r)$. To predict $ρ(r)$, we use many-body correlation descriptors to accurately describe the neighborhood of a grid point and to predict the total energy we use amplitudes of these many-body correlation descriptors. Utilizing the amplitudes of the many-body descriptors allows for uniquely identifying a structure while accounting for constraints, such as translational invariance; additionally, such a formulation is independent of the charge density grid.
△ Less
Submitted 30 April, 2020; v1 submitted 29 April, 2020;
originally announced April 2020.
-
QMCPACK: Advances in the development, efficiency, and application of auxiliary field and real-space variational and diffusion Quantum Monte Carlo
Authors:
P. R. C. Kent,
Abdulgani Annaberdiyev,
Anouar Benali,
M. Chandler Bennett,
Edgar Josue Landinez Borda,
Peter Doak,
Kenneth D. Jordan,
Jaron T. Krogel,
Ilkka Kylanpaa,
Joonho Lee,
Ye Luo,
Fionn D. Malone,
Cody A. Melton,
Lubos Mitas,
Miguel A. Morales,
Eric Neuscamman,
Fernando A. Reboredo,
Brenda Rubenstein,
Kayahan Saritas,
Shiv Upadhyay,
Hongxia Hao,
Guangming Wang,
Shuai Zhang,
Luning Zhao
Abstract:
We review recent advances in the capabilities of the open source ab initio Quantum Monte Carlo (QMC) package QMCPACK and the workflow tool Nexus used for greater efficiency and reproducibility. The auxiliary field QMC (AFQMC) implementation has been greatly expanded to include k-point symmetries, tensor-hypercontraction, and accelerated graphical processing unit (GPU) support. These scaling and me…
▽ More
We review recent advances in the capabilities of the open source ab initio Quantum Monte Carlo (QMC) package QMCPACK and the workflow tool Nexus used for greater efficiency and reproducibility. The auxiliary field QMC (AFQMC) implementation has been greatly expanded to include k-point symmetries, tensor-hypercontraction, and accelerated graphical processing unit (GPU) support. These scaling and memory reductions greatly increase the number of orbitals that can practically be included in AFQMC calculations, increasing accuracy. Advances in real space methods include techniques for accurate computation of band gaps and for systematically improving the nodal surface of ground state wavefunctions. Results of these calculations can be used to validate application of more approximate electronic structure methods including GW and density functional based techniques. To provide an improved foundation for these calculations we utilize a new set of correlation-consistent effective core potentials (pseudopotentials) that are more accurate than previous sets; these can also be applied in quantum-chemical and other many-body applications, not only QMC. These advances increase the efficiency, accuracy, and range of properties that can be studied in both molecules and materials with QMC and QMCPACK.
△ Less
Submitted 6 May, 2020; v1 submitted 3 March, 2020;
originally announced March 2020.
-
QMCPACK : An open source ab initio Quantum Monte Carlo package for the electronic structure of atoms, molecules, and solids
Authors:
Jeongnim Kim,
Andrew Baczewski,
Todd D. Beaudet,
Anouar Benali,
M. Chandler Bennett,
Mark A. Berrill,
Nick S. Blunt,
Edgar Josue Landinez Borda,
Michele Casula,
David M. Ceperley,
Simone Chiesa,
Bryan K. Clark,
Raymond C. Clay III,
Kris T. Delaney,
Mark Dewing,
Kenneth P. Esler,
Hongxia Hao,
Olle Heinonen,
Paul R. C. Kent,
Jaron T. Krogel,
Ilkka Kylanpaa,
Ying Wai Li,
M. Graham Lopez,
Ye Luo,
Fionn D. Malone
, et al. (23 additional authors not shown)
Abstract:
QMCPACK is an open source quantum Monte Carlo package for ab-initio electronic structure calculations. It supports calculations of metallic and insulating solids, molecules, atoms, and some model Hamiltonians. Implemented real space quantum Monte Carlo algorithms include variational, diffusion, and reptation Monte Carlo. QMCPACK uses Slater-Jastrow type trial wave functions in conjunction with a s…
▽ More
QMCPACK is an open source quantum Monte Carlo package for ab-initio electronic structure calculations. It supports calculations of metallic and insulating solids, molecules, atoms, and some model Hamiltonians. Implemented real space quantum Monte Carlo algorithms include variational, diffusion, and reptation Monte Carlo. QMCPACK uses Slater-Jastrow type trial wave functions in conjunction with a sophisticated optimizer capable of optimizing tens of thousands of parameters. The orbital space auxiliary field quantum Monte Carlo method is also implemented, enabling cross validation between different highly accurate methods. The code is specifically optimized for calculations with large numbers of electrons on the latest high performance computing architectures, including multicore central processing unit (CPU) and graphical processing unit (GPU) systems. We detail the program's capabilities, outline its structure, and give examples of its use in current research calculations. The package is available at http://www.qmcpack.org .
△ Less
Submitted 4 April, 2018; v1 submitted 19 February, 2018;
originally announced February 2018.
-
Non-Orthogonal Multi-Slater Determinant Expansions in Auxiliary Field Quantum Monte Carlo
Authors:
Edgar Josué Landinez Borda,
John A. Gomez,
Miguel A. Morales
Abstract:
The Auxiliary-Field Quantum Monte Carlo (AFQMC) algorithm is a powerful quantum many-body method that can be used successfully as an alternative to standard quantum chemistry approaches to compute the ground state of many body systems, such as molecules and solids, with high accuracy. In this article we use AFQMC with trial wave-functions built from non-orthogonal multi Slater determinant expansio…
▽ More
The Auxiliary-Field Quantum Monte Carlo (AFQMC) algorithm is a powerful quantum many-body method that can be used successfully as an alternative to standard quantum chemistry approaches to compute the ground state of many body systems, such as molecules and solids, with high accuracy. In this article we use AFQMC with trial wave-functions built from non-orthogonal multi Slater determinant expansions to study the energetics of molecular systems, including the 55 molecules of the G1 test set and the isomerization path of the $[Cu_{2}O_{2}]^{2+}$ molecule. The main goal of this study is to show the ability of non-orthogonal multi Slater determinant expansions to produce high-quality, compact trial wave-functions for quantum Monte Carlo methods. We obtain systematically improvable results as the number of determinants is increased, with high accuracy typically obtained with tens of determinants. Great reduction in the average error and traditional statistical indicators are observed in the total and absorption energies of the molecules in the G1 test set with as few as 10-20 determinants. In the case of the relative energies along the isomerization path of the $[Cu_{2}O_{2}]^{2+}$, our results compare favorably with other advanced quantum many-body methods, including DMRG and complete-renormalized CCSD(T). Discrepancies in previous studies for this molecular problem are identified and attributed to the differences in the number of electrons and active spaces considered in such calculations.
△ Less
Submitted 19 July, 2018; v1 submitted 31 January, 2018;
originally announced January 2018.
-
Reply to Comment on "Dislocation Structure and Mobility in hcp $^4$He"
Authors:
Edgar Josué Landinez Borda,
Wei Cai,
Maurice de Koning
Abstract:
In their Comment [arXiv:1609.06174 (2016)], Kuklov and Svistunov argue that (a) our discussion of the role of basal-plane dislocations in mass-flow junction experiments in our recent Letter (Phys. Rev. Lett. 117, 045301 (2016)) is misleading, (b) our results do not provide new insight into dislocation dissociation nor superfluidity of dislocation cores and (c) our calculations lack control of the…
▽ More
In their Comment [arXiv:1609.06174 (2016)], Kuklov and Svistunov argue that (a) our discussion of the role of basal-plane dislocations in mass-flow junction experiments in our recent Letter (Phys. Rev. Lett. 117, 045301 (2016)) is misleading, (b) our results do not provide new insight into dislocation dissociation nor superfluidity of dislocation cores and (c) our calculations lack control of the numerical data. Here we offer our Reply.
△ Less
Submitted 18 November, 2016;
originally announced November 2016.