-
Accurate quantum Monte Carlo forces for machine-learned force fields: Ethanol as a benchmark
Authors:
Emiel Slootman,
Igor Poltavsky,
Ravindra Shinde,
Jacopo Cocomello,
Saverio Moroni,
Alexandre Tkatchenko,
Claudia Filippi
Abstract:
Quantum Monte Carlo (QMC) is a powerful method to calculate accurate energies and forces for molecular systems. In this work, we demonstrate how we can obtain accurate QMC forces for the fluxional ethanol molecule at room temperature by using either multi-determinant Jastrow-Slater wave functions in variational Monte Carlo or just a single determinant in diffusion Monte Carlo. The excellent perfor…
▽ More
Quantum Monte Carlo (QMC) is a powerful method to calculate accurate energies and forces for molecular systems. In this work, we demonstrate how we can obtain accurate QMC forces for the fluxional ethanol molecule at room temperature by using either multi-determinant Jastrow-Slater wave functions in variational Monte Carlo or just a single determinant in diffusion Monte Carlo. The excellent performance of our protocols is assessed against high-level coupled cluster calculations on a diverse set of representative configurations of the system. Finally, we train machine-learning force fields on the QMC forces and compare them to models trained on coupled cluster reference data, showing that a force field based on the diffusion Monte Carlo forces with a single determinant can faithfully reproduce coupled cluster power spectra in molecular dynamics simulations.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Quantum-informed simulations for mechanics of materials: DFTB+MBD framework
Authors:
Zhaoxiang Shen,
Raúl I. Sosa,
Stéphane P. A. Bordas,
Alexandre Tkatchenko,
Jakub Lengiewicz
Abstract:
The macroscopic behaviors of materials are determined by interactions that occur at multiple lengths and time scales. Depending on the application, describing, predicting, and understanding these behaviors require models that rely on insights from electronic and atomic scales. In such cases, classical simplified approximations at those scales are insufficient, and quantum-based modeling is require…
▽ More
The macroscopic behaviors of materials are determined by interactions that occur at multiple lengths and time scales. Depending on the application, describing, predicting, and understanding these behaviors require models that rely on insights from electronic and atomic scales. In such cases, classical simplified approximations at those scales are insufficient, and quantum-based modeling is required. In this paper, we study how quantum effects can modify the mechanical properties of systems relevant to materials engineering. We base our study on a high-fidelity modeling framework that combines two computationally efficient models rooted in quantum first principles: Density Functional Tight Binding (DFTB) and many-body dispersion (MBD). The MBD model is applied to accurately describe non-covalent van der Waals interactions. Through various benchmark applications, we demonstrate the capabilities of this framework and the limitations of simplified modeling. We provide an open-source repository containing all codes, datasets, and examples presented in this work. This repository serves as a practical toolkit that we hope will support the development of future research in effective large-scale and multiscale modeling with quantum-mechanical fidelity.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Universal Pairwise Interatomic van der Waals Potentials Based on Quantum Drude Oscillators
Authors:
Almaz Khabibrakhmanov,
Dmitry V. Fedorov,
Alexandre Tkatchenko
Abstract:
Repulsive short-range and attractive long-range van der Waals (vdW) forces have an appreciable role in the behavior of extended molecular systems. When using empirical force fields - the most popular computational methods applied to such systems - vdW forces are typically described by Lennard-Jones-like potentials, which unfortunately have a limited predictive power. Here, we present a universal p…
▽ More
Repulsive short-range and attractive long-range van der Waals (vdW) forces have an appreciable role in the behavior of extended molecular systems. When using empirical force fields - the most popular computational methods applied to such systems - vdW forces are typically described by Lennard-Jones-like potentials, which unfortunately have a limited predictive power. Here, we present a universal parameterization of a quantum-mechanical vdW potential, which requires only two free-atom properties - the static dipole polarizability $α_1$ and the dipole-dipole $C_6$ dispersion coefficient. This is achieved by deriving the functional form of the potential from the quantum Drude oscillator (QDO) model, employing scaling laws for the equilibrium distance and the binding energy as well as applying the microscopic law of corresponding states. The vdW-QDO potential is shown to be accurate for vdW binding energy curves, as demonstrated by comparing to ab initio binding curves of 21 noble-gas dimers. The functional form of the vdW-QDO potential has the correct asymptotic behavior both at zero and infinite distances. In addition, it is shown that the damped vdW-QDO potential can accurately describe vdW interactions in dimers consisting of group II elements. Finally, we demonstrate the applicability of the atom-in-molecule vdW-QDO model for predicting accurate dispersion energies for molecular systems. The present work makes an important step towards constructing universal vdW potentials, which could benefit (bio)molecular computational studies.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Enabling Inverse Design in Chemical Compound Space: Map** Quantum Properties to Structures for Small Organic Molecules
Authors:
Alessio Fallani,
Leonardo Medrano Sandonas,
Alexandre Tkatchenko
Abstract:
Computer-driven molecular design combines the principles of chemistry, physics, and artificial intelligence to identify novel chemical compounds and materials with desired properties for a specific application. In particular, quantum-mechanical (QM) methods combined with machine learning (ML) techniques have accelerated the estimation of accurate molecular properties, providing a direct map** fr…
▽ More
Computer-driven molecular design combines the principles of chemistry, physics, and artificial intelligence to identify novel chemical compounds and materials with desired properties for a specific application. In particular, quantum-mechanical (QM) methods combined with machine learning (ML) techniques have accelerated the estimation of accurate molecular properties, providing a direct map** from 3D molecular structures to their properties. However, the development of reliable and efficient methodologies to enable \emph{inverse map**} in chemical space is a long-standing challenge that has not been accomplished yet. Here, we address this challenge by demonstrating the possibility of parametrizing a given chemical space with a finite set of extensive and intensive QM properties. In doing so, we develop a proof-of-concept implementation that combines a Variational Auto-Encoder (VAE) trained on molecular structures with a property encoder designed to learn the latent representation from a set of QM properties. The result of this joint architecture is a common latent space representation for both structures and properties, which enables property-to-structure map** for small drug-like molecules contained in the QM7-X dataset. We illustrate the capabilities of our approach by conditional generation of \emph{de novo} molecular structures with targeted properties, transition path interpolation for chemical reactions as well as insights into property-structure relationships. Our findings thus provide a proof-of-principle demonstration aiming to enable the inverse property-to-structure design in diverse chemical spaces.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Force Field Analysis Software and Tools (FFAST): Assessing Machine Learning Force Fields Under the Microscope
Authors:
Gregory Fonseca,
Igor Poltavsky,
Alexandre Tkatchenko
Abstract:
As the sophistication of Machine Learning Force Fields (MLFF) increases to match the complexity of extended molecules and materials, so does the need for tools to properly analyze and assess the practical performance of MLFFs. To go beyond average error metrics and into a complete picture of a model's applicability and limitations, we develop FFAST (Force Field Analysis Software and Tools): a cros…
▽ More
As the sophistication of Machine Learning Force Fields (MLFF) increases to match the complexity of extended molecules and materials, so does the need for tools to properly analyze and assess the practical performance of MLFFs. To go beyond average error metrics and into a complete picture of a model's applicability and limitations, we develop FFAST (Force Field Analysis Software and Tools): a cross-platform software package designed to gain detailed insights into a model's performance and limitations, complete with an easy-to-use graphical user interface. The software allows the user to gauge the performance of many popular state-of-the-art MLFF models on various popular dataset types, providing general prediction error overviews, outlier detection mechanisms, atom-projected errors, and more. It has a 3D visualizer to find and picture problematic configurations, atoms, or clusters in a large dataset. In this paper, the example of the MACE and Nequip models are used on two datasets of interest -- stachyose and docosahexaenoic acid (DHA) -- to illustrate the use cases of the software. With it, it was found that carbons and oxygens involved in or near glycosidic bonds inside the stachyose molecule present increased prediction errors. In addition, prediction errors on DHA rise as the molecule folds, especially for the carboxylic group at the edge of the molecule. We emphasize the need for a systematic assessment of MLFF models for ensuring their successful application to study the dynamics of molecules and materials.
△ Less
Submitted 13 August, 2023;
originally announced August 2023.
-
libMBD: A general-purpose package for scalable quantum many-body dispersion calculations
Authors:
Jan Hermann,
Martin Stöhr,
Szabolcs Góger,
Shayantan Chaudhuri,
Bálint Aradi,
Reinhard J. Maurer,
Alexandre Tkatchenko
Abstract:
Many-body dispersion (MBD) is a powerful framework to treat van der Waals (vdW) dispersion interactions in density-functional theory and related atomistic modeling methods. Several independent implementations of MBD with varying degree of functionality exist across a number of electronic structure codes, which both limits the current users of those codes and complicates dissemination of new varian…
▽ More
Many-body dispersion (MBD) is a powerful framework to treat van der Waals (vdW) dispersion interactions in density-functional theory and related atomistic modeling methods. Several independent implementations of MBD with varying degree of functionality exist across a number of electronic structure codes, which both limits the current users of those codes and complicates dissemination of new variants of MBD. Here, we develop and document libMBD, a library implementation of MBD that is functionally complete, efficient, easy to integrate with any electronic structure code, and already integrated in FHI-aims, DFTB+, VASP, Q-Chem, CASTEP, and Quantum ESPRESSO. libMBD is written in modern Fortran with bindings to C and Python, uses MPI/ScaLAPACK for parallelization, and implements MBD for both finite and periodic systems, with analytical gradients with respect to all input parameters. The computational cost has asymptotic cubic scaling with system size, and evaluation of gradients only changes the prefactor of the scaling law, with libMBD exhibiting strong scaling up to 256 processor cores. Other MBD properties beyond energy and gradients can be calculated with libMBD, such as the charge-density polarization, first-order Coulomb correction, the dielectric function, or the order-by-order expansion of the energy in the dipole interaction. Calculations on supramolecular complexes with MBD-corrected electronic structure methods and a meta-review of previous applications of MBD demonstrate the broad applicability of the libMBD package to treat vdW interactions.
△ Less
Submitted 6 August, 2023;
originally announced August 2023.
-
Modeling Non-Covalent Interatomic Interactions on a Photonic Quantum Computer
Authors:
Matthieu Sarkis,
Alessio Fallani,
Alexandre Tkatchenko
Abstract:
Non-covalent interactions are a key ingredient to determine the structure, stability, and dynamics of materials, molecules, and biological complexes. However, accurately capturing these interactions is a complex quantum many-body problem, with no efficient solution available on classical computers. A widely used model to accurately and efficiently model non-covalent interactions is the Coulomb-cou…
▽ More
Non-covalent interactions are a key ingredient to determine the structure, stability, and dynamics of materials, molecules, and biological complexes. However, accurately capturing these interactions is a complex quantum many-body problem, with no efficient solution available on classical computers. A widely used model to accurately and efficiently model non-covalent interactions is the Coulomb-coupled quantum Drude oscillator (cQDO) many-body Hamiltonian, for which no exact solution is known. We show that the cQDO model lends itself naturally to simulation on a photonic quantum computer, and we calculate the binding energy curve of diatomic systems by leveraging Xanadu's Strawberry Fields photonics library. Our study substantially extends the applicability of quantum computing to atomistic modeling, by showing a proof-of-concept application to non-covalent interactions, beyond the standard electronic-structure problem of small molecules. Remarkably, we find that two coupled bosonic QDOs exhibit a stable bond. In addition, our study suggests efficient functional forms for cQDO wavefunctions that can be optimized on classical computers, and capture the bonded-to-noncovalent transition for increasing interatomic distances. Remarkably, we find that two coupled bosonic QDOs exhibit a stable bond. In addition, our study suggests efficient functional forms for cQDO wavefunctions that can be optimized on classical computers, and capture the bonded-to-noncovalent transition for increasing interatomic distances.
△ Less
Submitted 26 October, 2023; v1 submitted 14 June, 2023;
originally announced June 2023.
-
Molecules in Environments: Towards Systematic Quantum Embedding of Electrons and Drude Oscillators
Authors:
Matej Ditte,
Matteo Barborini,
Leonardo Medrano Sandonas,
Alexandre Tkatchenko
Abstract:
We develop a quantum embedding method that enables accurate and efficient treatment of interactions between molecules and an environment, while explicitly including many-body correlations. The molecule is composed of classical nuclei and quantum electrons, whereas the environment is modeled via charged quantum harmonic oscillators. We construct a general Hamiltonian and introduce a variational ans…
▽ More
We develop a quantum embedding method that enables accurate and efficient treatment of interactions between molecules and an environment, while explicitly including many-body correlations. The molecule is composed of classical nuclei and quantum electrons, whereas the environment is modeled via charged quantum harmonic oscillators. We construct a general Hamiltonian and introduce a variational ansatz for the correlated ground state of the fully interacting molecule/environment system. This wavefunction is optimized via variational Monte Carlo and the ground state energy is subsequently estimated through diffusion Monte Carlo. The proposed scheme allows an explicit many-body treatment of electrostatic, polarization, and dispersion interactions between the molecule and the environment. We study solvation energies and excitation energies of benzene derivatives, obtaining excellent agreement with explicit ab initio calculations and experiment.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Optimized Quantum Drude Oscillators for Atomic and Molecular Response Properties
Authors:
Szabolcs Góger,
Almaz Khabibrakhmanov,
Ornella Vaccarelli,
Dmitry V. Fedorov,
Alexandre Tkatchenko
Abstract:
The quantum Drude oscillator (QDO) is an efficient yet accurate coarse-grained approach that has been widely used to model electronic and optical response properties of atoms and molecules, as well as polarization and dispersion interactions between them. Three effective parameters (frequency, mass, charge) fully characterize the QDO Hamiltonian and are adjusted to reproduce response properties. H…
▽ More
The quantum Drude oscillator (QDO) is an efficient yet accurate coarse-grained approach that has been widely used to model electronic and optical response properties of atoms and molecules, as well as polarization and dispersion interactions between them. Three effective parameters (frequency, mass, charge) fully characterize the QDO Hamiltonian and are adjusted to reproduce response properties. However, the soaring success of coupled QDOs for many-atom systems remains fundamentally unexplained and the optimal map** between atoms/molecules and oscillators has not been established. Here, we present an optimized parametrization (OQDO) where the parameters are fixed by using only dipolar properties. For the periodic table of elements as well as small molecules, our OQDO model accurately reproduces atomic (spatial) polarization potentials and multipolar dispersion coefficients, elucidating the high promise of the presented model in the development of next-generation quantum-mechanical force fields for (bio)molecular simulations.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
Accurate global machine learning force fields for molecules with hundreds of atoms
Authors:
Stefan Chmiela,
Valentin Vassilev-Galindo,
Oliver T. Unke,
Adil Kabylda,
Huziel E. Sauceda,
Alexandre Tkatchenko,
Klaus-Robert Müller
Abstract:
Global machine learning force fields (MLFFs), that have the capacity to capture collective many-atom interactions in molecular systems, currently only scale up to a few dozen atoms due a considerable growth of the model complexity with system size. For larger molecules, locality assumptions are typically introduced, with the consequence that non-local interactions are poorly or not at all describe…
▽ More
Global machine learning force fields (MLFFs), that have the capacity to capture collective many-atom interactions in molecular systems, currently only scale up to a few dozen atoms due a considerable growth of the model complexity with system size. For larger molecules, locality assumptions are typically introduced, with the consequence that non-local interactions are poorly or not at all described, even if those interactions are contained within the reference ab initio data. Here, we approach this challenge and develop an exact iterative parameter-free approach to train global symmetric gradient domain machine learning (sGDML) force fields for systems with up to several hundred atoms, without resorting to any localization of atomic interactions or other potentially uncontrolled approximations. This means that all atomic degrees of freedom remain fully correlated in the global sGDML FF, allowing the accurate description of complex molecules and materials that present phenomena with far-reaching characteristic correlation lengths. We assess the accuracy and efficiency of our MLFFs on a newly developed MD22 benchmark dataset containing molecules from 42 to 370 atoms. The robustness of our approach is demonstrated in nanosecond long path-integral molecular dynamics simulations for the supramolecular complexes in the MD22 dataset.
△ Less
Submitted 14 November, 2022; v1 submitted 29 September, 2022;
originally announced September 2022.
-
Towards Linearly Scaling and Chemically Accurate Global Machine Learning Force Fields for Large Molecules
Authors:
Adil Kabylda,
Valentin Vassilev-Galindo,
Stefan Chmiela,
Igor Poltavsky,
Alexandre Tkatchenko
Abstract:
Machine learning force fields (MLFFs) are gradually evolving towards enabling molecular dynamics simulations of molecules and materials with ab initio accuracy but at a small fraction of the computational cost. However, several challenges remain to be addressed to enable predictive MLFF simulations of realistic molecules, including: (1) develo** efficient descriptors for non-local interatomic in…
▽ More
Machine learning force fields (MLFFs) are gradually evolving towards enabling molecular dynamics simulations of molecules and materials with ab initio accuracy but at a small fraction of the computational cost. However, several challenges remain to be addressed to enable predictive MLFF simulations of realistic molecules, including: (1) develo** efficient descriptors for non-local interatomic interactions, which are essential to capture long-range molecular fluctuations, and (2) reducing the dimensionality of the descriptors in kernel methods (or a number of parameters in neural networks) to enhance the applicability and interpretability of MLFFs. Here we propose an automatized approach to substantially reduce the number of interatomic descriptor features while preserving the accuracy and increasing the efficiency of MLFFs. To simultaneously address the two stated challenges, we illustrate our approach on the example of the global GDML MLFF; however, our methodology can be equally applied to other models. We found that non-local features (atoms separated by as far as 15$~Å$ in studied systems) are crucial to retain the overall accuracy of the MLFF for peptides, DNA base pairs, fatty acids, and supramolecular complexes. Interestingly, the number of required non-local features in the reduced descriptors becomes comparable to the number of local interatomic features (those below 5$~Å$). These results pave the way to constructing global molecular MLFFs whose cost increases linearly, instead of quadratically, with system size.
△ Less
Submitted 12 April, 2023; v1 submitted 8 September, 2022;
originally announced September 2022.
-
The three-center two-positron bond
Authors:
Jorge Charry,
Félix Moncada,
Matteo Barborini,
Laura Pedraza-González,
Márcio T. do N. Varella,
Alexandre Tkatchenko,
Andrés Reyes
Abstract:
Computational studies have shown that one or more positrons can stabilize two repelling atomic anions through the formation of two-center positronic bonds. In the present work, we study the energetic stability of a system containing two positrons and three hydride anions, namely 2\ce{e^+[H^{3-}_3]}. To this aim, we performed a preliminary scan of the potential energy surface of the system with bot…
▽ More
Computational studies have shown that one or more positrons can stabilize two repelling atomic anions through the formation of two-center positronic bonds. In the present work, we study the energetic stability of a system containing two positrons and three hydride anions, namely 2\ce{e^+[H^{3-}_3]}. To this aim, we performed a preliminary scan of the potential energy surface of the system with both electrons and positron in a spin singlet state, with a multi-component MP2 method, that was further refined with variational and diffusion Monte Carlo calculations, and confirmed an equilibrium geometry with \ce{D_${3h}$} symmetry. The local stability of 2\ce{e^+[H^{3-}_3]} is demonstrated by analyzing the vertical detachment and adiabatic energy dissociation channels. Bonding properties of the positronic compound, such as the equilibrium interatomic distances, force constants, dissociation energies, and bonding densities are compared with those of the purely electronic \ce{H^+_3} and \ce{Li^+_3} systems. Through this analysis, we find compelling similarities between the 2\ce{e^+[H^{3-}_3]} compound and the trilithium cation. Our results strongly point out the formation of a non-electronic three-center two-positron bond, analogous to the well known three-center two-electron counterparts, which is fundamentally distinct from the two-center two-positron bond [D. Bressanini, \textit{J. Chem. Phys.} \textbf{155}, 054306 (2021)], thus extending the concept of positron bonded molecules.
△ Less
Submitted 18 August, 2022;
originally announced August 2022.
-
Second Quantization Approach to Many-Body Dispersion Interactions
Authors:
Matteo Gori,
Philip Kurian,
Alexandre Tkatchenko
Abstract:
The many-body dispersion (MBD) framework is a successful approach for modeling the long-range electronic correlation energy and optical response of systems with thousands of atoms. Inspired by field theory, here we develop a second-quantized MBD formalism (SQ-MBD) that recasts a system of atomic quantum Drude oscillators in a Fock-space representation. SQ-MBD provides (I) tools for projecting obse…
▽ More
The many-body dispersion (MBD) framework is a successful approach for modeling the long-range electronic correlation energy and optical response of systems with thousands of atoms. Inspired by field theory, here we develop a second-quantized MBD formalism (SQ-MBD) that recasts a system of atomic quantum Drude oscillators in a Fock-space representation. SQ-MBD provides (I) tools for projecting observables (interaction energy, transition multipoles, polarizability tensors) on coarse-grained representations of the atomistic system ranging from single atoms to large structural motifs, (ii) a quantum-information framework to analyze correlations and (non)separability among fragments in a given molecular complex, and (iii) a path toward the applicability of the MBD framework to molecular complexes with millions of atoms. The SQ-MBD approach offers novel insights into quantum fluctuations in molecular systems and enables direct coupling of collective plasmon-like MBD degrees of freedom with arbitrary environments, providing a tractable computational framework to treat dispersion interactions and polarization response in intricate systems.
△ Less
Submitted 25 April, 2023; v1 submitted 23 May, 2022;
originally announced May 2022.
-
Accurate Machine Learned Quantum-Mechanical Force Fields for Biomolecular Simulations
Authors:
Oliver T. Unke,
Martin Stöhr,
Stefan Ganscha,
Thomas Unterthiner,
Hartmut Maennel,
Sergii Kashubin,
Daniel Ahlin,
Michael Gastegger,
Leonardo Medrano Sandonas,
Alexandre Tkatchenko,
Klaus-Robert Müller
Abstract:
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes. Accurate MD simulations require computationally demanding quantum-mechanical calculations, being practically limited to short timescales and few atoms. For larger systems, efficient, but much less reliable empirical force fields are used. Recently, machine learned force fields (MLFFs) emerged as an…
▽ More
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes. Accurate MD simulations require computationally demanding quantum-mechanical calculations, being practically limited to short timescales and few atoms. For larger systems, efficient, but much less reliable empirical force fields are used. Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations, offering similar accuracy as ab initio methods at orders-of-magnitude speedup. Until now, MLFFs mainly capture short-range interactions in small molecules or periodic materials, due to the increased complexity of constructing models and obtaining reliable reference data for large molecules, where long-ranged many-body effects become important. This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations (GEMS) by training on "bottom-up" and "top-down" molecular fragments of varying size, from which the relevant physicochemical interactions can be learned. GEMS is applied to study the dynamics of alanine-based peptides and the 46-residue protein crambin in aqueous solution, allowing nanosecond-scale MD simulations of >25k atoms at essentially ab initio quality. Our findings suggest that structural motifs in peptides and proteins are more flexible than previously thought, indicating that simulations at ab initio accuracy might be necessary to understand dynamic biomolecular processes such as protein (mis)folding, drug-protein binding, or allosteric regulation.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Quantum Machine Learning Corrects Classical Force Fields: Stretching DNA Base Pairs in Explicit Solvent
Authors:
Joshua T. Berryman,
Amirhossein Taghavi,
Florian Mazur,
Alexandre Tkatchenko
Abstract:
In order to improve the accuracy of molecular dynamics simulations, classical force fields are supplemented with a kernel-based machine learning method trained on quantum-mechanical fragment energies. As an example application, a potential-energy surface is generalised for a small DNA duplex, taking into account explicit solvation and long-range electron exchange--correlation effects. Study of the…
▽ More
In order to improve the accuracy of molecular dynamics simulations, classical force fields are supplemented with a kernel-based machine learning method trained on quantum-mechanical fragment energies. As an example application, a potential-energy surface is generalised for a small DNA duplex, taking into account explicit solvation and long-range electron exchange--correlation effects. Study of the corrected potential energy versus extension shows that leading classical DNA models have excessive stiffness with respect to stretching. This discrepancy is found to be common across multiple forcefields. The quantum correction is in qualitative agreement to the experimental thermodynamics for larger DNA double helices, providing a candidate explanation for the general and long-standing discrepancy between single molecule stretching experiments and classical calculations of DNA stretching. The new dataset of quantum calculations and the associated Kernel Modified Molecular Dynamics (KMMD) method should be of general utility in biomolecular simulations. KMMD is made available as part of the AMBER22 simulation software.
△ Less
Submitted 9 September, 2022; v1 submitted 29 March, 2022;
originally announced March 2022.
-
Anisotropic van der Waals Dispersion Forces in Polymers: Structural Symmetry Breaking Leads to Enhanced Conformational Search
Authors:
Mario Galante,
Alexandre Tkatchenko
Abstract:
The modeling of conformations and dynamics of (bio)polymers is of primary importance for understanding physicochemical properties of soft matter. Although short-range interactions such as covalent and hydrogen bonding control the local arrangement of polymers, non-covalent interactions play a dominant role in determining the global conformations. Here we focus on how the inclusion of many-body eff…
▽ More
The modeling of conformations and dynamics of (bio)polymers is of primary importance for understanding physicochemical properties of soft matter. Although short-range interactions such as covalent and hydrogen bonding control the local arrangement of polymers, non-covalent interactions play a dominant role in determining the global conformations. Here we focus on how the inclusion of many-body effects in van der Waals dispersion affects the outcome of geometry optimizations and molecular dynamics simulations of model polymers. We find that delocalized force contributions are key to explore the conformational landscape, as they induce an anisotropic polarization response which efficiently guides the conformation towards globally optimized structures. This is in contrast with the commonly used pair-wise approach, where the atomic polarizabilities lack information on the global geometry. We show that such local approximation causes the conformational search to be obstructed by conformations with unphysically limited spatial symmetry, while the many-body formalism strongly reduces the roughness of the potential energy landscape.
△ Less
Submitted 23 August, 2022; v1 submitted 13 October, 2021;
originally announced October 2021.
-
BIGDML: Towards Exact Machine Learning Force Fields for Materials
Authors:
Huziel E. Sauceda,
Luis E. Gálvez-González,
Stefan Chmiela,
Lauro Oliver Paz-Borbón,
Klaus-Robert Müller,
Alexandre Tkatchenko
Abstract:
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof. Currently, MLFFs often introduce tradeoffs that restrict their practical applicability to small subsets of chemical space or require exhaustive datasets for training. Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning (BI…
▽ More
Machine-learning force fields (MLFF) should be accurate, computationally and data efficient, and applicable to molecules, materials, and interfaces thereof. Currently, MLFFs often introduce tradeoffs that restrict their practical applicability to small subsets of chemical space or require exhaustive datasets for training. Here, we introduce the Bravais-Inspired Gradient-Domain Machine Learning (BIGDML) approach and demonstrate its ability to construct reliable force fields using a training set with just 10-200 geometries for materials including pristine and defect-containing 2D and 3D semiconductors and metals, as well as chemisorbed and physisorbed atomic and molecular adsorbates on surfaces. The BIGDML model employs the full relevant symmetry group for a given material, does not assume artificial atom types or localization of atomic interactions and exhibits high data efficiency and state-of-the-art energy accuracies (errors substantially below 1 meV per atom) for an extended set of materials. Extensive path-integral molecular dynamics carried out with BIGDML models demonstrate the counterintuitive localization of benzene--graphene dynamics induced by nuclear quantum effects and allow to rationalize the Arrhenius behavior of hydrogen diffusion coefficient in a Pd crystal for a wide range of temperatures.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
Comprehensive Quantum Framework for Describing Retarded and Non-Retarded Molecular Interactions in External Electric Fields
Authors:
Mohammad Reza Karimpour,
Dmitry V. Fedorov,
Alexandre Tkatchenko
Abstract:
We employ various quantum-mechanical approaches for studying the impact of electric fields on both nonretarded and retarded noncovalent interactions between atoms or molecules. To this end, we apply perturbative and non-perturbative methods within the frameworks of quantum mechanics (QM) as well as quantum electrodynamics (QED). In addition, to provide a transparent physical picture of the differe…
▽ More
We employ various quantum-mechanical approaches for studying the impact of electric fields on both nonretarded and retarded noncovalent interactions between atoms or molecules. To this end, we apply perturbative and non-perturbative methods within the frameworks of quantum mechanics (QM) as well as quantum electrodynamics (QED). In addition, to provide a transparent physical picture of the different types of resulting interactions, we employ a stochastic electrodynamic approach based on the zero-point fluctuating field. Atomic response properties are described via harmonic Drude oscillators - an efficient model system that permits an analytical solution and has been convincingly shown to yield accurate results when modeling non-retarded intermolecular interactions. The obtained intermolecular energy contributions are classified as field-induced (FI) electrostatics, FI polarization, and dispersion interactions. The interplay between these three types of interactions enables the manipulation of molecular dimer conformations by applying transversal or longitudinal electric fields along the intermolecular axis. Our framework combining four complementary theoretical approaches paves the way toward a systematic description and improved understanding of molecular interactions when molecules are subject to both external and vacuum fields.
△ Less
Submitted 13 October, 2021; v1 submitted 30 March, 2021;
originally announced March 2021.
-
Molecular Interactions Induced by a Static Electric Field in Quantum Mechanics and Quantum Electrodynamics
Authors:
Mohammad Reza Karimpour,
Dmitry V. Fedorov,
Alexandre Tkatchenko
Abstract:
By means of quantum mechanics and quantum electrodynamics applied to coupled harmonic Drude oscillators, we study the interaction between two neutral atoms or molecules subject to a uniform static electric field. Our focus is to understand the interplay between leading contributions to field-induced electrostatics/polarization and dispersion interactions, as considered within the employed Drude mo…
▽ More
By means of quantum mechanics and quantum electrodynamics applied to coupled harmonic Drude oscillators, we study the interaction between two neutral atoms or molecules subject to a uniform static electric field. Our focus is to understand the interplay between leading contributions to field-induced electrostatics/polarization and dispersion interactions, as considered within the employed Drude model for both non-retarded and retarded regimes. For the first case, we present an exact solution for two coupled oscillators obtained by diagonalizing the corresponding quantum-mechanical Hamiltonian and demonstrate that the external field can control the strength of different intermolecular interactions and relative orientations of the molecules. In the retarded regime described by quantum electrodynamics, our analysis shows that field-induced electrostatic and polarization energies remain unchanged (in isotropic and homogeneous vacuum) compared to the nonretarded case. For interacting species modeled by quantum Drude oscillators, the developed framework based on quantum mechanics and quantum electrodynamics yields the leading contributions to molecular interactions under the combined action of external and vacuum fields.
△ Less
Submitted 13 March, 2022; v1 submitted 30 March, 2021;
originally announced March 2021.
-
Improving Molecular Force Fields Across Configurational Space by Combining Supervised and Unsupervised Machine Learning
Authors:
Gregory Fonseca,
Igor Poltavsky,
Valentin Vassilev-Galindo,
Alexandre Tkatchenko
Abstract:
The training set of atomic configurations is key to the performance of any Machine Learning Force Field (MLFF) and, as such, the training set selection determines the applicability of the MLFF model for predictive molecular simulations. However, most atomistic reference datasets are inhomogeneously distributed across configurational space (CS), thus choosing the training set randomly or according…
▽ More
The training set of atomic configurations is key to the performance of any Machine Learning Force Field (MLFF) and, as such, the training set selection determines the applicability of the MLFF model for predictive molecular simulations. However, most atomistic reference datasets are inhomogeneously distributed across configurational space (CS), thus choosing the training set randomly or according to the probability distribution of the data leads to models whose accuracy is mainly defined by the most common close-to-equilibrium configurations in the reference data. In this work, we combine unsupervised and supervised ML methods to bypass the inherent bias of the data for common configurations, effectively widening the applicability range of MLFF to the fullest capabilities of the dataset. To achieve this goal, we first cluster the CS into subregions similar in terms of geometry and energetics. We iteratively test a given MLFF performance on each subregion and fill the training set of the model with the representatives of the most inaccurate parts of the CS. The proposed approach has been applied to a set of small organic molecules and alanine tetrapeptide, demonstrating an up to two-fold decrease in the root mean squared errors for force predictions of these molecules. This result holds for both kernel-based methods (sGDML and GAP/SOAP models) and deep neural networks (SchNet model). For the latter, the developed approach simultaneously improves both energy and forces, bypassing the compromise to be made when employing mixed energy/force loss functions.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Challenges for Machine Learning Force Fields in Reproducing Potential Energy Surfaces of Flexible Molecules
Authors:
Valentin Vassilev-Galindo,
Gregory Fonseca,
Igor Poltavsky,
Alexandre Tkatchenko
Abstract:
Dynamics of flexible molecules are often determined by an interplay between local chemical bond fluctuations and conformational changes driven by long-range electrostatics and van der Waals interactions. This interplay between interactions yields complex potential-energy surfaces (PES) with multiple minima and transition paths between them. In this work, we assess the performance of state-of-the-a…
▽ More
Dynamics of flexible molecules are often determined by an interplay between local chemical bond fluctuations and conformational changes driven by long-range electrostatics and van der Waals interactions. This interplay between interactions yields complex potential-energy surfaces (PES) with multiple minima and transition paths between them. In this work, we assess the performance of state-of-the-art Machine Learning (ML) models, namely sGDML, SchNet, GAP/SOAP, and BPNN for reproducing such PES, while using limited amounts of reference data. As a benchmark, we use the cis to trans thermal relaxation in an azobenzene molecule, where at least three different transition mechanisms should be considered. Although GAP/SOAP, SchNet, and sGDML models can globally achieve chemical accuracy of 1 kcal mol-1 with fewer than 1000 training points, predictions greatly depend on the ML method used as well as the local region of the PES being sampled. Within a given ML method, large differences can be found between predictions of close-to-equilibrium and transition regions, as well as for different transition mechanisms. We identify key challenges that the ML models face in learning long-range interactions and the intrinsic limitations of commonly used atom-based descriptors. All in all, our results suggest switching from learning the entire PES within a single model to using multiple local models with optimized descriptors, training sets, and architectures for different parts of complex PES.
△ Less
Submitted 1 March, 2021;
originally announced March 2021.
-
Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems
Authors:
John A. Keith,
Valentin Vassilev-Galindo,
Bingqing Cheng,
Stefan Chmiela,
Michael Gastegger,
Klaus-Robert Müller,
Alexandre Tkatchenko
Abstract:
Machine learning models are poised to make a transformative impact on chemical sciences by dramatically accelerating computational algorithms and amplifying insights available from computational chemistry methods. However, achieving this requires a confluence and coaction of expertise in computer science and physical sciences. This review is written for new and experienced researchers working at t…
▽ More
Machine learning models are poised to make a transformative impact on chemical sciences by dramatically accelerating computational algorithms and amplifying insights available from computational chemistry methods. However, achieving this requires a confluence and coaction of expertise in computer science and physical sciences. This review is written for new and experienced researchers working at the intersection of both fields. We first provide concise tutorials of computational chemistry and machine learning methods, showing how insights involving both can be achieved. We then follow with a critical review of noteworthy applications that demonstrate how computational chemistry and machine learning can be used together to provide insightful (and useful) predictions in molecular and materials modeling, retrosyntheses, catalysis, and drug design.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
Quantum-Mechanical Force Balance Between Multipolar Dispersion and Pauli Repulsion in Atomic van der Waals Dimers
Authors:
Ornella Vaccarelli,
Dmitry V. Fedorov,
Martin Stöhr,
Alexandre Tkatchenko
Abstract:
The structure and stability of atomic and molecular systems with van der Waals (vdW) bonding are often determined by the interplay between attractive dispersion interactions and repulsive interactions caused by electron confinement. Arising due to different mechanisms -- electron correlation for dispersion and the Pauli exclusion principle for exchange-repulsion -- these interactions do not appear…
▽ More
The structure and stability of atomic and molecular systems with van der Waals (vdW) bonding are often determined by the interplay between attractive dispersion interactions and repulsive interactions caused by electron confinement. Arising due to different mechanisms -- electron correlation for dispersion and the Pauli exclusion principle for exchange-repulsion -- these interactions do not appear to have a straightforward connection. In this paper, we use a coarse-grained approach for evaluating the exchange energy for two coupled quantum Drude oscillators and investigate the mutual compensation of the attractive and repulsive forces at the equilibrium distance within the multipole expansion of the Coulomb potential. This compensation yields a compact formula relating the vdW radius of an atom to its multipole polarizabilities, $R_{\rm vdW} = A_l^{\,}\, α_l^{{2}/{7(l+1)}}$, where $l$ is the multipole rank and $A_l$ is a conversion factor. Such a relation is compelling because it connects an electronic property of an isolated atom (atomic polarizability) with an equilibrium distance in a dimer composed of two closed-shell atoms. We assess the accuracy of the revealed formula for noble-gas, alkaline-earth, and alkali atoms and show that the $A_l$ can be assumed to be universal constants. Besides a seamless definition of vdW radii, the proposed relation can also be used for the efficient determination of atomic multipole polarizabilities solely based on the corresponding dipole polarizability and the vdW radius. Finally, our work provides a basis for the construction of efficient and minimally-empirical interatomic potentials by combining multipolar interatomic exchange and dispersion forces on an equal footing.
△ Less
Submitted 10 August, 2021; v1 submitted 21 January, 2021;
originally announced January 2021.
-
Four-Dimensional Scaling of Dipole Polarizability in Quantum Systems
Authors:
Peter Szabo,
Szabolcs Goger,
Jorge Charry,
Mohammad Reza Karimpour,
Dmitry V. Fedorov,
Alexandre Tkatchenko
Abstract:
Polarizability is a key response property of physical and chemical systems, which has an impact on intermolecular interactions, spectroscopic observables, and vacuum polarization. The calculation of polarizability for quantum systems involves an infinite sum over all excited (bound and continuum) states, concealing the physical interpretation of polarization mechanisms and complicating the derivat…
▽ More
Polarizability is a key response property of physical and chemical systems, which has an impact on intermolecular interactions, spectroscopic observables, and vacuum polarization. The calculation of polarizability for quantum systems involves an infinite sum over all excited (bound and continuum) states, concealing the physical interpretation of polarization mechanisms and complicating the derivation of efficient response models. Approximate expressions for the dipole polarizability, $α$, rely on different scaling laws $α\propto$ $R^3$, $R^4$, or $R^7$, for various definitions of the system radius $R$. Here, we consider a range of single-particle quantum systems of varying spatial dimensionality and having qualitatively different spectra, demonstrating that their polarizability follows a universal four-dimensional scaling law $α= C (4 μq^2/\hbar^2)L^4$, where $μ$ and $q$ are the (effective) particle mass and charge, $C$ is a dimensionless excitation-energy ratio, and the characteristic length $L$ is defined via the $\mathcal{L}^2$-norm of the position operator. %The applicability of this unified formula is demonstrated by accurately predicting the dipole polarizability of 36 atoms and 1641 small organic~molecules. This unified formula is also applicable to many-particle systems, as shown by} accurately predicting the dipole polarizability of 36 atoms, 1641 small organic \rrr{molecules, and Bloch electrons in periodic systems.
△ Less
Submitted 23 January, 2022; v1 submitted 22 October, 2020;
originally announced October 2020.
-
Machine Learning Force Fields
Authors:
Oliver T. Unke,
Stefan Chmiela,
Huziel E. Sauceda,
Michael Gastegger,
Igor Poltavsky,
Kristof T. Schütt,
Alexandre Tkatchenko,
Klaus-Robert Müller
Abstract:
In recent years, the use of Machine Learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of cl…
▽ More
In recent years, the use of Machine Learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of classical FFs. The key idea is to learn the statistical relation between chemical structure and potential energy without relying on a preconceived notion of fixed chemical bonds or knowledge about the relevant interactions. Such universal ML approximations are in principle only limited by the quality and quantity of the reference data used to train them. This review gives an overview of applications of ML-FFs and the chemical insights that can be obtained from them. The core concepts underlying ML-FFs are described in detail and a step-by-step guide for constructing and testing them from scratch is given. The text concludes with a discussion of the challenges that remain to be overcome by the next generation of ML-FFs.
△ Less
Submitted 12 January, 2021; v1 submitted 14 October, 2020;
originally announced October 2020.
-
Interactions between Large Molecules: Puzzle for Reference Quantum-Mechanical Methods
Authors:
Yasmine S. Al-Hamdani,
Péter R. Nagy,
Dennis Barton,
Mihály Kállay,
Jan Gerit Brandenburg,
Alexandre Tkatchenko
Abstract:
Quantum-mechanical methods are widely used for understanding molecular interactions throughout biology, chemistry, and materials science. Quantum diffusion Monte Carlo (DMC) and coupled cluster with single, double, and perturbative triple excitations [CCSD(T)] are two state-of-the-art and trusted wavefunction methods that have been categorically shown to yield accurate interaction energies for sma…
▽ More
Quantum-mechanical methods are widely used for understanding molecular interactions throughout biology, chemistry, and materials science. Quantum diffusion Monte Carlo (DMC) and coupled cluster with single, double, and perturbative triple excitations [CCSD(T)] are two state-of-the-art and trusted wavefunction methods that have been categorically shown to yield accurate interaction energies for small organic molecules. These methods provide valuable reference information for widely-used semi-empirical and machine learning potentials, especially where experimental information is scarce. However, agreement for systems beyond small molecules is a crucial remaining milestone for cementing the benchmark accuracy of these methods. Approaching such well-converged predictive power in larger molecules has motivated major developments in CCSD(T) as well as DMC algorithms in the past years, resulting in orders of magnitude time-to-solution reductions. Here, we show that CCSD(T) and DMC interaction energies are not in consistent agreement for a set of polarizable supramolecules. Whilst agreement is found for some of the complexes, in a few key systems disagreements of up to 8 kcal/mol remained. This leads to differences of up to 6 orders of magnitude in the corresponding binding association constant at room temperature for systems which are well within the accustomed domain of applicability for both methods. These findings thus indicate that more caution is required when aiming at reproducible non-covalent interactions between extended molecules. Our data contradicts the expectation that the most comprehensive and robust wavefunction methods predict identical non-covalent interactions and indicate an unsolved challenge for benchmark approaches.
△ Less
Submitted 18 September, 2020;
originally announced September 2020.
-
Molecular Force Fields with Gradient-Domain Machine Learning (GDML): Comparison and Synergies with Classical Force Fields
Authors:
Huziel E. Sauceda,
Michael Gastegger,
Stefan Chmiela,
Klaus-Robert Müller,
Alexandre Tkatchenko
Abstract:
Modern machine learning force fields (ML-FF) are able to yield energy and force predictions at the accuracy of high-level $ab~initio$ methods, but at a much lower computational cost. On the other hand, classical molecular mechanics force fields (MM-FF) employ fixed functional forms and tend to be less accurate, but considerably faster and transferable between molecules of the same class. In this w…
▽ More
Modern machine learning force fields (ML-FF) are able to yield energy and force predictions at the accuracy of high-level $ab~initio$ methods, but at a much lower computational cost. On the other hand, classical molecular mechanics force fields (MM-FF) employ fixed functional forms and tend to be less accurate, but considerably faster and transferable between molecules of the same class. In this work, we investigate how both approaches can complement each other. We contrast the ability of ML-FF for reconstructing dynamic and thermodynamic observables to MM-FFs in order to gain a qualitative understanding of the differences between the two approaches. This analysis enables us to modify the generalized AMBER force field (GAFF) by reparametrizing short-range and bonded interactions with more expressive terms to make them more accurate, without sacrificing the key properties that make MM-FFs so successful.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
Coulomb Interactions between Dipolar Quantum Fluctuations in van der Waals Bound Molecules and Materials
Authors:
Martin Stöhr,
Mainak Sadhukhan,
Yasmine S. Al-Hamdani,
Jan Hermann,
Alexandre Tkatchenko
Abstract:
Mutual Coulomb interactions between electrons lead to a plethora of interesting physical and chemical effects, especially if those interactions involve many fluctuating electrons over large spatial scales. Here, we identify and study in detail the Coulomb interaction between dipolar quantum fluctuations in the context of van der Waals complexes and materials. Up to now, the interaction arising fro…
▽ More
Mutual Coulomb interactions between electrons lead to a plethora of interesting physical and chemical effects, especially if those interactions involve many fluctuating electrons over large spatial scales. Here, we identify and study in detail the Coulomb interaction between dipolar quantum fluctuations in the context of van der Waals complexes and materials. Up to now, the interaction arising from the modification of the electron density due to quantum van der Waals interactions was considered to be vanishingly small. We demonstrate that in supramolecular systems and for molecules embedded in nanostructures, such contributions can amount to up to 6 kJ/mol and can even lead to qualitative changes in the long-range vdW interaction. Taking into account these broad implications, we advocate for the systematic assessment of so-called Coulomb singles in large molecular systems and discuss their relevance for explaining several recent puzzling experimental observations of collective behavior in nanostructured materials.
△ Less
Submitted 24 July, 2020;
originally announced July 2020.
-
Fine-Structure Constant Connects the Polarizability of Atoms and Vacuum
Authors:
Alexandre Tkatchenko,
Dmitry V. Fedorov
Abstract:
We examine the recently derived quantum-mechanical relation between atomic polarizabilities and equilibrium internuclear distances in van der Waals (vdW) bonded diatomic systems [Phys. Rev. Lett. {\bf 121}, 183401 (2018)]. For homonuclear dimers, this relation is described by the compact formula $α_{\rm m}^{\rm q} = ΦR_{\rm vdW}^7$, where the constant factor in front of the vdW radius was determin…
▽ More
We examine the recently derived quantum-mechanical relation between atomic polarizabilities and equilibrium internuclear distances in van der Waals (vdW) bonded diatomic systems [Phys. Rev. Lett. {\bf 121}, 183401 (2018)]. For homonuclear dimers, this relation is described by the compact formula $α_{\rm m}^{\rm q} = ΦR_{\rm vdW}^7$, where the constant factor in front of the vdW radius was determined empirically. Here, we derive $Φ= (4πε_0/a_0^4) \times α^{4/3}$ expressed in terms of the vacuum electric permittivity $ε_0$, the Bohr radius $a_0$, and the fine-structure constant $α$. The validity of the obtained formula is confirmed by estimating the value of the fine-structure constant from non-relativistic quantum-mechanical calculations of atomic polarizabilities and equilibrium internuclear vdW distances. The presented derivation allows to interpret the fine-structure constant as the ratio between the polarizability densities of vacuum and matter, whereas the vdW radius becomes a geometrical length scale of atoms endowed by the vacuum field.
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
QM7-X: A comprehensive dataset of quantum-mechanical properties spanning the chemical space of small organic molecules
Authors:
Johannes Hoja,
Leonardo Medrano Sandonas,
Brian G. Ernst,
Alvaro Vazquez-Mayagoitia,
Robert A. DiStasio Jr.,
Alexandre Tkatchenko
Abstract:
We introduce QM7-X, a comprehensive dataset of 42 physicochemical properties for $\approx$ 4.2 M equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. To span this fundamentally important region of chemical compound space (CCS), QM7-X includes an exhaustive sampling of (meta-)stable equilibrium structures - comprised of constitu…
▽ More
We introduce QM7-X, a comprehensive dataset of 42 physicochemical properties for $\approx$ 4.2 M equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. To span this fundamentally important region of chemical compound space (CCS), QM7-X includes an exhaustive sampling of (meta-)stable equilibrium structures - comprised of constitutional/structural isomers and stereoisomers, e.g., enantiomers and diastereomers (including cis-/trans- and conformational isomers) - as well as 100 non-equilibrium structural variations thereof to reach a total of $\approx$ 4.2 M molecular structures. Computed at the tightly converged quantum-mechanical PBE0+MBD level of theory, QM7-X contains global (molecular) and local (atom-in-a-molecule) properties ranging from ground state quantities (such as atomization energies and dipole moments) to response quantities (such as polarizability tensors and dispersion coefficients). By providing a systematic, extensive, and tightly-converged dataset of quantum-mechanically computed physicochemical properties, we expect that QM7-X will play a critical role in the development of next-generation machine-learning based models for exploring greater swaths of CCS and performing in silico design of molecules with targeted properties.
△ Less
Submitted 26 June, 2020;
originally announced June 2020.
-
Dynamical Strengthening of Covalent and Non-Covalent Molecular Interactions by Nuclear Quantum Effects at Finite Temperature
Authors:
Huziel E. Sauceda,
Valentin Vassilev-Galindo,
Stefan Chmiela,
Klaus-Robert Müller,
Alexandre Tkatchenko
Abstract:
Nuclear quantum effects (NQE) tend to generate delocalized molecular dynamics due to the inclusion of the zero point energy and its coupling with the anharmonicities in interatomic interactions. Here, we present evidence that NQE often enhance electronic interactions and, in turn, can result in dynamical molecular stabilization at finite temperature. The underlying physical mechanism promoted by N…
▽ More
Nuclear quantum effects (NQE) tend to generate delocalized molecular dynamics due to the inclusion of the zero point energy and its coupling with the anharmonicities in interatomic interactions. Here, we present evidence that NQE often enhance electronic interactions and, in turn, can result in dynamical molecular stabilization at finite temperature. The underlying physical mechanism promoted by NQE depends on the particular interaction under consideration. First, the effective reduction of interatomic distances between functional groups within a molecule can enhance the $n\toπ^*$ interaction by increasing the overlap between molecular orbitals or by strengthening electrostatic interactions between neighboring charge densities. Second, NQE can localize methyl rotors by temporarily changing molecular bond orders and leading to the emergence of localized transient rotor states. Third, for noncovalent van der Waals interactions the strengthening comes from the increase of the polarizability given the expanded average interatomic distances induced by NQE. The implications of these boosted interactions include counterintuitive hydroxyl--hydroxyl bonding, hindered methyl rotor dynamics, and molecular stiffening which generates smoother free-energy surfaces. Our findings yield new insights into the versatile role of nuclear quantum fluctuations in molecules and materials.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
Accurate Many-Body Repulsive Potentials for Density-Functional Tight-Binding from Deep Tensor Neural Networks
Authors:
Martin Stöhr,
Leonardo Medrano Sandonas,
Alexandre Tkatchenko
Abstract:
We combine density-functional tight-binding (DFTB) with deep tensor neural networks (DTNN) to maximize the strengths of both approaches in predicting structural, energetic, and vibrational molecular properties. The DTNN is used to learn a non-linear model for the localized many-body interatomic repulsive energy, which so far has been treated in an atom-pairwise manner in DFTB. Substantially improv…
▽ More
We combine density-functional tight-binding (DFTB) with deep tensor neural networks (DTNN) to maximize the strengths of both approaches in predicting structural, energetic, and vibrational molecular properties. The DTNN is used to learn a non-linear model for the localized many-body interatomic repulsive energy, which so far has been treated in an atom-pairwise manner in DFTB. Substantially improving upon standard DFTB and DTNN, the resulting DFTB-NN$_{\sf{rep}}$ model yields accurate predictions of atomization and isomerization energies, equilibrium geometries, vibrational frequencies and dihedral rotation profiles for a large variety of organic molecules compared to the hybrid DFT-PBE0 functional. Our results highlight the high potential of combining semi-empirical electronic-structure methods with physically-motivated machine learning approaches for predicting localized many-body interactions. We conclude by discussing future advancements of the DFTB-NN$_{\sf{rep}}$ approach that could enable chemically accurate electronic-structure calculations for systems with tens of thousands of atoms.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
Fluctuational Electrodynamics in Atomic and Macroscopic Systems: van der Waals Interactions and Radiative Heat Transfer
Authors:
Prashanth S. Venkataram,
Jan Hermann,
Alexandre Tkatchenko,
Alejandro W. Rodriguez
Abstract:
We present an approach to describing fluctuational electrodynamic (FED) interactions, particularly van der Waals (vdW) interactions as well as radiative heat transfer (RHT), between material bodies of vastly different length scales, allowing for going between atomistic and continuum treatments of the response of each of these bodies as desired. Any local continuum description of electromagnetic (E…
▽ More
We present an approach to describing fluctuational electrodynamic (FED) interactions, particularly van der Waals (vdW) interactions as well as radiative heat transfer (RHT), between material bodies of vastly different length scales, allowing for going between atomistic and continuum treatments of the response of each of these bodies as desired. Any local continuum description of electromagnetic (EM) response is compatible with our approach, while atomistic descriptions in our approach are based on effective electronic and nuclear oscillator degrees of freedom, encapsulating dissipation, short-range electronic correlations, and collective nuclear vibrations (phonons). While our previous works using this approach have focused on presenting novel results, this work focuses on the derivations underlying these methods. First, we show how the distinction between "atomic" and "macroscopic" bodies is ultimately somewhat arbitrary, as formulas for vdW free energies and RHT look very similar regardless of how the distinction is drawn. Next, we demonstrate that the atomistic description of material response in our approach yields EM interaction matrix elements which are expressed in terms of analytical formulas for compact bodies or semianalytical formulas based on Ewald summation for periodic media; we use this to compute vdW interaction free energies as well as RHT powers among small biological molecules in the presence of a metallic plate as well as between parallel graphene sheets in vacuum, showing strong deviations from conventional macroscopic theories due to the confluence of geometry, phonons, and EM retardation effects. Finally, we propose formulas for efficient computation of FED interactions among material bodies in which those that are treated atomistically as well as those treated through continuum methods may have arbitrary shapes, extending previous surface-integral techniques.
△ Less
Submitted 8 May, 2020;
originally announced May 2020.
-
Machine learning force fields and coarse-grained variables in molecular dynamics: application to materials and biological systems
Authors:
Paraskevi Gkeka,
Gabriel Stoltz,
Amir Barati Farimani,
Zineb Belkacemi,
Michele Ceriotti,
John Chodera,
Aaron R. Dinner,
Andrew Ferguson,
Jean-Bernard Maillet,
Hervé Minoux,
Christine Peter,
Fabio Pietrucci,
Ana Silveira,
Alexandre Tkatchenko,
Zofia Trstanova,
Rafal Wiewiora,
Tony Leliévre
Abstract:
Machine learning encompasses a set of tools and algorithms which are now becoming popular in almost all scientific and technological fields. This is true for molecular dynamics as well, where machine learning offers promises of extracting valuable information from the enormous amounts of data generated by simulation of complex systems. We provide here a review of our current understanding of goals…
▽ More
Machine learning encompasses a set of tools and algorithms which are now becoming popular in almost all scientific and technological fields. This is true for molecular dynamics as well, where machine learning offers promises of extracting valuable information from the enormous amounts of data generated by simulation of complex systems. We provide here a review of our current understanding of goals, benefits, and limitations of machine learning techniques for computational studies on atomistic systems, focusing on the construction of empirical force fields from ab-initio databases and the determination of reaction coordinates for free energy computation and enhanced sampling.
△ Less
Submitted 15 April, 2020;
originally announced April 2020.
-
Accurate Molecular Dynamics Enabled by Efficient Physically-Constrained Machine Learning Approaches
Authors:
Stefan Chmiela,
Huziel E. Sauceda,
Alexandre Tkatchenko,
Klaus-Robert Müller
Abstract:
We develop a combined machine learning (ML) and quantum mechanics approach that enables data-efficient reconstruction of flexible molecular force fields from high-level ab initio calculations, through the consideration of fundamental physical constraints. We discuss how such constraints are recovered and incorporated into ML models. Specifically, we use conservation of energy - a fundamental prope…
▽ More
We develop a combined machine learning (ML) and quantum mechanics approach that enables data-efficient reconstruction of flexible molecular force fields from high-level ab initio calculations, through the consideration of fundamental physical constraints. We discuss how such constraints are recovered and incorporated into ML models. Specifically, we use conservation of energy - a fundamental property of closed classical and quantum mechanical systems -- to derive an efficient gradient-domain machine learning (GDML) model. The challenge of constructing conservative force fields is accomplished by learning in a Hilbert space of vector-valued functions that obey the law of energy conservation. We proceed with the development of a multi-partite matching algorithm that enables a fully automated recovery of physically relevant point-group and fluxional symmetries from the training dataset into a symmetric variant of our model. The developed symmetric GDML (sGDML) approach is able to faithfully reproduce global force fields at the accuracy high-level ab initio methods, thus enabling sample intensive tasks like molecular dynamics simulations at that level of accuracy.
△ Less
Submitted 13 December, 2019;
originally announced December 2019.
-
Exploring Chemical Compound Space with Quantum-Based Machine Learning
Authors:
O. Anatole von Lilienfeld,
Klaus-Robert Müller,
Alexandre Tkatchenko
Abstract:
Rational design of compounds with specific properties requires conceptual understanding and fast evaluation of molecular properties throughout chemical compound space (CCS) -- the huge set of all potentially stable molecules. Recent advances in combining quantum mechanical (QM) calculations with machine learning (ML) provide powerful tools for exploring wide swaths of CCS. We present our perspecti…
▽ More
Rational design of compounds with specific properties requires conceptual understanding and fast evaluation of molecular properties throughout chemical compound space (CCS) -- the huge set of all potentially stable molecules. Recent advances in combining quantum mechanical (QM) calculations with machine learning (ML) provide powerful tools for exploring wide swaths of CCS. We present our perspective on this exciting and quickly develo** field by discussing key advances in the development and applications of QM-based ML methods to diverse compounds and properties and outlining the challenges ahead. We argue that significant progress in the exploration and understanding of CCS can be made through a systematic combination of rigorous physical theories, comprehensive synthetic datasets of microscopic and macroscopic properties, and modern ML methods that account for physical and chemical knowledge.
△ Less
Submitted 28 November, 2019; v1 submitted 22 November, 2019;
originally announced November 2019.
-
Machine learning for molecular simulation
Authors:
Frank Noé,
Alexandre Tkatchenko,
Klaus-Robert Müller,
Cecilia Clementi
Abstract:
Machine learning (ML) is transforming all areas of science. The complex and time-consuming calculations in molecular simulations are particularly suitable for a machine learning revolution and have already been profoundly impacted by the application of existing ML methods. Here we review recent ML methods for molecular simulation, with particular focus on (deep) neural networks for the prediction…
▽ More
Machine learning (ML) is transforming all areas of science. The complex and time-consuming calculations in molecular simulations are particularly suitable for a machine learning revolution and have already been profoundly impacted by the application of existing ML methods. Here we review recent ML methods for molecular simulation, with particular focus on (deep) neural networks for the prediction of quantum-mechanical energies and forces, coarse-grained molecular dynamics, the extraction of free energy surfaces and kinetics and generative network approaches to sample molecular equilibrium structures and compute thermodynamics. To explain these methods and illustrate open methodological problems, we review some important principles of molecular physics and describe how they can be incorporated into machine learning structures. Finally, we identify and describe a list of open challenges for the interface between ML and molecular simulation.
△ Less
Submitted 7 November, 2019;
originally announced November 2019.
-
Density-functional model for van der Waals interactions: Unifying many-body atomic approaches with nonlocal functionals
Authors:
Jan Hermann,
Alexandre Tkatchenko
Abstract:
Noncovalent van der Waals (vdW) interactions are responsible for a wide range of phenomena in matter. Popular density-functional methods that treat vdW interactions use disparate physical models for these intricate forces, and as a result the applicability of these methods is often restricted to a subset of relevant molecules and materials. Aiming towards a general-purpose density functional model…
▽ More
Noncovalent van der Waals (vdW) interactions are responsible for a wide range of phenomena in matter. Popular density-functional methods that treat vdW interactions use disparate physical models for these intricate forces, and as a result the applicability of these methods is often restricted to a subset of relevant molecules and materials. Aiming towards a general-purpose density functional model of vdW interactions, here we unify two complementary approaches: nonlocal vdW functionals for polarization and interatomic methods for many-body interactions. The developed nonlocal many-body dispersion method (MBD-NL) increases the accuracy and efficiency of existing vdW functionals and is shown to be broadly applicable to molecules, soft and hard materials including ionic and metallic compounds, as well as organic/inorganic interfaces.
△ Less
Submitted 10 March, 2020; v1 submitted 7 October, 2019;
originally announced October 2019.
-
Construction of Machine Learned Force Fields with Quantum Chemical Accuracy: Applications and Chemical Insights
Authors:
Huziel E. Sauceda,
Stefan Chmiela,
Igor Poltavsky,
Klaus-Robert Müller,
Alexandre Tkatchenko
Abstract:
Highly accurate force fields are a mandatory requirement to generate predictive simulations. Here we present the path for the construction of machine learned molecular force fields by discussing the hierarchical pathway from generating the dataset of reference calculations to the construction of the machine learning model, and the validation of the physics generated by the model. We will use the s…
▽ More
Highly accurate force fields are a mandatory requirement to generate predictive simulations. Here we present the path for the construction of machine learned molecular force fields by discussing the hierarchical pathway from generating the dataset of reference calculations to the construction of the machine learning model, and the validation of the physics generated by the model. We will use the symmetrized gradient-domain machine learning (sGDML) framework due to its ability to reconstruct complex high-dimensional potential-energy surfaces (PES) with high precision even when using just a few hundreds of molecular conformations for training. The data efficiency of the sGDML model allows using reference atomic forces computed with high-level wavefunction-based approaches, such as the $gold$ $standard$ coupled cluster method with single, double, and perturbative triple excitations (CCSD(T)). We demonstrate that the flexible nature of the sGDML framework captures local and non-local electronic interactions (e.g. H-bonding, lone pairs, steric repulsion, changes in hybridization states (e.g. $sp^2 \rightleftharpoons sp^3$), $n\toπ^*$ interactions, and proton transfer) without imposing any restriction on the nature of interatomic potentials. The analysis of sGDML models trained for different molecular structures at different levels of theory (e.g. density functional theory and CCSD(T)) provides empirical evidence that a higher level of theory generates a smoother PES. Additionally, a careful analysis of molecular dynamics simulations yields new qualitative insights into dynamics and vibrational spectroscopy of small molecules close to spectroscopic accuracy.
△ Less
Submitted 18 September, 2019;
originally announced September 2019.
-
Quantum Mechanics of Proteins in Explicit Water: The Role of Plasmon-Like Solute-Solvent Interactions
Authors:
Martin Stöhr,
Alexandre Tkatchenko
Abstract:
Quantum-mechanical van der Waals dispersion interactions play an essential role for both intra-protein and protein-water interactions -- the two main driving forces for the structure and dynamics of proteins in aqueous solution. Typically, these interactions are only treated phenomenologically via pairwise potential terms in classical force fields. Here, we use an explicit quantum-mechanical appro…
▽ More
Quantum-mechanical van der Waals dispersion interactions play an essential role for both intra-protein and protein-water interactions -- the two main driving forces for the structure and dynamics of proteins in aqueous solution. Typically, these interactions are only treated phenomenologically via pairwise potential terms in classical force fields. Here, we use an explicit quantum-mechanical approach based on density-functional tight-binding with the many-body dispersion formalism, which allows us to demonstrate the unexpected relevance of the many-body character of dispersion interactions for protein energetics and the protein-water interaction. In contrast to commonly employed pairwise approaches, many-body effects significantly decrease the relative stability of the native state in the absence of water. In an aqueous environment, the collective character of the protein-water van der Waals interaction counteracts this effect and stabilizes native conformations and transition states. This stabilization arises due to a high degree of delocalization and collectivity of protein-water dispersion interactions, suggesting a remarkable persistence of long-range electron correlation through aqueous environments. Our findings are exemplified on prototypical showcases of proteins forming $β$-sheets, hairpins, and helices, emphasizing the crucial role of plasmon-like solute-solvent interactions in biomolecular systems.
△ Less
Submitted 21 October, 2019; v1 submitted 6 August, 2019;
originally announced August 2019.
-
Unifying machine learning and quantum chemistry -- a deep neural network for molecular wavefunctions
Authors:
K. T. Schütt,
M. Gastegger,
A. Tkatchenko,
K. -R. Müller,
R. J. Maurer
Abstract:
Machine learning advances chemistry and materials science by enabling large-scale exploration of chemical space based on quantum chemical calculations. While these models supply fast and accurate predictions of atomistic chemical properties, they do not explicitly capture the electronic degrees of freedom of a molecule, which limits their applicability for reactive chemistry and chemical analysis.…
▽ More
Machine learning advances chemistry and materials science by enabling large-scale exploration of chemical space based on quantum chemical calculations. While these models supply fast and accurate predictions of atomistic chemical properties, they do not explicitly capture the electronic degrees of freedom of a molecule, which limits their applicability for reactive chemistry and chemical analysis. Here we present a deep learning framework for the prediction of the quantum mechanical wavefunction in a local basis of atomic orbitals from which all other ground-state properties can be derived. This approach retains full access to the electronic structure via the wavefunction at force field-like efficiency and captures quantum mechanics in an analytically differentiable representation. On several examples, we demonstrate that this opens promising avenues to perform inverse design of molecular structures for target electronic property optimisation and a clear path towards increased synergy of machine learning and quantum chemistry.
△ Less
Submitted 24 June, 2019;
originally announced June 2019.
-
Molecular Force Fields with Gradient-Domain Machine Learning: Construction and Application to Dynamics of Small Molecules with Coupled Cluster Forces
Authors:
Huziel E. Sauceda,
Stefan Chmiela,
Igor Poltavsky,
Klaus-Robert Müller,
Alexandre Tkatchenko
Abstract:
We present the construction of molecular force fields for small molecules (less than 25 atoms) using the recently developed symmetrized gradient-domain machine learning (sGDML) approach [Chmiela et al., Nat. Commun. 9, 3887 (2018); Sci. Adv. 3, e1603015 (2017)]. This approach is able to accurately reconstruct complex high-dimensional potential-energy surfaces from just a few 100s of molecular conf…
▽ More
We present the construction of molecular force fields for small molecules (less than 25 atoms) using the recently developed symmetrized gradient-domain machine learning (sGDML) approach [Chmiela et al., Nat. Commun. 9, 3887 (2018); Sci. Adv. 3, e1603015 (2017)]. This approach is able to accurately reconstruct complex high-dimensional potential-energy surfaces from just a few 100s of molecular conformations extracted from ab initio molecular dynamics trajectories. The data efficiency of the sGDML approach implies that atomic forces for these conformations can be computed with high-level wavefunction-based approaches, such as the "gold standard" CCSD(T) method. We demonstrate that the flexible nature of the sGDML model recovers local and non-local electronic interactions (e.g. H-bonding, proton transfer, lone pairs, changes in hybridization states, steric repulsion and $n\toπ^*$ interactions) without imposing any restriction on the nature of interatomic potentials. The analysis of sGDML molecular dynamics trajectories yields new qualitative insights into dynamics and spectroscopy of small molecules close to spectroscopic accuracy.
△ Less
Submitted 31 January, 2019; v1 submitted 19 January, 2019;
originally announced January 2019.
-
sGDML: Constructing Accurate and Data Efficient Molecular Force Fields Using Machine Learning
Authors:
Stefan Chmiela,
Huziel E. Sauceda,
Igor Poltavsky,
Klaus-Robert Müller,
Alexandre Tkatchenko
Abstract:
We present an optimized implementation of the recently proposed symmetric gradient domain machine learning (sGDML) model. The sGDML model is able to faithfully reproduce global potential energy surfaces (PES) for molecules with a few dozen atoms from a limited number of user-provided reference molecular conformations and the associated atomic forces. Here, we introduce a Python software package to…
▽ More
We present an optimized implementation of the recently proposed symmetric gradient domain machine learning (sGDML) model. The sGDML model is able to faithfully reproduce global potential energy surfaces (PES) for molecules with a few dozen atoms from a limited number of user-provided reference molecular conformations and the associated atomic forces. Here, we introduce a Python software package to reconstruct and evaluate custom sGDML force fields (FFs), without requiring in-depth knowledge about the details of the model. A user-friendly command-line interface offers assistance through the complete process of model creation, in an effort to make this novel machine learning approach accessible to broad practitioners. Our paper serves as a documentation, but also includes a practical application example of how to reconstruct and use a PBE0+MBD FF for paracetamol. Finally, we show how to interface sGDML with the FF simulation engines ASE (Larsen et al., J. Phys. Condens. Matter 29, 273002 (2017)) and i-PI (Kapil et al., Comput. Phys. Commun. 236, 214-223 (2019)) to run numerical experiments, including structure optimization, classical and path integral molecular dynamics and nudged elastic band calculations.
△ Less
Submitted 2 March, 2019; v1 submitted 12 December, 2018;
originally announced December 2018.
-
Learning representations of molecules and materials with atomistic neural networks
Authors:
Kristof T. Schütt,
Alexandre Tkatchenko,
Klaus-Robert Müller
Abstract:
Deep Learning has been shown to learn efficient representations for structured data such as image, text or audio. In this chapter, we present neural network architectures that are able to learn efficient representations of molecules and materials. In particular, the continuous-filter convolutional network SchNet accurately predicts chemical properties across compositional and configurational space…
▽ More
Deep Learning has been shown to learn efficient representations for structured data such as image, text or audio. In this chapter, we present neural network architectures that are able to learn efficient representations of molecules and materials. In particular, the continuous-filter convolutional network SchNet accurately predicts chemical properties across compositional and configurational space on a variety of datasets. Beyond that, we analyze the obtained representations to find evidence that their spatial and chemical properties agree with chemical intuition.
△ Less
Submitted 11 December, 2018;
originally announced December 2018.
-
SchNetPack: A Deep Learning Toolbox For Atomistic Systems
Authors:
K. T. Schütt,
P. Kessel,
M. Gastegger,
K. Nicoli,
A. Tkatchenko,
K. -R. Müller
Abstract:
SchNetPack is a toolbox for the development and application of deep neural networks to the prediction of potential energy surfaces and other quantum-chemical properties of molecules and materials. It contains basic building blocks of atomistic neural networks, manages their training and provides simple access to common benchmark datasets. This allows for an easy implementation and evaluation of ne…
▽ More
SchNetPack is a toolbox for the development and application of deep neural networks to the prediction of potential energy surfaces and other quantum-chemical properties of molecules and materials. It contains basic building blocks of atomistic neural networks, manages their training and provides simple access to common benchmark datasets. This allows for an easy implementation and evaluation of new models. For now, SchNetPack includes implementations of (weighted) atomcentered symmetry functions and the deep tensor neural network SchNet as well as ready-to-use scripts that allow to train these models on molecule and material datasets. Based upon the PyTorch deep learning framework, SchNetPack allows to efficiently apply the neural networks to large datasets with millions of reference calculations as well as parallelize the model across multiple GPUs. Finally, SchNetPack provides an interface to the Atomic Simulation Environment in order to make trained models easily accessible to researchers that are not yet familiar with neural networks.
△ Less
Submitted 4 September, 2018;
originally announced September 2018.
-
i-PI 2.0: A Universal Force Engine for Advanced Molecular Simulations
Authors:
Venkat Kapil,
Mariana Rossi,
Ondrej Marsalek,
Riccardo Petraglia,
Yair Litman,
Thomas Spura,
Bingqing Cheng,
Alice Cuzzocrea,
Robert H. Meißner,
David M. Wilkins,
Przemyslaw Juda,
Sébastien P. Bienvenue,
Wei Fang,
Jan Kessler,
Igor Poltavsky,
Steven Vandenbrande,
Jelle Wieme,
Clemence Corminboeuf,
Thomas D. Kühne,
David E. Manolopoulos,
Thomas E. Markland,
Jeremy O. Richardson,
Alexandre Tkatchenko,
Gareth A. Tribello,
Veronique Van Speybroeck
, et al. (1 additional authors not shown)
Abstract:
Progress in the atomic-scale modelling of matter over the past decade has been tremendous. This progress has been brought about by improvements in methods for evaluating interatomic forces that work by either solving the electronic structure problem explicitly, or by computing accurate approximations of the solution and by the development of techniques that use the Born-Oppenheimer (BO) forces to…
▽ More
Progress in the atomic-scale modelling of matter over the past decade has been tremendous. This progress has been brought about by improvements in methods for evaluating interatomic forces that work by either solving the electronic structure problem explicitly, or by computing accurate approximations of the solution and by the development of techniques that use the Born-Oppenheimer (BO) forces to move the atoms on the BO potential energy surface. As a consequence of these developments it is now possible to identify stable or metastable states, to sample configurations consistent with the appropriate thermodynamic ensemble, and to estimate the kinetics of reactions and phase transitions. All too often, however, progress is slowed down by the bottleneck associated with implementing new optimization algorithms and/or sampling techniques into the many existing electronic-structure and empirical-potential codes. To address this problem, we are thus releasing a new version of the i-PI software. This piece of software is an easily extensible framework for implementing advanced atomistic simulation techniques using interatomic potentials and forces calculated by an external driver code. While the original version of the code was developed with a focus on path integral molecular dynamics techniques, this second release of i-PI not only includes several new advanced path integral methods, but also offers other classes of algorithms. In other words, i-PI is moving towards becoming a universal force engine that is both modular and tightly coupled to the driver codes that evaluate the potential energy surface and its derivatives.
△ Less
Submitted 17 September, 2018; v1 submitted 11 August, 2018;
originally announced August 2018.
-
Quantum-chemical insights from interpretable atomistic neural networks
Authors:
Kristof T. Schütt,
Michael Gastegger,
Alexandre Tkatchenko,
Klaus-Robert Müller
Abstract:
With the rise of deep neural networks for quantum chemistry applications, there is a pressing need for architectures that, beyond delivering accurate predictions of chemical properties, are readily interpretable by researchers. Here, we describe interpretation techniques for atomistic neural networks on the example of Behler-Parrinello networks as well as the end-to-end model SchNet. Both models o…
▽ More
With the rise of deep neural networks for quantum chemistry applications, there is a pressing need for architectures that, beyond delivering accurate predictions of chemical properties, are readily interpretable by researchers. Here, we describe interpretation techniques for atomistic neural networks on the example of Behler-Parrinello networks as well as the end-to-end model SchNet. Both models obtain predictions of chemical properties by aggregating atom-wise contributions. These latent variables can serve as local explanations of a prediction and are obtained during training without additional cost. Due to their correspondence to well-known chemical concepts such as atomic energies and partial charges, these atom-wise explanations enable insights not only about the model but more importantly about the underlying quantum-chemical regularities. We generalize from atomistic explanations to 3d space, thus obtaining spatially resolved visualizations which further improve interpretability. Finally, we analyze learned embeddings of chemical elements that exhibit a partial ordering that resembles the order of the periodic table. As the examined neural networks show excellent agreement with chemical knowledge, the presented techniques open up new venues for data-driven research in chemistry, physics and materials science.
△ Less
Submitted 27 June, 2018;
originally announced June 2018.
-
Quantum-Mechanical Relation between Atomic Dipole Polarizability and the van der Waals Radius
Authors:
Dmitry V. Fedorov,
Mainak Sadhukhan,
Martin Stöhr,
Alexandre Tkatchenko
Abstract:
The atomic dipole polarizability, $α$, and the van der Waals (vdW) radius, $R_{\rm vdW}$, are two key quantities to describe vdW interactions between atoms in molecules and materials. Until now, they have been determined independently and separately from each other. Here, we derive the quantum-mechanical relation $R_{\rm vdW} = const. \timesα^{1/7}$ which is markedly different from the common assu…
▽ More
The atomic dipole polarizability, $α$, and the van der Waals (vdW) radius, $R_{\rm vdW}$, are two key quantities to describe vdW interactions between atoms in molecules and materials. Until now, they have been determined independently and separately from each other. Here, we derive the quantum-mechanical relation $R_{\rm vdW} = const. \timesα^{1/7}$ which is markedly different from the common assumption $R_{\rm vdW} \propto α^{1/3}$ based on a classical picture of hard-sphere atoms. As shown for 72 chemical elements between hydrogen and uranium, the obtained formula can be used as a unified definition of the vdW radius solely in terms of the atomic polarizability. For vdW-bonded heteronuclear dimers consisting of atoms $A$ and $B$, the combination rule $α= (α_A + α_B)/2$ provides a remarkably accurate way to calculate their equilibrium interatomic distance. The revealed scaling law allows to reduce the empiricism and improve the accuracy of interatomic vdW potentials, at the same time suggesting the existence of a non-trivial relation between length and volume in quantum systems.
△ Less
Submitted 4 September, 2018; v1 submitted 30 March, 2018;
originally announced March 2018.
-
Reliable and Practical Computational Prediction of Molecular Crystal Polymorphs
Authors:
Johannes Hoja,
Hsin-Yu Ko,
Marcus A. Neumann,
Roberto Car,
Robert A. DiStasio Jr.,
Alexandre Tkatchenko
Abstract:
The ability to reliably predict the structures and stabilities of a molecular crystal and its polymorphs without any prior experimental information would be an invaluable tool for a number of fields, with specific and immediate applications in the design and formulation of pharmaceuticals. In this case, detailed knowledge of the polymorphic energy landscape for an active pharmaceutical ingredient…
▽ More
The ability to reliably predict the structures and stabilities of a molecular crystal and its polymorphs without any prior experimental information would be an invaluable tool for a number of fields, with specific and immediate applications in the design and formulation of pharmaceuticals. In this case, detailed knowledge of the polymorphic energy landscape for an active pharmaceutical ingredient yields profound insight regarding the existence and likelihood of late-appearing polymorphs. However, the computational prediction of the structures and stabilities of molecular crystal polymorphs is particularly challenging due to the high dimensionality of conformational and crystallographic space accompanied by the need for relative (free) energies to within $\approx$ 1 kJ/mol per molecule. In this work, we combine the most successful crystal structure sampling strategy with the most accurate energy ranking strategy of the latest blind test of organic crystal structure prediction (CSP), organized by the Cambridge Crystallographic Data Centre (CCDC). Our final energy ranking is based on first-principles density functional theory (DFT) calculations that include three key physical contributions: (i) a sophisticated treatment of Pauli exchange-repulsion and electron correlation effects with hybrid functionals, (ii) inclusion of many-body van der Waals dispersion interactions, and (iii) account of vibrational free energies. In doing so, this combined approach has an optimal success rate in producing the crystal structures corresponding to the five blind-test molecules. With this practical approach, we demonstrate the feasibility of obtaining reliable structures and stabilities for molecular crystals of pharmaceutical importance, paving the way towards an enhanced fundamental understanding of polymorphic energy landscapes and routine industrial application of molecular CSP methods.
△ Less
Submitted 20 March, 2018;
originally announced March 2018.
-
Fast and accurate quantum Monte Carlo for molecular crystals
Authors:
Andrea Zen,
Jan Gerit Brandenburg,
Jiří Klimeš,
Alexandre Tkatchenko,
Dario Alfè,
Angelos Michaelides
Abstract:
Computer simulation plays a central role in modern day materials science. The utility of a given computational approach depends largely on the balance it provides between accuracy and computational cost. Molecular crystals are a class of materials of great technological importance which are challenging for even the most sophisticated \emph{ab initio} electronic structure theories to accurately des…
▽ More
Computer simulation plays a central role in modern day materials science. The utility of a given computational approach depends largely on the balance it provides between accuracy and computational cost. Molecular crystals are a class of materials of great technological importance which are challenging for even the most sophisticated \emph{ab initio} electronic structure theories to accurately describe. This is partly because they are held together by a balance of weak intermolecular forces but also because the primitive cells of molecular crystals are often substantially larger than those of atomic solids. Here, we demonstrate that diffusion quantum Monte Carlo (DMC) delivers sub-chemical accuracy for a diverse set of molecular crystals at a surprisingly moderate computational cost. As such, we anticipate that DMC can play an important role in understanding and predicting the properties of a large number of molecular crystals, including those built from relatively large molecules which are far beyond reach of other high accuracy methods.
△ Less
Submitted 27 February, 2018;
originally announced February 2018.