Search | arXiv e-print repository

E3x: $\mathrm{E}(3)$-Equivariant Deep Learning Made Easy

Authors: Oliver T. Unke, Hartmut Maennel

Abstract: This work introduces E3x, a software package for building neural networks that are equivariant with respect to the Euclidean group $\mathrm{E}(3)$, consisting of translations, rotations, and reflections of three-dimensional space. Compared to ordinary neural networks, $\mathrm{E}(3)$-equivariant models promise benefits whenever input and/or output data are quantities associated with three-dimensio… ▽ More This work introduces E3x, a software package for building neural networks that are equivariant with respect to the Euclidean group $\mathrm{E}(3)$, consisting of translations, rotations, and reflections of three-dimensional space. Compared to ordinary neural networks, $\mathrm{E}(3)$-equivariant models promise benefits whenever input and/or output data are quantities associated with three-dimensional objects. This is because the numeric values of such quantities (e.g. positions) typically depend on the chosen coordinate system. Under transformations of the reference frame, the values change predictably, but the underlying rules can be difficult to learn for ordinary machine learning models. With built-in $\mathrm{E}(3)$-equivariance, neural networks are guaranteed to satisfy the relevant transformation rules exactly, resulting in superior data efficiency and accuracy. The code for E3x is available from https://github.com/google-research/e3x, detailed documentation and usage examples can be found on https://e3x.readthedocs.io. △ Less

Submitted 17 January, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

arXiv:2309.15126 [pdf, other]

From Peptides to Nanostructures: A Euclidean Transformer for Fast and Stable Machine Learned Force Fields

Authors: J. Thorben Frank, Oliver T. Unke, Klaus-Robert Müller, Stefan Chmiela

Abstract: Recent years have seen vast progress in the development of machine learned force fields (MLFFs) based on ab-initio reference calculations. Despite achieving low test errors, the reliability of MLFFs in molecular dynamics (MD) simulations is facing growing scrutiny due to concerns about instability over extended simulation timescales. Our findings suggest a potential connection between robustness t… ▽ More Recent years have seen vast progress in the development of machine learned force fields (MLFFs) based on ab-initio reference calculations. Despite achieving low test errors, the reliability of MLFFs in molecular dynamics (MD) simulations is facing growing scrutiny due to concerns about instability over extended simulation timescales. Our findings suggest a potential connection between robustness to cumulative inaccuracies and the use of equivariant representations in MLFFs, but the computational cost associated with these representations can limit this advantage in practice. To address this, we propose a transformer architecture called SO3krates that combines sparse equivariant representations (Euclidean variables) with a self-attention mechanism that separates invariant and equivariant information, eliminating the need for expensive tensor products. SO3krates achieves a unique combination of accuracy, stability, and speed that enables insightful analysis of quantum properties of matter on extended time and system size scales. To showcase this capability, we generate stable MD trajectories for flexible peptides and supra-molecular structures with hundreds of atoms. Furthermore, we investigate the PES topology for medium-sized chainlike molecules (e.g., small peptides) by exploring thousands of minima. Remarkably, SO3krates demonstrates the ability to strike a balance between the conflicting demands of stability and the emergence of new minimum-energy conformations beyond the training data, which is crucial for realistic exploration tasks in the field of biochemistry. △ Less

Submitted 16 February, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

arXiv:2209.14865 [pdf, other]

Accurate global machine learning force fields for molecules with hundreds of atoms

Authors: Stefan Chmiela, Valentin Vassilev-Galindo, Oliver T. Unke, Adil Kabylda, Huziel E. Sauceda, Alexandre Tkatchenko, Klaus-Robert Müller

Abstract: Global machine learning force fields (MLFFs), that have the capacity to capture collective many-atom interactions in molecular systems, currently only scale up to a few dozen atoms due a considerable growth of the model complexity with system size. For larger molecules, locality assumptions are typically introduced, with the consequence that non-local interactions are poorly or not at all describe… ▽ More Global machine learning force fields (MLFFs), that have the capacity to capture collective many-atom interactions in molecular systems, currently only scale up to a few dozen atoms due a considerable growth of the model complexity with system size. For larger molecules, locality assumptions are typically introduced, with the consequence that non-local interactions are poorly or not at all described, even if those interactions are contained within the reference ab initio data. Here, we approach this challenge and develop an exact iterative parameter-free approach to train global symmetric gradient domain machine learning (sGDML) force fields for systems with up to several hundred atoms, without resorting to any localization of atomic interactions or other potentially uncontrolled approximations. This means that all atomic degrees of freedom remain fully correlated in the global sGDML FF, allowing the accurate description of complex molecules and materials that present phenomena with far-reaching characteristic correlation lengths. We assess the accuracy and efficiency of our MLFFs on a newly developed MD22 benchmark dataset containing molecules from 42 to 370 atoms. The robustness of our approach is demonstrated in nanosecond long path-integral molecular dynamics simulations for the supramolecular complexes in the MD22 dataset. △ Less

Submitted 14 November, 2022; v1 submitted 29 September, 2022; originally announced September 2022.

arXiv:2205.08306 [pdf, other]

Accurate Machine Learned Quantum-Mechanical Force Fields for Biomolecular Simulations

Authors: Oliver T. Unke, Martin Stöhr, Stefan Ganscha, Thomas Unterthiner, Hartmut Maennel, Sergii Kashubin, Daniel Ahlin, Michael Gastegger, Leonardo Medrano Sandonas, Alexandre Tkatchenko, Klaus-Robert Müller

Abstract: Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes. Accurate MD simulations require computationally demanding quantum-mechanical calculations, being practically limited to short timescales and few atoms. For larger systems, efficient, but much less reliable empirical force fields are used. Recently, machine learned force fields (MLFFs) emerged as an… ▽ More Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes. Accurate MD simulations require computationally demanding quantum-mechanical calculations, being practically limited to short timescales and few atoms. For larger systems, efficient, but much less reliable empirical force fields are used. Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations, offering similar accuracy as ab initio methods at orders-of-magnitude speedup. Until now, MLFFs mainly capture short-range interactions in small molecules or periodic materials, due to the increased complexity of constructing models and obtaining reliable reference data for large molecules, where long-ranged many-body effects become important. This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations (GEMS) by training on "bottom-up" and "top-down" molecular fragments of varying size, from which the relevant physicochemical interactions can be learned. GEMS is applied to study the dynamics of alanine-based peptides and the 46-residue protein crambin in aqueous solution, allowing nanosecond-scale MD simulations of >25k atoms at essentially ab initio quality. Our findings suggest that structural motifs in peptides and proteins are more flexible than previously thought, indicating that simulations at ab initio accuracy might be necessary to understand dynamic biomolecular processes such as protein (mis)folding, drug-protein binding, or allosteric regulation. △ Less

Submitted 17 May, 2022; originally announced May 2022.

arXiv:2203.16205 [pdf, other]

Automatic Identification of Chemical Moieties

Authors: Jonas Lederer, Michael Gastegger, Kristof T. Schütt, Michael Kampffmeyer, Klaus-Robert Müller, Oliver T. Unke

Abstract: In recent years, the prediction of quantum mechanical observables with machine learning methods has become increasingly popular. Message-passing neural networks (MPNNs) solve this task by constructing atomic representations, from which the properties of interest are predicted. Here, we introduce a method to automatically identify chemical moieties (molecular building blocks) from such representati… ▽ More In recent years, the prediction of quantum mechanical observables with machine learning methods has become increasingly popular. Message-passing neural networks (MPNNs) solve this task by constructing atomic representations, from which the properties of interest are predicted. Here, we introduce a method to automatically identify chemical moieties (molecular building blocks) from such representations, enabling a variety of applications beyond property prediction, which otherwise rely on expert knowledge. The required representation can either be provided by a pretrained MPNN, or learned from scratch using only structural information. Beyond the data-driven design of molecular fingerprints, the versatility of our approach is demonstrated by enabling the selection of representative entries in chemical databases, the automatic construction of coarse-grained force fields, as well as the identification of reaction coordinates. △ Less

Submitted 27 April, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

arXiv:2106.02347 [pdf, other]

SE(3)-equivariant prediction of molecular wavefunctions and electronic densities

Authors: Oliver T. Unke, Mihail Bogojeski, Michael Gastegger, Mario Geiger, Tess Smidt, Klaus-Robert Müller

Abstract: Machine learning has enabled the prediction of quantum chemical properties with high accuracy and efficiency, allowing to bypass computationally costly ab initio calculations. Instead of training on a fixed set of properties, more recent approaches attempt to learn the electronic wavefunction (or density) as a central quantity of atomistic systems, from which all other observables can be derived.… ▽ More Machine learning has enabled the prediction of quantum chemical properties with high accuracy and efficiency, allowing to bypass computationally costly ab initio calculations. Instead of training on a fixed set of properties, more recent approaches attempt to learn the electronic wavefunction (or density) as a central quantity of atomistic systems, from which all other observables can be derived. This is complicated by the fact that wavefunctions transform non-trivially under molecular rotations, which makes them a challenging prediction target. To solve this issue, we introduce general SE(3)-equivariant operations and building blocks for constructing deep learning architectures for geometric point cloud data and apply them to reconstruct wavefunctions of atomistic systems with unprecedented accuracy. Our model achieves speedups of over three orders of magnitude compared to ab initio methods and reduces prediction errors by up to two orders of magnitude compared to the previous state-of-the-art. This accuracy makes it possible to derive properties such as energies and forces directly from the wavefunction in an end-to-end manner. We demonstrate the potential of our approach in a transfer learning application, where a model trained on low accuracy reference wavefunctions implicitly learns to correct for electronic many-body interactions from observables computed at a higher level of theory. Such machine-learned wavefunction surrogates pave the way towards novel semi-empirical methods, offering resolution at an electronic level while drastically decreasing computational cost. Additionally, the predicted wavefunctions can serve as initial guess in conventional ab initio methods, decreasing the number of iterations required to arrive at a converged solution, thus leading to significant speedups without any loss of accuracy or robustness. △ Less

Submitted 20 October, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

arXiv:2105.00304 [pdf, other]

doi 10.1038/s41467-021-27504-0

SpookyNet: Learning Force Fields with Electronic Degrees of Freedom and Nonlocal Effects

Authors: Oliver T. Unke, Stefan Chmiela, Michael Gastegger, Kristof T. Schütt, Huziel E. Sauceda, Klaus-Robert Müller

Abstract: Machine-learned force fields (ML-FFs) combine the accuracy of ab initio methods with the efficiency of conventional force fields. However, current ML-FFs typically ignore electronic degrees of freedom, such as the total charge or spin state, and assume chemical locality, which is problematic when molecules have inconsistent electronic states, or when nonlocal effects play a significant role. This… ▽ More Machine-learned force fields (ML-FFs) combine the accuracy of ab initio methods with the efficiency of conventional force fields. However, current ML-FFs typically ignore electronic degrees of freedom, such as the total charge or spin state, and assume chemical locality, which is problematic when molecules have inconsistent electronic states, or when nonlocal effects play a significant role. This work introduces SpookyNet, a deep neural network for constructing ML-FFs with explicit treatment of electronic degrees of freedom and quantum nonlocality. Chemically meaningful inductive biases and analytical corrections built into the network architecture allow it to properly model physical limits. SpookyNet improves upon the current state-of-the-art (or achieves similar performance) on popular quantum chemistry data sets. Notably, it is able to generalize across chemical and conformational space and can leverage the learned chemical insights, e.g. by predicting unknown spin states, thus hel** to close a further important remaining gap for today's machine learning models in quantum chemistry. △ Less

Submitted 20 July, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

arXiv:2104.06099 [pdf, other]

doi 10.1021/acs.jctc.1c00363

Impact of the characteristics of quantum chemical databases on machine learning predictions of tautomerization energies

Authors: Luis Itza Vazquez-Salazar, Eric Boittier, Oliver T. Unke, Markus Meuwly

Abstract: An essential aspect for adequate predictions of chemical properties by machine learning models is the database used for training them. However, studies that analyze how the content and structure of the databases used for training impact the prediction quality are scarce. In this work, we analyze and quantify the relationships learned by a machine learning model (Neural Network) trained on five dif… ▽ More An essential aspect for adequate predictions of chemical properties by machine learning models is the database used for training them. However, studies that analyze how the content and structure of the databases used for training impact the prediction quality are scarce. In this work, we analyze and quantify the relationships learned by a machine learning model (Neural Network) trained on five different reference databases (QM9, PC9, ANI-1E, ANI-1 and ANI-1x) to predict tautomerization energies from molecules in Tautobase. For this, characteristics such as the number of heavy atoms in a molecule, number of atoms of a given element, bond composition, or initial geometry on the quality of the predictions are considered. The results indicate that training on a chemically diverse database is crucial for obtaining good results but also that conformational sampling can partly compensate for limited coverage of chemical diversity. We explicitly demonstrate that when certain types of bonds need to be covered in the target database (Tautobase) but are undersampled in the reference databases the resulting predictions are poor. A quantitative measure for these deficiencies is the Kullback-Leibler divergence between reference and target distributions. Analysis of the results with a TreeMAP algorithm provides deeper understanding of specific deficiencies in the reference data sets. Capitalizing on this information can be used to either improve existing databases or to generate new databases of sufficient diversity for a range of ML applications in chemistry. △ Less

Submitted 13 April, 2021; originally announced April 2021.

Comments: 42 pages, 10 figures of main manuscript plus 21 pages, 12 figures of supporting information

Journal ref: J. Chem. Theory Comput. 2021, 17, 8, 4769-4785

arXiv:2102.03150 [pdf, other]

Equivariant message passing for the prediction of tensorial properties and molecular spectra

Authors: Kristof T. Schütt, Oliver T. Unke, Michael Gastegger

Abstract: Message passing neural networks have become a method of choice for learning on graphs, in particular the prediction of chemical properties and the acceleration of molecular dynamics studies. While they readily scale to large training data sets, previous approaches have proven to be less data efficient than kernel methods. We identify limitations of invariant representations as a major reason and e… ▽ More Message passing neural networks have become a method of choice for learning on graphs, in particular the prediction of chemical properties and the acceleration of molecular dynamics studies. While they readily scale to large training data sets, previous approaches have proven to be less data efficient than kernel methods. We identify limitations of invariant representations as a major reason and extend the message passing formulation to rotationally equivariant representations. On this basis, we propose the polarizable atom interaction neural network (PaiNN) and improve on common molecule benchmarks over previous networks, while reducing model size and inference time. We leverage the equivariant atomwise representations obtained by PaiNN for the prediction of tensorial properties. Finally, we apply this to the simulation of molecular spectra, achieving speedups of 4-5 orders of magnitude compared to the electronic structure reference. △ Less

Submitted 7 June, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

Comments: Accepted at ICML 2021

arXiv:2010.07067 [pdf, other]

doi 10.1021/acs.chemrev.0c01111

Machine Learning Force Fields

Authors: Oliver T. Unke, Stefan Chmiela, Huziel E. Sauceda, Michael Gastegger, Igor Poltavsky, Kristof T. Schütt, Alexandre Tkatchenko, Klaus-Robert Müller

Abstract: In recent years, the use of Machine Learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of cl… ▽ More In recent years, the use of Machine Learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of classical FFs. The key idea is to learn the statistical relation between chemical structure and potential energy without relying on a preconceived notion of fixed chemical bonds or knowledge about the relevant interactions. Such universal ML approximations are in principle only limited by the quality and quantity of the reference data used to train them. This review gives an overview of applications of ML-FFs and the chemical insights that can be obtained from them. The core concepts underlying ML-FFs are described in detail and a step-by-step guide for constructing and testing them from scratch is given. The text concludes with a discussion of the challenges that remain to be overcome by the next generation of ML-FFs. △ Less

Submitted 12 January, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

Journal ref: Chem. Rev. 2021, 121, 16, 10142-10186

arXiv:2003.08171 [pdf, other]

doi 10.1063/5.0008223

Isomerization and Decomposition Reactions of Acetaldehyde Relevant to Atmospheric Processes from Dynamics Simulations on Neural Network-Based Potential Energy Surfaces

Authors: Silvan Käser, Oliver T. Unke, Markus Meuwly

Abstract: Acetaldehyde (AA) isomerization (to vinylalcohol, VA) and decomposition (into either CO+CH$_4$ and H$_2$+H$_2$CCO) is studied using a fully dimensional, reactive potential energy surface represented as a neural network (NN). The NN, trained on 432'399 reference structures from MP2/aug-cc-pVTZ calculations has a MAE of 0.0453 kcal/mol and an RMSE of 1.186 kcal/mol for a test set of 27'399 structure… ▽ More Acetaldehyde (AA) isomerization (to vinylalcohol, VA) and decomposition (into either CO+CH$_4$ and H$_2$+H$_2$CCO) is studied using a fully dimensional, reactive potential energy surface represented as a neural network (NN). The NN, trained on 432'399 reference structures from MP2/aug-cc-pVTZ calculations has a MAE of 0.0453 kcal/mol and an RMSE of 1.186 kcal/mol for a test set of 27'399 structures. For the isomerization process AA $\rightarrow$ VA the minimum dynamical path implies that the C-H vibration, and the C-C-H (with H being the transferring H-atom) and the C-C-O angles are involved to surmount the 68.2 kcal/mol barrier. Using an excess energy of 93.6 kcal/mol - the energy available in the solar spectrum and sufficient to excite to the first electronically excited state - to initialize the molecular dynamics, no isomerization to VA is observed on the 500 ns time scale. Only with excess energies of $\sim$ 127.6 kcal/mol (including the zero point energy of the AA molecule), isomerization occurs on the nanosecond time scale. Given that collisional de-excitation at atmospheric conditions in the stratosphere occurs on the 100 ns time scale, it is concluded that formation of VA following photoexcitation of AA from actinic photons is unlikely. This also limits the relevance of this reaction pathway to be a source for formic acid. △ Less

Submitted 18 March, 2020; originally announced March 2020.

arXiv:2002.02151 [pdf, other]

doi 10.1039/D0CP00668H

Thermal Activation of Methane by MgO$^+$: Temperature Dependent Kinetics, Reactive Molecular Dynamics Simulations and Statistical Modeling

Authors: Brendan C. Sweeny, Hanqing Pan, Asmaa Kassem, Jordan C Sawyer, Shaun G. Ard, Nicholas S. Shuman, Albert A. Viggiano, Sebastian Brickel, Oliver T. Unke, Meenu Upadhyay, Markus Meuwly

Abstract: The kinetics of MgO$^+$ + CH$_4$ was studied experimentally using the variable ion source, temperature adjustable selected ion flow tube (VISTA-SIFT) apparatus from 300 $-$ 600 K and computationally by running and analyzing reactive atomistic simulations. Rates and product branching fractions were determined as a function of temperature. The reaction proceeded with a rate of… ▽ More The kinetics of MgO$^+$ + CH$_4$ was studied experimentally using the variable ion source, temperature adjustable selected ion flow tube (VISTA-SIFT) apparatus from 300 $-$ 600 K and computationally by running and analyzing reactive atomistic simulations. Rates and product branching fractions were determined as a function of temperature. The reaction proceeded with a rate of $k = 5.9 \pm 1.5 10^{-10}(T/300 $ K$)^{-0.5 \pm 0.2}$ cm$^3$ s$^{-1}$. MgOH$^+$ was the dominant product at all temperatures, but Mg$^+$, the co-product of oxygen-atom transfer to form methanol, was observed with a product branching fraction of $0.08 \pm 0.03 (T / 300 $ K$)^{-0.8 \pm 0.7}$. Reactive molecular dynamics simulations using a reactive force field, as well as a neural network yield rate coefficients about one order of magnitude lower. This underestimation of the rates is traced back to the multireference character of the transition state [MgOCH$_4$]$^+$. Statistical modeling of the temperature-dependent kinetics provides further insight into the reactive potential surface. The rate limiting step was found to be consistent with a four-centered activation of the C-H bond, consistent with previous calculations. The product branching was modeled as a competition between dissociation of an insertion intermediate directly after the rate-limiting transition state, and traversing a transition state corresponding to a methyl migration leading to a Mg-CH$_3$OH$^+$ complex, though only if this transition state is stabilized significantly relative to the dissociated MgOH$^+$ + CH$_3$ product channel. An alternative non-statistical mechanism is discussed, whereby a post-transition state bifurcation in the potential surface could allow the reaction to proceed directly from the four-centered TS to the Mg-CH$_3$OH$^+$ complex thereby allowing a more robust competition between the product channels. △ Less

Submitted 6 February, 2020; originally announced February 2020.

arXiv:1911.09475 [pdf, other]

doi 10.1088/1367-2630/ab81b5

Reactive Dynamics and Spectroscopy of Hydrogen Transfer from Neural Network-Based Reactive Potential Energy Surfaces

Authors: Silvan Käser, Oliver T. Unke, Markus Meuwly

Abstract: The in silico exploration of chemical, physical and biological systems requires accurate and efficient energy functions to follow their nuclear dynamics at a molecular and atomistic level. Recently, machine learning tools gained a lot of attention in the field of molecular sciences and simulations and are increasingly used to investigate the dynamics of such systems. Among the various approaches,… ▽ More The in silico exploration of chemical, physical and biological systems requires accurate and efficient energy functions to follow their nuclear dynamics at a molecular and atomistic level. Recently, machine learning tools gained a lot of attention in the field of molecular sciences and simulations and are increasingly used to investigate the dynamics of such systems. Among the various approaches, artificial neural networks (NNs) are one promising tool to learn a representation of potential energy surfaces. This is done by formulating the problem as a map** from a set of atomic positions $\mathbf{x}$ and nuclear charges $Z_i$ to a potential energy $V(\mathbf{x})$. Here, a fully-dimensional, reactive neural network representation for malonaldehyde (MA), acetoacetaldehyde (AAA) and acetylacetone (AcAc) is learned. It is used to run finite-temperature molecular dynamics simulations, and to determine the infrared spectra and the hydrogen transfer rates for the three molecules. The finite-temperature infrared spectrum for MA based on the NN learned on MP2 reference data provides a realistic representation of the low-frequency modes and the H-transfer band whereas the CH vibrations are somewhat too high in frequency. For AAA it is demonstrated that the IR spectroscopy is sensitive to the position of the transferring hydrogen at either the OCH- or OCCH$_3$ end of the molecule. For the hydrogen transfer rates it is demonstrated that the O-O vibration is a gating mode and largely determines the rate at which the hydrogen is transferred between the donor and acceptor. Finally, possibilities to further improve such NN-based potential energy surfaces are explored. They include the transferability of an NN-learned energy function across chemical species (here methylation) and transfer learning from a lower level of reference data (MP2) to a higher level of theory (pair natural orbital-LCCSD(T)). △ Less

Submitted 21 November, 2019; originally announced November 2019.

arXiv:1910.03044 [pdf, other]

doi 10.1088/2632-2153/ab5922

High-Dimensional Potential Energy Surfaces for Molecular Simulations

Authors: Oliver T. Unke, Debasish Koner, Sarbani Patra, Silvan Käser, Markus Meuwly

Abstract: An overview of computational methods to describe high-dimensional potential energy surfaces suitable for atomistic simulations is given. Particular emphasis is put on accuracy, computability, transferability and extensibility of the methods discussed. They include empirical force fields, representations based on reproducing kernels, using permutationally invariant polynomials, and neural network-l… ▽ More An overview of computational methods to describe high-dimensional potential energy surfaces suitable for atomistic simulations is given. Particular emphasis is put on accuracy, computability, transferability and extensibility of the methods discussed. They include empirical force fields, representations based on reproducing kernels, using permutationally invariant polynomials, and neural network-learned representations and combinations thereof. Future directions and potential improvements are discussed primarily from a practical, application-oriented perspective. △ Less

Submitted 7 October, 2019; originally announced October 2019.

Journal ref: Mach. Learn.: Sci. Technol. 1 (2020) 013001

arXiv:1909.08027 [pdf, other]

Machine Learning Potential Energy Surfaces

Authors: Oliver T. Unke, Markus Meuwly

Abstract: Machine Learning techniques can be used to represent high-dimensional potential energy surfaces for reactive chemical systems. Two such methods are based on a reproducing kernel Hilbert space representation or on deep neural networks. They can achieve a sub-1 kcal/mol accuracy with respect to reference data and can be used in studies of chemical dynamics. Their construction and a few typical examp… ▽ More Machine Learning techniques can be used to represent high-dimensional potential energy surfaces for reactive chemical systems. Two such methods are based on a reproducing kernel Hilbert space representation or on deep neural networks. They can achieve a sub-1 kcal/mol accuracy with respect to reference data and can be used in studies of chemical dynamics. Their construction and a few typical examples are briefly summarized in the present contribution. △ Less

Submitted 17 September, 2019; originally announced September 2019.

arXiv:1906.07455 [pdf, ps, other]

doi 10.1063/1.5114981

Reactive Atomistic Simulations of Diels-Alder Reactions: the Importance of Molecular Rotations

Authors: Uxía Rivero, Oliver T. Unke, Markus Meuwly, Stefan Willitsch

Abstract: The Diels-Alder reaction between 2,3-dibromo-1,3-butadiene and maleic anhydride has been studied by means of multisurface adiabatic reactive molecular dynamics and the PhysNet neural network architecture. This system is used as a prototype to explore the concertedness, synchronicity and possible ways of promotion of Diels-Alder reactions. Analysis of the minimum dynamic path indicates that rotatio… ▽ More The Diels-Alder reaction between 2,3-dibromo-1,3-butadiene and maleic anhydride has been studied by means of multisurface adiabatic reactive molecular dynamics and the PhysNet neural network architecture. This system is used as a prototype to explore the concertedness, synchronicity and possible ways of promotion of Diels-Alder reactions. Analysis of the minimum dynamic path indicates that rotational energy is crucial ($\sim$ 65 %) to drive the system towards the transition state in addition to collision energy ($\sim$ 20 %). Comparison with the reaction of butadiene and maleic anhydride shows that the presence of bromine substituents in the diene accentuates the importance of rotational excitation to promote the reaction. At the high total energies at which reactive events are recorded, the reaction is found to be direct and mostly synchronous. △ Less

Submitted 18 June, 2019; originally announced June 2019.

Journal ref: J. Chem. Phys. 151, 104301 (2019)

arXiv:1902.08408 [pdf, other]

doi 10.1021/acs.jctc.9b00181

PhysNet: A Neural Network for Predicting Energies, Forces, Dipole Moments and Partial Charges

Authors: Oliver T. Unke, Markus Meuwly

Abstract: In recent years, machine learning (ML) methods have become increasingly popular in computational chemistry. After being trained on appropriate ab initio reference data, these methods allow to accurately predict the properties of chemical systems, circumventing the need for explicitly solving the electronic Schrödinger equation. Because of their computational efficiency and scalability to large dat… ▽ More In recent years, machine learning (ML) methods have become increasingly popular in computational chemistry. After being trained on appropriate ab initio reference data, these methods allow to accurately predict the properties of chemical systems, circumventing the need for explicitly solving the electronic Schrödinger equation. Because of their computational efficiency and scalability to large datasets, deep neural networks (DNNs) are a particularly promising ML algorithm for chemical applications. This work introduces PhysNet, a DNN architecture designed for predicting energies, forces and dipole moments of chemical systems. PhysNet achieves state-of-the-art performance on the QM9, MD17 and ISO17 benchmarks. Further, two new datasets are generated in order to probe the performance of ML models for describing chemical reactions, long-range interactions, and condensed phase systems. It is shown that explicitly including electrostatics in energy predictions is crucial for a qualitatively correct description of the asymptotic regions of a potential energy surface (PES). PhysNet models trained on a systematically constructed set of small peptide fragments (at most eight heavy atoms) are able to generalize to considerably larger proteins like deca-alanine (Ala$_{10}$): The optimized geometry of helical Ala$_{10}$ predicted by PhysNet is virtually identical to ab initio results (RMSD = 0.21 Å). By running unbiased molecular dynamics (MD) simulations of Ala$_{10}$ on the PhysNet-PES in gas phase, it is found that instead of a helical structure, Ala$_{10}$ folds into a wreath-shaped configuration, which is more stable than the helical form by 0.46 kcal mol$^{-1}$ according to the reference ab initio calculations. △ Less

Submitted 28 March, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

Comments: 23 pages, 9 figures, 7 tables

Journal ref: J. Chem. Theory Comput. 2019, 15, 6, 3678-3693

Showing 1–17 of 17 results for author: Unke, O T