-
Guest Editorial: Special Topic on Software for Atomistic Machine Learning
Authors:
Matthias Rupp,
Emine Küçükbenli,
Gábor Csányi
Abstract:
A survey of the contributions to the Journal of Chemical Physics' Special Topic on Software for Atomistic Machine Learning.
A survey of the contributions to the Journal of Chemical Physics' Special Topic on Software for Atomistic Machine Learning.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Laser driven melt pool resonances through dynamically oscillating energy inputs
Authors:
Marco Rupp,
Karen Schwarzkopf,
Markus Doering,
Shuichiro Hayashi,
Michael Schmidt,
Craig B. Arnold
Abstract:
Spatially selective melting of metal materials by laser irradiation allows for the precise welding as well as the 3D printing of complex metal parts. However, the simple scanning of a conventional Gaussian beam typically results in a melt track with randomly distributed surface features due to the complex and dynamic behavior of the melt pool. In this study, the implications of utilizing a dynamic…
▽ More
Spatially selective melting of metal materials by laser irradiation allows for the precise welding as well as the 3D printing of complex metal parts. However, the simple scanning of a conventional Gaussian beam typically results in a melt track with randomly distributed surface features due to the complex and dynamic behavior of the melt pool. In this study, the implications of utilizing a dynamically oscillating energy input on driving melt track fluctuations is investigated. Specifically, the laser intensity and/or intensity distribution is sinusoidally modulated at different scan speeds, and the effect of modulation frequency on the resulting surface features of the melt track is examined. The formation of periodically oriented surface features indicates an evident frequency coupling between the melt pool and the modulation frequency. Moreover, such a frequency coupling becomes most prominent under a specific modulation frequency, suggesting resonant behavior. The insights provided in this study will enable the development of novel methods, allowing for the control and/or mitigation of inherent fluctuations in the melt pool through laser-driven resonances.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Modeling RIS from Electromagnetic Principles to Communication Systems--Part I: Synthesis and Characterization of a Scalable Anomalous Reflector
Authors:
Sravan K. R. Vuyyuru,
Le Hao,
Markus Rupp,
Sergei A. Tretyakov,
Risto Valkonen
Abstract:
This work aims to build connections between the electromagnetic and communication aspects of Reconfigurable Intelligent Surfaces (RIS) by proposing a methodology to combine outputs from electromagnetic RIS design into an RIS-tailored system-level simulator and a ray tracer. In this first part of the contribution, a periodic anomalous reflector is designed using an algebraic array antenna scatterin…
▽ More
This work aims to build connections between the electromagnetic and communication aspects of Reconfigurable Intelligent Surfaces (RIS) by proposing a methodology to combine outputs from electromagnetic RIS design into an RIS-tailored system-level simulator and a ray tracer. In this first part of the contribution, a periodic anomalous reflector is designed using an algebraic array antenna scattering synthesis technique that enables electromagnetically accurate modeling of scattering surfaces with both static and reconfigurable scattering characteristics. The multi-mode periodic structure, capable of scattering into several anomalous angles through manipulation of reactive loads, is then cropped into finite-sized arrays, and the quantization effects of the load reactances on the array scattering are analyzed. An experimental anomalous reflector is demonstrated with a comparison between simulated and measured scattering performance. In the second part, the simulated receiving and transmitting scattering patterns of the anomalous reflector are utilized to build an electromagnetically consistent path loss model of an RIS into a system-level simulator. Large-scale fading is analyzed in simple scenarios of RIS-assisted wireless networks to verify the communication model, and an indoor scenario measurement using the manufactured anomalous reflector sample to support the simulation analysis. After verifying the connections between electromagnetic and communication aspects through simulations and measurements, the proposed communication model can be used for a broad range of RIS designs to perform large-scale system-level and ray-tracing simulations in realistic scenarios.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Machine Learning, Density Functional Theory, and Experiments to Understand the Photocatalytic Reduction of CO$_2$ by CuPt/TiO$_2$
Authors:
Vaidish Sumaria,
Takat B. Rawal,
Young Feng Li,
David Sommer,
Jake Vikoren,
Robert J. Bondi,
Matthias Rupp,
Amrit Prasad,
Deeptanshu Prasad
Abstract:
The photoconversion of CO$_2$ to hydrocarbons is a sustainable route to its transformation into value-added compounds and, thereby, crucial to mitigating the energy and climate crises. CuPt nanoparticles on TiO$_2$ surfaces have been reported to show promising photoconversion efficiency. For further progress, a mechanistic understanding of the catalytic properties of these CuPt/TiO$_2$ systems is…
▽ More
The photoconversion of CO$_2$ to hydrocarbons is a sustainable route to its transformation into value-added compounds and, thereby, crucial to mitigating the energy and climate crises. CuPt nanoparticles on TiO$_2$ surfaces have been reported to show promising photoconversion efficiency. For further progress, a mechanistic understanding of the catalytic properties of these CuPt/TiO$_2$ systems is vital. Here, we employ $\textit{ab-initio}$ calculations, machine learning, and photocatalysis experiments to explore their configurational space and examine their reactivity and find that the interface plays a key role in stabilizing *CO$_2$, *CO, and other CH-containing intermediates, facilitating higher activity and selectivity for methane. A bias-corrected machine-learning interatomic potential trained on density functional theory data enables efficient exploration of the potential energy surfaces of numerous CO$_2$@CuPt/TiO$_2$ configurations via basin-hop** Monte Carlo simulations, greatly accelerating the study of these photocatalyst systems. Our simulations show that CO$_2$ preferentially adsorbs at the interface, with C atom bonded to a Pt site and one O atom occupying an O-vacancy site. The interface also promotes the formation of *CH and *CH$_2$ intermediates. For confirmation, we synthesize CuPt/TiO$_2$ samples with a variety of compositions and analyze their morphologies and compositions using scanning electron microscopy and energy-dispersive X-ray spectroscopy, and measure their photocatalytic activity. Our computational and experimental findings qualitatively agree and highlight the importance of interface design for selective conversion of CO$_2$ to hydrocarbons.
△ Less
Submitted 16 February, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Heat flux for semi-local machine-learning potentials
Authors:
Marcel F. Langer,
Florian Knoop,
Christian Carbogno,
Matthias Scheffler,
Matthias Rupp
Abstract:
The Green-Kubo (GK) method is a rigorous framework for heat transport simulations in materials. However, it requires an accurate description of the potential-energy surface and carefully converged statistics. Machine-learning potentials can achieve the accuracy of first-principles simulations while allowing to reach well beyond their simulation time and length scales at a fraction of the cost. In…
▽ More
The Green-Kubo (GK) method is a rigorous framework for heat transport simulations in materials. However, it requires an accurate description of the potential-energy surface and carefully converged statistics. Machine-learning potentials can achieve the accuracy of first-principles simulations while allowing to reach well beyond their simulation time and length scales at a fraction of the cost. In this paper, we explain how to apply the GK approach to the recent class of message-passing machine-learning potentials, which iteratively consider semi-local interactions beyond the initial interaction cutoff. We derive an adapted heat flux formulation that can be implemented using automatic differentiation without compromising computational efficiency. The approach is demonstrated and validated by calculating the thermal conductivity of zirconium dioxide across temperatures.
△ Less
Submitted 28 March, 2023; v1 submitted 25 March, 2023;
originally announced March 2023.
-
Ultra-fast interpretable machine-learning potentials
Authors:
Stephen R. Xie,
Matthias Rupp,
Richard G. Hennig
Abstract:
All-atom dynamics simulations are an indispensable quantitative tool in physics, chemistry, and materials science, but large systems and long simulation times remain challenging due to the trade-off between computational efficiency and predictive accuracy. To address this challenge, we combine effective two- and three-body potentials in a cubic B-spline basis with regularized linear regression to…
▽ More
All-atom dynamics simulations are an indispensable quantitative tool in physics, chemistry, and materials science, but large systems and long simulation times remain challenging due to the trade-off between computational efficiency and predictive accuracy. To address this challenge, we combine effective two- and three-body potentials in a cubic B-spline basis with regularized linear regression to obtain machine-learning potentials that are physically interpretable, sufficiently accurate for applications, as fast as the fastest traditional empirical potentials, and two to four orders of magnitude faster than state-of-the-art machine-learning potentials. For data from empirical potentials, we demonstrate exact retrieval of the potential. For data from density functional theory, the predicted energies, forces, and derived properties, including phonon spectra, elastic constants, and melting points, closely match those of the reference method. The introduced potentials might contribute towards accurate all-atom dynamics simulations of large atomistic systems over long time scales.
△ Less
Submitted 25 May, 2023; v1 submitted 1 October, 2021;
originally announced October 2021.
-
Representations of molecules and materials for interpolation of quantum-mechanical simulations via machine learning
Authors:
Marcel F. Langer,
Alex Goeßmann,
Matthias Rupp
Abstract:
Computational study of molecules and materials from first principles is a cornerstone of physics, chemistry, and materials science, but limited by the cost of accurate and precise simulations. In settings involving many simulations, machine learning can reduce these costs, often by orders of magnitude, by interpolating between reference simulations. This requires representations that describe any…
▽ More
Computational study of molecules and materials from first principles is a cornerstone of physics, chemistry, and materials science, but limited by the cost of accurate and precise simulations. In settings involving many simulations, machine learning can reduce these costs, often by orders of magnitude, by interpolating between reference simulations. This requires representations that describe any molecule or material and support interpolation.
We comprehensively review and discuss current representations and relations between them, using a unified mathematical framework based on many-body functions, group averaging, and tensor products. For selected state-of-the-art representations, we compare energy predictions for organic molecules, binary alloys, and Al-Ga-In sesquioxides in numerical experiments controlled for data distribution, regression method, and hyper-parameter optimization.
△ Less
Submitted 9 February, 2021; v1 submitted 26 March, 2020;
originally announced March 2020.
-
Chemical diversity in molecular orbital energy predictions with kernel ridge regression
Authors:
Annika Stuke,
Milica Todorović,
Matthias Rupp,
Christian Kunkel,
Kunal Ghosh,
Lauri Himanen,
Patrick Rinke
Abstract:
Instant machine learning predictions of molecular properties are desirable for materials design, but the predictive power of the methodology is mainly tested on well-known benchmark datasets. Here, we investigate the performance of machine learning with kernel ridge regression (KRR) for the prediction of molecular orbital energies on three large datasets: the standard QM9 small organic molecules s…
▽ More
Instant machine learning predictions of molecular properties are desirable for materials design, but the predictive power of the methodology is mainly tested on well-known benchmark datasets. Here, we investigate the performance of machine learning with kernel ridge regression (KRR) for the prediction of molecular orbital energies on three large datasets: the standard QM9 small organic molecules set, amino acid and dipeptide conformers, and organic crystal-forming molecules extracted from the Cambridge Structural Database. We focus on prediction of highest occupied molecular orbital (HOMO) energies, computed at density-functional level of theory. Two different representations that encode molecular structure are compared: the Coulomb matrix (CM) and the many-body tensor representation (MBTR). We find that KRR performance depends significantly on the chemistry of the underlying dataset and that the MBTR is superior to the CM, predicting HOMO energies with a mean absolute error as low as 0.09 eV. To demonstrate the power of our machine learning method, we apply our model to structures of 10k previously unseen molecules. We gain instant energy predictions that allow us to identify interesting molecules for future applications.
△ Less
Submitted 25 March, 2019; v1 submitted 20 December, 2018;
originally announced December 2018.
-
Machine-learned multi-system surrogate models for materials prediction
Authors:
Chandramouli Nyshadham,
Matthias Rupp,
Brayden Bekker,
Alexander V. Shapeev,
Tim Mueller,
Conrad W. Rosenbrock,
Gábor Csányi,
David W. Wingate,
Gus L. W. Hart
Abstract:
Surrogate machine-learning models are transforming computational materials science by predicting properties of materials with the accuracy of ab initio methods at a fraction of the computational cost. We demonstrate surrogate models that simultaneously interpolate energies of different materials on a dataset of 10 binary alloys (AgCu, AlFe, AlMg, AlNi, AlTi, CoNi, CuFe, CuNi, FeV, NbNi) with 10 di…
▽ More
Surrogate machine-learning models are transforming computational materials science by predicting properties of materials with the accuracy of ab initio methods at a fraction of the computational cost. We demonstrate surrogate models that simultaneously interpolate energies of different materials on a dataset of 10 binary alloys (AgCu, AlFe, AlMg, AlNi, AlTi, CoNi, CuFe, CuNi, FeV, NbNi) with 10 different species and all possible fcc, bcc and hcp structures up to 8 atoms in the unit cell, 15\,950 structures in total. We find that the deviation of prediction errors when increasing the number of simultaneously modeled alloys is less than 1\,meV/atom. Several state-of-the-art materials representations and learning algorithms were found to qualitatively agree on the prediction errors of formation enthalpy with relative errors of $<$2.5\% for all systems.
△ Less
Submitted 20 May, 2019; v1 submitted 24 September, 2018;
originally announced September 2018.
-
Guest Editorial: Special Topic on Data-enabled Theoretical Chemistry
Authors:
Matthias Rupp,
O. Anatole von Lilienfeld,
Kieron Burke
Abstract:
A survey of the contributions to the Special Topic on Data-enabled Theoretical Chemistry is given, including a glossary of relevant machine learning terms.
A survey of the contributions to the Special Topic on Data-enabled Theoretical Chemistry is given, including a glossary of relevant machine learning terms.
△ Less
Submitted 3 July, 2018; v1 submitted 7 June, 2018;
originally announced June 2018.
-
Better than Rician: Modelling millimetre wave channels as Two-Wave with Diffuse Power
Authors:
Erich Zöchmann,
Sebastian Caban,
Christoph F. Mecklenbräuker,
Stefan Pratschner,
Martin Lerch,
Stefan Schwarz,
Markus Rupp
Abstract:
This contribution provides experimental evidence for the two-wave with diffuse power (TWDP) fading model. We have conducted two indoor millimetre wave measurement campaigns with directive horn antennas at both link ends. One horn antenna is mounted in a corner of our laboratory, while the other is steerable and scans azimuth and elevation. Our first measurement campaign is based on scalar network…
▽ More
This contribution provides experimental evidence for the two-wave with diffuse power (TWDP) fading model. We have conducted two indoor millimetre wave measurement campaigns with directive horn antennas at both link ends. One horn antenna is mounted in a corner of our laboratory, while the other is steerable and scans azimuth and elevation. Our first measurement campaign is based on scalar network analysis with 7 GHz of bandwidth. Our second measurement campaign obtains magnitude and phase information, additionally sampled directionally at several positions in space. %This second measurement campaign is limited to 2 GHz bandwidth. We apply Akaike's information criterion to decide whether Rician fading sufficiently explains the data or the generalized TWDP fading model is necessary. Our results indicate that the TWDP fading hypothesis is favoured over Rician fading in situations where the steerable antenna is pointing towards reflecting objects or is slightly misaligned at line-of-sight. We demonstrate TWDP fading in several different domains, namely, frequency, space, and time.
△ Less
Submitted 10 April, 2018;
originally announced April 2018.
-
Angle-dependent reflectivity of twill-weave carbon fibre reinforced polymer for millimetre waves
Authors:
Gerald Artner,
Erich Zöchmann,
Stefan Pratschner,
Martin Lerch,
Markus Rupp,
C. F. Mecklenbräuker
Abstract:
Twill-weave carbon fibre reinforced polymer is measured as reflector material for electromagnetic waves at 60 GHz. The reflectivity at millimetre wavelength shows a predominant direction, which coincides with the diagonal pattern of the twill's top layer. In the remaining directions the investigated 2/2 twill-weave composite's reflectivity is almost isotropic.
Twill-weave carbon fibre reinforced polymer is measured as reflector material for electromagnetic waves at 60 GHz. The reflectivity at millimetre wavelength shows a predominant direction, which coincides with the diagonal pattern of the twill's top layer. In the remaining directions the investigated 2/2 twill-weave composite's reflectivity is almost isotropic.
△ Less
Submitted 9 November, 2017;
originally announced November 2017.
-
Unified Representation of Molecules and Crystals for Machine Learning
Authors:
Haoyan Huo,
Matthias Rupp
Abstract:
Accurate simulations of atomistic systems from first principles are limited by computational cost. In high-throughput settings, machine learning can reduce these costs significantly by accurately interpolating between reference calculations. For this, kernel learning approaches crucially require a representation that accommodates arbitrary atomistic systems. We introduce a many-body tensor represe…
▽ More
Accurate simulations of atomistic systems from first principles are limited by computational cost. In high-throughput settings, machine learning can reduce these costs significantly by accurately interpolating between reference calculations. For this, kernel learning approaches crucially require a representation that accommodates arbitrary atomistic systems. We introduce a many-body tensor representation that is invariant to translations, rotations, and nuclear permutations of same elements, unique, differentiable, can represent molecules and crystals, and is fast to compute. Empirical evidence for competitive energy and force prediction errors is presented for changes in molecular structure, crystal chemistry, and molecular dynamics using kernel regression and symmetric gradient-domain machine learning as models. Applicability is demonstrated for phase diagrams of Pt-group/transition-metal binary systems.
△ Less
Submitted 24 November, 2022; v1 submitted 21 April, 2017;
originally announced April 2017.
-
Machine Learning for Quantum Mechanical Properties of Atoms in Molecules
Authors:
Matthias Rupp,
Raghunathan Ramakrishnan,
O. Anatole von Lilienfeld
Abstract:
We introduce machine learning models of quantum mechanical observables of atoms in molecules. Instant out-of-sample predictions for proton and carbon nuclear chemical shifts, atomic core level excitations, and forces on atoms reach accuracies on par with density functional theory reference. Locality is exploited within non-linear regression via local atom-centered coordinate systems. The approach…
▽ More
We introduce machine learning models of quantum mechanical observables of atoms in molecules. Instant out-of-sample predictions for proton and carbon nuclear chemical shifts, atomic core level excitations, and forces on atoms reach accuracies on par with density functional theory reference. Locality is exploited within non-linear regression via local atom-centered coordinate systems. The approach is validated on a diverse set of 9k small organic molecules. Linear scaling of computational cost in system size is demonstrated for saturated polymers with up to sub-mesoscale lengths.
△ Less
Submitted 25 August, 2015; v1 submitted 2 May, 2015;
originally announced May 2015.
-
Big Data meets Quantum Chemistry Approximations: The $Δ$-Machine Learning Approach
Authors:
Raghunathan Ramakrishnan,
Pavlo O. Dral,
Matthias Rupp,
O. Anatole von Lilienfeld
Abstract:
Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron…
▽ More
Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger molecular sets than used for training. For thermochemical properties of up to 16k constitutional isomers of C$_7$H$_{10}$O$_2$ we present numerical evidence that chemical accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of Hartree-Fock, and we establish a qualitative relationship between molecular entropy and electron correlation. The transferability of our approach is demonstrated, using semi-empirical quantum chemistry and machine learning models trained on 1 and 10\% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy.
△ Less
Submitted 17 March, 2015;
originally announced March 2015.
-
Understanding Kernel Ridge Regression: Common behaviors from simple functions to density functionals
Authors:
Kevin Vu,
John Snyder,
Li Li,
Matthias Rupp,
Brandon F. Chen,
Tarek Khelif,
Klaus-Robert Müller,
Kieron Burke
Abstract:
Accurate approximations to density functionals have recently been obtained via machine learning (ML). By applying ML to a simple function of one variable without any random sampling, we extract the qualitative dependence of errors on hyperparameters. We find universal features of the behavior in extreme limits, including both very small and very large length scales, and the noise-free limit. We sh…
▽ More
Accurate approximations to density functionals have recently been obtained via machine learning (ML). By applying ML to a simple function of one variable without any random sampling, we extract the qualitative dependence of errors on hyperparameters. We find universal features of the behavior in extreme limits, including both very small and very large length scales, and the noise-free limit. We show how such features arise in ML models of density functionals.
△ Less
Submitted 28 January, 2015; v1 submitted 15 January, 2015;
originally announced January 2015.
-
Understanding Machine-learned Density Functionals
Authors:
Li Li,
John C. Snyder,
Isabelle M. Pelaschier,
Jessica Huang,
Uma-Naresh Niranjan,
Paul Duncan,
Matthias Rupp,
Klaus-Robert Müller,
Kieron Burke
Abstract:
Kernel ridge regression is used to approximate the kinetic energy of non-interacting fermions in a one-dimensional box as a functional of their density. The properties of different kernels and methods of cross-validation are explored, and highly accurate energies are achieved. Accurate {\em constrained optimal densities} are found via a modified Euler-Lagrange constrained minimization of the total…
▽ More
Kernel ridge regression is used to approximate the kinetic energy of non-interacting fermions in a one-dimensional box as a functional of their density. The properties of different kernels and methods of cross-validation are explored, and highly accurate energies are achieved. Accurate {\em constrained optimal densities} are found via a modified Euler-Lagrange constrained minimization of the total energy. A projected gradient descent algorithm is derived using local principal component analysis. Additionally, a sparse grid representation of the density can be used without degrading the performance of the methods. The implications for machine-learned density functional approximations are discussed.
△ Less
Submitted 26 May, 2014; v1 submitted 4 April, 2014;
originally announced April 2014.
-
Fourier series of atomic radial distribution functions: A molecular fingerprint for machine learning models of quantum chemical properties
Authors:
O. Anatole von Lilienfeld,
Raghunathan Ramakrishnan,
Matthias Rupp,
Aaron Knoll
Abstract:
We introduce a fingerprint representation of molecules based on a Fourier series of atomic radial distribution functions. This fingerprint is unique (except for chirality), continuous, and differentiable with respect to atomic coordinates and nuclear charges. It is invariant with respect to translation, rotation, and nuclear permutation, and requires no pre-conceived knowledge about chemical bondi…
▽ More
We introduce a fingerprint representation of molecules based on a Fourier series of atomic radial distribution functions. This fingerprint is unique (except for chirality), continuous, and differentiable with respect to atomic coordinates and nuclear charges. It is invariant with respect to translation, rotation, and nuclear permutation, and requires no pre-conceived knowledge about chemical bonding, topology, or electronic orbitals. As such it meets many important criteria for a good molecular representation, suggesting its usefulness for machine learning models of molecular properties trained across chemical compound space. To assess the performance of this new descriptor we have trained machine learning models of molecular enthalpies of atomization for training sets with up to 10k organic molecules, drawn at random from a published set of 134k organic molecules. We validate the descriptor on all remaining molecules of the 134k set. For a training set of 5k molecules the fingerprint descriptor achieves a mean absolute error of 8.0 kcal/mol, respectively. This is slightly worse than the performance attained using the Coulomb matrix, another popular alternative, reaching 6.2 kcal/mol for the same training and test sets.
△ Less
Submitted 17 March, 2015; v1 submitted 10 July, 2013;
originally announced July 2013.
-
Orbital-free Bond Breaking via Machine Learning
Authors:
John C. Snyder,
Matthias Rupp,
Katja Hansen,
Leo Blooston,
Klaus-Robert Müller,
Kieron Burke
Abstract:
Machine learning is used to approximate the kinetic energy of one dimensional diatomics as a functional of the electron density. The functional can accurately dissociate a diatomic, and can be systematically improved with training. Highly accurate self-consistent densities and molecular forces are found, indicating the possibility for ab-initio molecular dynamics simulations.
Machine learning is used to approximate the kinetic energy of one dimensional diatomics as a functional of the electron density. The functional can accurately dissociate a diatomic, and can be systematically improved with training. Highly accurate self-consistent densities and molecular forces are found, indicating the possibility for ab-initio molecular dynamics simulations.
△ Less
Submitted 7 June, 2013;
originally announced June 2013.
-
Machine Learning of Molecular Electronic Properties in Chemical Compound Space
Authors:
Grégoire Montavon,
Matthias Rupp,
Vivekanand Gobre,
Alvaro Vazquez-Mayagoitia,
Katja Hansen,
Alexandre Tkatchenko,
Klaus-Robert Müller,
O. Anatole von Lilienfeld
Abstract:
The combination of modern scientific computing with electronic structure theory can lead to an unprecedented amount of data amenable to intelligent data analysis for the identification of meaningful, novel, and predictive structure-property relationships. Such relationships enable high-throughput screening for relevant properties in an exponentially growing pool of virtual compounds that are synth…
▽ More
The combination of modern scientific computing with electronic structure theory can lead to an unprecedented amount of data amenable to intelligent data analysis for the identification of meaningful, novel, and predictive structure-property relationships. Such relationships enable high-throughput screening for relevant properties in an exponentially growing pool of virtual compounds that are synthetically accessible. Here, we present a machine learning (ML) model, trained on a data base of \textit{ab initio} calculation results for thousands of organic molecules, that simultaneously predicts multiple electronic ground- and excited-state properties. The properties include atomization energy, polarizability, frontier orbital eigenvalues, ionization potential, electron affinity, and excitation energies. The ML model is based on a deep multi-task artificial neural network, exploiting underlying correlations between various molecular properties. The input is identical to \emph{ab initio} methods, \emph{i.e.} nuclear charges and Cartesian coordinates of all atoms. For small organic molecules the accuracy of such a "Quantum Machine" is similar, and sometimes superior, to modern quantum-chemical methods---at negligible computational cost.
△ Less
Submitted 30 May, 2013;
originally announced May 2013.
-
Finding Density Functionals with Machine Learning
Authors:
John C. Snyder,
Matthias Rupp,
Katja Hansen,
Klaus-Robert Müller,
Kieron Burke
Abstract:
Machine learning is used to approximate density functionals. For the model problem of the kinetic energy of non-interacting fermions in 1d, mean absolute errors below 1 kcal/mol on test densities similar to the training set are reached with fewer than 100 training densities. A predictor identifies if a test density is within the interpolation region. Via principal component analysis, a projected f…
▽ More
Machine learning is used to approximate density functionals. For the model problem of the kinetic energy of non-interacting fermions in 1d, mean absolute errors below 1 kcal/mol on test densities similar to the training set are reached with fewer than 100 training densities. A predictor identifies if a test density is within the interpolation region. Via principal component analysis, a projected functional derivative finds highly accurate self-consistent densities. Challenges for application of our method to real electronic structure problems are discussed.
△ Less
Submitted 22 December, 2011;
originally announced December 2011.
-
Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning
Authors:
Matthias Rupp,
Alexandre Tkatchenko,
Klaus-Robert Müller,
O. Anatole von Lilienfeld
Abstract:
We introduce a machine learning model to predict atomization energies of a diverse set of organic molecules, based on nuclear charges and atomic positions only. The problem of solving the molecular Schrödinger equation is mapped onto a non-linear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid density-…
▽ More
We introduce a machine learning model to predict atomization energies of a diverse set of organic molecules, based on nuclear charges and atomic positions only. The problem of solving the molecular Schrödinger equation is mapped onto a non-linear statistical regression problem of reduced complexity. Regression models are trained on and compared to atomization energies computed with hybrid density-functional theory. Cross-validation over more than seven thousand small organic molecules yields a mean absolute error of ~10 kcal/mol. Applicability is demonstrated for the prediction of molecular atomization potential energy curves.
△ Less
Submitted 12 September, 2011;
originally announced September 2011.