-
Comment on "Manifolds of quasi-constant SOAP and ACSF fingerprints and the resulting failure to machine learn four body interactions"
Authors:
Sergey N. Pozdnyakov,
Michael J. Willatt,
Albert P. Bartók,
Christoph Ortner,
Gábor Csányi,
Michele Ceriotti
Abstract:
The "quasi-constant" SOAP and ACSF fingerprint manifolds recently discovered by Parsaeifard and Goedecker are a direct consequence of the presence of degenerate pairs of configurations, a known shortcoming of all low-body-order atom-density correlation representations of molecular structures. Contrary to the configurations that are rigorously singular -- that we demonstrate can only occur in finit…
▽ More
The "quasi-constant" SOAP and ACSF fingerprint manifolds recently discovered by Parsaeifard and Goedecker are a direct consequence of the presence of degenerate pairs of configurations, a known shortcoming of all low-body-order atom-density correlation representations of molecular structures. Contrary to the configurations that are rigorously singular -- that we demonstrate can only occur in finite, discrete sets -- the continuous "quasi-constant" manifolds exhibit low, but non-zero, sensitivity to atomic displacements. Thus, it is possible to build interpolative machine-learning models of high-order interactions along the manifold, even though the numerical instabilities associated with proximity to the exact singularities affect the accuracy and transferability of such models, to an extent that depends on numerical details of the implementation.
△ Less
Submitted 2 May, 2022;
originally announced May 2022.
-
Equivariant representations for molecular Hamiltonians and N-center atomic-scale properties
Authors:
Jigyasa Nigam,
Michael Willatt,
Michele Ceriotti
Abstract:
Symmetry considerations are at the core of the major frameworks used to provide an effective mathematical representation of atomic configurations that is then used in machine-learning models to predict the properties associated with each structure. In most cases, the models rely on a description of atom-centered environments, and are suitable to learn atomic properties, or global observables that…
▽ More
Symmetry considerations are at the core of the major frameworks used to provide an effective mathematical representation of atomic configurations that is then used in machine-learning models to predict the properties associated with each structure. In most cases, the models rely on a description of atom-centered environments, and are suitable to learn atomic properties, or global observables that can be decomposed into atomic contributions. Many quantities that are relevant for quantum mechanical calculations, however -- most notably the single-particle Hamiltonian matrix when written in an atomic-orbital basis -- are not associated with a single center, but with two (or more) atoms in the structure. We discuss a family of structural descriptors that generalize the very successful atom-centered density correlation features to the N-centers case, and show in particular how this construction can be applied to efficiently learn the matrix elements of the (effective) single-particle Hamiltonian written in an atom-centered orbital basis. These N-centers features are fully equivariant -- not only in terms of translations and rotations, but also in terms of permutations of the indices associated with the atoms -- and are suitable to construct symmetry-adapted machine-learning models of new classes of properties of molecules and materials.
△ Less
Submitted 20 December, 2021; v1 submitted 24 September, 2021;
originally announced September 2021.
-
Efficient implementation of atom-density representations
Authors:
Félix Musil,
Max Veit,
Alexander Goscinski,
Guillaume Fraux,
Michael J. Willatt,
Markus Stricker,
Till Junge,
Michele Ceriotti
Abstract:
Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods. They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space. Recently, it has become clear t…
▽ More
Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods. They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space. Recently, it has become clear that many of the most effective representations share a fundamental formal connection: that they can all be expressed as a discretization of N-body correlation functions of the local atom density, suggesting the opportunity of standardizing and, more importantly, optimizing the calculation of such representations. We present an implementation, named librascal, whose modular design lends itself both to develo** refinements to the density-based formalism and to rapid prototy** for new developments of rotationally equivariant atomistic representations. As an example, we discuss SOAP features, perhaps the most widely used member of this family of representations, to show how the expansion of the local density can be optimized for any choice of radial basis set. We discuss the representation in the context of a kernel ridge regression model, commonly used with SOAP features, and analyze how the computational effort scales for each of the individual steps of the calculation. By applying data reduction techniques in feature space, we show how to further reduce the total computational cost by at up to a factor of 4 or 5 without affecting the model's symmetry properties and without significantly impacting its accuracy.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
On the Completeness of Atomic Structure Representations
Authors:
Sergey N. Pozdnyakov,
Michael J. Willatt,
Albert P. Bartók,
Christoph Ortner,
Gábor Csányi,
Michele Ceriotti
Abstract:
Many-body descriptors are widely used to represent atomic environments in the construction of machine learned interatomic potentials and more broadly for fitting, classification and embedding tasks on atomic structures. It was generally believed that 3-body descriptors uniquely specify the environment of an atom, up to a rotation and permutation of like atoms. We produce several counterexamples to…
▽ More
Many-body descriptors are widely used to represent atomic environments in the construction of machine learned interatomic potentials and more broadly for fitting, classification and embedding tasks on atomic structures. It was generally believed that 3-body descriptors uniquely specify the environment of an atom, up to a rotation and permutation of like atoms. We produce several counterexamples to this belief, with the consequence that any classifier, regression or embedding model for atom-centred properties that uses 3 (or 4)-body features will incorrectly give identical results for different configurations. Writing global properties (such as total energies) as a sum of many atom-centred contributions mitigates, but does not eliminate, the impact of this fundamental deficiency -- explaining the success of current "machine-learning" force fields. We anticipate the issues that will arise as the desired accuracy increases, and suggest potential solutions.
△ Less
Submitted 5 June, 2020; v1 submitted 31 January, 2020;
originally announced January 2020.
-
Path-integral dynamics of water using curvilinear centroids
Authors:
George Trenins,
Michael J. Willatt,
Stuart C. Althorpe
Abstract:
We develop a path-integral dynamics method for water that resembles centroid molecular dynamics (CMD), except that the centroids are averages of curvilinear, rather than cartesian, bead coordinates. The curvilinear coordinates are used explicitly only when computing the potential of mean force, the components of which are re-expressed in terms of cartesian 'quasi-centroids' (so-called because they…
▽ More
We develop a path-integral dynamics method for water that resembles centroid molecular dynamics (CMD), except that the centroids are averages of curvilinear, rather than cartesian, bead coordinates. The curvilinear coordinates are used explicitly only when computing the potential of mean force, the components of which are re-expressed in terms of cartesian 'quasi-centroids' (so-called because they are close to the cartesian centroids). Cartesian equations of motion are obtained by making small approximations to the quantum Boltzmann distribution. Simulations of the infrared spectra of various water models over 150-600 K show these approximations to be justified: for a two-dimensional OH-bond model, the quasi-centroid molecular dynamics (QCMD) spectra lie close to the exact quantum spectra, and almost on top of the Matsubara dynamics spectra; for gas-phase water, the QCMD spectra are close to the exact quantum spectra; for liquid water and ice (using the q-TIP4P/F surface), the QCMD spectra are close to the CMD spectra at 600 K, and line up with the results of thermostatted ring-polymer molecular dynamics and approximate quantum calculations at 300 and 150 K. The QCMD spectra show no sign of the CMD 'curvature problem' (of erroneous red shifts and broadening). In the liquid and ice simulations, the potential of mean force was evaluated on the fly by generalising an adiabatic CMD algorithm to curvilinear coordinates; the full limit of adiabatic separation needed to be taken, which made the QCMD calculations 8 times more expensive than partially adiabatic CMD at 300 K, and 32 times at 150 K (and the intensities may still not be converged at this temperature). The QCMD method is probably generalisable to many other systems, provided collective bead-coordinates can be identified that yield compact mean-field ring-polymer distributions.
△ Less
Submitted 6 July, 2019;
originally announced July 2019.
-
Atomic-scale representation and statistical learning of tensorial properties
Authors:
Andrea Grisafi,
David M. Wilkins,
Michael J. Willatt,
Michele Ceriotti
Abstract:
This chapter discusses the importance of incorporating three-dimensional symmetries in the context of statistical learning models geared towards the interpolation of the tensorial properties of atomic-scale structures. We focus on Gaussian process regression, and in particular on the construction of structural representations, and the associated kernel functions, that are endowed with the geometri…
▽ More
This chapter discusses the importance of incorporating three-dimensional symmetries in the context of statistical learning models geared towards the interpolation of the tensorial properties of atomic-scale structures. We focus on Gaussian process regression, and in particular on the construction of structural representations, and the associated kernel functions, that are endowed with the geometric covariance properties compatible with those of the learning targets. We summarize the general formulation of such a symmetry-adapted Gaussian process regression model, and how it can be implemented based on a scheme that generalizes the popular smooth overlap of atomic positions representation. We give examples of the performance of this framework when learning the polarizability and the ground-state electron density of a molecule.
△ Less
Submitted 2 April, 2019;
originally announced April 2019.
-
Machine-learning of atomic-scale properties based on physical principles
Authors:
Michele Ceriotti,
Michael J. Willatt,
Gábor Csányi
Abstract:
We briefly summarize the kernel regression approach, as used recently in materials modelling, to fitting functions, particularly potential energy surfaces, and highlight how the linear algebra framework can be used to both predict and train from linear functionals of the potential energy, such as the total energy and atomic forces. We then give a detailed account of the Smooth Overlap of Atomic Po…
▽ More
We briefly summarize the kernel regression approach, as used recently in materials modelling, to fitting functions, particularly potential energy surfaces, and highlight how the linear algebra framework can be used to both predict and train from linear functionals of the potential energy, such as the total energy and atomic forces. We then give a detailed account of the Smooth Overlap of Atomic Positions (SOAP) representation and kernel, showing how it arises from an abstract representation of smooth atomic densities, and how it is related to several popular density-based representations of atomic structure. We also discuss recent generalisations that allow fine control of correlations between different atomic species, prediction and fitting of tensorial properties, and also how to construct structural kernels---applicable to comparing entire molecules or periodic systems---that go beyond an additive combination of local environments.
△ Less
Submitted 30 January, 2019;
originally announced January 2019.
-
Fast and Accurate Uncertainty Estimation in Chemical Machine Learning
Authors:
Felix Musil,
Michael J. Willatt,
Mikhail A. Langovoy,
Michele Ceriotti
Abstract:
We present a scheme to obtain an inexpensive and reliable estimate of the uncertainty associated with the predictions of a machine-learning model of atomic and molecular properties. The scheme is based on resampling, with multiple models being generated based on sub-sampling of the same training data. The accuracy of the uncertainty prediction can be benchmarked by maximum likelihood estimation, w…
▽ More
We present a scheme to obtain an inexpensive and reliable estimate of the uncertainty associated with the predictions of a machine-learning model of atomic and molecular properties. The scheme is based on resampling, with multiple models being generated based on sub-sampling of the same training data. The accuracy of the uncertainty prediction can be benchmarked by maximum likelihood estimation, which can also be used to correct for correlations between resampled models, and to improve the performance of the uncertainty estimation by a cross-validation procedure. In the case of sparse Gaussian Process Regression models, this resampled estimator can be evaluated at negligible cost. We demonstrate the reliability of these estimates for the prediction of molecular energetics, and for the estimation of nuclear chemical shieldings in molecular crystals. Extension to estimate the uncertainty in energy differences, forces, or other correlated predictions is straightforward. This method can be easily applied to other machine learning schemes, and will be beneficial to make data-driven predictions more reliable, and to facilitate training-set optimization and active-learning strategies.
△ Less
Submitted 20 September, 2018;
originally announced September 2018.
-
Atom-Density Representations for Machine Learning
Authors:
Michael J. Willatt,
Felix Musil,
Michele Ceriotti
Abstract:
The applications of machine learning techniques to chemistry and materials science become more numerous by the day. The main challenge is to devise representations of atomic systems that are at the same time complete and concise, so as to reduce the number of reference calculations that are needed to predict the properties of different types of materials reliably. This has led to a proliferation o…
▽ More
The applications of machine learning techniques to chemistry and materials science become more numerous by the day. The main challenge is to devise representations of atomic systems that are at the same time complete and concise, so as to reduce the number of reference calculations that are needed to predict the properties of different types of materials reliably. This has led to a proliferation of alternative ways to convert an atomic structure into an input for a machine-learning model. We introduce an abstract definition of chemical environments that is based on a smoothed atomic density, using a bra-ket notation to emphasize basis set independence and to highlight the connections with some popular choices of representations for describing atomic systems. The correlations between the spatial distribution of atoms and their chemical identities are computed as inner products between these feature kets, which can be given an explicit representation in terms of the expansion of the atom density on orthogonal basis functions, that is equivalent to the smooth overlap of atomic positions (SOAP) power spectrum, but also in real space, corresponding to $n$-body correlations of the atom density. This formalism lays the foundations for a more systematic tuning of the behavior of the representations, by introducing operators that represent the correlations between structure, composition, and the target properties. It provides a unifying picture of recent developments in the field and indicates a way forward towards more effective and computationally affordable machine-learning schemes for molecules and materials.
△ Less
Submitted 28 January, 2019; v1 submitted 1 July, 2018;
originally announced July 2018.
-
Feature Optimization for Atomistic Machine Learning Yields A Data-Driven Construction of the Periodic Table of the Elements
Authors:
Michael J. Willatt,
Félix Musil,
Michele Ceriotti
Abstract:
Machine-learning of atomic-scale properties amounts to extracting correlations between structure, composition and the quantity that one wants to predict. Representing the input structure in a way that best reflects such correlations makes it possible to improve the accuracy of the model for a given amount of reference data. When using a description of the structures that is transparent and well-pr…
▽ More
Machine-learning of atomic-scale properties amounts to extracting correlations between structure, composition and the quantity that one wants to predict. Representing the input structure in a way that best reflects such correlations makes it possible to improve the accuracy of the model for a given amount of reference data. When using a description of the structures that is transparent and well-principled, optimizing the representation might reveal insights into the chemistry of the data set. Here we show how one can generalize the SOAP kernel to introduce a distance-dependent weight that accounts for the multi-scale nature of the interactions, and a description of correlations between chemical species. We show that this improves substantially the performance of ML models of molecular and materials stability, while making it easier to work with complex, multi-component systems and to extend SOAP to coarse-grained intermolecular potentials. The element correlations that give the best performing model show striking similarities with the conventional periodic table of the elements, providing an inspiring example of how machine learning can rediscover, and generalize, intuitive concepts that constitute the foundations of chemistry.
△ Less
Submitted 25 July, 2019; v1 submitted 30 June, 2018;
originally announced July 2018.
-
Relation of centroid molecular dynamics and ring-polymer molecular dynamics to exact quantum dynamics
Authors:
Timothy J. H. Hele,
Michael J. Willatt,
Andrea Muolo,
Stuart C. Althorpe
Abstract:
We recently obtained a quantum-Boltzmann-conserving classical dynamics by making a single change to the derivation of the `Classical Wigner' approximation. Here, we show that the further approximation of this `Matsubara dynamics' gives rise to two popular heuristic methods for treating quantum Boltzmann time-correlation functions: centroid molecular dynamics (CMD) and ring-polymer molecular dynami…
▽ More
We recently obtained a quantum-Boltzmann-conserving classical dynamics by making a single change to the derivation of the `Classical Wigner' approximation. Here, we show that the further approximation of this `Matsubara dynamics' gives rise to two popular heuristic methods for treating quantum Boltzmann time-correlation functions: centroid molecular dynamics (CMD) and ring-polymer molecular dynamics (RPMD). We show that CMD is a mean-field approximation to Matsubara dynamics, obtained by discarding (classical) fluctuations around the centroid, and that RPMD is the result of discarding a term in the Matsubara Liouvillian which shifts the frequencies of these fluctuations. These findings are consistent with previous numerical results, and give explicit formulae for the terms that CMD and RPMD leave out.
△ Less
Submitted 19 May, 2015;
originally announced May 2015.
-
Boltzmann-conserving classical dynamics in quantum time-correlation functions: Matsubara dynamics
Authors:
Timothy J. H. Hele,
Michael J. Willatt,
Andrea Muolo,
Stuart C. Althorpe
Abstract:
We show that a single change in the derivation of the linearized semiclassical-initial value representation (LSC-IVR or classical Wigner approximation) results in a classical dynamics which conserves the quantum Boltzmann distribution. We rederive the (standard) LSC-IVR approach by writing the (exact) quantum time-correlation function in terms of the normal modes of a free ring-polymer (i.e. a dis…
▽ More
We show that a single change in the derivation of the linearized semiclassical-initial value representation (LSC-IVR or classical Wigner approximation) results in a classical dynamics which conserves the quantum Boltzmann distribution. We rederive the (standard) LSC-IVR approach by writing the (exact) quantum time-correlation function in terms of the normal modes of a free ring-polymer (i.e. a discrete imaginary-time Feynman path), taking the limit that the number of polymer beads $N \to \infty$, such that the lowest normal-mode frequencies take their Matsubara values. The change we propose is to truncate the quantum Liouvillian, not explicitly in powers of $\hbar^2$ at $\hbar^0$ (which gives back the standard LSC-IVR approximation), but in the normal-mode derivatives corresponding to the lowest Matsubara frequencies. The resulting Matsubara dynamics is inherently classical (since all terms $\mathcal{O}\left(\hbar^{2}\right)$ disappear from the Matsubara Liouvillian in the limit $N \to \infty$), and conserves the quantum Boltzmann distribution because the Matsubara Hamiltonian is symmetric with respect to imaginary-time translation. Numerical tests show that the Matsubara approximation to the quantum time-correlation function converges with respect to the number of modes, and gives better agreement than LSC- IVR with the exact quantum result. Matsubara dynamics is too computationally expensive to be applied to complex systems, but its further approximation may lead to practical methods.
△ Less
Submitted 9 April, 2015;
originally announced April 2015.