-
Modeling the ferroelectric phase transition in barium titanate with DFT accuracy and converged sampling
Authors:
Lorenzo Gigli,
Alexander Goscinski,
Michele Ceriotti,
Gareth A. Tribello
Abstract:
The accurate description of the structural and thermodynamic properties of ferroelectrics has been one of the most remarkable achievements of Density Functional Theory (DFT). However, running large simulation cells with DFT is computationally demanding, while simulations of small cells are often plagued with non-physical effects that are a consequence of the system's finite size. To avoid these fi…
▽ More
The accurate description of the structural and thermodynamic properties of ferroelectrics has been one of the most remarkable achievements of Density Functional Theory (DFT). However, running large simulation cells with DFT is computationally demanding, while simulations of small cells are often plagued with non-physical effects that are a consequence of the system's finite size. To avoid these finite-size effects one is thus often forced to use empirical models that describe the physics of the material in terms of effective interaction terms, that are fitted using the results from DFT. In this study we use a machine-learning (ML) potential trained on DFT, in combination with accelerated sampling techniques, to converge the thermodynamic properties of Barium Titanate (BTO) with first-principles accuracy and a full atomistic description. Our results indicate that the predicted Curie temperature depends strongly on the choice of DFT functional and system size, because of emergent long-range directional correlations in the local dipole fluctuations. Our findings demonstrate how the combination of ML models and traditional bottom-up modeling allow one to investigate emergent phenomena with the accuracy of first-principles calculations over the large size and time scales afforded by empirical models.
△ Less
Submitted 13 June, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Optimal radial basis for density-based atomic representations
Authors:
Alexander Goscinski,
Félix Musil,
Sergey Pozdnyakov,
Michele Ceriotti
Abstract:
The input of almost every machine learning algorithm targeting the properties of matter at the atomic scale involves a transformation of the list of Cartesian atomic coordinates into a more symmetric representation. Many of the most popular representations can be seen as an expansion of the symmetrized correlations of the atom density, and differ mainly by the choice of basis. Considerable effort…
▽ More
The input of almost every machine learning algorithm targeting the properties of matter at the atomic scale involves a transformation of the list of Cartesian atomic coordinates into a more symmetric representation. Many of the most popular representations can be seen as an expansion of the symmetrized correlations of the atom density, and differ mainly by the choice of basis. Considerable effort has been dedicated to the optimization of the basis set, typically driven by heuristic considerations on the behavior of the regression target. Here we take a different, unsupervised viewpoint, aiming to determine the basis that encodes in the most compact way possible the structural information that is relevant for the dataset at hand. For each training dataset and number of basis functions, one can determine a unique basis that is optimal in this sense, and can be computed at no additional cost with respect to the primitive basis by approximating it with splines. We demonstrate that this construction yields representations that are accurate and computationally efficient, particularly when constructing representations that correspond to high-body order correlations. We present examples that involve both molecular and condensed-phase machine-learning models.
△ Less
Submitted 10 January, 2022; v1 submitted 18 May, 2021;
originally announced May 2021.
-
Efficient implementation of atom-density representations
Authors:
Félix Musil,
Max Veit,
Alexander Goscinski,
Guillaume Fraux,
Michael J. Willatt,
Markus Stricker,
Till Junge,
Michele Ceriotti
Abstract:
Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods. They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space. Recently, it has become clear t…
▽ More
Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods. They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space. Recently, it has become clear that many of the most effective representations share a fundamental formal connection: that they can all be expressed as a discretization of N-body correlation functions of the local atom density, suggesting the opportunity of standardizing and, more importantly, optimizing the calculation of such representations. We present an implementation, named librascal, whose modular design lends itself both to develo** refinements to the density-based formalism and to rapid prototy** for new developments of rotationally equivariant atomistic representations. As an example, we discuss SOAP features, perhaps the most widely used member of this family of representations, to show how the expansion of the local density can be optimized for any choice of radial basis set. We discuss the representation in the context of a kernel ridge regression model, commonly used with SOAP features, and analyze how the computational effort scales for each of the individual steps of the calculation. By applying data reduction techniques in feature space, we show how to further reduce the total computational cost by at up to a factor of 4 or 5 without affecting the model's symmetry properties and without significantly impacting its accuracy.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
The role of feature space in atomistic learning
Authors:
Alexander Goscinski,
Guillaume Fraux,
Giulio Imbalzano,
Michele Ceriotti
Abstract:
Eficient, physically-inspired descriptors of the structure and composition of molecules and materials play a key role in the application of machine-learning techniques to atomistic simulations. The proliferation of approaches, as well as the fact that each choice of features can lead to very different behavior depending on how they are used, e.g. by introducing non-linear kernels and non-Euclidean…
▽ More
Eficient, physically-inspired descriptors of the structure and composition of molecules and materials play a key role in the application of machine-learning techniques to atomistic simulations. The proliferation of approaches, as well as the fact that each choice of features can lead to very different behavior depending on how they are used, e.g. by introducing non-linear kernels and non-Euclidean metrics to manipulate them, makes it difficult to objectively compare different methods, and to address fundamental questions on how one feature space is related to another. In this work we introduce a framework to compare different sets of descriptors, and different ways of transforming them by means of metrics and kernels, in terms of the structure of the feature space that they induce. We define diagnostic tools to determine whether alternative feature spaces contain equivalent amounts of information, and whether the common information is substantially distorted when going from one feature space to another. We compare, in particular, representations that are built in terms of n-body correlations of the atom density, quantitatively assessing the information loss associated with the use of low-order features. We also investigate the impact of different choices of basis functions and hyperparameters of the widely used SOAP and Behler-Parrinello features, and investigate how the use of non-linear kernels, and of a Wasserstein-type metric, change the structure of the feature space in comparison to a simpler linear feature space.
△ Less
Submitted 10 December, 2020; v1 submitted 6 September, 2020;
originally announced September 2020.
-
The parallel Grover as dynamic system
Authors:
Alexander Goscinski
Abstract:
A sequential application of the Grover algorithm to solve the iterated search problem has been improved by Ozhigov by parallelizing the application of the oracle. In this work a representation of the parallel Grover as dynamic system of inversion about the mean and Grover operators is given. Within this representation the parallel Grover for $k = 2$ can be interpreted as rotation in three-dimensio…
▽ More
A sequential application of the Grover algorithm to solve the iterated search problem has been improved by Ozhigov by parallelizing the application of the oracle. In this work a representation of the parallel Grover as dynamic system of inversion about the mean and Grover operators is given. Within this representation the parallel Grover for $k = 2$ can be interpreted as rotation in three-dimensional space and it can be shown that the sole application of the parallel Grover operator does not lead to a solution for $k > 2$. We propose a solution for $k = 3$ with a number of approximately $1.51\sqrt{N}$ iterations.
△ Less
Submitted 9 August, 2018;
originally announced August 2018.