-
Accelerated First-Principles Exploration of Structure and Reactivity in Graphene Oxide
Authors:
Zakariya El-Machachi,
Damyan Frantzov,
A. Nijamudheen,
Tigany Zarrouk,
Miguel A. Caro,
Volker L. Deringer
Abstract:
Graphene oxide (GO) materials are widely studied, and yet their atomic-scale structures remain to be fully understood. Here we show that the chemical and configurational space of GO can be rapidly explored by advanced machine-learning methods, combining on-the-fly acceleration for first-principles molecular dynamics with message-passing neural-network potentials. The first step allows for the rapi…
▽ More
Graphene oxide (GO) materials are widely studied, and yet their atomic-scale structures remain to be fully understood. Here we show that the chemical and configurational space of GO can be rapidly explored by advanced machine-learning methods, combining on-the-fly acceleration for first-principles molecular dynamics with message-passing neural-network potentials. The first step allows for the rapid sampling of chemical structures with very little prior knowledge required; the second step affords state-of-the-art accuracy and predictive power. We apply the method to the thermal reduction of GO, which we describe in a realistic (ten-nanometre scale) structural model. Our simulations are consistent with recent experimental findings and help to rationalise them in atomistic and mechanistic detail. More generally, our work provides a platform for routine, accurate, and predictive simulations of diverse carbonaceous materials.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
A foundation model for atomistic materials chemistry
Authors:
Ilyes Batatia,
Philipp Benner,
Yuan Chiang,
Alin M. Elena,
Dávid P. Kovács,
Janosh Riebesell,
Xavier R. Advincula,
Mark Asta,
Matthew Avaylon,
William J. Baldwin,
Fabian Berger,
Noam Bernstein,
Arghya Bhowmik,
Samuel M. Blau,
Vlad Cărare,
James P. Darby,
Sandip De,
Flaviano Della Pia,
Volker L. Deringer,
Rokas Elijošius,
Zakariya El-Machachi,
Fabio Falcioni,
Edvin Fako,
Andrea C. Ferrari,
Annalena Genreith-Schriever
, et al. (51 additional authors not shown)
Abstract:
Machine-learned force fields have transformed the atomistic modelling of materials by enabling simulations of ab initio quality on unprecedented time and length scales. However, they are currently limited by: (i) the significant computational and human effort that must go into development and validation of potentials for each particular system of interest; and (ii) a general lack of transferabilit…
▽ More
Machine-learned force fields have transformed the atomistic modelling of materials by enabling simulations of ab initio quality on unprecedented time and length scales. However, they are currently limited by: (i) the significant computational and human effort that must go into development and validation of potentials for each particular system of interest; and (ii) a general lack of transferability from one chemical system to the next. Here, using the state-of-the-art MACE architecture we introduce a single general-purpose ML model, trained on a public database of 150k inorganic crystals, that is capable of running stable molecular dynamics on molecules and materials. We demonstrate the power of the MACE-MP-0 model - and its qualitative and at times quantitative accuracy - on a diverse set problems in the physical sciences, including the properties of solids, liquids, gases, chemical reactions, interfaces and even the dynamics of a small protein. The model can be applied out of the box and as a starting or "foundation model" for any atomistic system of interest and is thus a step towards democratising the revolution of ML force fields by lowering the barriers to entry.
△ Less
Submitted 1 March, 2024; v1 submitted 29 December, 2023;
originally announced January 2024.
-
High-dimensional order parameters and neural network classifiers applied to amorphous ices
Authors:
Zoé Faure Beaulieu,
Volker L. Deringer,
Fausto Martelli
Abstract:
Amorphous ice phases are key constituents of water's complex structural landscape. This study investigates the polyamorphic nature of water, focusing on the complexities within low-density amorphous ice (LDA), high-density amorphous ice (HDA), and the recently discovered medium-density amorphous ice (MDA). We use rotationally-invariant, high-dimensional order parameters to capture a wide spectrum…
▽ More
Amorphous ice phases are key constituents of water's complex structural landscape. This study investigates the polyamorphic nature of water, focusing on the complexities within low-density amorphous ice (LDA), high-density amorphous ice (HDA), and the recently discovered medium-density amorphous ice (MDA). We use rotationally-invariant, high-dimensional order parameters to capture a wide spectrum of local symmetries for the characterisation of local oxygen environments. We train a neural network (NN) to classify these local environments, and investigate the distinctiveness of MDA within the structural landscape of amorphous ice. Our results highlight the difficulty in accurately differentiating MDA from LDA due to structural similarities. Beyond water, our methodology can be applied to investigate the structural properties and phases of disordered materials.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Understanding defects in amorphous silicon with million-atom simulations and machine learning
Authors:
Joe D. Morrow,
Chinonso Ugwumadu,
David A. Drabold,
Stephen R. Elliott,
Andrew L. Goodwin,
Volker L. Deringer
Abstract:
The structure of amorphous silicon is widely thought of as a fourfold-connected random network, and yet it is defective atoms, with fewer or more than four bonds, that make it particularly interesting. Despite many attempts to explain such "dangling-bond" and "floating-bond" defects, respectively, a unified understanding is still missing. Here, we show that atomistic machine-learning methods can r…
▽ More
The structure of amorphous silicon is widely thought of as a fourfold-connected random network, and yet it is defective atoms, with fewer or more than four bonds, that make it particularly interesting. Despite many attempts to explain such "dangling-bond" and "floating-bond" defects, respectively, a unified understanding is still missing. Here, we show that atomistic machine-learning methods can reveal the complex structural and energetic landscape of defects in amorphous silicon. We study an ultra-large-scale, quantum-accurate structural model containing a million atoms, and more than ten thousand defects, allowing reliable defect-related statistics to be obtained. We combine structural descriptors and machine-learned local atomic energies to develop a universal classification of the different types of defects in amorphous silicon. The results suggest a revision of the established floating-bond model by showing that fivefold-coordinated atoms in amorphous silicon exhibit a wide range of local environments, and it is shown that fivefold (but not threefold) coordination defects tend to cluster together. Our study provides new insights into one of the most widely studied amorphous solids, and has general implications for modelling and understanding defects in disordered materials beyond silicon alone.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Synthetic pre-training for neural-network interatomic potentials
Authors:
John L. A. Gardner,
Kathryn T. Baker,
Volker L. Deringer
Abstract:
Machine learning (ML) based interatomic potentials have transformed the field of atomistic materials modelling. However, ML potentials depend critically on the quality and quantity of quantum-mechanical reference data with which they are trained, and therefore develo** datasets and training pipelines is becoming an increasingly central challenge. Leveraging the idea of "synthetic" (artificial) d…
▽ More
Machine learning (ML) based interatomic potentials have transformed the field of atomistic materials modelling. However, ML potentials depend critically on the quality and quantity of quantum-mechanical reference data with which they are trained, and therefore develo** datasets and training pipelines is becoming an increasingly central challenge. Leveraging the idea of "synthetic" (artificial) data that is common in other areas of ML research, we here show that synthetic atomistic data, themselves obtained at scale with an existing ML potential, constitute a useful pre-training task for neural-network interatomic potential models. Once pre-trained with a large synthetic dataset, these models can be fine-tuned on a much smaller, quantum-mechanical one, improving numerical accuracy and stability in computational practice. We demonstrate feasibility for a series of equivariant graph-neural-network potentials for carbon, and we carry out initial experiments to test the limits of the approach.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
Robustness of Local Predictions in Atomistic Machine Learning Models
Authors:
Sanggyu Chong,
Federico Grasselli,
Chiheb Ben Mahmoud,
Joe D. Morrow,
Volker L. Deringer,
Michele Ceriotti
Abstract:
Machine learning (ML) models for molecules and materials commonly rely on a decomposition of the global target quantity into local, atom-centered contributions. This approach is convenient from a computational perspective, enabling large-scale ML-driven simulations with a linear-scaling cost, and also allow for the identification and post-hoc interpretation of contributions from individual chemica…
▽ More
Machine learning (ML) models for molecules and materials commonly rely on a decomposition of the global target quantity into local, atom-centered contributions. This approach is convenient from a computational perspective, enabling large-scale ML-driven simulations with a linear-scaling cost, and also allow for the identification and post-hoc interpretation of contributions from individual chemical environments and motifs to complicated macroscopic properties. However, even though there exist practical justifications for these decompositions, only the global quantity is rigorously defined, and thus it is unclear to what extent the atomistic terms predicted by the model can be trusted. Here, we introduce a quantitative metric, which we call the local prediction rigidity (LPR), that allows one to assess how robust the locally decomposed predictions of ML models are. We investigate the dependence of LPR on the aspects of model training, particularly the composition of training dataset, for a range of different problems from simple toy models to real chemical systems. We present strategies to systematically enhance the LPR, which can be used to improve the robustness, interpretability, and transferability of atomistic ML models.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Coarse-grained versus fully atomistic machine learning for zeolitic imidazolate frameworks
Authors:
Zoé Faure Beaulieu,
Thomas C. Nicholas,
John L. A. Gardner,
Andrew L. Goodwin,
Volker L. Deringer
Abstract:
Zeolitic imidazolate frameworks are widely thought of as being analogous to inorganic AB$_{2}$ phases. We test the validity of this assumption by comparing simplified and fully atomistic machine-learning models for local environments in ZIFs. Our work addresses the central question to what extent chemical information can be "coarse-grained" in hybrid framework materials.
Zeolitic imidazolate frameworks are widely thought of as being analogous to inorganic AB$_{2}$ phases. We test the validity of this assumption by comparing simplified and fully atomistic machine-learning models for local environments in ZIFs. Our work addresses the central question to what extent chemical information can be "coarse-grained" in hybrid framework materials.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Synthetic data enable experiments in atomistic machine learning
Authors:
John L. A. Gardner,
Zoé Faure Beaulieu,
Volker L. Deringer
Abstract:
Machine-learning models are increasingly used to predict properties of atoms in chemical systems. There have been major advances in develo** descriptors and regression frameworks for this task, typically starting from (relatively) small sets of quantum-mechanical reference data. Larger datasets of this kind are becoming available, but remain expensive to generate. Here we demonstrate the use of…
▽ More
Machine-learning models are increasingly used to predict properties of atoms in chemical systems. There have been major advances in develo** descriptors and regression frameworks for this task, typically starting from (relatively) small sets of quantum-mechanical reference data. Larger datasets of this kind are becoming available, but remain expensive to generate. Here we demonstrate the use of a large dataset that we have "synthetically" labelled with per-atom energies from an existing ML potential model. The cheapness of this process, compared to the quantum-mechanical ground truth, allows us to generate millions of datapoints, in turn enabling rapid experimentation with atomistic ML models from the small- to the large-data regime. This approach allows us here to compare regression frameworks in depth, and to explore visualisation based on learned representations. We also show that learning synthetic data labels can be a useful pre-training task for subsequent fine-tuning on small datasets. In the future, we expect that our open-sourced dataset, and similar ones, will be useful in rapidly exploring deep-learning models in the limit of abundant chemical data.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
How to validate machine-learned interatomic potentials
Authors:
Joe D. Morrow,
John L. A. Gardner,
Volker L. Deringer
Abstract:
Machine learning (ML) approaches enable large-scale atomistic simulations with near-quantum-mechanical accuracy. With the growing availability of these methods there arises a need for careful validation, particularly for physically agnostic models - that is, for potentials which extract the nature of atomic interactions from reference data. Here, we review the basic principles behind ML potentials…
▽ More
Machine learning (ML) approaches enable large-scale atomistic simulations with near-quantum-mechanical accuracy. With the growing availability of these methods there arises a need for careful validation, particularly for physically agnostic models - that is, for potentials which extract the nature of atomic interactions from reference data. Here, we review the basic principles behind ML potentials and their validation for atomic-scale materials modeling. We discuss best practice in defining error metrics based on numerical performance as well as physically guided validation. We give specific recommendations that we hope will be useful for the wider community, including those researchers who intend to use ML potentials for materials "off the shelf".
△ Less
Submitted 28 November, 2022; v1 submitted 22 November, 2022;
originally announced November 2022.
-
Structure and Bonding in Amorphous Red Phosphorus
Authors:
Yuxing Zhou,
Stephen R. Elliott,
Volker L. Deringer
Abstract:
Amorphous red phosphorus (a-P) is one of the remaining puzzling cases in the structural chemistry of the elements. Here, we elucidate the structure, stability, and chemical bond-ing in a-P from first principles, combining machine-learning and density-functional theo-ry (DFT) methods. We show that a-P structures exist with a range of energies slightly higher than those of phosphorus nanorods, to wh…
▽ More
Amorphous red phosphorus (a-P) is one of the remaining puzzling cases in the structural chemistry of the elements. Here, we elucidate the structure, stability, and chemical bond-ing in a-P from first principles, combining machine-learning and density-functional theo-ry (DFT) methods. We show that a-P structures exist with a range of energies slightly higher than those of phosphorus nanorods, to which they are closely related, and that the stability of a-P is linked to the degree of structural relaxation and medium-range order. We thus complete the stability range of phosphorus allotropes [Angew. Chem. Int. Ed. 2014, 53, 11629] by now including the previously poorly understood amorphous phase, and we quantify the covalent and van der Waals interactions in all main phases of phos-phorus. We also study the electronic densities of states, including those of hydrogenated a-P. Beyond the present study, our structural models are expected to enable wider-ranging first-principles investigations - for example, of a-P-based battery materials.
△ Less
Submitted 9 November, 2022;
originally announced November 2022.
-
Exploring the configurational space of amorphous graphene with machine-learned atomic energies
Authors:
Zakariya El-Machachi,
Mark Wilson,
Volker L. Deringer
Abstract:
Two-dimensionally extended amorphous carbon ("amorphous graphene") is a prototype system for disorder in 2D, showing a rich and complex configurational space that is yet to be fully understood. Here we explore the nature of amorphous graphene with an atomistic machine-learning (ML) model. We create structural models by introducing defects into ordered graphene through Monte-Carlo bond switching, d…
▽ More
Two-dimensionally extended amorphous carbon ("amorphous graphene") is a prototype system for disorder in 2D, showing a rich and complex configurational space that is yet to be fully understood. Here we explore the nature of amorphous graphene with an atomistic machine-learning (ML) model. We create structural models by introducing defects into ordered graphene through Monte-Carlo bond switching, defining acceptance criteria using the machine-learned local, atomic energies associated with a defect, as well as the nearest-neighbor (NN) environments. We find that physically meaningful structural models arise from ML atomic energies in this way, ranging from continuous random networks to paracrystalline structures. Our results show that ML atomic energies can be used to guide Monte-Carlo structural searches in principle, and that their predictions of local stability can be linked to short- and medium-range order in amorphous graphene. We expect that the former point will be relevant more generally to the study of amorphous materials, and that the latter has wider implications for the interpretation of ML potential models.
△ Less
Submitted 19 July, 2022;
originally announced July 2022.
-
Designing inorganic semiconductors with cold-rolling processability
Authors:
Xu-Dong Wang,
Jieling Tan,
Jian Ouyang,
Hangming Zhang,
Jiang-**g Wang,
Yuecun Wang,
Volker L. Deringer,
Jian Zhou,
Wei Zhang,
En Ma
Abstract:
While metals can be readily processed and reshaped by cold rolling, most bulk inorganic semiconductors are brittle materials that tend to fracture when plastically deformed. Manufacturing thin sheets and foils of inorganic semiconductors is therefore a bottleneck problem, severely restricting their use in flexible electronics applications. It was recently reported that a few single-crystalline two…
▽ More
While metals can be readily processed and reshaped by cold rolling, most bulk inorganic semiconductors are brittle materials that tend to fracture when plastically deformed. Manufacturing thin sheets and foils of inorganic semiconductors is therefore a bottleneck problem, severely restricting their use in flexible electronics applications. It was recently reported that a few single-crystalline two-dimensional van der Waals (vdW) semiconductors, such as InSe, are deformable under compressive stress. Here we demonstrate that intralayer fracture toughness can be tailored via compositional design to make inorganic semiconductors processable by cold rolling. We report systematic ab initio calculations covering a range of van der Waals semiconductors homologous to InSe, leading to material-property maps that forecast trends in both the susceptibility to interlayer slip and the intralayer fracture toughness against cracking. GaSe has been predicted, and experimentally confirmed, to be practically amenable to being rolled to large (three quarters) thickness reduction and length extension by a factor of three. Our findings open a new realm of possibility for alloy selection and design towards processing-friendly group-III chalcogenides for flexible electronic and thermoelectric applications.
△ Less
Submitted 19 June, 2022;
originally announced July 2022.
-
Indirect Learning of Interatomic Potentials for Accelerated Materials Simulations
Authors:
Joe D. Morrow,
Volker L. Deringer
Abstract:
Machine learning (ML) based interatomic potentials are emerging tools for materials simulations but require a trade-off between accuracy and speed. Here we show how one can use one ML potential model to train another: we use an existing, accurate, but more computationally expensive model to generate reference data (locations and labels) for a series of much faster potentials. Without the need for…
▽ More
Machine learning (ML) based interatomic potentials are emerging tools for materials simulations but require a trade-off between accuracy and speed. Here we show how one can use one ML potential model to train another: we use an existing, accurate, but more computationally expensive model to generate reference data (locations and labels) for a series of much faster potentials. Without the need for quantum-mechanical reference computations at the secondary stage, extensive reference datasets can be easily generated, and we find that this improves the quality of fast potentials with less flexible functional forms. We apply the technique to disordered silicon, including a simulation of vitrification and polycrystalline grain formation under pressure with a system size of a million atoms. Our work provides conceptual insight into the machine learning of interatomic potential models, and it suggests a route toward accelerated simulations of condensed-phase systems and nanostructured materials.
△ Less
Submitted 21 February, 2022; v1 submitted 22 November, 2021;
originally announced November 2021.
-
An Accurate and Transferable Machine Learning Potential for Carbon
Authors:
Patrick Rowe,
Volker L Deringer,
Piero Gasparotto,
Gábor Csányi,
Angelos Michaelides
Abstract:
We present an accurate machine learning (ML) model for atomistic simulations of carbon, constructed using the Gaussian approximation potential (GAP) methodology. The potential, named GAP-20, describes the properties of the bulk crystalline and amorphous phases, crystal surfaces and defect structures with an accuracy approaching that of direct ab initio simulation, but at a significantly reduced co…
▽ More
We present an accurate machine learning (ML) model for atomistic simulations of carbon, constructed using the Gaussian approximation potential (GAP) methodology. The potential, named GAP-20, describes the properties of the bulk crystalline and amorphous phases, crystal surfaces and defect structures with an accuracy approaching that of direct ab initio simulation, but at a significantly reduced cost. We combine structural databases for amorphous carbon and graphene, which we extend substantially by adding suitable configurations, for example, for defects in graphene and other nanostructures. The final potential is fitted to reference data computed using the optB88-vdW density functional theory (DFT) functional. Dispersion interactions, which are crucial to describe multilayer carbonaceous materials, are therefore implicitly included. We additionally account for long-range dispersion interactions using a semianalytical two-body term and show that an improved model can be obtained through an optimisation of the many-body smooth overlap of atomic positions (SOAP) descriptor. We rigorously test the potential on lattice parameters, bond lengths, formation energies and phonon dispersions of numerous carbon allotropes. We compare the formation energies of an extensive set of defect structures, surfaces and surface reconstructions to DFT reference calculations. The present work demonstrates the ability to combine, in the same ML model, the previously attained flexibility required for amorphous carbon [Phys. Rev. B, 95, 094203, (2017)] with the high numerical accuracy necessary for crystalline graphene [Phys. Rev. B, 97, 054303, (2018)], thereby providing an interatomic potential that will be applicable to a wide range of applications concerning diverse forms of bulk and nanostructured carbon.
△ Less
Submitted 24 June, 2020;
originally announced June 2020.
-
Combining phonon accuracy with high transferability in Gaussian approximation potential models
Authors:
Janine George,
Geoffroy Hautier,
Albert P. Bartók,
Gábor Csányi,
Volker L. Deringer
Abstract:
Machine learning driven interatomic potentials, including Gaussian approximation potential (GAP) models, are emerging tools for atomistic simulations. Here, we address the methodological question of how one can fit GAP models that accurately predict vibrational properties in specific regions of configuration space, whilst retaining flexibility and transferability to others. We use an adaptive regu…
▽ More
Machine learning driven interatomic potentials, including Gaussian approximation potential (GAP) models, are emerging tools for atomistic simulations. Here, we address the methodological question of how one can fit GAP models that accurately predict vibrational properties in specific regions of configuration space, whilst retaining flexibility and transferability to others. We use an adaptive regularization of the GAP fit that scales with the absolute force magnitude on any given atom, thereby exploring the Bayesian interpretation of GAP regularization as an "expected error", and its impact on the prediction of physical properties for a material of interest. The approach enables excellent predictions of phonon modes (to within 0.1-0.2 THz) for structurally diverse silicon allotropes, and it can be coupled with existing fitting databases for high transferability. These findings and workflows are expected to be useful for GAP-driven materials modeling more generally.
△ Less
Submitted 14 May, 2020;
originally announced May 2020.
-
De novo exploration and self-guided learning of potential-energy surfaces
Authors:
Noam Bernstein,
Gábor Csányi,
Volker L. Deringer
Abstract:
Interatomic potential models based on machine learning (ML) are rapidly develo** as tools for materials simulations. However, because of their flexibility, they require large fitting databases that are normally created with substantial manual selection and tuning of reference configurations. Here, we show that ML potentials can be built in a largely automated fashion, exploring and fitting poten…
▽ More
Interatomic potential models based on machine learning (ML) are rapidly develo** as tools for materials simulations. However, because of their flexibility, they require large fitting databases that are normally created with substantial manual selection and tuning of reference configurations. Here, we show that ML potentials can be built in a largely automated fashion, exploring and fitting potential-energy surfaces from the beginning (de novo) within one and the same protocol. The key enabling step is the use of a configuration-averaged kernel metric that allows one to select the few most relevant structures at each step. The resulting potentials are accurate and robust for the wide range of configurations that occur during structure searching, despite only requiring a relatively small number of single-point DFT calculations on small unit cells. We apply the method to materials with diverse chemical nature and coordination environments, marking a milestone toward the more routine application of ML potentials in physics, chemistry, and materials science.
△ Less
Submitted 24 May, 2019;
originally announced May 2019.