Search | arXiv e-print repository

Adaptive energy reference for machine-learning models of the electronic density of states

Authors: Wei Bin How, Sanggyu Chong, Federico Grasselli, Kevin K. Huguenin-Dumittan, Michele Ceriotti

Abstract: The electronic density of states (DOS) provides information regarding the distribution of electronic states in a material, and can be used to approximate its optical and electronic properties and therefore guide computational material design. Given its usefulness and relative simplicity, it has been one of the first electronic properties used as target for machine-learning approaches going beyond… ▽ More The electronic density of states (DOS) provides information regarding the distribution of electronic states in a material, and can be used to approximate its optical and electronic properties and therefore guide computational material design. Given its usefulness and relative simplicity, it has been one of the first electronic properties used as target for machine-learning approaches going beyond interatomic potentials. A subtle but important point, well-appreciated in the condensed matter community but usually overlooked in the construction of data-driven models, is that for bulk configurations the absolute energy reference of single-particle energy levels is ill-defined. Only energy differences matter, and quantities derived from the DOS are typically independent on the absolute alignment. We introduce an adaptive scheme that optimizes the energy reference of each structure as part of training, and show that it consistently improves the quality of ML models compared to traditional choices of energy reference, for different classes of materials and different model architectures. On a practical level, we trace the improved performance to the ability of this self-aligning scheme to match the most prominent features in the DOS. More broadly, we believe that this work highlights the importance of incorporating insights into the nature of the physical target into the definition of the architecture and of the appropriate figures of merit for machine-learning models, that translate in better transferability and overall performance. △ Less

Submitted 2 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

arXiv:2406.17747 [pdf, other]

Probing the effects of broken symmetries in machine learning

Authors: Marcel F. Langer, Sergey N. Pozdnyakov, Michele Ceriotti

Abstract: Symmetry is one of the most central concepts in physics, and it is no surprise that it has also been widely adopted as an inductive bias for machine-learning models applied to the physical sciences. This is especially true for models targeting the properties of matter at the atomic scale. Both established and state-of-the-art approaches, with almost no exceptions, are built to be exactly equivaria… ▽ More Symmetry is one of the most central concepts in physics, and it is no surprise that it has also been widely adopted as an inductive bias for machine-learning models applied to the physical sciences. This is especially true for models targeting the properties of matter at the atomic scale. Both established and state-of-the-art approaches, with almost no exceptions, are built to be exactly equivariant to translations, permutations, and rotations of the atoms. Incorporating symmetries -- rotations in particular -- constrains the model design space and implies more complicated architectures that are often also computationally demanding. There are indications that non-symmetric models can easily learn symmetries from data, and that doing so can even be beneficial for the accuracy of the model. We put a model that obeys rotational invariance only approximately to the test, in realistic scenarios involving simulations of gas-phase, liquid, and solid water. We focus specifically on physical observables that are likely to be affected -- directly or indirectly -- by symmetry breaking, finding negligible consequences when the model is used in an interpolative, bulk, regime. Even for extrapolative gas-phase predictions, the model remains very stable, even though symmetry artifacts are noticeable. We also discuss strategies that can be used to systematically reduce the magnitude of symmetry breaking when it occurs, and assess their impact on the convergence of observables. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2405.15224 [pdf, other]

i-PI 3.0: a flexible, efficient framework for advanced atomistic simulations

Authors: Yair Litman, Venkat Kapil, Yotam M. Y. Feldman, Davide Tisi, Tomislav Begušić, Karen Fidanyan, Guillaume Fraux, Jacob Higer, Matthias Kellner, Tao E. Li, Eszter S. Pós, Elia Stocco, George Trenins, Barak Hirshberg, Mariana Rossi, Michele Ceriotti

Abstract: Atomic-scale simulations have progressed tremendously over the past decade, largely due to the availability of interatomic potentials based on machine-learning algorithms. These potentials enable the combination of the accuracy of electronic structure calculations with extensive length and time scales. In this paper, we present a new release of the i-PI code that allows the community to fully bene… ▽ More Atomic-scale simulations have progressed tremendously over the past decade, largely due to the availability of interatomic potentials based on machine-learning algorithms. These potentials enable the combination of the accuracy of electronic structure calculations with extensive length and time scales. In this paper, we present a new release of the i-PI code that allows the community to fully benefit from the rapid developments in the field. i-PI is a Python software that facilitates the integration of different methods and different software tools by using a socket interface for inter-process communication. The current framework is flexible enough to support rapid prototy** and the combination of various simulation techniques, while maintaining a speed that prevents it from becoming the bottleneck in most workflows. We discuss the implementation of several new features, including an efficient algorithm to model bosonic and fermionic exchange, a framework for uncertainty quantification to be used in conjunction with machine-learning potentials, a communication infrastructure that allows deeper integration with electronic-driven simulations, and an approach to simulate coupled photon-nuclear dynamics in optical or plasmonic cavities. For this release, we also improved some computational bottlenecks of the implementation, reducing the overhead associated with using i-PI over a native implementation of molecular dynamics techniques. We show numerical benchmarks using widely adopted machine learning potentials, such as Behler-Parinello, DeepMD and MACE neural networks, and demonstrate that such overhead is negligible for systems containing between 100 and 12000 atoms. △ Less

Submitted 24 May, 2024; originally announced May 2024.

arXiv:2403.02251 [pdf, other]

A prediction rigidity formalism for low-cost uncertainties in trained neural networks

Authors: Filippo Bigi, Sanggyu Chong, Michele Ceriotti, Federico Grasselli

Abstract: Regression methods are fundamental for scientific and technological applications. However, fitted models can be highly unreliable outside of their training domain, and hence the quantification of their uncertainty is crucial in many of their applications. Based on the solution of a constrained optimization problem, we propose "prediction rigidities" as a method to obtain uncertainties of arbitrary… ▽ More Regression methods are fundamental for scientific and technological applications. However, fitted models can be highly unreliable outside of their training domain, and hence the quantification of their uncertainty is crucial in many of their applications. Based on the solution of a constrained optimization problem, we propose "prediction rigidities" as a method to obtain uncertainties of arbitrary pre-trained regressors. We establish a strong connection between our framework and Bayesian inference, and we develop a last-layer approximation that allows the new method to be applied to neural networks. This extension affords cheap uncertainties without any modification to the neural network itself or its training procedure. We show the effectiveness of our method on a wide range of regression tasks, ranging from simple toy models to applications in chemistry and meteorology. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2402.16621 [pdf, other]

Uncertainty quantification by direct propagation of shallow ensembles

Authors: Matthias Kellner, Michele Ceriotti

Abstract: Statistical learning algorithms provide a generally-applicable framework to sidestep time-consuming experiments, or accurate physics-based modeling, but they introduce a further source of error on top of the intrinsic limitations of the experimental or theoretical setup. Uncertainty estimation is essential to quantify this error, and make application of data-centric approaches more trustworthy. To… ▽ More Statistical learning algorithms provide a generally-applicable framework to sidestep time-consuming experiments, or accurate physics-based modeling, but they introduce a further source of error on top of the intrinsic limitations of the experimental or theoretical setup. Uncertainty estimation is essential to quantify this error, and make application of data-centric approaches more trustworthy. To ensure that uncertainty quantification is used widely, one should aim for algorithms that are reasonably accurate, but also easy to implement and apply. In particular, including uncertainty quantification on top of an existing architecture should be straightforward, and add minimal computational overhead. Furthermore, it should be easy to manipulate or combine multiple machine-learning predictions, propagating uncertainty over further modeling steps. We compare several well-established uncertainty quantification frameworks against these requirements, and propose a practical approach, which we dub direct propagation of shallow ensembles, that provides a good compromise between ease of use and accuracy. We present benchmarks for generic datasets, and an in-depth study of applications to the field of atomistic machine learning for chemistry and materials. These examples underscore the importance of using a formulation that allows propagating errors without making strong assumptions on the correlations between different predictions of the model. △ Less

Submitted 16 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

arXiv:2401.12936 [pdf, other]

doi 10.1103/PhysRevMaterials.8.065403

Thermal conductivity of Li$_3$PS$_4$ solid electrolytes with ab initio accuracy

Authors: Davide Tisi, Federico Grasselli, Lorenzo Gigli, Michele Ceriotti

Abstract: The vast amount of computational studies on electrical conduction in solid-state electrolytes is not mirrored by comparable efforts addressing thermal conduction, which has been scarcely investigated despite its relevance to thermal management and (over)heating of batteries. The reason for this lies in the complexity of the calculations: on one hand, the diffusion of ionic charge carriers makes la… ▽ More The vast amount of computational studies on electrical conduction in solid-state electrolytes is not mirrored by comparable efforts addressing thermal conduction, which has been scarcely investigated despite its relevance to thermal management and (over)heating of batteries. The reason for this lies in the complexity of the calculations: on one hand, the diffusion of ionic charge carriers makes lattice methods formally unsuitable, due to the lack of equilibrium atomic positions needed for normal-mode expansion. On the other hand, the prohibitive cost of large-scale molecular dynamics (MD) simulations of heat transport in large systems at ab initio levels has hindered the use of MD-based methods. In this paper, we leverage recently developed machine-learning potentials targeting different ab initio functionals (PBEsol, r$^2$SCAN, PBE0) and a state-of-the-art formulation of the Green-Kubo theory of heat transport in multicomponent systems to compute the thermal conductivity of a promising solid-state electrolyte, Li$_3$PS$_4$, in all its polymorphs ($α$, $β$, and $γ$). By comparing MD estimates with lattice methods on the low-temperature, nondiffusive $γ$-Li$_3$PS$_4$, we highlight strong anharmonicities and negligible nuclear quantum effects, hence further justifying MD-based methods even for nondiffusive phases. Finally, for the ion-conducting $α$ and $β$ phases, where the multicomponent Green-Kubo MD approach is mandatory, our simulations indicate a weak temperature dependence of the thermal conductivity, a glass-like behavior due to the effective local disorder characterizing these Li-diffusing phases. △ Less

Submitted 17 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

Comments: 11 pages, 5 figures

Journal ref: Phys. Rev. Materials 8 (2024) 065403

arXiv:2311.15394 [pdf, other]

Natural Aging and Vacancy Trap** in Al-6xxx

Authors: Abhinav C. P. Jain, M. Ceriotti, W. A. Curtin

Abstract: Undesirable natural aging (NA) in Al-6xxx delays subsequent artificial aging (AA) but the size, composition, and evolution of clustering are challenging to measure. Here, atomistic details of early-stage clustering in Al-1\%Mg-0.6\%Si during NA are studied computationally using a chemically-accurate neural-network potential. Feasible growth paths for the preferred $β''$ precipitates identify: domi… ▽ More Undesirable natural aging (NA) in Al-6xxx delays subsequent artificial aging (AA) but the size, composition, and evolution of clustering are challenging to measure. Here, atomistic details of early-stage clustering in Al-1\%Mg-0.6\%Si during NA are studied computationally using a chemically-accurate neural-network potential. Feasible growth paths for the preferred $β''$ precipitates identify: dominant clusters differing from $β''$ motifs; spontaneous vacancy-interstitial formation creating 14-18 solute atom $β''$-like motifs; and lower-energy clusters requiring chemical re-arrangement to form $β''$ nuclei. Quasi-on-lattice kinetic Monte Carlo simulations reveal that 8-14 solute atom clusters form within 1000 s but that growth slows considerably due to vacancy trap** inside clusters, with trap** energies of 0.3-0.5 eV. These findings rationalize why cluster growth and alloy hardness saturate during NA, confirm the concept of ''vacancy prisons", and suggest why clusters must be dissolved during AA before formation of $β''$. This atomistic understanding of NA may enable design of strategies to mitigate negative effects of NA. △ Less

Submitted 26 November, 2023; originally announced November 2023.

arXiv:2311.00844 [pdf, other]

Electronic excited states from physically-constrained machine learning

Authors: Edoardo Cignoni, Divya Suman, Jigyasa Nigam, Lorenzo Cupellini, Benedetta Mennucci, Michele Ceriotti

Abstract: Data-driven techniques are increasingly used to replace electronic-structure calculations of matter. In this context, a relevant question is whether machine learning (ML) should be applied directly to predict the desired properties or be combined explicitly with physically-grounded operations. We present an example of an integrated modeling approach, in which a symmetry-adapted ML model of an effe… ▽ More Data-driven techniques are increasingly used to replace electronic-structure calculations of matter. In this context, a relevant question is whether machine learning (ML) should be applied directly to predict the desired properties or be combined explicitly with physically-grounded operations. We present an example of an integrated modeling approach, in which a symmetry-adapted ML model of an effective Hamiltonian is trained to reproduce electronic excitations from a quantum-mechanical calculation. The resulting model can make predictions for molecules that are much larger and more complex than those that it is trained on, and allows for dramatic computational savings by indirectly targeting the outputs of well-converged calculations while using a parameterization corresponding to a minimal atom-centered basis. These results emphasize the merits of intertwining data-driven techniques with physical approximations, improving the transferability and interpretability of ML models without affecting their accuracy and computational efficiency, and providing a blueprint for develo** ML-augmented electronic-structure methods. △ Less

Submitted 7 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

arXiv:2310.15679 [pdf, other]

Mechanism of charge transport in lithium thiophosphate

Authors: Lorenzo Gigli, Davide Tisi, Federico Grasselli, Michele Ceriotti

Abstract: Lithium ortho-thiophosphate (Li$_3$PS$_4$) has emerged as a promising candidate for solid-state-electrolyte batteries, thanks to its highly conductive phases, cheap components, and large electrochemical stability range. Nonetheless, the microscopic mechanisms of Li-ion transport in Li$_3$PS$_4$ are far to be fully understood, the role of PS$_4$ dynamics in charge transport still being controversia… ▽ More Lithium ortho-thiophosphate (Li$_3$PS$_4$) has emerged as a promising candidate for solid-state-electrolyte batteries, thanks to its highly conductive phases, cheap components, and large electrochemical stability range. Nonetheless, the microscopic mechanisms of Li-ion transport in Li$_3$PS$_4$ are far to be fully understood, the role of PS$_4$ dynamics in charge transport still being controversial. In this work, we build machine learning potentials targeting state-of-the-art DFT references (PBEsol, r$^2$SCAN, and PBE0) to tackle this problem in all known phases of Li$_3$PS$_4$ ($α$, $β$ and $γ$), for large system sizes and timescales. We discuss the physical origin of the observed superionic behavior of Li$_3$PS$_4$: the activation of PS$_4$ flip** drives a structural transition to a highly conductive phase, characterized by an increase of Li-site availability and by a drastic reduction in the activation energy of Li-ion diffusion. We also rule out any paddle-wheel effects of PS$_4$ tetrahedra in the superionic phases -- previously claimed to enhance Li-ion diffusion -- due to the orders-of-magnitude difference between the rate of PS$_4$ flips and Li-ion hops at all temperatures below melting. We finally elucidate the role of inter-ionic dynamical correlations in charge transport, by highlighting the failure of the Nernst-Einstein approximation to estimate the electrical conductivity. Our results show a strong dependence on the target DFT reference, with PBE0 yielding the best quantitative agreement with experimental measurements not only for the electronic band-gap but also for the electrical conductivity of $β$- and $α$-Li$_3$PS$_4$. △ Less

Submitted 10 January, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

Comments: Main: 18 pages, 9 figures; Supporting Information: 13 pages, 13 figures

arXiv:2310.12579 [pdf, other]

Modeling the ferroelectric phase transition in barium titanate with DFT accuracy and converged sampling

Authors: Lorenzo Gigli, Alexander Goscinski, Michele Ceriotti, Gareth A. Tribello

Abstract: The accurate description of the structural and thermodynamic properties of ferroelectrics has been one of the most remarkable achievements of Density Functional Theory (DFT). However, running large simulation cells with DFT is computationally demanding, while simulations of small cells are often plagued with non-physical effects that are a consequence of the system's finite size. To avoid these fi… ▽ More The accurate description of the structural and thermodynamic properties of ferroelectrics has been one of the most remarkable achievements of Density Functional Theory (DFT). However, running large simulation cells with DFT is computationally demanding, while simulations of small cells are often plagued with non-physical effects that are a consequence of the system's finite size. To avoid these finite-size effects one is thus often forced to use empirical models that describe the physics of the material in terms of effective interaction terms, that are fitted using the results from DFT. In this study we use a machine-learning (ML) potential trained on DFT, in combination with accelerated sampling techniques, to converge the thermodynamic properties of Barium Titanate (BTO) with first-principles accuracy and a full atomistic description. Our results indicate that the predicted Curie temperature depends strongly on the choice of DFT functional and system size, because of emergent long-range directional correlations in the local dipole fluctuations. Our findings demonstrate how the combination of ML models and traditional bottom-up modeling allow one to investigate emergent phenomena with the accuracy of first-principles calculations over the large size and time scales afforded by empirical models. △ Less

Submitted 13 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

Comments: 15 pages, 10 figures

arXiv:2310.07604 [pdf, other]

Surface segregation in high-entropy alloys from alchemical machine learning

Authors: Arslan Mazitov, Maximilian A. Springer, Nataliya Lopanitsyna, Guillaume Fraux, Sandip De, Michele Ceriotti

Abstract: High-entropy alloys (HEAs), containing several metallic elements in near-equimolar proportions, have long been of interest for their unique mechanical properties. More recently, they have emerged as a promising platform for the development of novel heterogeneous catalysts, because of the large design space, and the synergistic effects between their components. In this work we use a machine-learnin… ▽ More High-entropy alloys (HEAs), containing several metallic elements in near-equimolar proportions, have long been of interest for their unique mechanical properties. More recently, they have emerged as a promising platform for the development of novel heterogeneous catalysts, because of the large design space, and the synergistic effects between their components. In this work we use a machine-learning potential that can model simultaneously up to 25 transition metals to study the tendency of different elements to segregate at the surface of a HEA. We use as a starting point a potential that was previously developed using exclusively crystalline bulk phases, and show that, thanks to the physically-inspired functional form of the model, adding a much smaller number of defective configurations makes it capable of describing surface phenomena. We then present several computational studies of surface segregation, including both a simulation of a 25-element alloy, that provides a rough estimate of the relative surface propensity of the various elements, and targeted studies of CoCrFeMnNi and IrFeCoNiCu, which provide further validation of the model, and insights to guide the modeling and design of alloys for heterogeneous catalysis. △ Less

Submitted 11 January, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

arXiv:2308.13208 [pdf, other]

Physics-inspired Equivariant Descriptors of Non-bonded Interactions

Authors: Kevin K. Huguenin-Dumittan, Philip Loche, Ni Haoran, Michele Ceriotti

Abstract: One essential ingredient in many machine learning (ML) based methods for atomistic modeling of materials and molecules is the use of locality. While allowing better system-size scaling, this systematically neglects long-range (LR) effects, such as electrostatics or dispersion interaction. We present an extension of the long distance equivariant (LODE) framework that can handle diverse LR interacti… ▽ More One essential ingredient in many machine learning (ML) based methods for atomistic modeling of materials and molecules is the use of locality. While allowing better system-size scaling, this systematically neglects long-range (LR) effects, such as electrostatics or dispersion interaction. We present an extension of the long distance equivariant (LODE) framework that can handle diverse LR interactions in a consistent way, and seamlessly integrates with preexisting methods by building new sets of atom centered features. We provide a direct physical interpretation of these using the multipole expansion, which allows for simpler and more efficient implementations. The framework is applied to simple toy systems as proof of concept, and a heterogeneous set of molecular dimers to push the method to its limits. By generalizing LODE to arbitrary asymptotic behaviors, we provide a coherent approach to treat arbitrary two- and many-body non-bonded interactions in the data-driven modeling of matter. △ Less

Submitted 3 October, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

arXiv:2306.15638 [pdf, other]

Robustness of Local Predictions in Atomistic Machine Learning Models

Authors: Sanggyu Chong, Federico Grasselli, Chiheb Ben Mahmoud, Joe D. Morrow, Volker L. Deringer, Michele Ceriotti

Abstract: Machine learning (ML) models for molecules and materials commonly rely on a decomposition of the global target quantity into local, atom-centered contributions. This approach is convenient from a computational perspective, enabling large-scale ML-driven simulations with a linear-scaling cost, and also allow for the identification and post-hoc interpretation of contributions from individual chemica… ▽ More Machine learning (ML) models for molecules and materials commonly rely on a decomposition of the global target quantity into local, atom-centered contributions. This approach is convenient from a computational perspective, enabling large-scale ML-driven simulations with a linear-scaling cost, and also allow for the identification and post-hoc interpretation of contributions from individual chemical environments and motifs to complicated macroscopic properties. However, even though there exist practical justifications for these decompositions, only the global quantity is rigorously defined, and thus it is unclear to what extent the atomistic terms predicted by the model can be trusted. Here, we introduce a quantitative metric, which we call the local prediction rigidity (LPR), that allows one to assess how robust the locally decomposed predictions of ML models are. We investigate the dependence of LPR on the aspects of model training, particularly the composition of training dataset, for a range of different problems from simple toy models to real chemical systems. We present strategies to systematically enhance the LPR, which can be used to improve the robustness, interpretability, and transferability of atomistic ML models. △ Less

Submitted 27 June, 2023; originally announced June 2023.

arXiv:2306.08429 [pdf, other]

doi 10.1021/ct500950z

Probing the unfolded configurations of a $β$-hairpin using sketch-map

Authors: Albert Ardevol, Gareth A. Tribello, Michele Ceriotti, Michele Parrinello

Abstract: This work examines the conformational ensemble involved in $β$-hairpin folding by means of advanced molecular dynamics simulations and dimensionality reduction. A fully atomistic description of the protein and the surrounding solvent molecules is used and this complex energy landscape is sampled by means of parallel tempering metadynamics simulations. The ensemble of configurations explored is ana… ▽ More This work examines the conformational ensemble involved in $β$-hairpin folding by means of advanced molecular dynamics simulations and dimensionality reduction. A fully atomistic description of the protein and the surrounding solvent molecules is used and this complex energy landscape is sampled by means of parallel tempering metadynamics simulations. The ensemble of configurations explored is analysed using the recently proposed sketch-map algorithm. Further simulations allow us to probe how mutations affect the structures adopted by this protein. We find that many of the configurations adopted by a mutant are the same as those adopted by the wild type protein. Furthermore, certain mutations destabilize secondary structure containing configurations by preventing the formation of hydrogen bonds or by promoting the formation of new intramolecular contacts. Our analysis demonstrates that machine-learning techniques can be used to study the energy landscapes of complex molecules and that the visualizations that are generated in this way provide a natural basis for examining how the stabilities of particular configurations of the molecule are affected by factors such as temperature or structural mutations. △ Less

Submitted 14 June, 2023; originally announced June 2023.

Comments: Pre-print of doi:10.1021/ct500950z, reproduced with permission from the ACS

Journal ref: Journal of Chemical Theory and Computation 11(3), 1086-1093 (2015)

arXiv:2305.19302 [pdf, other]

Smooth, exact rotational symmetrization for deep learning on point clouds

Authors: Sergey N. Pozdnyakov, Michele Ceriotti

Abstract: Point clouds are versatile representations of 3D objects and have found widespread application in science and engineering. Many successful deep-learning models have been proposed that use them as input. The domain of chemical and materials modeling is especially challenging because exact compliance with physical constraints is highly desirable for a model to be usable in practice. These constraint… ▽ More Point clouds are versatile representations of 3D objects and have found widespread application in science and engineering. Many successful deep-learning models have been proposed that use them as input. The domain of chemical and materials modeling is especially challenging because exact compliance with physical constraints is highly desirable for a model to be usable in practice. These constraints include smoothness and invariance with respect to translations, rotations, and permutations of identical atoms. If these requirements are not rigorously fulfilled, atomistic simulations might lead to absurd outcomes even if the model has excellent accuracy. Consequently, dedicated architectures, which achieve invariance by restricting their design space, have been developed. General-purpose point-cloud models are more varied but often disregard rotational symmetry. We propose a general symmetrization method that adds rotational equivariance to any given model while preserving all the other requirements. Our approach simplifies the development of better atomic-scale machine-learning schemes by relaxing the constraints on the design space and making it possible to incorporate ideas that proved effective in other domains. We demonstrate this idea by introducing the Point Edge Transformer (PET) architecture, which is not intrinsically equivariant but achieves state-of-the-art performance on several benchmark datasets of molecules and solids. A-posteriori application of our general protocol makes PET exactly equivariant, with minimal changes to its accuracy. △ Less

Submitted 6 February, 2024; v1 submitted 30 May, 2023; originally announced May 2023.

Comments: Enhancing figures; minor polishing

arXiv:2303.04124 [pdf, other]

Wigner kernels: body-ordered equivariant machine learning without a basis

Authors: Filippo Bigi, Sergey N. Pozdnyakov, Michele Ceriotti

Abstract: Machine-learning models based on a point-cloud representation of a physical object are ubiquitous in scientific applications and particularly well-suited to the atomic-scale description of molecules and materials. Among the many different approaches that have been pursued, the description of local atomic environments in terms of their neighbor densities has been used widely and very succesfully. W… ▽ More Machine-learning models based on a point-cloud representation of a physical object are ubiquitous in scientific applications and particularly well-suited to the atomic-scale description of molecules and materials. Among the many different approaches that have been pursued, the description of local atomic environments in terms of their neighbor densities has been used widely and very succesfully. We propose a novel density-based method which involves computing ``Wigner kernels''. These are fully equivariant and body-ordered kernels that can be computed iteratively with a cost that is independent of the radial-chemical basis and grows only linearly with the maximum body-order considered. This is in marked contrast to feature-space models, which comprise an exponentially-growing number of terms with increasing order of correlations. We present several examples of the accuracy of models based on Wigner kernels in chemical applications, for both scalar and tensorial targets, reaching state-of-the-art accuracy on the popular QM9 benchmark dataset, and we discuss the broader relevance of these ideas to equivariant geometric machine-learning. △ Less

Submitted 7 March, 2023; originally announced March 2023.

arXiv:2302.14770 [pdf, other]

doi 10.1063/5.0160740

Completeness of Atomic Structure Representations

Authors: Jigyasa Nigam, Sergey N. Pozdnyakov, Kevin K. Huguenin-Dumittan, Michele Ceriotti

Abstract: In this paper, we address the challenge of obtaining a comprehensive and symmetric representation of point particle groups, such as atoms in a molecule, which is crucial in physics and theoretical chemistry. The problem has become even more important with the widespread adoption of machine-learning techniques in science, as it underpins the capacity of models to accurately reproduce physical relat… ▽ More In this paper, we address the challenge of obtaining a comprehensive and symmetric representation of point particle groups, such as atoms in a molecule, which is crucial in physics and theoretical chemistry. The problem has become even more important with the widespread adoption of machine-learning techniques in science, as it underpins the capacity of models to accurately reproduce physical relationships while being consistent with fundamental symmetries and conservation laws. However, some of the descriptors that are commonly used to represent point clouds -- most notably those based on discretized correlations of the neighbor density, that underpin most of the existing ML models of matter at the atomic scale -- are unable to distinguish between special arrangements of particles in three dimensions. This makes it impossible to machine learn their properties. Atom-density correlations are provably complete in the limit in which they simultaneously describe the mutual relationship between all atoms, which is impractical. We present a novel approach to construct descriptors of \emph{finite} correlations based on the relative arrangement of particle triplets, which can be employed to create symmetry-adapted models with universal approximation capabilities, which have the resolution of the neighbor discretization as the sole convergence parameter. Our strategy is demonstrated on a class of atomic arrangements that are specifically built to defy a broad class of conventional symmetric descriptors, showcasing its potential for addressing their limitations. △ Less

Submitted 30 December, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

Journal ref: APL Mach. Learn. 2, 016110 (2024)

arXiv:2302.08381 [pdf, other]

Fast evaluation of spherical harmonics with sphericart

Authors: Filippo Bigi, Guillaume Fraux, Nicholas J. Browning, Michele Ceriotti

Abstract: Spherical harmonics provide a smooth, orthogonal, and symmetry-adapted basis to expand functions on a sphere, and they are used routinely in physical and theoretical chemistry as well as in different fields of science and technology, from geology and atmospheric sciences to signal processing and computer graphics. More recently, they have become a key component of rotationally equivariant models i… ▽ More Spherical harmonics provide a smooth, orthogonal, and symmetry-adapted basis to expand functions on a sphere, and they are used routinely in physical and theoretical chemistry as well as in different fields of science and technology, from geology and atmospheric sciences to signal processing and computer graphics. More recently, they have become a key component of rotationally equivariant models in geometric machine learning, including applications to atomic-scale modeling of molecules and materials. We present an elegant and efficient algorithm for the evaluation of the real-valued spherical harmonics. Our construction features many of the desirable properties of existing schemes and allows to compute Cartesian derivatives in a numerically stable and computationally efficient manner. To facilitate usage, we implement this algorithm in sphericart, a fast C++ library which also provides C bindings, a Python API, and a PyTorch implementation that includes a GPU kernel. △ Less

Submitted 30 April, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

arXiv:2301.01297 [pdf]

Solar Sail Propulsion by 2050: An Enabling Capability for Heliophysics Missions

Authors: Les Johnson, Nathan Barnes, Matteo Ceriotti, Thomas Y. Chen, Artur Davoyan, Louis Friedman, Darren Garber, Roman Kezerashvili, Ken Kobayashi, Greg Matloff, Colin McInnes, Pat Mulligan, Grover Swartzlander, Slava G. Turyshev

Abstract: Solar sails enable missions to observe the solar environment from unique vantage points, such as sustained observations away from the Sun-Earth line; sub-L1 station kee**; high inclination solar orbits; Earth polar-sitting and polar-viewing observatories; fast transit missions to study heliosphere to interstellar medium transition, as well as missions of interest across a broad user community. R… ▽ More Solar sails enable missions to observe the solar environment from unique vantage points, such as sustained observations away from the Sun-Earth line; sub-L1 station kee**; high inclination solar orbits; Earth polar-sitting and polar-viewing observatories; fast transit missions to study heliosphere to interstellar medium transition, as well as missions of interest across a broad user community. Recent and planned demonstration missions make this technology ready for use on near-term science missions. △ Less

Submitted 2 January, 2023; originally announced January 2023.

Comments: Heliophysics 2050 White Paper

arXiv:2212.13254 [pdf, other]

Modeling high-entropy transition-metal alloys with alchemical compression

Authors: Nataliya Lopanitsyna, Guillaume Fraux, Maximilian A. Springer, Sandip De, Michele Ceriotti

Abstract: Alloys composed of several elements in roughly equimolar composition, often referred to as high-entropy alloys, have long been of interest for their thermodynamics and peculiar mechanical properties, and more recently for their potential application in catalysis. They are a considerable challenge to traditional atomistic modeling, and also to data-driven potentials that for the most part have memo… ▽ More Alloys composed of several elements in roughly equimolar composition, often referred to as high-entropy alloys, have long been of interest for their thermodynamics and peculiar mechanical properties, and more recently for their potential application in catalysis. They are a considerable challenge to traditional atomistic modeling, and also to data-driven potentials that for the most part have memory footprint, computational effort and data requirements which scale poorly with the number of elements included. We apply a recently proposed scheme to compress chemical information in a lower-dimensional space, which reduces dramatically the cost of the model with negligible loss of accuracy, to build a potential that can describe 25 d-block transition metals. The model shows semi-quantitative accuracy for prototypical alloys, and is remarkably stable when extrapolating to structures outside its training set. We use this framework to study element segregation in a computational experiment that simulates an equimolar alloy of all 25 elements, mimicking the seminal experiments by Cantor et al., and use our observations on the short-range order relations between the elements to define a data-driven set of Hume-Rothery rules that can serve as guidance for alloy design. We conclude with a study of three prototypical alloys, CoCrFeMnNi, CoCrFeMoNi and IrPdPtRhRu, determining their stability and the short-range order behavior of their constituents. △ Less

Submitted 7 April, 2023; v1 submitted 26 December, 2022; originally announced December 2022.

arXiv:2209.10709 [pdf, other]

A data-driven interpretation of the stability of molecular crystals

Authors: Rose K. Cersonsky, Maria Pakhnova, Edgar A. Engel, Michele Ceriotti

Abstract: Due to the subtle balance of intermolecular interactions that govern structure-property relations, predicting the stability of crystal structures formed from molecular building blocks is a highly non-trivial scientific problem. A particularly active and fruitful approach involves classifying the different combinations of interacting chemical moieties, as understanding the relative energetics of di… ▽ More Due to the subtle balance of intermolecular interactions that govern structure-property relations, predicting the stability of crystal structures formed from molecular building blocks is a highly non-trivial scientific problem. A particularly active and fruitful approach involves classifying the different combinations of interacting chemical moieties, as understanding the relative energetics of different interactions enables the design of molecular crystals and fine-tuning their stabilities. While this is usually performed based on the empirical observation of the most commonly encountered motifs in known crystal structures, we propose to apply a combination of supervised and unsupervised machine-learning techniques to automate the construction of an extensive library of molecular building blocks. We introduce a structural descriptor tailored to the prediction of the binding (lattice) energy and apply it to a curated dataset of organic crystals and exploit its atom-centered nature to obtain a data-driven assessment of the contribution of different chemical groups to the lattice energy of the crystal. We then interpret this library using a low-dimensional representation of the structure-energy landscape and discuss selected examples of the insights into crystal engineering that can be extracted from this analysis, providing a complete database to guide the design of molecular materials. △ Less

Submitted 22 December, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

arXiv:2209.01948 [pdf, other]

doi 10.1063/5.0124363

A smooth basis for atomistic machine learning

Authors: Filippo Bigi, Kevin Huguenin-Dumittan, Michele Ceriotti, David E. Manolopoulos

Abstract: Machine learning frameworks based on correlations of interatomic positions begin with a discretized description of the density of other atoms in the neighbourhood of each atom in the system. Symmetry considerations support the use of spherical harmonics to expand the angular dependence of this density, but there is as yet no clear rationale to choose one radial basis over another. Here we investig… ▽ More Machine learning frameworks based on correlations of interatomic positions begin with a discretized description of the density of other atoms in the neighbourhood of each atom in the system. Symmetry considerations support the use of spherical harmonics to expand the angular dependence of this density, but there is as yet no clear rationale to choose one radial basis over another. Here we investigate the basis that results from the solution of the Laplacian eigenvalue problem within a sphere around the atom of interest. We show that this generates the smoothest possible basis of a given size within the sphere, and that a tensor product of Laplacian eigenstates also provides the smoothest possible basis for expanding any higher-order correlation of the atomic density within the appropriate hypersphere. We consider several unsupervised metrics of the quality of a basis for a given dataset, and show that the Laplacian eigenstate basis has a performance that is much better than some widely used basis sets and is competitive with data-driven bases that numerically optimize each metric. In supervised machine learning tests, we find that the optimal function smoothness of the Laplacian eigenstates leads to comparable or better performance than can be obtained from a data-driven basis of a similar size that has been optimized to describe the atom-density correlation for the specific dataset. We conclude that the smoothness of the basis functions is a key and hitherto largely overlooked aspect of successful atomic density representations. △ Less

Submitted 23 May, 2023; v1 submitted 5 September, 2022; originally announced September 2022.

arXiv:2208.06139 [pdf, other]

Beyond potentials: integrated machine-learning models for materials

Authors: Michele Ceriotti

Abstract: Over the past decade inter-atomic potentials based on machine-learning (ML) techniques have become an indispensable tool in the atomic-scale modeling of materials. Trained on energies and forces obtained from electronic-structure calculations, they inherit their predictive accuracy, and extend greatly the length and time scales that are accessible to explicit atomistic simulations. Inexpensive pre… ▽ More Over the past decade inter-atomic potentials based on machine-learning (ML) techniques have become an indispensable tool in the atomic-scale modeling of materials. Trained on energies and forces obtained from electronic-structure calculations, they inherit their predictive accuracy, and extend greatly the length and time scales that are accessible to explicit atomistic simulations. Inexpensive predictions of the energetics of individual configurations have facilitated greatly the calculation of the thermodynamics of materials, including finite-temperature effects and disorder. More recently, machine-learning models have been closing the gap with first-principles calculations in another area: the prediction of arbitrarily complicated functional properties, from vibrational and optical spectroscopies to electronic excitations. The implementation of integrated machine-learning models, that combine energetic and functional predictions with statistical and dynamical sampling of atomic-scale properties is bringing the promise of predictive, uncompromising simulations of existing and novel materials closer to its full realisation. △ Less

Submitted 12 August, 2022; originally announced August 2022.

arXiv:2206.14087 [pdf, other]

Electronic-structure properties from atom-centered predictions of the electron density

Authors: Andrea Grisafi, Alan M. Lewis, Mariana Rossi, Michele Ceriotti

Abstract: The electron density of a molecule or material has recently received major attention as a target quantity of machine-learning models. A natural choice to construct a model that yields transferable and linear-scaling predictions is to represent the scalar field using a multi-centered atomic basis analogous to that routinely used in density fitting approximations. However, the non-orthogonality of t… ▽ More The electron density of a molecule or material has recently received major attention as a target quantity of machine-learning models. A natural choice to construct a model that yields transferable and linear-scaling predictions is to represent the scalar field using a multi-centered atomic basis analogous to that routinely used in density fitting approximations. However, the non-orthogonality of the basis poses challenges for the learning exercise, as it requires accounting for all the atomic density components at once. We devise a gradient-based approach to directly minimize the loss function of the regression problem in an optimized and highly sparse feature space. In so doing, we overcome the limitations associated with adopting an atom-centered model to learn the electron density over arbitrarily complex datasets, obtaining extremely accurate predictions. The enhanced framework is tested on 32-molecule periodic cells of liquid water, presenting enough complexity to require an optimal balance between accuracy and computational efficiency. We show that starting from the predicted density a single Kohn-Sham diagonalization step can be performed to access total energy components that carry an error of just 0.1 meV/atom with respect to the reference density functional calculations. Finally, we test our method on the highly heterogeneous QM9 benchmark dataset, showing that a small fraction of the training data is enough to derive ground-state total energies within chemical accuracy. △ Less

Submitted 28 June, 2022; originally announced June 2022.

arXiv:2205.05591 [pdf, other]

doi 10.1103/PhysRevB.106.L121116

Predicting hot-electron free energies from ground-state data

Authors: Chiheb Ben Mahmoud, Federico Grasselli, Michele Ceriotti

Abstract: Machine-learning potentials are usually trained on the ground-state, Born-Oppenheimer energy surface, which depends exclusively on the atomic positions and not on the simulation temperature. This disregards the effect of thermally-excited electrons, that is important in metals, and essential to the description of warm dense matter. An accurate physical description of these effects requires that th… ▽ More Machine-learning potentials are usually trained on the ground-state, Born-Oppenheimer energy surface, which depends exclusively on the atomic positions and not on the simulation temperature. This disregards the effect of thermally-excited electrons, that is important in metals, and essential to the description of warm dense matter. An accurate physical description of these effects requires that the nuclei move on a temperature-dependent electronic free energy. We propose a method to obtain machine-learning predictions of this free energy at an arbitrary electron temperature using exclusively training data from ground-state calculations, avoiding the need to train temperature-dependent potentials, and benchmark it on metallic liquid hydrogen at the conditions of the core of gas giants and brown dwarfs. This work demonstrates the advantages of hybrid schemes that use physical consideration to combine machine-learning predictions, providing a blueprint for the development of similar approaches that extend the reach of atomistic modelling by removing the barrier between physics and data-driven methodologies. △ Less

Submitted 28 September, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

Journal ref: Phys. Rev. B 106, L121116 (2022)

arXiv:2205.01062 [pdf, other]

doi 10.1063/5.0088404

Comment on "Manifolds of quasi-constant SOAP and ACSF fingerprints and the resulting failure to machine learn four body interactions"

Authors: Sergey N. Pozdnyakov, Michael J. Willatt, Albert P. Bartók, Christoph Ortner, Gábor Csányi, Michele Ceriotti

Abstract: The "quasi-constant" SOAP and ACSF fingerprint manifolds recently discovered by Parsaeifard and Goedecker are a direct consequence of the presence of degenerate pairs of configurations, a known shortcoming of all low-body-order atom-density correlation representations of molecular structures. Contrary to the configurations that are rigorously singular -- that we demonstrate can only occur in finit… ▽ More The "quasi-constant" SOAP and ACSF fingerprint manifolds recently discovered by Parsaeifard and Goedecker are a direct consequence of the presence of degenerate pairs of configurations, a known shortcoming of all low-body-order atom-density correlation representations of molecular structures. Contrary to the configurations that are rigorously singular -- that we demonstrate can only occur in finite, discrete sets -- the continuous "quasi-constant" manifolds exhibit low, but non-zero, sensitivity to atomic displacements. Thus, it is possible to build interpolative machine-learning models of high-order interactions along the manifold, even though the numerical instabilities associated with proximity to the exact singularities affect the accuracy and transferability of such models, to an extent that depends on numerical details of the implementation. △ Less

Submitted 2 May, 2022; originally announced May 2022.

Comments: Comment on arXiv:2102.06915

arXiv:2202.01566 [pdf, other]

doi 10.1063/5.0087042

Unified theory of atom-centered representations and message-passing machine-learning schemes

Authors: Jigyasa Nigam, Sergey Pozdnyakov, Guillaume Fraux, Michele Ceriotti

Abstract: Data-driven schemes that associate molecular and crystal structures with their microscopic properties share the need for a concise, effective description of the arrangement of their atomic constituents. Many types of models rely on descriptions of atom-centered environments, that are associated with an atomic property or with an atomic contribution to an extensive macroscopic quantity. Frameworks… ▽ More Data-driven schemes that associate molecular and crystal structures with their microscopic properties share the need for a concise, effective description of the arrangement of their atomic constituents. Many types of models rely on descriptions of atom-centered environments, that are associated with an atomic property or with an atomic contribution to an extensive macroscopic quantity. Frameworks in this class can be understood in terms of atom-centered density correlations (ACDC), that are used as a basis for a body-ordered, symmetry-adapted expansion of the targets. Several other schemes, that gather information on the relationship between neighboring atoms using "message-passing" ideas, cannot be directly mapped to correlations centered around a single atom. We generalize the ACDC framework to include multi-centered information, generating representations that provide a complete linear basis to regress symmetric functions of atomic coordinates, and provides a coherent foundation to systematize our understanding of both atom-centered and message-passing, invariant and equivariant machine-learning schemes. △ Less

Submitted 1 April, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

arXiv:2201.07136 [pdf, other]

Incompleteness of graph neural networks for points clouds in three dimensions

Authors: Sergey N. Pozdnyakov, Michele Ceriotti

Abstract: Graph neural networks (GNN) are very popular methods in machine learning and have been applied very successfully to the prediction of the properties of molecules and materials. First-order GNNs are well known to be incomplete, i.e., there exist graphs that are distinct but appear identical when seen through the lens of the GNN. More complicated schemes have thus been designed to increase their res… ▽ More Graph neural networks (GNN) are very popular methods in machine learning and have been applied very successfully to the prediction of the properties of molecules and materials. First-order GNNs are well known to be incomplete, i.e., there exist graphs that are distinct but appear identical when seen through the lens of the GNN. More complicated schemes have thus been designed to increase their resolving power. Applications to molecules (and more generally, point clouds), however, add a geometric dimension to the problem. The most straightforward and prevalent approach to construct graph representation for molecules regards atoms as vertices in a graph and draws a bond between each pair of atoms within a chosen cutoff. Bonds can be decorated with the distance between atoms, and the resulting "distance graph NNs" (dGNN) have empirically demonstrated excellent resolving power and are widely used in chemical ML, with all known indistinguishable configurations being resolved in the fully-connected limit, which is equivalent to infinite or sufficiently large cutoff. Here we present a counterexample that proves that dGNNs are not complete even for the restricted case of fully-connected graphs induced by 3D atom clouds. We construct pairs of distinct point clouds whose associated graphs are, for any cutoff radius, equivalent based on a first-order Weisfeiler-Lehman test. This class of degenerate structures includes chemically-plausible configurations, both for isolated structures and for infinite structures that are periodic in 1, 2, and 3 dimensions. The existence of indistinguishable configurations sets an ultimate limit to the expressive power of some of the well-established GNN architectures for atomistic machine learning. Models that explicitly use angular or directional information in the description of atomic environments can resolve this class of degeneracies. △ Less

Submitted 7 November, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

arXiv:2111.05129 [pdf, other]

doi 10.1038/s41524-022-00845-0

Thermodynamics and dielectric response of $\text{BaTiO}_3$ by data-driven modeling

Authors: Lorenzo Gigli, Max Veit, Michele Kotiuga, Giovanni Pizzi, Nicola Marzari, Michele Ceriotti

Abstract: Modeling ferroelectric materials from first principles is one of the successes of density-functional theory, and the driver of much development effort, requiring an accurate description of the electronic processes and the thermodynamic equilibrium that drive the spontaneous symmetry breaking and the emergence of macroscopic polarization. We demonstrate the development and application of an integra… ▽ More Modeling ferroelectric materials from first principles is one of the successes of density-functional theory, and the driver of much development effort, requiring an accurate description of the electronic processes and the thermodynamic equilibrium that drive the spontaneous symmetry breaking and the emergence of macroscopic polarization. We demonstrate the development and application of an integrated machine learning model that describes on the same footing structural, energetic and functional properties of barium titanate ($\text{BaTiO}_3$), a prototypical ferroelectric. The model uses ab initio calculations as reference and achieves accurate yet inexpensive predictions of energy and polarization on time and length scales that are not accessible to direct ab initio modeling. These predictions allow us to assess the microscopic mechanism of the ferroelectric transition. The presence of an order-disorder transition for the Ti off-centered states is the main driver of the ferroelectric transition, even though the coupling between symmetry breaking and cell distortions determines the presence of intermediate, partly-ordered phases. Moreover, we thoroughly probe the static and dynamical behavior of $\text{BaTiO}_3$ across its phase diagram, without the need to introduce a coarse-grained description of the ferroelectric transition. Finally, we apply the polarization model to calculate dielectric response properties of the material in a fully ab initio manner, again reproducing the correct qualitative experimental behaviour. △ Less

Submitted 12 September, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

Comments: 26 pages, 14 figures

Journal ref: npj Computational Materials 8, 209 (2022)

arXiv:2110.13764 [pdf, other]

Ranking the Synthesizability of Hypothetical Zeolites with the Sorting Hat

Authors: Benjamin A. Helfrecht, Giovanni Pireddu, Rocio Semino, Scott M. Auerbach, Michele Ceriotti

Abstract: Zeolites are nanoporous alumino-silicate frameworks widely used as catalysts and adsorbents. Even though millions of distinct siliceous networks can be generated by computer-aided searches, no new hypothetical framework has yet been synthesized. The needle-in-a-haystack problem of finding promising candidates among large databases of predicted structures has intrigued materials scientists for deca… ▽ More Zeolites are nanoporous alumino-silicate frameworks widely used as catalysts and adsorbents. Even though millions of distinct siliceous networks can be generated by computer-aided searches, no new hypothetical framework has yet been synthesized. The needle-in-a-haystack problem of finding promising candidates among large databases of predicted structures has intrigued materials scientists for decades; most work to date on the zeolite problem has been limited to intuitive structural descriptors. Here, we tackle this problem through a rigorous data science scheme-the "zeolite sorting hat"-that exploits interatomic correlations to produce a 95% real versus theoretical zeolites classification accuracy. The hypothetical frameworks that are grouped together with known zeolites are promising candidates for synthesis, that can be further ranked by estimating their thermodynamic stability. A critical analysis of the classifier reveals the decisive structural features. Further partitioning into compositional classes provides guidance in the design of synthetic strategies. △ Less

Submitted 26 October, 2021; originally announced October 2021.

arXiv:2109.12083 [pdf, other]

doi 10.1063/5.0072784

Equivariant representations for molecular Hamiltonians and N-center atomic-scale properties

Authors: Jigyasa Nigam, Michael Willatt, Michele Ceriotti

Abstract: Symmetry considerations are at the core of the major frameworks used to provide an effective mathematical representation of atomic configurations that is then used in machine-learning models to predict the properties associated with each structure. In most cases, the models rely on a description of atom-centered environments, and are suitable to learn atomic properties, or global observables that… ▽ More Symmetry considerations are at the core of the major frameworks used to provide an effective mathematical representation of atomic configurations that is then used in machine-learning models to predict the properties associated with each structure. In most cases, the models rely on a description of atom-centered environments, and are suitable to learn atomic properties, or global observables that can be decomposed into atomic contributions. Many quantities that are relevant for quantum mechanical calculations, however -- most notably the single-particle Hamiltonian matrix when written in an atomic-orbital basis -- are not associated with a single center, but with two (or more) atoms in the structure. We discuss a family of structural descriptors that generalize the very successful atom-centered density correlation features to the N-centers case, and show in particular how this construction can be applied to efficiently learn the matrix elements of the (effective) single-particle Hamiltonian written in an atom-centered orbital basis. These N-centers features are fully equivariant -- not only in terms of translations and rotations, but also in terms of permutations of the indices associated with the atoms -- and are suitable to construct symmetry-adapted machine-learning models of new classes of properties of molecules and materials. △ Less

Submitted 20 December, 2021; v1 submitted 24 September, 2021; originally announced September 2021.

arXiv:2109.11440 [pdf, other]

Local invertibility and sensitivity of atomic structure-feature map**s

Authors: Sergey N. Pozdnyakov, Liwei Zhang, Christoph Ortner, Gábor Csányi, Michele Ceriotti

Abstract: The increasingly common applications of machine-learning schemes to atomic-scale simulations have triggered efforts to better understand the mathematical properties of the map** between the Cartesian coordinates of the atoms and the variety of representations that can be used to convert them into a finite set of symmetric descriptors or features. Here, we analyze the sensitivity of the map** t… ▽ More The increasingly common applications of machine-learning schemes to atomic-scale simulations have triggered efforts to better understand the mathematical properties of the map** between the Cartesian coordinates of the atoms and the variety of representations that can be used to convert them into a finite set of symmetric descriptors or features. Here, we analyze the sensitivity of the map** to atomic displacements, showing that the combination of symmetry and smoothness leads to map**s that have singular points at which the Jacobian has one or more null singular values (besides those corresponding to infinitesimal translations and rotations). This is in fact desirable, because it enforces physical symmetry constraints on the values predicted by regression models constructed using such representations. However, besides these symmetry-induced singularities, there are also spurious singular points, that we find to be linked to the incompleteness of the map**, i.e. the fact that, for certain classes of representations, structurally distinct configurations are not guaranteed to be mapped onto different feature vectors. Additional singularities can be introduced by a too aggressive truncation of the infinite basis set that is used to discretize the representations. △ Less

Submitted 23 September, 2021; originally announced September 2021.

arXiv:2106.14171 [pdf, other]

doi 10.1021/acs.jpclett.1c01987

The importance of nuclear quantum effects for NMR crystallography

Authors: Edgar A. Engel, Venkat Kapil, Michele Ceriotti

Abstract: The resolving power of solid-state nuclear magnetic resonance (NMR) crystallography depends heavily on the accuracy of computational predictions of NMR chemical shieldings of candidate structures, which are usually taken to be local minima in the potential energy. To test the limits of this approximation, we systematically study the importance of finite-temperature and quantum nuclear fluctuations… ▽ More The resolving power of solid-state nuclear magnetic resonance (NMR) crystallography depends heavily on the accuracy of computational predictions of NMR chemical shieldings of candidate structures, which are usually taken to be local minima in the potential energy. To test the limits of this approximation, we systematically study the importance of finite-temperature and quantum nuclear fluctuations for $^1$H, $^{13}$C, and $^{15}$N shieldings in polymorphs of three paradigmatic molecular crystals -- benzene, glycine, and succinic acid. The effect of quantum fluctuations is comparable to the typical errors of shielding predictions for static nuclei with respect to experiments, and their inclusion to improve the agreement with measurements, translating to more reliable assignment of the NMR spectra to the correct candidate structure. The use of integrated machine-learning models, trained on first-principles energies and shieldings, renders rigorous sampling of nuclear fluctuations affordable, setting a new standard for the calculations underlying NMR structure determinations. △ Less

Submitted 9 January, 2022; v1 submitted 27 June, 2021; originally announced June 2021.

Journal ref: J. Phys. Chem. Lett. 12(32), 7701-7707 (2021)

arXiv:2106.05364 [pdf, other]

doi 10.1021/acs.jctc.1c00576

Learning electron densities in the condensed phase

Authors: Alan M. Lewis, Andrea Grisafi, Michele Ceriotti, Mariana Rossi

Abstract: We introduce a local machine-learning method for predicting the electron densities of periodic systems. The framework is based on a numerical, atom-centred auxiliary basis, which enables an accurate expansion of the all-electron density in a form suitable for learning isolated and periodic systems alike. We show that using this formulation the electron densities of metals, semiconductors and molec… ▽ More We introduce a local machine-learning method for predicting the electron densities of periodic systems. The framework is based on a numerical, atom-centred auxiliary basis, which enables an accurate expansion of the all-electron density in a form suitable for learning isolated and periodic systems alike. We show that using this formulation the electron densities of metals, semiconductors and molecular crystals can all be accurately predicted using symmetry-adapted Gaussian process regression models, properly adjusted for the non-orthogonal nature of the basis. These predicted densities enable the efficient calculation of electronic properties which present errors on the order of tens of meV/atom when compared to ab initio density-functional calculations. We demonstrate the key power of this approach by using a model trained on ice unit cells containing only 4 water molecules to predict the electron densities of cells containing up to 512 molecules, and see no increase in the magnitude of the errors of derived electronic properties when increasing the system size. Indeed, we find that these extrapolated derived energies are more accurate than those predicted using a direct machine-learning model. Finally, on heterogeneous datasets SALTED can predict electron densities with errors below 4%. △ Less

Submitted 9 November, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

Comments: 13 pages, 7 figures; accepted version, SI added

Journal ref: Journal of Chemical Theory and Computation 2021, 17, 11, 7203-7214

arXiv:2105.08717 [pdf, other]

doi 10.1063/5.0057229

Optimal radial basis for density-based atomic representations

Authors: Alexander Goscinski, Félix Musil, Sergey Pozdnyakov, Michele Ceriotti

Abstract: The input of almost every machine learning algorithm targeting the properties of matter at the atomic scale involves a transformation of the list of Cartesian atomic coordinates into a more symmetric representation. Many of the most popular representations can be seen as an expansion of the symmetrized correlations of the atom density, and differ mainly by the choice of basis. Considerable effort… ▽ More The input of almost every machine learning algorithm targeting the properties of matter at the atomic scale involves a transformation of the list of Cartesian atomic coordinates into a more symmetric representation. Many of the most popular representations can be seen as an expansion of the symmetrized correlations of the atom density, and differ mainly by the choice of basis. Considerable effort has been dedicated to the optimization of the basis set, typically driven by heuristic considerations on the behavior of the regression target. Here we take a different, unsupervised viewpoint, aiming to determine the basis that encodes in the most compact way possible the structural information that is relevant for the dataset at hand. For each training dataset and number of basis functions, one can determine a unique basis that is optimal in this sense, and can be computed at no additional cost with respect to the primitive basis by approximating it with splines. We demonstrate that this construction yields representations that are accurate and computationally efficient, particularly when constructing representations that correspond to high-body order correlations. We present examples that involve both molecular and condensed-phase machine-learning models. △ Less

Submitted 10 January, 2022; v1 submitted 18 May, 2021; originally announced May 2021.

Journal ref: The Journal of Chemical Physics 155(10), 104106 (2021)

arXiv:2104.11065 [pdf, other]

doi 10.1103/PhysRevMaterials.5.L070801

Quantum vibronic effects on the electronic properties of solid and molecular carbon

Authors: Arpan Kundu, Marco Govoni, Han Yang, Michele Ceriotti, Francois Gygi, Giulia Galli

Abstract: We study the effect of quantum vibronic coupling on the electronic properties of carbon allotropes, including molecules and solids, by combining path integral first principles molecular dynamics (FPMD) with a colored noise thermostat. In addition to avoiding several approximations commonly adopted in calculations of electron-phonon coupling, our approach only adds a moderate computational cost to… ▽ More We study the effect of quantum vibronic coupling on the electronic properties of carbon allotropes, including molecules and solids, by combining path integral first principles molecular dynamics (FPMD) with a colored noise thermostat. In addition to avoiding several approximations commonly adopted in calculations of electron-phonon coupling, our approach only adds a moderate computational cost to FPMD simulations and hence it is applicable to large supercells, such as those required to describe amorphous solids. We predict the effect of electron-phonon coupling on the fundamental gap of amorphous carbon, and we show that in diamond the zero-phonon renormalization of the band gap is larger than previously reported. △ Less

Submitted 22 April, 2021; originally announced April 2021.

Comments: 24 pages, 13 figures

Journal ref: Phys. Rev. Materials 5, 070801 (2021)

arXiv:2103.09041 [pdf, other]

doi 10.1103/PhysRevMaterials.5.063804

Modeling the Ga/As binary system across temperaturesand compositions from first principles

Authors: Giulio imbalzano, Michele Ceriotti

Abstract: Materials composed of elements from the third and fifth columns of the periodic table display a very rich behavior, with the phase diagram usually containing a metallic liquid phase and a polar semiconducting solid. As a consequence, it is very hard to achieve transferable empirical models of interactions between the atoms that can reliably predict their behavior across the temperature and composi… ▽ More Materials composed of elements from the third and fifth columns of the periodic table display a very rich behavior, with the phase diagram usually containing a metallic liquid phase and a polar semiconducting solid. As a consequence, it is very hard to achieve transferable empirical models of interactions between the atoms that can reliably predict their behavior across the temperature and composition range that is relevant to the study of the synthesis and properties of III/V nanostructures and devices. We present a machine-learning potential trained on density functional theory reference data that provides a general-purpose model for the Ga$_x$As$_{1-x}$ system. We provide a series of stringent tests that showcase the accuracy of the potential, and its applicability across the whole binary phase space, computing with ab initio accuracy a large number of finite-temperature properties as well as the location of phase boundaries. We also show how a committe model can be used to reliably determine the uncertainty induced by the limitations of the ML model on its predictions, to identify regions of phase space that are predicted with insufficient accuracy, and to iteratively refine the training set to achieve consistent, reliable modeling. △ Less

Submitted 12 January, 2022; v1 submitted 16 March, 2021; originally announced March 2021.

Journal ref: G.Imbalzano, M. Ceriotti, Phys. Rev. Materials 5, 063804 (2021)

arXiv:2101.08814 [pdf, other]

doi 10.1063/5.0044689

Efficient implementation of atom-density representations

Authors: Félix Musil, Max Veit, Alexander Goscinski, Guillaume Fraux, Michael J. Willatt, Markus Stricker, Till Junge, Michele Ceriotti

Abstract: Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods. They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space. Recently, it has become clear t… ▽ More Physically-motivated and mathematically robust atom-centred representations of molecular structures are key to the success of modern atomistic machine learning (ML) methods. They lie at the foundation of a wide range of methods to predict the properties of both materials and molecules as well as to explore and visualize the chemical compound and configuration space. Recently, it has become clear that many of the most effective representations share a fundamental formal connection: that they can all be expressed as a discretization of N-body correlation functions of the local atom density, suggesting the opportunity of standardizing and, more importantly, optimizing the calculation of such representations. We present an implementation, named librascal, whose modular design lends itself both to develo** refinements to the density-based formalism and to rapid prototy** for new developments of rotationally equivariant atomistic representations. As an example, we discuss SOAP features, perhaps the most widely used member of this family of representations, to show how the expansion of the local density can be optimized for any choice of radial basis set. We discuss the representation in the context of a kernel ridge regression model, commonly used with SOAP features, and analyze how the computational effort scales for each of the individual steps of the calculation. By applying data reduction techniques in feature space, we show how to further reduce the total computational cost by at up to a factor of 4 or 5 without affecting the model's symmetry properties and without significantly impacting its accuracy. △ Less

Submitted 21 January, 2021; originally announced January 2021.

arXiv:2101.04673 [pdf, other]

doi 10.1021/acs.chemrev.1c00021

Physics-inspired structural representations for molecules and materials

Authors: Felix Musil, Andrea Grisafi, Albert P. Bartók, Christoph Ortner, Gábor Csányi, Michele Ceriotti

Abstract: The first step in the construction of a regression model or a data-driven analysis, aiming to predict or elucidate the relationship between the atomic scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of atomic-scale representations has played, and continues to play, a central role in the succe… ▽ More The first step in the construction of a regression model or a data-driven analysis, aiming to predict or elucidate the relationship between the atomic scale structure of matter and its properties, involves transforming the Cartesian coordinates of the atoms into a suitable representation. The development of atomic-scale representations has played, and continues to play, a central role in the success of machine-learning methods for chemistry and materials science. This review summarizes the current understanding of the nature and characteristics of the most commonly used structural and chemical descriptions of atomistic structures, highlighting the deep underlying connections between different frameworks, and the ideas that lead to computationally efficient and universally applicable models. It emphasizes the link between properties, structures, their physical chemistry and their mathematical description, provides examples of recent applications to a diverse set of chemical and materials science problems, and outlines the open questions and the most promising research directions in the field. △ Less

Submitted 4 August, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

arXiv:2012.12253 [pdf, other]

Improving Sample and Feature Selection with Principal Covariates Regression

Authors: Rose K. Cersonsky, Benjamin A. Helfrecht, Edgar A. Engel, Michele Ceriotti

Abstract: Selecting the most relevant features and samples out of a large set of candidates is a task that occurs very often in the context of automated data analysis, where it can be used to improve the computational performance, and also often the transferability, of a model. Here we focus on two popular sub-selection schemes which have been applied to this end: CUR decomposition, that is based on a low-r… ▽ More Selecting the most relevant features and samples out of a large set of candidates is a task that occurs very often in the context of automated data analysis, where it can be used to improve the computational performance, and also often the transferability, of a model. Here we focus on two popular sub-selection schemes which have been applied to this end: CUR decomposition, that is based on a low-rank approximation of the feature matrix and Farthest Point Sampling, that relies on the iterative identification of the most diverse samples and discriminating features. We modify these unsupervised approaches, incorporating a supervised component following the same spirit as the Principal Covariates Regression (PCovR) method. We show that incorporating target information provides selections that perform better in supervised tasks, which we demonstrate with ridge regression, kernel ridge regression, and sparse kernel regression. We also show that incorporating aspects of simple supervised learning models can improve the accuracy of more complex models, such as feed-forward neural networks. We present adjustments to minimize the impact that any subselection may incur when performing unsupervised tasks. We demonstrate the significant improvements associated with the use of PCov-CUR and PCov-FPS selections for applications to chemistry and materials science, typically reducing by a factor of two the number of features and samples which are required to achieve a given level of regression accuracy. △ Less

Submitted 22 December, 2020; originally announced December 2020.

arXiv:2012.04616 [pdf, other]

doi 10.2533/chimia.2019.972

Machine learning at the atomic-scale

Authors: Félix Musil, Michele Ceriotti

Abstract: Statistical learning algorithms are finding more and more applications in science and technology. Atomic-scale modeling is no exception, with machine learning becoming commonplace as a tool to predict energy, forces and properties of molecules and condensed-phase systems. This short review summarizes recent progress in the field, focusing in particular on the problem of representing an atomic conf… ▽ More Statistical learning algorithms are finding more and more applications in science and technology. Atomic-scale modeling is no exception, with machine learning becoming commonplace as a tool to predict energy, forces and properties of molecules and condensed-phase systems. This short review summarizes recent progress in the field, focusing in particular on the problem of representing an atomic configuration in a mathematically robust and computationally efficient way. We also discuss some of the regression algorithms that have been used to construct surrogate models of atomic-scale properties. We then show examples of how the optimization of the machine-learning models can both incorporate and reveal insights onto the physical phenomena that underlie structure-property relations. △ Less

Submitted 8 December, 2020; originally announced December 2020.

Journal ref: Chimia (Aarau). 73, 972-982 (2019)

arXiv:2011.08828 [pdf, other]

doi 10.1063/5.0036522

Uncertainty estimation for molecular dynamics and sampling

Authors: Giulio Imbalzano, Yongbin Zhuang, Venkat Kapil, Kevin Rossi, Edgar A. Engel, Federico Grasselli, Michele Ceriotti

Abstract: Machine learning models have emerged as a very effective strategy to sidestep time-consuming electronic-structure calculations, enabling accurate simulations of greater size, time scale and complexity. Given the interpolative nature of these models, the reliability of predictions depends on the position in phase space, and it is crucial to obtain an estimate of the error that derives from the fini… ▽ More Machine learning models have emerged as a very effective strategy to sidestep time-consuming electronic-structure calculations, enabling accurate simulations of greater size, time scale and complexity. Given the interpolative nature of these models, the reliability of predictions depends on the position in phase space, and it is crucial to obtain an estimate of the error that derives from the finite number of reference structures included during the training of the model. When using a machine-learning potential to sample a finite-temperature ensemble, the uncertainty on individual configurations translates into an error on thermodynamic averages, and provides an indication for the loss of accuracy when the simulation enters a previously unexplored region. Here we discuss how uncertainty quantification can be used, together with a baseline energy model, or a more robust although less accurate interatomic potential, to obtain more resilient simulations and to support active-learning strategies. Furthermore, we introduce an on-the-fly reweighing scheme that makes it possible to estimate the uncertainty in the thermodynamic averages extracted from long trajectories. We present examples covering different types of structural and thermodynamic properties, and systems as diverse as water and liquid gallium. △ Less

Submitted 14 January, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

Comments: 17 pages, 9 figures

arXiv:2011.07987 [pdf, other]

doi 10.1021/acs.jctc.0c01177

Global free energy landscapes as a smoothly joined collection of local maps

Authors: F. Giberti, G. A. Tribello, M. Ceriotti

Abstract: Enhanced sampling techniques have become an essential tool in computational chemistry and physics, where they are applied to sample activated processes that occur on a time scale that is inaccessible to conventional simulations. Despite their popularity, it is well known that they have constraints that hinder their applications to complex problems. The core issue lies in the need to describe the s… ▽ More Enhanced sampling techniques have become an essential tool in computational chemistry and physics, where they are applied to sample activated processes that occur on a time scale that is inaccessible to conventional simulations. Despite their popularity, it is well known that they have constraints that hinder their applications to complex problems. The core issue lies in the need to describe the system using a small number of collective variables (CVs). Any slow degree of freedom that is not properly described by the chosen CVs will hinder sampling efficiency. However, exploration of configuration space is also hampered by including variables that are not relevant to describe the activated process under study. This paper presents the Adaptive Topography of Landscape for Accelerated Sampling (ATLAS), a new biasing method capable of working with many CVs. The root idea of ATLAS is to apply a divide-and-conquer strategy where the high-dimensional CVs space is divided into basins, each of which is described by an automatically-determined, low-dimensional set of variables. A well-tempered metadynamics-like bias is constructed as a function of these local variables. Indicator functions associated with the basins switch on and off the local biases, so that the sampling is performed on a collection of low-dimensional CV spaces, that are smoothly combined to generate an effectively high-dimensional bias. The unbiased Boltzmann distribution is recovered through reweighing, making the evaluation of conformational and thermodynamic properties straightforward. The decomposition of the free-energy landscape in local basins can be updated iteratively as the simulation discovers new (meta)stable states. △ Less

Submitted 12 January, 2022; v1 submitted 16 November, 2020; originally announced November 2020.

Journal ref: F. Giberti, G. A. Tribello, and M. Ceriotti, J. Chem. Theory Comput. 17(6), 3292-3308 (2021)

arXiv:2011.03874 [pdf, other]

doi 10.1103/PhysRevMaterials.5.043802

Finite-temperature materials modeling from the quantum nuclei to the hot electrons regime

Authors: Nataliya Lopanitsyna, Chiheb Ben Mahmoud, Michele Ceriotti

Abstract: Atomistic simulations provide insights into structure-property relations on an atomic size and length scale, that are complementary to the macroscopic observables that can be obtained from experiments. Quantitative predictions, however, are usually hindered by the need to strike a balance between the accuracy of the calculation of the interatomic potential and the modelling of realistic thermodyna… ▽ More Atomistic simulations provide insights into structure-property relations on an atomic size and length scale, that are complementary to the macroscopic observables that can be obtained from experiments. Quantitative predictions, however, are usually hindered by the need to strike a balance between the accuracy of the calculation of the interatomic potential and the modelling of realistic thermodynamic conditions. Machine-learning techniques make it possible to efficiently approximate the outcome of accurate electronic-structure calculations, that can therefore be combined with extensive thermodynamic sampling. We take elemental nickel as a prototypical material, whose alloys have applications from cryogenic temperatures up to close to their melting point, and use it to demonstrate how a combination of machine-learning models of electronic properties and statistical sampling methods makes it possible to compute accurate finite-temperature properties at an affordable cost. We demonstrate the calculation of a broad array of bulk, interfacial and defect properties over a temperature range from 100 to 2500 K, modeling also, when needed, the impact of nuclear quantum fluctuations and electronic entropy. The framework we demonstrate here can be easily generalized to more complex alloys and different classes of materials. △ Less

Submitted 7 November, 2020; originally announced November 2020.

Journal ref: Phys. Rev. Materials 5, 043802 (2021)

arXiv:2009.02741 [pdf, other]

The role of feature space in atomistic learning

Authors: Alexander Goscinski, Guillaume Fraux, Giulio Imbalzano, Michele Ceriotti

Abstract: Eficient, physically-inspired descriptors of the structure and composition of molecules and materials play a key role in the application of machine-learning techniques to atomistic simulations. The proliferation of approaches, as well as the fact that each choice of features can lead to very different behavior depending on how they are used, e.g. by introducing non-linear kernels and non-Euclidean… ▽ More Eficient, physically-inspired descriptors of the structure and composition of molecules and materials play a key role in the application of machine-learning techniques to atomistic simulations. The proliferation of approaches, as well as the fact that each choice of features can lead to very different behavior depending on how they are used, e.g. by introducing non-linear kernels and non-Euclidean metrics to manipulate them, makes it difficult to objectively compare different methods, and to address fundamental questions on how one feature space is related to another. In this work we introduce a framework to compare different sets of descriptors, and different ways of transforming them by means of metrics and kernels, in terms of the structure of the feature space that they induce. We define diagnostic tools to determine whether alternative feature spaces contain equivalent amounts of information, and whether the common information is substantially distorted when going from one feature space to another. We compare, in particular, representations that are built in terms of n-body correlations of the atom density, quantitatively assessing the information loss associated with the use of low-order features. We also investigate the impact of different choices of basis functions and hyperparameters of the widely used SOAP and Behler-Parrinello features, and investigate how the use of non-linear kernels, and of a Wasserstein-type metric, change the structure of the feature space in comparison to a simpler linear feature space. △ Less

Submitted 10 December, 2020; v1 submitted 6 September, 2020; originally announced September 2020.

arXiv:2008.12122 [pdf, other]

Multi-scale approach for the prediction of atomic scale properties

Authors: Andrea Grisafi, Jigyasa Nigam, Michele Ceriotti

Abstract: Electronic nearsightedness is one of the fundamental principles governing the behavior of condensed matter and supporting its description in terms of local entities such as chemical bonds. Locality also underlies the tremendous success of machine-learning schemes that predict quantum mechanical observables -- such as the cohesive energy, the electron density, or a variety of response properties --… ▽ More Electronic nearsightedness is one of the fundamental principles governing the behavior of condensed matter and supporting its description in terms of local entities such as chemical bonds. Locality also underlies the tremendous success of machine-learning schemes that predict quantum mechanical observables -- such as the cohesive energy, the electron density, or a variety of response properties -- as a sum of atom-centred contributions, based on a short-range representation of atomic environments. One of the main shortcomings of these approaches is their inability to capture physical effects, ranging from electrostatic interactions to quantum delocalization, which have a long-range nature. Here we show how to build a multi-scale scheme that combines in the same framework local and non-local information, overcoming such limitations. We show that the simplest version of such features can be put in formal correspondence with a multipole expansion of permanent electrostatics. The data-driven nature of the model construction, however, makes this simple form suitable to tackle also different types of delocalized and collective effects. We present several examples that range from molecular physics, to surface science and biophysics, demonstrating the ability of this multi-scale approach to model interactions driven by electrostatics, polarization and dispersion, as well as the cooperative behavior of dielectric response functions. △ Less

Submitted 31 August, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

arXiv:2007.03407 [pdf, other]

doi 10.1063/5.0021116

Recursive evaluation and iterative contraction of $N$-body equivariant features

Authors: Jigyasa Nigam, Sergey Pozdnyakov, Michele Ceriotti

Abstract: Map** an atomistic configuration to an $N$-point correlation of a field associated with the atomic positions (e.g. an atomic density) has emerged as an elegant and effective solution to represent structures as the input of machine-learning algorithms. While it has become clear that low-order density correlations do not provide a complete representation of an atomic environment, the exponential i… ▽ More Map** an atomistic configuration to an $N$-point correlation of a field associated with the atomic positions (e.g. an atomic density) has emerged as an elegant and effective solution to represent structures as the input of machine-learning algorithms. While it has become clear that low-order density correlations do not provide a complete representation of an atomic environment, the exponential increase in the number of possible $N$-body invariants makes it difficult to design a concise and effective representation. We discuss how to exploit recursion relations between equivariant features of different orders (generalizations of $N$-body invariants that provide a complete representation of the symmetries of improper rotations) to compute high-order terms efficiently. In combination with the automatic selection of the most expressive combination of features at each order, this approach provides a conceptual and practical framework to generate systematically-improvable, symmetry adapted representations for atomistic machine learning. △ Less

Submitted 7 July, 2020; originally announced July 2020.

Journal ref: J. Chem. Phys. 153, 121101 (2020)

arXiv:2006.12597 [pdf, other]

doi 10.1021/acs.jctc.0c00362

Simulating solvation and acidity in complex mixtures with first-principles accuracy: the case of CH$_3$SO$_3$H and H$_2$O$_2$ in phenol

Authors: Kevin Rossi, Veronika Juraskova, Raphael Wischert, Laurent Garel, Clemence Corminboeuf, Michele Ceriotti

Abstract: We present a generally-applicable computational framework for the efficient and accurate characterization of molecular structural patterns and acid properties in explicit solvent using H$_2$O$_2$ and CH$_3$SO$_3$H in phenol as an example. In order to address the challenges posed by the complexity of the problem, we resort to a set of data-driven methods and enhanced sampling algorithms. The synerg… ▽ More We present a generally-applicable computational framework for the efficient and accurate characterization of molecular structural patterns and acid properties in explicit solvent using H$_2$O$_2$ and CH$_3$SO$_3$H in phenol as an example. In order to address the challenges posed by the complexity of the problem, we resort to a set of data-driven methods and enhanced sampling algorithms. The synergistic application of these techniques makes the first-principle estimation of the chemical properties feasible without renouncing to the use of explicit solvation, involving extensive statistical sampling. Ensembles of neural network potentials are trained on a set of configurations carefully selected out of preliminary simulations performed at a low-cost density-functional tight-binding (DFTB) level. The energy and forces of these configurations are then recomputed at the hybrid density functional theory (DFT) level and used to train the neural networks. The stability of the NN model is enhanced by using DFTB energetics as a baseline, but the efficiency of the direct NN (i.e., baseline-free) is exploited via a multiple-time step integrator. The neural network potentials are combined with enhanced sampling techniques, such as replica exchange and metadynamics, and used to characterize the relevant protonated species and dominant non-covalent interactions in the mixture, also considering nuclear quantum effects. △ Less

Submitted 22 June, 2020; originally announced June 2020.

Journal ref: J. Chem. Theory Comput. 16(8), 5139-5149 (2020)

arXiv:2006.11803 [pdf, other]

doi 10.1103/PhysRevB.102.235130

Learning the electronic density of states in condensed matter

Authors: Chiheb Ben Mahmoud, Andrea Anelli, Gábor Csányi, Michele Ceriotti

Abstract: The electronic density of states (DOS) quantifies the distribution of the energy levels that can be occupied by electrons in a quasiparticle picture, and is central to modern electronic structure theory. It also underpins the computation and interpretation of experimentally observable material properties such as optical absorption and electrical conductivity. We discuss the challenges inherent in… ▽ More The electronic density of states (DOS) quantifies the distribution of the energy levels that can be occupied by electrons in a quasiparticle picture, and is central to modern electronic structure theory. It also underpins the computation and interpretation of experimentally observable material properties such as optical absorption and electrical conductivity. We discuss the challenges inherent in the construction of a machine-learning (ML) framework aimed at predicting the DOS as a combination of local contributions that depend in turn on the geometric configuration of neighbours around each atom, using quasiparticle energy levels from density functional theory as training data. We present a challenging case study that includes configurations of silicon spanning a broad set of thermodynamic conditions, ranging from bulk structures to clusters, and from semiconducting to metallic behavior. We compare different approaches to represent the DOS, and the accuracy of predicting quantities such as the Fermi level, the DOS at the Fermi level, or the band energy, either directly or as a side-product of the evaluation of the DOS. The performance of the model depends crucially on the smoothening of the DOS, and there is a tradeoff to be made between the systematic error associated with the smoothening and the error in the ML model for a specific structure. We demonstrate the usefulness of this approach by computing the density of states of a large amorphous silicon sample, for which it would be prohibitively expensive to compute the DOS by direct electronic structure calculations, and show how the atom-centred decomposition of the DOS that is obtained through our model can be used to extract physical insights into the connections between structural and electronic features. △ Less

Submitted 12 November, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

Journal ref: Phys. Rev. B 102, 235130 (2020)

arXiv:2004.06950 [pdf, other]

Machine learning force fields and coarse-grained variables in molecular dynamics: application to materials and biological systems

Authors: Paraskevi Gkeka, Gabriel Stoltz, Amir Barati Farimani, Zineb Belkacemi, Michele Ceriotti, John Chodera, Aaron R. Dinner, Andrew Ferguson, Jean-Bernard Maillet, Hervé Minoux, Christine Peter, Fabio Pietrucci, Ana Silveira, Alexandre Tkatchenko, Zofia Trstanova, Rafal Wiewiora, Tony Leliévre

Abstract: Machine learning encompasses a set of tools and algorithms which are now becoming popular in almost all scientific and technological fields. This is true for molecular dynamics as well, where machine learning offers promises of extracting valuable information from the enormous amounts of data generated by simulation of complex systems. We provide here a review of our current understanding of goals… ▽ More Machine learning encompasses a set of tools and algorithms which are now becoming popular in almost all scientific and technological fields. This is true for molecular dynamics as well, where machine learning offers promises of extracting valuable information from the enormous amounts of data generated by simulation of complex systems. We provide here a review of our current understanding of goals, benefits, and limitations of machine learning techniques for computational studies on atomistic systems, focusing on the construction of empirical force fields from ab-initio databases and the determination of reaction coordinates for free energy computation and enhanced sampling. △ Less

Submitted 15 April, 2020; originally announced April 2020.

Showing 1–50 of 136 results for author: Ceriotti, M