-
PolyGET: Accelerating Polymer Simulations by Accurate and Generalizable Forcefield with Equivariant Transformer
Authors:
Rui Feng,
Huan Tran,
Aubrey Toland,
Binghong Chen,
Qi Zhu,
Rampi Ramprasad,
Chao Zhang
Abstract:
Polymer simulation with both accuracy and efficiency is a challenging task. Machine learning (ML) forcefields have been developed to achieve both the accuracy of ab initio methods and the efficiency of empirical force fields. However, existing ML force fields are usually limited to single-molecule settings, and their simulations are not robust enough. In this paper, we present PolyGET, a new frame…
▽ More
Polymer simulation with both accuracy and efficiency is a challenging task. Machine learning (ML) forcefields have been developed to achieve both the accuracy of ab initio methods and the efficiency of empirical force fields. However, existing ML force fields are usually limited to single-molecule settings, and their simulations are not robust enough. In this paper, we present PolyGET, a new framework for Polymer Forcefields with Generalizable Equivariant Transformers. PolyGET is designed to capture complex quantum interactions between atoms and generalize across various polymer families, using a deep learning model called Equivariant Transformers. We propose a new training paradigm that focuses exclusively on optimizing forces, which is different from existing methods that jointly optimize forces and energy. This simple force-centric objective function avoids competing objectives between energy and forces, thereby allowing for learning a unified forcefield ML model over different polymer families. We evaluated PolyGET on a large-scale dataset of 24 distinct polymer types and demonstrated state-of-the-art performance in force accuracy and robust MD simulations. Furthermore, PolyGET can simulate large polymers with high fidelity to the reference ab initio DFT method while being able to generalize to unseen polymers.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
May the Force be with You: Unified Force-Centric Pre-Training for 3D Molecular Conformations
Authors:
Rui Feng,
Qi Zhu,
Huan Tran,
Binghong Chen,
Aubrey Toland,
Rampi Ramprasad,
Chao Zhang
Abstract:
Recent works have shown the promise of learning pre-trained models for 3D molecular representation. However, existing pre-training models focus predominantly on equilibrium data and largely overlook off-equilibrium conformations. It is challenging to extend these methods to off-equilibrium data because their training objective relies on assumptions of conformations being the local energy minima. W…
▽ More
Recent works have shown the promise of learning pre-trained models for 3D molecular representation. However, existing pre-training models focus predominantly on equilibrium data and largely overlook off-equilibrium conformations. It is challenging to extend these methods to off-equilibrium data because their training objective relies on assumptions of conformations being the local energy minima. We address this gap by proposing a force-centric pretraining model for 3D molecular conformations covering both equilibrium and off-equilibrium data. For off-equilibrium data, our model learns directly from their atomic forces. For equilibrium data, we introduce zero-force regularization and forced-based denoising techniques to approximate near-equilibrium forces. We obtain a unified pre-trained model for 3D molecular representation with over 15 million diverse conformations. Experiments show that, with our pre-training objective, we increase forces accuracy by around 3 times compared to the un-pre-trained Equivariant Transformer model. By incorporating regularizations on equilibrium data, we solved the problem of unstable MD simulations in vanilla Equivariant Transformers, achieving state-of-the-art simulation performance with 2.45 times faster inference time than NequIP. As a powerful molecular encoder, our pre-trained model achieves on-par performance with state-of-the-art property prediction tasks.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Polymer informatics at-scale with multitask graph neural networks
Authors:
Rishi Gurnani,
Christopher Kuenneth,
Aubrey Toland,
Rampi Ramprasad
Abstract:
Artificial intelligence-based methods are becoming increasingly effective at screening libraries of polymers down to a selection that is manageable for experimental inquiry. The vast majority of presently adopted approaches for polymer screening rely on handcrafted chemostructural features extracted from polymer repeat units -- a burdensome task as polymer libraries, which approximate the polymer…
▽ More
Artificial intelligence-based methods are becoming increasingly effective at screening libraries of polymers down to a selection that is manageable for experimental inquiry. The vast majority of presently adopted approaches for polymer screening rely on handcrafted chemostructural features extracted from polymer repeat units -- a burdensome task as polymer libraries, which approximate the polymer chemical search space, progressively grow over time. Here, we demonstrate that directly "machine-learning" important features from a polymer repeat unit is a cheap and viable alternative to extracting expensive features by hand. Our approach -- based on graph neural networks, multitask learning, and other advanced deep learning techniques -- speeds up feature extraction by one to two orders of magnitude relative to presently adopted handcrafted methods without compromising model accuracy for a variety of polymer property prediction tasks. We anticipate that our approach, which unlocks the screening of truly massive polymer libraries at scale, will enable more sophisticated and large scale screening technologies in the field of polymer informatics.
△ Less
Submitted 17 January, 2023; v1 submitted 27 September, 2022;
originally announced September 2022.
-
Database, Features, and Machine Learning Model to Identify Thermally Driven Metal-Insulator Transition Compounds
Authors:
Alexandru B. Georgescu,
Peiwen Ren,
Aubrey R. Toland,
Shengtong Zhang,
Kyle D. Miller,
Daniel W. Apley,
Elsa A. Olivetti,
Nicholas Wagner,
James M. Rondinelli
Abstract:
Metal-insulator transition (MIT) compounds are materials that may exhibit insulating or metallic behavior, depending on the physical conditions, and are of immense fundamental interest owing to their potential applications in emerging microelectronics. There is a dearth of thermally-driven MIT materials, however, which makes delineating these compounds from those that are exclusively insulating or…
▽ More
Metal-insulator transition (MIT) compounds are materials that may exhibit insulating or metallic behavior, depending on the physical conditions, and are of immense fundamental interest owing to their potential applications in emerging microelectronics. There is a dearth of thermally-driven MIT materials, however, which makes delineating these compounds from those that are exclusively insulating or metallic challenging. Here we report a material database comprising temperature-controlled MITs (and metals and insulators with similar chemical composition and stoichiometries to the MIT compounds) from high quality experimental literature, built through a combination of materials-domain knowledge and natural language processing. We featurize the dataset using compositional, structural, and energetic descriptors, including two MIT relevant energy scales, an estimated Hubbard interaction and the charge transfer energy, as well as the structure-bond-stress metric referred to as the global-instability index (GII). We then perform supervised classification, constructing three electronic-state classifiers: metal vs non-metal (M), insulator vs non-insulator (I), and MIT vs non-MIT (T). We identify two important descriptors that separate metals, insulators, and MIT materials in a 2D feature space: the average deviation of the covalent radius and the range of the Mendeleev number. We further elaborate on other important features (GII and Ewald energy), and examine how they affect classification of binary vanadium and titanium oxides. We discuss the relationship of these atomic features to the physical interactions underlying MITs in the rare-earth nickelate family. Last, we implement an online version of the classifiers, enabling quick probabilistic class predictions by uploading a crystallographic structure file.
△ Less
Submitted 21 July, 2021; v1 submitted 25 October, 2020;
originally announced October 2020.