-
Accelerated Design of Block Copolymers: An Unbiased Exploration Strategy via Fusion of Molecular Dynamics Simulations and Machine Learning
Authors:
Jan Michael Y. Carrillo,
Vijith P,
Tarak K. Patra,
Zhan Chen,
Thomas P. Russell,
Subramanian KRS Sankaranarayanan,
Bobby G. Sumpter,
Rohit Batra
Abstract:
Star block copolymers (s-BCPs) have potential applications as novel surfactants or amphiphiles for emulsification, compatbilization, chemical transformations and separations. s-BCPs are star-shaped macromolecules comprised of linear chains of different chemical blocks (e.g., solvophilic and solvophobic blocks) that are covalently joined at one junction point. Various parameters of these macromolec…
▽ More
Star block copolymers (s-BCPs) have potential applications as novel surfactants or amphiphiles for emulsification, compatbilization, chemical transformations and separations. s-BCPs are star-shaped macromolecules comprised of linear chains of different chemical blocks (e.g., solvophilic and solvophobic blocks) that are covalently joined at one junction point. Various parameters of these macromolecules can be tuned to obtain desired surface properties, including the number of arms, composition of the arms, and the degree-of-polymerization of the blocks (or the length of the arm). This makes identification of the optimal s-BCP design highly non-trivial as the total number of plausible s-BCPs architectures is experimentally or computationally intractable. In this work, we use molecular dynamics (MD) simulations coupled with reinforcement learning based Monte Carlo tree search (MCTS) to identify s-BCPs designs that minimize the interfacial tension between polar and non-polar solvents. We first validate the MCTS approach for design of small- and medium-sized s-BCPs, and then use it to efficiently identify sequences of copolymer blocks for large-sized s-BCPs. The structural origins of interfacial tension in these systems are also identified using the configurations obtained from MD simulations. Chemical insights on the arrangement of copolymer blocks that promote lower interfacial tension were mined using machine learning (ML) techniques. Overall, this work provides an efficient approach to solve design problems via fusion of simulations and ML and provide important groundwork for future experimental investigation of s-BCPs sequences for various applications.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Millions of Co-purchases and Reviews Reveal the Spread of Polarization and Lifestyle Politics across Online Markets
Authors:
Alexander Ruch,
Ari Decter-Frain,
Raghav Batra
Abstract:
Polarization in America has reached a high point as markets are also becoming polarized. Existing research, however, focuses on specific market segments and products and has not evaluated this trend's full breadth. If such fault lines do spread into other segments that are not explicitly political, it would indicate the presence of lifestyle politics -- when ideas and behaviors not inherently poli…
▽ More
Polarization in America has reached a high point as markets are also becoming polarized. Existing research, however, focuses on specific market segments and products and has not evaluated this trend's full breadth. If such fault lines do spread into other segments that are not explicitly political, it would indicate the presence of lifestyle politics -- when ideas and behaviors not inherently political become politically aligned through their connections with explicitly political things. We study the pervasiveness of polarization and lifestyle politics over different product segments in a diverse market and test the extent to which consumer- and platform-level network effects and morality may explain lifestyle politics. Specifically, using graph and language data from Amazon (82.5M reviews of 9.5M products and product and category metadata from 1996-2014), we sample 234.6 million relations among 21.8 million market entities to find product categories that are most politically relevant, aligned, and polarized. We then extract moral values present in reviews' text and use these data and other reviewer-, product-, and category-level data to test whether individual- and platform- level network factors explain lifestyle politics better than products' implicit morality. We find pervasive lifestyle politics. Cultural products are 4 times more polarized than any other segment, products' political attributes have up to 3.7 times larger associations with lifestyle politics than author-level covariates, and morality has statistically significant but relatively small correlations with lifestyle politics. Examining lifestyle politics in these contexts helps us better understand the extent and root of partisan differences, why Americans may be so polarized, and how this polarization affects market systems.
△ Less
Submitted 17 January, 2022;
originally announced January 2022.
-
Screening of Therapeutic Agents for COVID-19 using Machine Learning and Ensemble Docking Simulations
Authors:
Rohit Batra,
Henry Chan,
Ganesh Kamath,
Rampi Ramprasad,
Mathew J. Cherukara,
Subramanian Sankaranarayanan
Abstract:
The world has witnessed unprecedented human and economic loss from the COVID-19 disease, caused by the novel coronavirus SARS-CoV-2. Extensive research is being conducted across the globe to identify therapeutic agents against the SARS-CoV-2. Here, we use a powerful and efficient computational strategy by combining machine learning (ML) based models and high-fidelity ensemble docking simulations t…
▽ More
The world has witnessed unprecedented human and economic loss from the COVID-19 disease, caused by the novel coronavirus SARS-CoV-2. Extensive research is being conducted across the globe to identify therapeutic agents against the SARS-CoV-2. Here, we use a powerful and efficient computational strategy by combining machine learning (ML) based models and high-fidelity ensemble docking simulations to enable rapid screening of possible therapeutic molecules (or ligands). Our screening is based on the binding affinity to either the isolated SARS-CoV-2 S-protein at its host receptor region or to the Sprotein-human ACE2 interface complex, thereby potentially limiting and/or disrupting the host-virus interactions. We first apply our screening strategy to two drug datasets (CureFFI and DrugCentral) to identify hundreds of ligands that bind strongly to the aforementioned two systems. Candidate ligands were then validated by all atom docking simulations. The validated ML models were subsequently used to screen a large bio-molecule dataset (with nearly a million entries) to provide a rank-ordered list of ~19,000 potentially useful compounds for further validation. Overall, this work not only expands our knowledge of small-molecule treatment against COVID-19, but also provides an efficient pathway to perform high-throughput computational drug screening by combining quick ML surrogate models with expensive high-fidelity simulations, for accelerating the therapeutic cure of diseases.
△ Less
Submitted 7 April, 2020;
originally announced April 2020.
-
Machine Learning for Multi-fidelity Scale Bridging and Dynamical Simulations of Materials
Authors:
Rohit Batra,
Subramanian Sankaranarayanan
Abstract:
Molecular dynamics (MD) is a powerful and popular tool for understanding the dynamical evolution of materials at the nano and mesoscopic scales. There are various flavors of MD ranging from the high fidelity albeit computationally expensive ab-initio MD to relatively lower fidelity but much more efficient classical MD such as atomistic and coarse-grained models. Each of these different flavors of…
▽ More
Molecular dynamics (MD) is a powerful and popular tool for understanding the dynamical evolution of materials at the nano and mesoscopic scales. There are various flavors of MD ranging from the high fidelity albeit computationally expensive ab-initio MD to relatively lower fidelity but much more efficient classical MD such as atomistic and coarse-grained models. Each of these different flavors of MD have been independently used by materials scientists to bring about breakthroughs in materials discovery and design. A significant gulf exists between the various MD flavors, each having varying levels of fidelity. The accuracy of DFT or ab-initio MD is generally much higher than that of classical atomistic simulations which is higher than that of coarse-grained models. Multi-fidelity scale bridging to combine the accuracy and flexibility of ab-initio MD with efficiency classical MD has been a longstanding goal. The advent of big-data analytics has brought to the forefront powerful machine learning methods that can be deployed to achieve this goal. Here, we provide our perspective on the challenges in multi-fidelity scale bridging and trace the developments leading up to the use of machine learning algorithms and data-science towards addressing this grand challenge.
△ Less
Submitted 1 April, 2020;
originally announced April 2020.
-
Machine Learning Models for the Lattice Thermal Conductivity Prediction of Inorganic Materials
Authors:
Lihua Chen,
Huan Tran,
Rohit Batra,
Chiho Kim,
Rampi Ramprasad
Abstract:
The lattice thermal conductivity ($κ_{\rm L} $) is a critical property of thermoelectrics, thermal barrier coating materials and semiconductors. While accurate empirical measurements of $κ_{\rm L} $ are extremely challenging, it is usually approximated through computational approaches, such as semi-empirical models, Green-Kubo formalism coupled with molecular dynamics simulations, and first-princi…
▽ More
The lattice thermal conductivity ($κ_{\rm L} $) is a critical property of thermoelectrics, thermal barrier coating materials and semiconductors. While accurate empirical measurements of $κ_{\rm L} $ are extremely challenging, it is usually approximated through computational approaches, such as semi-empirical models, Green-Kubo formalism coupled with molecular dynamics simulations, and first-principles based methods. However, these theoretical methods are not only limited in terms of their accuracy, but sometimes become computationally intractable owing to their cost. Thus, in this work, we build a machine learning (ML)-based model to accurately and instantly predict $κ_{\rm L}$ of inorganic materials, using a benchmark data set of experimentally measured $κ_{\rm L} $ of about 100 inorganic solids. We use advanced and universal feature engineering techniques along with the Gaussian process regression algorithm, and compare the performance of our ML model with past theoretical works. The trained ML model is not only helpful for rational design and screening of novel materials, but we also identify key features governing the thermal transport behavior in non-metals.
△ Less
Submitted 4 August, 2019; v1 submitted 14 June, 2019;
originally announced June 2019.