-
Completely Multipolar Model as a General Framework for Many-Body Interactions as Illustrated for Water
Authors:
Joseph P. Heindel,
Selim Sami,
Teresa Head-Gordon
Abstract:
We introduce a general framework for many-body force field models, the Completely Multipolar Model (CMM), that utilizes multipolar electrical moments modulated by exponential decay of electron density as a common functional form for all piecewise terms of an energy decomposition analysis of intermolecular interactions. With this common functional form the CMM model establishes well-formulated damp…
▽ More
We introduce a general framework for many-body force field models, the Completely Multipolar Model (CMM), that utilizes multipolar electrical moments modulated by exponential decay of electron density as a common functional form for all piecewise terms of an energy decomposition analysis of intermolecular interactions. With this common functional form the CMM model establishes well-formulated damped tensors that reach the correct asymptotes at both long- and short-range while formally ensuring no short-range catastrophes. The CMM describes the separable EDA terms of dispersion, exchange polarization, and Pauli repulsion with short-ranged anisotropy, polarization as intramolecular charge fluctuations and induced dipoles, while charge transfer describes explicit movement of charge between molecules, and naturally describes many-body charge transfer by coupling into the polarization equations. We also utilize a new one-body potential that accounts for intramolecular polarization by including an electric field-dependent correction to the Morse potential to ensure that the CMM reproduces all physically relevant monomer properties including the dipole moment, molecular polarizability, and dipole and polarizability derivatives. The quality of the CMM is illustrated through agreement of individual terms of the EDA and excellent extrapolation to energies and geometries of an extensive validation set of water cluster data.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
A Curated Rotamer Library for Common Post-Translational Modifications of Proteins
Authors:
Oufan Zhang,
Shubhankar A. Naik,
Zi Hao Liu,
Julie Forman-Kay,
Teresa Head-Gordon
Abstract:
Sidechain rotamer libraries of the common amino acids of a protein are useful for folded protein structure determination and for generating ensembles of intrinsically disordered proteins (IDPs). However much of protein function is modulated beyond the translated sequence through thFiguree introduction of post-translational modifications (PTMs). In this work we have provided a curated set of side c…
▽ More
Sidechain rotamer libraries of the common amino acids of a protein are useful for folded protein structure determination and for generating ensembles of intrinsically disordered proteins (IDPs). However much of protein function is modulated beyond the translated sequence through thFiguree introduction of post-translational modifications (PTMs). In this work we have provided a curated set of side chain rotamers for the most common PTMs derived from the RCSB PDB database, including phosphorylated, methylated, and acetylated sidechains. Our rotamer libraries improve upon existing methods such as SIDEpro and Rosetta in predicting the experimental structures for PTMs in folded proteins. In addition, we showcase our PTM libraries in full use by generating ensembles with the Monte Carlo Side Chain Entropy (MCSCE) for folded proteins, and combining MCSCE with the Local Disordered Region Sampling algorithms within IDPConformerGenerator for proteins with intrinsically disordered regions.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Deep Learning of ab initio Hessians for Transition State Optimization
Authors:
Eric C. -Y. Yuan,
Anup Kumar,
Xingyi Guan,
Eric D. Hermes,
Andrew S. Rosen,
Judit Zádor,
Teresa Head-Gordon,
Samuel M. Blau
Abstract:
Identifying transition states -- saddle points on the potential energy surface connecting reactant and product minima -- is central to predicting kinetic barriers and understanding chemical reaction mechanisms. In this work, we train an equivariant neural network potential, NewtonNet, on an ab initio dataset of thousands of organic reactions from which we derive the analytical Hessians from the fu…
▽ More
Identifying transition states -- saddle points on the potential energy surface connecting reactant and product minima -- is central to predicting kinetic barriers and understanding chemical reaction mechanisms. In this work, we train an equivariant neural network potential, NewtonNet, on an ab initio dataset of thousands of organic reactions from which we derive the analytical Hessians from the fully differentiable machine learning (ML) model. By reducing the computational cost by several orders of magnitude relative to the Density Functional Theory (DFT) ab initio source, we can afford to use the learned Hessians at every step for the saddle point optimizations. We have implemented our ML Hessian algorithm in Sella, an open source software package designed to optimize atomic systems to find saddle point structures, in order to compare transition state optimization against quasi-Newton Hessian updates using DFT or the ML model. We show that the full ML Hessian robustly finds the transition states of 240 unseen organic reactions, even when the quality of the initial guess structures are degraded, while reducing the number of optimization steps to convergence by 2--3$\times$ compared to the quasi-Newton DFT and ML methods. All data generation, NewtonNet model, and ML transition state finding methods are available in an automated workflow.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Water Structure and Electric Fields at the Interface of Oil Droplets
Authors:
Lixue Shi,
R. Allen LaCour,
Xiaoqi Lang,
Joseph P. Heindel,
Teresa Head-Gordon,
Wei Min
Abstract:
Mesoscale water-hydrophobic interfaces are of fundamental importance in multiple disciplines, but their molecular properties have remained elusive for decades due to experimental complications and alternate theoretical explanations. Surface-specific spectroscopies, such as vibrational sum-frequency techniques, suffer from either sample preparation issues or the need for complex spectral correction…
▽ More
Mesoscale water-hydrophobic interfaces are of fundamental importance in multiple disciplines, but their molecular properties have remained elusive for decades due to experimental complications and alternate theoretical explanations. Surface-specific spectroscopies, such as vibrational sum-frequency techniques, suffer from either sample preparation issues or the need for complex spectral corrections. Here, we report on a robust "in solution" interface-selective Raman spectroscopy approach using multivariate curve resolution to probe hexadecane in water emulsions. Computationally, we use the recently developed monomer field model for Raman spectroscopy to help interpret the interfacial spectra. Unlike with vibrational sum frequency techniques, our interfacial spectra are readily comparable to the spectra of bulk water, yielding new insights. The combination of experiment and theory show that the interface leads to reduced tetrahedral order and weaker hydrogen bonding, giving rise to a substantial water population with dangling OH at the interface. Additionally, the stretching mode of these free OH experiences a ~80 cm-1 red-shift due to a strong electric field which we attribute to the negative zeta potential that is general to oil droplets. These findings are either opposite to, or absent in, the molecular hydrophobic interface formed by small solutes. Together, water structural disorder and enhanced electrostatics are an emergent feature at the mesoscale interface of oil-water emulsions, with an estimated interfacial electric field of ~35-70 MV/cm that is important for chemical reactivity.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
The Role of Charge in Microdroplet Redox Chemistry
Authors:
Joseph P. Heindel,
R. Allen LaCour,
Teresa Head-Gordon
Abstract:
In charged water microdroplets, which occur in nature or in the lab upon ultrasonication or in electrospray processes, the thermodynamics for reactive chemistry can be dramatically altered relative to the bulk phase. Here, we provide a theoretical basis for the observation of accelerated chemistry by simulating water droplets of increasing charge imbalance to create redox agents such as hydroxyl a…
▽ More
In charged water microdroplets, which occur in nature or in the lab upon ultrasonication or in electrospray processes, the thermodynamics for reactive chemistry can be dramatically altered relative to the bulk phase. Here, we provide a theoretical basis for the observation of accelerated chemistry by simulating water droplets of increasing charge imbalance to create redox agents such as hydroxyl and hydrogen radicals and solvated electrons. We compute the hydration enthalpy of OH^- and H^+ that controls the electron transfer process, and the corresponding changes in vertical ionization energy and vertical electron affinity of the ions, to create OH* and H* reactive species. We find that at ~20-50% of the Rayleigh limit of droplet charge the hydration enthalpy of both OH^- and H^+ have decreased by >50 kcal/mol such that electron transfer becomes thermodynamically favorable, in correspondence with the more favorable vertical electron affinity of H^+ and the lowered vertical ionization energy of OH^-. We provide scaling arguments that show that the nanoscale calculations and conclusions extend to the experimental microdroplet length scale. The relevance of the droplet charge for chemical reactivity is illustrated for the formation of H2O2, and has clear implications for other redox reactions observed to occur with enhanced rates in microdroplets.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Leak Proof PDBBind: A Reorganized Dataset of Protein-Ligand Complexes for More Generalizable Binding Affinity Prediction
Authors:
Jie Li,
Xingyi Guan,
Oufan Zhang,
Kunyang Sun,
Yingze Wang,
Dorian Bagni,
Teresa Head-Gordon
Abstract:
Many physics-based and machine-learned scoring functions (SFs) used to predict protein-ligand binding free energies have been trained on the PDBBind dataset. However, it is controversial as to whether new SFs are actually improving since the general, refined, and core datasets of PDBBind are cross-contaminated with proteins and ligands with high similarity, and hence they may not perform comparabl…
▽ More
Many physics-based and machine-learned scoring functions (SFs) used to predict protein-ligand binding free energies have been trained on the PDBBind dataset. However, it is controversial as to whether new SFs are actually improving since the general, refined, and core datasets of PDBBind are cross-contaminated with proteins and ligands with high similarity, and hence they may not perform comparably well in binding prediction of new protein-ligand complexes. In this work we have carefully prepared a cleaned PDBBind data set of non-covalent binders that are split into training, validation, and test datasets to control for data leakage, defined as proteins and ligands with high sequence and structural similarity. The resulting leak-proof (LP)-PDBBind data is used to retrain four popular SFs: AutoDock Vina, Random Forest (RF)-Score, InteractionGraphNet (IGN), and DeepDTA, to better test their capabilities when applied to new protein-ligand complexes. In particular we have formulated a new independent data set, BDB2020+, by matching high quality binding free energies from BindingDB with co-crystalized ligand-protein complexes from the PDB that have been deposited since 2020. Based on all the benchmark results, the retrained models using LP-PDBBind consistently perform better, with IGN especially being recommended for scoring and ranking applications for new protein-ligand systems.
△ Less
Submitted 2 May, 2024; v1 submitted 18 August, 2023;
originally announced August 2023.
-
Greater Transferability and Accuracy of Norm-conserving Pseudopotentials using Nonlinear Core Corrections
Authors:
Wan-Lu Li,
Kaixuan Chen,
Elliot Rossomme,
Martin Head-Gordon,
Teresa Head-Gordon
Abstract:
We present an investigation into the transferability of pseudopotentials (PPs) with a nonlinear core correction (NLCC) using the Goedecker, Teter, and Hutter (GTH) protocol across a range of pure GGA, meta-GGA and hybrid functionals, and their impact on thermochemical and non-thermochemical properties. The GTH-NLCC PP for the PBE density functional demonstrates remarkable transferability to the PB…
▽ More
We present an investigation into the transferability of pseudopotentials (PPs) with a nonlinear core correction (NLCC) using the Goedecker, Teter, and Hutter (GTH) protocol across a range of pure GGA, meta-GGA and hybrid functionals, and their impact on thermochemical and non-thermochemical properties. The GTH-NLCC PP for the PBE density functional demonstrates remarkable transferability to the PBE0 and $ω$B97X-V exchange-correlation functionals, and relative to no NLCC, improves agreement significantly for thermochemical benchmarks compared to all-electron calculations. On the other hand, the B97M-rV meta-GGA functional performs poorly with the PBE-derived GTH-NLCC PP, which is mitigated by reoptimizing the NLCC parameters for this specific functional. The findings reveal that atomization energies exhibit the greatest improvements from use of the NLCC, which thus provides an important correction needed for covalent interactions relevant to applications involving chemical reactivity. Finally we test the NLCC-GTH PPs when combined with medium-size TZV2P molecularly optimized (MOLOPT) basis sets which are typically utilized in condensed phase simulations, and show that they lead to consistently good results when compared to all-electron calculations for atomization energies, ionization potentials, barrier heights, and non-covalent interactions, but lead to somewhat larger errors for electron affinities.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
Beyond potential energy surface benchmarking: a complete application of machine learning to chemical reactivity
Authors:
Xingyi Guan,
Joseph Heindel,
Taehee Ko,
Chao Yang,
Teresa Head-Gordon
Abstract:
We train an equivariant machine learning model to predict energies and forces for a real-world study of hydrogen combustion under conditions of finite temperature and pressure. This challenging case for reactive chemistry illustrates that ML learned potential energy surfaces (PESs) are always incomplete as they are overly reliant on chemical intuition of what data is important for training, i.e. s…
▽ More
We train an equivariant machine learning model to predict energies and forces for a real-world study of hydrogen combustion under conditions of finite temperature and pressure. This challenging case for reactive chemistry illustrates that ML learned potential energy surfaces (PESs) are always incomplete as they are overly reliant on chemical intuition of what data is important for training, i.e. stable or metastable energy states. Instead we show here that a negative design data acquisition strategy is necessary to create a more complete ML model of the PES, since it must also learn avoidance of unforeseen high energy intermediates or even unphysical energy configurations. Because this type of data is unintuitive to create, we introduce an active learning workflow based on metadynamics that samples a lower dimensional manifold within collective variables that efficiently creates highly variable energy configurations for further ML training. This strategy more rapidly completes the ML PES such that deviations among query by committee ML models helps to now signal occasional calls to the external ab initio data source to further molecular dynamics in time without need for retraining the ML model. With the hybrid ML-physics model we predict the change in transition state and/or reaction mechanism at finite temperature and pressure for hydrogen combustion, thereby delivering on the promise of real application work using ML trained models of an ab initio PES with two orders of magnitude reduction in cost.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Highly Accurate Prediction of NMR Chemical Shifts from Low-Level Quantum Mechanics Calculations Using Machine Learning
Authors:
Jie Li,
Jiashu Liang,
Zhe Wang,
Aleksandra L. Ptaszek,
Xiao Liu,
Brad Ganoe,
Martin Head-Gordon,
Teresa Head-Gordon
Abstract:
Theoretical predictions of NMR chemical shifts from first-principles can greatly facilitate experimental interpretation and structure identification. However, accurate prediction of chemical shifts using the best coupled cluster methods can be prohibitively expensive for systems larger than ten to twenty non-hydrogen atoms on today's computers. By contrast machine learning methods offer inexpensiv…
▽ More
Theoretical predictions of NMR chemical shifts from first-principles can greatly facilitate experimental interpretation and structure identification. However, accurate prediction of chemical shifts using the best coupled cluster methods can be prohibitively expensive for systems larger than ten to twenty non-hydrogen atoms on today's computers. By contrast machine learning methods offer inexpensive alternatives but are hampered by generalization to molecules outside the original training set. Here we propose a novel machine learning feature representation informed by intermediate calculations of atomic chemical shielding tensors within a molecular environment using an inexpensive quantum mechanics method, and training it to predict NMR chemical shieldings of a high-level composite theory that is comparable to CCSD(T) in the complete basis set limit. The inexpensive shift machine learning (iShiftML) algorithm is trained through a new progressive active learning workflow that reduces the total number of expensive calculations required when constructing the dataset, while allowing the model to continuously improve on data it has never seen. Furthermore, we show that the error estimations from our model correlate quite well with actual errors to provide confidence values on new predictions. We illustrate the predictive capacity of iShiftML across gas phase experimental chemical shifts for small organic molecules and much larger and more complex natural products in which we can accurately differentiate between subtle diastereomers based on chemical shift assignments.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Using Diffusion Maps to Analyze Reaction Dynamics for a Hydrogen Combustion Benchmark Dataset
Authors:
Taehee Ko,
Joseph Heindel,
Xingyi Guan,
Teresa Head-Gordon,
David Williams-Young,
Chao Yang
Abstract:
We use local diffusion maps to assess the quality of two types of collective variables (CVs) for a recently published hydrogen combustion benchmark dataset~\cite{guan2022benchmark} that contains ab initio molecular dynamics trajectories and normal modes along minimum energy paths. This approach was recently advocated in~\cite{tlldiffmap20} for assessing CVs and analyzing reactions modeled by class…
▽ More
We use local diffusion maps to assess the quality of two types of collective variables (CVs) for a recently published hydrogen combustion benchmark dataset~\cite{guan2022benchmark} that contains ab initio molecular dynamics trajectories and normal modes along minimum energy paths. This approach was recently advocated in~\cite{tlldiffmap20} for assessing CVs and analyzing reactions modeled by classical molecular dynamics simulations. We report the effectiveness of this approach to molecular systems modeled by quantum ab initio molecular dynamics. In addition to assessing the quality of CVs, we also use global diffusion maps to perform committor analysis as proposed in~\cite{tlldiffmap20}. We show that the committor function obtained from the global diffusion map allows us to identify transition regions of interest in several hydrogen combustion reaction channels.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Efficient Calculation of NMR Shielding Constants Using Composite Method Approximations and Locally Dense Basis Sets
Authors:
Jiashu Liang,
Zhe Wang,
Jie Li,
Jonathan Wong,
Xiao Liu,
Brad Ganoe,
Teresa Head-Gordon,
Martin Head-Gordon
Abstract:
This paper presents a systematic study of applying composite method approximations with locally dense basis sets (LDBS) to efficiently calculate NMR shielding constants in small and medium-sized molecules. The pcSseg-n series of basis sets are shown to have similar accuracy to the pcS-n series when n $\geq1$ and can slightly reduce compute costs. We identify two different LDBS partition schemes th…
▽ More
This paper presents a systematic study of applying composite method approximations with locally dense basis sets (LDBS) to efficiently calculate NMR shielding constants in small and medium-sized molecules. The pcSseg-n series of basis sets are shown to have similar accuracy to the pcS-n series when n $\geq1$ and can slightly reduce compute costs. We identify two different LDBS partition schemes that perform very effectively for density functional calculations. We select a large subset of the recent NS372 database containing 290 H, C, N, and O shielding values evaluated by reference methods on 106 molecules to carefully assess methods of the high, medium, and low compute costs to make practical recommendations. Our assessment covers conventional electronic structure methods (DFT and wavefunction) with global basis calculations, as well as their use in one of the satisfactory LDBS approaches, and a range of composite approaches, also with and without LDBS. Altogether 99 methods are evaluated. On this basis, we recommend different methods to reach three different levels of accuracy and time requirements across the four nuclei considered.
△ Less
Submitted 11 November, 2022; v1 submitted 9 September, 2022;
originally announced September 2022.
-
Learning to Evolve Structural Ensembles of Unfolded and Disordered Proteins Using Experimental Solution Data
Authors:
Oufan Zhang,
Mojtaba Haghighatlari,
Jie Li,
Joao Miguel Correia Teixeira,
Ashley Namini,
Zi-Hao Liu,
Julie D Forman-Kay,
Teresa Head-Gordon
Abstract:
We have developed a Generative Recurrent Neural Networks (GRNN) that learns the probability of the next residue torsions $X_{i+1}=\ [φ_{i+1},ψ_{i+1},ω_{i+1}, χ_{i+1}]$ from the previous residue in the sequence $X_i$ to generate new IDP conformations. In addition, we couple the GRNN with a Bayesian model, X-EISD, in a reinforcement learning step that biases the probability distributions of torsions…
▽ More
We have developed a Generative Recurrent Neural Networks (GRNN) that learns the probability of the next residue torsions $X_{i+1}=\ [φ_{i+1},ψ_{i+1},ω_{i+1}, χ_{i+1}]$ from the previous residue in the sequence $X_i$ to generate new IDP conformations. In addition, we couple the GRNN with a Bayesian model, X-EISD, in a reinforcement learning step that biases the probability distributions of torsions to take advantage of experimental data types such as J-couplingss, NOEs and PREs. We show that updating the generative model parameters according to the reward feedback on the basis of the agreement between structures and data improves upon existing approaches that simply reweight static structural pools for disordered proteins. Instead the GRNN "DynamICE" model learns to physically change the conformations of the underlying pool to those that better agree with experiment.
△ Less
Submitted 24 July, 2022; v1 submitted 25 June, 2022;
originally announced June 2022.
-
Learning Correlations between Internal Coordinates to improve 3D Cartesian Coordinates for Proteins
Authors:
Jie Li,
Oufan Zhang,
Seokyoung Lee,
Ashley Namini,
Zi Hao Liu,
João Miguel Correia Teixeira,
Julie D Forman-Kay,
Teresa Head-Gordon
Abstract:
We consider a generic representation problem of internal coordinates (bond lengths, valence angles, and dihedral angles) and their transformation to 3-dimensional Cartesian coordinates of a biomolecule. We show that the internal-to-Cartesian process relies on correctly predicting chemically subtle correlations among the internal coordinates themselves, and learning these correlations increases the…
▽ More
We consider a generic representation problem of internal coordinates (bond lengths, valence angles, and dihedral angles) and their transformation to 3-dimensional Cartesian coordinates of a biomolecule. We show that the internal-to-Cartesian process relies on correctly predicting chemically subtle correlations among the internal coordinates themselves, and learning these correlations increases the fidelity of the Cartesian representation. This general problem has been solved with machine learning for proteins, but with appropriately formulated data is extensible to any type of chain biomolecule including RNA, DNA, and lipids. We show that the internal-to-Cartesian process relies on correctly predicting chemically subtle correlations among the internal coordinates themselves, and learning these correlations increases the fidelity of the Cartesian representation. We developed a machine learning algorithm, Int2Cart, to predict bond lengths and bond angles from backbone torsion angles and residue types of a protein, and allows reconstruction of protein structures better than using fixed bond lengths and bond angles, or a static library method that relies on backbone torsion angles and residue types on a single residue. The Int2Cart algorithm has been implemented as an individual python package at https://github.com/THGLab/int2cart.
△ Less
Submitted 11 May, 2022; v1 submitted 10 May, 2022;
originally announced May 2022.
-
Mining for Potent Inhibitors through Artificial Intelligence and Physics: A Unified Methodology for Ligand Based and Structure Based Drug Design
Authors:
Jie Li,
Oufan Zhang,
Yingze Wang,
Kunyang Sun,
Xingyi Guan,
Dorian Bagni,
Mojtaba Haghighatlari,
Fiona L. Kearns,
Conor Parks,
Rommie E. Amaro,
Teresa Head-Gordon
Abstract:
The viability of a new drug molecule is a time and resource intensive task that makes computer-aided assessments a vital approach to rapid drug discovery. Here we develop a machine learning algorithm, iMiner, that generates novel inhibitor molecules for target proteins by combining deep reinforcement learning with real-time 3D molecular docking using AutoDock Vina, thereby simultaneously creating…
▽ More
The viability of a new drug molecule is a time and resource intensive task that makes computer-aided assessments a vital approach to rapid drug discovery. Here we develop a machine learning algorithm, iMiner, that generates novel inhibitor molecules for target proteins by combining deep reinforcement learning with real-time 3D molecular docking using AutoDock Vina, thereby simultaneously creating chemical novelty while constraining molecules for shape and molecular compatibility with target active sites. Moreover, through the use of various types of reward functions, we can generate new molecules that are chemically similar to a target ligand, which can be grown from known protein bound fragments, as well as to create molecules that enforce interactions with target residues in the protein active site. The iMiner algorithm is embedded in a composite workflow that filters out Pan-assay interference compounds, Lipinski rule violations, and poor synthetic accessibility, with options for cross-validation against other docking scoring functions and automation of a molecular dynamics simulation to measure pose stability. Because our approach only relies on the structure of the target protein, iMiner can be easily adapted for future development of other inhibitors or small molecule therapeutics of any target protein.
△ Less
Submitted 10 January, 2024; v1 submitted 4 October, 2021;
originally announced October 2021.
-
NewtonNet: A Newtonian message passing network for deep learning of interatomic potentials and forces
Authors:
Mojtaba Haghighatlari,
Jie Li,
Xingyi Guan,
Oufan Zhang,
Akshaya Das,
Christopher J. Stein,
Farnaz Heidar-Zadeh,
Meili Liu,
Martin Head-Gordon,
Luke Bertels,
Hongxia Hao,
Itai Leven,
Teresa Head-Gordon
Abstract:
We report a new deep learning message passing network that takes inspiration from Newton's equations of motion to learn interatomic potentials and forces. With the advantage of directional information from trainable latent force vectors, and physics-infused operators that are inspired by the Newtonian physics, the entire model remains rotationally equivariant, and many-body interactions are inferr…
▽ More
We report a new deep learning message passing network that takes inspiration from Newton's equations of motion to learn interatomic potentials and forces. With the advantage of directional information from trainable latent force vectors, and physics-infused operators that are inspired by the Newtonian physics, the entire model remains rotationally equivariant, and many-body interactions are inferred by more interpretable physical features. We test NewtonNet on the prediction of several reactive and non-reactive high quality ab initio data sets including single small molecule dynamics, a large set of chemically diverse molecules, and methane and hydrogen combustion reactions, achieving state-of-the-art test performance on energies and forces with far greater data and computational efficiency than other deep learning models.
△ Less
Submitted 5 August, 2021;
originally announced August 2021.
-
Configurational Entropy of Folded Proteins and its Importance for Intrinsically Disordered Proteins
Authors:
Meili Liu,
Akshaya K. Das,
James Lincoff,
Sukanya Sasmal,
Sara Y. Cheng,
Robert Vernon,
Julie Forman-Kay,
Teresa Head-Gordon
Abstract:
Many pairwise additive force fields are in active use for intrinsically disordered proteins (IDPs) and regions (IDRs), some of which modify energetic terms to improve description of IDPs/IDRs, but are largely in disagreement with solution experiments for the disordered states. We have evaluated representative pairwise and many-body protein and water force fields against experimental data on repres…
▽ More
Many pairwise additive force fields are in active use for intrinsically disordered proteins (IDPs) and regions (IDRs), some of which modify energetic terms to improve description of IDPs/IDRs, but are largely in disagreement with solution experiments for the disordered states. We have evaluated representative pairwise and many-body protein and water force fields against experimental data on representative IDPs and IDRs, a peptide that undergoes a disorder-to-order transition, and for seven globular proteins ranging in size from 130-266 amino acids. We find that force fields with the largest statistical fluctuations consistent with the radius of gyration and universal Lindemann values for folded states simultaneously better describe IDPs and IDRs and disorder to order transitions. Hence the crux of what a force field should exhibit to well describe IDRs/IDPs is not just the balance between protein and water energetics, but the balance between energetic effects and configurational entropy of folded states of globular proteins.
△ Less
Submitted 11 November, 2020; v1 submitted 12 July, 2020;
originally announced July 2020.
-
The Critical Role of Thermal Fluctuations for Electrocatalytic Metal Surface Properties and CO Binding Trends
Authors:
Wan-Lu Li,
Christianna N. Lininger,
Valerie Vaissier Welborn,
Elliot Rossomme,
Alexis T. Bell,
Martin Head-Gordon,
Teresa Head-Gordon
Abstract:
This work addresses a longstanding theoretical discrepancy using Density Functional Theory (DFT) with experimental observations of CO binding trends on electrocatalytically relevant metals for the CO2 reduction reaction (CO2RR). By introducing thermal fluctuations using appropriate statistical mechanical NVT and NPT ensembles, we show that DFT with universal dispersion interactions yields qualitat…
▽ More
This work addresses a longstanding theoretical discrepancy using Density Functional Theory (DFT) with experimental observations of CO binding trends on electrocatalytically relevant metals for the CO2 reduction reaction (CO2RR). By introducing thermal fluctuations using appropriate statistical mechanical NVT and NPT ensembles, we show that DFT with universal dispersion interactions yields qualitatively better metal surface strain trends and CO binding energetics, consistently predicts the correct site preference for all metals due to thermally induced surface distortions that preferentially exposes the undercoordinated atop site for Cu(111) and Pt(111), and for the weak binding Ag(111) and Au(111) surfaces at finite temperatures shows CO-metal interactions that are a mixture of chemisorbed and physisorbed species. This study better places theory as an equal partner to experimental heterogeneous catalysis by demonstrating the need to fully account for finite temperature fluctuations to make contact with surface science experiments.
△ Less
Submitted 11 November, 2020; v1 submitted 6 June, 2020;
originally announced June 2020.
-
Stochastic Constrained Extended System Dynamics for Solving Charge Equilibration Models
Authors:
Songchen Tan,
Itai Leven,
Dong An,
Lin Lin,
Teresa Head-Gordon
Abstract:
We present a new stochastic extended Lagrangian solution to charge equilibration that eliminates self-consistent field (SCF) calculations, eliminating the computational bottleneck in solving the many-body solution with standard SCF solvers. By formulating both charges and chemical potential as latent variables, and introducing a holonomic constraint that satisfies charge conservation, the SC-XLMD…
▽ More
We present a new stochastic extended Lagrangian solution to charge equilibration that eliminates self-consistent field (SCF) calculations, eliminating the computational bottleneck in solving the many-body solution with standard SCF solvers. By formulating both charges and chemical potential as latent variables, and introducing a holonomic constraint that satisfies charge conservation, the SC-XLMD method accurately reproduces structural, thermodynamic, and dynamics properties using ReaxFF, and shows excellent weak- and strong-scaling performance in the LAMMPS molecular simulation package.
△ Less
Submitted 21 May, 2020;
originally announced May 2020.
-
An Isolated Water Droplet in the Aqueous Solution of a Supramolecular Tetrahedral Cage
Authors:
Federico Sebastiani,
Trandon A. Bender,
Simone Pezzotti,
Wan-Lu Li,
Gerhard Schwaab,
Robert G. Bergman,
Kenneth N. Raymond,
F. Dean Toste,
Teresa Head-Gordon,
Martina Havenith
Abstract:
Water under nanoconfinement at ambient conditions has exhibited low-dimensional ice formation and liquid-solid phase transitions, but with structural and dynamical signatures which map onto known regions of waters phase diagram. Using THz absorption spectroscopy and ab initio molecular dynamics, we have investigated the ambient water confined in a supramolecular tetrahedral assembly, and determine…
▽ More
Water under nanoconfinement at ambient conditions has exhibited low-dimensional ice formation and liquid-solid phase transitions, but with structural and dynamical signatures which map onto known regions of waters phase diagram. Using THz absorption spectroscopy and ab initio molecular dynamics, we have investigated the ambient water confined in a supramolecular tetrahedral assembly, and determined that a distinct network of 9-10 water molecules is present within the nanocavity of the host. The low-frequency absorption spectrum and theoretical analysis of the water in the $Ga_4$$L_6$$^{-12}$ host demonstrate that the structure and dynamics of the encapsulated droplet is distinct from any known phase of water. A further inference is that the release of the highly unusual encapsulated water droplet creates a strong thermodynamic driver for the high affinity binding of guests in aqueous solution for the $Ga_4$$L_6$$^{-12}$ supramolecular construct.
△ Less
Submitted 12 July, 2020; v1 submitted 20 April, 2020;
originally announced April 2020.
-
Learning to Make Chemical Predictions: the Interplay of Feature Representation, Data, and Machine Learning Algorithms
Authors:
Mojtaba Haghighatlari,
Jie Li,
Farnaz Heidar-Zadeh,
Yuchen Liu,
Xingyi Guan,
Teresa Head-Gordon
Abstract:
Recently supervised machine learning has been ascending in providing new predictive approaches for chemical, biological and materials sciences applications. In this Perspective we focus on the interplay of machine learning algorithm with the chemically motivated descriptors and the size and type of data sets needed for molecular property prediction. Using Nuclear Magnetic Resonance chemical shift…
▽ More
Recently supervised machine learning has been ascending in providing new predictive approaches for chemical, biological and materials sciences applications. In this Perspective we focus on the interplay of machine learning algorithm with the chemically motivated descriptors and the size and type of data sets needed for molecular property prediction. Using Nuclear Magnetic Resonance chemical shift prediction as an example, we demonstrate that success is predicated on the choice of feature extracted or real-space representations of chemical structures, whether the molecular property data is abundant and/or experimentally or computationally derived, and how these together will influence the correct choice of popular machine learning algorithms drawn from deep learning, random forests, or kernel methods.
△ Less
Submitted 28 February, 2020;
originally announced March 2020.
-
Extended Experimental Inferential Structure Determination Method for Evaluating the Structural Ensembles of Disordered Protein States
Authors:
James Lincoff,
Mickael Krzeminski,
Mojtaba Haghighatlari,
João M. C. Teixeira,
Gregory-Neal W. Gomes,
Claudiu C. Gradinaru,
Julie D. Forman-Kay,
Teresa Head-Gordon
Abstract:
Characterization of proteins with intrinsic or unfolded state disorder comprises a new frontier in structural biology, requiring the characterization of diverse and dynamic structural ensembles. We introduce a comprehensive Bayesian framework, the Extended Experimental Inferential Structure Determination (X-EISD) method, that calculates the maximum log-likelihood of a protein structural ensemble b…
▽ More
Characterization of proteins with intrinsic or unfolded state disorder comprises a new frontier in structural biology, requiring the characterization of diverse and dynamic structural ensembles. We introduce a comprehensive Bayesian framework, the Extended Experimental Inferential Structure Determination (X-EISD) method, that calculates the maximum log-likelihood of a protein structural ensemble by accounting for the uncertainties of a wide range of experimental data and back-calculation models from structures, including NMR chemical shifts, J-couplings, Nuclear Overhauser Effects, paramagnetic relaxation enhancements, residual dipolar couplings, and hydrodynamic radii, single molecule fluorescence Förster resonance energy transfer efficiencies and small angle X-ray scattering intensity curves. We apply X-EISD to the drkN SH3 unfolded state domain and show that certain experimental data types are more influential than others for both eliminating structural ensemble models, while also finding equally probable disordered ensembles that have alternative structural properties that will stimulate further experiments to discriminate between them.
△ Less
Submitted 29 December, 2019;
originally announced December 2019.
-
Accurate Prediction of Chemical Shifts for Aqueous Protein Structure for "Real World" Cases using Machine Learning
Authors:
Jie Li,
Kochise C. Bennett,
Yuchen Liu,
Michael V. Martin,
Teresa Head-Gordon
Abstract:
Accurate prediction of NMR chemical shifts can in principle help refine aqueous solution structure of proteins to the quality of X-ray structures. We report a new machine learning algorithm for protein chemical shift prediction that outperforms existing chemical shift calculators on realistic NMR solution data. Our UCBShift predictor implements two modules: a transfer prediction module that employ…
▽ More
Accurate prediction of NMR chemical shifts can in principle help refine aqueous solution structure of proteins to the quality of X-ray structures. We report a new machine learning algorithm for protein chemical shift prediction that outperforms existing chemical shift calculators on realistic NMR solution data. Our UCBShift predictor implements two modules: a transfer prediction module that employs both sequence and structural alignment to select reference candidates for experimental chemical shift replication, and a redesigned machine learning module based on random forest regression which utilizes more, and more carefully curated, feature extracted data. When combined together, this new predictor achieves state of the art accuracy for predicting chemical shifts on a "real-world" dataset, with root-mean-square errors of 0.31 ppm for amide hydrogens, 0.19 ppm for Halpha, 0.87 ppm for C, 0.81 ppm for Calpha, 1.01 ppm for Cbeta, and 1.83 ppm for N, exceeding current prediction accuracy of popular chemical shift predictors such as SPARTA+ and SHIFTX2.
△ Less
Submitted 5 December, 2019;
originally announced December 2019.
-
Diels-Alder Reactions in Water are Determined by Microsolvation
Authors:
Luis Ruiz Pestana,
Hongxia Hao,
Teresa Head-Gordon
Abstract:
Nanoconfined aqueous environments and the recent advent of accelerated chemistry in microdroplets are increasingly being investigated for catalysis. The mechanisms underlying the enhanced reactivity in alternate solvent environments, and whether the enhanced reactivity due to nanoconfinement is a universal phenomenon, are not fully understood. Here, we use ab initio molecular dynamics simulations…
▽ More
Nanoconfined aqueous environments and the recent advent of accelerated chemistry in microdroplets are increasingly being investigated for catalysis. The mechanisms underlying the enhanced reactivity in alternate solvent environments, and whether the enhanced reactivity due to nanoconfinement is a universal phenomenon, are not fully understood. Here, we use ab initio molecular dynamics simulations to characterize the free energy of a retro-Diels-Alder reaction in bulk water at very different densities and in water nanoconfined by parallel graphene sheets. We find that the broadly different global solvation environments accelerate the reactions to a similar degree with respect to the gas phase reaction, with activation free energies that do not differ by more than kbT from each other. The reason for the same acceleration factor in the extremely different solvation environments is that it is the microsolvation of the dienophile's carbonyl group that governs the transition state stabilization and mechanism, which is not significantly disrupted by either the lower density in bulk water or the strong nanoconfinement conditions used here. Our results also suggest that significant acceleration of Diels Alder reactions in microdroplets or on-water conditions can't arise from local microsolvation when water is present, but instead must come from highly altered reaction environments that drastically change the reaction mechanisms.
△ Less
Submitted 5 December, 2019; v1 submitted 9 November, 2019;
originally announced November 2019.
-
Interplay of Water and a Supramolecular Capsule for Catalysis of Reductive Elimination Reaction from Gold
Authors:
Valerie Vaissier Welborn,
Wan-Lu Li,
Teresa Head-Gordon
Abstract:
Supramolecular assemblies have gained tremendous attention due to their apparent ability to catalyze reactions with the efficiencies of natural enzymes. Using Born-Oppenheimer molecular dynamics and density functional theory, we identify the origin of the catalytic power of the supramolecular assembly Ga$_4$L$_{612-}$ on the reductive elimination reaction from gold complexes and their similarity t…
▽ More
Supramolecular assemblies have gained tremendous attention due to their apparent ability to catalyze reactions with the efficiencies of natural enzymes. Using Born-Oppenheimer molecular dynamics and density functional theory, we identify the origin of the catalytic power of the supramolecular assembly Ga$_4$L$_{612-}$ on the reductive elimination reaction from gold complexes and their similarity to enzymes. By comparing the catalyzed and uncatalyzed reaction in explicit solvent to identify the reaction free energies of the reactants, transition states, and products, we determine that a catalytic moiety -- an encapsulated water molecule -- generates electric fields that contribute significant reduction in the activation free energy. Although this is unlike the biomimetic scenario of catalysis through direct host-guest interactions, the nanocage host preconditions the transition state for greater sensitivity to electric field projections onto the breaking carbon bonds to complete the reductive elimination reaction with greater catalytic efficiency. However it is also shown that the nanocage poorly organizes the interfacial water, which in turn creates electric fields that misalign with the breaking bonds of the substrate, thus identifying new opportunities for catalytic design improvements in nanocage assemblies.
△ Less
Submitted 5 December, 2019; v1 submitted 28 August, 2019;
originally announced August 2019.
-
A Multi-Resolution 3D-DenseNet for Chemical Shift Prediction in NMR Crystallography
Authors:
Shuai Liu,
Jie Li,
Kochise C. Bennett,
Brad Ganoe,
Tim Stauch,
Martin Head-Gordon,
Alexander Hexemer,
Daniela Ushizima,
Teresa Head-Gordon
Abstract:
We have developed a deep learning algorithm for chemical shift prediction for atoms in molecular crystals that utilizes an atom-centered Gaussian density model for the 3D data representation of a molecule. We define multiple channels that describe different spatial resolutions for each atom type that utilizes crop**, pooling, and concatenation to create a multi-resolution 3D-DenseNet architectur…
▽ More
We have developed a deep learning algorithm for chemical shift prediction for atoms in molecular crystals that utilizes an atom-centered Gaussian density model for the 3D data representation of a molecule. We define multiple channels that describe different spatial resolutions for each atom type that utilizes crop**, pooling, and concatenation to create a multi-resolution 3D-DenseNet architecture (MR-3D-DenseNet). Because the training and testing time scale linearly with the number of samples, the MR-3D-DenseNet can exploit data augmentation that takes into account the property of rotational invariance of the chemical shifts, thereby also increasing the size of the training dataset by an order of magnitude without additional cost. We obtain very good agreement for 13C, 15N, and 17O chemical shifts, with the highest accuracy found for 1H chemical shifts that is equivalent to the best predictions using ab initio quantum chemistry methods.
△ Less
Submitted 31 May, 2019;
originally announced June 2019.
-
Inertial Extended-Lagrangian Scheme for Solving Charge Equilibration Models
Authors:
Itai Leven,
Teresa Head-Gordon
Abstract:
The inertial extended Lagrangian/self-consistent field scheme (iEL-SCF) has been adopted for solving charge equilibration in LAMMPS as part of the reactive force field ReaxFF, which due to the charge conservation constraint requires solving two sets of linear system of equations for the new charges at each molecular dynamics time-step. Therefore, the extended Lagrangian for charge equilibration is…
▽ More
The inertial extended Lagrangian/self-consistent field scheme (iEL-SCF) has been adopted for solving charge equilibration in LAMMPS as part of the reactive force field ReaxFF, which due to the charge conservation constraint requires solving two sets of linear system of equations for the new charges at each molecular dynamics time-step. Therefore, the extended Lagrangian for charge equilibration is comprised of two auxiliary variables for the intermediate charges which serve as an initial guess for the real charges. We show that the iEL-SCF is able to reduce the number of SCF cycles by 50-80% of the original conjugate gradient self-consistent field solver as tested across diverse systems including water, ferric hydroxide, nitramine RDX, and hexanitrostilbene.
△ Less
Submitted 27 May, 2019; v1 submitted 26 May, 2019;
originally announced May 2019.
-
Development of an Advanced Force Field for Water using Variational Energy Decomposition Analysis
Authors:
A. K. Das,
L. Urban,
I. Leven,
M. Loipersberger,
A. Aldossary,
M. Head-Gordon,
T. Head-Gordon
Abstract:
Given the piecewise approach to modeling intermolecular interactions for force fields, they can be difficult to parameterize since they are fit to data like total energies that only indirectly connect to their separable functional forms. Furthermore, by neglecting certain types of molecular interactions such as charge penetration and charge transfer, most classical force fields must rely on, but d…
▽ More
Given the piecewise approach to modeling intermolecular interactions for force fields, they can be difficult to parameterize since they are fit to data like total energies that only indirectly connect to their separable functional forms. Furthermore, by neglecting certain types of molecular interactions such as charge penetration and charge transfer, most classical force fields must rely on, but do not always demonstrate, how cancellation of errors occurs among the remaining molecular interactions accounted for such as exchange repulsion, electrostatics, and polarization. In this work we present the first generation of the (many-body) MB-UCB force field that explicitly accounts for the decomposed molecular interactions commensurate with a variational energy decomposition analysis, including charge transfer, with force field design choices that reduce the computational expense of the MB-UCB potential while remaining accurate. We optimize parameters using only single water molecule and water cluster data up through pentamers, with no fitting to condensed phase data, and we demonstrate that high accuracy is maintained when the force field is subsequently validated against conformational energies of larger water cluster data sets, radial distribution functions of the liquid phase, and the temperature dependence of thermodynamic and transport water properties. We conclude that MB-UCB is comparable in performance to MB-Pol, but is less expensive and more transferable by eliminating the need to represent short-ranged interactions through large parameter fits to high order polynomials.
△ Less
Submitted 27 July, 2019; v1 submitted 19 May, 2019;
originally announced May 2019.
-
Fluctuations of Electric Fields in the Active Site of the Enzyme Ketosteroid Isomerase
Authors:
Valerie Vaissier Welborn,
Teresa Head-Gordon
Abstract:
We report the effect of conformational dynamics on the fluctuations of electric fields in the active site of the enzyme Ketosteroid Isomerase (KSI). While KSI is considered rigid with little conformational variation to support different stages of the catalytic cycle, we show that KSI utilizes cooperative side chain motions of the entire protein scaffold outside the active site, which contribute ne…
▽ More
We report the effect of conformational dynamics on the fluctuations of electric fields in the active site of the enzyme Ketosteroid Isomerase (KSI). While KSI is considered rigid with little conformational variation to support different stages of the catalytic cycle, we show that KSI utilizes cooperative side chain motions of the entire protein scaffold outside the active site, which contribute negligibly to the electric fields on the substrate, by progressively stabilizing electric field contributions by particular active site residues at different timescales. The design of synthetic enzymes could benefit from strategies that can take advantage of the dynamics by using electric fields fluctuations as a guide.
△ Less
Submitted 17 May, 2019;
originally announced May 2019.
-
Water is not a Dynamic Polydisperse Branched Polymer
Authors:
Teresa Head-Gordon,
Francesco Paesani
Abstract:
The contributed paper by Naserifar and Goddard reports that their RexPoN water model under ambient conditions simulates liquid water as a dynamic polydisperse branched polymer, which they speculate explains the existence of the liquid-liquid critical point (LLCP) in the supercooled region. Our work addresses several serious factual errors and needless speculation in their paper about their interpr…
▽ More
The contributed paper by Naserifar and Goddard reports that their RexPoN water model under ambient conditions simulates liquid water as a dynamic polydisperse branched polymer, which they speculate explains the existence of the liquid-liquid critical point (LLCP) in the supercooled region. Our work addresses several serious factual errors and needless speculation in their paper about their interpretation of their model and its implication for the LLCP in supercooled water.
△ Less
Submitted 23 May, 2019; v1 submitted 10 May, 2019;
originally announced May 2019.
-
Convergence of Stochastic-extended Lagrangian molecular dynamics method for polarizable force field simulation
Authors:
Dong An,
Sara Y. Cheng,
Teresa Head-Gordon,
Lin Lin,
Jianfeng Lu
Abstract:
Extended Lagrangian molecular dynamics (XLMD) is a general method for performing molecular dynamics simulations using quantum and classical many-body potentials. Recently several new XLMD schemes have been proposed and tested on several classes of many-body polarization models such as induced dipoles or Drude charges, by creating an auxiliary set of these same degrees of freedom that are reversibl…
▽ More
Extended Lagrangian molecular dynamics (XLMD) is a general method for performing molecular dynamics simulations using quantum and classical many-body potentials. Recently several new XLMD schemes have been proposed and tested on several classes of many-body polarization models such as induced dipoles or Drude charges, by creating an auxiliary set of these same degrees of freedom that are reversibly integrated through time. This gives rise to a singularly perturbed Hamiltonian system that provides a good approximation to the time evolution of the real mutual polarization field. To further improve upon the accuracy of the XLMD dynamics, and to potentially extend it to other many-body potentials, we introduce a stochastic modification which leads to a set of singularly perturbed Langevin equations with degenerate noise. We prove that the resulting Stochastic-XLMD converges to the accurate dynamics, and the convergence rate is both optimal and is independent of the accuracy of the initial polarization field. We carefully study the scaling of the dam** factor and numerical noise for efficient numerical simulation for Stochastic-XLMD, and we demonstrate the effectiveness of the method for model polarizable force field systems.
△ Less
Submitted 27 February, 2020; v1 submitted 26 April, 2019;
originally announced April 2019.
-
Strong Anisotropy in Liquid Water upon Librational Excitation using Terahertz Laser Fields
Authors:
Fabio Novelli,
Luis Ruiz Pestana,
Kochise C. Bennett,
Federico Sebastiani,
Ellen M. Adams,
Nikolas Stavrias,
Thorsten Ockelmann,
Alejandro Colchero,
Claudius Hoberg,
Gerhard Schwaab,
Teresa Head-Gordon,
Martina Havenith
Abstract:
Tracking the excitation of water molecules in the homogeneous liquid is challenging due to the ultrafast dissipation of rotational excitation energy through the hydrogen-bonded network. Here we demonstrate strong transient anisotropy of liquid water through librational excitation using single-color pump-probe experiments at 12.3 THz. We deduce a third order response of chi^3 exceeding previously r…
▽ More
Tracking the excitation of water molecules in the homogeneous liquid is challenging due to the ultrafast dissipation of rotational excitation energy through the hydrogen-bonded network. Here we demonstrate strong transient anisotropy of liquid water through librational excitation using single-color pump-probe experiments at 12.3 THz. We deduce a third order response of chi^3 exceeding previously reported values in the optical range by three orders of magnitude. Using a theory that replaces the nonlinear response with a material response property amenable to molecular dynamics simulation, we show that the rotationally damped motion of water molecules in the librational band is resonantly driven at this frequency, which could explain the enhancement of the anisotropy in the liquid by the external Terahertz field. By addition of salt (MgSO4), the hydration water is instead dominated by the local electric field of the ions, resulting in reduction of water molecules that can be dynamically perturbed by THz pulses.
△ Less
Submitted 9 June, 2020; v1 submitted 12 September, 2018;
originally announced September 2018.
-
Improvements to the APBS biomolecular solvation software suite
Authors:
Elizabeth Jurrus,
Dave Engel,
Keith Star,
Kyle Monson,
Juan Brandi,
Lisa E. Felberg,
David H. Brookes,
Leighton Wilson,
Jiahui Chen,
Karina Liles,
Minju Chun,
Peter Li,
David W. Gohara,
Todd Dolinsky,
Robert Konecny,
David R. Koes,
Jens Erik Nielsen,
Teresa Head-Gordon,
Weihua Geng,
Robert Krasny,
Guo Wei Wei,
Michael J. Holst,
J. Andrew McCammon,
Nathan A. Baker
Abstract:
The Adaptive Poisson-Boltzmann Solver (APBS) software was developed to solve the equations of continuum electrostatics for large biomolecular assemblages that has provided impact in the study of a broad range of chemical, biological, and biomedical applications. APBS addresses three key technology challenges for understanding solvation and electrostatics in biomedical applications: accurate and ef…
▽ More
The Adaptive Poisson-Boltzmann Solver (APBS) software was developed to solve the equations of continuum electrostatics for large biomolecular assemblages that has provided impact in the study of a broad range of chemical, biological, and biomedical applications. APBS addresses three key technology challenges for understanding solvation and electrostatics in biomedical applications: accurate and efficient models for biomolecular solvation and electrostatics, robust and scalable software for applying those theories to biomolecular systems, and mechanisms for sharing and analyzing biomolecular electrostatics data in the scientific community. To address new research applications and advancing computational capabilities, we have continually updated APBS and its suite of accompanying software since its release in 2001. In this manuscript, we discuss the models and capabilities that have recently been implemented within the APBS software package including: a Poisson-Boltzmann analytical and a semi-analytical solver, an optimized boundary element solver, a geometry-based geometric flow solvation model, a graph theory based algorithm for determining p$K_a$ values, and an improved web-based visualization tool for viewing electrostatics.
△ Less
Submitted 21 August, 2017; v1 submitted 30 June, 2017;
originally announced July 2017.
-
PB-AM: An Open-Source, Fully Analytical Linear Poisson-Boltzmann Solver
Authors:
Lisa E. Feldberg,
David H. Brookes,
Eng-Hui Yap,
Elizabeth Jurrus,
Nathan Baker,
Teresa Head-Gordon
Abstract:
We present the open source distributed software package Poisson-Boltzmann Analytical Method (PB-AM), a fully analytical solution to the linearized Poisson Boltzmann equation, for molecules represented as non-overlap** spherical cavities. The PB-AM software package includes the generation of outputs files appropriate for visualization using VMD, a Brownian dynamics scheme that uses periodic bound…
▽ More
We present the open source distributed software package Poisson-Boltzmann Analytical Method (PB-AM), a fully analytical solution to the linearized Poisson Boltzmann equation, for molecules represented as non-overlap** spherical cavities. The PB-AM software package includes the generation of outputs files appropriate for visualization using VMD, a Brownian dynamics scheme that uses periodic boundary conditions to simulate dynamics, the ability to specify docking criteria, and offers two different kinetics schemes to evaluate biomolecular association rate constants. Given that PB-AM defines mutual polarization completely and accurately, it can be refactored as a many-body expansion to explore 2- and 3-body polarization. Additionally, the software has been integrated into the Adaptive Poisson-Boltzmann Solver (APBS) software package to make it more accessible to a larger group of scientists, educators and students that are more familiar with the APBS framework.
△ Less
Submitted 23 September, 2016; v1 submitted 13 August, 2016;
originally announced August 2016.
-
Representability problems for coarse-grained water potentials
Authors:
Margaret E. Johnson,
Teresa Head-Gordon,
Ard A. Louis
Abstract:
The use of an effective intermolecular potential often involves a compromise between more accurate, complex functional forms and more tractable simple representations. To study this choice in detail, we systematically derive coarse-grained isotropic pair potentials that accurately reproduce the oxygen-oxygen radial distribution function of the TIP4P-Ew water model at state points over density ra…
▽ More
The use of an effective intermolecular potential often involves a compromise between more accurate, complex functional forms and more tractable simple representations. To study this choice in detail, we systematically derive coarse-grained isotropic pair potentials that accurately reproduce the oxygen-oxygen radial distribution function of the TIP4P-Ew water model at state points over density ranges from 0.88-1.30g/cc and temperature ranges from 235K-310K. Although by construction these effective potentials correctly represent the isothermal compressibility of TIP4P-Ew water, they do not accurately resolve other thermodynamic properties such as the virial pressure, the internal energy or thermodynamic anomalies. Because at a given state point the pair potential that reproduces the pair structure is unique, we have therefore explicitly demonstrated that it is impossible to simultaneously represent the pair-structure and several key equilibrium thermodynamic properties of water with state-point dependent radially symmetric pair potentials. We argue that such representability problems are related to, but different from, more widely acknowledged transferability problems, and discuss in detail the implications this has for the modeling of water and other liquids by coarse-grained potentials. Nevertheless, regardless of thermodynamic inconsistencies, the state-point dependent effective potentials for water do generate structural and dynamical anomalies.
△ Less
Submitted 22 February, 2007;
originally announced February 2007.
-
Hydration Water Dynamics and Instigation of Protein Structural Relaxation
Authors:
Daniela Russo,
Greg Hura,
Teresa Head-Gordon
Abstract:
The molecular mechanism of the solvent motion that is required to instigate the protein structural relaxation above a critical hydration level or transition temperature has yet to be determined. In this work we use quasi-elastic neutron scattering (QENS) and molecular dynamics simulation to investigate hydration water dynamics near a greatly simplified protein surface. We consider the hydration…
▽ More
The molecular mechanism of the solvent motion that is required to instigate the protein structural relaxation above a critical hydration level or transition temperature has yet to be determined. In this work we use quasi-elastic neutron scattering (QENS) and molecular dynamics simulation to investigate hydration water dynamics near a greatly simplified protein surface. We consider the hydration water dynamics near the completely deuterated N-acetyl-leucine-methylamide (NALMA) solute, a hydrophobic amino acid side chain attached to a polar blocked polypeptide backbone, as a function of concentration between 0.5M-2.0M, under ambient conditions. In this Communication, we focus our results of hydration dynamics near a model protein surface on the issue of how enzymatic activity is restored once a critical hydration level is reached, and provide a hypothesis for the molecular mechanism of the solvent motion that is required to trigger protein structural relaxation when above the hydration transition.
△ Less
Submitted 10 October, 2003;
originally announced October 2003.