Search | arXiv e-print repository

Augmenting Human Expertise in Weighted Ensemble Simulations through Deep Learning based Information Bottleneck

Abstract: The weighted ensemble (WE) method stands out as a widely used segment-based sampling technique renowned for its rigorous treatment of kinetics. The WE framework typically involves initially map** the configuration space onto a low-dimensional collective variable (CV) space and then partitioning it into bins. The efficacy of WE simulations heavily depends on the selection of CVs and binning schem… ▽ More The weighted ensemble (WE) method stands out as a widely used segment-based sampling technique renowned for its rigorous treatment of kinetics. The WE framework typically involves initially map** the configuration space onto a low-dimensional collective variable (CV) space and then partitioning it into bins. The efficacy of WE simulations heavily depends on the selection of CVs and binning schemes. The recently proposed State Predictive Information Bottleneck (SPIB) method has emerged as a promising tool for automatically constructing CVs from data and guiding enhanced sampling through an iterative manner. In this work, we advance this data-driven pipeline by incorporating prior expert knowledge. Our hybrid approach combines SPIB-learned CVs to enhance sampling in explored regions with expert-based CVs to guide exploration in regions of interest, synergizing the strengths of both methods. Through benchmarking on alanine dipeptide and chignoin systems, we demonstrate that our hybrid approach effectively guides WE simulations to sample states of interest, and reduces run-to-run variances. Moreover, our integration of the SPIB model also enhances the analysis and interpretation of WE simulation data by effectively identifying metastable states and pathways, and offering direct visualization of dynamics. △ Less

Submitted 20 June, 2024; originally announced June 2024.

arXiv:2404.17722 [pdf, other]

Simulating Crystallization in a Colloidal System Using State Predictive Information Bottleneck based Enhanced Sampling

Authors: Vanessa J. Meraz, Ziyue Zou, Pratyush Tiwary

Abstract: We investigate crystal nucleation in supersaturated colloid suspensions using enhanced molecular dynamics simulations augmented with machine learning techniques. The simulations reveal that crystallization in the model colloidal system studied here, with particles interacting through a repulsive screened Coulomb Yukawa potential, proceeds from vapor to dense liquid droplet to crystalline phases ac… ▽ More We investigate crystal nucleation in supersaturated colloid suspensions using enhanced molecular dynamics simulations augmented with machine learning techniques. The simulations reveal that crystallization in the model colloidal system studied here, with particles interacting through a repulsive screened Coulomb Yukawa potential, proceeds from vapor to dense liquid droplet to crystalline phases across multiple high barriers. Employing a one-dimensional reaction coordinate derived from the State Predictive Information Bottleneck framework, our simulations capture backand-forth phase transitions across multiple barriers effectively in biased metadynamics simulations. We obtain relative free energy differences between different phases and also quantify the roles of different molecular level features in driving the phase changes. △ Less

Submitted 26 April, 2024; originally announced April 2024.

arXiv:2404.07102 [pdf, other]

doi 10.7554/eLife.99702.1

Empowering AlphaFold2 for protein conformation selective drug discovery with AlphaFold2-RAVE

Authors: Xinyu Gu, Akashnathan Aranganathan, Pratyush Tiwary

Abstract: Small molecule drug design hinges on obtaining co-crystallized ligand-protein structures. Despite AlphaFold2's strides in protein native structure prediction, its focus on apo structures overlooks ligands and associated holo structures. Moreover, designing selective drugs often benefits from the targeting of diverse metastable conformations. Therefore, direct application of AlphaFold2 models in vi… ▽ More Small molecule drug design hinges on obtaining co-crystallized ligand-protein structures. Despite AlphaFold2's strides in protein native structure prediction, its focus on apo structures overlooks ligands and associated holo structures. Moreover, designing selective drugs often benefits from the targeting of diverse metastable conformations. Therefore, direct application of AlphaFold2 models in virtual screening and drug discovery remains tentative. Here, we demonstrate an AlphaFold2 based framework combined with all-atom enhanced sampling molecular dynamics and induced fit docking, named AF2RAVE-Glide, to conduct computational model based small molecule binding of metastable protein kinase conformations, initiated from protein sequences. We demonstrate the AF2RAVE-Glide workflow on three different protein kinases and their type I and II inhibitors, with special emphasis on binding of known type II kinase inhibitors which target the metastable classical DFG-out state. These states are not easy to sample from AlphaFold2. Here we demonstrate how with AF2RAVE these metastable conformations can be sampled for different kinases with high enough accuracy to enable subsequent docking of known type II kinase inhibitors with more than 50% success rates across docking calculations. We believe the protocol should be deployable for other kinases and more proteins generally. △ Less

Submitted 4 July, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

Comments: added revised version and DOI link to eLife version

arXiv:2404.02856 [pdf, other]

An Information Bottleneck Approach for Markov Model Construction

Authors: Dedi Wang, Yunrui Qiu, Eric Beyerle, Xuhui Huang, Pratyush Tiwary

Abstract: Markov state models (MSMs) are valuable for studying dynamics of protein conformational changes via statistical analysis of molecular dynamics (MD) simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with the dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific… ▽ More Markov state models (MSMs) are valuable for studying dynamics of protein conformational changes via statistical analysis of molecular dynamics (MD) simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with the dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific lag time requires state defined without significant internal energy barriers, enabling internal dynamics relaxation within the lag time. This process coarse grains time and space, integrating out rapid motions within metastable states. This work introduces a continuous embedding approach for molecular conformations using the state predictive information bottleneck (SPIB), which unifies dimensionality reduction and state space partitioning via a continuous, machine learned basis set. Without explicit optimization of VAMP-based scores, SPIB demonstrates state-of-the-art performance in identifying slow dynamical processes and constructing predictive multi-resolution Markovian models. When applied to mini-proteins trajectories, SPIB showcases unique advantages compared to competing methods. It automatically adjusts the number of metastable states based on a specified minimal time resolution, eliminating the need for manual tuning. While maintaining efficacy in dynamical properties, SPIB excels in accurately distinguishing metastable states and capturing numerous well-populated macrostates. Furthermore, SPIB's ability to learn a low-dimensional continuous embedding of the underlying MSMs enhances the interpretation of dynamic pathways. Accordingly, we propose SPIB as an easy-to-implement methodology for end-to-end MSM construction. △ Less

Submitted 10 June, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

Comments: 19 pages, 7 figures

arXiv:2403.00925 [pdf, ps, other]

Atomic scale insights into NaCl nucleation in nanoconfined environments

Authors: Ruiyu Wang, Pratyush Tiwary

Abstract: In this work we examine the nucleation from NaCl aqueous solutions within nano-confined environments, employing enhanced sampling molecular dynamics simulations integrated with machine learning-derived reaction coordinates. Through our simulations, we successfully induce phase transitions between solid, liquid, and a hydrated phase, typically observed at lower temperatures in bulk environments. In… ▽ More In this work we examine the nucleation from NaCl aqueous solutions within nano-confined environments, employing enhanced sampling molecular dynamics simulations integrated with machine learning-derived reaction coordinates. Through our simulations, we successfully induce phase transitions between solid, liquid, and a hydrated phase, typically observed at lower temperatures in bulk environments. Interestingly, nano-confinement serves to stabilize the solid phase and elevate melting points. Our simulations explain these findings by underscoring the significant role of water, alongside ion aggregation and subtle, anistropic dielectric behavior, in driving nucleation within nano-constrained environments. This letter thus provides a framework for sampling, analyzing and understanding nucleation processes under nano-confinement. △ Less

Submitted 1 March, 2024; originally announced March 2024.

arXiv:2310.07927 [pdf, other]

Enhanced sampling of Crystal Nucleation with Graph Representation Learnt Variables

Authors: Ziyue Zou, Pratyush Tiwary

Abstract: In this study, we present a graph neural network-based learning approach using an autoencoder setup to derive low-dimensional variables from features observed in experimental crystal structures. These variables are then biased in enhanced sampling to observe state-to-state transitions and reliable thermodynamic weights. Our approach uses simple convolution and pooling methods. To verify the effect… ▽ More In this study, we present a graph neural network-based learning approach using an autoencoder setup to derive low-dimensional variables from features observed in experimental crystal structures. These variables are then biased in enhanced sampling to observe state-to-state transitions and reliable thermodynamic weights. Our approach uses simple convolution and pooling methods. To verify the effectiveness of our protocol, we examined the nucleation of various allotropes and polymorphs of iron and glycine from their molten states. Our graph latent variables when biased in well-tempered metadynamics consistently show transitions between states and achieve accurate free energy calculations in agreement with experiments, both of which are indicators of dependable sampling. This underscores the strength and promise of our graph neural net variables for improved sampling. The protocol shown here should be applicable for other systems and with other sampling methods. △ Less

Submitted 11 October, 2023; originally announced October 2023.

arXiv:2310.03819 [pdf, other]

doi 10.1021/acs.jpcb.3c08304

Thermodynamically Optimized Machine-learned Reaction Coordinates for Hydrophobic Ligand Dissociation

Authors: Eric Beyerle, Pratyush Tiwary

Abstract: Ligand unbinding is mediated by the free energy change, which has intertwined contributions from both energy and entropy. It is important but not easy to quantify their individual contributions. We model hydrophobic ligand unbinding for two systems, a methane particle and a C60 fullerene, both unbinding from hydrophobic pockets in all-atom water. By using a modified deep learning framework, we lea… ▽ More Ligand unbinding is mediated by the free energy change, which has intertwined contributions from both energy and entropy. It is important but not easy to quantify their individual contributions. We model hydrophobic ligand unbinding for two systems, a methane particle and a C60 fullerene, both unbinding from hydrophobic pockets in all-atom water. By using a modified deep learning framework, we learn a thermodynamically optimized reaction coordinate to describe hydrophobic ligand dissociation for both systems. Interpretation of these reaction coordinates reveals the roles of entropic and enthalpic forces as ligand and pocket sizes change. Irrespective of the contrasting roles of energy and entropy, we also find that for both the systems the transition from the bound to unbound states is driven primarily by solvation of the pocket and ligand, independent of ligand size. Our framework thus gives useful thermodynamic insight into hydrophobic ligand dissociation problems that are otherwise difficult to glean. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: 27 pages; 5 figures

arXiv:2309.14054 [pdf, other]

Adapt then Unlearn: Exploiting Parameter Space Semantics for Unlearning in Generative Adversarial Networks

Authors: Piyush Tiwary, Atri Guha, Subhodip Panda, Prathosh A. P

Abstract: The increased attention to regulating the outputs of deep generative models, driven by growing concerns about privacy and regulatory compliance, has highlighted the need for effective control over these models. This necessity arises from instances where generative models produce outputs containing undesirable, offensive, or potentially harmful content. To tackle this challenge, the concept of mach… ▽ More The increased attention to regulating the outputs of deep generative models, driven by growing concerns about privacy and regulatory compliance, has highlighted the need for effective control over these models. This necessity arises from instances where generative models produce outputs containing undesirable, offensive, or potentially harmful content. To tackle this challenge, the concept of machine unlearning has emerged, aiming to forget specific learned information or to erase the influence of undesired data subsets from a trained model. The objective of this work is to prevent the generation of outputs containing undesired features from a pre-trained GAN where the underlying training data set is inaccessible. Our approach is inspired by a crucial observation: the parameter space of GANs exhibits meaningful directions that can be leveraged to suppress specific undesired features. However, such directions usually result in the degradation of the quality of generated samples. Our proposed method, known as 'Adapt-then-Unlearn,' excels at unlearning such undesirable features while also maintaining the quality of generated samples. This method unfolds in two stages: in the initial stage, we adapt the pre-trained GAN using negative samples provided by the user, while in the subsequent stage, we focus on unlearning the undesired feature. During the latter phase, we train the pre-trained GAN using positive samples, incorporating a repulsion regularizer. This regularizer encourages the model's parameters to be away from the parameters associated with the adapted model from the first stage while also maintaining the quality of generated samples. To the best of our knowledge, our approach stands as first method addressing unlearning in GANs. We validate the effectiveness of our method through comprehensive experiments. △ Less

Submitted 25 September, 2023; originally announced September 2023.

Comments: 15 pages, 12 figures

arXiv:2309.09284 [pdf, other]

doi 10.1021/acs.jpcb.3c06735

Is the Local Ion Density Sufficient to Drive NaCl Nucleation from the Melt and Aqueous Solution?

Authors: Ruiyu Wang, Shams Mehdi, Ziyue Zou, Pratyush Tiwary

Abstract: Even though nucleation is ubiquitous in different science and engineering problems, investigating nucleation is extremely difficult due to the complicated ranges of time and length scales involved. In this work, we simulate NaCl nucleation in both molten and aqueous environments using enhanced sampling all-atom molecular dynamics with deep learning-based estimation of reaction coordinates. By inco… ▽ More Even though nucleation is ubiquitous in different science and engineering problems, investigating nucleation is extremely difficult due to the complicated ranges of time and length scales involved. In this work, we simulate NaCl nucleation in both molten and aqueous environments using enhanced sampling all-atom molecular dynamics with deep learning-based estimation of reaction coordinates. By incorporating various structural order parameters and learning the reaction coordinate as a function thereof, we achieve significantly improved sampling relative to traditional ad hoc descriptions of what drives nucleation, particularly in the aqueous medium. Our results reveal a one-step nucleation mechanism in both environments, with reaction coordinate analysis highlighting the importance of local ion density in distinguishing solid and liquid states. However, while fluctuations in the local ion density are necessary to drive nucleation, they are not sufficient. Our analysis shows that near the transition states, descriptors such as enthalpy and local structure become crucial. Our protocol proposed here enables robust nucleation analysis and phase sampling, and could offer insights into nucleation mechanisms for generic small molecules in different environments. △ Less

Submitted 27 December, 2023; v1 submitted 17 September, 2023; originally announced September 2023.

arXiv:2309.03649 [pdf, other]

Exploring kinase DFG loop conformational stability with AlphaFold2-RAVE

Authors: Bodhi P. Vani, Akashnathan Aranganathan, Pratyush Tiwary

Abstract: Kinases compose one of the largest fractions of the human proteome, and their misfunction is implicated in many diseases, in particular cancers. The ubiquitousness and structural similarities of kinases makes specific and effective drug design difficult. In particular, conformational variability due to the evolutionarily conserved DFG motif adopting in and out conformations and the relative stabil… ▽ More Kinases compose one of the largest fractions of the human proteome, and their misfunction is implicated in many diseases, in particular cancers. The ubiquitousness and structural similarities of kinases makes specific and effective drug design difficult. In particular, conformational variability due to the evolutionarily conserved DFG motif adopting in and out conformations and the relative stabilities thereof are key in structure-based drug design for ATP competitive drugs. These relative conformational stabilities are extremely sensitive to small changes in sequence, and provide an important problem for sampling method development. Since the invention of AlphaFold2, the world of structure-based drug design has noticably changed. In spite of it being limited to crystal-like structure prediction, several methods have also leveraged its underlying architecture to improve dynamics and enhanced sampling of conformational ensembles, including AlphaFold2-RAVE. Here, we extend AlphaFold2-RAVE and apply it to a set of kinases: the wild type DDR1 sequence and three mutants with single point mutations that are known to behave drastically differently. We show that AlphaFold2-RAVE is able to efficiently recover the changes in relative stability using transferable learnt order parameters and potentials, thereby supplementing AlphaFold2 as a tool for exploration of Boltzmann-weighted protein conformations. △ Less

Submitted 7 September, 2023; originally announced September 2023.

arXiv:2308.14885 [pdf, other]

Inferring phase transitions and critical exponents from limited observations with Thermodynamic Maps

Authors: Lukas Herron, Kinjal Mondal, John S. Schneekloth, Pratyush Tiwary

Abstract: Phase transitions are ubiquitous across life, yet hard to quantify and describe accurately. In this work, we develop an approach for characterizing generic attributes of phase transitions from very limited observations made deep within different phases' domains of stability. Our approach is called Thermodynamic Maps, which combines statistical mechanics and molecular simulations with score-based g… ▽ More Phase transitions are ubiquitous across life, yet hard to quantify and describe accurately. In this work, we develop an approach for characterizing generic attributes of phase transitions from very limited observations made deep within different phases' domains of stability. Our approach is called Thermodynamic Maps, which combines statistical mechanics and molecular simulations with score-based generative models. Thermodynamic Maps enable learning the temperature dependence of arbitrary thermodynamic observables across a wide range of temperatures. We show its usefulness by calculating phase transition attributes such as melting temperature, temperature-dependent heat capacities, and critical exponents. For instance, we demonstrate the ability of thermodynamic maps to infer the ferromagnetic phase transition of the Ising model, including temperature-dependent heat capacity and critical exponents, despite never having seen samples from the transition region. In addition, we efficiently characterize the temperature-dependent conformational ensemble and compute melting curves of the two RNA systems GCAA tetraloop and HIV-TAR, which are notoriously hard to sample due to glassy-like landscapes. △ Less

Submitted 28 August, 2023; originally announced August 2023.

arXiv:2307.13189 [pdf, other]

Quantifying the relevance of long-range forces for crystal nucleation in water

Authors: Renjie Zhao, Ziyue Zou, John D. Weeks, Pratyush Tiwary

Abstract: Understanding nucleation from aqueous solutions is of fundamental importance in a multitude of fields, ranging from materials science to biophysics. The complex solvent-mediated interactions in aqueous solutions hamper the development of a simple physical picture elucidating the roles of different interactions in nucleation processes. In this work we make use of three complementary techniques to d… ▽ More Understanding nucleation from aqueous solutions is of fundamental importance in a multitude of fields, ranging from materials science to biophysics. The complex solvent-mediated interactions in aqueous solutions hamper the development of a simple physical picture elucidating the roles of different interactions in nucleation processes. In this work we make use of three complementary techniques to disentangle the role played by short and long-range interactions in solvent mediated nucleation. Specifically, the first approach we utilize is the local molecular field (LMF) theory to renormalize long-range Coulomb electrostatics. Secondly, we use well-tempered metadynamics to speed up rare events governed by short-range interactions. Thirdly, deep learning-based State Predictive Information Bottleneck approach is employed in analyzing the reaction coordinate of the nucleation processes obtained from LMF treatment coupled with well-tempered metadynamics. We find that the two-step nucleation mechanism can largely be captured by the short-range interactions, while the long-range interactions further contribute to the stability of the primary crystal state at ambient conditions. Furthermore, by analyzing the reaction coordinate obtained from combined LMF-metadynamics treatment, we discern the fluctuations on different time scales, highlighting the need for long-range interactions when accounting for metastability. △ Less

Submitted 24 August, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

arXiv:2306.14977 [pdf]

Heat Conductance of the Quantum Hall Bulk

Authors: Ron Aharon Melcer, Avigail Gil, Arup-Kumar Paul, Priya Tiwary, Vladimir Umansky, Moty Heiblum, Yuval Oreg, Ady Stern, Erez Berg

Abstract: The Quantum Hall Effect (QHE) is a prototypical realization of a topological state of matter. It emerges from a subtle interplay between topology, interactions, and disorder. The disorder enables the formation of localized states in the bulk that stabilize the quantum Hall states with respect to the magnetic field and carrier density. Still, the details of the localized states and their contributi… ▽ More The Quantum Hall Effect (QHE) is a prototypical realization of a topological state of matter. It emerges from a subtle interplay between topology, interactions, and disorder. The disorder enables the formation of localized states in the bulk that stabilize the quantum Hall states with respect to the magnetic field and carrier density. Still, the details of the localized states and their contribution to transport remain beyond the reach of most experimental techniques. Here, we describe an extensive study of the bulk's heat conductance. Using a novel 'multi-terminal' short device (on a scale of $10 μm$), we separate the longitudinal thermal conductance, $κ_{xx}T$ (due to bulk's contribution), from the topological transverse value $κ_{xy}T$, by eliminating the contribution of the edge modes. When the magnetic field is tuned away from the conductance plateau center, the localized states in the bulk conduct heat efficiently ($κ_{xx}T \propto T$), while the bulk remains electrically insulating. Fractional states in the first excited Landau level, such as the $ν=7/3$ and $ν=5/2$, conduct heat throughout the plateau with a finite $κ_{xx} T$. We propose a theoretical model that identifies the localized states as the cause of the finite heat conductance, agreeing qualitatively with our experimental findings. △ Less

Submitted 15 September, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: 30 pages 11 figures

arXiv:2306.11688 [pdf, other]

doi 10.1038/s41524-024-01259-w

JARVIS-Leaderboard: A Large Scale Benchmark of Materials Design Methods

Authors: Kamal Choudhary, Daniel Wines, Kangming Li, Kevin F. Garrity, Vishu Gupta, Aldo H. Romero, Jaron T. Krogel, Kayahan Saritas, Addis Fuhr, Panchapakesan Ganesh, Paul R. C. Kent, Keqiang Yan, Yuchao Lin, Shuiwang Ji, Ben Blaiszik, Patrick Reiser, Pascal Friederich, Ankit Agrawal, Pratyush Tiwary, Eric Beyerle, Peter Minch, Trevor David Rhone, Ichiro Takeuchi, Robert B. Wexler, Arun Mannodi-Kanakkithodi , et al. (13 additional authors not shown)

Abstract: Lack of rigorous reproducibility and validation are major hurdles for scientific development across many fields. Materials science in particular encompasses a variety of experimental and theoretical approaches that require careful benchmarking. Leaderboard efforts have been developed previously to mitigate these issues. However, a comprehensive comparison and benchmarking on an integrated platform… ▽ More Lack of rigorous reproducibility and validation are major hurdles for scientific development across many fields. Materials science in particular encompasses a variety of experimental and theoretical approaches that require careful benchmarking. Leaderboard efforts have been developed previously to mitigate these issues. However, a comprehensive comparison and benchmarking on an integrated platform with multiple data modalities with both perfect and defect materials data is still lacking. This work introduces JARVIS-Leaderboard, an open-source and community-driven platform that facilitates benchmarking and enhances reproducibility. The platform allows users to set up benchmarks with custom tasks and enables contributions in the form of dataset, code, and meta-data submissions. We cover the following materials design categories: Artificial Intelligence (AI), Electronic Structure (ES), Force-fields (FF), Quantum Computation (QC) and Experiments (EXP). For AI, we cover several types of input data, including atomic structures, atomistic images, spectra, and text. For ES, we consider multiple ES approaches, software packages, pseudopotentials, materials, and properties, comparing results to experiment. For FF, we compare multiple approaches for material property predictions. For QC, we benchmark Hamiltonian simulations using various quantum algorithms and circuits. Finally, for experiments, we use the inter-laboratory approach to establish benchmarks. There are 1281 contributions to 274 benchmarks using 152 methods with more than 8 million data-points, and the leaderboard is continuously expanding. The JARVIS-Leaderboard is available at the website: https://pages.nist.gov/jarvis_leaderboard △ Less

Submitted 26 March, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.09111 [pdf, other]

Enhanced Sampling with Machine Learning: A Review

Authors: Shams Mehdi, Zachary Smith, Lukas Herron, Ziyue Zou, Pratyush Tiwary

Abstract: Molecular dynamics (MD) enables the study of physical systems with excellent spatiotemporal resolution but suffers from severe time-scale limitations. To address this, enhanced sampling methods have been developed to improve exploration of configurational space. However, implementing these is challenging and requires domain expertise. In recent years, integration of machine learning (ML) technique… ▽ More Molecular dynamics (MD) enables the study of physical systems with excellent spatiotemporal resolution but suffers from severe time-scale limitations. To address this, enhanced sampling methods have been developed to improve exploration of configurational space. However, implementing these is challenging and requires domain expertise. In recent years, integration of machine learning (ML) techniques in different domains has shown promise, prompting their adoption in enhanced sampling as well. Although ML is often employed in various fields primarily due to its data-driven nature, its integration with enhanced sampling is more natural with many common underlying synergies. This review explores the merging of ML and enhanced MD by presenting different shared viewpoints. It offers a comprehensive overview of this rapidly evolving field, which can be difficult to stay updated on. We highlight successful strategies like dimensionality reduction, reinforcement learning, and flow-based methods. Finally, we discuss open problems at the exciting ML-enhanced MD interface. △ Less

Submitted 16 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: Submitted as invited article to Annual Review of Physical Chemistry vol 75; updated formatting issues

arXiv:2304.13815 [pdf, other]

doi 10.1016/j.cossms.2023.101093

Recent advances in describing and driving crystal nucleation using machine learning and artificial intelligence

Authors: Eric R. Beyerle, Ziyue Zou, Pratyush Tiwary

Abstract: With the advent of faster computer processors and especially graphics processing units (GPUs) over the last few decades, the use of data-intensive machine learning (ML) and artificial intelligence (AI) has increased greatly, and the study of crystal nucleation has been one of the beneficiaries. In this review, we outline how ML and AI have been applied to address four outstanding difficulties of c… ▽ More With the advent of faster computer processors and especially graphics processing units (GPUs) over the last few decades, the use of data-intensive machine learning (ML) and artificial intelligence (AI) has increased greatly, and the study of crystal nucleation has been one of the beneficiaries. In this review, we outline how ML and AI have been applied to address four outstanding difficulties of crystal nucleation: how to discover better reaction coordinates (RCs) for describing accurately non-classical nucleation situations; the development of more accurate force fields for describing the nucleation of multiple polymorphs or phases for a single system; more robust identification methods for determining crystal phases and structures; and as a method to yield improved course-grained models for studying nucleation. △ Less

Submitted 1 May, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

Comments: 15 pages; 1 figure

arXiv:2303.11278 [pdf, other]

Bayesian Pseudo-Coresets via Contrastive Divergence

Authors: Piyush Tiwary, Kumar Shubham, Vivek V. Kashyap, Prathosh A. P

Abstract: Bayesian methods provide an elegant framework for estimating parameter posteriors and quantification of uncertainty associated with probabilistic models. However, they often suffer from slow inference times. To address this challenge, Bayesian Pseudo-Coresets (BPC) have emerged as a promising solution. BPC methods aim to create a small synthetic dataset, known as pseudo-coresets, that approximates… ▽ More Bayesian methods provide an elegant framework for estimating parameter posteriors and quantification of uncertainty associated with probabilistic models. However, they often suffer from slow inference times. To address this challenge, Bayesian Pseudo-Coresets (BPC) have emerged as a promising solution. BPC methods aim to create a small synthetic dataset, known as pseudo-coresets, that approximates the posterior inference achieved with the original dataset. This approximation is achieved by optimizing a divergence measure between the true posterior and the pseudo-coreset posterior. Various divergence measures have been proposed for constructing pseudo-coresets, with forward Kullback-Leibler (KL) divergence being the most successful. However, using forward KL divergence necessitates sampling from the pseudo-coreset posterior, often accomplished through approximate Gaussian variational distributions. Alternatively, one could employ Markov Chain Monte Carlo (MCMC) methods for sampling, but this becomes challenging in high-dimensional parameter spaces due to slow mixing. In this study, we introduce a novel approach for constructing pseudo-coresets by utilizing contrastive divergence. Importantly, optimizing contrastive divergence eliminates the need for approximations in the pseudo-coreset construction process. Furthermore, it enables the use of finite-step MCMC methods, alleviating the requirement for extensive mixing to reach a stationary distribution. To validate our method's effectiveness, we conduct extensive experiments on multiple datasets, demonstrating its superiority over existing BPC techniques. △ Less

Submitted 8 May, 2024; v1 submitted 20 March, 2023; originally announced March 2023.

Comments: Accepted at UAI 2024

arXiv:2210.04822 [pdf, other]

doi 10.1073/pnas.2216099120

Driving and characterizing nucleation of urea and glycine polymorphs in water

Authors: Ziyue Zou, Eric Beyerle, Sun-Ting Tsai, Pratyush Tiwary

Abstract: Crystal nucleation is relevant across the domains of fundamental and applied sciences. However, in many cases its mechanism remains unclear due to a lack of temporal or spatial resolution. To gain insights to the molecular details of nucleation, some form of molecular dynamics simulations is typically performed; these simulations, in turn, are limited by their ability to run long enough to sample… ▽ More Crystal nucleation is relevant across the domains of fundamental and applied sciences. However, in many cases its mechanism remains unclear due to a lack of temporal or spatial resolution. To gain insights to the molecular details of nucleation, some form of molecular dynamics simulations is typically performed; these simulations, in turn, are limited by their ability to run long enough to sample the nucleation event thoroughly. To overcome the timescale limits in typical molecular dynamics simulations in a manner free of prior human bias, here we employ the machine learning augmented molecular dynamics framework ``Reweighted Autoencoded Variational Bayes for enhanced sampling (RAVE)". We study two molecular systems, urea and glycine in explicit all-atom water, due to their enrichment in polymorphic structures and common utility in commercial applications. From our simulations, we observe multiple back-and-forth liquid-solid transitions of different polymorphs and from these trajectories calculate the polymorph stability relative to the dissolved liquid state. We further observe that the obtained reaction coordinates and transitions are highly non-classical. △ Less

Submitted 2 December, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

Comments: 12 pages, 7 figures

arXiv:2209.00905 [pdf, other]

From latent dynamics to meaningful representations

Authors: Dedi Wang, Yihang Wang, Luke Evans, Pratyush Tiwary

Abstract: While representation learning has been central to the rise of machine learning and artificial intelligence, a key problem remains in making the learned representations meaningful. For this, the typical approach is to regularize the learned representation through prior probability distributions. However, such priors are usually unavailable or are ad hoc. To deal with this, recent efforts have shift… ▽ More While representation learning has been central to the rise of machine learning and artificial intelligence, a key problem remains in making the learned representations meaningful. For this, the typical approach is to regularize the learned representation through prior probability distributions. However, such priors are usually unavailable or are ad hoc. To deal with this, recent efforts have shifted towards leveraging the insights from physical principles to guide the learning process. In this spirit, we propose a purely dynamics-constrained representation learning framework. Instead of relying on predefined probabilities, we restrict the latent representation to follow overdamped Langevin dynamics with a learnable transition density - a prior driven by statistical mechanics. We show this is a more natural constraint for representation learning in stochastic dynamical systems, with the crucial ability to uniquely identify the ground truth representation. We validate our framework for different systems including a real-world fluorescent DNA movie dataset. We show that our algorithm can uniquely identify orthogonal, isometric and meaningful latent representations. △ Less

Submitted 9 April, 2024; v1 submitted 2 September, 2022; originally announced September 2022.

arXiv:2208.13772 [pdf, other]

doi 10.1063/5.0122990

Computing committors via Mahalanobis diffusion maps with enhanced sampling data

Authors: Luke Evans, Maria K. Cameron, Pratyush Tiwary

Abstract: The study of phenomena such as protein folding and conformational changes in molecules is a central theme in chemical physics. Molecular dynamics (MD) simulation is the primary tool for the study of transition processes in biomolecules, but it is hampered by a huge timescale gap between the processes of interest and atomic vibrations which dictate the time step size. Therefore, it is imperative to… ▽ More The study of phenomena such as protein folding and conformational changes in molecules is a central theme in chemical physics. Molecular dynamics (MD) simulation is the primary tool for the study of transition processes in biomolecules, but it is hampered by a huge timescale gap between the processes of interest and atomic vibrations which dictate the time step size. Therefore, it is imperative to combine MD simulations with other techniques in order to quantify the transition processes taking place on large timescales. In this work, the diffusion map with Mahalanobis kernel, a meshless approach for approximating the Backward Kolmogorov Operator (BKO) in collective variables, is upgraded to incorporate standard enhanced sampling techniques such as metadynamics. The resulting algorithm, which we call the "target measure Mahalanobis diffusion map" (tm-mmap), is suitable for a moderate number of collective variables in which one can approximate the diffusion tensor and free energy. Imposing appropriate boundary conditions allows use of the approximated BKO to solve for the committor function and utilization of transition path theory to find the reactive current delineating the transition channels and the transition rate. The proposed algorithm, tm-mmap, is tested on the two-dimensional Moro-Cardin two-well system with position-dependent diffusion coefficient and on alanine dipeptide in two collective variables where the committor, the reactive current, and the transition rate are compared to those computed by the finite element method (FEM). Finally, tm-mmap is applied to alanine dipeptide in four collective variables where the use of finite elements is infeasible. △ Less

Submitted 2 November, 2022; v1 submitted 26 August, 2022; originally announced August 2022.

Comments: Restructured introduction, improved explanation of key algorithms and formulas (Section II.C and III.B,C). Streamlined presentation and proof of Theorem 1

arXiv:2206.13475 [pdf, other]

Thermodynamics-inspired Explanations of Artificial Intelligence

Authors: Shams Mehdi, Pratyush Tiwary

Abstract: In recent years, predictive machine learning methods have gained prominence in various scientific domains. However, due to their black-box nature, it is essential to establish trust in these models before accepting them as accurate. One promising strategy for assigning trust involves employing explanation techniques that elucidate the rationale behind a black-box model's predictions in a manner th… ▽ More In recent years, predictive machine learning methods have gained prominence in various scientific domains. However, due to their black-box nature, it is essential to establish trust in these models before accepting them as accurate. One promising strategy for assigning trust involves employing explanation techniques that elucidate the rationale behind a black-box model's predictions in a manner that humans can understand. However, assessing the degree of human interpretability of the rationale generated by such methods is a nontrivial challenge. In this work, we introduce interpretation entropy as a universal solution for assessing the degree of human interpretability associated with any linear model. Using this concept and drawing inspiration from classical thermodynamics, we present Thermodynamics-inspired Explainable Representations of AI and other black-box Paradigms (TERP), a method for generating accurate, and human-interpretable explanations for black-box predictions in a model-agnostic manner. To demonstrate the wide-ranging applicability of TERP, we successfully employ it to explain various black-box model architectures, including deep learning Autoencoders, Recurrent Neural Networks, and Convolutional Neural Networks, across diverse domains such as molecular simulations, text, and image classification. △ Less

Submitted 8 April, 2024; v1 submitted 27 June, 2022; originally announced June 2022.

Comments: revised theory and examples

arXiv:2203.07560 [pdf, other]

Quantifying Energetic and Entropic Pathways in Molecular Systems

Authors: E. R. Beyerle, Shams Mehdi, Pratyush Tiwary

Abstract: When examining dynamics occurring at non-zero temperatures, both energy and entropy must be taken into account while describing activated barrier crossing events. Furthermore, good reaction coordinates need to be constructed to describe different metastable states and the transition mechanisms between them. Here we use a physics-based machine learning method called the State Predictive Information… ▽ More When examining dynamics occurring at non-zero temperatures, both energy and entropy must be taken into account while describing activated barrier crossing events. Furthermore, good reaction coordinates need to be constructed to describe different metastable states and the transition mechanisms between them. Here we use a physics-based machine learning method called the State Predictive Information Bottleneck (SPIB) to find non-linear reaction coordinates for three systems of varying complexity. The SPIB is able to predict correctly an entropic bottleneck for an analytical flat-energy double-well system and identify the entropy- and energy-dominated pathways for an analytical four-well system. Finally, for a simulation of benzoic acid permeation through a lipid bilayer, SPIB is able to discover the entropic and energetic barriers to the permeation process. Given these results, we thus establish that SPIB is a reasonable and robust method for finding the important entropy and energy/enthalpy barriers in physical systems, which can then be used for enhanced understanding and sampling of different activated mechanisms. △ Less

Submitted 14 March, 2022; originally announced March 2022.

arXiv:2203.00597 [pdf, other]

doi 10.1038/s41467-022-34780-x

Path sampling of recurrent neural networks by incorporating known physics

Authors: Sun-Ting Tsai, Eric Fields, Yijia Xu, En-Jui Kuo, Pratyush Tiwary

Abstract: Recurrent neural networks have seen widespread use in modeling dynamical systems in varied domains such as weather prediction, text prediction and several others. Often one wishes to supplement the experimentally observed dynamics with prior knowledge or intuition about the system. While the recurrent nature of these networks allows them to model arbitrarily long memories in the time series used i… ▽ More Recurrent neural networks have seen widespread use in modeling dynamical systems in varied domains such as weather prediction, text prediction and several others. Often one wishes to supplement the experimentally observed dynamics with prior knowledge or intuition about the system. While the recurrent nature of these networks allows them to model arbitrarily long memories in the time series used in training, it makes it harder to impose prior knowledge or intuition through generic constraints. In this work, we present a path sampling approach based on principle of Maximum Caliber that allows us to include generic thermodynamic or kinetic constraints into recurrent neural networks. We show the method here for a widely used type of recurrent neural network known as long short-term memory network in the context of supplementing time series collected from different application domains. These include classical Molecular Dynamics of a protein and Monte Carlo simulations of an open quantum system continuously losing photons to the environment and displaying Rabi oscillations. Our method can be easily generalized to other generative artificial intelligence models and to generic time series in different areas of physical and social sciences, where one wishes to supplement limited data with intuition or theory based corrections. △ Less

Submitted 20 April, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

Comments: Added results for open quantum system with dissipative photon dynamics

arXiv:2112.11201 [pdf, other]

Accelerating all-atom simulations and gaining mechanistic understanding of biophysical systems through State Predictive Information Bottleneck

Authors: Shams Mehdi, Dedi Wang, Shashank Pant, Pratyush Tiwary

Abstract: An effective implementation of enhanced sampling algorithms for molecular dynamics simulations requires a priori knowledge of the approximate reaction coordinate describing the relevant mechanisms in the system. Here we demonstrate how the artificial intelligence based recent State Predictive Information Bottleneck (SPIB) approach can learn such a reaction coordinate as a deep neural network even… ▽ More An effective implementation of enhanced sampling algorithms for molecular dynamics simulations requires a priori knowledge of the approximate reaction coordinate describing the relevant mechanisms in the system. Here we demonstrate how the artificial intelligence based recent State Predictive Information Bottleneck (SPIB) approach can learn such a reaction coordinate as a deep neural network even from under-sampled trajectories. We demonstrate its usefulness by achieving more than 40 magnitudes of acceleration in simulating two test-piece biophysical systems through well-tempered metadynamics performed by biasing along the SPIB learned reaction coordinate. These include left- to right- handed chirality transitions in a synthetic protein (Aib)_9, and permeation of a small, asymmetric molecule benzoic acid through a synthetic, symmetric phospholipid bilayer. In addition to significantly accelerating the dynamics and achieving back-and-forth movement between different metastable states, the SPIB based reaction coordinate gives mechanistic insight into the processes driving these two important problems. △ Less

Submitted 21 December, 2021; originally announced December 2021.

arXiv:2110.05646 [pdf, other]

Influence of long range forces on the transition states and dynamics of NaCl ion-pair dissociation in water

Authors: Dedi Wang, Renjie Zhao, John D. Weeks, Pratyush Tiwary

Abstract: We study NaCl ion-pair dissociation in a dilute aqueous solution using computer simulations both for the full system with long range Coulomb interactions and for a well chosen reference system with short range intermolecular interactions. Analyzing results using concepts from Local Molecular Field (LMF) theory and the recently proposed AI-based analysis tool "State predictive information bottlenec… ▽ More We study NaCl ion-pair dissociation in a dilute aqueous solution using computer simulations both for the full system with long range Coulomb interactions and for a well chosen reference system with short range intermolecular interactions. Analyzing results using concepts from Local Molecular Field (LMF) theory and the recently proposed AI-based analysis tool "State predictive information bottleneck" (SPIB) we show that the system with short range interactions can accurately reproduce the transition rate for the dissociation process, the dynamics for moving between the underlying metastable states, and the transition state ensemble. Contributions from long range interactions can be largely neglected for these processes because long range forces from the direct interionic Coulomb interactions are almost completely canceled ($>90\%$) by those from solvent interactions over the length scale where the transition takes place. Thus for this important monovalent ion-pair system, short range forces alone are able to capture detailed consequences of the collective solvent motion, allowing the use of physically suggestive and computationally efficient short range models for the disassociation event. We believe that the framework here should be applicable to disentangling mechanisms for more complex processes such as multivalent ion disassociation, where previous work has suggested that long range contributions may be more important. △ Less

Submitted 11 October, 2021; originally announced October 2021.

arXiv:2108.10459 [pdf, other]

Towards automated sampling of polymorph nucleation and free energies with SGOOP and metadynamics

Authors: Ziyue Zou, Sun-Ting Tsai, Pratyush Tiwary

Abstract: Understanding the driving forces behind the nucleation of different polymorphs is of great importance for material sciences and the pharmaceutical industry. This includes understanding the reaction coordinate that governs the nucleation process as well as correctly calculating the relative free energies of different polymorphs. Here we demonstrate, for the prototypical case of urea nucleation from… ▽ More Understanding the driving forces behind the nucleation of different polymorphs is of great importance for material sciences and the pharmaceutical industry. This includes understanding the reaction coordinate that governs the nucleation process as well as correctly calculating the relative free energies of different polymorphs. Here we demonstrate, for the prototypical case of urea nucleation from melt, how one can learn such a 1-dimensional reaction coordinate as a function of pre-specified order parameters, and use it to perform efficient biased all-atom molecular dynamics simulations. The reaction coordinate is learnt as a function of generic thermodynamic and structural order parameters using the "Spectral Gap Optimization of Order Parameters (SGOOP)" approach [P. Tiwary and B. J. Berne, Proc. Natl. Acad. Sci. (2016)], and is biased using well-tempered metadynamics simulations. The reaction coordinate gives insight into the role played by different structural and thermodynamics order parameters, and the biased simulations obtain accurate relative free energies for different polymorphs. This includes accurate prediction of the approximate pressure at which urea undergoes a phase transition and one of the metastable polymorphs becomes the most stable conformation. We believe the ideas demonstrated in thus work will facilitate efficient sampling of nucleation in complex, generic systems. △ Less

Submitted 23 August, 2021; originally announced August 2021.

arXiv:2108.08979 [pdf, other]

Computing committors in collective variables via Mahalanobis diffusion maps

Authors: Luke Evans, Maria K. Cameron, Pratyush Tiwary

Abstract: The study of rare events in molecular and atomic systems such as conformal changes and cluster rearrangements has been one of the most important research themes in chemical physics. Key challenges are associated with long waiting times rendering molecular simulations inefficient, high dimensionality impeding the use of PDE-based approaches, and the complexity or breadth of transition processes lim… ▽ More The study of rare events in molecular and atomic systems such as conformal changes and cluster rearrangements has been one of the most important research themes in chemical physics. Key challenges are associated with long waiting times rendering molecular simulations inefficient, high dimensionality impeding the use of PDE-based approaches, and the complexity or breadth of transition processes limiting the predictive power of asymptotic methods. Diffusion maps are promising algorithms to avoid or mitigate all these issues. We adapt the diffusion map with Mahalanobis kernel proposed by Singer and Coifman (2008) for the SDE describing molecular dynamics in collective variables in which the diffusion matrix is position-dependent and, unlike the case considered by Singer and Coifman, is not associated with a diffeomorphism. We offer an elementary proof showing that one can approximate the generator for this SDE discretized to a point cloud via the Mahalanobis diffusion map. We use it to calculate the committor functions in collective variables for two benchmark systems: alanine dipeptide, and Lennard-Jones-7 in 2D. For validating our committor results, we compare our committor functions to the finite-difference solution or by conducting a "committor analysis" as used by molecular dynamics practitioners. We contrast the outputs of the Mahalanobis diffusion map with those of the standard diffusion map with isotropic kernel and show that the former gives significantly more accurate estimates for the committors than the latter. △ Less

Submitted 2 October, 2022; v1 submitted 19 August, 2021; originally announced August 2021.

Comments: Restructured introduction, additional Theorem 3.1 and Appendix A, B

arXiv:2107.07369 [pdf, other]

doi 10.1073/pnas.2203656119

From data to noise to data: mixing physics across temperatures with generative artificial intelligence

Authors: Yihang Wang, Lukas Herron, Pratyush Tiwary

Abstract: Using simulations or experiments performed at some set of temperatures to learn about the physics or chemistry at some other arbitrary temperature is a problem of immense practical and theoretical relevance. Here we develop a framework based on statistical mechanics and generative Artificial Intelligence that allows solving this problem. Specifically, we work with denoising diffusion probabilistic… ▽ More Using simulations or experiments performed at some set of temperatures to learn about the physics or chemistry at some other arbitrary temperature is a problem of immense practical and theoretical relevance. Here we develop a framework based on statistical mechanics and generative Artificial Intelligence that allows solving this problem. Specifically, we work with denoising diffusion probabilistic models, and show how these models in combination with replica exchange molecular dynamics achieve superior sampling of the biomolecular energy landscape at temperatures that were never even simulated without assuming any particular slow degrees of freedom. The key idea is to treat the temperature as a fluctuating random variable and not a control parameter as is usually done. This allows us to directly sample from the joint probability distribution in configuration and temperature space. The results here are demonstrated for a chirally symmetric peptide and single-strand ribonucleic acid undergoing conformational transitions in all-atom water. We demonstrate how we can discover transition states and metastable states that were previously unseen at the temperature of interest, and even bypass the need to perform further simulations for wide range of temperatures. At the same time, any unphysical states are easily identifiable through very low Boltzmann weights. The procedure while shown here for a class of molecular simulations should be more generally applicable to mixing information across simulations and experiments with varying control parameters. △ Less

Submitted 2 March, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

Comments: Added new system (RNA nucleotide) and more detailed analysis including comparison with direct reweighting

arXiv:2104.13560 [pdf, other]

SGOOP-d: Estimating kinetic distances and reaction coordinate dimensionality for rare event systems from biased/unbiased simulations

Authors: Sun-Ting Tsai, Zachary Smith, Pratyush Tiwary

Abstract: Understanding kinetics including reaction pathways and associated transition rates is an important yet difficult problem in numerous chemical and biological systems especially in situations with multiple competing pathways. When these high-dimensional systems are projected on low-dimensional coordinates, which are often needed for enhanced sampling or for interpretation of simulations and experime… ▽ More Understanding kinetics including reaction pathways and associated transition rates is an important yet difficult problem in numerous chemical and biological systems especially in situations with multiple competing pathways. When these high-dimensional systems are projected on low-dimensional coordinates, which are often needed for enhanced sampling or for interpretation of simulations and experiments, one can end up losing the kinetic connectivity of the underlying high-dimensional landscape. Thus in the low-dimensional projection metastable states might appear closer or further than they actually are. To deal with this issue, in this work we develop a formalism that learns a multi-dimensional yet minimally complex reaction coordinate (RC) for generic high-dimensional systems. When projected along this RC, all possible kinetically relevant pathways can be demarcated and the true high-dimensional connectivity is maintained. One of the defining attributes of our method lies in that it can work on long unbiased simulations as well as biased simulations often needed for rare event systems. We demonstrate the utility of the method by studying a range of model systems including conformational transitions in a small peptide Ace-Ala$_3$-Nme, where we show how two-dimensional and three-dimensional reaction coordinate found by our previously published spectral gap optimization method "SGOOP" [P. Tiwary and B. J. Berne, Proc. Natl. Acad. Sci. 113, 2839 (2016)] can capture the kinetics for 23 and all 28 out of the 28 dominant state-to-state transitions respectively. △ Less

Submitted 21 September, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

Comments: 10 pages, 4 figures, 2 tables

arXiv:2104.01301 [pdf, other]

Multimedia Technology Applications and Algorithms: A Survey

Authors: Palak Tiwary, Sanjida Ahmed

Abstract: Multimedia related research and development has evolved rapidly in the last few years with advancements in hardware, software and network infrastructures. As a result, multimedia has been integrated into domains like Healthcare and Medicine, Human facial feature extraction and tracking, pose recognition, disparity estimation, etc. This survey gives an overview of the various multimedia technologie… ▽ More Multimedia related research and development has evolved rapidly in the last few years with advancements in hardware, software and network infrastructures. As a result, multimedia has been integrated into domains like Healthcare and Medicine, Human facial feature extraction and tracking, pose recognition, disparity estimation, etc. This survey gives an overview of the various multimedia technologies and algorithms developed in the domains mentioned. △ Less

Submitted 2 April, 2021; originally announced April 2021.

arXiv:2011.10127 [pdf, ps, other]

doi 10.1063/5.0038198

State Predictive Information Bottleneck

Authors: Dedi Wang, Pratyush Tiwary

Abstract: The ability to make sense of the massive amounts of high-dimensional data generated from molecular dynamics (MD) simulations is heavily dependent on the knowledge of a low dimensional manifold (parameterized by a reaction coordinate or RC) that typically distinguishes between relevant metastable states and which captures the relevant slow dynamics of interest. Methods based on machine learning and… ▽ More The ability to make sense of the massive amounts of high-dimensional data generated from molecular dynamics (MD) simulations is heavily dependent on the knowledge of a low dimensional manifold (parameterized by a reaction coordinate or RC) that typically distinguishes between relevant metastable states and which captures the relevant slow dynamics of interest. Methods based on machine learning and artificial intelligence have been proposed over the years to deal with learning such low-dimensional manifolds, but they are often criticized for a disconnect from more traditional and physically interpretable approaches. To deal with such concerns, in this work, we propose a deep learning based State Predictive Information Bottleneck (SPIB) approach to learn the RC from high dimensional molecular simulation trajectories. We demonstrate analytically and numerically how the RC learnt in this approach is deeply connected to the committor in chemical physics, and can be used to accurately identify transition states. A crucial hyperparameter in this approach is the time-delay, or how far into the future the algorithm should make predictions about. Through careful comparisons for benchmark systems, we demonstrate that this hyperparameter choice gives useful control over how coarse-grained we want the metastable state classification of the system to be. We thus believe that this work represents a step forward in systematic application of deep learning based ideas to molecular simulations in a way that bridges the gap between artificial intelligence and traditional chemical physics. △ Less

Submitted 12 February, 2021; v1 submitted 19 November, 2020; originally announced November 2020.

Comments: 11 pages, 13 figures

arXiv:2004.12360 [pdf, other]

doi 10.1038/s41467-020-18959-8

Learning Molecular Dynamics with Simple Language Model built upon Long Short-Term Memory Neural Network

Authors: Sun-Ting Tsai, En-Jui Kuo, Pratyush Tiwary

Abstract: Recurrent neural networks (RNNs) have led to breakthroughs in natural language processing and speech recognition, wherein hundreds of millions of people use such tools on a daily basis through smartphones, email servers and other avenues. In this work, we show such RNNs, specifically Long Short-Term Memory (LSTM) neural networks can also be applied to capturing the temporal evolution of typical tr… ▽ More Recurrent neural networks (RNNs) have led to breakthroughs in natural language processing and speech recognition, wherein hundreds of millions of people use such tools on a daily basis through smartphones, email servers and other avenues. In this work, we show such RNNs, specifically Long Short-Term Memory (LSTM) neural networks can also be applied to capturing the temporal evolution of typical trajectories arising in chemical and biological physics. Specifically, we use a character-level language model based on LSTM. This learns a probabilistic model from 1-dimensional stochastic trajectories generated from molecular dynamics simulations of a higher dimensional system. We show that the model can not only capture the Boltzmann statistics of the system but it also reproduce kinetics at a large spectrum of timescales. We demonstrate how the embedding layer, introduced originally for representing the contextual meaning of words or characters, exhibits here a nontrivial connectivity between different metastable states in the underlying physical system. We demonstrate the reliability of our model and interpretations through different benchmark systems and a single molecule force spectroscopy trajectory for multi-state riboswitch. We anticipate that our work represents a step** stone in the understanding and use of RNNs for modeling and predicting dynamics of complex stochastic molecular systems. △ Less

Submitted 4 August, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

arXiv:2002.06099 [pdf, other]

doi 10.1063/5.0004838

Understanding the role of predictive time delay and biased propagator in RAVE

Authors: Yihang Wang, Pratyush Tiwary

Abstract: In this work, we revisit our recent iterative machine learning (ML) -- molecular dynamics (MD) technique "Reweighted autoencoded variational Bayes for enhanced sampling (RAVE)" (Ribeiro, Bravo, Wang, Tiwary, J. Chem. Phys. 149 072301 (2018) and Wang, Ribeiro, Tiwary, Nature Commun. 10 3573 (2019)) and analyze as well as formalize some of its approximations. These including: (a) the choice of a pre… ▽ More In this work, we revisit our recent iterative machine learning (ML) -- molecular dynamics (MD) technique "Reweighted autoencoded variational Bayes for enhanced sampling (RAVE)" (Ribeiro, Bravo, Wang, Tiwary, J. Chem. Phys. 149 072301 (2018) and Wang, Ribeiro, Tiwary, Nature Commun. 10 3573 (2019)) and analyze as well as formalize some of its approximations. These including: (a) the choice of a predictive time-delay, or how far into the future should the ML try to predict the state of a given system output from MD, and (b) for short time-delays, how much of an error is made in approximating the biased propagator for the dynamics as the unbiased propagator. We demonstrate through a master equation framework as to why the exact choice of time-delay is irrelevant as long as a small non-zero value is adopted. We also derive a correction to reweight the biased propagator, and somewhat to our dissatisfaction but also to our reassurance, find that it barely makes a difference to the intuitive picture we had previously derived and used. △ Less

Submitted 14 February, 2020; originally announced February 2020.

arXiv:1909.11748 [pdf, other]

Machine learning approaches for analyzing and enhancing molecular dynamics simulations

Authors: Yihang Wang, Joao Marcelo Lamim Ribeiro, Pratyush Tiwary

Abstract: Molecular dynamics (MD) has become a powerful tool for studying biophysical systems, due to increasing computational power and availability of software. Although MD has made many contributions to better understanding these complex biophysical systems, there remain methodological difficulties to be surmounted. First, how to make the deluge of data generated in running even a microsecond long MD sim… ▽ More Molecular dynamics (MD) has become a powerful tool for studying biophysical systems, due to increasing computational power and availability of software. Although MD has made many contributions to better understanding these complex biophysical systems, there remain methodological difficulties to be surmounted. First, how to make the deluge of data generated in running even a microsecond long MD simulation human comprehensible. Second, how to efficiently sample the underlying free energy surface and kinetics. In this short perspective, we summarize machine learning based ideas that are solving both of these limitations, with a focus on their key theoretical underpinnings and remaining challenges. △ Less

Submitted 25 September, 2019; originally announced September 2019.

arXiv:1908.04846 [pdf, other]

doi 10.1063/1.5124385

Reaction coordinates and rate constants for liquid droplet nucleation: quantifying the interplay between driving force and memory

Authors: Sun-Ting Tsai, Zachary Smith, Pratyush Tiwary

Abstract: In this work we revisit the classic problem of homogeneous nucleation of a liquid droplet in a supersaturated vapor phase. We consider this at different extents of the driving force, which here is the extent of supersaturation, and calculate a reaction coordinate (RC) for nucleation as the driving force is varied. The RC is constructed as a linear combination of three order parameters, where one a… ▽ More In this work we revisit the classic problem of homogeneous nucleation of a liquid droplet in a supersaturated vapor phase. We consider this at different extents of the driving force, which here is the extent of supersaturation, and calculate a reaction coordinate (RC) for nucleation as the driving force is varied. The RC is constructed as a linear combination of three order parameters, where one accounts for the number of liquid-like atoms, and the other two for local density fluctuations. The RC is calculated from all-atom biased and unbiased molecular dynamics (MD) simulations using the spectral gap optimization approach "SGOOP" [P. Tiwary and B. J. Berne, Proc. Natl. Acad. Sci. U. S. A. 113, 2839 (2016)]. Our key finding is that as the supersaturation decreases, the RC ceases to simply be the number of liquid-like atoms, and instead it becomes important to explicitly consider local density fluctuations that correlate with shape and density variations in the nucleus. All three order parameters are found to have similar barriers in their respective potentials of mean force, however, as the supersaturation decreases the density fluctuations decorrelate slower and thus carry longer memory. Thus at lower supersaturations density fluctuations are non-Markovian and can not be simply ignored from the RC by virtue of being noise. Finally, we use this optimized RC to calculate nucleation rates in the infrequent metadynamics framework, and show it leads to more accurate estimate of the nucleation rate with four orders of magnitude acceleration relative to unbiased MD. △ Less

Submitted 13 August, 2019; originally announced August 2019.

Comments: 10 pages, 5 figures

arXiv:1809.04540 [pdf, ps, other]

Ligand dissociation mechanisms from all-atom simulations: Are we there yet?

Authors: Joao Marcelo Lamim Ribeiro, Sun-Ting Tsai, Debabrata Pramanik, Yihang Wang, Pratyush Tiwary

Abstract: Large parallel gains in the development of both computational resources as well as sampling methods have now made it possible to simulate dissociation events in ligand-protein complexes with all--atom resolution. Such encouraging progress, together with the inherent spatiotemporal resolution associated with molecular simulations, has left their use for investigating dissociation processes brimming… ▽ More Large parallel gains in the development of both computational resources as well as sampling methods have now made it possible to simulate dissociation events in ligand-protein complexes with all--atom resolution. Such encouraging progress, together with the inherent spatiotemporal resolution associated with molecular simulations, has left their use for investigating dissociation processes brimming with potential, both in rational drug design, where it can be an invaluable tool for determining the mechanistic driving forces behind dissociation rate constants, as well as in force-field development, where it can provide a catalog of transient molecular structures on which to refine force-fields. Although much progress has been made in making force-fields more accurate, reducing their error for transient structures along a transition path could yet prove to be a critical development hel** to make kinetic predictions much more accurate. In what follows we will provide a state-of-the-art compilation of the molecular dynamics (MD) methods used to investigate the kinetics and mechanisms of ligand-protein dissociation processes. Due to the timescales of such processes being slower than what is accessible using straightforward MD simulations, several ingenious schemes are being devised at a rapid rate to overcome this obstacle. Here we provide an up-to-date compendium of such methods and their achievements/shortcomings in extracting mechanistic insight into ligand-protein dissociation. We conclude with a critical and provocative appraisal attempting to answer the title of this review. △ Less

Submitted 12 September, 2018; originally announced September 2018.

arXiv:1802.04182 [pdf, other]

doi 10.1063/1.5024679

Frequency adaptive metadynamics for the calculation of rare-event kinetics

Authors: Yong Wang, Omar Valsson, Pratyush Tiwary, Michele Parrinello, Kresten Lindorff-Larsen

Abstract: The ability to predict accurate thermodynamic and kinetic properties in biomolecular systems is of both scientific and practical utility. While both remain very difficult, predictions of kinetics are particularly difficult because rates, in contrast to free energies, depend on the route taken and are thus not amenable to all enhanced sampling methods. It has recently been demonstrated that it is p… ▽ More The ability to predict accurate thermodynamic and kinetic properties in biomolecular systems is of both scientific and practical utility. While both remain very difficult, predictions of kinetics are particularly difficult because rates, in contrast to free energies, depend on the route taken and are thus not amenable to all enhanced sampling methods. It has recently been demonstrated that it is possible to recover kinetics through so called `infrequent metadynamics' simulations, where the simulations are biased in a way that minimally corrupts the dynamics of moving between metastable states. This method, however, requires the bias to be added slowly, thus hampering applications to processes with only modest separations of timescales. Here we present a frequency-adaptive strategy which bridges normal and infrequent metadynamics. We show that this strategy can improve the precision and accuracy of rate calculations at fixed computational cost, and should be able to extend rate calculations for much slower kinetic processes. △ Less

Submitted 12 February, 2018; originally announced February 2018.

Comments: 15 pages, 2 figures, 2 tables

arXiv:1802.03420 [pdf, other]

Reweighted Autoencoded Variational Bayes for Enhanced Sampling (RAVE)

Authors: Joao Marcelo Lamim Ribeiro, Pablo Bravo Collado, Yihang Wang, Pratyush Tiwary

Abstract: Here we propose the Reweighted Autoencoded Variational Bayes for Enhanced Sampling (RAVE) method, a new iterative scheme that uses the deep learning framework of variational autoencoders to enhance sampling in molecular simulations. RAVE involves iterations between molecular simulations and deep learning in order to produce an increasingly accurate probability distribution along a low-dimensional… ▽ More Here we propose the Reweighted Autoencoded Variational Bayes for Enhanced Sampling (RAVE) method, a new iterative scheme that uses the deep learning framework of variational autoencoders to enhance sampling in molecular simulations. RAVE involves iterations between molecular simulations and deep learning in order to produce an increasingly accurate probability distribution along a low-dimensional latent space that captures the key features of the molecular simulation trajectory. Using the Kullback-Leibler divergence between this latent space distribution and the distribution of various trial reaction coordinates sampled from the molecular simulation, RAVE determines an optimum, yet nonetheless physically interpretable, reaction coordinate and optimum probability distribution. Both then directly serve as the biasing protocol for a new biased simulation, which is once again fed into the deep learning module with appropriate weights accounting for the bias, the procedure continuing until estimates of desirable thermodynamic observables are converged. Unlike recent methods using deep learning for enhanced sampling purposes, RAVE stands out in that (a) it naturally produces a physically interpretable reaction coordinate, (b) is independent of existing enhanced sampling protocols to enhance the fluctuations along the latent space identified via deep learning, and (c) it provides the ability to easily filter out spurious solutions learned by the deep learning procedure. The usefulness and reliability of RAVE is demonstrated by applying it to model potentials of increasing complexity, including computation of the binding free energy profile for a hydrophobic ligand-substrate system in explicit water with dissociation time of more than three minutes, in computer time at least twenty times less than that needed for umbrella sampling or metadynamics. △ Less

Submitted 9 February, 2018; originally announced February 2018.

arXiv:1704.03912 [pdf, ps, other]

doi 10.1063/1.4983727

Predicting reaction coordinates in energy landscapes with diffusion anisotropy

Authors: Pratyush Tiwary, B. J. Berne

Abstract: We consider a range of model potentials with metastable states undergoing molecular dynamics coupled to a thermal bath in the high friction regime, and consider how the optimal reaction coordinate depends on the diffusion anisotropy. For this we use our recently proposed method 'Spectral gap optimization of order parameters (SGOOP)' (Tiwary and Berne, Proc. Natl. Acad. Sci. 113 2839 2016). We show… ▽ More We consider a range of model potentials with metastable states undergoing molecular dynamics coupled to a thermal bath in the high friction regime, and consider how the optimal reaction coordinate depends on the diffusion anisotropy. For this we use our recently proposed method 'Spectral gap optimization of order parameters (SGOOP)' (Tiwary and Berne, Proc. Natl. Acad. Sci. 113 2839 2016). We show how available information about dynamical observables in addition to static information can be incorporated into SGOOP, which can then be used to accurately determine the 'best' reaction coordinate for arbitrary anisotropies. We compare our results with transmission coefficient calculations and published benchmarks where applicable or available respectively. △ Less

Submitted 1 May, 2017; v1 submitted 12 April, 2017; originally announced April 2017.

Journal ref: J. Chem. Phys. 147, 152701 (2017)

arXiv:1609.06012 [pdf, other]

A Novel Approach to Implement Message Level Security in RESTful Web Services

Authors: Gyan Prakash Tiwary, Abhishek Srivastava

Abstract: The world is rapidly adopting RESTful web services for most of its tasks. The once popular SOAP-based web services are fast losing ground owing to this. RESTful web services are light weight services without strict message formats. RESTful web services, unlike SOAP, are capable of message transfer in any format be it XML, JSON, plain text. However, in spite of these positives, ensuring message lev… ▽ More The world is rapidly adopting RESTful web services for most of its tasks. The once popular SOAP-based web services are fast losing ground owing to this. RESTful web services are light weight services without strict message formats. RESTful web services, unlike SOAP, are capable of message transfer in any format be it XML, JSON, plain text. However, in spite of these positives, ensuring message level security in REST is a challenge. Security in RESTful web services is still largely dependent upon transport layer security. There has been some work recently towards message level security in such environments wherein the transfer of message level security metadata is done through utilising new HTTP headers. We feel, however, that any method that compromises the generality of the HTTP protocol should be avoided. In this paper, therefore, we propose two new ways of encryption that promise to ensure message level security in RESTful web services without the need for special HTTP headers. This approach works seamlessly on most famous content-types of RESTful web services: XML, JSON, HTML, plain-text and various ASCII printable content types. Further, the proposed approach removes the need for content negotiation in cases where the content comprises XML, JSON, HTML, plain-text, and ASCII printable content types and also removes the need for XML or JSON canonicalization. △ Less

Submitted 20 September, 2016; originally announced September 2016.

arXiv:1605.07090 [pdf, other]

doi 10.1063/1.4959969

How wet should be the reaction coordinate for ligand unbinding?

Authors: Pratyush Tiwary, B. J. Berne

Abstract: We use a recently proposed method called Spectral Gap Optimization of Order Parameters (SGOOP) (Tiwary and Berne, Proc. Natl. Acad. Sci 2016, 113, 2839 (2016)), to determine an optimal 1-dimensional reaction coordinate (RC) for the unbinding of a bucky-ball from a pocket in explicit water. This RC is estimated as a linear combination of the multiple available order parameters that collectively can… ▽ More We use a recently proposed method called Spectral Gap Optimization of Order Parameters (SGOOP) (Tiwary and Berne, Proc. Natl. Acad. Sci 2016, 113, 2839 (2016)), to determine an optimal 1-dimensional reaction coordinate (RC) for the unbinding of a bucky-ball from a pocket in explicit water. This RC is estimated as a linear combination of the multiple available order parameters that collectively can be used to distinguish the various stable states relevant for unbinding. We pay special attention to determining and quantifying the degree to which water molecules should be included in the RC. Using SGOOP with under-sampled biased simulations, we predict that water plays a distinct role in the reaction coordinate for unbinding in the case when the ligand is sterically constrained to move along an axis of symmetry. This prediction is validated through extensive calculations of the unbinding times through metadynamics, and by comparison through detailed balance with unbiased molecular dynamics estimate of the binding time. However when the steric constraint is removed, we find that the role of water in the reaction coordinate diminishes. Here instead SGOOP identifies a good one-dimensional RC involving various motional degrees of freedom. △ Less

Submitted 23 May, 2016; originally announced May 2016.

Comments: 7 pages, 5 figures

arXiv:1602.06588 [pdf, ps, other]

doi 10.1063/1.4944577

Kramers turnover: from energy diffusion to spatial diffusion using metadynamics

Authors: Pratyush Tiwary, B. J. Berne

Abstract: We consider the rate of transition for a particle between two metastable states coupled to a thermal environment for various magnitudes of the coupling strength, using the recently proposed infrequent metadynamics approach (Tiwary and Parrinello, Phys. Rev. Lett. 111, 230602 (2013)). We are interested in understanding how this approach for obtaining rate constants performs as the dynamics regime c… ▽ More We consider the rate of transition for a particle between two metastable states coupled to a thermal environment for various magnitudes of the coupling strength, using the recently proposed infrequent metadynamics approach (Tiwary and Parrinello, Phys. Rev. Lett. 111, 230602 (2013)). We are interested in understanding how this approach for obtaining rate constants performs as the dynamics regime changes from energy diffusion to spatial diffusion. Reassuringly, we find that the approach works remarkably well for various coupling strengths in the strong coupling regime, and to some extent even in the weak coupling regime. △ Less

Submitted 21 February, 2016; originally announced February 2016.

Comments: 3 pages, 1 figure, submitted to J. Chem. Phys

arXiv:1510.01649 [pdf, ps, other]

doi 10.1063/1.4937945

A perturbative solution to metadynamics ordinary differential equation

Authors: Pratyush Tiwary, James F. Dama, Michele Parrinello

Abstract: Metadynamics is a popular enhanced sampling scheme wherein by periodic application of a repulsive bias, one can surmount high free energy barriers and explore complex landscapes. Recently metadynamics was shown to be mathematically well founded, in the sense that the biasing procedure is guaranteed to converge to the true free energy surface in the long time limit irrespective of the precise choic… ▽ More Metadynamics is a popular enhanced sampling scheme wherein by periodic application of a repulsive bias, one can surmount high free energy barriers and explore complex landscapes. Recently metadynamics was shown to be mathematically well founded, in the sense that the biasing procedure is guaranteed to converge to the true free energy surface in the long time limit irrespective of the precise choice of biasing parameters. A differential equation governing the post-transient convergence behavior of metadynamics was also derived. In this short communication, we revisit this differential equation, expressing it in a convenient and elegant Riccati-like form. A perturbative solution scheme is then developed for solving this differential equation, which is valid for any generic biasing kernel. The solution clearly demonstrates the robustness of metadynamics to choice of biasing parameters and gives further confidence in the widely used method. △ Less

Submitted 6 October, 2015; originally announced October 2015.

Comments: submitted to J. Chem. Phys

arXiv:1509.06145 [pdf, other]

doi 10.1073/pnas.1600917113

Caliber based spectral gap optimization of order parameters (SGOOP) for sampling complex molecular systems

Authors: Pratyush Tiwary, B. J. Berne

Abstract: In modern day simulations of many-body systems much of the computational complexity is shifted to the identification of slowly changing molecular order parameters called collective variables (CV) or reaction coordinates. A vast array of enhanced sampling methods are based on the identification and biasing of these low-dimensional order parameters, whose fluctuations are important in driving rare e… ▽ More In modern day simulations of many-body systems much of the computational complexity is shifted to the identification of slowly changing molecular order parameters called collective variables (CV) or reaction coordinates. A vast array of enhanced sampling methods are based on the identification and biasing of these low-dimensional order parameters, whose fluctuations are important in driving rare events of interest. Here describe a new algorithm for finding optimal low-dimensional collective variables for use in enhanced sampling biasing methods like umbrella sampling, metadynamics and related methods, when limited prior static and dynamic information is known about the system, and a much larger set of candidate CVs is specified. The algorithm involves estimating the best combination of these candidate CVs, as quantified by a maximum path entropy estimate of the spectral gap for dynamics viewed as a function of that CV. Through multiple practical examples, we show how this post-processing procedure can lead to optimization of CV and several orders of magnitude improvement in the convergence of the free energy calculated through metadynamics, essentially giving the ability to extract useful information even from unsuccessful metadynamics runs. △ Less

Submitted 8 November, 2015; v1 submitted 21 September, 2015; originally announced September 2015.

Comments: 7 pages, 4 figures; corrected missing figure number and added a reference

arXiv:1508.01642 [pdf, other]

doi 10.1063/1.4966265

Overcoming timescale and finite-size limitations to compute nucleation rates from small scale Well Tempered Metadynamics simulations

Authors: Matteo Salvalaglio, Pratyush Tiwary, Giovanni Maria Maggioni, Marco Mazzotti, Michele Parrinello

Abstract: Condensation of a liquid droplet from a supersaturated vapour phase is initiated by a prototypical nucleation event. As such it is challenging to compute its rate from atomistic molecular dynamics simulations. In fact at realistic supersaturation conditions condensation occurs on time scales that far exceed what can be reached with conventional molecular dynamics methods. Another known problem in… ▽ More Condensation of a liquid droplet from a supersaturated vapour phase is initiated by a prototypical nucleation event. As such it is challenging to compute its rate from atomistic molecular dynamics simulations. In fact at realistic supersaturation conditions condensation occurs on time scales that far exceed what can be reached with conventional molecular dynamics methods. Another known problem in this context is the distortion of the free energy profile associated to nucleation due to the small, finite size of typical simulation boxes. In this work the problem of time scale is addressed with a recently developed enhanced sampling method while contextually correcting for finite size effects. We demonstrate our approach by studying the condensation of argon, and showing that characteristic nucleation times of the order of magnitude of hours can be reliably calculated, approaching realistic supersaturation conditions, thus bridging the gap between what standard molecular dynamics simulations can do and real physical systems. △ Less

Submitted 16 May, 2016; v1 submitted 7 August, 2015; originally announced August 2015.

Comments: 9 pages, 7 figures, additional figures and data provided as supplementary information. Submitted to the Journal of Chemical Physiscs

arXiv:1507.02985 [pdf, other]

doi 10.1073/pnas.1516652112

The role of water and steric constraints in the kinetics of cavity-ligand unbinding

Authors: Pratyush Tiwary, Jagannath Mondal, Joseph A. Morrone, B. J. Berne

Abstract: A key factor influencing a drug's efficacy is its residence time in the binding pocket of the host protein. Using atomistic computer simulation to predict this residence time and the associated dissociation process is a desirable but extremely difficult task due to the long timescales involved. This gets further complicated by the presence of biophysical factors such as steric and solvation effect… ▽ More A key factor influencing a drug's efficacy is its residence time in the binding pocket of the host protein. Using atomistic computer simulation to predict this residence time and the associated dissociation process is a desirable but extremely difficult task due to the long timescales involved. This gets further complicated by the presence of biophysical factors such as steric and solvation effects. In this work, we perform molecular dynamics (MD) simulations of the unbinding of a popular prototypical hydrophobic cavity-ligand system using a metadynamics based approach that allows direct assessment of kinetic pathways and parameters. When constrained to move in an axial manner, we find the unbinding time to be on the order of 4000 sec. In accordance with previous studies, we find that the ligand must pass through a region of sharp dewetting transition manifested by sudden and high fluctuations in solvent density in the cavity. When we remove the steric constraints on ligand, the unbinding happens predominantly by an alternate pathway, where the unbinding becomes 20 times faster, and the sharp dewetting transition instead becomes continuous. We validate the unbinding timescales from metadynamics through a Poisson analysis, and by comparison through detailed balance to binding timescale estimates from unbiased MD. This work demonstrates that enhanced sampling can be used to perform explicit solvent molecular dynamics studies at timescales previously unattainable, obtaining direct and reliable pictures of the underlying physio-chemical factors including free energies and rate constants. △ Less

Submitted 10 July, 2015; originally announced July 2015.

Comments: 7 pages, 4 figures, supplementary PDF file, submitted

arXiv:1506.02545 [pdf, other]

doi 10.1103/PhysRevLett.115.070601

Variationally Optimized Free Energy Flooding for Rate Calculation

Authors: James McCarty, Omar Valsson, Pratyush Tiwary, Michele Parrinello

Abstract: We propose a new method to obtain kinetic properties of infrequent events from molecular dynamics simulation. The procedure employs a recently introduced variational approach [Valsson and Parrinello, Phys. Rev. Lett. 113, 090601 (2014)] to construct a bias potential as a function of several collective variables that is designed to flood only the associated free energy surface up to a predefined le… ▽ More We propose a new method to obtain kinetic properties of infrequent events from molecular dynamics simulation. The procedure employs a recently introduced variational approach [Valsson and Parrinello, Phys. Rev. Lett. 113, 090601 (2014)] to construct a bias potential as a function of several collective variables that is designed to flood only the associated free energy surface up to a predefined level. The resulting bias potential effectively accelerates transitions between metastable free energy minima while ensuring bias-free transition states, thus allowing accurate kinetic rates to be obtained. We test the method on a few illustrative systems for which we obtain an order of magnitude improvement in efficiency relative to previous approaches, and several orders of magnitude relative to unbiased molecular dynamics. We expect an even larger improvement in more complex systems. This and the ability of the variational approach to deal efficiently with a large number of collective variables will greatly enhance the scope of these calculations. This work is a vindication of the potential that the variational principle has if applied in innovative ways △ Less

Submitted 18 June, 2015; v1 submitted 8 June, 2015; originally announced June 2015.

Comments: 6 pages, 3 figures, Supplemental Information

Journal ref: Phys. Rev. Lett. 115, 070601 (2015)

arXiv:1309.5323 [pdf, other]

doi 10.1103/PhysRevLett.111.230602

From Metadynamics to Dynamics

Authors: Pratyush Tiwary, Michele Parrinello

Abstract: Metadynamics is a commonly used and successful enhanced sampling method. By the introduction of a history dependent bias which depends on a restricted number of collective variables(CVs) it can explore complex free energy surfaces characterized by several metastable states separated by large free energy barriers. Here we extend its scope by introducing a simple yet powerful method for calculating… ▽ More Metadynamics is a commonly used and successful enhanced sampling method. By the introduction of a history dependent bias which depends on a restricted number of collective variables(CVs) it can explore complex free energy surfaces characterized by several metastable states separated by large free energy barriers. Here we extend its scope by introducing a simple yet powerful method for calculating the rates of transition between different metastable states. The method does not rely on a previous knowledge of the transition states or reaction co-ordinates, as long as CVs are known that can distinguish between the various stable minima in free energy space. We demonstrate that our method recovers the correct escape rates out of these stable states and also preserves the correct sequence of state-to-state transitions, with minimal extra computational effort needed over ordinary metadynamics. We apply the formalism to three different problems and in each case find excellent agreement with the results of long unbiased molecular dynamics runs. △ Less

Submitted 5 December, 2013; v1 submitted 20 September, 2013; originally announced September 2013.

Comments: 4 pages, 2 figures, 1 supplemental file

Journal ref: Phys. Rev. Lett. 111 (2013) 230602-230606

arXiv:1301.0168 [pdf, ps, other]

doi 10.1103/PhysRevB.89.184101

Ab initio calculation of anisotropic interfacial excess free energies

Authors: Axel van de Walle, Chirranjeevi Balaji Gopal, Steve Demers, Qijun Hong, Adam Kowalski, Ljubomir Miljacic, Gregory Pomrehn, Pratyush Tiwary

Abstract: We describe a simple method to determine, from ab initio calculations, the complete orientation-dependence of interfacial free energies in solid-state crystalline systems. We illustrate the method with an application to precipitates in the Al-Ti alloy system. The method combines the cluster expansion formalism in its most general form (to model the system's energetics) with the inversion of the we… ▽ More We describe a simple method to determine, from ab initio calculations, the complete orientation-dependence of interfacial free energies in solid-state crystalline systems. We illustrate the method with an application to precipitates in the Al-Ti alloy system. The method combines the cluster expansion formalism in its most general form (to model the system's energetics) with the inversion of the well-known Wulff construction (to recover interfacial energies from equilibrium precipitate shapes). Although the inverse Wulff construction only provides the relative magnitude of the various interfacial free energies, absolute free energies can be recovered from a calculation of a single, conveniently chosen, planar interface. The method is able to account for essentially all sources of entropy (arising from phonons, bulk point defects, as well as interface roughness) and is thus able to transparently handle both atomically smooth and rough interfaces. The approach expresses the resulting orientation-dependence of the interfacial properties using symmetry-adapted bases for general orientation-dependent quantities. As a by-product, this paper thus provides a simple and general method to generate such basis functions, which prove useful in a variety of other applications, for instance to represent the anisotropy of the so-called constituent strain elastic energy. △ Less

Submitted 22 April, 2014; v1 submitted 1 January, 2013; originally announced January 2013.

Comments: 17 pages, 9 figures

arXiv:1212.6649 [pdf, other]

Accelerated Molecular Dynamics through stochastic iterations to strengthen yield of path hop** over upper states (SISYPHUS)

Authors: Pratyush Tiwary, Axel van de Walle

Abstract: We present a new method, called SISYPHUS (Stochastic Iterations to Strengthen Yield of Path Hop** over Upper States), for extending accessible time-scales in atomistic simulations. The method proceeds by separating phase space into basins, and transition regions between the basins based on a general collective variable (CV) criterion. The transition regions are treated via traditional molecular… ▽ More We present a new method, called SISYPHUS (Stochastic Iterations to Strengthen Yield of Path Hop** over Upper States), for extending accessible time-scales in atomistic simulations. The method proceeds by separating phase space into basins, and transition regions between the basins based on a general collective variable (CV) criterion. The transition regions are treated via traditional molecular dynamics (MD) while Monte Carlo (MC) methods are used to (i) estimate the expected time spent in each basin and (ii) thermalize the system between two MD episodes. In particular, an efficient adiabatic switching based scheme is used to estimate the time spent inside the basins. The method offers various advantages over existing approaches in terms of (i) providing an accurate real time scale, (ii) avoiding reliance on harmonic transition state theory and (iii) avoiding the need to enumerate all possible transition events. Applications of SISYPHUS to low temperature vacancy diffusion in BCC Ta and adatom island ripening in FCC Al are presented. A new CV appropriate for such condensed phases, especially for transitions involving collective motions of several atoms, is also introduced. △ Less

Submitted 2 January, 2013; v1 submitted 29 December, 2012; originally announced December 2012.

Comments: 5 pages, 4 figures, see ancillary material as well (includes 6 movies and a PDF document)

Showing 1–50 of 54 results for author: Tiwary, P