-
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
Authors:
Sebastian Dittert,
Vincent Moens,
Gianni De Fabritiis
Abstract:
We present BricksRL, a platform designed to democratize access to robotics for reinforcement learning research and education. BricksRL facilitates the creation, design, and training of custom LEGO robots in the real world by interfacing them with the TorchRL library for reinforcement learning agents. The integration of TorchRL with the LEGO hubs, via Bluetooth bidirectional communication, enables…
▽ More
We present BricksRL, a platform designed to democratize access to robotics for reinforcement learning research and education. BricksRL facilitates the creation, design, and training of custom LEGO robots in the real world by interfacing them with the TorchRL library for reinforcement learning agents. The integration of TorchRL with the LEGO hubs, via Bluetooth bidirectional communication, enables state-of-the-art reinforcement learning training on GPUs for a wide variety of LEGO builds. This offers a flexible and cost-efficient approach for scaling and also provides a robust infrastructure for robot-environment-algorithm communication. We present various experiments across tasks and robot configurations, providing built plans and training results. Furthermore, we demonstrate that inexpensive LEGO robots can be trained end-to-end in the real world to achieve simple tasks, with training times typically under 120 minutes on a normal laptop. Moreover, we show how users can extend the capabilities, exemplified by the successful integration of non-LEGO sensors. By enhancing accessibility to both robotics and reinforcement learning, BricksRL establishes a strong foundation for democratized robotic learning in research and educational settings.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
ACEGEN: Reinforcement learning of generative chemical agents for drug discovery
Authors:
Albert Bou,
Morgan Thomas,
Sebastian Dittert,
Carles Navarro Ramírez,
Maciej Majewski,
Ye Wang,
Shivam Patel,
Gary Tresadern,
Mazen Ahmad,
Vincent Moens,
Woody Sherman,
Simone Sciabola,
Gianni De Fabritiis
Abstract:
In recent years, reinforcement learning (RL) has emerged as a valuable tool in drug design, offering the potential to propose and optimize molecules with desired properties. However, striking a balance between capabilities, flexibility, reliability, and efficiency remains challenging due to the complexity of advanced RL algorithms and the significant reliance on specialized code. In this work, we…
▽ More
In recent years, reinforcement learning (RL) has emerged as a valuable tool in drug design, offering the potential to propose and optimize molecules with desired properties. However, striking a balance between capabilities, flexibility, reliability, and efficiency remains challenging due to the complexity of advanced RL algorithms and the significant reliance on specialized code. In this work, we introduce ACEGEN, a comprehensive and streamlined toolkit tailored for generative drug design, built using TorchRL, a modern RL library that offers thoroughly tested reusable components. We validate ACEGEN by benchmarking against other published generative modeling algorithms and show comparable or improved performance. We also show examples of ACEGEN applied in multiple drug discovery case studies. ACEGEN is accessible at \url{https://github.com/acellera/acegen-open} and available for use under the MIT license.
△ Less
Submitted 3 June, 2024; v1 submitted 7 May, 2024;
originally announced May 2024.
-
On the Inclusion of Charge and Spin States in Cartesian Tensor Neural Network Potentials
Authors:
Guillem Simeon,
Antonio Mirarchi,
Raul P. Pelaez,
Raimondas Galvelis,
Gianni De Fabritiis
Abstract:
In this letter, we present an extension to TensorNet, a state-of-the-art equivariant Cartesian tensor neural network potential, allowing it to handle charged molecules and spin states without architectural changes or increased costs. By incorporating these attributes, we address input degeneracy issues, enhancing the model's predictive accuracy across diverse chemical systems. This advancement sig…
▽ More
In this letter, we present an extension to TensorNet, a state-of-the-art equivariant Cartesian tensor neural network potential, allowing it to handle charged molecules and spin states without architectural changes or increased costs. By incorporating these attributes, we address input degeneracy issues, enhancing the model's predictive accuracy across diverse chemical systems. This advancement significantly broadens TensorNet's applicability, maintaining its efficiency and accuracy.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
TorchMD-Net 2.0: Fast Neural Network Potentials for Molecular Simulations
Authors:
Raul P. Pelaez,
Guillem Simeon,
Raimondas Galvelis,
Antonio Mirarchi,
Peter Eastman,
Stefan Doerr,
Philipp Thölke,
Thomas E. Markland,
Gianni De Fabritiis
Abstract:
Achieving a balance between computational speed, prediction accuracy, and universal applicability in molecular simulations has been a persistent challenge. This paper presents substantial advancements in the TorchMD-Net software, a pivotal step forward in the shift from conventional force fields to neural network-based potentials. The evolution of TorchMD-Net into a more comprehensive and versatil…
▽ More
Achieving a balance between computational speed, prediction accuracy, and universal applicability in molecular simulations has been a persistent challenge. This paper presents substantial advancements in the TorchMD-Net software, a pivotal step forward in the shift from conventional force fields to neural network-based potentials. The evolution of TorchMD-Net into a more comprehensive and versatile framework is highlighted, incorporating cutting-edge architectures such as TensorNet. This transformation is achieved through a modular design approach, encouraging customized applications within the scientific community. The most notable enhancement is a significant improvement in computational efficiency, achieving a very remarkable acceleration in the computation of energy and forces for TensorNet models, with performance gains ranging from 2-fold to 10-fold over previous iterations. Other enhancements include highly optimized neighbor search algorithms that support periodic boundary conditions and the smooth integration with existing molecular dynamics frameworks. Additionally, the updated version introduces the capability to integrate physical priors, further enriching its application spectrum and utility in research. The software is available at https://github.com/torchmd/torchmd-net.
△ Less
Submitted 23 May, 2024; v1 submitted 27 February, 2024;
originally announced February 2024.
-
Enhancing Protein-Ligand Binding Affinity Predictions using Neural Network Potentials
Authors:
Francesc Sabanes Zariquiey,
Raimondas Galvelis,
Emilio Gallicchio,
John D. Chodera,
Thomas E. Markland,
Gianni de Fabritiis
Abstract:
This letter gives results on improving protein-ligand binding affinity predictions based on molecular dynamics simulations using machine learning potentials with a hybrid neural network potential and molecular mechanics methodology (NNP/MM). We compute relative binding free energies (RBFE) with the Alchemical Transfer Method (ATM) and validate its performance against established benchmarks and fin…
▽ More
This letter gives results on improving protein-ligand binding affinity predictions based on molecular dynamics simulations using machine learning potentials with a hybrid neural network potential and molecular mechanics methodology (NNP/MM). We compute relative binding free energies (RBFE) with the Alchemical Transfer Method (ATM) and validate its performance against established benchmarks and find significant enhancements compared to conventional MM force fields like GAFF2.
△ Less
Submitted 14 February, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
PlayMolecule Viewer: a toolkit for the visualization of molecules and other data
Authors:
Mariona Torrens-Fontanals,
Panagiotis Tourlas,
Stefan Doerr,
Gianni De Fabritiis
Abstract:
PlayMolecule Viewer is a web-based data visualization toolkit designed to streamline the exploration of data resulting from structural bioinformatics or computer-aided drug design efforts. By harnessing state-of-the-art web technologies such as WebAssembly, PlayMolecule Viewer integrates powerful Python libraries directly within the browser environment, which enhances its capabilities of managing…
▽ More
PlayMolecule Viewer is a web-based data visualization toolkit designed to streamline the exploration of data resulting from structural bioinformatics or computer-aided drug design efforts. By harnessing state-of-the-art web technologies such as WebAssembly, PlayMolecule Viewer integrates powerful Python libraries directly within the browser environment, which enhances its capabilities of managing multiple types of molecular data. With its intuitive interface, it allows users to easily upload, visualize, select, and manipulate molecular structures and associated data. The toolkit supports a wide range of common structural file formats and offers a variety of molecular representations to cater to different visualization needs. PlayMolecule Viewer is freely accessible at open.playmolecule.org, ensuring accessibility and availability to the scientific community and beyond.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Navigating protein landscapes with a machine-learned transferable coarse-grained model
Authors:
Nicholas E. Charron,
Felix Musil,
Andrea Guljas,
Yaoyi Chen,
Klara Bonneau,
Aldo S. Pasos-Trejo,
Jacopo Venturin,
Daria Gusew,
Iryna Zaporozhets,
Andreas Krämer,
Clark Templeton,
Atharva Kelkar,
Aleksander E. P. Durumeric,
Simon Olsson,
Adrià Pérez,
Maciej Majewski,
Brooke E. Husic,
Ankit Patel,
Gianni De Fabritiis,
Frank Noé,
Cecilia Clementi
Abstract:
The most popular and universally predictive protein simulation models employ all-atom molecular dynamics (MD), but they come at extreme computational cost. The development of a universal, computationally efficient coarse-grained (CG) model with similar prediction performance has been a long-standing challenge. By combining recent deep learning methods with a large and diverse training set of all-a…
▽ More
The most popular and universally predictive protein simulation models employ all-atom molecular dynamics (MD), but they come at extreme computational cost. The development of a universal, computationally efficient coarse-grained (CG) model with similar prediction performance has been a long-standing challenge. By combining recent deep learning methods with a large and diverse training set of all-atom protein simulations, we here develop a bottom-up CG force field with chemical transferability, which can be used for extrapolative molecular dynamics on new sequences not used during model parametrization. We demonstrate that the model successfully predicts folded structures, intermediates, metastable folded and unfolded basins, and the fluctuations of intrinsically disordered proteins while it is several orders of magnitude faster than an all-atom model. This showcases the feasibility of a universal and computationally efficient machine-learned CG model for proteins.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
OpenMM 8: Molecular Dynamics Simulation with Machine Learning Potentials
Authors:
Peter Eastman,
Raimondas Galvelis,
Raúl P. Peláez,
Charlles R. A. Abreu,
Stephen E. Farr,
Emilio Gallicchio,
Anton Gorenko,
Michael M. Henry,
Frank Hu,
**g Huang,
Andreas Krämer,
Julien Michel,
Joshua A. Mitchell,
Vijay S. Pande,
João PGLM Rodrigues,
Jaime Rodriguez-Guerra,
Andrew C. Simmonett,
Sukrit Singh,
Jason Swails,
Philip Turner,
Yuanqing Wang,
Ivy Zhang,
John D. Chodera,
Gianni De Fabritiis,
Thomas E. Markland
Abstract:
Machine learning plays an important and growing role in molecular simulation. The newest version of the OpenMM molecular dynamics toolkit introduces new features to support the use of machine learning potentials. Arbitrary PyTorch models can be added to a simulation and used to compute forces and energy. A higher-level interface allows users to easily model their molecules of interest with general…
▽ More
Machine learning plays an important and growing role in molecular simulation. The newest version of the OpenMM molecular dynamics toolkit introduces new features to support the use of machine learning potentials. Arbitrary PyTorch models can be added to a simulation and used to compute forces and energy. A higher-level interface allows users to easily model their molecules of interest with general purpose, pretrained potential functions. A collection of optimized CUDA kernels and custom PyTorch operations greatly improves the speed of simulations. We demonstrate these features on simulations of cyclin-dependent kinase 8 (CDK8) and the green fluorescent protein (GFP) chromophore in water. Taken together, these features make it practical to use machine learning to improve the accuracy of simulations at only a modest increase in cost.
△ Less
Submitted 29 November, 2023; v1 submitted 4 October, 2023;
originally announced October 2023.
-
A High-Throughput Steered Molecular Dynamics Study on the Free Energy Profile of Ion Permeation through Gramicidin A
Authors:
Toni Giorgino,
Gianni De Fabritiis
Abstract:
Steered molecular dynamics (SMD) simulations for the calculation of free energies are well suited for high-throughput molecular simulations on a distributed infrastructure due to the simplicity of the setup and parallel granularity of the runs. However, so far, the computational cost limited the estimation of the free energy typically over just a few pullings, thus impeding the evaluation of stati…
▽ More
Steered molecular dynamics (SMD) simulations for the calculation of free energies are well suited for high-throughput molecular simulations on a distributed infrastructure due to the simplicity of the setup and parallel granularity of the runs. However, so far, the computational cost limited the estimation of the free energy typically over just a few pullings, thus impeding the evaluation of statistical uncertainties involved. In this work, we performed two thousand pulls for the permeation of a potassium ion in the gramicidin A pore by all-atom molecular dynamics in order to assess the bidirectional SMD protocol with a proper amount of sampling. The estimated free energy profile still shows a statistical error of several kcal/mol, while the work distributions are estimated to be non-Gaussian at pulling speeds of 10 Å/ns. We discuss the methodology and the confidence intervals in relation to increasing amounts of computed trajectories and how different permeation pathways for the potassium ion, knock-on and sideways, affect the sampling and the free energy estimation.
△ Less
Submitted 21 September, 2023;
originally announced September 2023.
-
Machine Learning Small Molecule Properties in Drug Discovery
Authors:
Nikolai Schapin,
Maciej Majewski,
Alejandro Varela,
Carlos Arroniz,
Gianni De Fabritiis
Abstract:
Machine learning (ML) is a promising approach for predicting small molecule properties in drug discovery. Here, we provide a comprehensive overview of various ML methods introduced for this purpose in recent years. We review a wide range of properties, including binding affinities, solubility, and ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity). We discuss existing popular da…
▽ More
Machine learning (ML) is a promising approach for predicting small molecule properties in drug discovery. Here, we provide a comprehensive overview of various ML methods introduced for this purpose in recent years. We review a wide range of properties, including binding affinities, solubility, and ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity). We discuss existing popular datasets and molecular descriptors and embeddings, such as chemical fingerprints and graph-based neural networks. We highlight also challenges of predicting and optimizing multiple properties during hit-to-lead and lead optimization stages of drug discovery and explore briefly possible multi-objective optimization techniques that can be used to balance diverse properties while optimizing lead candidates. Finally, techniques to provide an understanding of model predictions, especially for critical decision-making in drug discovery are assessed. Overall, this review provides insights into the landscape of ML models for small molecule property predictions in drug discovery. So far, there are multiple diverse approaches, but their performances are often comparable. Neural networks, while more flexible, do not always outperform simpler models. This shows that the availability of high-quality training data remains crucial for training accurate models and there is a need for standardized benchmarks, additional performance metrics, and best practices to enable richer comparisons between the different techniques and models that can shed a better light on the differences between the many techniques.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Top-down machine learning of coarse-grained protein force-fields
Authors:
Carles Navarro,
Maciej Majewski,
Gianni de Fabritiis
Abstract:
Develo** accurate and efficient coarse-grained representations of proteins is crucial for understanding their folding, function, and interactions over extended timescales. Our methodology involves simulating proteins with molecular dynamics and utilizing the resulting trajectories to train a neural network potential through differentiable trajectory reweighting. Remarkably, this method requires…
▽ More
Develo** accurate and efficient coarse-grained representations of proteins is crucial for understanding their folding, function, and interactions over extended timescales. Our methodology involves simulating proteins with molecular dynamics and utilizing the resulting trajectories to train a neural network potential through differentiable trajectory reweighting. Remarkably, this method requires only the native conformation of proteins, eliminating the need for labeled data derived from extensive simulations or memory-intensive end-to-end differentiable simulations. Once trained, the model can be employed to run parallel molecular dynamics simulations and sample folding events for proteins both within and beyond the training distribution, showcasing its extrapolation capabilities. By applying Markov State Models, native-like conformations of the simulated proteins can be predicted from the coarse-grained simulations. Owing to its theoretical transferability and ability to use solely experimental static structures as training data, we anticipate that this approach will prove advantageous for develo** new protein force fields and further advancing the study of protein dynamics, folding, and interactions.
△ Less
Submitted 10 October, 2023; v1 submitted 20 June, 2023;
originally announced June 2023.
-
TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials
Authors:
Guillem Simeon,
Gianni de Fabritiis
Abstract:
The development of efficient machine learning models for molecular systems representation is becoming crucial in scientific research. We introduce TensorNet, an innovative O(3)-equivariant message-passing neural network architecture that leverages Cartesian tensor representations. By using Cartesian tensor atomic embeddings, feature mixing is simplified through matrix product operations. Furthermo…
▽ More
The development of efficient machine learning models for molecular systems representation is becoming crucial in scientific research. We introduce TensorNet, an innovative O(3)-equivariant message-passing neural network architecture that leverages Cartesian tensor representations. By using Cartesian tensor atomic embeddings, feature mixing is simplified through matrix product operations. Furthermore, the cost-effective decomposition of these tensors into rotation group irreducible representations allows for the separate processing of scalars, vectors, and tensors when necessary. Compared to higher-rank spherical tensor models, TensorNet demonstrates state-of-the-art performance with significantly fewer parameters. For small molecule potential energies, this can be achieved even with a single interaction layer. As a result of all these properties, the model's computational cost is substantially decreased. Moreover, the accurate prediction of vector and tensor molecular quantities on top of potential energies and forces is possible. In summary, TensorNet's framework opens up a new space for the design of state-of-the-art equivariant models.
△ Less
Submitted 30 October, 2023; v1 submitted 10 June, 2023;
originally announced June 2023.
-
TorchRL: A data-driven decision-making library for PyTorch
Authors:
Albert Bou,
Matteo Bettini,
Sebastian Dittert,
Vikash Kumar,
Shagun Sodhani,
Xiaomeng Yang,
Gianni De Fabritiis,
Vincent Moens
Abstract:
PyTorch has ascended as a premier machine learning framework, yet it lacks a native and comprehensive library for decision and control tasks suitable for large development teams dealing with complex real-world data and environments. To address this issue, we propose TorchRL, a generalistic control library for PyTorch that provides well-integrated, yet standalone components. We introduce a new and…
▽ More
PyTorch has ascended as a premier machine learning framework, yet it lacks a native and comprehensive library for decision and control tasks suitable for large development teams dealing with complex real-world data and environments. To address this issue, we propose TorchRL, a generalistic control library for PyTorch that provides well-integrated, yet standalone components. We introduce a new and flexible PyTorch primitive, the TensorDict, which facilitates streamlined algorithm development across the many branches of Reinforcement Learning (RL) and control. We provide a detailed description of the building blocks and an extensive overview of the library across domains and tasks. Finally, we experimentally demonstrate its reliability and flexibility and show comparative benchmarks to demonstrate its computational efficiency. TorchRL fosters long-term support and is publicly available on GitHub for greater reproducibility and collaboration within the research community. The code is open-sourced on GitHub.
△ Less
Submitted 27 November, 2023; v1 submitted 1 June, 2023;
originally announced June 2023.
-
Validation of the Alchemical Transfer Method for the Estimation of Relative Binding Affinities of Molecular Series
Authors:
Francesc Sabanés Zariquiey,
Adrià Pérez,
Maciej Majewski,
Emilio Gallicchio,
Gianni De Fabritiis
Abstract:
The accurate prediction of protein-ligand binding affinities is crucial for drug discovery. Alchemical free energy calculations have become a popular tool for this purpose. However, the accuracy and reliability of these methods can vary depending on the methodology. In this study, we evaluate the performance of a relative binding free energy protocol based on the alchemical transfer method (ATM),…
▽ More
The accurate prediction of protein-ligand binding affinities is crucial for drug discovery. Alchemical free energy calculations have become a popular tool for this purpose. However, the accuracy and reliability of these methods can vary depending on the methodology. In this study, we evaluate the performance of a relative binding free energy protocol based on the alchemical transfer method (ATM), a novel approach based on a coordinate transformation that swaps the positions of two ligands. The results show that ATM matches the performance of more complex free energy perturbation (FEP) methods in terms of Pearson correlation, but with marginally higher mean absolute errors. This study shows that the ATM method is competitive compared to more traditional methods in speed and accuracy and offers the advantage of being applicable with any potential energy function.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.
-
Binding-and-folding recognition of an intrinsically disordered protein using online learning molecular dynamics
Authors:
Pablo Herrera-Nieto,
Adrià Pérez,
Gianni De Fabritiis
Abstract:
Intrinsically disordered proteins participate in many biological processes by folding upon binding with other proteins. However, coupled folding and binding processes are not well understood from an atomistic point of view. One of the main questions is whether folding occurs prior to or after binding. Here we use a novel unbiased high-throughput adaptive sampling approach to reconstruct the bindin…
▽ More
Intrinsically disordered proteins participate in many biological processes by folding upon binding with other proteins. However, coupled folding and binding processes are not well understood from an atomistic point of view. One of the main questions is whether folding occurs prior to or after binding. Here we use a novel unbiased high-throughput adaptive sampling approach to reconstruct the binding and folding between the disordered transactivation domain of \mbox{c-Myb} and the KIX domain of the CREB-binding protein. The reconstructed long-term dynamical process highlights the binding of a short stretch of amino acids on \mbox{c-Myb} as a folded $α$-helix. Leucine residues, specially Leu298 to Leu302, establish initial native contacts that prime the binding and folding of the rest of the peptide, with a mixture of conformational selection on the N-terminal region with an induced fit of the C-terminal.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Machine Learning Coarse-Grained Potentials of Protein Thermodynamics
Authors:
Maciej Majewski,
Adrià Pérez,
Philipp Thölke,
Stefan Doerr,
Nicholas E. Charron,
Toni Giorgino,
Brooke E. Husic,
Cecilia Clementi,
Frank Noé,
Gianni De Fabritiis
Abstract:
A generalized understanding of protein dynamics is an unsolved scientific problem, the solution of which is critical to the interpretation of the structure-function relationships that govern essential biological processes. Here, we approach this problem by constructing coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics. For training, we bu…
▽ More
A generalized understanding of protein dynamics is an unsolved scientific problem, the solution of which is critical to the interpretation of the structure-function relationships that govern essential biological processes. Here, we approach this problem by constructing coarse-grained molecular potentials based on artificial neural networks and grounded in statistical mechanics. For training, we build a unique dataset of unbiased all-atom molecular dynamics simulations of approximately 9 ms for twelve different proteins with multiple secondary structure arrangements. The coarse-grained models are capable of accelerating the dynamics by more than three orders of magnitude while preserving the thermodynamics of the systems. Coarse-grained simulations identify relevant structural states in the ensemble with comparable energetics to the all-atom systems. Furthermore, we show that a single coarse-grained potential can integrate all twelve proteins and can capture experimental structural features of mutated proteins. These results indicate that machine learning coarse-grained potentials could provide a feasible approach to simulate and understand protein dynamics.
△ Less
Submitted 14 December, 2022;
originally announced December 2022.
-
SPICE, A Dataset of Drug-like Molecules and Peptides for Training Machine Learning Potentials
Authors:
Peter Eastman,
Pavan Kumar Behara,
David L. Dotson,
Raimondas Galvelis,
John E. Herr,
Josh T. Horton,
Yuezhi Mao,
John D. Chodera,
Benjamin P. Pritchard,
Yuanqing Wang,
Gianni De Fabritiis,
Thomas E. Markland
Abstract:
Machine learning potentials are an important tool for molecular simulation, but their development is held back by a shortage of high quality datasets to train them on. We describe the SPICE dataset, a new quantum chemistry dataset for training potentials relevant to simulating drug-like small molecules interacting with proteins. It contains over 1.1 million conformations for a diverse set of small…
▽ More
Machine learning potentials are an important tool for molecular simulation, but their development is held back by a shortage of high quality datasets to train them on. We describe the SPICE dataset, a new quantum chemistry dataset for training potentials relevant to simulating drug-like small molecules interacting with proteins. It contains over 1.1 million conformations for a diverse set of small molecules, dimers, dipeptides, and solvated amino acids. It includes 15 elements, charged and uncharged molecules, and a wide range of covalent and non-covalent interactions. It provides both forces and energies calculated at the ωB97M-D3(BJ)/def2-TZVPPD level of theory, along with other useful quantities such as multipole moments and bond orders. We train a set of machine learning potentials on it and demonstrate that they can achieve chemical accuracy across a broad region of chemical space. It can serve as a valuable resource for the creation of transferable, ready to use potential functions for use in molecular simulations.
△ Less
Submitted 23 November, 2022; v1 submitted 21 September, 2022;
originally announced September 2022.
-
TorchMD-NET: Equivariant Transformers for Neural Network based Molecular Potentials
Authors:
Philipp Thölke,
Gianni De Fabritiis
Abstract:
The prediction of quantum mechanical properties is historically plagued by a trade-off between accuracy and speed. Machine learning potentials have previously shown great success in this domain, reaching increasingly better accuracy while maintaining computational efficiency comparable with classical force fields. In this work we propose TorchMD-NET, a novel equivariant transformer (ET) architectu…
▽ More
The prediction of quantum mechanical properties is historically plagued by a trade-off between accuracy and speed. Machine learning potentials have previously shown great success in this domain, reaching increasingly better accuracy while maintaining computational efficiency comparable with classical force fields. In this work we propose TorchMD-NET, a novel equivariant transformer (ET) architecture, outperforming state-of-the-art on MD17, ANI-1, and many QM9 targets in both accuracy and computational efficiency. Through an extensive attention weight analysis, we gain valuable insights into the black box predictor and show differences in the learned representation of conformers versus conformations sampled from molecular dynamics or normal modes. Furthermore, we highlight the importance of datasets including off-equilibrium conformations for the evaluation of molecular potentials.
△ Less
Submitted 23 April, 2022; v1 submitted 5 February, 2022;
originally announced February 2022.
-
NNP/MM: Accelerating molecular dynamics simulations with machine learning potentials and molecular mechanic
Authors:
Raimondas Galvelis,
Alejandro Varela-Rial,
Stefan Doerr,
Roberto Fino,
Peter Eastman,
Thomas E. Markland,
John D. Chodera,
Gianni De Fabritiis
Abstract:
Machine learning potentials have emerged as a means to enhance the accuracy of biomolecular simulations. However, their application is constrained by the significant computational cost arising from the vast number of parameters compared to traditional molecular mechanics. To tackle this issue, we introduce an optimized implementation of the hybrid method (NNP/MM), which combines neural network pot…
▽ More
Machine learning potentials have emerged as a means to enhance the accuracy of biomolecular simulations. However, their application is constrained by the significant computational cost arising from the vast number of parameters compared to traditional molecular mechanics. To tackle this issue, we introduce an optimized implementation of the hybrid method (NNP/MM), which combines neural network potentials (NNP) and molecular mechanics (MM). This approach models a portion of the system, such as a small molecule, using NNP while employing MM for the remaining system to boost efficiency. By conducting molecular dynamics (MD) simulations on various protein-ligand complexes and metadynamics (MTD) simulations on a ligand, we showcase the capabilities of our implementation of NNP/MM. It has enabled us to increase the simulation speed by 5 times and achieve a combined sampling of one microsecond for each complex, marking the longest simulations ever reported for this class of simulation.
△ Less
Submitted 28 August, 2023; v1 submitted 20 January, 2022;
originally announced January 2022.
-
TorchMD: A deep learning framework for molecular simulations
Authors:
Stefan Doerr,
Maciej Majewsk,
Adrià Pérez,
Andreas Krämer,
Cecilia Clementi,
Frank Noe,
Toni Giorgino,
Gianni De Fabritiis
Abstract:
Molecular dynamics simulations provide a mechanistic description of molecules by relying on empirical potentials. The quality and transferability of such potentials can be improved leveraging data-driven models derived with machine learning approaches. Here, we present TorchMD, a framework for molecular simulations with mixed classical and machine learning potentials. All of force computations inc…
▽ More
Molecular dynamics simulations provide a mechanistic description of molecules by relying on empirical potentials. The quality and transferability of such potentials can be improved leveraging data-driven models derived with machine learning approaches. Here, we present TorchMD, a framework for molecular simulations with mixed classical and machine learning potentials. All of force computations including bond, angle, dihedral, Lennard-Jones and Coulomb interactions are expressed as PyTorch arrays and operations. Moreover, TorchMD enables learning and simulating neural network potentials. We validate it using standard Amber all-atom simulations, learning an ab-initio potential, performing an end-to-end training and finally learning and simulating a coarse-grained model for protein folding. We believe that TorchMD provides a useful tool-set to support molecular simulations of machine learning potentials. Code and data are freely available at \url{github.com/torchmd}.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
Coarse Graining Molecular Dynamics with Graph Neural Networks
Authors:
Brooke E. Husic,
Nicholas E. Charron,
Dominik Lemm,
Jiang Wang,
Adrià Pérez,
Maciej Majewski,
Andreas Krämer,
Yaoyi Chen,
Simon Olsson,
Gianni de Fabritiis,
Frank Noé,
Cecilia Clementi
Abstract:
Coarse graining enables the investigation of molecular dynamics for larger systems and at longer timescales than is possible at atomic resolution. However, a coarse graining model must be formulated such that the conclusions we draw from it are consistent with the conclusions we would draw from a model at a finer level of detail. It has been proven that a force matching scheme defines a thermodyna…
▽ More
Coarse graining enables the investigation of molecular dynamics for larger systems and at longer timescales than is possible at atomic resolution. However, a coarse graining model must be formulated such that the conclusions we draw from it are consistent with the conclusions we would draw from a model at a finer level of detail. It has been proven that a force matching scheme defines a thermodynamically consistent coarse-grained model for an atomistic system in the variational limit. Wang et al. [ACS Cent. Sci. 5, 755 (2019)] demonstrated that the existence of such a variational limit enables the use of a supervised machine learning framework to generate a coarse-grained force field, which can then be used for simulation in the coarse-grained space. Their framework, however, requires the manual input of molecular features upon which to machine learn the force field. In the present contribution, we build upon the advance of Wang et al.and introduce a hybrid architecture for the machine learning of coarse-grained force fields that learns their own features via a subnetwork that leverages continuous filter convolutions on a graph neural network architecture. We demonstrate that this framework succeeds at reproducing the thermodynamics for small biomolecular systems. Since the learned molecular representations are inherently transferable, the architecture presented here sets the stage for the development of machine-learned, coarse-grained force fields that are transferable across molecular systems.
△ Less
Submitted 6 November, 2020; v1 submitted 22 July, 2020;
originally announced July 2020.
-
Guided Exploration with Proximal Policy Optimization using a Single Demonstration
Authors:
Gabriele Libardi,
Gianni De Fabritiis
Abstract:
Solving sparse reward tasks through exploration is one of the major challenges in deep reinforcement learning, especially in three-dimensional, partially-observable environments. Critically, the algorithm proposed in this article uses a single human demonstration to solve hard-exploration problems. We train an agent on a combination of demonstrations and own experience to solve problems with varia…
▽ More
Solving sparse reward tasks through exploration is one of the major challenges in deep reinforcement learning, especially in three-dimensional, partially-observable environments. Critically, the algorithm proposed in this article uses a single human demonstration to solve hard-exploration problems. We train an agent on a combination of demonstrations and own experience to solve problems with variable initial conditions. We adapt this idea and integrate it with the proximal policy optimization (PPO). The agent is able to increase its performance and to tackle harder problems by replaying its own past trajectories prioritizing them based on the obtained reward and the maximum value of the trajectory. We compare different variations of this algorithm to behavioral cloning on a set of hard-exploration tasks in the Animal-AI Olympics environment. To the best of our knowledge, learning a task in a three-dimensional environment with comparable difficulty has never been considered before using only one human demonstration.
△ Less
Submitted 16 June, 2021; v1 submitted 7 July, 2020;
originally announced July 2020.
-
Integrating Distributed Architectures in Highly Modular RL Libraries
Authors:
Albert Bou,
Sebastian Dittert,
Gianni De Fabritiis
Abstract:
Advancing reinforcement learning (RL) requires tools that are flexible enough to easily prototype new methods while avoiding impractically slow experimental turnaround times. To match the first requirement, the most popular RL libraries advocate for highly modular agent composability, which facilitates experimentation and development. To solve challenging environments within reasonable time frames…
▽ More
Advancing reinforcement learning (RL) requires tools that are flexible enough to easily prototype new methods while avoiding impractically slow experimental turnaround times. To match the first requirement, the most popular RL libraries advocate for highly modular agent composability, which facilitates experimentation and development. To solve challenging environments within reasonable time frames, scaling RL to large sampling and computing resources has proved a successful strategy. However, this capability has been so far difficult to combine with modularity. In this work, we explore design choices to allow agent composability both at a local and distributed level of execution. We propose a versatile approach that allows the definition of RL agents at different scales through independent reusable components. We demonstrate experimentally that our design choices allow us to reproduce classical benchmarks, explore multiple distributed architectures, and solve novel and complex environments while giving full control to the user in the agent definition and training scheme definition. We believe this work can provide useful insights to the next generation of RL libraries.
△ Less
Submitted 12 June, 2023; v1 submitted 6 July, 2020;
originally announced July 2020.
-
SkeleDock: A Web Application for Scaffold Docking in PlayMolecule
Authors:
Alejandro Varela-Rial,
Maciej Majewski,
Alberto Cuzzolin,
Gerard Martínez-Rosell,
Gianni De Fabritiis
Abstract:
SkeleDock is a scaffold docking algorithm which uses the structure of a protein-ligand complex as a template to model the binding mode of a chemically similar system. This algorithm was evaluated in the D3R Grand Challenge 4 pose prediction challenge, where it achieved competitive performance. Furthermore, we show that, if crystallized fragments of the target ligand are available, SkeleDock can ou…
▽ More
SkeleDock is a scaffold docking algorithm which uses the structure of a protein-ligand complex as a template to model the binding mode of a chemically similar system. This algorithm was evaluated in the D3R Grand Challenge 4 pose prediction challenge, where it achieved competitive performance. Furthermore, we show that, if crystallized fragments of the target ligand are available, SkeleDock can outperform rDock docking software at predicting the binding mode. This article also addresses the capacity of this algorithm to model macrocycles and deal with scaffold hop**. SkeleDock can be accessed at https://playmolecule.org/SkeleDock/.
△ Less
Submitted 12 May, 2020;
originally announced May 2020.
-
AdaptiveBandit: A multi-armed bandit framework for adaptive sampling in molecular simulations
Authors:
Adrià Pérez,
Pablo Herrera-Nieto,
Stefan Doerr,
Gianni De Fabritiis
Abstract:
Sampling from the equilibrium distribution has always been a major problem in molecular simulations due to the very high dimensionality of conformational space. Over several decades, many approaches have been used to overcome the problem. In particular, we focus on unbiased simulation methods such as parallel and adaptive sampling. Here, we recast adaptive sampling schemes on the basis of multi-ar…
▽ More
Sampling from the equilibrium distribution has always been a major problem in molecular simulations due to the very high dimensionality of conformational space. Over several decades, many approaches have been used to overcome the problem. In particular, we focus on unbiased simulation methods such as parallel and adaptive sampling. Here, we recast adaptive sampling schemes on the basis of multi-armed bandits and develop a novel adaptive sampling algorithm under this framework, \UCB. We test it on multiple simplified potentials and in a protein folding scenario. We find that this framework performs similarly or better in every type of test potentials compared to previous methods. Furthermore, it provides a novel framework to develop new sampling algorithms with better asymptotic characteristics.
△ Less
Submitted 28 February, 2020;
originally announced February 2020.
-
Machine learning for protein folding and dynamics
Authors:
Frank Noé,
Gianni De Fabritiis,
Cecilia Clementi
Abstract:
Many aspects of the study of protein folding and dynamics have been affected by the recent advances in machine learning. Methods for the prediction of protein structures from their sequences are now heavily based on machine learning tools. The way simulations are performed to explore the energy landscape of protein systems is also changing as force-fields are started to be designed by means of mac…
▽ More
Many aspects of the study of protein folding and dynamics have been affected by the recent advances in machine learning. Methods for the prediction of protein structures from their sequences are now heavily based on machine learning tools. The way simulations are performed to explore the energy landscape of protein systems is also changing as force-fields are started to be designed by means of machine learning methods. These methods are also used to extract the essential information from large simulation datasets and to enhance the sampling of rare events such as folding/unfolding transitions. While significant challenges still need to be tackled, we expect these methods to play an important role on the study of protein folding and dynamics in the near future. We discuss here the recent advances on all these fronts and the questions that need to be addressed for machine learning approaches to become mainstream in protein simulation.
△ Less
Submitted 21 November, 2019;
originally announced November 2019.
-
A Scalable Molecular Force Field Parameterization Method Based on Density Functional Theory and Quantum-Level Machine Learning
Authors:
Raimondas Galvelis,
Stefan Doerr,
Joao M. Damas,
Matt J. Harvey,
Gianni De Fabritiis
Abstract:
Fast and accurate molecular force field (FF) parameterization is still an unsolved problem. Accurate FFs are not generally available for all molecules, like novel druglike molecules. While methods based on quantum mechanics (QM) exist to parameterize them with better accuracy, they are computationally expensive and slow, which limits applicability to a small number of molecules. Here, we present a…
▽ More
Fast and accurate molecular force field (FF) parameterization is still an unsolved problem. Accurate FFs are not generally available for all molecules, like novel druglike molecules. While methods based on quantum mechanics (QM) exist to parameterize them with better accuracy, they are computationally expensive and slow, which limits applicability to a small number of molecules. Here, we present an automated FF parameterization method which can utilize either DFT calculations or approximate QM energies produced by different neural network potentials (NNPs), to obtain improved parameters for molecules. We demonstrate that for the case of torchani-ANI-1x NNP, we can parameterize small molecules in a fraction of time compared with an equivalent parameterization using DFT QM calculations while producing more accurate parameters than FF (GAFF2). We expect our method to be of critical importance in computational structure-based drug discovery. The current version is available at PlayMolecule (www.playmolecule.org) and implemented in HTMD, allowing to parameterize molecules with different QM and NNP options.
△ Less
Submitted 3 August, 2019; v1 submitted 16 July, 2019;
originally announced July 2019.
-
Machine Learning of coarse-grained Molecular Dynamics Force Fields
Authors:
Jiang Wang,
Simon Olsson,
Christoph Wehmeyer,
Adria Perez,
Nicholas E. Charron,
Gianni de Fabritiis,
Frank Noe,
Cecilia Clementi
Abstract:
Atomistic or ab-initio molecular dynamics simulations are widely used to predict thermodynamics and kinetics and relate them to molecular structure. A common approach to go beyond the time- and length-scales accessible with such computationally expensive simulations is the definition of coarse-grained molecular models. Existing coarse-graining approaches define an effective interaction potential t…
▽ More
Atomistic or ab-initio molecular dynamics simulations are widely used to predict thermodynamics and kinetics and relate them to molecular structure. A common approach to go beyond the time- and length-scales accessible with such computationally expensive simulations is the definition of coarse-grained molecular models. Existing coarse-graining approaches define an effective interaction potential to match defined properties of high-resolution models or experimental data. In this paper, we reformulate coarse-graining as a supervised machine learning problem. We use statistical learning theory to decompose the coarse-graining error and cross-validation to select and compare the performance of different models. We introduce CGnets, a deep learning approach, that learns coarse-grained free energy functions and can be trained by a force matching scheme. CGnets maintain all physically relevant invariances and allow one to incorporate prior physics knowledge to avoid sampling of unphysical structures. We show that CGnets can capture all-atom explicit-solvent free energy surfaces with models using only a few coarse-grained beads and no solvent, while classical coarse-graining methods fail to capture crucial features of the free energy surface. Thus, CGnets are able to capture multi-body terms that emerge from the dimensionality reduction.
△ Less
Submitted 3 April, 2019; v1 submitted 4 December, 2018;
originally announced December 2018.
-
Simulations meet Machine Learning in Structural Biology
Authors:
Adrià Pérez,
Gerard Martínez-Rosell,
Gianni De Fabritiis
Abstract:
Classical molecular dynamics (MD) simulations will be able to reach sampling in the second timescale within five years, producing petabytes of simulation data at current force field accuracy. Notwithstanding this, MD will still be in the regime of low-throughput, high-latency predictions with average accuracy. We envisage that machine learning (ML) will be able to solve both the accuracy and time-…
▽ More
Classical molecular dynamics (MD) simulations will be able to reach sampling in the second timescale within five years, producing petabytes of simulation data at current force field accuracy. Notwithstanding this, MD will still be in the regime of low-throughput, high-latency predictions with average accuracy. We envisage that machine learning (ML) will be able to solve both the accuracy and time-to-prediction problem by learning predictive models using expensive simulation data. The synergies between classical, quantum simulations and ML methods, such as artificial neural networks, have the potential to drastically reshape the way we make predictions in computational structural biology and drug discovery.
△ Less
Submitted 19 October, 2018;
originally announced October 2018.
-
Dimensionality reduction methods for molecular simulations
Authors:
Stefan Doerr,
Igor Ariz-Extreme,
Matthew J. Harvey,
Gianni De Fabritiis
Abstract:
Molecular simulations produce very high-dimensional data-sets with millions of data points. As analysis methods are often unable to cope with so many dimensions, it is common to use dimensionality reduction and clustering methods to reach a reduced representation of the data. Yet these methods often fail to capture the most important features necessary for the construction of a Markov model. Here…
▽ More
Molecular simulations produce very high-dimensional data-sets with millions of data points. As analysis methods are often unable to cope with so many dimensions, it is common to use dimensionality reduction and clustering methods to reach a reduced representation of the data. Yet these methods often fail to capture the most important features necessary for the construction of a Markov model. Here we demonstrate the results of various dimensionality reduction methods on two simulation data-sets, one of protein folding and another of protein-ligand binding. The methods tested include a k-means clustering variant, a non-linear auto encoder, principal component analysis and tICA. The dimension-reduced data is then used to estimate the implied timescales of the slowest process by a Markov state model analysis to assess the quality of the projection. The projected dimensions learned from the data are visualized to demonstrate which conformations the various methods choose to represent the molecular process.
△ Less
Submitted 2 November, 2017; v1 submitted 29 October, 2017;
originally announced October 2017.
-
Identification of slow molecular order parameters for Markov model construction
Authors:
Guillermo Perez-Hernandez,
Fabian Paul,
Toni Giorgino,
Gianni de Fabritiis,
Frank Noé
Abstract:
A goal in the kinetic characterization of a macromolecular system is the description of its slow relaxation processes, involving (i) identification of the structural changes involved in these processes, and (ii) estimation of the rates or timescales at which these slow processes occur. Most of the approaches to this task, including Markov models, Master-equation models, and kinetic network models,…
▽ More
A goal in the kinetic characterization of a macromolecular system is the description of its slow relaxation processes, involving (i) identification of the structural changes involved in these processes, and (ii) estimation of the rates or timescales at which these slow processes occur. Most of the approaches to this task, including Markov models, Master-equation models, and kinetic network models, start by discretizing the high-dimensional state space and then characterize relaxation processes in terms of the eigenvectors and eigenvalues of a discrete transition matrix. The practical success of such an approach depends very much on the ability to finely discretize the slow order parameters. How can this task be achieved in a high-dimensional configuration space without relying on subjective guesses of the slow order parameters? In this paper, we use the variational principle of conformation dynamics to derive an optimal way of identifying the "slow subspace" of a large set of prior order parameters - either generic internal coordinates (distances and dihedral angles), or a user-defined set of parameters. It is shown that a method to identify this slow subspace exists in statistics: the time-lagged independent component analysis (TICA). Furthermore, optimal indicators-order parameters indicating the progress of the slow transitions and thus may serve as reaction coordinates-are readily identified. We demonstrate that the slow subspace is well suited to construct accurate kinetic models of two sets of molecular dynamics simulations, the 6-residue fluorescent peptide MR121-GSGSW and the 30-residue natively disordered peptide KID. The identified optimal indicators reveal the structural changes associated with the slow processes of the molecular system under analysis.
△ Less
Submitted 26 February, 2013;
originally announced February 2013.
-
Statistical Analysis of Global Connectivity and Activity Distributions in Cellular Networks
Authors:
Adrián López García de Lomana,
Qasim K. Beg,
G. de Fabritiis,
Jordi Villà-Freixa
Abstract:
Various molecular interaction networks have been claimed to follow power-law decay for their global connectivity distribution. It has been proposed that there may be underlying generative models that explain this heavy-tailed behavior by self-reinforcement processes such as classical or hierarchical scale-free network models. Here we analyze a comprehensive data set of protein-protein and transcri…
▽ More
Various molecular interaction networks have been claimed to follow power-law decay for their global connectivity distribution. It has been proposed that there may be underlying generative models that explain this heavy-tailed behavior by self-reinforcement processes such as classical or hierarchical scale-free network models. Here we analyze a comprehensive data set of protein-protein and transcriptional regulatory interaction networks in yeast, an E. coli metabolic network, and gene activity profiles for different metabolic states in both organisms. We show that in all cases the networks have a heavy-tailed distribution, but most of them present significant differences from a power-law model according to a stringent statistical test. Those few data sets that have a statistically significant fit with a power-law model follow other distributions equally well. Thus, while our analysis supports that both global connectivity interaction networks and activity distributions are heavy-tailed, they are not generally described by any specific distribution model, leaving space for further inferences on generative models.
△ Less
Submitted 19 April, 2010;
originally announced April 2010.
-
ACEMD: Accelerating bio-molecular dynamics in the microsecond time-scale
Authors:
M. J. Harvey,
G. Giupponi,
G. De Fabritiis
Abstract:
The high arithmetic performance and intrinsic parallelism of recent graphical processing units (GPUs) can offer a technological edge for molecular dynamics simulations. ACEMD is a production-class bio-molecular dynamics (MD) simulation program designed specifically for GPUs which is able to achieve supercomputing scale performance of 40 nanoseconds/day for all-atom protein systems with over 23,0…
▽ More
The high arithmetic performance and intrinsic parallelism of recent graphical processing units (GPUs) can offer a technological edge for molecular dynamics simulations. ACEMD is a production-class bio-molecular dynamics (MD) simulation program designed specifically for GPUs which is able to achieve supercomputing scale performance of 40 nanoseconds/day for all-atom protein systems with over 23,000 atoms. We illustrate the characteristics of the code, its validation and performance. We also run a microsecond-long trajectory for an all-atom molecular system in explicit TIP3P water on a single workstation computer equipped with just 3 GPUs. This performance on cost effective hardware allows ACEMD to reach microsecond timescales routinely with important implications in terms of scientific applications.
△ Less
Submitted 4 February, 2009;
originally announced February 2009.
-
A hybrid method coupling fluctuating hydrodynamics and molecular dynamics for the simulation of macromolecules
Authors:
G. Giupponi,
G. De Fabritiis,
P. V. Coveney
Abstract:
We present a hybrid computational method for simulating the dynamics of macromolecules in solution which couples a mesoscale solver for the fluctuating hydrodynamics (FH) equations with molecular dynamics to describe the macromolecule. The two models interact through a dissipative Stokesian term first introduced by Ahlrichs and Dünweg [J. Chem. Phys. {\bf 111}, 8225 (1999)]. We show that our met…
▽ More
We present a hybrid computational method for simulating the dynamics of macromolecules in solution which couples a mesoscale solver for the fluctuating hydrodynamics (FH) equations with molecular dynamics to describe the macromolecule. The two models interact through a dissipative Stokesian term first introduced by Ahlrichs and Dünweg [J. Chem. Phys. {\bf 111}, 8225 (1999)]. We show that our method correctly captures the static and dynamical properties of polymer chains as predicted by the Zimm model. In particular, we show that the static conformations are best described when the ratio $\fracσ{b}=0.6$, where $σ$ is the Lennard-Jones length parameter and $b$ is the monomer bond length. We also find that the decay of the Rouse modes' autocorrelation function is better described with an analytical correction suggested by Ahlrichs and Dünweg. Our FH solver permits us to treat the fluid equation of state and transport parameters as direct simulation parameters. The expected independence of the chain dynamics on various choices of fluid equation of state and bulk viscosity is recovered, while excellent agreement is found for the temperature and shear viscosity dependence of centre of mass diffusion between simulation results and predictions of the Zimm model. We find that Zimm model approximations start to fail when the Schmidt number $Sc \lessapprox 30$. Finally, we investigate the importance of fluid fluctuations and show that using the preaveraged approximation for the hydrodynamic tensor leads to around 3% error in the diffusion coefficient for a polymer chain when the fluid discretization size is greater than $50Å$.
△ Less
Submitted 4 March, 2007;
originally announced March 2007.
-
Fluctuating hydrodynamic modelling of fluids at the nanoscale
Authors:
G. De Fabritiis,
M. Serrano,
R. Delgado-Buscalioni,
P. V. Coveney
Abstract:
A good representation of mesoscopic fluids is required to combine with molecular simulations at larger length and time scales (De Fabritiis {\it et. al}, Phys. Rev. Lett. 97, 134501 (2006)). However, accurate computational models of the hydrodynamics of nanoscale molecular assemblies are lacking, at least in part because of the stochastic character of the underlying fluctuating hydrodynamic equa…
▽ More
A good representation of mesoscopic fluids is required to combine with molecular simulations at larger length and time scales (De Fabritiis {\it et. al}, Phys. Rev. Lett. 97, 134501 (2006)). However, accurate computational models of the hydrodynamics of nanoscale molecular assemblies are lacking, at least in part because of the stochastic character of the underlying fluctuating hydrodynamic equations. Here we derive a finite volume discretization of the compressible isothermal fluctuating hydrodynamic equations over a regular grid in the Eulerian reference system. We apply it to fluids such as argon at arbitrary densities and water under ambient conditions. To that end, molecular dynamics simulations are used to derive the required fluid properties. The equilibrium state of the model is shown to be thermodynamically consistent and correctly reproduces linear hydrodynamics including relaxation of sound and shear modes. We also consider non-equilibrium states involving diffusion and convection in cavities with no-slip boundary conditions.
△ Less
Submitted 23 December, 2006;
originally announced December 2006.
-
Performance of the Cell processor for biomolecular simulations
Authors:
G. De Fabritiis
Abstract:
The new Cell processor represents a turning point for computing intensive applications. Here, I show that for molecular dynamics it is possible to reach an impressive sustained performance in excess of 30 Gflops with a peak of 45 Gflops for the non-bonded force calculations, over one order of magnitude faster than a single core standard processor.
The new Cell processor represents a turning point for computing intensive applications. Here, I show that for molecular dynamics it is possible to reach an impressive sustained performance in excess of 30 Gflops with a peak of 45 Gflops for the non-bonded force calculations, over one order of magnitude faster than a single core standard processor.
△ Less
Submitted 1 March, 2007; v1 submitted 21 November, 2006;
originally announced November 2006.
-
Multiscale modelling of liquids with molecular specificity
Authors:
G. De Fabritiis,
R. Delgado-Buscalioni,
P. V. Coveney
Abstract:
The separation between molecular and mesoscopic length and time scales poses a severe limit to molecular simulations of mesoscale phenomena. We describe a hybrid multiscale computational technique which address this problem by kee** the full molecular nature of the system where it is of interest and coarse-graining it elsewhere. This is made possible by coupling molecular dynamics with a mesos…
▽ More
The separation between molecular and mesoscopic length and time scales poses a severe limit to molecular simulations of mesoscale phenomena. We describe a hybrid multiscale computational technique which address this problem by kee** the full molecular nature of the system where it is of interest and coarse-graining it elsewhere. This is made possible by coupling molecular dynamics with a mesoscopic description of realistic liquids based on Landau's fluctuating hydrodynamics. We show that our scheme correctly couples hydrodynamics and that fluctuations, at both the molecular and continuum levels, are thermodynamically consistent. Hybrid simulations of sound waves in bulk water and reflected by a lipid monolayer are presented as illustrations of the scheme.
△ Less
Submitted 23 August, 2006;
originally announced August 2006.
-
Coupled applications on distributed resources
Authors:
P. V. Coveney,
G. De Fabritiis,
M. J. Harvey,
S. M. Pickles,
A. R. Porter
Abstract:
Coupled models are set to become increasingly important in all aspects of science and engineering as tools with which to study complex systems in an integrated manner. Such coupled, hybrid simulations typically communicate data between the component models of which they are comprised relatively infrequently, and so a Grid is expected to present an ideal architecture on which to run them. In the…
▽ More
Coupled models are set to become increasingly important in all aspects of science and engineering as tools with which to study complex systems in an integrated manner. Such coupled, hybrid simulations typically communicate data between the component models of which they are comprised relatively infrequently, and so a Grid is expected to present an ideal architecture on which to run them. In the present paper, we describe a simple, flexible and extensible architecture for a two-component hybrid molecular-continuum coupled model (hybrid MD). We discuss its deployment on distributed resources and the extensions to the RealityGrid computational-steering system to handle coupled models.
△ Less
Submitted 19 May, 2006;
originally announced May 2006.
-
A stochastic Trotter integration scheme for dissipative particle dynamics
Authors:
M. Serrano,
G. De Fabritiis,
P. Español,
P. V. Coveney
Abstract:
In this article we show in details the derivation of an integration scheme for the dissipative particle dynamic model (DPD) using the stochastic Trotter formula [De Fabritiis et al., Physica A, 361, 429 (2006)]. We explain some subtleties due to the stochastic character of the equations and exploit analyticity in some interesting parts of the dynamics. The DPD-Trotter integrator demonstrates the…
▽ More
In this article we show in details the derivation of an integration scheme for the dissipative particle dynamic model (DPD) using the stochastic Trotter formula [De Fabritiis et al., Physica A, 361, 429 (2006)]. We explain some subtleties due to the stochastic character of the equations and exploit analyticity in some interesting parts of the dynamics. The DPD-Trotter integrator demonstrates the inexistence of spurious spatial correlations in the radial distribution function for an ideal gas equation of state. We also compare our numerical integrator to other available DPD integration schemes.
△ Less
Submitted 22 February, 2006;
originally announced February 2006.
-
Determination of the chemical potential using energy-biased sampling
Authors:
R. Delgado-Buscalioni,
G. De Fabritiis,
P. V. Coveney
Abstract:
An energy-biased method to evaluate ensemble averages requiring test-particle insertion is presented. The method is based on biasing the sampling within the subdomains of the test-particle configurational space with energies smaller than a given value freely assigned. These energy-wells are located via unbiased random insertion over the whole configurational space and are sampled using the so ca…
▽ More
An energy-biased method to evaluate ensemble averages requiring test-particle insertion is presented. The method is based on biasing the sampling within the subdomains of the test-particle configurational space with energies smaller than a given value freely assigned. These energy-wells are located via unbiased random insertion over the whole configurational space and are sampled using the so called Hit&Run algorithm, which uniformly samples compact regions of any shape immersed in a space of arbitrary dimensions. Because the bias is defined in terms of the energy landscape it can be exactly corrected to obtain the unbiased distribution. The test-particle energy distribution is then combined with the Bennett relation for the evaluation of the chemical potential. We apply this protocol to a system with relatively small probability of low-energy test-particle insertion, liquid argon at high density and low temperature, and show that the energy-biased Bennett method is around five times more efficient than the standard Bennett method. A similar performance gain is observed in the reconstruction of the energy distribution.
△ Less
Submitted 17 June, 2005;
originally announced June 2005.
-
Efficient numerical integrators for stochastic models
Authors:
G. De Fabritiis,
M. Serrano,
P. Español,
P. V. Coveney
Abstract:
The efficient simulation of models defined in terms of stochastic differential equations (SDEs) depends critically on an efficient integration scheme. In this article, we investigate under which conditions the integration schemes for general SDEs can be derived using the Trotter expansion. It follows that, in the stochastic case, some care is required in splitting the stochastic generator. We te…
▽ More
The efficient simulation of models defined in terms of stochastic differential equations (SDEs) depends critically on an efficient integration scheme. In this article, we investigate under which conditions the integration schemes for general SDEs can be derived using the Trotter expansion. It follows that, in the stochastic case, some care is required in splitting the stochastic generator. We test the Trotter integrators on an energy-conserving Brownian model and derive a new numerical scheme for dissipative particle dynamics. We find that the stochastic Trotter scheme provides a mathematically correct and easy-to-use method which should find wide applicability.
△ Less
Submitted 18 October, 2005; v1 submitted 11 February, 2005;
originally announced February 2005.
-
Energy controlled insertion of polar molecules in dense fluids
Authors:
Gianni De Fabritiis,
Rafael Delgado-Buscalioni,
Peter V. Coveney
Abstract:
We present a method to search low energy configurations of polar molecules in the complex potential energy surfaces associated with dense fluids. The search is done in the configurational space of the translational and rotational degrees of freedom of the molecule, combining steepest-descent and Newton-Raphson steps which embed information on the average sizes of the potential energy wells obtai…
▽ More
We present a method to search low energy configurations of polar molecules in the complex potential energy surfaces associated with dense fluids. The search is done in the configurational space of the translational and rotational degrees of freedom of the molecule, combining steepest-descent and Newton-Raphson steps which embed information on the average sizes of the potential energy wells obtained from prior inspection of the liquid structure. We perform a molecular dynamics simulation of a liquid water shell which demonstrates that the method enables fast and energy-controlled water molecule insertion in aqueous environments. The algorithm finds low energy configurations of incoming water molecules around three orders of magnitude faster than direct random insertion.
This method is an important step towards dynamic simulations of open systems and it may also prove useful for energy-biased ensemble average calculations of the chemical potential.
△ Less
Submitted 2 November, 2004;
originally announced November 2004.
-
On size and growth of business firms
Authors:
G. De Fabritiis,
F. Pammolli,
M. Riccaboni
Abstract:
We study size and growth distributions of products and business firms in the context of a given industry. Firm size growth is analyzed in terms of two basic mechanisms, i.e. the increase of the number of new elementary business units and their size growth. We find a power-law relationship between size and the variance of growth rates for both firms and products, with an exponent between -0.17 an…
▽ More
We study size and growth distributions of products and business firms in the context of a given industry. Firm size growth is analyzed in terms of two basic mechanisms, i.e. the increase of the number of new elementary business units and their size growth. We find a power-law relationship between size and the variance of growth rates for both firms and products, with an exponent between -0.17 and -0.15, with a remarkable stability upon aggregation. We then introduce a simple and general model of proportional growth for both the number of firm independent constituent units and their size, which conveys a good representation of the empirical evidences. This general and plausible generative process can account for the observed scaling in a wide variety of economic and industrial systems. Our findings contribute to shed light on the mechanisms that sustain economic growth in terms of the relationships between the size of economic entities and the number and size distribution of their elementary components.
△ Less
Submitted 3 July, 2003;
originally announced July 2003.
-
Dynamical geometry for multiscale dissipative particle dynamics
Authors:
G. De Fabritiis,
P. V. Coveney
Abstract:
In this paper, we review the computational aspects of a multiscale dissipative particle dynamics model for complex fluid simulations based on the feature-rich geometry of the Voronoi tessellation. The geometrical features of the model are critical since the mesh is directly connected to the physics by the interpretation of the Voronoi volumes of the tessellation as coarse-grained fluid clusters.…
▽ More
In this paper, we review the computational aspects of a multiscale dissipative particle dynamics model for complex fluid simulations based on the feature-rich geometry of the Voronoi tessellation. The geometrical features of the model are critical since the mesh is directly connected to the physics by the interpretation of the Voronoi volumes of the tessellation as coarse-grained fluid clusters. The Voronoi tessellation is maintained dynamically in time to model the fluid in the Lagrangian frame of reference, including imposition of periodic boundary conditions. Several algorithms to construct and maintain the periodic Voronoi tessellations are reviewed in two and three spatial dimensions and their parallel performance discussed. The insertion of polymers and colloidal particles in the fluctuating hydrodynamic solvent is described using surface boundaries.
△ Less
Submitted 21 January, 2003;
originally announced January 2003.
-
Foundations of Dissipative Particle Dynamics
Authors:
Eirik G. Flekkoy,
Peter V. Coveney,
Gianni De Fabritiis
Abstract:
We derive a mesoscopic modeling and simulation technique that is very close to the technique known as dissipative particle dynamics. The model is derived from molecular dynamics by means of a systematic coarse-graining procedure. Thus the rules governing our new form of dissipative particle dynamics reflect the underlying molecular dynamics; in particular all the underlying conservation laws car…
▽ More
We derive a mesoscopic modeling and simulation technique that is very close to the technique known as dissipative particle dynamics. The model is derived from molecular dynamics by means of a systematic coarse-graining procedure. Thus the rules governing our new form of dissipative particle dynamics reflect the underlying molecular dynamics; in particular all the underlying conservation laws carry over from the microscopic to the mesoscopic descriptions. Whereas previously the dissipative particles were spheres of fixed size and mass, now they are defined as cells on a Voronoi lattice with variable masses and sizes. This Voronoi lattice arises naturally from the coarse-graining procedure which may be applied iteratively and thus represents a form of renormalisation-group map**. It enables us to select any desired local scale for the mesoscopic description of a given problem. Indeed, the method may be used to deal with situations in which several different length scales are simultaneously present. Simulations carried out with the present scheme show good agreement with theoretical predictions for the equilibrium behavior.
△ Less
Submitted 11 February, 2000;
originally announced February 2000.
-
Discrete random walk models for symmetric Levy-Feller diffusion processes
Authors:
Rudolf Gorenflo,
Gianni De Fabritiis,
Francesco Mainardi
Abstract:
We propose a variety of models of random walk, discrete in space and time, suitable for simulating stable random variables of arbitrary index $α$ ($0< α\le 2$), in the symmetric case. We show that by properly scaled transition to vanishing space and time steps our random walk models converge to the corresponding continuous Markovian stochastic processes, that we refer to as Levy-Feller diffusion…
▽ More
We propose a variety of models of random walk, discrete in space and time, suitable for simulating stable random variables of arbitrary index $α$ ($0< α\le 2$), in the symmetric case. We show that by properly scaled transition to vanishing space and time steps our random walk models converge to the corresponding continuous Markovian stochastic processes, that we refer to as Levy-Feller diffusion processes.
△ Less
Submitted 17 March, 1999;
originally announced March 1999.