-
Enabling Inverse Design in Chemical Compound Space: Map** Quantum Properties to Structures for Small Organic Molecules
Authors:
Alessio Fallani,
Leonardo Medrano Sandonas,
Alexandre Tkatchenko
Abstract:
Computer-driven molecular design combines the principles of chemistry, physics, and artificial intelligence to identify novel chemical compounds and materials with desired properties for a specific application. In particular, quantum-mechanical (QM) methods combined with machine learning (ML) techniques have accelerated the estimation of accurate molecular properties, providing a direct map** fr…
▽ More
Computer-driven molecular design combines the principles of chemistry, physics, and artificial intelligence to identify novel chemical compounds and materials with desired properties for a specific application. In particular, quantum-mechanical (QM) methods combined with machine learning (ML) techniques have accelerated the estimation of accurate molecular properties, providing a direct map** from 3D molecular structures to their properties. However, the development of reliable and efficient methodologies to enable \emph{inverse map**} in chemical space is a long-standing challenge that has not been accomplished yet. Here, we address this challenge by demonstrating the possibility of parametrizing a given chemical space with a finite set of extensive and intensive QM properties. In doing so, we develop a proof-of-concept implementation that combines a Variational Auto-Encoder (VAE) trained on molecular structures with a property encoder designed to learn the latent representation from a set of QM properties. The result of this joint architecture is a common latent space representation for both structures and properties, which enables property-to-structure map** for small drug-like molecules contained in the QM7-X dataset. We illustrate the capabilities of our approach by conditional generation of \emph{de novo} molecular structures with targeted properties, transition path interpolation for chemical reactions as well as insights into property-structure relationships. Our findings thus provide a proof-of-principle demonstration aiming to enable the inverse property-to-structure design in diverse chemical spaces.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Molecules in Environments: Towards Systematic Quantum Embedding of Electrons and Drude Oscillators
Authors:
Matej Ditte,
Matteo Barborini,
Leonardo Medrano Sandonas,
Alexandre Tkatchenko
Abstract:
We develop a quantum embedding method that enables accurate and efficient treatment of interactions between molecules and an environment, while explicitly including many-body correlations. The molecule is composed of classical nuclei and quantum electrons, whereas the environment is modeled via charged quantum harmonic oscillators. We construct a general Hamiltonian and introduce a variational ans…
▽ More
We develop a quantum embedding method that enables accurate and efficient treatment of interactions between molecules and an environment, while explicitly including many-body correlations. The molecule is composed of classical nuclei and quantum electrons, whereas the environment is modeled via charged quantum harmonic oscillators. We construct a general Hamiltonian and introduce a variational ansatz for the correlated ground state of the fully interacting molecule/environment system. This wavefunction is optimized via variational Monte Carlo and the ground state energy is subsequently estimated through diffusion Monte Carlo. The proposed scheme allows an explicit many-body treatment of electrostatic, polarization, and dispersion interactions between the molecule and the environment. We study solvation energies and excitation energies of benzene derivatives, obtaining excellent agreement with explicit ab initio calculations and experiment.
△ Less
Submitted 28 March, 2023;
originally announced March 2023.
-
Accurate Machine Learned Quantum-Mechanical Force Fields for Biomolecular Simulations
Authors:
Oliver T. Unke,
Martin Stöhr,
Stefan Ganscha,
Thomas Unterthiner,
Hartmut Maennel,
Sergii Kashubin,
Daniel Ahlin,
Michael Gastegger,
Leonardo Medrano Sandonas,
Alexandre Tkatchenko,
Klaus-Robert Müller
Abstract:
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes. Accurate MD simulations require computationally demanding quantum-mechanical calculations, being practically limited to short timescales and few atoms. For larger systems, efficient, but much less reliable empirical force fields are used. Recently, machine learned force fields (MLFFs) emerged as an…
▽ More
Molecular dynamics (MD) simulations allow atomistic insights into chemical and biological processes. Accurate MD simulations require computationally demanding quantum-mechanical calculations, being practically limited to short timescales and few atoms. For larger systems, efficient, but much less reliable empirical force fields are used. Recently, machine learned force fields (MLFFs) emerged as an alternative means to execute MD simulations, offering similar accuracy as ab initio methods at orders-of-magnitude speedup. Until now, MLFFs mainly capture short-range interactions in small molecules or periodic materials, due to the increased complexity of constructing models and obtaining reliable reference data for large molecules, where long-ranged many-body effects become important. This work proposes a general approach to constructing accurate MLFFs for large-scale molecular simulations (GEMS) by training on "bottom-up" and "top-down" molecular fragments of varying size, from which the relevant physicochemical interactions can be learned. GEMS is applied to study the dynamics of alanine-based peptides and the 46-residue protein crambin in aqueous solution, allowing nanosecond-scale MD simulations of >25k atoms at essentially ab initio quality. Our findings suggest that structural motifs in peptides and proteins are more flexible than previously thought, indicating that simulations at ab initio accuracy might be necessary to understand dynamic biomolecular processes such as protein (mis)folding, drug-protein binding, or allosteric regulation.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
QM7-X: A comprehensive dataset of quantum-mechanical properties spanning the chemical space of small organic molecules
Authors:
Johannes Hoja,
Leonardo Medrano Sandonas,
Brian G. Ernst,
Alvaro Vazquez-Mayagoitia,
Robert A. DiStasio Jr.,
Alexandre Tkatchenko
Abstract:
We introduce QM7-X, a comprehensive dataset of 42 physicochemical properties for $\approx$ 4.2 M equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. To span this fundamentally important region of chemical compound space (CCS), QM7-X includes an exhaustive sampling of (meta-)stable equilibrium structures - comprised of constitu…
▽ More
We introduce QM7-X, a comprehensive dataset of 42 physicochemical properties for $\approx$ 4.2 M equilibrium and non-equilibrium structures of small organic molecules with up to seven non-hydrogen (C, N, O, S, Cl) atoms. To span this fundamentally important region of chemical compound space (CCS), QM7-X includes an exhaustive sampling of (meta-)stable equilibrium structures - comprised of constitutional/structural isomers and stereoisomers, e.g., enantiomers and diastereomers (including cis-/trans- and conformational isomers) - as well as 100 non-equilibrium structural variations thereof to reach a total of $\approx$ 4.2 M molecular structures. Computed at the tightly converged quantum-mechanical PBE0+MBD level of theory, QM7-X contains global (molecular) and local (atom-in-a-molecule) properties ranging from ground state quantities (such as atomization energies and dipole moments) to response quantities (such as polarizability tensors and dispersion coefficients). By providing a systematic, extensive, and tightly-converged dataset of quantum-mechanically computed physicochemical properties, we expect that QM7-X will play a critical role in the development of next-generation machine-learning based models for exploring greater swaths of CCS and performing in silico design of molecules with targeted properties.
△ Less
Submitted 26 June, 2020;
originally announced June 2020.
-
Accurate Many-Body Repulsive Potentials for Density-Functional Tight-Binding from Deep Tensor Neural Networks
Authors:
Martin Stöhr,
Leonardo Medrano Sandonas,
Alexandre Tkatchenko
Abstract:
We combine density-functional tight-binding (DFTB) with deep tensor neural networks (DTNN) to maximize the strengths of both approaches in predicting structural, energetic, and vibrational molecular properties. The DTNN is used to learn a non-linear model for the localized many-body interatomic repulsive energy, which so far has been treated in an atom-pairwise manner in DFTB. Substantially improv…
▽ More
We combine density-functional tight-binding (DFTB) with deep tensor neural networks (DTNN) to maximize the strengths of both approaches in predicting structural, energetic, and vibrational molecular properties. The DTNN is used to learn a non-linear model for the localized many-body interatomic repulsive energy, which so far has been treated in an atom-pairwise manner in DFTB. Substantially improving upon standard DFTB and DTNN, the resulting DFTB-NN$_{\sf{rep}}$ model yields accurate predictions of atomization and isomerization energies, equilibrium geometries, vibrational frequencies and dihedral rotation profiles for a large variety of organic molecules compared to the hybrid DFT-PBE0 functional. Our results highlight the high potential of combining semi-empirical electronic-structure methods with physically-motivated machine learning approaches for predicting localized many-body interactions. We conclude by discussing future advancements of the DFTB-NN$_{\sf{rep}}$ approach that could enable chemically accurate electronic-structure calculations for systems with tens of thousands of atoms.
△ Less
Submitted 18 June, 2020;
originally announced June 2020.
-
Electron transport through self-assembled monolayers of tripeptides
Authors:
E. Mervinetsky,
I. Alshanski,
S. Lenfant,
D. Guerin,
L. Medrano Sandonas,
A. Dianat,
R. Gutierrez,
G. Cuniberti,
M. Hurevich,
S. Yitzchaik,
D. Vuillaume
Abstract:
We report how the electron transport through a solid-state metal/Gly-Gly-His tripeptide (GGH) monolayer/metal junction and the metal/GGH work function are modified by the GGH complexation with Cu2+ ions. Conducting AFM is used to measure the current-voltage histograms. The work function is characterized by combining macroscopic Kelvin probe and Kelvin probe force microscopy at the nanoscale. We ob…
▽ More
We report how the electron transport through a solid-state metal/Gly-Gly-His tripeptide (GGH) monolayer/metal junction and the metal/GGH work function are modified by the GGH complexation with Cu2+ ions. Conducting AFM is used to measure the current-voltage histograms. The work function is characterized by combining macroscopic Kelvin probe and Kelvin probe force microscopy at the nanoscale. We observe that the Cu2+ ions complexation with the GGH monolayer is highly dependent on the molecular surface density and results in opposite trends. In the case of a high density monolayer the conformational changes are hindered by the proximity of the neighboring peptides, hence forming an insulating layer in response to copper-complexation. Whereas the slightly lower density monolayers allow for the conformational change to a looped peptide wrap** the Cu-ion, which results in a more conductive monolayer. Copper-ion complexation to the high- and low-density monolayers systematically induces an increase of the work functions. Copper-ion complexation to the low-density monolayer induces an increase of electron transport efficiency, while the copper-ion complexation to the high-density monolayer results in a slight decrease of electron transport. Both of the observed trends are in agreement with first-principle calculations. Complexed copper to low density GGH-monolayer induces a new gap state slightly above the Au Fermi energy that is absent in the high density monolayer.
△ Less
Submitted 9 April, 2019;
originally announced April 2019.
-
Molecular and Ionic Dipole Effects on the Electronic Properties of Si/SiO2 Grafted Alkylamine Monolayers
Authors:
Alina Gankin,
Ruthy Sfez,
Evgeniy Mervinetsky,
Jörg Buchwald,
Arezoo Dianat,
Leonardo Medrano Sandonas,
Rafael Gutierrez,
Gianaurelio Cuniberti,
Shlomo Yitzchaik
Abstract:
In this work, we demonstrate the tunability of electronic properties of Si/SiO2 substrate by molecular and ionic surface modifications. The change in the electronic properties such as the work function (WF) and electron affinity (EA), were experimentally measured by contact potential difference (CPD) technique and theoretically supported by DFT calculations. We attribute these molecular electronic…
▽ More
In this work, we demonstrate the tunability of electronic properties of Si/SiO2 substrate by molecular and ionic surface modifications. The change in the electronic properties such as the work function (WF) and electron affinity (EA), were experimentally measured by contact potential difference (CPD) technique and theoretically supported by DFT calculations. We attribute these molecular electronic effects mainly to the variations of molecular and surface dipoles of the ionic and neutral species. We have previously showed that for the alkylhalide monolayers, changing the tail group from Cl to I decreased the work function of the substrate. Here we report on the opposite trend of WF changes, i.e. increase of the WF, obtained by using the anions of those halides from Cl$^{-}$ to I$^{-}$. This trend was observed on self-assembled alkylamonium halide (-NH3$^{+}$ X$^{-}$, where X$^{-}$=Cl$^{-}$, Br$^{-}$, I$^{-}$) monolayers modified substrates. The monolayer' formation was supported by Ellipsometry measurements, X-Ray Photoelectron Spectroscopy and Atomic Force Microscopy. Comparison of the theoretical and experimental data suggests that ionic surface dipole depends mainly on the polarizability and the position of the counter halide anion along with the organization and packaging of the layer. The described ionic modification can be easily used for facile tailoring and design of the electronic properties Si/SiO2 substrates for various device applications.
△ Less
Submitted 1 October, 2018;
originally announced October 2018.