Search | arXiv e-print repository

arXiv:2405.14925 [pdf, other]

PILOT: Equivariant diffusion for pocket conditioned de novo ligand generation with multi-objective guidance via importance sampling

Authors: Julian Cremer, Tuan Le, Frank Noé, Djork-Arné Clevert, Kristof T. Schütt

Abstract: The generation of ligands that both are tailored to a given protein pocket and exhibit a range of desired chemical properties is a major challenge in structure-based drug design. Here, we propose an in-silico approach for the $\textit{de novo}$ generation of 3D ligand structures using the equivariant diffusion model PILOT, combining pocket conditioning with a large-scale pre-training and property… ▽ More The generation of ligands that both are tailored to a given protein pocket and exhibit a range of desired chemical properties is a major challenge in structure-based drug design. Here, we propose an in-silico approach for the $\textit{de novo}$ generation of 3D ligand structures using the equivariant diffusion model PILOT, combining pocket conditioning with a large-scale pre-training and property guidance. Its multi-objective trajectory-based importance sampling strategy is designed to direct the model towards molecules that not only exhibit desired characteristics such as increased binding affinity for a given protein pocket but also maintains high synthetic accessibility. This ensures the practicality of sampled molecules, thus maximizing their potential for the drug discovery pipeline. PILOT significantly outperforms existing methods across various metrics on the common benchmark dataset CrossDocked2020. Moreover, we employ PILOT to generate novel ligands for unseen protein pockets from the Kinodata-3D dataset, which encompasses a substantial portion of the human kinome. The generated structures exhibit predicted $IC_{50}$ values indicative of potent biological activity, which highlights the potential of PILOT as a powerful tool for structure-based drug design. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2401.13040 [pdf, other]

A new "temperature inversion" estimator to detect CMB patchy screening by large-scale structure

Authors: Theo Schutt, Abhishek S. Maniyar, Emmanuel Schaan, William R. Coulton, Nishant Mishra

Abstract: Thomson scattering of cosmic microwave background (CMB) photons imprints various properties of the baryons around galaxies on the CMB. One such imprint, called patchy screening, is a direct probe of the gas density profile around galaxies. It usefully complements the information from the kinematic and thermal Sunyaev-Zel'dovich effects and does not require individual redshifts. In this paper, we d… ▽ More Thomson scattering of cosmic microwave background (CMB) photons imprints various properties of the baryons around galaxies on the CMB. One such imprint, called patchy screening, is a direct probe of the gas density profile around galaxies. It usefully complements the information from the kinematic and thermal Sunyaev-Zel'dovich effects and does not require individual redshifts. In this paper, we derive new estimators of patchy screening called the "temperature inversion" (TI) and "signed" estimators, analogous to the gradient inversion estimator of CMB lensing. Pedagogically, we clarify the relation between these estimators and the standard patchy screening quadratic estimator (QE). The new estimators trade optimality for robustness to biases caused by the dominant CMB lensing and foreground contaminants, allowing the use of smaller angular scales. We perform a simulated analysis to realistically forecast the expected precision of patchy screening measurements from four CMB experiments, ACT, SPT, Simons Observatory (SO) and CMB-S4, cross-correlated with three galaxy samples from BOSS, unWISE and the simulated Rubin LSST Data Challenge 2 catalog. Our results give further confidence in the first detection of this effect from the ACT$\times$unWISE data in the companion paper and show patchy screening will be a powerful observable for future surveys like SO, CMB-S4 and LSST. Implementations of the patchy screening QE and the TI and signed estimators are publicly available in our LensQuEst and ThumbStack software packages, available at https://github.com/EmmanuelSchaan/LensQuEst and https://github.com/EmmanuelSchaan/ThumbStack , respectively. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: 20 pages, 7 figures, submitted to PRD

arXiv:2401.13033 [pdf, other]

The Atacama Cosmology Telescope: Detection of Patchy Screening of the Cosmic Microwave Background

Authors: William R. Coulton, Theo Schutt, Abhishek S. Maniyar, Emmanuel Schaan, Rui An, Zachary Atkins, Nicholas Battaglia, J Richard Bond, Erminia Calabrese, Steve K. Choi, Mark J. Devlin, Adriaan J. Duivenvoorden, Jo Dunkley, Simone Ferraro, Vera Gluscevic, J. Colin Hill, Matt Hilton, Adam D. Hincks, Arthur Kosowsky, Darby Kramer, Aleksandra Kusiak, Adrien La Posta, Thibaut Louis, Mathew S. Madhavacheril, Gabriela A. Marques , et al. (15 additional authors not shown)

Abstract: Spatial variations in the cosmic electron density after reionization generate cosmic microwave background anisotropies via Thomson scattering, a process known as the ``patchy screening" effect. In this paper, we propose a new estimator for the patchy screening effect that is designed to mitigate biases from the dominant foreground signals. We use it to measure the cross-correlation between \textit… ▽ More Spatial variations in the cosmic electron density after reionization generate cosmic microwave background anisotropies via Thomson scattering, a process known as the ``patchy screening" effect. In this paper, we propose a new estimator for the patchy screening effect that is designed to mitigate biases from the dominant foreground signals. We use it to measure the cross-correlation between \textit{unWISE} galaxies and patchy screening, the latter measured by the Atacama Cosmology Telescope and \textit{Planck} satellite. We report the first detection of the patchy screening effect, with the statistical significance of the cross-correlation exceeding $7σ$. This measurement directly probes the distribution of electrons around these galaxies and provides strong evidence that gas is more extended than the underlying dark matter. By comparing our measurements to electron profiles extracted from simulations, we demonstrate the power of these observations to constrain galaxy evolution models. Requiring only the 2D positions of objects and no individual redshifts or velocity estimates, this approach is complementary to existing gas probes, such as those based on the kinetic Sunyaev-Zeldovich effect. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: See Schutt et al for a detailed comparison of patchy screening estimators. 17 pages with 8 figures

arXiv:2308.00919 [pdf, other]

Photometry, Centroid and Point-Spread Function Measurements in the LSST Camera Focal Plane Using Artificial Stars

Authors: Johnny H. Esteves, Yousuke Utsumi, Adam Snyder, Theo Schutt, Alex Broughton, Bahrudin Trbalic, Sidney Mau, Andrew Rasmussen, Andrés A. Plazas Malagón, Andrew Bradshaw, Stuart Marshall, Seth Digel, James Chiang, Marcelle Soares-Santos, Aaron Roodman

Abstract: The Vera C. Rubin Observatory's LSST Camera (LSSTCam) pixel response has been characterized using laboratory measurements with a grid of artificial stars. We quantify the contributions to photometry, centroid, point-spread function size, and shape measurement errors due to small anomalies in the LSSTCam CCDs. The main sources of those anomalies are quantum efficiency variations and pixel area vari… ▽ More The Vera C. Rubin Observatory's LSST Camera (LSSTCam) pixel response has been characterized using laboratory measurements with a grid of artificial stars. We quantify the contributions to photometry, centroid, point-spread function size, and shape measurement errors due to small anomalies in the LSSTCam CCDs. The main sources of those anomalies are quantum efficiency variations and pixel area variations induced by the amplifier segmentation boundaries and "tree-rings" - circular variations in silicon do** concentration. This laboratory study using artificial stars projected on the sensors shows overall small effects. The residual effects on point-spread function (PSF) size and shape are below $0.1\%$, meeting the ten-year LSST survey science requirements. However, the CCD mid-line presents distortions that can have a moderate impact on PSF measurements. This feature can be avoided by masking the affected regions. Effects of tree-rings are observed on centroids and PSFs of the artificial stars and the nature of the effect is confirmed by a study of the flat-field response. Nevertheless, further studies of the full-focal plane with stellar data should more completely probe variations and might reveal new features, e.g. wavelength-dependent effects. The results of this study can be used as a guide for the on-sky operation of LSSTCam. △ Less

Submitted 3 November, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: accepted for publication, PASP

arXiv:2212.05517 [pdf, other]

doi 10.1063/5.0138367

SchNetPack 2.0: A neural network toolbox for atomistic machine learning

Authors: Kristof T. Schütt, Stefaan S. P. Hessmann, Niklas W. A. Gebauer, Jonas Lederer, Michael Gastegger

Abstract: SchNetPack is a versatile neural networks toolbox that addresses both the requirements of method development and application of atomistic machine learning. Version 2.0 comes with an improved data pipeline, modules for equivariant neural networks as well as a PyTorch implementation of molecular dynamics. An optional integration with PyTorch Lightning and the Hydra configuration framework powers a f… ▽ More SchNetPack is a versatile neural networks toolbox that addresses both the requirements of method development and application of atomistic machine learning. Version 2.0 comes with an improved data pipeline, modules for equivariant neural networks as well as a PyTorch implementation of molecular dynamics. An optional integration with PyTorch Lightning and the Hydra configuration framework powers a flexible command-line interface. This makes SchNetPack 2.0 easily extendable with custom code and ready for complex training task such as generation of 3d molecular structures. △ Less

Submitted 11 December, 2022; originally announced December 2022.

arXiv:2204.09812 [pdf, ps, other]

Comment on "On the recurrence times of neutron star X-ray binary transients and the nature of the Galactic Center quiescent X-ray binaries"

Authors: Kaya Mori, Shifra Mandel, Charles J. Hailey, Theo Y. E. Schutt, Keri Heuer, Jonathan E. Grindlay, Jaesub Hong, John A. Tomsick

Abstract: In 2018, we reported our discovery of a dozen quiescent X-ray binaries in the central parsec (pc) of the Galaxy (Hailey et al. 2018). In a recent follow-up paper (Mori et al. 2021), we published an extended analysis of these sources and other X-ray binaries (XRBs) in the central pc and beyond, showing that most if not all of the 12 non-thermal sources are likely black hole low-mass X-ray binary (B… ▽ More In 2018, we reported our discovery of a dozen quiescent X-ray binaries in the central parsec (pc) of the Galaxy (Hailey et al. 2018). In a recent follow-up paper (Mori et al. 2021), we published an extended analysis of these sources and other X-ray binaries (XRBs) in the central pc and beyond, showing that most if not all of the 12 non-thermal sources are likely black hole low-mass X-ray binary (BH-LMXB) candidates. In response, Maccarone et al. 2022 (TM22 hereafter) argued, primarily on the claim that neutron star low-mass X-ray binaries (NS-LMXBs) often do not have short outburst recurrence times (<~ 10 yr), that they cannot be excluded as a designation for the 12 quiescent X-ray binary sources. TM22 cites three main factors in their study: (1) X-ray outburst data of NS transients detected by RXTE and MAXI, (2) the Galactic population of NS-LMXBs, and (3) (persistently) quiescent NS-LMXBs in globular clusters. We address these arguments of TM22 and correct their misunderstandings of our work and the literature, even though most of these points have already been thoroughly addressed by Mori et al. 2021. We also correct TM22's assertion that our arguments are based solely on NS transients' recurrence times. △ Less

Submitted 20 April, 2022; originally announced April 2022.

Comments: 11 pages, 2 tables. Comments are welcome and should be sent to the corresponding author (K. Mori)

arXiv:2203.16205 [pdf, other]

Automatic Identification of Chemical Moieties

Authors: Jonas Lederer, Michael Gastegger, Kristof T. Schütt, Michael Kampffmeyer, Klaus-Robert Müller, Oliver T. Unke

Abstract: In recent years, the prediction of quantum mechanical observables with machine learning methods has become increasingly popular. Message-passing neural networks (MPNNs) solve this task by constructing atomic representations, from which the properties of interest are predicted. Here, we introduce a method to automatically identify chemical moieties (molecular building blocks) from such representati… ▽ More In recent years, the prediction of quantum mechanical observables with machine learning methods has become increasingly popular. Message-passing neural networks (MPNNs) solve this task by constructing atomic representations, from which the properties of interest are predicted. Here, we introduce a method to automatically identify chemical moieties (molecular building blocks) from such representations, enabling a variety of applications beyond property prediction, which otherwise rely on expert knowledge. The required representation can either be provided by a pretrained MPNN, or learned from scratch using only structural information. Beyond the data-driven design of molecular fingerprints, the versatility of our approach is demonstrated by enabling the selection of representative entries in chemical databases, the automatic construction of coarse-grained force fields, as well as the identification of reaction coordinates. △ Less

Submitted 27 April, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

arXiv:2109.04824 [pdf, other]

doi 10.1038/s41467-022-28526-y

Inverse design of 3d molecular structures with conditional generative neural networks

Authors: Niklas W. A. Gebauer, Michael Gastegger, Stefaan S. P. Hessmann, Klaus-Robert Müller, Kristof T. Schütt

Abstract: The rational design of molecules with desired properties is a long-standing challenge in chemistry. Generative neural networks have emerged as a powerful approach to sample novel molecules from a learned distribution. Here, we propose a conditional generative neural network for 3d molecular structures with specified chemical and structural properties. This approach is agnostic to chemical bonding… ▽ More The rational design of molecules with desired properties is a long-standing challenge in chemistry. Generative neural networks have emerged as a powerful approach to sample novel molecules from a learned distribution. Here, we propose a conditional generative neural network for 3d molecular structures with specified chemical and structural properties. This approach is agnostic to chemical bonding and enables targeted sampling of novel molecules from conditional distributions, even in domains where reference calculations are sparse. We demonstrate the utility of our method for inverse design by generating molecules with specified motifs or composition, discovering particularly stable molecules, and jointly targeting multiple electronic properties beyond the training regime. △ Less

Submitted 22 December, 2021; v1 submitted 10 September, 2021; originally announced September 2021.

Journal ref: Nature Communications 13, 973 (2022)

arXiv:2108.07312 [pdf, other]

doi 10.3847/1538-4357/ac1da5

The X-ray binary population in the Galactic Center revealed through multi-decade observations

Authors: Kaya Mori, Charles J. Hailey, Theo Y. E. Schutt, Shifra Mandel, Keri Heuer, Jonathan E. Grindlay, Jaesub Hong, Gabriele Ponti, John A. Tomsick

Abstract: We present an investigation of the quiescent and transient X-ray binaries (XRBs) of the Galactic Center (GC). We extended our Chandra analysis of the non-thermal X-ray sources, located in the central parsec, from Hailey et al. (2018), using an additional 4.6 Msec of ACIS-S data obtained in 2012-2018. The individual Chandra spectra of the 12 sources fit to an absorbed power-law model with a mean ph… ▽ More We present an investigation of the quiescent and transient X-ray binaries (XRBs) of the Galactic Center (GC). We extended our Chandra analysis of the non-thermal X-ray sources, located in the central parsec, from Hailey et al. (2018), using an additional 4.6 Msec of ACIS-S data obtained in 2012-2018. The individual Chandra spectra of the 12 sources fit to an absorbed power-law model with a mean photon index $Γ$~2 and show no Fe emission lines. Long-term variability was detected from nine of them, confirming that a majority are quiescent XRBs. Frequent X-ray monitoring of the GC revealed that the 12 non-thermal X-ray sources, as well as four X-ray transients have shown at most a single outburst over the last two decades. They are distinct from the six known neutron star LMXBs in the GC, which have all undergone multiple outbursts with <~ 5 year recurrence time on average. Based on the outburst history data of the broader population of X-ray transients, we conclude that the 16 sources represent a population of ~240-630 tightly-bound BH-LMXBs with ~4-12 hour orbital periods, consistent with the stellar/binary dynamics modelling in the vicinity of Sgr A*. The distribution of the 16 BH-LMXB candidates is disk-like (at 87% CL) and aligned with the nuclear star cluster. Our results have implications for XRB formation and the rate of gravitational wave events in other galactic nuclei. △ Less

Submitted 19 October, 2021; v1 submitted 16 August, 2021; originally announced August 2021.

Comments: Typos are fixed and new references are added

arXiv:2105.00304 [pdf, other]

doi 10.1038/s41467-021-27504-0

SpookyNet: Learning Force Fields with Electronic Degrees of Freedom and Nonlocal Effects

Authors: Oliver T. Unke, Stefan Chmiela, Michael Gastegger, Kristof T. Schütt, Huziel E. Sauceda, Klaus-Robert Müller

Abstract: Machine-learned force fields (ML-FFs) combine the accuracy of ab initio methods with the efficiency of conventional force fields. However, current ML-FFs typically ignore electronic degrees of freedom, such as the total charge or spin state, and assume chemical locality, which is problematic when molecules have inconsistent electronic states, or when nonlocal effects play a significant role. This… ▽ More Machine-learned force fields (ML-FFs) combine the accuracy of ab initio methods with the efficiency of conventional force fields. However, current ML-FFs typically ignore electronic degrees of freedom, such as the total charge or spin state, and assume chemical locality, which is problematic when molecules have inconsistent electronic states, or when nonlocal effects play a significant role. This work introduces SpookyNet, a deep neural network for constructing ML-FFs with explicit treatment of electronic degrees of freedom and quantum nonlocality. Chemically meaningful inductive biases and analytical corrections built into the network architecture allow it to properly model physical limits. SpookyNet improves upon the current state-of-the-art (or achieves similar performance) on popular quantum chemistry data sets. Notably, it is able to generalize across chemical and conformational space and can leverage the learned chemical insights, e.g. by predicting unknown spin states, thus hel** to close a further important remaining gap for today's machine learning models in quantum chemistry. △ Less

Submitted 20 July, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

arXiv:2102.08435 [pdf, other]

doi 10.1063/5.0047760

Perspective on integrating machine learning into computational chemistry and materials science

Authors: Julia Westermayr, Michael Gastegger, Kristof T. Schütt, Reinhard J. Maurer

Abstract: Machine learning (ML) methods are being used in almost every conceivable area of electronic structure theory and molecular simulation. In particular, ML has become firmly established in the construction of high-dimensional interatomic potentials. Not a day goes by without another proof of principle being published on how ML methods can represent and predict quantum mechanical properties - be they… ▽ More Machine learning (ML) methods are being used in almost every conceivable area of electronic structure theory and molecular simulation. In particular, ML has become firmly established in the construction of high-dimensional interatomic potentials. Not a day goes by without another proof of principle being published on how ML methods can represent and predict quantum mechanical properties - be they observable, such as molecular polarizabilities, or not, such as atomic charges. As ML is becoming pervasive in electronic structure theory and molecular simulation, we provide an overview of how atomistic computational modeling is being transformed by the incorporation of ML approaches. From the perspective of the practitioner in the field, we assess how common workflows to predict structure, dynamics, and spectroscopy are affected by ML. Finally, we discuss how a tighter and lasting integration of ML methods with computational chemistry and materials science can be achieved and what it will mean for research practice, software development, and postgraduate training. △ Less

Submitted 21 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

Comments: 22 pages, 5 figures

arXiv:2102.03150 [pdf, other]

Equivariant message passing for the prediction of tensorial properties and molecular spectra

Authors: Kristof T. Schütt, Oliver T. Unke, Michael Gastegger

Abstract: Message passing neural networks have become a method of choice for learning on graphs, in particular the prediction of chemical properties and the acceleration of molecular dynamics studies. While they readily scale to large training data sets, previous approaches have proven to be less data efficient than kernel methods. We identify limitations of invariant representations as a major reason and e… ▽ More Message passing neural networks have become a method of choice for learning on graphs, in particular the prediction of chemical properties and the acceleration of molecular dynamics studies. While they readily scale to large training data sets, previous approaches have proven to be less data efficient than kernel methods. We identify limitations of invariant representations as a major reason and extend the message passing formulation to rotationally equivariant representations. On this basis, we propose the polarizable atom interaction neural network (PaiNN) and improve on common molecule benchmarks over previous networks, while reducing model size and inference time. We leverage the equivariant atomwise representations obtained by PaiNN for the prediction of tensorial properties. Finally, we apply this to the simulation of molecular spectra, achieving speedups of 4-5 orders of magnitude compared to the electronic structure reference. △ Less

Submitted 7 June, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

Comments: Accepted at ICML 2021

arXiv:2010.14942 [pdf, other]

Machine learning of solvent effects on molecular spectra and reactions

Authors: Michael Gastegger, Kristof T. Schütt, Klaus-Robert Müller

Abstract: Fast and accurate simulation of complex chemical systems in environments such as solutions is a long standing challenge in theoretical chemistry. In recent years, machine learning has extended the boundaries of quantum chemistry by providing highly accurate and efficient surrogate models of electronic structure theory, which previously have been out of reach for conventional approaches. Those mode… ▽ More Fast and accurate simulation of complex chemical systems in environments such as solutions is a long standing challenge in theoretical chemistry. In recent years, machine learning has extended the boundaries of quantum chemistry by providing highly accurate and efficient surrogate models of electronic structure theory, which previously have been out of reach for conventional approaches. Those models have long been restricted to closed molecular systems without accounting for environmental influences, such as external electric and magnetic fields or solvent effects. Here, we introduce the deep neural network FieldSchNet for modeling the interaction of molecules with arbitrary external fields. FieldSchNet offers access to a wealth of molecular response properties, enabling it to simulate a wide range of molecular spectra, such as infrared, Raman and nuclear magnetic resonance. Beyond that, it is able to describe implicit and explicit molecular environments, operating as a polarizable continuum model for solvation or in a quantum mechanics / molecular mechanics setup. We employ FieldSchNet to study the influence of solvent effects on molecular spectra and a Claisen rearrangement reaction. Based on these results, we use FieldSchNet to design an external environment capable of lowering the activation barrier of the rearrangement reaction significantly, demonstrating promising venues for inverse chemical design. △ Less

Submitted 4 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

Comments: 16 pages, 5 figures

arXiv:2010.07067 [pdf, other]

doi 10.1021/acs.chemrev.0c01111

Machine Learning Force Fields

Authors: Oliver T. Unke, Stefan Chmiela, Huziel E. Sauceda, Michael Gastegger, Igor Poltavsky, Kristof T. Schütt, Alexandre Tkatchenko, Klaus-Robert Müller

Abstract: In recent years, the use of Machine Learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of cl… ▽ More In recent years, the use of Machine Learning (ML) in computational chemistry has enabled numerous advances previously out of reach due to the computational complexity of traditional electronic-structure methods. One of the most promising applications is the construction of ML-based force fields (FFs), with the aim to narrow the gap between the accuracy of ab initio methods and the efficiency of classical FFs. The key idea is to learn the statistical relation between chemical structure and potential energy without relying on a preconceived notion of fixed chemical bonds or knowledge about the relevant interactions. Such universal ML approximations are in principle only limited by the quality and quantity of the reference data used to train them. This review gives an overview of applications of ML-FFs and the chemical insights that can be obtained from them. The core concepts underlying ML-FFs are described in detail and a step-by-step guide for constructing and testing them from scratch is given. The text concludes with a discussion of the challenges that remain to be overcome by the next generation of ML-FFs. △ Less

Submitted 12 January, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

Journal ref: Chem. Rev. 2021, 121, 16, 10142-10186

arXiv:2006.16284 [pdf, other]

Transactions on Red-black and AVL trees in NVRAM

Authors: Thorsten Schütt, Florian Schintke, Jan Skrzypczak

Abstract: Byte-addressable non-volatile memory (NVRAM) supports persistent storage with low latency and high bandwidth. Complex data structures in it ought to be updated transactionally, so that they remain recoverable at all times. Traditional database technologies such as kee** a separate log, a journal, or shadow data work on a coarse-grained level, where the whole transaction is made visible using a f… ▽ More Byte-addressable non-volatile memory (NVRAM) supports persistent storage with low latency and high bandwidth. Complex data structures in it ought to be updated transactionally, so that they remain recoverable at all times. Traditional database technologies such as kee** a separate log, a journal, or shadow data work on a coarse-grained level, where the whole transaction is made visible using a final atomic update operation. These methods typically need significant additional space overhead and induce non-trivial overhead for log pruning, state maintenance, and resource (de-)allocation. Thus, they are not necessarily the best choice for NVRAM, which supports fine-grained, byte-addressable access. We present a generic transaction mechanism to update dynamic complex data structures `in-place' with a constant memory overhead. It is independent of the size of the data structure. We demonstrate and evaluate our approach on Red-Black Trees and AVL Trees with a redo log of constant size (4 resp. 2 cache lines). The redo log guarantees that each accepted (started) transaction is executed eventually despite arbitrary many system crashes and recoveries in the meantime. We update complex data structures in local and remote NVRAM providing exactly once semantics and durable linearizability for multi-reader single-writer access. To persist data, we use the available processor instructions for NVRAM in the local case and remote direct memory access (RDMA) combined with a software agent in the remote case. △ Less

Submitted 29 June, 2020; originally announced June 2020.

arXiv:2006.03589 [pdf, other]

doi 10.1109/TPAMI.2021.3115452

Higher-Order Explanations of Graph Neural Networks via Relevant Walks

Authors: Thomas Schnake, Oliver Eberle, Jonas Lederer, Shinichi Nakajima, Kristof T. Schütt, Klaus-Robert Müller, Grégoire Montavon

Abstract: Graph Neural Networks (GNNs) are a popular approach for predicting graph structured data. As GNNs tightly entangle the input graph into the neural network structure, common explainable AI approaches are not applicable. To a large extent, GNNs have remained black-boxes for the user so far. In this paper, we show that GNNs can in fact be naturally explained using higher-order expansions, i.e. by ide… ▽ More Graph Neural Networks (GNNs) are a popular approach for predicting graph structured data. As GNNs tightly entangle the input graph into the neural network structure, common explainable AI approaches are not applicable. To a large extent, GNNs have remained black-boxes for the user so far. In this paper, we show that GNNs can in fact be naturally explained using higher-order expansions, i.e. by identifying groups of edges that jointly contribute to the prediction. Practically, we find that such explanations can be extracted using a nested attribution scheme, where existing techniques such as layer-wise relevance propagation (LRP) can be applied at each step. The output is a collection of walks into the input graph that are relevant for the prediction. Our novel explanation method, which we denote by GNN-LRP, is applicable to a broad range of graph neural networks and lets us extract practically relevant insights on sentiment analysis of text data, structure-property relationships in quantum chemistry, and image classification. △ Less

Submitted 26 November, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

Comments: 14 pages + 6 pages supplement

arXiv:2005.06979 [pdf, other]

doi 10.1063/5.0012911

A deep neural network for molecular wave functions in quasi-atomic minimal basis representation

Authors: M. Gastegger, A. McSloy, M. Luya, K. T. Schütt, R. J. Maurer

Abstract: The emergence of machine learning methods in quantum chemistry provides new methods to revisit an old problem: Can the predictive accuracy of electronic structure calculations be decoupled from their numerical bottlenecks? Previous attempts to answer this question have, among other methods, given rise to semi-empirical quantum chemistry in minimal basis representation. We present an adaptation of… ▽ More The emergence of machine learning methods in quantum chemistry provides new methods to revisit an old problem: Can the predictive accuracy of electronic structure calculations be decoupled from their numerical bottlenecks? Previous attempts to answer this question have, among other methods, given rise to semi-empirical quantum chemistry in minimal basis representation. We present an adaptation of the recently proposed SchNet for Orbitals (SchNOrb) deep convolutional neural network model [Nature Commun. 10, 5024 (2019)] for electronic wave functions in an optimised quasi-atomic minimal basis representation. For five organic molecules ranging from 5 to 13 heavy atoms, the model accurately predicts molecular orbital energies and wavefunctions and provides access to derived properties for chemical bonding analysis. Particularly for larger molecules, the model outperforms the original atomic-orbital-based SchNOrb method in terms of accuracy and scaling. We conclude by discussing the future potential of this approach in quantum chemical workflows. △ Less

Submitted 6 July, 2020; v1 submitted 11 May, 2020; originally announced May 2020.

Comments: 15 pages, 9 figures

arXiv:2002.11952 [pdf, other]

doi 10.1126/sciadv.abb6987

Autonomous robotic nanofabrication with reinforcement learning

Authors: Philipp Leinen, Malte Esders, Kristof T. Schütt, Christian Wagner, Klaus-Robert Müller, F. Stefan Tautz

Abstract: The ability to handle single molecules as effectively as macroscopic building-blocks would enable the construction of complex supramolecular structures inaccessible to self-assembly. The fundamental challenges obstructing this goal are the uncontrolled variability and poor observability of atomic-scale conformations. Here, we present a strategy to work around both obstacles, and demonstrate autono… ▽ More The ability to handle single molecules as effectively as macroscopic building-blocks would enable the construction of complex supramolecular structures inaccessible to self-assembly. The fundamental challenges obstructing this goal are the uncontrolled variability and poor observability of atomic-scale conformations. Here, we present a strategy to work around both obstacles, and demonstrate autonomous robotic nanofabrication by manipulating single molecules. Our approach employs reinforcement learning (RL), which finds solution strategies even in the face of large uncertainty and sparse feedback. We demonstrate the potential of our RL approach by removing molecules autonomously with a scanning probe microscope from a supramolecular structure -- an exemplary task of subtractive manufacturing at the nanoscale. Our RL agent reaches an excellent performance, enabling us to automate a task which previously had to be performed by a human. We anticipate that our work opens the way towards autonomous agents for the robotic construction of functional supramolecular structures with speed, precision and perseverance beyond our current capabilities. △ Less

Submitted 1 October, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

Comments: 3 figures

Journal ref: Sci. Adv. 6, eabb6987 (2020)

arXiv:2001.03362 [pdf, other]

doi 10.1109/TPDS.2020.2981891

RMWPaxos: Fault-Tolerant In-Place Consensus Sequences

Authors: Jan Skrzypczak, Florian Schintke, Thorsten Schütt

Abstract: Building consensus sequences based on distributed, fault-tolerant consensus, as used for replicated state machines, typically requires a separate distributed state for every new consensus instance. Allocating and maintaining this state causes significant overhead. In particular, freeing the distributed, outdated states in a fault-tolerant way is not trivial and adds further complexity and cost to… ▽ More Building consensus sequences based on distributed, fault-tolerant consensus, as used for replicated state machines, typically requires a separate distributed state for every new consensus instance. Allocating and maintaining this state causes significant overhead. In particular, freeing the distributed, outdated states in a fault-tolerant way is not trivial and adds further complexity and cost to the system. In this paper, we propose an extension to the single-decree Paxos protocol that can learn a sequence of consensus decisions 'in-place', i.e. with a single set of distributed states. Our protocol does not require dynamic log structures and hence has no need for distributed log pruning, snapshotting, compaction, or dynamic resource allocation. The protocol builds a fault-tolerant atomic register that supports arbitrary read-modify-write operations. We use the concept of consistent quorums to detect whether the previous consensus still needs to be consolidated or is already finished so that the next consensus value can be safely proposed. Reading a consolidated consensus is done without state modifications and is thereby free of concurrency control and demand for serialisation. A proposer that is not interrupted reaches agreement on consecutive consensus decisions within a single message round-trip per decision by preparing the acceptors eagerly with the previous request. △ Less

Submitted 1 April, 2020; v1 submitted 10 January, 2020; originally announced January 2020.

arXiv:1906.10033 [pdf, other]

Unifying machine learning and quantum chemistry -- a deep neural network for molecular wavefunctions

Authors: K. T. Schütt, M. Gastegger, A. Tkatchenko, K. -R. Müller, R. J. Maurer

Abstract: Machine learning advances chemistry and materials science by enabling large-scale exploration of chemical space based on quantum chemical calculations. While these models supply fast and accurate predictions of atomistic chemical properties, they do not explicitly capture the electronic degrees of freedom of a molecule, which limits their applicability for reactive chemistry and chemical analysis.… ▽ More Machine learning advances chemistry and materials science by enabling large-scale exploration of chemical space based on quantum chemical calculations. While these models supply fast and accurate predictions of atomistic chemical properties, they do not explicitly capture the electronic degrees of freedom of a molecule, which limits their applicability for reactive chemistry and chemical analysis. Here we present a deep learning framework for the prediction of the quantum mechanical wavefunction in a local basis of atomic orbitals from which all other ground-state properties can be derived. This approach retains full access to the electronic structure via the wavefunction at force field-like efficiency and captures quantum mechanics in an analytically differentiable representation. On several examples, we demonstrate that this opens promising avenues to perform inverse design of molecular structures for target electronic property optimisation and a clear path towards increased synergy of machine learning and quantum chemistry. △ Less

Submitted 24 June, 2019; originally announced June 2019.

arXiv:1906.00957 [pdf, other]

Symmetry-adapted generation of 3d point sets for the targeted discovery of molecules

Authors: Niklas W. A. Gebauer, Michael Gastegger, Kristof T. Schütt

Abstract: Deep learning has proven to yield fast and accurate predictions of quantum-chemical properties to accelerate the discovery of novel molecules and materials. As an exhaustive exploration of the vast chemical space is still infeasible, we require generative models that guide our search towards systems with desired properties. While graph-based models have previously been proposed, they are restricte… ▽ More Deep learning has proven to yield fast and accurate predictions of quantum-chemical properties to accelerate the discovery of novel molecules and materials. As an exhaustive exploration of the vast chemical space is still infeasible, we require generative models that guide our search towards systems with desired properties. While graph-based models have previously been proposed, they are restricted by a lack of spatial information such that they are unable to recognize spatial isomerism and non-bonded interactions. Here, we introduce a generative neural network for 3d point sets that respects the rotational invariance of the targeted structures. We apply it to the generation of molecules and demonstrate its ability to approximate the distribution of equilibrium structures using spatial metrics as well as established measures from chemoinformatics. As our model is able to capture the complex relationship between 3d geometry and electronic properties, we bias the distribution of the generator towards molecules with a small HOMO-LUMO gap - an important property for the design of organic solar cells. △ Less

Submitted 9 January, 2020; v1 submitted 2 June, 2019; originally announced June 2019.

arXiv:1905.08733 [pdf, other]

Linearizable State Machine Replication of State-Based CRDTs without Logs

Authors: Jan Skrzypczak, Florian Schintke, Thorsten Schütt

Abstract: General solutions of state machine replication have to ensure that all replicas apply the same commands in the same order, even in the presence of failures. Such strict ordering incurs high synchronization costs caused by distributed consensus or by the use of a leader. This paper presents a protocol for linearizable state machine replication of conflict-free replicated data types (CRDTs) that n… ▽ More General solutions of state machine replication have to ensure that all replicas apply the same commands in the same order, even in the presence of failures. Such strict ordering incurs high synchronization costs caused by distributed consensus or by the use of a leader. This paper presents a protocol for linearizable state machine replication of conflict-free replicated data types (CRDTs) that neither requires consensus nor a leader. By leveraging the properties of state-based CRDTs - in particular, the monotonic growth of a join semilattice - synchronization overhead is greatly reduced. As a result, updates only need a single round trip and modify the state 'in-place' without the need for a log. Furthermore, the message size overhead for coordination consists of a single counter per message. For queries, we guarantee finite writes termination. We show in an experimental evaluation that more than 99 % of queries can be handled in one to three round trips under highly concurrent accesses. Our protocol achieves high throughput without auxiliary processes such as command log management or leader election. Thus, it is well suited for practical scenarios that need linearizable access to CRDT data on a fine-granular scale. △ Less

Submitted 24 July, 2020; v1 submitted 21 May, 2019; originally announced May 2019.

arXiv:1812.04690 [pdf, other]

Learning representations of molecules and materials with atomistic neural networks

Authors: Kristof T. Schütt, Alexandre Tkatchenko, Klaus-Robert Müller

Abstract: Deep Learning has been shown to learn efficient representations for structured data such as image, text or audio. In this chapter, we present neural network architectures that are able to learn efficient representations of molecules and materials. In particular, the continuous-filter convolutional network SchNet accurately predicts chemical properties across compositional and configurational space… ▽ More Deep Learning has been shown to learn efficient representations for structured data such as image, text or audio. In this chapter, we present neural network architectures that are able to learn efficient representations of molecules and materials. In particular, the continuous-filter convolutional network SchNet accurately predicts chemical properties across compositional and configurational space on a variety of datasets. Beyond that, we analyze the obtained representations to find evidence that their spatial and chemical properties agree with chemical intuition. △ Less

Submitted 11 December, 2018; originally announced December 2018.

arXiv:1810.11347 [pdf, other]

Generating equilibrium molecules with deep neural networks

Authors: Niklas W. A. Gebauer, Michael Gastegger, Kristof T. Schütt

Abstract: Discovery of atomistic systems with desirable properties is a major challenge in chemistry and material science. Here we introduce a novel, autoregressive, convolutional deep neural network architecture that generates molecular equilibrium structures by sequentially placing atoms in three-dimensional space. The model estimates the joint probability over molecular configurations with tractable cond… ▽ More Discovery of atomistic systems with desirable properties is a major challenge in chemistry and material science. Here we introduce a novel, autoregressive, convolutional deep neural network architecture that generates molecular equilibrium structures by sequentially placing atoms in three-dimensional space. The model estimates the joint probability over molecular configurations with tractable conditional probabilities which only depend on distances between atoms and their nuclear charges. It combines concepts from state-of-the-art atomistic neural networks with auto-regressive generative models for images and speech. We demonstrate that the architecture is capable of generating molecules close to equilibrium for constitutional isomers of C$_7$O$_2$H$_{10}$. △ Less

Submitted 26 October, 2018; originally announced October 2018.

arXiv:1810.09751 [pdf, other]

Analysis of Atomistic Representations Using Weighted Skip-Connections

Authors: Kim A. Nicoli, Pan Kessel, Michael Gastegger, Kristof T. Schütt

Abstract: In this work, we extend the SchNet architecture by using weighted skip connections to assemble the final representation. This enables us to study the relative importance of each interaction block for property prediction. We demonstrate on both the QM9 and MD17 dataset that their relative weighting depends strongly on the chemical composition and configurational degrees of freedom of the molecules… ▽ More In this work, we extend the SchNet architecture by using weighted skip connections to assemble the final representation. This enables us to study the relative importance of each interaction block for property prediction. We demonstrate on both the QM9 and MD17 dataset that their relative weighting depends strongly on the chemical composition and configurational degrees of freedom of the molecules which opens the path towards a more detailed understanding of machine learning models for molecules. △ Less

Submitted 14 November, 2018; v1 submitted 23 October, 2018; originally announced October 2018.

Comments: NIPS 2018 Workshop: Machine Learning for Molecules and Materials

arXiv:1809.01072 [pdf, other]

doi 10.1021/acs.jctc.8b00908

SchNetPack: A Deep Learning Toolbox For Atomistic Systems

Authors: K. T. Schütt, P. Kessel, M. Gastegger, K. Nicoli, A. Tkatchenko, K. -R. Müller

Abstract: SchNetPack is a toolbox for the development and application of deep neural networks to the prediction of potential energy surfaces and other quantum-chemical properties of molecules and materials. It contains basic building blocks of atomistic neural networks, manages their training and provides simple access to common benchmark datasets. This allows for an easy implementation and evaluation of ne… ▽ More SchNetPack is a toolbox for the development and application of deep neural networks to the prediction of potential energy surfaces and other quantum-chemical properties of molecules and materials. It contains basic building blocks of atomistic neural networks, manages their training and provides simple access to common benchmark datasets. This allows for an easy implementation and evaluation of new models. For now, SchNetPack includes implementations of (weighted) atomcentered symmetry functions and the deep tensor neural network SchNet as well as ready-to-use scripts that allow to train these models on molecule and material datasets. Based upon the PyTorch deep learning framework, SchNetPack allows to efficiently apply the neural networks to large datasets with millions of reference calculations as well as parallelize the model across multiple GPUs. Finally, SchNetPack provides an interface to the Atomic Simulation Environment in order to make trained models easily accessible to researchers that are not yet familiar with neural networks. △ Less

Submitted 4 September, 2018; originally announced September 2018.

arXiv:1808.04260 [pdf, other]

iNNvestigate neural networks!

Authors: Maximilian Alber, Sebastian Lapuschkin, Philipp Seegerer, Miriam Hägele, Kristof T. Schütt, Grégoire Montavon, Wojciech Samek, Klaus-Robert Müller, Sven Dähne, Pieter-Jan Kindermans

Abstract: In recent years, deep neural networks have revolutionized many application domains of machine learning and are key components of many critical decision or predictive processes. Therefore, it is crucial that domain specialists can understand and analyze actions and pre- dictions, even of the most complex neural network architectures. Despite these arguments neural networks are often treated as blac… ▽ More In recent years, deep neural networks have revolutionized many application domains of machine learning and are key components of many critical decision or predictive processes. Therefore, it is crucial that domain specialists can understand and analyze actions and pre- dictions, even of the most complex neural network architectures. Despite these arguments neural networks are often treated as black boxes. In the attempt to alleviate this short- coming many analysis methods were proposed, yet the lack of reference implementations often makes a systematic comparison between the methods a major effort. The presented library iNNvestigate addresses this by providing a common interface and out-of-the- box implementation for many analysis methods, including the reference implementation for PatternNet and PatternAttribution as well as for LRP-methods. To demonstrate the versatility of iNNvestigate, we provide an analysis of image classifications for variety of state-of-the-art neural network architectures. △ Less

Submitted 13 August, 2018; originally announced August 2018.

arXiv:1806.10349 [pdf, other]

Quantum-chemical insights from interpretable atomistic neural networks

Authors: Kristof T. Schütt, Michael Gastegger, Alexandre Tkatchenko, Klaus-Robert Müller

Abstract: With the rise of deep neural networks for quantum chemistry applications, there is a pressing need for architectures that, beyond delivering accurate predictions of chemical properties, are readily interpretable by researchers. Here, we describe interpretation techniques for atomistic neural networks on the example of Behler-Parrinello networks as well as the end-to-end model SchNet. Both models o… ▽ More With the rise of deep neural networks for quantum chemistry applications, there is a pressing need for architectures that, beyond delivering accurate predictions of chemical properties, are readily interpretable by researchers. Here, we describe interpretation techniques for atomistic neural networks on the example of Behler-Parrinello networks as well as the end-to-end model SchNet. Both models obtain predictions of chemical properties by aggregating atom-wise contributions. These latent variables can serve as local explanations of a prediction and are obtained during training without additional cost. Due to their correspondence to well-known chemical concepts such as atomic energies and partial charges, these atom-wise explanations enable insights not only about the model but more importantly about the underlying quantum-chemical regularities. We generalize from atomistic explanations to 3d space, thus obtaining spatially resolved visualizations which further improve interpretability. Finally, we analyze learned embeddings of chemical elements that exhibit a partial ordering that resembles the order of the periodic table. As the examined neural networks show excellent agreement with chemical knowledge, the presented techniques open up new venues for data-driven research in chemistry, physics and materials science. △ Less

Submitted 27 June, 2018; originally announced June 2018.

arXiv:1712.06113 [pdf, other]

doi 10.1063/1.5019779

SchNet - a deep learning architecture for molecules and materials

Authors: Kristof T. Schütt, Huziel E. Sauceda, Pieter-Jan Kindermans, Alexandre Tkatchenko, Klaus-Robert Müller

Abstract: Deep learning has led to a paradigm shift in artificial intelligence, including web, text and image search, speech recognition, as well as bioinformatics, with growing impact in chemical physics. Machine learning in general and deep learning in particular is ideally suited for representing quantum-mechanical interactions, enabling to model nonlinear potential-energy surfaces or enhancing the explo… ▽ More Deep learning has led to a paradigm shift in artificial intelligence, including web, text and image search, speech recognition, as well as bioinformatics, with growing impact in chemical physics. Machine learning in general and deep learning in particular is ideally suited for representing quantum-mechanical interactions, enabling to model nonlinear potential-energy surfaces or enhancing the exploration of chemical compound space. Here we present the deep learning architecture SchNet that is specifically designed to model atomistic systems by making use of continuous-filter convolutional layers. We demonstrate the capabilities of SchNet by accurately predicting a range of properties across chemical space for \emph{molecules and materials} where our model learns chemically plausible embeddings of atom types across the periodic table. Finally, we employ SchNet to predict potential-energy surfaces and energy-conserving force fields for molecular dynamics simulations of small molecules and perform an exemplary study of the quantum-mechanical properties of C$_{20}$-fullerene that would have been infeasible with regular ab initio molecular dynamics. △ Less

Submitted 22 March, 2018; v1 submitted 17 December, 2017; originally announced December 2017.

arXiv:1711.00867 [pdf, other]

The (Un)reliability of saliency methods

Authors: Pieter-Jan Kindermans, Sara Hooker, Julius Adebayo, Maximilian Alber, Kristof T. Schütt, Sven Dähne, Dumitru Erhan, Been Kim

Abstract: Saliency methods aim to explain the predictions of deep neural networks. These methods lack reliability when the explanation is sensitive to factors that do not contribute to the model prediction. We use a simple and common pre-processing step ---adding a constant shift to the input data--- to show that a transformation with no effect on the model can cause numerous methods to incorrectly attribut… ▽ More Saliency methods aim to explain the predictions of deep neural networks. These methods lack reliability when the explanation is sensitive to factors that do not contribute to the model prediction. We use a simple and common pre-processing step ---adding a constant shift to the input data--- to show that a transformation with no effect on the model can cause numerous methods to incorrectly attribute. In order to guarantee reliability, we posit that methods should fulfill input invariance, the requirement that a saliency method mirror the sensitivity of the model with respect to transformations of the input. We show, through several examples, that saliency methods that do not satisfy input invariance result in misleading attribution. △ Less

Submitted 2 November, 2017; originally announced November 2017.

arXiv:1706.08566 [pdf, other]

SchNet: A continuous-filter convolutional neural network for modeling quantum interactions

Authors: Kristof T. Schütt, Pieter-Jan Kindermans, Huziel E. Sauceda, Stefan Chmiela, Alexandre Tkatchenko, Klaus-Robert Müller

Abstract: Deep learning has the potential to revolutionize quantum chemistry as it is ideally suited to learn representations for structured data and speed up the exploration of chemical space. While convolutional neural networks have proven to be the first choice for images, audio and video data, the atoms in molecules are not restricted to a grid. Instead, their precise locations contain essential physica… ▽ More Deep learning has the potential to revolutionize quantum chemistry as it is ideally suited to learn representations for structured data and speed up the exploration of chemical space. While convolutional neural networks have proven to be the first choice for images, audio and video data, the atoms in molecules are not restricted to a grid. Instead, their precise locations contain essential physical information, that would get lost if discretized. Thus, we propose to use continuous-filter convolutional layers to be able to model local correlations without requiring the data to lie on a grid. We apply those layers in SchNet: a novel deep learning architecture modeling quantum interactions in molecules. We obtain a joint model for the total energy and interatomic forces that follows fundamental quantum-chemical principles. This includes rotationally invariant energy predictions and a smooth, differentiable potential energy surface. Our architecture achieves state-of-the-art performance for benchmarks of equilibrium molecules and molecular dynamics trajectories. Finally, we introduce a more challenging benchmark with chemical and structural variations that suggests the path for further work. △ Less

Submitted 19 December, 2017; v1 submitted 26 June, 2017; originally announced June 2017.

Journal ref: Advances in Neural Information Processing Systems 30 (2017), pp. 992-1002

arXiv:1705.05598 [pdf, other]

Learning how to explain neural networks: PatternNet and PatternAttribution

Authors: Pieter-Jan Kindermans, Kristof T. Schütt, Maximilian Alber, Klaus-Robert Müller, Dumitru Erhan, Been Kim, Sven Dähne

Abstract: DeConvNet, Guided BackProp, LRP, were invented to better understand deep neural networks. We show that these methods do not produce the theoretically correct explanation for a linear model. Yet they are used on multi-layer networks with millions of parameters. This is a cause for concern since linear models are simple neural networks. We argue that explanation methods for neural nets should work r… ▽ More DeConvNet, Guided BackProp, LRP, were invented to better understand deep neural networks. We show that these methods do not produce the theoretically correct explanation for a linear model. Yet they are used on multi-layer networks with millions of parameters. This is a cause for concern since linear models are simple neural networks. We argue that explanation methods for neural nets should work reliably in the limit of simplicity, the linear models. Based on our analysis of linear models we propose a generalization that yields two explanation techniques (PatternNet and PatternAttribution) that are theoretically sound for linear models and produce improved explanations for deep networks. △ Less

Submitted 24 October, 2017; v1 submitted 16 May, 2017; originally announced May 2017.

arXiv:1611.04678 [pdf, other]

doi 10.1126/sciadv.1603015

Machine Learning of Accurate Energy-Conserving Molecular Force Fields

Authors: Stefan Chmiela, Alexandre Tkatchenko, Huziel E. Sauceda, Igor Poltavsky, Kristof T. Schütt, Klaus-Robert Müller

Abstract: Using conservation of energy - a fundamental property of closed classical and quantum mechanical systems - we develop an efficient gradient-domain machine learning (GDML) approach to construct accurate molecular force fields using a restricted number of samples from ab initio molecular dynamics (AIMD) trajectories. The GDML implementation is able to reproduce global potential energy surfaces of in… ▽ More Using conservation of energy - a fundamental property of closed classical and quantum mechanical systems - we develop an efficient gradient-domain machine learning (GDML) approach to construct accurate molecular force fields using a restricted number of samples from ab initio molecular dynamics (AIMD) trajectories. The GDML implementation is able to reproduce global potential energy surfaces of intermediate-sized molecules with an accuracy of 0.3 kcal $\text{mol}^{-1}$ for energies and 1 kcal $\text{mol}^{-1}$ $\textÅ^{-1}$ for atomic forces using only 1000 conformational geometries for training. We demonstrate this accuracy for AIMD trajectories of molecules, including benzene, toluene, naphthalene, ethanol, uracil, and aspirin. The challenge of constructing conservative force fields is accomplished in our work by learning in a Hilbert space of vector-valued functions that obey the law of energy conservation. The GDML approach enables quantitative molecular dynamics simulations for molecules at a fraction of cost of explicit AIMD calculations, thereby allowing the construction of efficient force fields with the accuracy and transferability of high-level ab initio methods. △ Less

Submitted 8 May, 2017; v1 submitted 14 November, 2016; originally announced November 2016.

Journal ref: Science Advances 3(5):e1603015 (2017)

arXiv:1609.08259 [pdf, other]

doi 10.1038/ncomms13890

Quantum-Chemical Insights from Deep Tensor Neural Networks

Authors: Kristof T. Schütt, Farhad Arbabzadah, Stefan Chmiela, Klaus R. Müller, Alexandre Tkatchenko

Abstract: Learning from data has led to paradigm shifts in a multitude of disciplines, including web, text, and image search, speech recognition, as well as bioinformatics. Can machine learning enable similar breakthroughs in understanding quantum many-body systems? Here we develop an efficient deep learning approach that enables spatially and chemically resolved insights into quantum-mechanical observables… ▽ More Learning from data has led to paradigm shifts in a multitude of disciplines, including web, text, and image search, speech recognition, as well as bioinformatics. Can machine learning enable similar breakthroughs in understanding quantum many-body systems? Here we develop an efficient deep learning approach that enables spatially and chemically resolved insights into quantum-mechanical observables of molecular systems. We unify concepts from many-body Hamiltonians with purpose-designed deep tensor neural networks (DTNN), which leads to size-extensive and uniformly accurate (1 kcal/mol) predictions in compositional and configurational chemical space for molecules of intermediate size. As an example of chemical relevance, the DTNN model reveals a classification of aromatic rings with respect to their stability -- a useful property that is not contained as such in the training dataset. Further applications of DTNN for predicting atomic energies and local chemical potentials in molecules, reliable isomer energies, and molecules with peculiar electronic structure demonstrate the high potential of machine learning for revealing novel insights into complex quantum-chemical systems. △ Less

Submitted 7 November, 2016; v1 submitted 27 September, 2016; originally announced September 2016.

Journal ref: Nature Comm. 8, 13890 (2017)

arXiv:1307.1266 [pdf, other]

doi 10.1103/PhysRevB.89.205118

How to represent crystal structures for machine learning: towards fast prediction of electronic properties

Authors: K. T. Schütt, H. Glawe, F. Brockherde, A. Sanna, K. R. Müller, E. K. U. Gross

Abstract: High-throughput density-functional calculations of solids are extremely time consuming. As an alternative, we here propose a machine learning approach for the fast prediction of solid-state properties. To achieve this, LSDA calculations are used as training set. We focus on predicting metallic vs. insulating behavior, and on predicting the value of the density of electronic states at the Fermi ene… ▽ More High-throughput density-functional calculations of solids are extremely time consuming. As an alternative, we here propose a machine learning approach for the fast prediction of solid-state properties. To achieve this, LSDA calculations are used as training set. We focus on predicting metallic vs. insulating behavior, and on predicting the value of the density of electronic states at the Fermi energy. We find that conventional representations of the input data, such as the Coulomb matrix, are not suitable for the training of learning machines in the case of periodic solids. We propose a novel crystal structure representation for which learning and competitive prediction accuracies become possible within an unrestricted class of spd systems. Due to magnetic phenomena learning on d systems is found more difficult than in pure sp systems. △ Less

Submitted 22 May, 2014; v1 submitted 4 July, 2013; originally announced July 2013.

Journal ref: Phys. Rev. B 89, 205118 (2014)

Showing 1–35 of 35 results for author: Schütt, T