-
ReLiCADA -- Reservoir Computing using Linear Cellular Automata Design Algorithm
Authors:
Jonas Kantic,
Fabian C. Legl,
Walter Stechele,
Jakob Hermann
Abstract:
In this paper, we present a novel algorithm to optimize the design of Reservoir Computing using Cellular Automata models for time series applications. Besides selecting the models' hyperparameters, the proposed algorithm particularly solves the open problem of linear Cellular Automaton rule selection. The selection method pre-selects only a few promising candidate rules out of an exponentially gro…
▽ More
In this paper, we present a novel algorithm to optimize the design of Reservoir Computing using Cellular Automata models for time series applications. Besides selecting the models' hyperparameters, the proposed algorithm particularly solves the open problem of linear Cellular Automaton rule selection. The selection method pre-selects only a few promising candidate rules out of an exponentially growing rule space. When applied to relevant benchmark datasets, the selected rules achieve low errors, with the best rules being among the top 5% of the overall rule space. The algorithm was developed based on mathematical analysis of linear Cellular Automaton properties and is backed by almost one million experiments, adding up to a computational runtime of nearly one year. Comparisons to other state-of-the-art time series models show that the proposed Reservoir Computing using Cellular Automata models have lower computational complexity, at the same time, achieve lower errors. Hence, our approach reduces the time needed for training and hyperparameter optimization by up to several orders of magnitude.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
libMBD: A general-purpose package for scalable quantum many-body dispersion calculations
Authors:
Jan Hermann,
Martin Stöhr,
Szabolcs Góger,
Shayantan Chaudhuri,
Bálint Aradi,
Reinhard J. Maurer,
Alexandre Tkatchenko
Abstract:
Many-body dispersion (MBD) is a powerful framework to treat van der Waals (vdW) dispersion interactions in density-functional theory and related atomistic modeling methods. Several independent implementations of MBD with varying degree of functionality exist across a number of electronic structure codes, which both limits the current users of those codes and complicates dissemination of new varian…
▽ More
Many-body dispersion (MBD) is a powerful framework to treat van der Waals (vdW) dispersion interactions in density-functional theory and related atomistic modeling methods. Several independent implementations of MBD with varying degree of functionality exist across a number of electronic structure codes, which both limits the current users of those codes and complicates dissemination of new variants of MBD. Here, we develop and document libMBD, a library implementation of MBD that is functionally complete, efficient, easy to integrate with any electronic structure code, and already integrated in FHI-aims, DFTB+, VASP, Q-Chem, CASTEP, and Quantum ESPRESSO. libMBD is written in modern Fortran with bindings to C and Python, uses MPI/ScaLAPACK for parallelization, and implements MBD for both finite and periodic systems, with analytical gradients with respect to all input parameters. The computational cost has asymptotic cubic scaling with system size, and evaluation of gradients only changes the prefactor of the scaling law, with libMBD exhibiting strong scaling up to 256 processor cores. Other MBD properties beyond energy and gradients can be calculated with libMBD, such as the charge-density polarization, first-order Coulomb correction, the dielectric function, or the order-by-order expansion of the energy in the dipole interaction. Calculations on supramolecular complexes with MBD-corrected electronic structure methods and a meta-review of previous applications of MBD demonstrate the broad applicability of the libMBD package to treat vdW interactions.
△ Less
Submitted 6 August, 2023;
originally announced August 2023.
-
DeepQMC: an open-source software suite for variational optimization of deep-learning molecular wave functions
Authors:
Zeno Schätzle,
Bernát Szabó,
Matĕj Mezera,
Jan Hermann,
Frank Noé
Abstract:
Computing accurate yet efficient approximations to the solutions of the electronic Schrödinger equation has been a paramount challenge of computational chemistry for decades. Quantum Monte Carlo methods are a promising avenue of development as their core algorithm exhibits a number of favorable properties: it is highly parallel, and scales favorably with the considered system size, with an accurac…
▽ More
Computing accurate yet efficient approximations to the solutions of the electronic Schrödinger equation has been a paramount challenge of computational chemistry for decades. Quantum Monte Carlo methods are a promising avenue of development as their core algorithm exhibits a number of favorable properties: it is highly parallel, and scales favorably with the considered system size, with an accuracy that is limited only by the choice of the wave function ansatz. The recently introduced machine-learned parametrizations of quantum Monte Carlo ansatzes rely on the efficiency of neural networks as universal function approximators to achieve state of the art accuracy on a variety of molecular systems. With interest in the field growing rapidly, there is a clear need for easy to use, modular, and extendable software libraries facilitating the development and adoption of this new class of methods. In this contribution, the DeepQMC program package is introduced, in an attempt to provide a common framework for future investigations by unifying many of the currently available deep-learning quantum Monte Carlo architectures. Furthermore, the manuscript provides a brief introduction to the methodology of variational quantum Monte Carlo in real space, highlights some technical challenges of optimizing neural network wave functions, and presents example black-box applications of the program package. We thereby intend to make this novel field accessible to a broader class of practitioners both from the quantum chemistry as well as the machine learning communities.
△ Less
Submitted 22 September, 2023; v1 submitted 26 July, 2023;
originally announced July 2023.
-
Variational principle to regularize machine-learned density functionals: the non-interacting kinetic-energy functional
Authors:
P. del Mazo-Sevillano,
J. Hermann
Abstract:
Practical density functional theory (DFT) owes its success to the groundbreaking work of Kohn and Sham that introduced the exact calculation of the non-interacting kinetic energy of the electrons using an auxiliary mean-field system. However, the full power of DFT will not be unleashed until the exact relationship between the electron density and the non-interacting kinetic energy is found. Variou…
▽ More
Practical density functional theory (DFT) owes its success to the groundbreaking work of Kohn and Sham that introduced the exact calculation of the non-interacting kinetic energy of the electrons using an auxiliary mean-field system. However, the full power of DFT will not be unleashed until the exact relationship between the electron density and the non-interacting kinetic energy is found. Various attempts have been made to approximate this functional, similar to the exchange--correlation functional, with much less success due to the larger contribution of kinetic energy and its more non-local nature. In this work we propose a new and efficient regularization method to train density functionals based on deep neural networks, with particular interest in the kinetic-energy functional. The method is tested on (effectively) one-dimensional systems, including the hydrogen chain, non-interacting electrons, and atoms of the first two periods, with excellent results. For the atomic systems, the generalizability of the regularization method is demonstrated by training also an exchange--correlation functional, and the contrasting nature of the two functionals is discussed from a machine-learning perspective.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
Extending the coherence time of spin defects in hBN enables advanced qubit control and quantum sensing
Authors:
Roberto Rizzato,
Martin Schalk,
Stephan Mohr,
Joachim P. Leibold,
Jens C. Hermann,
Fleming Bruckmaier,
Peirui Ji,
Georgy V. Astakhov,
Ulrich Kentsch,
Manfred Helm,
Andreas V. Stier,
Jonathan J. Finley,
Dominik B. Bucher
Abstract:
Spin defects in hexagonal Boron Nitride (hBN) attract increasing interest for quantum technology since they represent optically-addressable qubits in a van der Waals material. In particular, negatively-charged boron vacancy centers (${V_B}^-$) in hBN have shown promise as sensors of temperature, pressure, and static magnetic fields. However, the short spin coherence time of this defect currently l…
▽ More
Spin defects in hexagonal Boron Nitride (hBN) attract increasing interest for quantum technology since they represent optically-addressable qubits in a van der Waals material. In particular, negatively-charged boron vacancy centers (${V_B}^-$) in hBN have shown promise as sensors of temperature, pressure, and static magnetic fields. However, the short spin coherence time of this defect currently limits its scope for quantum technology. Here, we apply dynamical decoupling techniques to suppress magnetic noise and extend the spin coherence time by nearly two orders of magnitude, approaching the fundamental $T_1$ relaxation limit. Based on this improvement, we demonstrate advanced spin control and a set of quantum sensing protocols to detect electromagnetic signals in the MHz range with sub-Hz resolution. This work lays the foundation for nanoscale sensing using spin defects in an exfoliable material and opens a promising path to quantum sensors and quantum networks integrated into ultra-thin structures.
△ Less
Submitted 24 December, 2022;
originally announced December 2022.
-
NLO QCD corrections and off-shell effects for $t\bar{t}H$ production in the Higgs characterisation model
Authors:
Jonathan Hermann
Abstract:
In these proceedings we discuss $t\bar{t}H$ production at the Large Hadron Collider in the Higgs characterisation model. We demonstrate the importance of off-shell effects and higher-order QCD corrections for this process and highlight the importance of single-resonant contributions for the production of a $\mathcal{CP}$-odd Higgs boson.
In these proceedings we discuss $t\bar{t}H$ production at the Large Hadron Collider in the Higgs characterisation model. We demonstrate the importance of off-shell effects and higher-order QCD corrections for this process and highlight the importance of single-resonant contributions for the production of a $\mathcal{CP}$-odd Higgs boson.
△ Less
Submitted 1 December, 2022;
originally announced December 2022.
-
A Reference Model for Common Understanding of Capabilities and Skills in Manufacturing
Authors:
Aljosha Köcher,
Alexander Belyaev,
Jesko Hermann,
Jürgen Bock,
Kristof Meixner,
Magnus Volkmann,
Michael Winter,
Patrick Zimmermann,
Stephan Grimm,
Christian Diedrich
Abstract:
In manufacturing, many use cases of Industry 4.0 require vendor-neutral and machine-readable information models to describe, implement and execute resource functions. Such models have been researched under the terms capabilities and skills. Standardization of such models is required, but currently not available. This paper presents a reference model developed jointly by members of various organiza…
▽ More
In manufacturing, many use cases of Industry 4.0 require vendor-neutral and machine-readable information models to describe, implement and execute resource functions. Such models have been researched under the terms capabilities and skills. Standardization of such models is required, but currently not available. This paper presents a reference model developed jointly by members of various organizations in a working group of the Plattform Industrie 4.0. This model covers definitions of most important aspects of capabilities and skills. It can be seen as a basis for further standardization efforts.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Ab-initio quantum chemistry with neural-network wavefunctions
Authors:
Jan Hermann,
James Spencer,
Kenny Choo,
Antonio Mezzacapo,
W. M. C. Foulkes,
David Pfau,
Giuseppe Carleo,
Frank Noé
Abstract:
Machine learning and specifically deep-learning methods have outperformed human capabilities in many pattern recognition and data processing problems, in game playing, and now also play an increasingly important role in scientific discovery. A key application of machine learning in the molecular sciences is to learn potential energy surfaces or force fields from ab-initio solutions of the electron…
▽ More
Machine learning and specifically deep-learning methods have outperformed human capabilities in many pattern recognition and data processing problems, in game playing, and now also play an increasingly important role in scientific discovery. A key application of machine learning in the molecular sciences is to learn potential energy surfaces or force fields from ab-initio solutions of the electronic Schrödinger equation using datasets obtained with density functional theory, coupled cluster, or other quantum chemistry methods. Here we review a recent and complementary approach: using machine learning to aid the direct solution of quantum chemistry problems from first principles. Specifically, we focus on quantum Monte Carlo (QMC) methods that use neural network ansatz functions in order to solve the electronic Schrödinger equation, both in first and second quantization, computing ground and excited states, and generalizing over multiple nuclear configurations. Compared to existing quantum chemistry methods, these new deep QMC methods have the potential to generate highly accurate solutions of the Schrödinger equation at relatively modest computational cost.
△ Less
Submitted 26 August, 2022;
originally announced August 2022.
-
${\cal CP}$ structure of the top-quark Yukawa interaction: NLO QCD corrections and off-shell effects
Authors:
Jonathan Hermann,
Daniel Stremmer,
Malgorzata Worek
Abstract:
Since its discovery at the Large Hadron Collider in 2012 the Higgs boson has arguably become the most famous of the Standard Model particles and many measurements have been performed in order to assess its properties. Among others, these include measurements of the Higgs boson's ${\cal CP}$ state which is predicted to be ${\cal CP}$-even. Even though a pure ${\cal CP}$-odd state has been ruled out…
▽ More
Since its discovery at the Large Hadron Collider in 2012 the Higgs boson has arguably become the most famous of the Standard Model particles and many measurements have been performed in order to assess its properties. Among others, these include measurements of the Higgs boson's ${\cal CP}$ state which is predicted to be ${\cal CP}$-even. Even though a pure ${\cal CP}$-odd state has been ruled out, a possible admixture of a ${\cal CP}$-odd Higgs state has yet to be excluded. In this work we present predictions for the associated production of a leptonically decaying top quark pair and a stable Higgs boson $p p \to e^+ ν_e\, μ^- \barν_μ\,b\bar{b}\,H$ with possible mixing between ${\cal CP}$-even and ${\cal CP}$-odd states at NLO in QCD for the LHC with $\sqrt{s} = 13$ TeV. Finite top-quark and gauge boson width effects as well as all double-, single- and non-resonant Feynman diagrams including their interference effects are taken into account. We compare the behaviour of the ${\cal CP}$-even, -odd and -mixed scenarios for the integrated fiducial cross-sections as well as several key differential distributions. In addition, we show that both NLO corrections and off-shell effects play an important role even at the level of integrated fiducial cross-sections and that these are further enhanced in differential distributions. Even though we focus here on the Standard Model Higgs boson, the calculations could be straightforwardly applied to models that have an extended Higgs-boson sector and predict the existence of ${\cal CP}$-odd Higgs-like particles, such as the two-Higgs-doublet model.
△ Less
Submitted 1 September, 2022; v1 submitted 20 May, 2022;
originally announced May 2022.
-
Electronic excited states in deep variational Monte Carlo
Authors:
Mike Entwistle,
Zeno Schätzle,
Paolo A. Erdman,
Jan Hermann,
Frank Noé
Abstract:
Obtaining accurate ground and low-lying excited states of electronic systems is crucial in a multitude of important applications. One ab initio method for solving the Schrödinger equation that scales favorably for large systems is variational quantum Monte Carlo (QMC). The recently introduced deep QMC approach uses ansatzes represented by deep neural networks and generates nearly exact ground-stat…
▽ More
Obtaining accurate ground and low-lying excited states of electronic systems is crucial in a multitude of important applications. One ab initio method for solving the Schrödinger equation that scales favorably for large systems is variational quantum Monte Carlo (QMC). The recently introduced deep QMC approach uses ansatzes represented by deep neural networks and generates nearly exact ground-state solutions for molecules containing up to a few dozen electrons, with the potential to scale to much larger systems where other highly accurate methods are not feasible. In this paper, we extend one such ansatz (PauliNet) to compute electronic excited states. We demonstrate our method on various small atoms and molecules and consistently achieve high accuracy for low-lying states. To highlight the method's potential, we compute the first excited state of the much larger benzene molecule, as well as the conical intersection of ethylene, with PauliNet matching results of more expensive high-level methods.
△ Less
Submitted 18 January, 2023; v1 submitted 17 March, 2022;
originally announced March 2022.
-
On the exclusion limits in $t\bar{t}+$DM searches at the LHC
Authors:
Jonathan Hermann
Abstract:
In these proceedings we discuss the importance of higher-order corrections and off-shell effects for the calculation of signal strength exclusion limits in $t\bar{t}+$DM searches at the Large Hadron Collider. We present limits for the spin-0 s-channel mediator model using state-of-the-art NLO QCD predictions with full off-shell effects for the relevant $t\bar{t}$ and $t\bar{t}Z$ background process…
▽ More
In these proceedings we discuss the importance of higher-order corrections and off-shell effects for the calculation of signal strength exclusion limits in $t\bar{t}+$DM searches at the Large Hadron Collider. We present limits for the spin-0 s-channel mediator model using state-of-the-art NLO QCD predictions with full off-shell effects for the relevant $t\bar{t}$ and $t\bar{t}Z$ background processes. These are then compared to less sophisticated predictions using either the narrow-width approximation or LO calculations in order to asses the respective effects.
△ Less
Submitted 7 January, 2022;
originally announced January 2022.
-
The impact of top-quark modelling on the exclusion limits in $\boldsymbol{t\bar{t}}+\text{DM}$ searches at the LHC
Authors:
Jonathan Hermann,
Malgorzata Worek
Abstract:
New Physics searches at the LHC rely very heavily on the precision and accuracy of Standard Model background predictions. Applying the spin-0 $s$-channel mediator model, we assess the importance of properly modelling such backgrounds in $t\bar{t}$ associated Dark Matter production. Specifically, we discuss higher-order corrections and off-shell effects for the two dominant background processes…
▽ More
New Physics searches at the LHC rely very heavily on the precision and accuracy of Standard Model background predictions. Applying the spin-0 $s$-channel mediator model, we assess the importance of properly modelling such backgrounds in $t\bar{t}$ associated Dark Matter production. Specifically, we discuss higher-order corrections and off-shell effects for the two dominant background processes $t\bar{t}$ and $t\bar{t}Z$ in the presence of extremely exclusive cuts. Exclusion limits are calculated for state-of-the-art NLO full off-shell $t\bar{t}$ and $t\bar{t}Z$ predictions and compared to those computed with backgrounds in the NWA and / or at LO. We perform the same comparison for several new-physics sensitive observables and evaluate which of them are affected by the top-quark modelling. Additionally, we make suggestions as to which observables should be used to obtain the most stringent limits assuming integrated luminosities of $300$ fb$^{-1}$ and $3000$ fb$^{-1}$.
△ Less
Submitted 15 November, 2021; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Convergence to the fixed-node limit in deep variational Monte Carlo
Authors:
Zeno Schätzle,
Jan Hermann,
Frank Noé
Abstract:
Variational quantum Monte Carlo (QMC) is an ab-initio method for solving the electronic Schrödinger equation that is exact in principle, but limited by the flexibility of the available ansatzes in practice. The recently introduced deep QMC approach, specifically two deep-neural-network ansatzes PauliNet and FermiNet, allows variational QMC to reach the accuracy of diffusion QMC, but little is unde…
▽ More
Variational quantum Monte Carlo (QMC) is an ab-initio method for solving the electronic Schrödinger equation that is exact in principle, but limited by the flexibility of the available ansatzes in practice. The recently introduced deep QMC approach, specifically two deep-neural-network ansatzes PauliNet and FermiNet, allows variational QMC to reach the accuracy of diffusion QMC, but little is understood about the convergence behavior of such ansatzes. Here, we analyze how deep variational QMC approaches the fixed-node limit with increasing network size. First, we demonstrate that a deep neural network can overcome the limitations of a small basis set and reach the mean-field complete-basis-set limit. Moving to electron correlation, we then perform an extensive hyperparameter scan of a deep Jastrow factor for LiH and H$_4$ and find that variational energies at the fixed-node limit can be obtained with a sufficiently large network. Finally, we benchmark mean-field and many-body ansatzes on H$_2$O, increasing the fraction of recovered fixed-node correlation energy of single-determinant Slater--Jastrow-type ansatzes by half an order of magnitude compared to previous variational QMC results and demonstrate that a single-determinant Slater--Jastrow--backflow version of the ansatz overcomes the fixed-node limitations. This analysis helps understanding the superb accuracy of deep variational ansatzes in comparison to the traditional trial wavefunctions at the respective level of theory, and will guide future improvements of the neural network architectures in deep QMC.
△ Less
Submitted 25 March, 2021; v1 submitted 11 October, 2020;
originally announced October 2020.
-
Coulomb Interactions between Dipolar Quantum Fluctuations in van der Waals Bound Molecules and Materials
Authors:
Martin Stöhr,
Mainak Sadhukhan,
Yasmine S. Al-Hamdani,
Jan Hermann,
Alexandre Tkatchenko
Abstract:
Mutual Coulomb interactions between electrons lead to a plethora of interesting physical and chemical effects, especially if those interactions involve many fluctuating electrons over large spatial scales. Here, we identify and study in detail the Coulomb interaction between dipolar quantum fluctuations in the context of van der Waals complexes and materials. Up to now, the interaction arising fro…
▽ More
Mutual Coulomb interactions between electrons lead to a plethora of interesting physical and chemical effects, especially if those interactions involve many fluctuating electrons over large spatial scales. Here, we identify and study in detail the Coulomb interaction between dipolar quantum fluctuations in the context of van der Waals complexes and materials. Up to now, the interaction arising from the modification of the electron density due to quantum van der Waals interactions was considered to be vanishingly small. We demonstrate that in supramolecular systems and for molecules embedded in nanostructures, such contributions can amount to up to 6 kJ/mol and can even lead to qualitative changes in the long-range vdW interaction. Taking into account these broad implications, we advocate for the systematic assessment of so-called Coulomb singles in large molecular systems and discuss their relevance for explaining several recent puzzling experimental observations of collective behavior in nanostructured materials.
△ Less
Submitted 24 July, 2020;
originally announced July 2020.
-
Fluctuational Electrodynamics in Atomic and Macroscopic Systems: van der Waals Interactions and Radiative Heat Transfer
Authors:
Prashanth S. Venkataram,
Jan Hermann,
Alexandre Tkatchenko,
Alejandro W. Rodriguez
Abstract:
We present an approach to describing fluctuational electrodynamic (FED) interactions, particularly van der Waals (vdW) interactions as well as radiative heat transfer (RHT), between material bodies of vastly different length scales, allowing for going between atomistic and continuum treatments of the response of each of these bodies as desired. Any local continuum description of electromagnetic (E…
▽ More
We present an approach to describing fluctuational electrodynamic (FED) interactions, particularly van der Waals (vdW) interactions as well as radiative heat transfer (RHT), between material bodies of vastly different length scales, allowing for going between atomistic and continuum treatments of the response of each of these bodies as desired. Any local continuum description of electromagnetic (EM) response is compatible with our approach, while atomistic descriptions in our approach are based on effective electronic and nuclear oscillator degrees of freedom, encapsulating dissipation, short-range electronic correlations, and collective nuclear vibrations (phonons). While our previous works using this approach have focused on presenting novel results, this work focuses on the derivations underlying these methods. First, we show how the distinction between "atomic" and "macroscopic" bodies is ultimately somewhat arbitrary, as formulas for vdW free energies and RHT look very similar regardless of how the distinction is drawn. Next, we demonstrate that the atomistic description of material response in our approach yields EM interaction matrix elements which are expressed in terms of analytical formulas for compact bodies or semianalytical formulas based on Ewald summation for periodic media; we use this to compute vdW interaction free energies as well as RHT powers among small biological molecules in the presence of a metallic plate as well as between parallel graphene sheets in vacuum, showing strong deviations from conventional macroscopic theories due to the confluence of geometry, phonons, and EM retardation effects. Finally, we propose formulas for efficient computation of FED interactions among material bodies in which those that are treated atomistically as well as those treated through continuum methods may have arbitrary shapes, extending previous surface-integral techniques.
△ Less
Submitted 8 May, 2020;
originally announced May 2020.
-
Recent developments in the PySCF program package
Authors:
Qiming Sun,
Xing Zhang,
Samragni Banerjee,
Peng Bao,
Marc Barbry,
Nick S. Blunt,
Nikolay A. Bogdanov,
George H. Booth,
Jia Chen,
Zhi-Hao Cui,
Janus Juul Eriksen,
Yang Gao,
Sheng Guo,
Jan Hermann,
Matthew R. Hermes,
Kevin Koh,
Peter Koval,
Susi Lehtola,
Zhendong Li,
Junzi Liu,
Narbe Mardirossian,
James D. McClain,
Mario Motta,
Bastien Mussard,
Hung Q. Pham
, et al. (24 additional authors not shown)
Abstract:
PYSCF is a Python-based general-purpose electronic structure platform that both supports first-principles simulations of molecules and solids, as well as accelerates the development of new methodology and complex computational workflows. The present paper explains the design and philosophy behind PYSCF that enables it to meet these twin objectives. With several case studies, we show how users can…
▽ More
PYSCF is a Python-based general-purpose electronic structure platform that both supports first-principles simulations of molecules and solids, as well as accelerates the development of new methodology and complex computational workflows. The present paper explains the design and philosophy behind PYSCF that enables it to meet these twin objectives. With several case studies, we show how users can easily implement their own methods using PYSCF as a development environment. We then summarize the capabilities of PYSCF for molecular and solid-state simulations. Finally, we describe the growing ecosystem of projects that use PYSCF across the domains of quantum chemistry, materials science, machine learning and quantum information science.
△ Less
Submitted 10 July, 2020; v1 submitted 27 February, 2020;
originally announced February 2020.
-
Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory
Authors:
Julien Herrmann,
Olivier Beaumont,
Lionel Eyraud-Dubois,
Julien Hermann,
Alexis Joly,
Alena Shilova
Abstract:
This paper introduces a new activation checkpointing method which allows to significantly decrease memory usage when training Deep Neural Networks with the back-propagation algorithm. Similarly to checkpoint-ing techniques coming from the literature on Automatic Differentiation, it consists in dynamically selecting the forward activations that are saved during the training phase, and then automati…
▽ More
This paper introduces a new activation checkpointing method which allows to significantly decrease memory usage when training Deep Neural Networks with the back-propagation algorithm. Similarly to checkpoint-ing techniques coming from the literature on Automatic Differentiation, it consists in dynamically selecting the forward activations that are saved during the training phase, and then automatically recomputing missing activations from those previously recorded. We propose an original computation model that combines two types of activation savings: either only storing the layer inputs, or recording the complete history of operations that produced the outputs (this uses more memory, but requires fewer recomputations in the backward phase), and we provide an algorithm to compute the optimal computation sequence for this model. This paper also describes a PyTorch implementation that processes the entire chain, dealing with any sequential DNN whose internal layers may be arbitrarily complex and automatically executing it according to the optimal checkpointing strategy computed given a memory limit. Through extensive experiments, we show that our implementation consistently outperforms existing checkpoint-ing approaches for a large class of networks, image sizes and batch sizes.
△ Less
Submitted 27 November, 2019;
originally announced November 2019.
-
Density-functional model for van der Waals interactions: Unifying many-body atomic approaches with nonlocal functionals
Authors:
Jan Hermann,
Alexandre Tkatchenko
Abstract:
Noncovalent van der Waals (vdW) interactions are responsible for a wide range of phenomena in matter. Popular density-functional methods that treat vdW interactions use disparate physical models for these intricate forces, and as a result the applicability of these methods is often restricted to a subset of relevant molecules and materials. Aiming towards a general-purpose density functional model…
▽ More
Noncovalent van der Waals (vdW) interactions are responsible for a wide range of phenomena in matter. Popular density-functional methods that treat vdW interactions use disparate physical models for these intricate forces, and as a result the applicability of these methods is often restricted to a subset of relevant molecules and materials. Aiming towards a general-purpose density functional model of vdW interactions, here we unify two complementary approaches: nonlocal vdW functionals for polarization and interatomic methods for many-body interactions. The developed nonlocal many-body dispersion method (MBD-NL) increases the accuracy and efficiency of existing vdW functionals and is shown to be broadly applicable to molecules, soft and hard materials including ionic and metallic compounds, as well as organic/inorganic interfaces.
△ Less
Submitted 10 March, 2020; v1 submitted 7 October, 2019;
originally announced October 2019.
-
Deep neural network solution of the electronic Schrödinger equation
Authors:
Jan Hermann,
Zeno Schätzle,
Frank Noé
Abstract:
[New and updated results were published in Nature Chemistry, doi:10.1038/s41557-020-0544-y.] The electronic Schrödinger equation describes fundamental properties of molecules and materials, but can only be solved analytically for the hydrogen atom. The numerically exact full configuration-interaction method is exponentially expensive in the number of electrons. Quantum Monte Carlo is a possible wa…
▽ More
[New and updated results were published in Nature Chemistry, doi:10.1038/s41557-020-0544-y.] The electronic Schrödinger equation describes fundamental properties of molecules and materials, but can only be solved analytically for the hydrogen atom. The numerically exact full configuration-interaction method is exponentially expensive in the number of electrons. Quantum Monte Carlo is a possible way out: it scales well to large molecules, can be parallelized, and its accuracy has, as yet, only been limited by the flexibility of the used wave function ansatz. Here we propose PauliNet, a deep-learning wave function ansatz that achieves nearly exact solutions of the electronic Schrödinger equation. PauliNet has a multireference Hartree-Fock solution built in as a baseline, incorporates the physics of valid wave functions, and is trained using variational quantum Monte Carlo (VMC). PauliNet outperforms comparable state-of-the-art VMC ansatzes for atoms, diatomic molecules and a strongly-correlated hydrogen chain by a margin and is yet computationally efficient. We anticipate that thanks to the favourable scaling with system size, this method may become a new leading method for highly accurate electronic-strucutre calculations on medium-sized molecular systems.
△ Less
Submitted 23 September, 2020; v1 submitted 16 September, 2019;
originally announced September 2019.
-
Impact of nuclear vibrations on van der Waals and Casimir interactions at zero and finite temperature
Authors:
Prashanth S. Venkataram,
Jan Hermann,
Teerit J. Vongkovit,
Alexandre Tkatchenko,
Alejandro W. Rodriguez
Abstract:
Van der Waals (vdW) and Casimir interactions depend crucially on material properties and geometry, especially at molecular scales, and temperature can produce noticeable relative shifts in interaction characteristics. Despite this, common treatments of these interactions ignore electromagnetic retardation, atomism, or contributions of collective mechanical vibrations (phonons) to the infrared resp…
▽ More
Van der Waals (vdW) and Casimir interactions depend crucially on material properties and geometry, especially at molecular scales, and temperature can produce noticeable relative shifts in interaction characteristics. Despite this, common treatments of these interactions ignore electromagnetic retardation, atomism, or contributions of collective mechanical vibrations (phonons) to the infrared response, which can interplay with temperature in nontrivial ways. We present a theoretical framework for computing electromagnetic interactions among molecular structures, accounting for their geometry, electronic delocalization, short-range interatomic correlations, dissipation, and phonons at atomic scales, along with long-range electromagnetic interactions among themselves or in the vicinity of continuous macroscopic bodies. We find that in carbon allotropes, particularly fullerenes, carbyne wires, and graphene sheets, phonons can couple strongly with long-range electromagnetic fields, especially at mesoscopic scales (nanometers), to create delocalized phonon polaritons that significantly modify the infrared molecular response. These polaritons especially depend on the molecular dimensionality and dissipation, and in turn affect the vdW interaction free energies of these bodies above a macroscopic gold surface, producing nonmonotonic power laws and nontrivial temperature variations at nanometer separations that are within the reach of current Casimir force experiments.
△ Less
Submitted 8 October, 2018;
originally announced October 2018.
-
Phonon-polariton mediated thermal radiation and heat transfer among molecules and macroscopic bodies: nonlocal electromagnetic response at mesoscopic scales
Authors:
Prashanth S. Venkataram,
Jan Hermann,
Alexandre Tkatchenko,
Alejandro W. Rodriguez
Abstract:
Thermal radiative phenomena can be strongly influenced by the coupling of phonons and long-range electromagnetic fields at infrared frequencies. Typically employed macroscopic descriptions of thermal fluctuations tend to ignore atomistic effects that become relevant at nanometric scales, whereas purely microscopic treatments ignore long-range, geometry-dependent electromagnetic effects. We describ…
▽ More
Thermal radiative phenomena can be strongly influenced by the coupling of phonons and long-range electromagnetic fields at infrared frequencies. Typically employed macroscopic descriptions of thermal fluctuations tend to ignore atomistic effects that become relevant at nanometric scales, whereas purely microscopic treatments ignore long-range, geometry-dependent electromagnetic effects. We describe a mesoscopic framework for modeling thermal fluctuation phenomena among molecules in the vicinity of macroscopic bodies, conjoining atomistic treatments of electronic and vibrational fluctuations obtained from ab-initio density functional theory in the former with continuum descriptions of electromagnetic scattering in the latter. The interplay of these effects becomes particularly important at mesoscopic scales, where phonon polaritons can be strongly influenced by the finite sizes, shapes, and non-local/many-body response of the bodies to electromagnetic fluctuations. We show that even in small but especially in elongated low-dimensional molecular systems, such effects can modify thermal emission and heat transfer by orders of magnitude and produce qualitatively different behavior compared to predictions based on local, dipolar, or pairwise approximations valid only in dilute media.
△ Less
Submitted 19 March, 2018;
originally announced March 2018.
-
Unifying microscopic and continuum treatments of van der Waals and Casimir interactions
Authors:
Prashanth S. Venkataram,
Jan Hermann,
Alexandre Tkatchenko,
Alejandro W. Rodriguez
Abstract:
We present an approach for computing long-range van der Waals (vdW) interactions between complex molecular systems and arbitrarily shaped macroscopic bodies, melding atomistic treatments of electronic fluctuations based on density functional theory in the former, with continuum descriptions of strongly shape-dependent electromagnetic fields in the latter, thus capturing many-body and multiple scat…
▽ More
We present an approach for computing long-range van der Waals (vdW) interactions between complex molecular systems and arbitrarily shaped macroscopic bodies, melding atomistic treatments of electronic fluctuations based on density functional theory in the former, with continuum descriptions of strongly shape-dependent electromagnetic fields in the latter, thus capturing many-body and multiple scattering effects to all orders. Such a theory is especially important when considering vdW interactions at mesoscopic scales, i.e. between molecules and structured surfaces with features on the scale of molecular sizes, in which case the finite sizes, complex shapes, and resulting nonlocal electronic excitations of molecules are strongly influenced by electromagnetic retardation and wave effects that depend crucially on the shapes of surrounding macroscopic bodies. We show that these effects together can modify vdW interactions by orders of magnitude compared to previous treatments based on Casimir--Polder or non-retarded approximations, which are valid only at macroscopically large or atomic-scale separations, respectively.
△ Less
Submitted 25 January, 2017;
originally announced January 2017.
-
Analyses of femtosecond laser ablation of Ti, Zr, Hf
Authors:
D. Grojo,
J. Hermann,
S. Bruneau,
T. Itina
Abstract:
Femtosecond laser ablation of Ti, Zr and Hf has been investigated by means of in-situ plasma diagnostics. Fast plasma imaging with the aid of an intensified charged coupled device (ICCD) camera was used to characterise the plasma plume expansion on a nanosecond time scale. Time- and spaceresolved optical emission spectroscopy was employed to perform time-of-flight measurements of ions and neutra…
▽ More
Femtosecond laser ablation of Ti, Zr and Hf has been investigated by means of in-situ plasma diagnostics. Fast plasma imaging with the aid of an intensified charged coupled device (ICCD) camera was used to characterise the plasma plume expansion on a nanosecond time scale. Time- and spaceresolved optical emission spectroscopy was employed to perform time-of-flight measurements of ions and neutral atoms. It is shown that two plasma components with different expansion velocities are generated by the ultra-short laser ablation process. The expansion behaviour of these two components has been analysed as a function of laser fluence and target material. The results are discussed in terms of mechanisms responsible for ultra-short laser ablation.
△ Less
Submitted 31 October, 2006;
originally announced October 2006.