-
Physics-Informed Bayesian Optimization of Variational Quantum Circuits
Authors:
Kim A. Nicoli,
Christopher J. Anders,
Lena Funcke,
Tobias Hartung,
Karl Jansen,
Stefan Kühn,
Klaus-Robert Müller,
Paolo Stornati,
Pan Kessel,
Shinichi Nakajima
Abstract:
In this paper, we propose a novel and powerful method to harness Bayesian optimization for Variational Quantum Eigensolvers (VQEs) -- a hybrid quantum-classical protocol used to approximate the ground state of a quantum Hamiltonian. Specifically, we derive a VQE-kernel which incorporates important prior information about quantum circuits: the kernel feature map of the VQE-kernel exactly matches th…
▽ More
In this paper, we propose a novel and powerful method to harness Bayesian optimization for Variational Quantum Eigensolvers (VQEs) -- a hybrid quantum-classical protocol used to approximate the ground state of a quantum Hamiltonian. Specifically, we derive a VQE-kernel which incorporates important prior information about quantum circuits: the kernel feature map of the VQE-kernel exactly matches the known functional form of the VQE's objective function and thereby significantly reduces the posterior uncertainty. Moreover, we propose a novel acquisition function for Bayesian optimization called Expected Maximum Improvement over Confident Regions (EMICoRe) which can actively exploit the inductive bias of the VQE-kernel by treating regions with low predictive uncertainty as indirectly ``observed''. As a result, observations at as few as three points in the search domain are sufficient to determine the complete objective function along an entire one-dimensional subspace of the optimization landscape. Our numerical experiments demonstrate that our approach improves over state-of-the-art baselines.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Detecting and Mitigating Mode-Collapse for Flow-based Sampling of Lattice Field Theories
Authors:
Kim A. Nicoli,
Christopher J. Anders,
Tobias Hartung,
Karl Jansen,
Pan Kessel,
Shinichi Nakajima
Abstract:
We study the consequences of mode-collapse of normalizing flows in the context of lattice field theory. Normalizing flows allow for independent sampling. For this reason, it is hoped that they can avoid the tunneling problem of local-update MCMC algorithms for multi-modal distributions. In this work, we first point out that the tunneling problem is also present for normalizing flows but is shifted…
▽ More
We study the consequences of mode-collapse of normalizing flows in the context of lattice field theory. Normalizing flows allow for independent sampling. For this reason, it is hoped that they can avoid the tunneling problem of local-update MCMC algorithms for multi-modal distributions. In this work, we first point out that the tunneling problem is also present for normalizing flows but is shifted from the sampling to the training phase of the algorithm. Specifically, normalizing flows often suffer from mode-collapse for which the training process assigns vanishingly low probability mass to relevant modes of the physical distribution. This may result in a significant bias when the flow is used as a sampler in a Markov-Chain or with Importance Sampling. We propose a metric to quantify the degree of mode-collapse and derive a bound on the resulting bias. Furthermore, we propose various mitigation strategies in particular in the context of estimating thermodynamic observables, such as the free energy.
△ Less
Submitted 3 November, 2023; v1 submitted 27 February, 2023;
originally announced February 2023.
-
Gradients should stay on Path: Better Estimators of the Reverse- and Forward KL Divergence for Normalizing Flows
Authors:
Lorenz Vaitl,
Kim A. Nicoli,
Shinichi Nakajima,
Pan Kessel
Abstract:
We propose an algorithm to estimate the path-gradient of both the reverse and forward Kullback-Leibler divergence for an arbitrary manifestly invertible normalizing flow. The resulting path-gradient estimators are straightforward to implement, have lower variance, and lead not only to faster convergence of training but also to better overall approximation results compared to standard total gradien…
▽ More
We propose an algorithm to estimate the path-gradient of both the reverse and forward Kullback-Leibler divergence for an arbitrary manifestly invertible normalizing flow. The resulting path-gradient estimators are straightforward to implement, have lower variance, and lead not only to faster convergence of training but also to better overall approximation results compared to standard total gradient estimators. We also demonstrate that path-gradient training is less susceptible to mode-collapse. In light of our results, we expect that path-gradient estimators will become the new standard method to train normalizing flows for variational inference.
△ Less
Submitted 17 July, 2022;
originally announced July 2022.
-
Path-Gradient Estimators for Continuous Normalizing Flows
Authors:
Lorenz Vaitl,
Kim A. Nicoli,
Shinichi Nakajima,
Pan Kessel
Abstract:
Recent work has established a path-gradient estimator for simple variational Gaussian distributions and has argued that the path-gradient is particularly beneficial in the regime in which the variational distribution approaches the exact target distribution. In many applications, this regime can however not be reached by a simple Gaussian variational distribution. In this work, we overcome this cr…
▽ More
Recent work has established a path-gradient estimator for simple variational Gaussian distributions and has argued that the path-gradient is particularly beneficial in the regime in which the variational distribution approaches the exact target distribution. In many applications, this regime can however not be reached by a simple Gaussian variational distribution. In this work, we overcome this crucial limitation by proposing a path-gradient estimator for the considerably more expressive variational family of continuous normalizing flows. We outline an efficient algorithm to calculate this estimator and establish its superior performance empirically.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
Modern applications of machine learning in quantum sciences
Authors:
Anna Dawid,
Julian Arnold,
Borja Requena,
Alexander Gresch,
Marcin Płodzień,
Kaelan Donatella,
Kim A. Nicoli,
Paolo Stornati,
Rouven Koch,
Miriam Büttner,
Robert Okuła,
Gorka Muñoz-Gil,
Rodrigo A. Vargas-Hernández,
Alba Cervera-Lierta,
Juan Carrasquilla,
Vedran Dunjko,
Marylou Gabrié,
Patrick Huembeli,
Evert van Nieuwenburg,
Filippo Vicentini,
Lei Wang,
Sebastian J. Wetzel,
Giuseppe Carleo,
Eliška Greplová,
Roman Krems
, et al. (4 additional authors not shown)
Abstract:
In this book, we provide a comprehensive introduction to the most recent advances in the application of machine learning methods in quantum sciences. We cover the use of deep learning and kernel methods in supervised, unsupervised, and reinforcement learning algorithms for phase classification, representation of many-body quantum states, quantum feedback control, and quantum circuits optimization.…
▽ More
In this book, we provide a comprehensive introduction to the most recent advances in the application of machine learning methods in quantum sciences. We cover the use of deep learning and kernel methods in supervised, unsupervised, and reinforcement learning algorithms for phase classification, representation of many-body quantum states, quantum feedback control, and quantum circuits optimization. Moreover, we introduce and discuss more specialized topics such as differentiable programming, generative models, statistical approach to machine learning, and quantum machine learning.
△ Less
Submitted 15 November, 2023; v1 submitted 8 April, 2022;
originally announced April 2022.
-
Machine Learning of Thermodynamic Observables in the Presence of Mode Collapse
Authors:
Kim A. Nicoli,
Christopher Anders,
Lena Funcke,
Tobias Hartung,
Karl Jansen,
Pan Kessel,
Shinichi Nakajima,
Paolo Stornati
Abstract:
Estimating the free energy, as well as other thermodynamic observables, is a key task in lattice field theories. Recently, it has been pointed out that deep generative models can be used in this context [1]. Crucially, these models allow for the direct estimation of the free energy at a given point in parameter space. This is in contrast to existing methods based on Markov chains which generically…
▽ More
Estimating the free energy, as well as other thermodynamic observables, is a key task in lattice field theories. Recently, it has been pointed out that deep generative models can be used in this context [1]. Crucially, these models allow for the direct estimation of the free energy at a given point in parameter space. This is in contrast to existing methods based on Markov chains which generically require integration through parameter space. In this contribution, we will review this novel machine-learning-based estimation method. We will in detail discuss the issue of mode collapse and outline mitigation techniques which are particularly suited for applications at finite temperature.
△ Less
Submitted 30 November, 2021; v1 submitted 22 November, 2021;
originally announced November 2021.
-
Estimation of Thermodynamic Observables in Lattice Field Theories with Deep Generative Models
Authors:
Kim A. Nicoli,
Christopher J. Anders,
Lena Funcke,
Tobias Hartung,
Karl Jansen,
Pan Kessel,
Shinichi Nakajima,
Paolo Stornati
Abstract:
In this work, we demonstrate that applying deep generative machine learning models for lattice field theory is a promising route for solving problems where Markov Chain Monte Carlo (MCMC) methods are problematic. More specifically, we show that generative models can be used to estimate the absolute value of the free energy, which is in contrast to existing MCMC-based methods which are limited to o…
▽ More
In this work, we demonstrate that applying deep generative machine learning models for lattice field theory is a promising route for solving problems where Markov Chain Monte Carlo (MCMC) methods are problematic. More specifically, we show that generative models can be used to estimate the absolute value of the free energy, which is in contrast to existing MCMC-based methods which are limited to only estimate free energy differences. We demonstrate the effectiveness of the proposed method for two-dimensional $φ^4$ theory and compare it to MCMC-based methods in detailed numerical experiments.
△ Less
Submitted 5 January, 2021; v1 submitted 14 July, 2020;
originally announced July 2020.
-
Asymptotically unbiased estimation of physical observables with neural samplers
Authors:
Kim A. Nicoli,
Shinichi Nakajima,
Nils Strodthoff,
Wojciech Samek,
Klaus-Robert Müller,
Pan Kessel
Abstract:
We propose a general framework for the estimation of observables with generative neural samplers focusing on modern deep generative neural networks that provide an exact sampling probability. In this framework, we present asymptotically unbiased estimators for generic observables, including those that explicitly depend on the partition function such as free energy or entropy, and derive correspond…
▽ More
We propose a general framework for the estimation of observables with generative neural samplers focusing on modern deep generative neural networks that provide an exact sampling probability. In this framework, we present asymptotically unbiased estimators for generic observables, including those that explicitly depend on the partition function such as free energy or entropy, and derive corresponding variance estimators. We demonstrate their practical applicability by numerical experiments for the 2d Ising model which highlight the superiority over existing methods. Our approach greatly enhances the applicability of generative neural samplers to real-world physical systems.
△ Less
Submitted 13 February, 2020; v1 submitted 29 October, 2019;
originally announced October 2019.
-
Comment on "Solving Statistical Mechanics Using VANs": Introducing saVANt - VANs Enhanced by Importance and MCMC Sampling
Authors:
Kim Nicoli,
Pan Kessel,
Nils Strodthoff,
Wojciech Samek,
Klaus-Robert Müller,
Shinichi Nakajima
Abstract:
In this comment on "Solving Statistical Mechanics Using Variational Autoregressive Networks" by Wu et al., we propose a subtle yet powerful modification of their approach. We show that the inherent sampling error of their method can be corrected by using neural network-based MCMC or importance sampling which leads to asymptotically unbiased estimators for physical quantities. This modification is…
▽ More
In this comment on "Solving Statistical Mechanics Using Variational Autoregressive Networks" by Wu et al., we propose a subtle yet powerful modification of their approach. We show that the inherent sampling error of their method can be corrected by using neural network-based MCMC or importance sampling which leads to asymptotically unbiased estimators for physical quantities. This modification is possible due to a singular property of VANs, namely that they provide the exact sample probability. With these modifications, we believe that their method could have a substantially greater impact on various important fields of physics, including strongly-interacting field theories and statistical physics.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.
-
Analysis of Atomistic Representations Using Weighted Skip-Connections
Authors:
Kim A. Nicoli,
Pan Kessel,
Michael Gastegger,
Kristof T. Schütt
Abstract:
In this work, we extend the SchNet architecture by using weighted skip connections to assemble the final representation. This enables us to study the relative importance of each interaction block for property prediction. We demonstrate on both the QM9 and MD17 dataset that their relative weighting depends strongly on the chemical composition and configurational degrees of freedom of the molecules…
▽ More
In this work, we extend the SchNet architecture by using weighted skip connections to assemble the final representation. This enables us to study the relative importance of each interaction block for property prediction. We demonstrate on both the QM9 and MD17 dataset that their relative weighting depends strongly on the chemical composition and configurational degrees of freedom of the molecules which opens the path towards a more detailed understanding of machine learning models for molecules.
△ Less
Submitted 14 November, 2018; v1 submitted 23 October, 2018;
originally announced October 2018.
-
SchNetPack: A Deep Learning Toolbox For Atomistic Systems
Authors:
K. T. Schütt,
P. Kessel,
M. Gastegger,
K. Nicoli,
A. Tkatchenko,
K. -R. Müller
Abstract:
SchNetPack is a toolbox for the development and application of deep neural networks to the prediction of potential energy surfaces and other quantum-chemical properties of molecules and materials. It contains basic building blocks of atomistic neural networks, manages their training and provides simple access to common benchmark datasets. This allows for an easy implementation and evaluation of ne…
▽ More
SchNetPack is a toolbox for the development and application of deep neural networks to the prediction of potential energy surfaces and other quantum-chemical properties of molecules and materials. It contains basic building blocks of atomistic neural networks, manages their training and provides simple access to common benchmark datasets. This allows for an easy implementation and evaluation of new models. For now, SchNetPack includes implementations of (weighted) atomcentered symmetry functions and the deep tensor neural network SchNet as well as ready-to-use scripts that allow to train these models on molecule and material datasets. Based upon the PyTorch deep learning framework, SchNetPack allows to efficiently apply the neural networks to large datasets with millions of reference calculations as well as parallelize the model across multiple GPUs. Finally, SchNetPack provides an interface to the Atomic Simulation Environment in order to make trained models easily accessible to researchers that are not yet familiar with neural networks.
△ Less
Submitted 4 September, 2018;
originally announced September 2018.