-
Towards equilibrium molecular conformation generation with GFlowNets
Authors:
Alexandra Volokhova,
Michał Koziarski,
Alex Hernández-García,
Cheng-Hao Liu,
Santiago Miret,
Pablo Lemos,
Luca Thiede,
Zichao Yan,
Alán Aspuru-Guzik,
Yoshua Bengio
Abstract:
Sampling diverse, thermodynamically feasible molecular conformations plays a crucial role in predicting properties of a molecule. In this paper we propose to use GFlowNet for sampling conformations of small molecules from the Boltzmann distribution, as determined by the molecule's energy. The proposed approach can be used in combination with energy estimation methods of different fidelity and disc…
▽ More
Sampling diverse, thermodynamically feasible molecular conformations plays a crucial role in predicting properties of a molecule. In this paper we propose to use GFlowNet for sampling conformations of small molecules from the Boltzmann distribution, as determined by the molecule's energy. The proposed approach can be used in combination with energy estimation methods of different fidelity and discovers a diverse set of low-energy conformations for highly flexible drug-like molecules. We demonstrate that GFlowNet can reproduce molecular potential energy surfaces by sampling proportionally to the Boltzmann distribution.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Crystal-GFN: sampling crystals with desirable properties and constraints
Authors:
Mila AI4Science,
Alex Hernandez-Garcia,
Alexandre Duval,
Alexandra Volokhova,
Yoshua Bengio,
Divya Sharma,
Pierre Luc Carrier,
Yasmine Benabed,
Michał Koziarski,
Victor Schmidt
Abstract:
Accelerating material discovery holds the potential to greatly help mitigate the climate crisis. Discovering new solid-state materials such as electrocatalysts, super-ionic conductors or photovoltaic materials can have a crucial impact, for instance, in improving the efficiency of renewable energy production and storage. In this paper, we introduce Crystal-GFN, a generative model of crystal struct…
▽ More
Accelerating material discovery holds the potential to greatly help mitigate the climate crisis. Discovering new solid-state materials such as electrocatalysts, super-ionic conductors or photovoltaic materials can have a crucial impact, for instance, in improving the efficiency of renewable energy production and storage. In this paper, we introduce Crystal-GFN, a generative model of crystal structures that sequentially samples structural properties of crystalline materials, namely the space group, composition and lattice parameters. This domain-inspired approach enables the flexible incorporation of physical and structural hard constraints, as well as the use of any available predictive model of a desired physicochemical property as an objective function. To design stable materials, one must target the candidates with the lowest formation energy. Here, we use as objective the formation energy per atom of a crystal structure predicted by a new proxy machine learning model trained on MatBench. The results demonstrate that Crystal-GFN is able to sample highly diverse crystals with low (median -3.1 eV/atom) predicted formation energy.
△ Less
Submitted 13 December, 2023; v1 submitted 7 October, 2023;
originally announced October 2023.
-
A theory of continuous generative flow networks
Authors:
Salem Lahlou,
Tristan Deleu,
Pablo Lemos,
Dinghuai Zhang,
Alexandra Volokhova,
Alex Hernández-García,
Léna Néhale Ezzine,
Yoshua Bengio,
Nikolay Malkin
Abstract:
Generative flow networks (GFlowNets) are amortized variational inference algorithms that are trained to sample from unnormalized target distributions over compositional objects. A key limitation of GFlowNets until this time has been that they are restricted to discrete spaces. We present a theory for generalized GFlowNets, which encompasses both existing discrete GFlowNets and ones with continuous…
▽ More
Generative flow networks (GFlowNets) are amortized variational inference algorithms that are trained to sample from unnormalized target distributions over compositional objects. A key limitation of GFlowNets until this time has been that they are restricted to discrete spaces. We present a theory for generalized GFlowNets, which encompasses both existing discrete GFlowNets and ones with continuous or hybrid state spaces, and perform experiments with two goals in mind. First, we illustrate critical points of the theory and the importance of various assumptions. Second, we empirically demonstrate how observations about discrete GFlowNets transfer to the continuous case and show strong results compared to non-GFlowNet baselines on several previously studied tasks. This work greatly widens the perspectives for the application of GFlowNets in probabilistic inference and various modeling settings.
△ Less
Submitted 25 May, 2023; v1 submitted 29 January, 2023;
originally announced January 2023.
-
Generative Flow Networks for Discrete Probabilistic Modeling
Authors:
Dinghuai Zhang,
Nikolay Malkin,
Zhen Liu,
Alexandra Volokhova,
Aaron Courville,
Yoshua Bengio
Abstract:
We present energy-based generative flow networks (EB-GFN), a novel probabilistic modeling algorithm for high-dimensional discrete data. Building upon the theory of generative flow networks (GFlowNets), we model the generation process by a stochastic data construction policy and thus amortize expensive MCMC exploration into a fixed number of actions sampled from a GFlowNet. We show how GFlowNets ca…
▽ More
We present energy-based generative flow networks (EB-GFN), a novel probabilistic modeling algorithm for high-dimensional discrete data. Building upon the theory of generative flow networks (GFlowNets), we model the generation process by a stochastic data construction policy and thus amortize expensive MCMC exploration into a fixed number of actions sampled from a GFlowNet. We show how GFlowNets can approximately perform large-block Gibbs sampling to mix between modes. We propose a framework to jointly train a GFlowNet with an energy function, so that the GFlowNet learns to sample from the energy distribution, while the energy learns with an approximate MLE objective with negative samples from the GFlowNet. We demonstrate EB-GFN's effectiveness on various probabilistic modeling tasks. Code is publicly available at https://github.com/zdhNarsil/EB_GFN.
△ Less
Submitted 8 June, 2022; v1 submitted 2 February, 2022;
originally announced February 2022.
-
Stochasticity in Neural ODEs: An Empirical Study
Authors:
Viktor Oganesyan,
Alexandra Volokhova,
Dmitry Vetrov
Abstract:
Stochastic regularization of neural networks (e.g. dropout) is a wide-spread technique in deep learning that allows for better generalization. Despite its success, continuous-time models, such as neural ordinary differential equation (ODE), usually rely on a completely deterministic feed-forward operation. This work provides an empirical study of stochastically regularized neural ODE on several im…
▽ More
Stochastic regularization of neural networks (e.g. dropout) is a wide-spread technique in deep learning that allows for better generalization. Despite its success, continuous-time models, such as neural ordinary differential equation (ODE), usually rely on a completely deterministic feed-forward operation. This work provides an empirical study of stochastically regularized neural ODE on several image-classification tasks (CIFAR-10, CIFAR-100, TinyImageNet). Building upon the formalism of stochastic differential equations (SDEs), we demonstrate that neural SDE is able to outperform its deterministic counterpart. Further, we show that data augmentation during the training improves the performance of both deterministic and stochastic versions of the same model. However, the improvements obtained by the data augmentation completely eliminate the empirical gains of the stochastic regularization, making the difference in the performance of neural ODE and neural SDE negligible.
△ Less
Submitted 26 June, 2020; v1 submitted 22 February, 2020;
originally announced February 2020.
-
Semi-Conditional Normalizing Flows for Semi-Supervised Learning
Authors:
Andrei Atanov,
Alexandra Volokhova,
Arsenii Ashukha,
Ivan Sosnovik,
Dmitry Vetrov
Abstract:
This paper proposes a semi-conditional normalizing flow model for semi-supervised learning. The model uses both labelled and unlabeled data to learn an explicit model of joint distribution over objects and labels. Semi-conditional architecture of the model allows us to efficiently compute a value and gradients of the marginal likelihood for unlabeled objects. The conditional part of the model is b…
▽ More
This paper proposes a semi-conditional normalizing flow model for semi-supervised learning. The model uses both labelled and unlabeled data to learn an explicit model of joint distribution over objects and labels. Semi-conditional architecture of the model allows us to efficiently compute a value and gradients of the marginal likelihood for unlabeled objects. The conditional part of the model is based on a proposed conditional coupling layer. We demonstrate performance of the model for semi-supervised classification problem on different datasets. The model outperforms the baseline approach based on variational auto-encoders on MNIST dataset.
△ Less
Submitted 22 June, 2020; v1 submitted 1 May, 2019;
originally announced May 2019.
-
Cherenkov Detectors Fast Simulation Using Neural Networks
Authors:
Denis Derkach,
Nikita Kazeev,
Fedor Ratnikov,
Andrey Ustyuzhanin,
Alexandra Volokhova
Abstract:
We propose a way to simulate Cherenkov detector response using a generative adversarial neural network to bypass low-level details. This network is trained to reproduce high level features of the simulated detector events based on input observables of incident particles. This allows the dramatic increase of simulation speed. We demonstrate that this approach provides simulation precision which is…
▽ More
We propose a way to simulate Cherenkov detector response using a generative adversarial neural network to bypass low-level details. This network is trained to reproduce high level features of the simulated detector events based on input observables of incident particles. This allows the dramatic increase of simulation speed. We demonstrate that this approach provides simulation precision which is consistent with the baseline and discuss possible implications of these results.
△ Less
Submitted 28 March, 2019;
originally announced March 2019.
-
Polaron Model of the Formation of Hydrated Electron States
Authors:
V. D. Lakhno,
A. V. Volokhova,
E. V. Zemlyanaya,
I. V. Amirkhanov,
I. V. Puzynin,
T. P. Puzynina
Abstract:
A computer simulation of the formation of photoexcited electrons in water is performed within the framework of a dynamic model. The obtained results are discussed in comparison with experimental data and theoretical estimates.
A computer simulation of the formation of photoexcited electrons in water is performed within the framework of a dynamic model. The obtained results are discussed in comparison with experimental data and theoretical estimates.
△ Less
Submitted 30 January, 2015;
originally announced January 2015.