-
Guided Multi-objective Generative AI to Enhance Structure-based Drug Design
Authors:
Amit Kadan,
Kevin Ryczko,
Adrian Roitberg,
Takeshi Yamazaki
Abstract:
Generative AI has the potential to revolutionize drug discovery. Yet, despite recent advances in machine learning, existing models cannot generate molecules that satisfy all desired physicochemical properties. Herein, we describe IDOLpro, a novel generative chemistry AI combining deep diffusion with multi-objective optimization for structure-based drug design. The latent variables of the diffusion…
▽ More
Generative AI has the potential to revolutionize drug discovery. Yet, despite recent advances in machine learning, existing models cannot generate molecules that satisfy all desired physicochemical properties. Herein, we describe IDOLpro, a novel generative chemistry AI combining deep diffusion with multi-objective optimization for structure-based drug design. The latent variables of the diffusion model are guided by differentiable scoring functions to explore uncharted chemical space and generate novel ligands in silico, optimizing a plurality of target physicochemical properties. We demonstrate its effectiveness by generating ligands with optimized binding affinity and synthetic accessibility on two benchmark sets. IDOLpro produces ligands with binding affinities over 10% higher than the next best state-of-the-art on each test set. On a test set of experimental complexes, IDOLpro is the first to surpass the performance of experimentally observed ligands. IDOLpro can accommodate other scoring functions (e.g. ADME-Tox) to accelerate hit-finding, hit-to-lead, and lead optimization for drug discovery.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Optical properties and dynamics of direct and spatially and momentum indirect excitons in AlGaAs/AlAs quantum wells
Authors:
Dąbrówka Biegańska,
Maciej Pieczarka,
Krzysztof Ryczko,
Maciej Kubisa,
Sebastian Klembt,
Sven Höfling,
Christian Schneider,
Marcin Syperek
Abstract:
We present an experimental study on optical properties and dynamics of direct and spatially and momentum indirect excitons in AlGaAs/AlAs quantum wells near the crossover between $\varGamma$- and $X$-valley confined electron states. The time-integrated photoluminescence experiment at $T=$4.8 K revealed three simultaneously observed optical transitions resulting from (a) a direct exciton recombinat…
▽ More
We present an experimental study on optical properties and dynamics of direct and spatially and momentum indirect excitons in AlGaAs/AlAs quantum wells near the crossover between $\varGamma$- and $X$-valley confined electron states. The time-integrated photoluminescence experiment at $T=$4.8 K revealed three simultaneously observed optical transitions resulting from (a) a direct exciton recombination, involving an electron and a hole states both located in the $\varGamma$-valley in the quantum well layer, and (b) two spatially and momentum indirect excitons, comprising of the confined electron states in the $X$-valley in the AlAs barrier with different effective masses and quantum well holes in the $\varGamma$-valley. The measured spatial extent, density dependence and temperature dependence of the structure photoluminescence revealed characteristics necessary to pinpoint the states' nature and provided their characterization crucial in future device design. Temporal dynamics observed in the time-resolved photoluminescence measurement showed the complexity of the capture and recombination dynamics, largely affected by nonradiative processes, what is additionally critical in the system's use. This solid state platform hosting both direct and indirect excitons in a highly tunable monolithic system can benefit and underline the operation principles of novel electronic and photonic devices.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Accelerated Organic Crystal Structure Prediction with Genetic Algorithms and Machine Learning
Authors:
Amit Kadan,
Kevin Ryczko,
Andrew Wildman,
Rodrigo Wang,
Adrian Roitberg,
Takeshi Yamazaki
Abstract:
We present a high-throughput, end-to-end pipeline for organic crystal structure prediction (CSP) -- the problem of identifying the stable crystal structures that will form from a given molecule based only on its molecular composition. Our tool uses Neural Network Potentials (NNPs) to allow for efficient screening and structural relaxations of generated crystal candidates. Our pipeline consists of…
▽ More
We present a high-throughput, end-to-end pipeline for organic crystal structure prediction (CSP) -- the problem of identifying the stable crystal structures that will form from a given molecule based only on its molecular composition. Our tool uses Neural Network Potentials (NNPs) to allow for efficient screening and structural relaxations of generated crystal candidates. Our pipeline consists of two distinct stages -- random search, whereby crystal candidates are randomly generated and screened, and optimization, where a genetic algorithm (GA) optimizes this screened population. We assess the performance of each stage of our pipeline on 21 molecules taken from the Cambridge Crystallographic Data Centre's CSP blind tests. We show that random search alone yields matches for $\approx 50\%$ of targets. We then validate the potential of our full pipeline, making use of the GA to optimize the Root Mean-Squared Deviation (RMSD) between crystal candidates and the experimentally derived structure. With this approach, we are able to find matches for $\approx80\%$ of candidates with 10-100 times smaller initial population sizes than when using random search. Lastly, we run our full pipeline with an ANI model that is trained on a small dataset of molecules extracted from crystal structures in the Cambridge Structural Database, generating $\approx 60\%$ of targets. By leveraging ML models trained to predict energies at the DFT level, our pipeline has the potential to approach the accuracy of \emph{ab initio} methods and the efficiency of empirical force-fields.
△ Less
Submitted 11 December, 2023; v1 submitted 3 August, 2023;
originally announced August 2023.
-
Machine Learning Diffusion Monte Carlo Energies
Authors:
Kevin Ryczko,
Jaron T. Krogel,
Isaac Tamblyn
Abstract:
We present two machine learning methodologies that are capable of predicting diffusion Monte Carlo (DMC) energies with small datasets (~60 DMC calculations in total). The first uses voxel deep neural networks (VDNNs) to predict DMC energy densities using Kohn-Sham density functional theory (DFT) electron densities as input. The second uses kernel ridge regression (KRR) to predict atomic contributi…
▽ More
We present two machine learning methodologies that are capable of predicting diffusion Monte Carlo (DMC) energies with small datasets (~60 DMC calculations in total). The first uses voxel deep neural networks (VDNNs) to predict DMC energy densities using Kohn-Sham density functional theory (DFT) electron densities as input. The second uses kernel ridge regression (KRR) to predict atomic contributions to the DMC total energy using atomic environment vectors as input (we used atom centred symmetry functions, atomic environment vectors from the ANI models, and smooth overlap of atomic positions). We first compare the methodologies on pristine graphene lattices, where we find the KRR methodology performs best in comparison to gradient boosted decision trees, random forest, gaussian process regression, and multilayer perceptrons. In addition, KRR outperforms VDNNs by an order of magnitude. Afterwards, we study the generalizability of KRR to predict the energy barrier associated with a Stone-Wales defect. Lastly, we move from 2D to 3D materials and use KRR to predict total energies of liquid water. In all cases, we find that the KRR models are more accurate than Kohn-Sham DFT and all mean absolute errors are less than chemical accuracy.
△ Less
Submitted 5 October, 2022; v1 submitted 9 May, 2022;
originally announced May 2022.
-
Electronic Response Quantities of Solids and Deep Learning
Authors:
Kevin Ryczko,
Olivier Malenfant-Thuot,
Michel Côté,
Isaac Tamblyn
Abstract:
We introduce a deep neural network (DNN) framework called the \textbf{r}eal-space \textbf{a}tomic \textbf{d}ecomposition \textbf{net}work (\textsc{radnet}), which is capable of making accurate polarization and static dielectric function predictions for solids. We use these predictions to calculate Born-effective charges, longitudinal optical transverse optical (LO-TO) splitting frequencies, and Ra…
▽ More
We introduce a deep neural network (DNN) framework called the \textbf{r}eal-space \textbf{a}tomic \textbf{d}ecomposition \textbf{net}work (\textsc{radnet}), which is capable of making accurate polarization and static dielectric function predictions for solids. We use these predictions to calculate Born-effective charges, longitudinal optical transverse optical (LO-TO) splitting frequencies, and Raman tensors for two prototypical examples: GaAs and BN. We then compute the Raman spectra, and find excellent agreement with \textit{ab initio} techniques. \textsc{radnet} is as good or better than current methodologies. Lastly, we discuss how \textsc{radnet} scales to larger systems, paving the way for predictions of response functions on meso-scale structures with \textit{ab initio} accuracy.
△ Less
Submitted 17 August, 2021;
originally announced August 2021.
-
Toward Orbital-Free Density Functional Theory with Small Data Sets and Deep Learning
Authors:
Kevin Ryczko,
Sebastian J. Wetzel,
Roger G. Melko,
Isaac Tamblyn
Abstract:
We use voxel deep neural networks to predict energy densities and functional derivatives of electron kinetic energies for the Thomas-Fermi model and Kohn-Sham density functional theory calculations. We show that the ground-state electron density can be found via direct minimization for a graphene lattice without any projection scheme using a voxel deep neural network trained with the Thomas-Fermi…
▽ More
We use voxel deep neural networks to predict energy densities and functional derivatives of electron kinetic energies for the Thomas-Fermi model and Kohn-Sham density functional theory calculations. We show that the ground-state electron density can be found via direct minimization for a graphene lattice without any projection scheme using a voxel deep neural network trained with the Thomas-Fermi model. Additionally, we predict the kinetic energy of a graphene lattice within chemical accuracy after training from only 2 Kohn-Sham density functional theory (DFT) calculations. We identify an important sampling issue inherent in Kohn-Sham DFT calculations and propose future work to rectify this problem. Furthermore, we demonstrate an alternative, functional derivative-free, Monte Carlo based orbital free density functional theory algorithm to calculate an accurate 2-electron density in a double inverted Gaussian potential with a machine-learned kinetic energy functional.
△ Less
Submitted 20 January, 2022; v1 submitted 12 April, 2021;
originally announced April 2021.
-
Neural evolution structure generation: High Entropy Alloys
Authors:
Conrard Giresse Tetsassi Feugmo,
Kevin Ryczko,
Abu Anand,
Chandra Veer Singh,
Isaac Tamblyn
Abstract:
We propose a method of neural evolution structures (NESs) combining artificial neural networks (ANNs) and evolutionary algorithms (EAs) to generate High Entropy Alloys (HEAs) structures. Our inverse design approach is based on pair distribution functions and atomic properties and allows one to train a model on smaller unit cells and then generate a larger cell. With a speed-up factor of approximat…
▽ More
We propose a method of neural evolution structures (NESs) combining artificial neural networks (ANNs) and evolutionary algorithms (EAs) to generate High Entropy Alloys (HEAs) structures. Our inverse design approach is based on pair distribution functions and atomic properties and allows one to train a model on smaller unit cells and then generate a larger cell. With a speed-up factor of approximately 1000 with respect to the SQSs, the NESs dramatically reduces computational costs and time, making possible the generation of very large structures (over 40,000 atoms) in few hours. Additionally, unlike the SQSs, the same model can be used to generate multiple structures with the same fractional composition.
△ Less
Submitted 1 March, 2021;
originally announced March 2021.
-
Twin Neural Network Regression
Authors:
Sebastian J. Wetzel,
Kevin Ryczko,
Roger G. Melko,
Isaac Tamblyn
Abstract:
We introduce twin neural network (TNN) regression. This method predicts differences between the target values of two different data points rather than the targets themselves. The solution of a traditional regression problem is then obtained by averaging over an ensemble of all predicted differences between the targets of an unseen data point and all training data points. Whereas ensembles are norm…
▽ More
We introduce twin neural network (TNN) regression. This method predicts differences between the target values of two different data points rather than the targets themselves. The solution of a traditional regression problem is then obtained by averaging over an ensemble of all predicted differences between the targets of an unseen data point and all training data points. Whereas ensembles are normally costly to produce, TNN regression intrinsically creates an ensemble of predictions of twice the size of the training set while only training a single neural network. Since ensembles have been shown to be more accurate than single models this property naturally transfers to TNN regression. We show that TNNs are able to compete or yield more accurate predictions for different data sets, compared to other state-of-the-art methods. Furthermore, TNN regression is constrained by self-consistency conditions. We find that the violation of these conditions provides an estimate for the prediction uncertainty.
△ Less
Submitted 29 December, 2020;
originally announced December 2020.
-
Inverse Design of a Graphene-Based Quantum Transducer via Neuroevolution
Authors:
Kevin Ryczko,
Pierre Darancet,
Isaac Tamblyn
Abstract:
We introduce an inverse design framework based on artificial neural networks, genetic algorithms, and tight-binding calculations, capable to optimize the very large configuration space of nanoelectronic devices. Our non-linear optimization procedure operates on trial Hamiltonians through superoperators controlling growth policies of regions of distinct do**. We demonstrate that our algorithm opt…
▽ More
We introduce an inverse design framework based on artificial neural networks, genetic algorithms, and tight-binding calculations, capable to optimize the very large configuration space of nanoelectronic devices. Our non-linear optimization procedure operates on trial Hamiltonians through superoperators controlling growth policies of regions of distinct do**. We demonstrate that our algorithm optimizes the do** of graphene-based three-terminal devices for valleytronics applications, monotonously converging to synthesizable devices with high merit functions in a few thousand evaluations (out of $\simeq 2^{3800}$ possible configurations). The best-performing device allowed for a terminal-specific separation of valley currents with $\simeq 96$\% ($\simeq 94\%)$ $K$ ($K'$) valley purity. Importantly, the devices found through our non-linear optimization procedure have both higher merit function and higher robustness to defects than the ones obtained through geometry optimization.
△ Less
Submitted 24 February, 2021; v1 submitted 14 July, 2020;
originally announced July 2020.
-
Deep Learning and Density Functional Theory
Authors:
Kevin Ryczko,
David Strubbe,
Isaac Tamblyn
Abstract:
We show that deep neural networks can be integrated into, or fully replace, the Kohn-Sham density functional theory scheme for multi-electron systems in simple harmonic oscillator and random external potentials with no feature engineering. We first show that self-consistent charge densities calculated with different exchange-correlation functionals can be used as input to an extensive deep neural…
▽ More
We show that deep neural networks can be integrated into, or fully replace, the Kohn-Sham density functional theory scheme for multi-electron systems in simple harmonic oscillator and random external potentials with no feature engineering. We first show that self-consistent charge densities calculated with different exchange-correlation functionals can be used as input to an extensive deep neural network to make predictions for correlation, exchange, external, kinetic and total energies simultaneously. Additionally, we show that one can also make all of the same predictions with the external potential rather than the self-consistent charge density, which allows one to circumvent the Kohn-Sham scheme altogether. We then show that a self-consistent charge density found from a non-local exchange-correlation functional can be used to make energy predictions for a semi-local exchange-correlation functional. Lastly, we use a deep convolutional inverse graphics network to predict the charge density given an external potential for different exchange-correlation functionals and assess the viability of the predicted charge densities. This work shows that extensive deep neural networks are generalizable and transferable given the variability of the potentials (maximum total energy range $\approx100$ Ha), because they require no feature engineering, and because they can scale to an arbitrary system size with an $\mathcal{O}(N)$ computational cost.
△ Less
Submitted 24 February, 2021; v1 submitted 21 November, 2018;
originally announced November 2018.
-
Extensive deep neural networks for transferring small scale learning to large scale systems
Authors:
Kyle Mills,
Kevin Ryczko,
Iryna Luchak,
Adam Domurad,
Chris Beeler,
Isaac Tamblyn
Abstract:
We present a physically-motivated topology of a deep neural network that can efficiently infer extensive parameters (such as energy, entropy, or number of particles) of arbitrarily large systems, doing so with O(N) scaling. We use a form of domain decomposition for training and inference, where each sub-domain (tile) is comprised of a non-overlap** focus region surrounded by an overlap** conte…
▽ More
We present a physically-motivated topology of a deep neural network that can efficiently infer extensive parameters (such as energy, entropy, or number of particles) of arbitrarily large systems, doing so with O(N) scaling. We use a form of domain decomposition for training and inference, where each sub-domain (tile) is comprised of a non-overlap** focus region surrounded by an overlap** context region. The size of these regions is motivated by the physical interaction length scales of the problem. We demonstrate the application of EDNNs to three physical systems: the Ising model and two hexagonal/graphene-like datasets. In the latter, an EDNN was able to make total energy predictions of a 60 atoms system, with comparable accuracy to density functional theory (DFT), in 57 milliseconds. Additionally EDNNs are well suited for massively parallel evaluation, as no communication is necessary during neural network evaluation. We demonstrate that EDNNs can be used to make an energy prediction of a two-dimensional 35.2 million atom system, over 1 square micrometer of material, at an accuracy comparable to DFT, in under 25 minutes. Such a system exists on a length scale visible with optical microscopy and larger than some living organisms.
△ Less
Submitted 11 April, 2019; v1 submitted 17 August, 2017;
originally announced August 2017.
-
Convolutional neural networks for atomistic systems
Authors:
Kevin Ryczko,
Kyle Mills,
Iryna Luchak,
Christa Homenick,
Isaac Tamblyn
Abstract:
We introduce a new method, called CNNAS (convolutional neural networks for atomistic systems), for calculating the total energy of atomic systems which rivals the computational cost of empirical potentials while maintaining the accuracy of \emph{ab initio} calculations. This method uses deep convolutional neural networks (CNNs), where the input to these networks are simple representations of the a…
▽ More
We introduce a new method, called CNNAS (convolutional neural networks for atomistic systems), for calculating the total energy of atomic systems which rivals the computational cost of empirical potentials while maintaining the accuracy of \emph{ab initio} calculations. This method uses deep convolutional neural networks (CNNs), where the input to these networks are simple representations of the atomic structure. We use this approach to predict energies obtained using density functional theory (DFT) for 2D hexagonal lattices of various types. Using a dataset consisting of graphene, hexagonal boron nitride (hBN), and graphene-hBN heterostructures, with and without defects, we trained a deep CNN that is capable of predicting DFT energies to an extremely high accuracy, with a mean absolute error (MAE) of 0.198 meV / atom (maximum absolute error of 16.1 meV / atom). To explore our new methodology, we investigate the ability of a deep neural network (DNN) in predicting a Lennard-Jones energy and separation distance for a dataset of dimer molecules in both two and three dimensions. In addition, we systematically investigate the flexibility of the deep learning models by performing interpolation and extrapolation tests.
△ Less
Submitted 20 March, 2018; v1 submitted 28 June, 2017;
originally announced June 2017.
-
Structural characterizations of water-metal interfaces
Authors:
Kevin Ryczko,
Isaac Tamblyn
Abstract:
We analyze and compare the structural, dynamical, and electronic properties of liquid water next to prototypical metals including Pt, graphite, and graphene. Our results are built on Born-Oppenheimer molecular dynamics (BOMD) generated using density functional theory (DFT) which explicitly include van der Waals (vdW) interactions within a first principles approach. All calculations reported use la…
▽ More
We analyze and compare the structural, dynamical, and electronic properties of liquid water next to prototypical metals including Pt, graphite, and graphene. Our results are built on Born-Oppenheimer molecular dynamics (BOMD) generated using density functional theory (DFT) which explicitly include van der Waals (vdW) interactions within a first principles approach. All calculations reported use large simulation cells, allowing for an accurate treatment of the water-electrode interfaces. We have included vdW interactions through the use of the optB86b-vdW exchange correlation functional. Comparisons with the Perdew-Burke-Ernzerhof (PBE) exchange correlation functional are also shown. We find an initial peak, due to chemisorption, in the density profile of the liquid water-Pt interface not seen in the liquid water-graphite interface, liquid water-graphene interface, nor interfaces studied previously. To further investigate this chemisorption peak, we also report differences in the electronic structure of single water molecules on both Pt and graphite surfaces. We find that a covalent bond forms between the single water molecule and the Pt surface, but not between the single water molecule and the graphite surface. We also discuss the effects that defects and dopants in the graphite and graphene surfaces have on the structure and dynamics of liquid water.
△ Less
Submitted 24 June, 2017; v1 submitted 7 November, 2016;
originally announced November 2016.
-
Hashkat: Large-scale simulations of online social networks
Authors:
Kevin Ryczko,
Adam Domurad,
Nicholas Buhagiar,
Isaac Tamblyn
Abstract:
Hashkat (http://hashkat.org) is a free, open source, agent based simulation software package designed to simulate large-scale online social networks (e.g. Twitter, Facebook, LinkedIn, etc). It allows for dynamic agent generation, edge creation, and information propagation. The purpose of hashkat is to study the growth of online social networks and how information flows within them. Like real life…
▽ More
Hashkat (http://hashkat.org) is a free, open source, agent based simulation software package designed to simulate large-scale online social networks (e.g. Twitter, Facebook, LinkedIn, etc). It allows for dynamic agent generation, edge creation, and information propagation. The purpose of hashkat is to study the growth of online social networks and how information flows within them. Like real life online social networks, hashkat incorporates user relationships, information diffusion, and trending topics. Hashkat was implemented in C++, and was designed with extensibility in mind. The software includes Shell and Python scripts for easy installation and usability. In this report, we describe all of the algorithms and features integrated into hashkat before moving on to example use cases. In general, hashkat can be used to understand the underlying topology of social networks, validate sampling methods of such networks, develop business strategy for advertising on online social networks, and test new features of an online social network before going into production.
△ Less
Submitted 24 October, 2016;
originally announced October 2016.
-
Conduction electrons localized by charged magneto-acceptors A$^{2-}$ in GaAs/GaAlAs quantum wells
Authors:
M. Kubisa,
K. Ryczko,
I. Bisotto,
C. Chaubet,
A. Raymond,
W. Zawadzki
Abstract:
A variational theory is presented of A$^{1-}$ and A$^{2-}$ centers, i.e. of a negative acceptor ion localizing one and two conduction electrons, respectively, in a GaAs/GaAlAs quantum well in the presence of a magnetic field parallel to the growth direction. A combined effect of the well and magnetic field confines conduction electrons to the proximity of the ion, resulting in discrete repulsive e…
▽ More
A variational theory is presented of A$^{1-}$ and A$^{2-}$ centers, i.e. of a negative acceptor ion localizing one and two conduction electrons, respectively, in a GaAs/GaAlAs quantum well in the presence of a magnetic field parallel to the growth direction. A combined effect of the well and magnetic field confines conduction electrons to the proximity of the ion, resulting in discrete repulsive energies above the corresponding Landau levels. The theory is motivated by our experimental magneto-transport results which indicate that, in a heterostructure doped in the GaAs well with Be acceptors, one observes a boil-off effect in which the conduction electrons in the crossed-field configuration are pushed by the Hall electric field from the delocalized Landau states to the localized acceptor states and cease to conduct. A detailed analysis of the transport data shows that, at high magnetic fields, there are almost no conducting electrons left in the sample. It is concluded that one negative acceptor ion localizes up to four conduction electrons.
△ Less
Submitted 9 April, 2015;
originally announced April 2015.
-
Photoluminescence investigations of 2D hole Landau levels in p-type single Al_{x}Ga_{1-x}As/GaAs heterostructures
Authors:
M. Kubisa,
L. Bryja,
K. Ryczko,
J. Misiewicz,
C. Bardot,
M. Potemski,
G. Ortner,
M. Bayer,
A. Forchel,
C. B. Sorensen
Abstract:
We study the energy structure of two-dimensional holes in p-type single Al_{1-x}Ga_{x}As/GaAs heterojunctions under a perpendicular magnetic field. Photoluminescence measurments with low densities of excitation power reveal rich spectra containing both free and bound-carrier transitions. The experimental results are compared with energies of valence-subband Landau levels calculated using a new n…
▽ More
We study the energy structure of two-dimensional holes in p-type single Al_{1-x}Ga_{x}As/GaAs heterojunctions under a perpendicular magnetic field. Photoluminescence measurments with low densities of excitation power reveal rich spectra containing both free and bound-carrier transitions. The experimental results are compared with energies of valence-subband Landau levels calculated using a new numerical procedure and a good agreement is achieved. Additional lines observed in the energy range of free-carrier recombinations are attributed to excitonic transitions. We also consider the role of many-body effects in photoluminescence spectra.
△ Less
Submitted 26 November, 2002;
originally announced November 2002.