Search | arXiv e-print repository

Pushing the Pareto front of band gap and permittivity: ML-guided search for dielectric materials

Authors: Janosh Riebesell, T. Wesley Surta, Rhys Goodall, Michael Gaultois, Alpha A Lee

Abstract: Materials with high-dielectric constant easily polarize under external electric fields, allowing them to perform essential functions in many modern electronic devices. Their practical utility is determined by two conflicting properties: high dielectric constants tend to occur in materials with narrow band gaps, limiting the operating voltage before dielectric breakdown. We present a high-throughpu… ▽ More Materials with high-dielectric constant easily polarize under external electric fields, allowing them to perform essential functions in many modern electronic devices. Their practical utility is determined by two conflicting properties: high dielectric constants tend to occur in materials with narrow band gaps, limiting the operating voltage before dielectric breakdown. We present a high-throughput workflow that combines element substitution, ML pre-screening, ab initio simulation and human expert intuition to efficiently explore the vast space of unknown materials for potential dielectrics, leading to the synthesis and characterization of two novel dielectric materials, CsTaTeO6 and Bi2Zr2O7. Our key idea is to deploy ML in a multi-objective optimization setting with concave Pareto front. While usually considered more challenging than single-objective optimization, we argue and show preliminary evidence that the $1/x$-correlation between band gap and permittivity in fact makes the task more amenable to ML methods by allowing separate models for band gap and permittivity to each operate in regions of good training support while still predicting materials of exceptional merit. To our knowledge, this is the first instance of successful ML-guided multi-objective materials optimization achieving experimental synthesis and characterization. CsTaTeO6 is a structure generated via element substitution not present in our reference data sources, thus exemplifying successful de-novo materials design. Meanwhile, we report the first high-purity synthesis and dielectric characterization of Bi2Zr2O7 with a band gap of 2.27 eV and a permittivity of 20.5, meeting all target metrics of our multi-objective search. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 27 pages, 11 figures, 5 authors

arXiv:2308.14920 [pdf, other]

Matbench Discovery -- A framework to evaluate machine learning crystal stability predictions

Authors: Janosh Riebesell, Rhys E. A. Goodall, Philipp Benner, Yuan Chiang, Bowen Deng, Alpha A. Lee, Anubhav Jain, Kristin A. Persson

Abstract: Matbench Discovery simulates the deployment of machine learning (ML) energy models in a high-throughput search for stable inorganic crystals. We address the disconnect between (i) thermodynamic stability and formation energy and (ii) in-domain vs out-of-distribution performance. Alongside this paper, we publish a Python package to aid with future model submissions and a growing online leaderboard… ▽ More Matbench Discovery simulates the deployment of machine learning (ML) energy models in a high-throughput search for stable inorganic crystals. We address the disconnect between (i) thermodynamic stability and formation energy and (ii) in-domain vs out-of-distribution performance. Alongside this paper, we publish a Python package to aid with future model submissions and a growing online leaderboard with further insights into trade-offs between various performance metrics. To answer the question which ML methodology performs best at materials discovery, our initial release explores a variety of models including random forests, graph neural networks (GNN), one-shot predictors, iterative Bayesian optimizers and universal interatomic potentials (UIP). Ranked best-to-worst by their test set F1 score on thermodynamic stability prediction, we find CHGNet > M3GNet > MACE > ALIGNN > MEGNet > CGCNN > CGCNN+P > Wrenformer > BOWSR > Voronoi tessellation fingerprints with random forest. The top 3 models are UIPs, the winning methodology for ML-guided materials discovery, achieving F1 scores of ~0.6 for crystal stability classification and discovery acceleration factors (DAF) of up to 5x on the first 10k most stable predictions compared to dummy selection from our test set. We also highlight a sharp disconnect between commonly used global regression metrics and more task-relevant classification metrics. Accurate regressors are susceptible to unexpectedly high false-positive rates if those accurate predictions lie close to the decision boundary at 0 eV/atom above the convex hull where most materials are. Our results highlight the need to focus on classification metrics that actually correlate with improved stability hit rate. △ Less

Submitted 4 February, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: 31 pages, 18 figures, 4 tables

arXiv:2212.04450 [pdf, other]

GAUCHE: A Library for Gaussian Processes in Chemistry

Authors: Ryan-Rhys Griffiths, Leo Klarner, Henry B. Moss, Aditya Ravuri, Sang Truong, Samuel Stanton, Gary Tom, Bojana Rankovic, Yuanqi Du, Arian Jamasb, Aryan Deshwal, Julius Schwartz, Austin Tripp, Gregory Kell, Simon Frieder, Anthony Bourached, Alex Chan, Jacob Moss, Chengzhi Guo, Johannes Durholt, Saudamini Chaurasia, Felix Strieth-Kalthoff, Alpha A. Lee, Bingqing Cheng, Alán Aspuru-Guzik , et al. (2 additional authors not shown)

Abstract: We introduce GAUCHE, a library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to chemical representations, however, is nontrivial, necessitating kernels defined over structured inputs such as graphs, strings… ▽ More We introduce GAUCHE, a library for GAUssian processes in CHEmistry. Gaussian processes have long been a cornerstone of probabilistic machine learning, affording particular advantages for uncertainty quantification and Bayesian optimisation. Extending Gaussian processes to chemical representations, however, is nontrivial, necessitating kernels defined over structured inputs such as graphs, strings and bit vectors. By defining such kernels in GAUCHE, we seek to open the door to powerful tools for uncertainty quantification and Bayesian optimisation in chemistry. Motivated by scenarios frequently encountered in experimental chemistry, we showcase applications for GAUCHE in molecular discovery and chemical reaction optimisation. The codebase is made available at https://github.com/leojklarner/gauche △ Less

Submitted 21 February, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

arXiv:2208.03182 [pdf, other]

Inferring global dynamics from local structure in liquid electrolytes

Authors: Penelope K. Jones, Kara D. Fong, Kristin A. Persson, Alpha A. Lee

Abstract: Ion transport in concentrated electrolytes plays a fundamental role in electrochemical systems such as lithium ion batteries. Nonetheless, the mechanism of transport amid strong ion-ion interactions remains enigmatic. A key question is whether the dynamics of ion transport can be predicted by the local static structure alone, and if so what are the key structural motifs that determine transport. I… ▽ More Ion transport in concentrated electrolytes plays a fundamental role in electrochemical systems such as lithium ion batteries. Nonetheless, the mechanism of transport amid strong ion-ion interactions remains enigmatic. A key question is whether the dynamics of ion transport can be predicted by the local static structure alone, and if so what are the key structural motifs that determine transport. In this paper, we show that machine learning can successfully decompose global conductivity into the spatio-temporal average of local, instantaneous ionic contributions, and relate this ``local molar conductivity" field to the local ionic environment. Our machine learning model accurately predicts the molar conductivity of electrolyte systems that were not part of the training set, suggesting that the dynamics of ion transport is predictable from local static structure. Further, through analysing this machine-learned local conductivity field, we observe that fluctuations in local conductivity at high concentration are negatively correlated with total molar conductivity. Surprisingly, these fluctuations arise due to a long tail distribution of low conductivity ions, rather than distinct ion pairs, and are spatially correlated through both like- and unlike-charge interactions. More broadly, our approach shows how machine learning can aid the understanding of complex soft matter systems, by learning a function that attributes global collective properties to local, atomistic contributions. △ Less

Submitted 5 August, 2022; originally announced August 2022.

Comments: 20 pages, 8 figures

arXiv:2106.11132 [pdf, other]

Rapid Discovery of Stable Materials by Coordinate-free Coarse Graining

Authors: Rhys E. A. Goodall, Abhijith S. Parackal, Felix A. Faber, Rickard Armiento, Alpha A. Lee

Abstract: A fundamental challenge in materials science pertains to elucidating the relationship between stoichiometry, stability, structure, and property. Recent advances have shown that machine learning can be used to learn such relationships, allowing the stability and functional properties of materials to be accurately predicted. However, most of these approaches use atomic coordinates as input and are t… ▽ More A fundamental challenge in materials science pertains to elucidating the relationship between stoichiometry, stability, structure, and property. Recent advances have shown that machine learning can be used to learn such relationships, allowing the stability and functional properties of materials to be accurately predicted. However, most of these approaches use atomic coordinates as input and are thus bottle-necked by crystal structure identification when investigating novel materials. Our approach solves this bottleneck by coarse-graining the infinite search space of atomic coordinates into a combinatorially enumerable search space. The key idea is to use Wyckoff representations -- coordinate-free sets of symmetry-related positions in a crystal -- as the input to a machine learning model. Our model demonstrates exceptionally high precision in discovering new theoretically stable materials, identifying 1,569 materials that lie below the known convex hull of previously calculated materials from just 5,675 ab-initio calculations. Our approach opens up fundamental advances in computational materials discovery. △ Less

Submitted 15 March, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

Comments: Code Available: https://github.com/CompRhys/aviary

arXiv:2105.02637 [pdf, other]

Dataset Bias in the Natural Sciences: A Case Study in Chemical Reaction Prediction and Synthesis Design

Authors: Ryan-Rhys Griffiths, Philippe Schwaller, Alpha A. Lee

Abstract: Datasets in the Natural Sciences are often curated with the goal of aiding scientific understanding and hence may not always be in a form that facilitates the application of machine learning. In this paper, we identify three trends within the fields of chemical reaction prediction and synthesis design that require a change in direction. First, the manner in which reaction datasets are split into r… ▽ More Datasets in the Natural Sciences are often curated with the goal of aiding scientific understanding and hence may not always be in a form that facilitates the application of machine learning. In this paper, we identify three trends within the fields of chemical reaction prediction and synthesis design that require a change in direction. First, the manner in which reaction datasets are split into reactants and reagents encourages testing models in an unrealistically generous manner. Second, we highlight the prevalence of mislabelled data, and suggest that the focus should be on outlier removal rather than data fitting only. Lastly, we discuss the problem of reagent prediction, in addition to reactant prediction, in order to solve the full synthesis design problem, highlighting the mismatch between what machine learning solves and what a lab chemist would need. Our critiques are also relevant to the burgeoning field of using machine learning to accelerate progress in experimental Natural Sciences, where datasets are often split in a biased way, are highly noisy, and contextual variables that are not evident from the data strongly influence the outcome of experiments. △ Less

Submitted 6 May, 2021; originally announced May 2021.

Comments: Presented at the 2018 NeurIPS Workshop on Machine Learning for Molecules and Materials

arXiv:2103.06838 [pdf, other]

doi 10.3847/1538-4357/abfa9f

Modelling the Multiwavelength Variability of Mrk 335 using Gaussian Processes

Authors: Ryan-Rhys Griffiths, Jiachen Jiang, Douglas J. K. Buisson, Dan R. Wilkins, Luigi C. Gallo, Adam Ingram, Alpha A. Lee, Dirk Grupe, Erin Kara, Michael L. Parker, William Alston, Anthony Bourached, George Cann, Andrew Young, S. Komossa

Abstract: The optical and UV variability of the majority of AGN may be related to the reprocessing of rapidly-changing X-ray emission from a more compact region near the central black hole. Such a reprocessing model would be characterised by lags between X-ray and optical/UV emission due to differences in light travel time. Observationally however, such lag features have been difficult to detect due to gaps… ▽ More The optical and UV variability of the majority of AGN may be related to the reprocessing of rapidly-changing X-ray emission from a more compact region near the central black hole. Such a reprocessing model would be characterised by lags between X-ray and optical/UV emission due to differences in light travel time. Observationally however, such lag features have been difficult to detect due to gaps in the lightcurves introduced through factors such as source visibility or limited telescope time. In this work, Gaussian process regression is employed to interpolate the gaps in the Swift X-ray and UV lightcurves of the narrow-line Seyfert 1 galaxy Mrk 335. In a simulation study of five commonly-employed analytic Gaussian process kernels, we conclude that the Matern 1/2 and rational quadratic kernels yield the most well-specified models for the X-ray and UVW2 bands of Mrk 335. In analysing the structure functions of the Gaussian process lightcurves, we obtain a broken power law with a break point at 125 days in the UVW2 band. In the X-ray band, the structure function of the Gaussian process lightcurve is consistent with a power law in the case of the rational quadratic kernel whilst a broken power law with a breakpoint at 66 days is obtained from the Matern 1/2 kernel. The subsequent cross-correlation analysis is consistent with previous studies and furthermore, shows tentative evidence for a broad X-ray-UV lag feature of up to 30 days in the lag-frequency spectrum where the significance of the lag depends on the choice of Gaussian process kernel. △ Less

Submitted 29 June, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

Comments: 24 pages, 9 figures, 2 tables. Accepted for publication in ApJ. Code available at https://github.com/Ryan-Rhys/Mrk_335

arXiv:2012.10694 [pdf, other]

doi 10.1063/5.0039617

Bayesian unsupervised learning reveals hidden structure in concentrated electrolytes

Authors: Penelope Jones, Fabian Coupette, Andreas Härtel, Alpha A. Lee

Abstract: Electrolytes play an important role in a plethora of applications ranging from energy storage to biomaterials. Notwithstanding this, the structure of concentrated electrolytes remains enigmatic. Many theoretical approaches attempt to model the concentrated electrolytes by introducing the idea of ion pairs, with ions either being tightly `paired' with a counter-ion, or `free' to screen charge. In t… ▽ More Electrolytes play an important role in a plethora of applications ranging from energy storage to biomaterials. Notwithstanding this, the structure of concentrated electrolytes remains enigmatic. Many theoretical approaches attempt to model the concentrated electrolytes by introducing the idea of ion pairs, with ions either being tightly `paired' with a counter-ion, or `free' to screen charge. In this study we reframe the problem into the language of computational statistics, and test the null hypothesis that all ions share the same local environment. Applying the framework to molecular dynamics simulations, we show that this null hypothesis is not supported by data. Our statistical technique suggests the presence of distinct local ionic environments; surprisingly, these differences arise in like charge correlations rather than unlike charge attraction. The resulting fraction of particles in non-aggregated environments shows a universal scaling behaviour across different background dielectric constants and ionic concentrations. △ Less

Submitted 19 December, 2020; originally announced December 2020.

Comments: 15 pages, 4 figures

arXiv:2010.12857 [pdf, other]

Investigating 3D Atomic Environments for Enhanced QSAR

Authors: William McCorkindale, Carl Poelking, Alpha A. Lee

Abstract: Predicting bioactivity and physical properties of molecules is a longstanding challenge in drug design. Most approaches use molecular descriptors based on a 2D representation of molecules as a graph of atoms and bonds, abstracting away the molecular shape. A difficulty in accounting for 3D shape is in designing molecular descriptors can precisely capture molecular shape while remaining invariant t… ▽ More Predicting bioactivity and physical properties of molecules is a longstanding challenge in drug design. Most approaches use molecular descriptors based on a 2D representation of molecules as a graph of atoms and bonds, abstracting away the molecular shape. A difficulty in accounting for 3D shape is in designing molecular descriptors can precisely capture molecular shape while remaining invariant to rotations/translations. We describe a novel alignment-free 3D QSAR method using Smooth Overlap of Atomic Positions (SOAP), a well-established formalism developed for interpolating potential energy surfaces. We show that this approach rigorously describes local 3D atomic environments to compare molecular shapes in a principled manner. This method performs competitively with traditional fingerprint-based approaches as well as state-of-the-art graph neural networks on pIC$_{50}$ ligand-binding prediction in both random and scaffold split scenarios. We illustrate the utility of SOAP descriptors by showing that its inclusion in ensembling diverse representations statistically improves performance, demonstrating that incorporating 3D atomic environments could lead to enhanced QSAR for cheminformatics. △ Less

Submitted 24 October, 2020; originally announced October 2020.

arXiv:2008.03226

Data-Driven Discovery of Molecular Photoswitches with Multioutput Gaussian Processes

Authors: Ryan-Rhys Griffiths, Jake L. Greenfield, Aditya R. Thawani, Arian R. Jamasb, Henry B. Moss, Anthony Bourached, Penelope Jones, William McCorkindale, Alexander A. Aldrick, Matthew J. Fuchter Alpha A. Lee

Abstract: Photoswitchable molecules display two or more isomeric forms that may be accessed using light. Separating the electronic absorption bands of these isomers is key to selectively addressing a specific isomer and achieving high photostationary states whilst overall red-shifting the absorption bands serves to limit material damage due to UV-exposure and increases penetration depth in photopharmacologi… ▽ More Photoswitchable molecules display two or more isomeric forms that may be accessed using light. Separating the electronic absorption bands of these isomers is key to selectively addressing a specific isomer and achieving high photostationary states whilst overall red-shifting the absorption bands serves to limit material damage due to UV-exposure and increases penetration depth in photopharmacological applications. Engineering these properties into a system through synthetic design however, remains a challenge. Here, we present a data-driven discovery pipeline for molecular photoswitches underpinned by dataset curation and multitask learning with Gaussian processes. In the prediction of electronic transition wavelengths, we demonstrate that a multioutput Gaussian process (MOGP) trained using labels from four photoswitch transition wavelengths yields the strongest predictive performance relative to single-task models as well as operationally outperforming time-dependent density functional theory (TD-DFT) in terms of the wall-clock time for prediction. We validate our proposed approach experimentally by screening a library of commercially available photoswitchable molecules. Through this screen, we identified several motifs that displayed separated electronic absorption bands of their isomers, exhibited red-shifted absorptions, and are suited for information transfer and photopharmacological applications. Our curated dataset, code, as well as all models are made available at https://github.com/Ryan-Rhys/The-Photoswitch-Dataset △ Less

Submitted 7 August, 2022; v1 submitted 28 June, 2020; originally announced August 2020.

Comments: Authors still in discussion about authorship ordering

arXiv:2007.15752 [pdf, other]

doi 10.1021/acs.chemmater.0c03885

Materials Graph Transformer predicts the outcomes of inorganic reactions with reliable uncertainties

Authors: Shreshth A. Malik, Rhys E. A. Goodall, Alpha A. Lee

Abstract: A common bottleneck for materials discovery is synthesis. While recent methodological advances have resulted in major improvements in the ability to predicatively design novel materials, researchers often still rely on trial-and-error approaches for determining synthesis procedures. In this work, we develop a model that predicts the major product of solid-state reactions. The cardinal feature of t… ▽ More A common bottleneck for materials discovery is synthesis. While recent methodological advances have resulted in major improvements in the ability to predicatively design novel materials, researchers often still rely on trial-and-error approaches for determining synthesis procedures. In this work, we develop a model that predicts the major product of solid-state reactions. The cardinal feature of this approach is the construction of fixed-length, learned representations of reactions. Precursors are represented as nodes on a `reaction graph', and message-passing operations between nodes are used to embody the interactions between precursors in the reaction mixture. Through an ablation study, it is shown that this framework not only outperforms less physically-motivated baseline methods but also more reliably assesses the uncertainty in its predictions. △ Less

Submitted 3 September, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

Journal ref: Chemistry of Materials 2021 33 (2), 616-624

arXiv:2004.09114 [pdf, other]

Machine learnt approximations to the bridge function yield improved closures for the Ornstein-Zernike equation

Authors: Rhys E. A. Goodall, Alpha A. Lee

Abstract: A key challenge for soft materials design and coarse-graining simulations is determining interaction potentials between components that give rise to desired condensed-phase structures. In theory, the Ornstein-Zernike equation provides an elegant framework for solving this inverse problem. Pioneering work in liquid state theory derived analytical closures for the framework. However, these analytica… ▽ More A key challenge for soft materials design and coarse-graining simulations is determining interaction potentials between components that give rise to desired condensed-phase structures. In theory, the Ornstein-Zernike equation provides an elegant framework for solving this inverse problem. Pioneering work in liquid state theory derived analytical closures for the framework. However, these analytical closures are approximations, valid only for specific classes of interaction potentials. In this work, we combine the physics of liquid state theory with machine learning to infer a closure directly from simulation data. The resulting closure is more accurate than commonly used closures across a broad range of interaction potentials. We show for two examples of a prototypical inverse design problem, fitting a coarse-grained simulation potential, that our approach leads to improved one-step inversion. △ Less

Submitted 19 February, 2021; v1 submitted 20 April, 2020; originally announced April 2020.

arXiv:2004.00925 [pdf, other]

Materials Informatics Reveals Unexplored Structure Space in Cuprate Superconductors

Authors: Rhys E. A. Goodall, Bonan Zhu, Judith L. MacManus-Driscoll, Alpha A. Lee

Abstract: High-temperature superconducting cuprates have the potential to be transformative in a wide range of energy applications. In this work we analyse the corpus of historical data about cuprates using materials informatics and re-examine how their structures are related to their critical temperatures (Tc). The available data is highly clustered and no single database contains all the features of inter… ▽ More High-temperature superconducting cuprates have the potential to be transformative in a wide range of energy applications. In this work we analyse the corpus of historical data about cuprates using materials informatics and re-examine how their structures are related to their critical temperatures (Tc). The available data is highly clustered and no single database contains all the features of interest to properly examine trends. To work around these issues we employ a linear calibration approach that allows us to utilise multiple data sources -- combining fine resolution data for which the Tc is unknown with coarse resolution data where it is known. The hybrid data set constructed enables us to explore the trends in Tc with the apical and in-plane copper-oxygen distances. We show that large regions of the materials space have yet to be explored and highlight how novel experiments relying on nano-engineering of the crystal structure may enable us to explore such new regions. Based on the trends identified we propose that single layer Bi-based cuprates are good candidate systems for such experiments. △ Less

Submitted 7 September, 2021; v1 submitted 2 April, 2020; originally announced April 2020.

arXiv:1910.07779 [pdf, ps, other]

doi 10.1088/2632-2153/ac298c

Achieving Robustness to Aleatoric Uncertainty with Heteroscedastic Bayesian Optimisation

Authors: Ryan-Rhys Griffiths, Alexander A. Aldrick, Miguel Garcia-Ortegon, Vidhi R. Lalchand, Alpha A. Lee

Abstract: Bayesian optimisation is a sample-efficient search methodology that holds great promise for accelerating drug and materials discovery programs. A frequently-overlooked modelling consideration in Bayesian optimisation strategies however, is the representation of heteroscedastic aleatoric uncertainty. In many practical applications it is desirable to identify inputs with low aleatoric noise, an exam… ▽ More Bayesian optimisation is a sample-efficient search methodology that holds great promise for accelerating drug and materials discovery programs. A frequently-overlooked modelling consideration in Bayesian optimisation strategies however, is the representation of heteroscedastic aleatoric uncertainty. In many practical applications it is desirable to identify inputs with low aleatoric noise, an example of which might be a material composition which consistently displays robust properties in response to a noisy fabrication process. In this paper, we propose a heteroscedastic Bayesian optimisation scheme capable of representing and minimising aleatoric noise across the input space. Our scheme employs a heteroscedastic Gaussian process (GP) surrogate model in conjunction with two straightforward adaptations of existing acquisition functions. First, we extend the augmented expected improvement (AEI) heuristic to the heteroscedastic setting and second, we introduce the aleatoric noise-penalised expected improvement (ANPEI) heuristic. Both methodologies are capable of penalising aleatoric noise in the suggestions and yield improved performance relative to homoscedastic Bayesian optimisation and random sampling on toy problems as well as on two real-world scientific datasets. Code is available at: \url{https://github.com/Ryan-Rhys/Heteroscedastic-BO} △ Less

Submitted 20 May, 2022; v1 submitted 17 October, 2019; originally announced October 2019.

Comments: Published in Machine Learning: Science and Technology 2021 (https://iopscience.iop.org/article/10.1088/2632-2153/ac298c) Earlier version accepted to the 2019 NeurIPS Workshop on Safety and Robustness in Decision Making

Journal ref: Mach. Learn.: Sci. Technol. 3 015004 (2022)

arXiv:1910.00617 [pdf, other]

doi 10.1038/s41467-020-19964-7

Predicting materials properties without crystal structure: Deep representation learning from stoichiometry

Authors: Rhys E. A. Goodall, Alpha A. Lee

Abstract: Machine learning has the potential to accelerate materials discovery by accurately predicting materials properties at a low computational cost. However, the model inputs remain a key stumbling block. Current methods typically use descriptors constructed from knowledge of either the full crystal structure -- therefore only applicable to materials with already characterised structures -- or structur… ▽ More Machine learning has the potential to accelerate materials discovery by accurately predicting materials properties at a low computational cost. However, the model inputs remain a key stumbling block. Current methods typically use descriptors constructed from knowledge of either the full crystal structure -- therefore only applicable to materials with already characterised structures -- or structure-agnostic fixed-length representations hand-engineered from the stoichiometry. We develop a machine learning approach that takes only the stoichiometry as input and automatically learns appropriate and systematically improvable descriptors from data. Our key insight is to treat the stoichiometric formula as a dense weighted graph between elements. Compared to the state of the art for structure-agnostic methods, our approach achieves lower errors with less data. △ Less

Submitted 23 September, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

Comments: A working implementation of our model is available at https://github.com/CompRhys/roost

arXiv:1905.11681 [pdf, other]

doi 10.1007/s10822-019-00274-0

Validating the Validation: Reanalyzing a large-scale comparison of Deep Learning and Machine Learning models for bioactivity prediction

Authors: Matthew C. Robinson, Robert C. Glen, Alpha A. Lee

Abstract: Machine learning methods may have the potential to significantly accelerate drug discovery. However, the increasing rate of new methodological approaches being published in the literature raises the fundamental question of how models should be benchmarked and validated. We reanalyze the data generated by a recently published large-scale comparison of machine learning models for bioactivity predict… ▽ More Machine learning methods may have the potential to significantly accelerate drug discovery. However, the increasing rate of new methodological approaches being published in the literature raises the fundamental question of how models should be benchmarked and validated. We reanalyze the data generated by a recently published large-scale comparison of machine learning models for bioactivity prediction and arrive at a somewhat different conclusion. We show that the performance of support vector machines is competitive with that of deep learning methods. Additionally, using a series of numerical experiments, we question the relevance of area under the receiver operating characteristic curve as a metric in virtual screening, and instead suggest that area under the precision-recall curve should be used in conjunction with the receiver operating characteristic. Our numerical experiments also highlight challenges in estimating the uncertainty in model performance via scaffold-split nested cross validation. △ Less

Submitted 9 June, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

Comments: Code available on GitHub: https://github.com/mc-robinson/validating_validation_supp_info

arXiv:1902.00925 [pdf, ps, other]

doi 10.1039/C9SC00616H

Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning

Authors: Yao Zhang, Alpha A. Lee

Abstract: Predicting bioactivity and physical properties of small molecules is a central challenge in drug discovery. Deep learning is becoming the method of choice but studies to date focus on mean accuracy as the main metric. However, to replace costly and mission-critical experiments by models, a high mean accuracy is not enough: Outliers can derail a discovery campaign, thus models need reliably predict… ▽ More Predicting bioactivity and physical properties of small molecules is a central challenge in drug discovery. Deep learning is becoming the method of choice but studies to date focus on mean accuracy as the main metric. However, to replace costly and mission-critical experiments by models, a high mean accuracy is not enough: Outliers can derail a discovery campaign, thus models need reliably predict when it will fail, even when the training data is biased; experiments are expensive, thus models need to be data-efficient and suggest informative training sets using active learning. We show that uncertainty quantification and active learning can be achieved by Bayesian semi-supervised graph convolutional neural networks. The Bayesian approach estimates uncertainty in a statistically principled way through sampling from the posterior distribution. Semi-supervised learning disentangles representation learning and regression, kee** uncertainty estimates accurate in the low data limit and allowing the model to start active learning from a small initial pool of training data. Our study highlights the promise of Bayesian deep learning for chemistry. △ Less

Submitted 25 July, 2019; v1 submitted 3 February, 2019; originally announced February 2019.

Comments: Chemical Science, in press

arXiv:1811.02633 [pdf, other]

doi 10.1021/acscentsci.9b00576

Molecular Transformer - A Model for Uncertainty-Calibrated Chemical Reaction Prediction

Authors: Philippe Schwaller, Teodoro Laino, Théophile Gaudin, Peter Bolgar, Costas Bekas, Alpha A Lee

Abstract: Organic synthesis is one of the key stumbling blocks in medicinal chemistry. A necessary yet unsolved step in planning synthesis is solving the forward problem: given reactants and reagents, predict the products. Similar to other work, we treat reaction prediction as a machine translation problem between SMILES strings of reactants-reagents and the products. We show that a multi-head attention Mol… ▽ More Organic synthesis is one of the key stumbling blocks in medicinal chemistry. A necessary yet unsolved step in planning synthesis is solving the forward problem: given reactants and reagents, predict the products. Similar to other work, we treat reaction prediction as a machine translation problem between SMILES strings of reactants-reagents and the products. We show that a multi-head attention Molecular Transformer model outperforms all algorithms in the literature, achieving a top-1 accuracy above 90% on a common benchmark dataset. Our algorithm requires no handcrafted rules, and accurately predicts subtle chemical transformations. Crucially, our model can accurately estimate its own uncertainty, with an uncertainty score that is 89% accurate in terms of classifying whether a prediction is correct. Furthermore, we show that the model is able to handle inputs without reactant-reagent split and including stereochemistry, which makes our method universally applicable. △ Less

Submitted 30 May, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

Comments: Machine Learning for Molecules and Materials workshop, NeurIPS 2018 / Platform: https://rxn.res.ibm.com

Journal ref: ACS Central Science, 2019

arXiv:1808.00408 [pdf, other]

doi 10.1103/PhysRevLett.124.108301

Geometry of energy landscapes and the optimizability of deep neural networks

Authors: Simon Becker, Yao Zhang, Alpha A. Lee

Abstract: Deep neural networks are workhorse models in machine learning with multiple layers of non-linear functions composed in series. Their loss function is highly non-convex, yet empirically even gradient descent minimisation is sufficient to arrive at accurate and predictive models. It is hitherto unknown why are deep neural networks easily optimizable. We analyze the energy landscape of a spin glass m… ▽ More Deep neural networks are workhorse models in machine learning with multiple layers of non-linear functions composed in series. Their loss function is highly non-convex, yet empirically even gradient descent minimisation is sufficient to arrive at accurate and predictive models. It is hitherto unknown why are deep neural networks easily optimizable. We analyze the energy landscape of a spin glass model of deep neural networks using random matrix theory and algebraic geometry. We analytically show that the multilayered structure holds the key to optimizability: Fixing the number of parameters and increasing network depth, the number of stationary points in the loss function decreases, minima become more clustered in parameter space, and the tradeoff between the depth and width of minima becomes less severe. Our analytical results are numerically verified through comparison with neural networks trained on a set of classical benchmark datasets. Our model uncovers generic design principles of machine learning models. △ Less

Submitted 1 August, 2018; originally announced August 2018.

Journal ref: Phys. Rev. Lett. 124, 108301 (2020)

arXiv:1805.00975 [pdf, other]

Fluctuation-induced forces in homogeneous isotropic turbulence

Authors: Vamsi Spandan, Daniel Putt, Rodolfo Ostilla-Mónico, Alpha Albert Lee

Abstract: Understanding force generation in non-equilibrium systems is a significant challenge in statistical physics. We uncover a surprising fluctuation-induced force between two plates immersed in homogeneous isotropic turbulence using Direct Numerical Simulation. The force is a non-monotonic function of plate separation. The mechanism of force generation reveals an intriguing analogy with fluctuation-in… ▽ More Understanding force generation in non-equilibrium systems is a significant challenge in statistical physics. We uncover a surprising fluctuation-induced force between two plates immersed in homogeneous isotropic turbulence using Direct Numerical Simulation. The force is a non-monotonic function of plate separation. The mechanism of force generation reveals an intriguing analogy with fluctuation-induced forces: energy in the fluid is localised in regions of high vorticity, or "worms", which have a characteristic length scale. The magnitude of the force depends on the packing of worms inside the plates, with the maximal force attained when the plate separation is comparable to the characteristic worm length. A key implication of our study is that the length scale-dependent partition of energy in an active or non-equilibrium system determines force generation. △ Less

Submitted 17 November, 2018; v1 submitted 2 May, 2018; originally announced May 2018.

arXiv:1803.10596 [pdf, other]

doi 10.1103/PhysRevLett.121.075501

Screening Lengths in Ionic Fluids

Authors: Fabian Coupette, Alpha A. Lee, Andreas Härtel

Abstract: The decay of correlations in ionic fluids is a classical problem in soft matter physics that underpins applications ranging from controlling colloidal self-assembly to batteries and supercapacitors. The conventional wisdom, based on analyzing a solvent-free electrolyte model, suggests that all correlation functions between species decay with a common decay length in the asymptotic far field limit.… ▽ More The decay of correlations in ionic fluids is a classical problem in soft matter physics that underpins applications ranging from controlling colloidal self-assembly to batteries and supercapacitors. The conventional wisdom, based on analyzing a solvent-free electrolyte model, suggests that all correlation functions between species decay with a common decay length in the asymptotic far field limit. Nonetheless, a solvent is present in many electrolyte systems. We show using an analytical theory and molecular dynamics simulations that multiple decay lengths can coexist in the asymptotic limit as well as at intermediate distances once a hard sphere solvent is considered. Our analysis provides an explanation for the recently observed discontinuous change in the structural force across a thin film of ionic liquid-solvent mixtures as the composition is varied, as well as reframes recent debates in the literature about the screening length in concentrated electrolytes. △ Less

Submitted 17 August, 2018; v1 submitted 28 March, 2018; originally announced March 2018.

Journal ref: Phys. Rev. Lett. 121, 075501 (2018)

arXiv:1803.01927 [pdf, other]

doi 10.1080/00268976.2018.1483535

Energy-entropy competition and the effectiveness of stochastic gradient descent in machine learning

Authors: Yao Zhang, Andrew M. Saxe, Madhu S. Advani, Alpha A. Lee

Abstract: Finding parameters that minimise a loss function is at the core of many machine learning methods. The Stochastic Gradient Descent algorithm is widely used and delivers state of the art results for many problems. Nonetheless, Stochastic Gradient Descent typically cannot find the global minimum, thus its empirical effectiveness is hitherto mysterious. We derive a correspondence between parameter inf… ▽ More Finding parameters that minimise a loss function is at the core of many machine learning methods. The Stochastic Gradient Descent algorithm is widely used and delivers state of the art results for many problems. Nonetheless, Stochastic Gradient Descent typically cannot find the global minimum, thus its empirical effectiveness is hitherto mysterious. We derive a correspondence between parameter inference and free energy minimisation in statistical physics. The degree of undersampling plays the role of temperature. Analogous to the energy-entropy competition in statistical physics, wide but shallow minima can be optimal if the system is undersampled, as is typical in many applications. Moreover, we show that the stochasticity in the algorithm has a non-trivial correlation structure which systematically biases it towards wide minima. We illustrate our argument with two prototypical models: image classification using deep learning, and a linear neural network where we can analytically reveal the relationship between entropy and out-of-sample error. △ Less

Submitted 5 March, 2018; originally announced March 2018.

arXiv:1803.00071 [pdf, other]

doi 10.1080/00268976.2018.1478137

Casimir force in dense confined electrolytes

Authors: Alpha A. Lee, Jean-Pierre Hansen, Olivier Bernard, Benjamin Rotenberg

Abstract: Understanding the force between charged surfaces immersed in an electrolyte solution is a classic problem in soft matter and liquid-state theory. Recent experiments showed that the force decays exponentially but the characteristic decay length in a concentrated electrolyte is significantly larger than what liquid-state theories predict based on analysing correlation functions in the bulk electroly… ▽ More Understanding the force between charged surfaces immersed in an electrolyte solution is a classic problem in soft matter and liquid-state theory. Recent experiments showed that the force decays exponentially but the characteristic decay length in a concentrated electrolyte is significantly larger than what liquid-state theories predict based on analysing correlation functions in the bulk electrolyte. Inspired by the classical Casimir effect, we consider an alternative mechanism for force generation, namely the confinement of density fluctuations in the electrolyte by the walls. We show analytically within the random phase approximation, which assumes the ions to be point charges, that this fluctuation-induced force is attractive and also decays exponentially, albeit with a decay length that is half of the bulk correlation length. These predictions change dramatically when excluded volume effects are accounted for within the mean spherical approximation. At high ion concentrations the Casimir force is found to be exponentially damped oscillatory as a function of the distance between the confining surfaces. Our analysis does not resolve the riddle of the anomalously long screening length observed in experiments, but suggests that the Casimir force due to mode restriction in density fluctuations could be an hitherto under-appreciated source of surface-surface interaction. △ Less

Submitted 28 February, 2018; originally announced March 2018.

arXiv:1801.05635 [pdf, other]

doi 10.1021/acs.jpcb.7b11398

Controlling polyelectrolyte adsorption onto carbon nanotubes by tuning ion-image interactions

Authors: Alpha A. Lee, Sarah V. Kostinski, Michael P. Brenner

Abstract: Understanding and controlling polyelectrolyte adsorption onto carbon nanotubes is a fundamen- tal challenge in nanotechology. Polyelectrolytes have been shown to stabilise nanotube suspensions through adsorbing onto the nanotube surface, and polyelectrolyte-coated nanotubes are emerging as building blocks for complex and addressable self-assembly. The conventional wisdom suggests that polyelectrol… ▽ More Understanding and controlling polyelectrolyte adsorption onto carbon nanotubes is a fundamen- tal challenge in nanotechology. Polyelectrolytes have been shown to stabilise nanotube suspensions through adsorbing onto the nanotube surface, and polyelectrolyte-coated nanotubes are emerging as building blocks for complex and addressable self-assembly. The conventional wisdom suggests that polyelectrolyte adsorption onto nanotubes is driven by specific chemical or van der Waals interac- tions. We develop a simple mean-field model and show that ion-image attraction is a significant effect for adsorption onto conducting nanotubes at low salt concentrations. Our theory suggests a simple strategy to selectively and reversibly functionalize carbon nanotubes based on their electronic structure which in turn modifies the ion-image attraction. △ Less

Submitted 17 January, 2018; originally announced January 2018.

Comments: This document is the unedited Author's version of a Submitted Work that was subsequently accepted for publication in Journal of Physical Chemistry B. To access the final edited and published work see http://www.pubs.acs.org/doi/10.1021/acs.jpcb.7b11398

arXiv:1706.08466 [pdf, other]

Inverse Ising inference by combining Ornstein-Zernike theory with deep learning

Authors: Soma Turi, Alpha A. Lee

Abstract: Inferring a generative model from data is a fundamental problem in machine learning. It is well-known that the Ising model is the maximum entropy model for binary variables which reproduces the sample mean and pairwise correlations. Learning the parameters of the Ising model from data is the challenge. We establish an analogy between the inverse Ising problem and the Ornstein-Zernike formalism in… ▽ More Inferring a generative model from data is a fundamental problem in machine learning. It is well-known that the Ising model is the maximum entropy model for binary variables which reproduces the sample mean and pairwise correlations. Learning the parameters of the Ising model from data is the challenge. We establish an analogy between the inverse Ising problem and the Ornstein-Zernike formalism in liquid state physics. Rather than analytically deriving the closure relation, we use a deep neural network to learn the closure from simulations of the Ising model. We show, using simulations as well as biochemical datasets, that the deep neural network model outperforms systematic field-theoretic expansions, is more data-efficient than the pseudolikelihood method, and can generalize well beyond the parameter regime of the training data. The neural network is able to learn from synthetic data, which can be generated with relative ease, to give accurate predictions on real world datasets. △ Less

Submitted 17 June, 2018; v1 submitted 26 June, 2017; originally announced June 2017.

arXiv:1706.02221 [pdf, other]

doi 10.1103/PhysRevLett.119.026002

Scaling analysis of the screening length in concentrated electrolytes

Authors: Alpha A. Lee, Carla Perez-Martinez, Alexander M. Smith, Susan Perkin

Abstract: The interaction between charged objects in an electrolyte solution is a fundamental question in soft matter physics. It is well-known that the electrostatic contribution to the interaction energy decays exponentially with object separation. Recent measurements reveal that, contrary to the conventional wisdom given by classic Poisson-Boltzmann theory, the decay length increases with ion concentrati… ▽ More The interaction between charged objects in an electrolyte solution is a fundamental question in soft matter physics. It is well-known that the electrostatic contribution to the interaction energy decays exponentially with object separation. Recent measurements reveal that, contrary to the conventional wisdom given by classic Poisson-Boltzmann theory, the decay length increases with ion concentration for concentrated electrolytes and can be an order of magnitude larger than the ion diameter in ionic liquids. We derive a simple scaling theory that explains this anomalous dependence of the decay length on ion concentration. Our theory successfully collapses the decay lengths of a wide class of salts onto a single curve. A novel prediction of our theory is that the decay length increases linearly with the Bjerrum length, which we experimentally verify by surface force measurements. Moreover, we quantitatively relate the measured decay length to classic measurements of the activity coefficient in concentrated electrolytes, thus showing that the measured decay length is indeed a bulk property of the concentrated electrolyte as well as contributing a mechanistic insight into empirical activity coefficients. △ Less

Submitted 7 June, 2017; originally announced June 2017.

Comments: To appear in Physical Review Letters

arXiv:1702.06001 [pdf, other]

doi 10.1103/PhysRevLett.119.208101

Optimal design of experiments by combining coarse and fine measurements

Authors: Alpha A. Lee, Michael P. Brenner, Lucy J. Colwell

Abstract: In many contexts it is extremely costly to perform enough high quality experimental measurements to accurately parameterize a predictive quantitative model. However, it is often much easier to carry out large numbers of experiments that indicate whether each sample is above or below a given threshold. Can many such categorical or "coarse" measurements be combined with a much smaller number of high… ▽ More In many contexts it is extremely costly to perform enough high quality experimental measurements to accurately parameterize a predictive quantitative model. However, it is often much easier to carry out large numbers of experiments that indicate whether each sample is above or below a given threshold. Can many such categorical or "coarse" measurements be combined with a much smaller number of high resolution or "fine" measurements to yield accurate models? Here, we demonstrate an intuitive strategy, inspired by statistical physics, wherein the coarse measurements are used to identify the salient features of the data, while the fine measurements determine the relative importance of these features. A linear model is inferred from the fine measurements, augmented by a quadratic term that captures the correlation structure of the coarse data. We illustrate our strategy by considering the problems of predicting the antimalarial potency and aqueous solubility of small organic molecules from their 2D molecular structure. △ Less

Submitted 16 October, 2017; v1 submitted 31 January, 2017; originally announced February 2017.

Comments: To appear in Physical Review Letters

Journal ref: Phys. Rev. Lett. 119, 208101 (2017)

arXiv:1701.08151 [pdf, other]

doi 10.1039/C6FD00250A

Underscreening in concentrated electrolytes

Authors: Alpha A. Lee, Carla Perez-Martinez, Alexander M. Smith, Susan Perkin

Abstract: Screening of a surface charge by electrolyte and the resulting interaction energy between charged objects is of fundamental importance in scenarios from bio-molecular interactions to energy storage. The conventional wisdom is that the interaction energy decays exponentially with object separation and the decay length is a decreasing function of ion concentration; the interaction is thus negligible… ▽ More Screening of a surface charge by electrolyte and the resulting interaction energy between charged objects is of fundamental importance in scenarios from bio-molecular interactions to energy storage. The conventional wisdom is that the interaction energy decays exponentially with object separation and the decay length is a decreasing function of ion concentration; the interaction is thus negligible in a concentrated electrolyte. Contrary to this conventional wisdom, we have shown by surface force measurements that the decay length is an increasing function of ion concentration and Bjerrum length for concentrated electrolytes. In this paper we report surface force measurements to test directly the scaling of the screening length with Bjerrum length. Furthermore, we identify a relationship between the concentration dependence of this screening length and empirical measurements of activity coefficient and differential capacitance. The dependence of the screening length on the ion concentration and the Bjerrum length can be explained by a simple scaling conjecture based on the physical intuition that solvent molecules, rather than ions, are charge carriers in a concentrated electrolyte. △ Less

Submitted 27 January, 2017; originally announced January 2017.

Comments: Accepted as a conference paper for "Chemical Physics of Electroactive Materials: Faraday Discussion" (10-12 April 2017, Cambridge, UK)

arXiv:1701.02005 [pdf, other]

doi 10.1039/C6FD00247A

Controlling turbulent drag across electrolytes using electric fields

Authors: Rodolfo Ostilla-Mónico, Alpha A. Lee

Abstract: Reversible in operando control of friction is an unsolved challenge crucial to industrial tribology. Recent studies show that at low sliding velocities, this control can be achieved by applying an electric field across electrolyte lubricants. However, the phenomenology at high sliding velocities is yet unknown. In this paper, we investigate the hydrodynamic friction across electrolytes under shear… ▽ More Reversible in operando control of friction is an unsolved challenge crucial to industrial tribology. Recent studies show that at low sliding velocities, this control can be achieved by applying an electric field across electrolyte lubricants. However, the phenomenology at high sliding velocities is yet unknown. In this paper, we investigate the hydrodynamic friction across electrolytes under shear beyond the transition to turbulence. We develop a novel, highly parallelised, numerical method for solving the coupled Navier-Stokes Poisson-Nernest-Planck equation. Our results show that turbulent drag cannot be controlled across dilute electrolyte using static electric fields alone. The limitations of the Poisson-Nernst-Planck formalism hints at ways in which turbulent drag could be controlled using electric fields. △ Less

Submitted 9 January, 2017; v1 submitted 8 January, 2017; originally announced January 2017.

Comments: Accepted by the Faraday Discussions on Chemical Physics of Electroactive Materials

arXiv:1611.02234 [pdf, other]

doi 10.1103/PhysRevFluids.2.043103

Hot Particles Attract in a Cold Bath

Authors: Hidenori Tanaka, Alpha A. Lee, Michael P. Brenner

Abstract: Controlling interactions out of thermodynamic equilibrium is crucial for designing addressable and functional self-organizing structures. These active interactions also underpin collective behavior in biological systems. Here we study a general setting of active particles in a bath of passive particles, and demonstrate a novel mechanism for long range attraction between active particles. The mecha… ▽ More Controlling interactions out of thermodynamic equilibrium is crucial for designing addressable and functional self-organizing structures. These active interactions also underpin collective behavior in biological systems. Here we study a general setting of active particles in a bath of passive particles, and demonstrate a novel mechanism for long range attraction between active particles. The mechanism operates when the translational persistence length of the active particle motion is smaller than the particle diameter. In this limit, the system reduces to particles of higher diffusivity ("hot" particles) in a bath of particles with lower diffusivity ("cold" particles). This attractive interaction arises as a hot particle pushes cold particles away to create a large hole around itself, and the holes interact via a depletion-like attraction. Strikingly, the interaction range is more than an order of magnitude larger than the particle radius, well beyond the range of conventional depletion force. Although the mechanism occurs outside the parameter regime of typical biological swimmers, the mechanism could be realized in the laboratory. △ Less

Submitted 30 March, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

Journal ref: Phys. Rev. Fluids, 2, 043103 ()2017

arXiv:1607.03926 [pdf]

doi 10.1021/acs.jpclett.6b00867

The Electrostatic Screening Length in Concentrated Electrolytes Increases with Concentration

Authors: Alexander M. Smith, Alpha A. Lee, Susan Perkin

Abstract: According to classical electrolyte theories interactions in dilute (low ion density) electrolytes decay exponentially with distance, with the Debye screening length the characteristic length-scale. This decay length decreases monotonically with increasing ion concentration, due to effective screening of charges over short distances. Thus within the Debye model no long-range forces are expected in… ▽ More According to classical electrolyte theories interactions in dilute (low ion density) electrolytes decay exponentially with distance, with the Debye screening length the characteristic length-scale. This decay length decreases monotonically with increasing ion concentration, due to effective screening of charges over short distances. Thus within the Debye model no long-range forces are expected in concentrated electrolytes. Here we reveal, using experimental detection of the interaction between two planar charged surfaces across a wide range of electrolytes, that beyond the dilute (Debye-Huuckel) regime the screening length increases with increasing concentration. The screening lengths for all electrolytes studied - including aqueous NaCl solutions, ionic liquids diluted with propylene carbonate, and pure ionic liquids - collapse onto a single curve when scaled by the dielectric constant. This non-monotonic variation of the screening length with concentration, and its generality across ionic liquids and aqueous salt solutions, demonstrates an important characteristic of concentrated electrolytes of substantial relevance from biology to energy storage. △ Less

Submitted 13 July, 2016; originally announced July 2016.

Comments: This document is the unedited authors' version of a Submitted Work that was subsequently accepted for publication in the Journal of Physical Chemistry Letters, copyright American Chemical Society, after peer review. To access the final edited and published work see http://pubsdc3.acs.org/articlesonrequest/AOR-EW6FuIC6wIh6D9qqEeHD

Journal ref: Journal of Physical Chemistry Letters 2016, 7, 2157-2163

arXiv:1606.05922 [pdf, ps, other]

doi 10.1209/0295-5075/113/38005

Quantum Capacitance Modifies Interionic Interactions in Semiconducting Nanopores

Authors: Alpha A. Lee, Dominic Vella, Alain Goriely

Abstract: Nanopores made with low dimensional semiconducting materials, such as carbon nanotubes and graphene slit pores, are used in supercapacitors. In theories and simulations of their operation, it is often assumed that such pores screen ion-ion interactions like metallic pores, i.e. that screening leads to an exponential decay of the interaction potential with ion separation. By introducing a quantum c… ▽ More Nanopores made with low dimensional semiconducting materials, such as carbon nanotubes and graphene slit pores, are used in supercapacitors. In theories and simulations of their operation, it is often assumed that such pores screen ion-ion interactions like metallic pores, i.e. that screening leads to an exponential decay of the interaction potential with ion separation. By introducing a quantum capacitance that accounts for the density of states in the material, we show that ion-ion interactions in carbon nanotubes and graphene slit pores actually decay algebraically with ion separation. This result suggests a new avenue of capacitance optimization based on tuning the electronic structure of a pore: a marked enhancement in capacitance might be achieved by develo** nanopores made with metallic materials or bulk semimetallic materials. △ Less

Submitted 19 June, 2016; originally announced June 2016.

Journal ref: Europhysics Letters, 113, 38005 (2016)

arXiv:1510.06354 [pdf, ps, other]

doi 10.1039/C6SM01927G

Microscopic mechanism of thermomolecular orientation and polarization

Authors: Alpha A. Lee

Abstract: Recent molecular dynamics simulations show that thermal gradients can induce electric fields in water that are comparable in magnitude to electric fields seen in ionic thin films and biomembranes. This surprising non-equilibrium phenomenon of thermomolecular orientation is also observed more generally in simulations of polar and non-polar size-asymmetric dumbbell fluids. However, a microscopic the… ▽ More Recent molecular dynamics simulations show that thermal gradients can induce electric fields in water that are comparable in magnitude to electric fields seen in ionic thin films and biomembranes. This surprising non-equilibrium phenomenon of thermomolecular orientation is also observed more generally in simulations of polar and non-polar size-asymmetric dumbbell fluids. However, a microscopic theory linking thermomolecular orientation and polarization to molecular properties is yet unknown. Here, we formulate an analytically solvable microscopic model of size-asymmetric dumbbell molecules in a temperature gradient using a mean-field, local equilibrium approach. Our theory reveals the relationship between the extent of thermomolecular orientation and polarization, and molecular volume, size anisotropy and dipole moment. Predictions of the theory agree quantitatively with molecular dynamics simulations. Crucially, our framework shows how thermomolecular orientation can be controlled and maximized by tuning microscopic molecular properties. △ Less

Submitted 7 December, 2016; v1 submitted 21 October, 2015; originally announced October 2015.

Journal ref: Soft Matter, 12, 8661 (2016)

arXiv:1510.05595 [pdf, ps, other]

doi 10.1103/PhysRevX.6.021034

Capacitance-Power-Hysteresis Trilemma in Nanoporous Supercapacitors

Authors: Alpha A Lee, Dominic Vella, Alain Goriely, Svyatoslav Kondrat

Abstract: Nanoporous supercapacitors are an important player in the field of energy storage that fill the gap between dielectric capacitors and batteries. The key challenge in the development of supercapacitors is the perceived trade-off between capacitance and power delivery. Current efforts to boost the capacitance of nanoporous supercapacitors focus on reducing the pore size so that they can only accommo… ▽ More Nanoporous supercapacitors are an important player in the field of energy storage that fill the gap between dielectric capacitors and batteries. The key challenge in the development of supercapacitors is the perceived trade-off between capacitance and power delivery. Current efforts to boost the capacitance of nanoporous supercapacitors focus on reducing the pore size so that they can only accommodate a single layer of ions. However, this tight packing compromises the charging dynamics and hence power density. We show via an analytical theory and Monte Carlo simulations that charging is sensitively dependent on the affinity of ions to the pores, and that high capacitances can be obtained for ionophobic pores of widths significantly larger than the ion diameter. Our theory also predicts that charging can be hysteretic with a significant energy loss per cycle for intermediate ionophilicities. We use these observations to explore the parameter regimes in which a capacitance-power-hysteresis trilemma may be avoided. △ Less

Submitted 15 June, 2016; v1 submitted 19 October, 2015; originally announced October 2015.

Journal ref: Phys. Rev. X 6, 021034 (2016)

arXiv:1508.05380 [pdf, other]

doi 10.1016/j.eml.2015.08.003

The role of extensibility in the birth of a ruck in a rug

Authors: Alpha A. Lee, Clément Le Gouellec, Dominic Vella

Abstract: Everyday experience suggests that a `ruck' forms when the two ends of a heavy carpet or rug are brought closer together. Classical analysis, however, shows that the horizontal compressive force needed to create such a ruck should be infinite. We show that this apparent paradox is due to the assumption of inextensibility of the rug. By accounting for a finite extensibility, we show that rucks appea… ▽ More Everyday experience suggests that a `ruck' forms when the two ends of a heavy carpet or rug are brought closer together. Classical analysis, however, shows that the horizontal compressive force needed to create such a ruck should be infinite. We show that this apparent paradox is due to the assumption of inextensibility of the rug. By accounting for a finite extensibility, we show that rucks appear with a finite, non-zero end-shortening and confirm our theoretical results with simple experiments. Finally, we note that the appropriate measure of extensibility, the stretchability, is in this case not determined purely by geometry, but incorporates the mechanics of the sheet. △ Less

Submitted 11 December, 2015; v1 submitted 21 August, 2015; originally announced August 2015.

Comments: Revised version - small typos corrected

Journal ref: Extr. Mech. Lett. 5, 81-87 (2015)

arXiv:1507.02410 [pdf, ps, other]

Sharp Interface Limits of the Cahn-Hilliard Equation with Degenerate Mobility

Authors: Alpha Albert Lee, Andreas Münch, Endre Süli

Abstract: In this work, the sharp interface limit of the degenerate Cahn-Hilliard equation (in two space dimensions) with a polynomial double well free energy and a quadratic mobility is derived via a matched asymptotic analysis involving exponentially large and small terms and multiple inner layers. In contrast to some results found in the literature, our analysis reveals that the interface motion is drive… ▽ More In this work, the sharp interface limit of the degenerate Cahn-Hilliard equation (in two space dimensions) with a polynomial double well free energy and a quadratic mobility is derived via a matched asymptotic analysis involving exponentially large and small terms and multiple inner layers. In contrast to some results found in the literature, our analysis reveals that the interface motion is driven by a combination of surface diffusion flux proportional to the surface Laplacian of the interface curvature and an additional contribution from nonlinear, porous-medium type bulk diffusion, For higher degenerate mobilities, bulk diffusion is subdominant. The sharp interface models are corroborated by comparing relaxation rates of perturbations to a radially symmetric stationary state with those obtained by the phase field model. △ Less

Submitted 9 July, 2015; originally announced July 2015.

Comments: 27 pages, 2 figures

arXiv:1505.06876 [pdf, other]

doi 10.1073/pnas.1701739114

Fluctuation Spectra and Force Generation in Non-equilibrium Systems

Authors: Alpha A. Lee, Dominic Vella, John S. Wettlaufer

Abstract: Many biological systems are appropriately viewed as passive inclusions immersed in an active bath: from proteins on active membranes to microscopic swimmers confined by boundaries. The non-equilibrium forces exerted by the active bath on the inclusions or boundaries often regulate function, and such forces may also be exploited in artificial active materials. Nonetheless, the general phenomenology… ▽ More Many biological systems are appropriately viewed as passive inclusions immersed in an active bath: from proteins on active membranes to microscopic swimmers confined by boundaries. The non-equilibrium forces exerted by the active bath on the inclusions or boundaries often regulate function, and such forces may also be exploited in artificial active materials. Nonetheless, the general phenomenology of these active forces remains elusive. We show that the fluctuation spectrum of the active medium, the partitioning of energy as a function of wavenumber, controls the phenomenology of force generation. We find that for a narrow, unimodal spectrum, the force exerted by a non-equilibrium system on two embedded walls depends on the width and the position of the peak in the fluctuation spectrum, and oscillates between repulsion and attraction as a function of wall separation. We examine two apparently disparate examples: the Maritime Casimir effect and recent simulations of active Brownian particles. A key implication of our work is that important non-equilibrium interactions are encoded within the fluctuation spectrum. In this sense the noise becomes the signal. △ Less

Submitted 24 June, 2017; v1 submitted 26 May, 2015; originally announced May 2015.

Journal ref: Proc. Nat. Acad. Sci. USA, vol. 114 (35), 9255-60 (2017)

arXiv:1505.06381 [pdf, ps, other]

doi 10.1063/1.4929696

Degenerate Mobilities in Phase Field Models are Insufficient to Capture Surface Diffusion

Authors: Alpha A Lee, Andreas Münch, Endre Süli

Abstract: Phase field models frequently provide insight to phase transitions, and are robust numerical tools to solve free boundary problems corresponding to the motion of interfaces. A body of prior literature suggests that interface motion via surface diffusion is the long-time, sharp interface limit of microscopic phase field models such as the Cahn-Hilliard equation with a degenerate mobility function.… ▽ More Phase field models frequently provide insight to phase transitions, and are robust numerical tools to solve free boundary problems corresponding to the motion of interfaces. A body of prior literature suggests that interface motion via surface diffusion is the long-time, sharp interface limit of microscopic phase field models such as the Cahn-Hilliard equation with a degenerate mobility function. Contrary to this conventional wisdom, we show that the long-time behaviour of degenerate Cahn-Hilliard equation with a polynomial free energy undergoes coarsening, reflecting the presence of bulk diffusion, rather than pure surface diffusion. This reveals an important limitation of phase field models that are frequently used to model surface diffusion. △ Less

Submitted 23 May, 2015; originally announced May 2015.

arXiv:1502.01276 [pdf, ps, other]

doi 10.1103/PhysRevLett.115.106101

Dynamics of Ion Transport in Ionic Liquids

Authors: Alpha A. Lee, Svyatoslav Kondrat, Dominic Vella, Alain Goriely

Abstract: A gap in understanding the link between continuum theories of ion transport in ionic liquids and the underlying microscopic dynamics has hindered the development of frameworks for transport phenomena in these concentrated electrolytes. Here, we construct a continuum theory for ion transport in ionic liquids by coarse graining a simple exclusion process of interacting particles on a lattice. The re… ▽ More A gap in understanding the link between continuum theories of ion transport in ionic liquids and the underlying microscopic dynamics has hindered the development of frameworks for transport phenomena in these concentrated electrolytes. Here, we construct a continuum theory for ion transport in ionic liquids by coarse graining a simple exclusion process of interacting particles on a lattice. The resulting dynamical equations can be written as a gradient flow with a mobility matrix that vanishes at high densities. This form of the mobility matrix gives rise to a charging behaviour that is different to the one known for electrolytic solutions, but which agrees qualitatively with the phenomenology observed in experiments and simulations. △ Less

Submitted 11 August, 2015; v1 submitted 4 February, 2015; originally announced February 2015.

Comments: To appear in PRL

Journal ref: Phys. Rev. Lett. 115, 106101 (2015)

arXiv:1412.7887 [pdf, other]

doi 10.1021/jz502250z

Are Room Temperature Ionic Liquids Dilute Electrolytes?

Authors: Alpha A Lee, Dominic Vella, Susan Perkin, Alain Goriely

Abstract: An important question in understanding the structure of ionic liquids is whether ions are truly "free" and mobile which would correspond to a concentrated ionic melt, or are rather "bound" in ion pairs, that is a liquid of ion pairs with a small concentration of free ions. Recent surface force balance experiments from different groups have given conflicting answers to this question. We propose a s… ▽ More An important question in understanding the structure of ionic liquids is whether ions are truly "free" and mobile which would correspond to a concentrated ionic melt, or are rather "bound" in ion pairs, that is a liquid of ion pairs with a small concentration of free ions. Recent surface force balance experiments from different groups have given conflicting answers to this question. We propose a simple model for the thermodynamics and kinetics of ion pairing in ionic liquids. Our model takes into account screened ion-ion, dipole-dipole and dipole-ion interactions in the mean field limit. The results of this model suggest that almost two thirds of the ions are free at any instant, and ion pairs have a short lifetime comparable to the characteristic timescale for diffusion. These results suggest that there is no particular thermodynamic or kinetic preference for ions residing in pairs. We therefore conclude that ionic liquids are concentrated, rather than dilute, electrolytes. △ Less

Submitted 25 December, 2014; originally announced December 2014.

Journal ref: Journal of Physical Chemistry Letters, 2015, 6, 159-163

arXiv:1405.5448 [pdf, ps, other]

doi 10.1063/1.4893714

Unravelling Nanoconfined Films of Ionic Liquids

Authors: Alpha A Lee, Dominic Vella, Susan Perkin, Alain Goriely

Abstract: The confinement of an ionic liquid between charged solid surfaces is treated using an exactly solvable 1D Coulomb gas model. The theory highlights the importance of two dimensionless parameters: the fugacity of the ionic liquid, and the electrostatic interaction energy of ions at closest approach relative to thermal energy, in determining how the disjoining pressure exerted on the walls depends on… ▽ More The confinement of an ionic liquid between charged solid surfaces is treated using an exactly solvable 1D Coulomb gas model. The theory highlights the importance of two dimensionless parameters: the fugacity of the ionic liquid, and the electrostatic interaction energy of ions at closest approach relative to thermal energy, in determining how the disjoining pressure exerted on the walls depends on the geometrical confinement. Our theory reveals that thermodynamic fluctuations play a vital role in the "squeezing out" of charged layers as the confinement is increased. The model shows good qualitative agreement with previous experimental data, with all parameters independently estimated without fitting. △ Less

Submitted 1 August, 2014; v1 submitted 21 May, 2014; originally announced May 2014.

Journal ref: J. Chem. Phys. 141, 094904 (2014)

arXiv:1212.2148 [pdf, ps, other]

Electroactuation with Single Charge Carrier Ionomers

Authors: Alpha A. Lee, Ralph H. Colby, Alexei A. Kornyshev

Abstract: A simple theory of electromechanical transduction for single-charge-carrier double-layer electroactuators is developed, in which the ion distribution and curvature are mutually coupled. The obtained expressions for the dependence of curvature and charge accumulation on the applied voltage, as well as the electroactuation dynamics, are compared with literature data. The mechanical- or sensor- perfo… ▽ More A simple theory of electromechanical transduction for single-charge-carrier double-layer electroactuators is developed, in which the ion distribution and curvature are mutually coupled. The obtained expressions for the dependence of curvature and charge accumulation on the applied voltage, as well as the electroactuation dynamics, are compared with literature data. The mechanical- or sensor- performance of such electroactuators appears to be determined by just three cumulative parameters, with all of their constituents measurable, permitting a scaling approach to their design. △ Less

Submitted 10 December, 2012; originally announced December 2012.

Showing 1–42 of 42 results for author: Lee, A A