-
Accelerating Defect Predictions in Semiconductors Using Graph Neural Networks
Authors:
Md Habibur Rahman,
Prince Gollapalli,
Panayotis Manganaris,
Satyesh Kumar Yadav,
Ghanshyam Pilania,
Brian DeCost,
Kamal Choudhary,
Arun Mannodi-Kanakkithodi
Abstract:
Here, we develop a framework for the prediction and screening of native defects and functional impurities in a chemical space of Group IV, III-V, and II-VI zinc blende (ZB) semiconductors, powered by crystal Graph-based Neural Networks (GNNs) trained on high-throughput density functional theory (DFT) data. Using an innovative approach of sampling partially optimized defect configurations from DFT…
▽ More
Here, we develop a framework for the prediction and screening of native defects and functional impurities in a chemical space of Group IV, III-V, and II-VI zinc blende (ZB) semiconductors, powered by crystal Graph-based Neural Networks (GNNs) trained on high-throughput density functional theory (DFT) data. Using an innovative approach of sampling partially optimized defect configurations from DFT calculations, we generate one of the largest computational defect datasets to date, containing many types of vacancies, self-interstitials, anti-site substitutions, impurity interstitials and substitutions, as well as some defect complexes. We applied three types of established GNN techniques, namely Crystal Graph Convolutional Neural Network (CGCNN), Materials Graph Network (MEGNET), and Atomistic Line Graph Neural Network (ALIGNN), to rigorously train models for predicting defect formation energy (DFE) in multiple charge states and chemical potential conditions. We find that ALIGNN yields the best DFE predictions with root mean square errors around 0.3 eV, which represents a prediction accuracy of 98 % given the range of values within the dataset, improving significantly on the state-of-the-art. Models are tested for different defect types as well as for defect charge transition levels. We further show that GNN-based defective structure optimization can take us close to DFT-optimized geometries at a fraction of the cost of full DFT. DFT-GNN models enable prediction and screening across thousands of hypothetical defects based on both unoptimized and partially-optimized defective structures, hel** identify electronically active defects in technologically-important semiconductors.
△ Less
Submitted 13 September, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Emulating Expert Insight: A Robust Strategy for Optimal Experimental Design
Authors:
Matthew R. Carbone,
Hyeong ** Kim,
Chandima Fernando,
Shinjae Yoo,
Daniel Olds,
Howie Joress,
Brian DeCost,
Bruce Ravel,
Yugang Zhang,
Phillip M. Maffettone
Abstract:
The challenge of optimal design of experiments (DOE) pervades materials science, physics, chemistry, and biology. Bayesian optimization has been used to address this challenge in vast sample spaces, although it requires framing experimental campaigns through the lens of maximizing some observable. This framing is insufficient for epistemic research goals that seek to comprehensively analyze a samp…
▽ More
The challenge of optimal design of experiments (DOE) pervades materials science, physics, chemistry, and biology. Bayesian optimization has been used to address this challenge in vast sample spaces, although it requires framing experimental campaigns through the lens of maximizing some observable. This framing is insufficient for epistemic research goals that seek to comprehensively analyze a sample space, without an explicit scalar objective (e.g., the characterization of a wafer or sample library). In this work, we propose a flexible formulation of scientific value that recasts a dataset of input conditions and higher-dimensional observable data into a continuous, scalar metric. Intuitively, the scientific value function measures where observables change significantly, emulating the perspective of experts driving an experiment, and can be used in collaborative analysis tools or as an objective for optimization techniques. We demonstrate this technique by exploring simulated phase boundaries from different observables, autonomously driving a variable temperature measurement of a ferroelectric material, and providing feedback from a nanoparticle synthesis campaign. The method is seamlessly compatible with existing optimization tools, can be extended to multi-modal and multi-fidelity experiments, and can integrate existing models of an experimental system. Because of its flexibility, it can be deployed in a range of experimental settings for autonomous or accelerated experiments.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Reproducible Sorbent Materials Foundry for Carbon Capture at Scale
Authors:
Austin McDannald,
Howie Joress,
Brian DeCost,
Avery E. Baumann,
A. Gilad Kusne,
Kamal Choudhary,
Taner Yildirim,
Daniel W. Siderius,
Winnie Wong-Ng,
Andrew J. Allen,
Christopher M. Stafford,
Diana Ortiz-Montalvo
Abstract:
We envision an autonomous sorbent materials foundry (SMF) for rapidly evaluating materials for direct air capture of carbon dioxide (CO2), specifically targeting novel metal organic framework materials. Our proposed SMF is hierarchical, simultaneously addressing the most critical gaps in the inter-related space of sorbent material synthesis, processing, properties, and performance. The ability to…
▽ More
We envision an autonomous sorbent materials foundry (SMF) for rapidly evaluating materials for direct air capture of carbon dioxide (CO2), specifically targeting novel metal organic framework materials. Our proposed SMF is hierarchical, simultaneously addressing the most critical gaps in the inter-related space of sorbent material synthesis, processing, properties, and performance. The ability to collect these critical data streams in an agile, coordinated, and automated fashion will enable efficient end-to-end sorbent materials design through machine learning driven research framework.
△ Less
Submitted 25 July, 2022;
originally announced July 2022.
-
Development of an automated millifluidic platform and data-analysis pipeline for rapid electrochemical corrosion measurements: a pH study on Zn-Ni
Authors:
Howie Joress,
Brian DeCost,
Najlaa Hassan,
Trevor M. Braun,
Justin M. Gorham,
Jason Hattrick-Simpers
Abstract:
We describe the development of a millifluidic based scanning droplet cell platform for rapid and automated corrosion. This system allows for measurement of corrosion properties (e.g., open circuit potential, corrosion current through Tafel and linear polarization resistance measurements, and cyclic voltammograms) on a localized section of a planar sample. Our system is highly automated and flexibl…
▽ More
We describe the development of a millifluidic based scanning droplet cell platform for rapid and automated corrosion. This system allows for measurement of corrosion properties (e.g., open circuit potential, corrosion current through Tafel and linear polarization resistance measurements, and cyclic voltammograms) on a localized section of a planar sample. Our system is highly automated and flexible, allowing for scripted changing and mixing of solutions and point-to-point motion on the sample. We have also created an automated data analysis pipeline. Here we demonstrate this tool by corroding a plate of electroplated Zn$_{85}$Ni$_{15}$ alloy over a range of pH values and correlate our results with XPS measurements and literature.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Recent Advances and Applications of Deep Learning Methods in Materials Science
Authors:
Kamal Choudhary,
Brian DeCost,
Chi Chen,
Anubhav Jain,
Francesca Tavazza,
Ryan Cohn,
Cheol WooPark,
Alok Choudhary,
Ankit Agrawal,
Simon J. L. Billinge,
Elizabeth Holm,
Shyue ** Ong,
Chris Wolverton
Abstract:
Deep learning (DL) is one of the fastest growing topics in materials data science, with rapidly emerging applications spanning atomistic, image-based, spectral, and textual data modalities. DL allows analysis of unstructured data and automated identification of features. Recent development of large materials databases has fueled the application of DL methods in atomistic prediction in particular.…
▽ More
Deep learning (DL) is one of the fastest growing topics in materials data science, with rapidly emerging applications spanning atomistic, image-based, spectral, and textual data modalities. DL allows analysis of unstructured data and automated identification of features. Recent development of large materials databases has fueled the application of DL methods in atomistic prediction in particular. In contrast, advances in image and spectral data have largely leveraged synthetic data enabled by high quality forward models as well as by generative unsupervised DL methods. In this article, we present a high-level overview of deep-learning methods followed by a detailed discussion of recent developments of deep learning in atomistic simulation, materials imaging, spectral analysis, and natural language processing. For each modality we discuss applications involving both theoretical and experimental data, typical modeling approaches with their strengths and limitations, and relevant publicly available software and datasets. We conclude the review with a discussion of recent cross-cutting work related to uncertainty quantification in this field and a brief perspective on limitations, challenges, and potential growth areas for DL methods in materials science. The application of DL methods in materials science presents an exciting avenue for future materials discovery and design.
△ Less
Submitted 27 October, 2021;
originally announced October 2021.
-
The Joint Automated Repository for Various Integrated Simulations (JARVIS) for data-driven materials design
Authors:
Kamal Choudhary,
Kevin F. Garrity,
Andrew C. E. Reid,
Brian DeCost,
Adam J. Biacchi,
Angela R. Hight Walker,
Zachary Trautt,
Jason Hattrick-Simpers,
A. Gilad Kusne,
Andrea Centrone,
Albert Davydov,
Jie Jiang,
Ruth Pachter,
Gowoon Cheon,
Evan Reed,
Ankit Agrawal,
Xiaofeng Qian,
Vinit Sharma,
Houlong Zhuang,
Sergei V. Kalinin,
Bobby G. Sumpter,
Ghanshyam Pilania,
Pinar Acar,
Subhasish Mandal,
Kristjan Haule
, et al. (3 additional authors not shown)
Abstract:
The Joint Automated Repository for Various Integrated Simulations (JARVIS) is an integrated infrastructure to accelerate materials discovery and design using density functional theory (DFT), classical force-fields (FF), and machine learning (ML) techniques. JARVIS is motivated by the Materials Genome Initiative (MGI) principles of develo** open-access databases and tools to reduce the cost and d…
▽ More
The Joint Automated Repository for Various Integrated Simulations (JARVIS) is an integrated infrastructure to accelerate materials discovery and design using density functional theory (DFT), classical force-fields (FF), and machine learning (ML) techniques. JARVIS is motivated by the Materials Genome Initiative (MGI) principles of develo** open-access databases and tools to reduce the cost and development time of materials discovery, optimization, and deployment. The major features of JARVIS are: JARVIS-DFT, JARVIS-FF, JARVIS-ML, and JARVIS-Tools. To date, JARVIS consists of 40,000 materials and 1 million calculated properties in JARVIS-DFT, 1,500 materials and 110 force-fields in JARVIS-FF, and 25 ML models for material-property predictions in JARVIS-ML, all of which are continuously expanding. JARVIS-Tools provides scripts and workflows for running and analyzing various simulations. We compare our computational data to experiments or high-fidelity computational methods wherever applicable to evaluate error/uncertainty in predictions. In addition to the existing workflows, the infrastructure can support a wide variety of other technologically important applications as part of the data-driven materials design paradigm. The JARVIS datasets and tools are publicly available at the website: https://jarvis.nist.gov .
△ Less
Submitted 11 July, 2021; v1 submitted 3 July, 2020;
originally announced July 2020.
-
Scientific AI in materials science: a path to a sustainable and scalable paradigm
Authors:
Brian DeCost,
Jason Hattrick-Simpers,
Zachary Trautt,
Aaron Kusne,
Eva Campo,
Martin Green
Abstract:
Recently there has been an ever-increasing trend in the use of machine learning (ML) and artificial intelligence (AI) methods by the materials science, condensed matter physics, and chemistry communities. This perspective article identifies key scientific, technical, and social opportunities that the materials community must prioritize to consistently develop and leverage Scientific AI to provide…
▽ More
Recently there has been an ever-increasing trend in the use of machine learning (ML) and artificial intelligence (AI) methods by the materials science, condensed matter physics, and chemistry communities. This perspective article identifies key scientific, technical, and social opportunities that the materials community must prioritize to consistently develop and leverage Scientific AI to provide a credible path towards the advancement of current materials-limited technologies. Here we highlight the intersections of these opportunities with a series of proposed paths forward. The opportunities are roughly sorted from scientific/technical (e.g., development of robust, physically meaningful multiscale material representations) to social (e.g., promoting an AI-ready workforce). The proposed paths forward range from develo** new infrastructure and capabilities to deploying them in industry and academia. We provide a brief introduction to AI in materials science and engineering, followed by detailed discussions of each of the opportunities and paths forward.
△ Less
Submitted 18 March, 2020;
originally announced March 2020.
-
Accelerating Photovoltaic Materials Development via High-Throughput Experiments and Machine-Learning-Assisted Diagnosis
Authors:
Shi**g Sun,
Noor T. P. Hartono,
Zekun D. Ren,
Felipe Oviedo,
Antonio M. Buscemi,
Mariya Layurova,
De Xin Chen,
Tofunmi Ogunfunmi,
Janak Thapa,
Savitha Ramasamy,
Charles Settens,
Brian L. DeCost,
Aaron Gilad Kusne,
Zhe Liu,
Siyu I. P. Tian,
I. Marius Peters,
Juan-Pablo Correa-Baena,
Tonio Buonassisi
Abstract:
Accelerating the experimental cycle for new materials development is vital for addressing the grand energy challenges of the 21st century. We fabricate and characterize 75 unique halide perovskite-inspired solution-based thin-film materials within a two-month period, with 87% exhibiting band gaps between 1.2 eV and 2.4 eV that are of interest for energy-harvesting applications. This increased thro…
▽ More
Accelerating the experimental cycle for new materials development is vital for addressing the grand energy challenges of the 21st century. We fabricate and characterize 75 unique halide perovskite-inspired solution-based thin-film materials within a two-month period, with 87% exhibiting band gaps between 1.2 eV and 2.4 eV that are of interest for energy-harvesting applications. This increased throughput is enabled by streamlining experimental workflows, develo** a set of precursors amenable to high-throughput synthesis, and develo** machine-learning assisted diagnosis. We utilize a deep neural network to classify compounds based on experimental X-ray diffraction data into 0D, 2D, and 3D structures more than 10 times faster than human analysis and with 90% accuracy. We validate our methods using lead-halide perovskites and extend the application to novel lead-free compositions. The wider synthesis window and faster cycle of learning enables three noteworthy scientific findings: (1) we realize four inorganic layered perovskites, A3B2Br9 (A = Cs, Rb; B = Bi, Sb) in thin-film form via one-step liquid deposition; (2) we report a multi-site lead-free alloy series that was not previously described in literature, Cs3(Bi1-xSbx)2(I1-xBrx)9; and (3) we reveal the effect on bandgap (reduction to <2 eV) and structure upon simultaneous alloying on the B-site and X-site of Cs3Bi2I9 with Sb and Br. This study demonstrates that combining an accelerated experimental cycle of learning and machine-learning based diagnosis represents an important step toward realizing fully-automated laboratories for materials discovery and development.
△ Less
Submitted 25 November, 2018;
originally announced December 2018.
-
Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks
Authors:
Felipe Oviedo,
Zekun Ren,
Shi**g Sun,
Charlie Settens,
Zhe Liu,
Noor Titan Putri Hartono,
Ramasamy Savitha,
Brian L. DeCost,
Siyu I. P. Tian,
Giuseppe Romano,
Aaron Gilad Kusne,
Tonio Buonassisi
Abstract:
X-ray diffraction (XRD) data acquisition and analysis is among the most time-consuming steps in the development cycle of novel thin-film materials. We propose a machine-learning-enabled approach to predict crystallographic dimensionality and space group from a limited number of thin-film XRD patterns. We overcome the scarce-data problem intrinsic to novel materials development by coupling a superv…
▽ More
X-ray diffraction (XRD) data acquisition and analysis is among the most time-consuming steps in the development cycle of novel thin-film materials. We propose a machine-learning-enabled approach to predict crystallographic dimensionality and space group from a limited number of thin-film XRD patterns. We overcome the scarce-data problem intrinsic to novel materials development by coupling a supervised machine learning approach with a model agnostic, physics-informed data augmentation strategy using simulated data from the Inorganic Crystal Structure Database (ICSD) and experimental data. As a test case, 115 thin-film metal halides spanning 3 dimensionalities and 7 space-groups are synthesized and classified. After testing various algorithms, we develop and implement an all convolutional neural network, with cross validated accuracies for dimensionality and space-group classification of 93% and 89%, respectively. We propose average class activation maps, computed from a global average pooling layer, to allow high model interpretability by human experimentalists, elucidating the root causes of misclassification. Finally, we systematically evaluate the maximum XRD pattern step size (data acquisition rate) before loss of predictive accuracy occurs, and determine it to be 0.16°, which enables an XRD pattern to be obtained and classified in 5.5 minutes or less.
△ Less
Submitted 23 April, 2019; v1 submitted 20 November, 2018;
originally announced November 2018.