-
Discovering Novel Halide Perovskite Alloys using Multi-Fidelity Machine Learning and Genetic Algorithm
Authors:
Jiaqi Yang,
Panayotis Manganaris,
Arun Mannodi-Kanakkithodi
Abstract:
Expanding the pool of stable halide perovskites with attractive optoelectronic properties is crucial to addressing current limitations in their performance as photovoltaic (PV) absorbers. In this article, we demonstrate how a high-throughput density functional theory (DFT) dataset of halide perovskite alloys can be used to train accurate surrogate models for property prediction and subsequently pe…
▽ More
Expanding the pool of stable halide perovskites with attractive optoelectronic properties is crucial to addressing current limitations in their performance as photovoltaic (PV) absorbers. In this article, we demonstrate how a high-throughput density functional theory (DFT) dataset of halide perovskite alloys can be used to train accurate surrogate models for property prediction and subsequently perform inverse design using genetic algorithm (GA). Our dataset consists of decomposition energies, band gaps, and photovoltaic efficiencies of nearly 800 pure and mixed composition ABX$_3$ compounds from both the GGA-PBE and HSE06 functionals, and are combined with ~ 100 experimental data points collected from the literature. Multi-fidelity random forest regression models are trained on the DFT + experimental dataset for each property using descriptors that one-hot encode composition, phase, and fidelity, and additionally include well-known elemental or molecular properties of species at the A, B, and X sites. Rigorously optimized models are deployed for experiment-level prediction over > 150,000 hypothetical compounds, leading to thousands of promising materials with low decomposition energy, band gap between 1 and 2 eV, and efficiency > 15%. Surrogate models are further combined with GA using an objective function to maintain chemical feasibility, minimize decomposition energy, maximize PV efficiency, and keep band gap between 1 and 2 eV; hundreds more optimal compositions and phases are thus discovered. We present an analysis of the screened and inverse-designed materials, visualize ternary phase diagrams generated for many systems of interest using ML predictions, and suggest strategies for further improvement and expansion in the future.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Accelerating Defect Predictions in Semiconductors Using Graph Neural Networks
Authors:
Md Habibur Rahman,
Prince Gollapalli,
Panayotis Manganaris,
Satyesh Kumar Yadav,
Ghanshyam Pilania,
Brian DeCost,
Kamal Choudhary,
Arun Mannodi-Kanakkithodi
Abstract:
Here, we develop a framework for the prediction and screening of native defects and functional impurities in a chemical space of Group IV, III-V, and II-VI zinc blende (ZB) semiconductors, powered by crystal Graph-based Neural Networks (GNNs) trained on high-throughput density functional theory (DFT) data. Using an innovative approach of sampling partially optimized defect configurations from DFT…
▽ More
Here, we develop a framework for the prediction and screening of native defects and functional impurities in a chemical space of Group IV, III-V, and II-VI zinc blende (ZB) semiconductors, powered by crystal Graph-based Neural Networks (GNNs) trained on high-throughput density functional theory (DFT) data. Using an innovative approach of sampling partially optimized defect configurations from DFT calculations, we generate one of the largest computational defect datasets to date, containing many types of vacancies, self-interstitials, anti-site substitutions, impurity interstitials and substitutions, as well as some defect complexes. We applied three types of established GNN techniques, namely Crystal Graph Convolutional Neural Network (CGCNN), Materials Graph Network (MEGNET), and Atomistic Line Graph Neural Network (ALIGNN), to rigorously train models for predicting defect formation energy (DFE) in multiple charge states and chemical potential conditions. We find that ALIGNN yields the best DFE predictions with root mean square errors around 0.3 eV, which represents a prediction accuracy of 98 % given the range of values within the dataset, improving significantly on the state-of-the-art. Models are tested for different defect types as well as for defect charge transition levels. We further show that GNN-based defective structure optimization can take us close to DFT-optimized geometries at a fraction of the cost of full DFT. DFT-GNN models enable prediction and screening across thousands of hypothetical defects based on both unoptimized and partially-optimized defective structures, hel** identify electronically active defects in technologically-important semiconductors.
△ Less
Submitted 13 September, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
A High-Throughput Computational Dataset of Halide Perovskite Alloys
Authors:
Jiaqi Yang,
Panayotis Manganaris,
Arun Mannodi-Kanakkithodi
Abstract:
Novel halide perovskites with improved stability and optoelectronic properties can be designed via composition engineering at cation and/or anion sites. Data-driven methods, especially high-throughput first principles computations and subsequent analysis based on unique materials descriptors, are key to achieving this goal. In this work, we report a density functional theory (DFT) based dataset of…
▽ More
Novel halide perovskites with improved stability and optoelectronic properties can be designed via composition engineering at cation and/or anion sites. Data-driven methods, especially high-throughput first principles computations and subsequent analysis based on unique materials descriptors, are key to achieving this goal. In this work, we report a density functional theory (DFT) based dataset of 495 $ABX_3$ halide perovskite compounds, with various atomic and molecular species considered at A, B and X sites, and different amounts of mixing applied at each site using the special quasirandom structures (SQS) approach for alloys. We perform GGA-PBE calculations on all 495 pseudo-cubic perovskite structures and around 250 calculations using the HSE06 functional, with and without spin-orbit coupling, both including geometry optimization and static calculations on PBE optimized structures. Lattice constants, decomposition energy, band gap, and theoretical photovoltaic efficiency, are computed using each level of theory, and comparisons are made with collected experimental values. Trends in the data are unraveled in terms of the effects of mixing at different sites, fractions of specific elemental or molecular species present in the compound, and averaged physical properties of species at different sites. We perform screening across the perovskite dataset based on multiple definitions of tolerance factors, deviation from cubicity, computed stability and optoelectronic properties, leading to a list of promising compositions and design principles for achieving multiple desired properties. Our multi-objective, multi-fidelity, computational halide perovskite alloy dataset, one of the most comprehensive to date, is available open-source, and currently being used to train predictive and optimization models for accelerating the design of novel compositions for superior optoelectronic applications.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.