-
A case study of multi-modal, multi-institutional data management for the combinatorial materials science community
Authors:
Sarah I. Allec,
Eric S. Muckley,
Nathan S. Johnson,
Christopher K. H. Borg,
Dylan J. Kirsch,
Joshua Martin,
Rohit Pant,
Ichiro Takeuchi,
Andrew S. Lee,
James E. Saal,
Logan Ward,
Apurva Mehta
Abstract:
Although the convergence of high-performance computing, automation, and machine learning has significantly altered the materials design timeline, transformative advances in functional materials and acceleration of their design will require addressing the deficiencies that currently exist in materials informatics, particularly a lack of standardized experimental data management. The challenges asso…
▽ More
Although the convergence of high-performance computing, automation, and machine learning has significantly altered the materials design timeline, transformative advances in functional materials and acceleration of their design will require addressing the deficiencies that currently exist in materials informatics, particularly a lack of standardized experimental data management. The challenges associated with experimental data management are especially true for combinatorial materials science, where advancements in automation of experimental workflows have produced datasets that are often too large and too complex for human reasoning. The data management challenge is further compounded by the multi-modal and multi-institutional nature of these datasets, as they tend to be distributed across multiple institutions and can vary substantially in format, size, and content. To adequately map a materials design space from such datasets, an ideal materials data infrastructure would contain data and metadata describing i) synthesis and processing conditions, ii) characterization results, and iii) property and performance measurements. Here, we present a case study for the low-barrier development of such a dashboard that enables standardized organization, analysis, and visualization of a large data lake consisting of combinatorial datasets of synthesis and processing conditions, X-ray diffraction patterns, and materials property measurements generated at several different institutions. While this dashboard was developed specifically for data-driven thermoelectric materials discovery, we envision the adaptation of this prototype to other materials applications, and, more ambitiously, future integration into an all-encompassing materials data management infrastructure.
△ Less
Submitted 6 February, 2024; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Quantifying the performance of machine learning models in materials discovery
Authors:
Christopher K. H. Borg,
Eric S. Muckley,
Clara Nyby,
James E. Saal,
Logan Ward,
Apurva Mehta,
Bryce Meredig
Abstract:
The predictive capabilities of machine learning (ML) models used in materials discovery are typically measured using simple statistics such as the root-mean-square error (RMSE) or the coefficient of determination ($r^2$) between ML-predicted materials property values and their known values. A tempting assumption is that models with low error should be effective at guiding materials discovery, and…
▽ More
The predictive capabilities of machine learning (ML) models used in materials discovery are typically measured using simple statistics such as the root-mean-square error (RMSE) or the coefficient of determination ($r^2$) between ML-predicted materials property values and their known values. A tempting assumption is that models with low error should be effective at guiding materials discovery, and conversely, models with high error should give poor discovery performance. However, we observe that no clear connection exists between a "static" quantity averaged across an entire training set, such as RMSE, and an ML property model's ability to dynamically guide the iterative (and often extrapolative) discovery of novel materials with targeted properties. In this work, we simulate a sequential learning (SL)-guided materials discovery process and demonstrate a decoupling between traditional model error metrics and model performance in guiding materials discoveries. We show that model performance in materials discovery depends strongly on (1) the target range within the property distribution (e.g., whether a 1st or 10th decile material is desired); (2) the incorporation of uncertainty estimates in the SL acquisition function; (3) whether the scientist is interested in one discovery or many targets; and (4) how many SL iterations are allowed. To overcome the limitations of static metrics and robustly capture SL performance, we recommend metrics such as Discovery Yield ($DY$), a measure of how many high-performing materials were discovered during SL, and Discovery Probability ($DP$), a measure of likelihood of discovering high-performing materials at any point in the SL process.
△ Less
Submitted 24 October, 2022;
originally announced October 2022.
-
Map** Thermoelectric Transport in a Multicomponent Alloy Space
Authors:
Ramya Gurunathan,
Suchismita Sarker,
Christopher K. H. Borg,
James Saal,
Logan Ward,
Apurva Mehta,
G. Jeffrey Snyder
Abstract:
Interest in high entropy alloy thermoelectric materials is predicated on achieving ultralow lattice thermal conductivity $κ\sub{L}$ through large compositional disorder. However, here we show that for a given mechanism, such as mass contrast phonon scattering, $κ\sub{L}$ will be minimized along the binary alloy with the highest mass contrast, such that adding an intermediate-mass atom to increase…
▽ More
Interest in high entropy alloy thermoelectric materials is predicated on achieving ultralow lattice thermal conductivity $κ\sub{L}$ through large compositional disorder. However, here we show that for a given mechanism, such as mass contrast phonon scattering, $κ\sub{L}$ will be minimized along the binary alloy with the highest mass contrast, such that adding an intermediate-mass atom to increase atomic disorder can increase thermal conductivity. Only when each component adds an independent scattering mechanism (such as adding strain fluctuation to an existing mass fluctuation) is there a benefit. In addition, both charge carriers and heat-carrying phonons are known to experience scattering due to alloying effects, leading to a trade-off in thermoelectric performance. We apply analytic transport models, based on perturbation and effective medium theories, to predict how alloy scattering will affect the thermal and electronic transport across the full compositional range of several pseudo-ternary and pseudo-quaternary alloy systems. To do so, we demonstrate a multicomponent extension to both thermal and electronic binary alloy scattering models based on the virtual crystal approximation. Finally, we show that common functional forms used in computational thermodynamics can be applied to this problem to further generalize the scattering behavior that is modeled.
△ Less
Submitted 3 May, 2022;
originally announced May 2022.
-
Quantifying uncertainty in high-throughput density functional theory: a comparison of AFLOW, Materials Project, and OQMD
Authors:
Vinay I. Hegde,
Christopher K. H. Borg,
Zachary del Rosario,
Yoolhee Kim,
Maxwell Hutchinson,
Erin Antono,
Julia Ling,
Paul Saxe,
James E. Saal,
Bryce Meredig
Abstract:
A central challenge in high throughput density functional theory (HT-DFT) calculations is selecting a combination of input parameters and post-processing techniques that can be used across all materials classes, while also managing accuracy-cost tradeoffs. To investigate the effects of these parameter choices, we consolidate three large HT-DFT databases: Automatic-FLOW (AFLOW), the Materials Proje…
▽ More
A central challenge in high throughput density functional theory (HT-DFT) calculations is selecting a combination of input parameters and post-processing techniques that can be used across all materials classes, while also managing accuracy-cost tradeoffs. To investigate the effects of these parameter choices, we consolidate three large HT-DFT databases: Automatic-FLOW (AFLOW), the Materials Project (MP), and the Open Quantum Materials Database (OQMD), and compare reported properties across each pair of databases for materials calculated using the same initial crystal structure. We find that HT-DFT formation energies and volumes are generally more reproducible than band gaps and total magnetizations; for instance, a notable fraction of records disagree on whether a material is metallic (up to 7%) or magnetic (up to 15%). The variance between calculated properties is as high as 0.105 eV/atom (median relative absolute difference, or MRAD, of 6%) for formation energy, 0.65 Å$^3$/atom (MRAD of 4%) for volume, 0.21 eV (MRAD of 9%) for band gap, and 0.15 $μ_{\rm B}$/formula unit (MRAD of 8%) for total magnetization, comparable to the differences between DFT and experiment. We trace some of the larger discrepancies to choices involving pseudopotentials, the DFT+U formalism, and elemental reference states, and argue that further standardization of HT-DFT would be beneficial to reproducibility.
△ Less
Submitted 5 November, 2022; v1 submitted 3 July, 2020;
originally announced July 2020.
-
The preparation and phase diagrams of (${^{7}}$Li${_{1-x}}$Fe${_{x}}$OD)FeSe and (Li${_{1-x}}$Fe${_{x}}$OH)FeSe superconductors
Authors:
Xiuquan Zhou,
Christopher K. H. Borg,
Jeffrey W. Lynn,
Shanta R. Saha,
Johnpierre Paglione,
Efrain E. Rodriguez
Abstract:
We report the phase diagram for the superconducting system (${^{7}}$Li${_{1-x}}$Fe${_{x}}$OD)FeSe and contrast it with that of (Li${_{1-x}}$Fe${_{x}}$OH)FeSe both in single crystal and powder forms. Samples were prepared via hydrothermal methods and characterized with laboratory and synchrotron X-ray diffraction, high-resolution neutron powder diffraction (NPD), and high intensity NPD. We find a c…
▽ More
We report the phase diagram for the superconducting system (${^{7}}$Li${_{1-x}}$Fe${_{x}}$OD)FeSe and contrast it with that of (Li${_{1-x}}$Fe${_{x}}$OH)FeSe both in single crystal and powder forms. Samples were prepared via hydrothermal methods and characterized with laboratory and synchrotron X-ray diffraction, high-resolution neutron powder diffraction (NPD), and high intensity NPD. We find a correlation between the tetragonality of the unit cell parameters and the critical temperature, $T_{c}$, which is indicative of the effects of charge do** on the lattice and formation of iron vacancies in the FeSe layer. We observe no appreciable isotope effect on the maximum $T_{c}$ in substituting H by by D. The NPD measurements definitively rule out an antiferromagnetic ordering in the non-superconducting (Li${_{1-x}}$Fe${_{x}}$OD)FeSe samples below 120 K, which has been reported in non-superconducting (Li${_{1-x}}$Fe${_{x}}$OH)FeSe.$^{1}$ A likely explanation for the observed antiferromagnetic transition in (Li${_{1-x}}$Fe${_{x}}$OH)FeSe samples is the formation of impurities during their preparation such as Fe${_{3}}$O${_{4}}$ and LixFeO2, which express a charge ordering transition known as the Verwey transition near 120 K. The concentration of these oxide impurities is found to be dependent on the concentration of the lithium hydroxide reagent and the use of H${_{2}}$O vs. D${_{2}}$O as the solvent during synthesis. We also describe the reaction conditions that lead to some of our superconducting samples to exhibit ferromagnetism below $T_{c}$.
△ Less
Submitted 10 December, 2015;
originally announced December 2015.
-
Strong anisotropy in nearly ideal-tetrahedral superconducting FeS single crystals
Authors:
Christopher K. H. Borg,
Xiuquan Zhou,
Christopher Eckberg,
Daniel J. Campbell,
Shanta R. Saha,
Johnpierre Paglione,
Efrain E. Rodriguez
Abstract:
We report the novel preparation of single crystals of tetragonal iron sulfide, FeS, which exhibits a nearly ideal tetrahedral geometry with S--Fe--S bond angles of 110.2(2) $^\circ$ and 108.1(2) $^\circ$. Grown via hydrothermal de-intercalation of K${_x}$Fe${_{2-y}}$S${_2}$ crystals under basic and reducing conditions, the silver, plate-like crystals of FeS remain stable up to 200 $^\circ$C under…
▽ More
We report the novel preparation of single crystals of tetragonal iron sulfide, FeS, which exhibits a nearly ideal tetrahedral geometry with S--Fe--S bond angles of 110.2(2) $^\circ$ and 108.1(2) $^\circ$. Grown via hydrothermal de-intercalation of K${_x}$Fe${_{2-y}}$S${_2}$ crystals under basic and reducing conditions, the silver, plate-like crystals of FeS remain stable up to 200 $^\circ$C under air and 250 $^\circ$C under inert conditions, even though the mineral "mackinawite" (FeS) is known to be metastable. FeS single crystals exhibit a superconducting state below $T_c=4$ K as determined by electrical resistivity, magnetic susceptibility, and heat capacity measurements, confirming the presence of a bulk superconducting state. Normal state measurements yield an electronic specific heat of 5~mJ/mol-K$^2$, and paramagnetic, metallic behavior with a low residual resistivity of 250~$μΩ\cdot$cm. Magnetoresistance measurements performed as a function of magnetic field angle tilted toward both transverse and longitudinal orientations with respect to the applied current reveal remarkable two-dimensional behavior. This is paralleled in the superconducting state, which exhibits the largest known upper critical field $H_{c2}$ anisotropy of all iron-based superconductors, with $H_{c2}^{||ab}(0) / H_{c2}^{||c}(0)=$(2.75~T)/(0.275~T)=10. Comparisons to theoretical models for 2D and anisotropic-3D superconductors, however, suggest that FeS is the latter case with a large effective mass anisotropy. We place FeS in context to other closely related iron-based superconductors and discuss the role of structural parameters such as anion height on superconductivity.
△ Less
Submitted 13 January, 2016; v1 submitted 3 December, 2015;
originally announced December 2015.
-
Neutron investigation of the magnetic scattering in an iron-based ferromagnetic superconductor
Authors:
Jeffrey W. Lynn,
Xiuquan Zhou,
Christopher K. H. Borg,
Shanta R. Saha,
Johnpierre Paglione,
Efrain E. Rodriguez
Abstract:
Neutron diffraction and small angle scattering experiments have been carried out on the double-isotopic polycrystalline sample (7Li0.82Fe0.18OD)FeSe. Profile refinements of the diffraction data establish the composition and reveal an essentially single phase material with lattice parameters of a= 3.7827 Å and c= 9.1277 Å at 4 K, in the ferromagnetic-superconductor regime, with a bulk superconducti…
▽ More
Neutron diffraction and small angle scattering experiments have been carried out on the double-isotopic polycrystalline sample (7Li0.82Fe0.18OD)FeSe. Profile refinements of the diffraction data establish the composition and reveal an essentially single phase material with lattice parameters of a= 3.7827 Å and c= 9.1277 Å at 4 K, in the ferromagnetic-superconductor regime, with a bulk superconducting transition of TC = 18 K. Small angle neutron scattering (SANS) measurements in zero applied field reveal the onset of ferromagnetic order below TF ~ 12.5 K, with a wave vector and temperature dependence consistent with an inhomogeneous ferromagnet of spontaneous vortices or domains in a mixed state. No oscillatory long range ordered magnetic state is observed. Field dependent measurements establish a separate component of magnetic scattering from the vortex lattice, which occurs at the expected wave vector. The temperature dependence of the vortex scattering does not indicate any contribution from the ferromagnetism, consistent with diffraction data that indicate that the ordered ferromagnetic moment is quite small.
△ Less
Submitted 15 July, 2015;
originally announced July 2015.