-
Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter
Authors:
M. Aamir,
B. Acar,
G. Adamov,
T. Adams,
C. Adloff,
S. Afanasiev,
C. Agrawal,
C. Agrawal,
A. Ahmad,
H. A. Ahmed,
S. Akbar,
N. Akchurin,
B. Akgul,
B. Akgun,
R. O. Akpinar,
E. Aktas,
A. AlKadhim,
V. Alexakhin,
J. Alimena,
J. Alison,
A. Alpana,
W. Alshehri,
P. Alvarez Dominguez,
M. Alyari,
C. Amendola
, et al. (550 additional authors not shown)
Abstract:
A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr…
▽ More
A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadronic section. The shower reconstruction method is based on graph neural networks and it makes use of a dynamic reduction network architecture. It is shown that the algorithm is able to capture and mitigate the main effects that normally hinder the reconstruction of hadronic showers using classical reconstruction methods, by compensating for fluctuations in the multiplicity, energy, and spatial distributions of the shower's constituents. The performance of the algorithm is evaluated using test beam data collected in 2018 prototype of the CMS HGCAL accompanied by a section of the CALICE AHCAL prototype. The capability of the method to mitigate the impact of energy leakage from the calorimeter is also demonstrated.
△ Less
Submitted 30 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Towards Informatics-Driven Design of Nuclear Waste Forms
Authors:
Vinay I. Hegde,
Miroslava Peterson,
Sarah I. Allec,
Xiaonan Lu,
Thiruvillamalai Mahadevan,
Thanh Nguyen,
Jayani Kalahe,
Jared Oshiro,
Robert J. Seffens,
Ethan K. Nickerson,
**cheng Du,
Brian J. Riley,
John D. Vienna,
James E. Saal
Abstract:
Informatics-driven approaches, such as machine learning and sequential experimental design, have shown the potential to drastically impact next-generation materials discovery and design. In this perspective, we present a few guiding principles for applying informatics-based methods towards the design of novel nuclear waste forms. We advocate for adopting a system design approach, and describe the…
▽ More
Informatics-driven approaches, such as machine learning and sequential experimental design, have shown the potential to drastically impact next-generation materials discovery and design. In this perspective, we present a few guiding principles for applying informatics-based methods towards the design of novel nuclear waste forms. We advocate for adopting a system design approach, and describe the effective usage of data-driven methods in every stage of such a design process. We demonstrate how this approach can optimally leverage physics-based simulations, machine learning surrogates, and experimental synthesis and characterization, within a feedback-driven closed-loop sequential learning framework. We discuss the importance of incorporating domain knowledge into the representation of materials, the construction and curation of datasets, the development of predictive property models, and the design and execution of experiments. We illustrate the application of this approach by successfully designing and validating Na- and Nd-containing phosphate-based ceramic waste forms. Finally, we discuss open challenges in such informatics-driven workflows and present an outlook for their widespread application for the cleanup of nuclear wastes.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Evaluation of GlassNet for physics-informed machine learning of glass stability and glass-forming ability
Authors:
Sarah I. Allec,
Xiaonan Lu,
Daniel R. Cassar,
Xuan T. Nguyen,
Vinay I. Hegde,
Thiruvillamalai Mahadevan,
Miroslava Peterson,
**cheng Du,
Brian J. Riley,
John D. Vienna,
James E. Saal
Abstract:
Glasses form the basis of many modern applications and also hold great potential for future medical and environmental applications. However, their structural complexity and large composition space make design and optimization challenging for certain applications. Of particular importance for glass processing is an estimate of a given composition's glass-forming ability (GFA). However, there remain…
▽ More
Glasses form the basis of many modern applications and also hold great potential for future medical and environmental applications. However, their structural complexity and large composition space make design and optimization challenging for certain applications. Of particular importance for glass processing is an estimate of a given composition's glass-forming ability (GFA). However, there remain many open questions regarding the physical mechanisms of glass formation, especially in oxide glasses. It is apparent that a proxy for GFA would be highly useful in glass processing and design, but identifying such a surrogate property has proven itself to be difficult. Here, we explore the application of an open-source pre-trained NN model, GlassNet, that can predict the characteristic temperatures necessary to compute glass stability (GS) and assess the feasibility of using these physics-informed ML (PIML)-predicted GS parameters to estimate GFA. In doing so, we track the uncertainties at each step of the computation - from the original ML prediction errors, to the compounding of errors during GS estimation, and finally to the final estimation of GFA. While GlassNet exhibits reasonable accuracy on all individual properties, we observe a large compounding of error in the combination of these individual predictions for the prediction of GS, finding that random forest models offer similar accuracy to GlassNet. We also breakdown the ML performance on different glass families and find that the error in GS prediction is correlated with the error in crystallization peak temperature prediction. Lastly, we utilize this finding to assess the relationship between top-performing GS parameters and GFA for two ternary glass systems: sodium borosilicate and sodium iron phosphate glasses. We conclude that to obtain true ML predictive capability of GFA, significantly more data needs to be collected.
△ Less
Submitted 19 March, 2024; v1 submitted 15 March, 2024;
originally announced March 2024.
-
Performance Characterization of Containerized DNN Training and Inference on Edge Accelerators
Authors:
Prashanthi S. K.,
Vinayaka Hegde,
Keerthana Patchava,
Ankita Das,
Yogesh Simmhan
Abstract:
Edge devices have typically been used for DNN inferencing. The increase in the compute power of accelerated edges is leading to their use in DNN training also. As privacy becomes a concern on multi-tenant edge devices, Docker containers provide a lightweight virtualization mechanism to sandbox models. But their overheads for edge devices are not yet explored. In this work, we study the impact of c…
▽ More
Edge devices have typically been used for DNN inferencing. The increase in the compute power of accelerated edges is leading to their use in DNN training also. As privacy becomes a concern on multi-tenant edge devices, Docker containers provide a lightweight virtualization mechanism to sandbox models. But their overheads for edge devices are not yet explored. In this work, we study the impact of containerized DNN inference and training workloads on an NVIDIA AGX Orin edge device and contrast it against bare metal execution on running time, CPU, GPU and memory utilization, and energy consumption. Our analysis provides several interesting insights on these overheads.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Rotation of a Stealth CME on 2012 October 5 Observed in the Inner Heliosphere
Authors:
Sandeep Kumar,
Dinesha V. Hegde,
Nandita Srivastava,
Nikolai V. Pogorelov,
Nat Gopalswamy,
Seiji Yashiro
Abstract:
Coronal Mass Ejections (CMEs) are subject to changes in their direction of propagation, tilt, and other properties. This is because CMEs interact with the ambient solar wind and other large-scale magnetic field structures. In this work, we report on the observations of the 2012 October 5 stealth CME using coronagraphic and heliospheric images. We find clear evidence of a continuous rotation of the…
▽ More
Coronal Mass Ejections (CMEs) are subject to changes in their direction of propagation, tilt, and other properties. This is because CMEs interact with the ambient solar wind and other large-scale magnetic field structures. In this work, we report on the observations of the 2012 October 5 stealth CME using coronagraphic and heliospheric images. We find clear evidence of a continuous rotation of the CME, i.e., an increase in the tilt angle, estimated using the Graduated Cylindrical Shell (GCS) reconstruction at different heliocentric distances, up to 58 solar radii. We find a further increase in the tilt at L1 estimated from the toroidal and cylindrical flux rope fitting on the in situ observations of IMF and solar wind parameters. This study highlights the importance of observations of Heliospheric Imager (HI), onboard the Solar TErrestrial RElations Observatory (STEREO). In particular, the GCS reconstruction of CMEs in HI field-of-view promises to bridge the gap between the near-Sun and in-situ observations at the L1. The changes in the CME tilt has significant implications for the space weather impact of stealth CMEs.
△ Less
Submitted 6 October, 2023;
originally announced October 2023.
-
Element similarity in high-dimensional materials representations
Authors:
Anthony Onwuli,
Ashish V. Hegde,
Kevin Nguyen,
Keith T. Butler,
Aron Walsh
Abstract:
The traditional display of elements in the periodic table is convenient for the study of chemistry and physics. However, the atomic number alone is insufficient for training statistical machine learning models to describe and extract composition-structure-property relationships. Here, we assess the similarity and correlations contained within high-dimensional local and distributed representations…
▽ More
The traditional display of elements in the periodic table is convenient for the study of chemistry and physics. However, the atomic number alone is insufficient for training statistical machine learning models to describe and extract composition-structure-property relationships. Here, we assess the similarity and correlations contained within high-dimensional local and distributed representations of the chemical elements, as implemented in an open-source Python package ElementEmbeddings. These include element vectors of up to 200 dimensions derived from known physical properties, crystal structure analysis, natural language processing, and deep learning models. A range of distance measures are compared and a clustering of elements into familiar groups is found using dimensionality reduction techniques. The cosine similarity is used to assess the utility of these metrics for crystal structure prediction, showing that they can outperform the traditional radius ratio rules for the structural classification of AB binary solids.
△ Less
Submitted 24 August, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
By how much can closed-loop frameworks accelerate computational materials discovery?
Authors:
Lance Kavalsky,
Vinay I. Hegde,
Eric Muckley,
Matthew S. Johnson,
Bryce Meredig,
Venkatasubramanian Viswanathan
Abstract:
The implementation of automation and machine learning surrogatization within closed-loop computational workflows is an increasingly popular approach to accelerate materials discovery. However, the scale of the speedup associated with this paradigm shift from traditional manual approaches remains an open question. In this work, we rigorously quantify the acceleration from each of the components wit…
▽ More
The implementation of automation and machine learning surrogatization within closed-loop computational workflows is an increasingly popular approach to accelerate materials discovery. However, the scale of the speedup associated with this paradigm shift from traditional manual approaches remains an open question. In this work, we rigorously quantify the acceleration from each of the components within a closed-loop framework for material hypothesis evaluation by identifying four distinct sources of speedup: (1) task automation, (2) calculation runtime improvements, (3) sequential learning-driven design space search, and (4) surrogatization of expensive simulations with machine learning models. This is done using a time-kee** ledger to record runs of automated software and corresponding manual computational experiments within the context of electrocatalysis. From a combination of the first three sources of acceleration, we estimate that overall hypothesis evaluation time can be reduced by over 90%, i.e., achieving a speedup of $\sim$$10\times$. Further, by introducing surrogatization into the loop, we estimate that the design time can be reduced by over 95%, i.e., achieving a speedup of $\sim$$15$-$20\times$. Our findings present a clear value proposition for utilizing closed-loop approaches for accelerating materials discovery.
△ Less
Submitted 23 November, 2022; v1 submitted 18 November, 2022;
originally announced November 2022.
-
Performance of the CMS High Granularity Calorimeter prototype to charged pion beams of 20$-$300 GeV/c
Authors:
B. Acar,
G. Adamov,
C. Adloff,
S. Afanasiev,
N. Akchurin,
B. Akgün,
M. Alhusseini,
J. Alison,
J. P. Figueiredo de sa Sousa de Almeida,
P. G. Dias de Almeida,
A. Alpana,
M. Alyari,
I. Andreev,
U. Aras,
P. Aspell,
I. O. Atakisi,
O. Bach,
A. Baden,
G. Bakas,
A. Bakshi,
S. Banerjee,
P. DeBarbaro,
P. Bargassa,
D. Barney,
F. Beaudette
, et al. (435 additional authors not shown)
Abstract:
The upgrade of the CMS experiment for the high luminosity operation of the LHC comprises the replacement of the current endcap calorimeter by a high granularity sampling calorimeter (HGCAL). The electromagnetic section of the HGCAL is based on silicon sensors interspersed between lead and copper (or copper tungsten) absorbers. The hadronic section uses layers of stainless steel as an absorbing med…
▽ More
The upgrade of the CMS experiment for the high luminosity operation of the LHC comprises the replacement of the current endcap calorimeter by a high granularity sampling calorimeter (HGCAL). The electromagnetic section of the HGCAL is based on silicon sensors interspersed between lead and copper (or copper tungsten) absorbers. The hadronic section uses layers of stainless steel as an absorbing medium and silicon sensors as an active medium in the regions of high radiation exposure, and scintillator tiles directly readout by silicon photomultipliers in the remaining regions. As part of the development of the detector and its readout electronic components, a section of a silicon-based HGCAL prototype detector along with a section of the CALICE AHCAL prototype was exposed to muons, electrons and charged pions in beam test experiments at the H2 beamline at the CERN SPS in October 2018. The AHCAL uses the same technology as foreseen for the HGCAL but with much finer longitudinal segmentation. The performance of the calorimeters in terms of energy response and resolution, longitudinal and transverse shower profiles is studied using negatively charged pions, and is compared to GEANT4 predictions. This is the first report summarizing results of hadronic showers measured by the HGCAL prototype using beam test data.
△ Less
Submitted 27 May, 2023; v1 submitted 9 November, 2022;
originally announced November 2022.
-
Current Status and Future Prospects for the Light Dark Matter eXperiment
Authors:
Torsten Åkesson,
Nikita Blinov,
Lukas Brand-Baugher,
Cameron Bravo,
Lene Kristian Bryngemark,
Pierfrancesco Butti,
Caterina Doglioni,
Craig Dukes,
Valentina Dutta,
Bertrand Echenard,
Ralf Ehrlich,
Thomas Eichlersmith,
Andrew Furmanski,
Chloe Greenstein,
Craig Group,
Niramay Gogate,
Vinay Hegde,
Christian Herwig,
David G. Hitlin,
Duc Hoang,
Tyler Horoho,
Joseph Incandela,
Wesley Ketchum,
Gordan Krnjaic,
Amina Li
, et al. (23 additional authors not shown)
Abstract:
The constituents of dark matter are still unknown, and the viable possibilities span a vast range of masses. The physics community has established searching for sub-GeV dark matter as a high priority and identified accelerator-based experiments as an essential facet of this search strategy. A key goal of the accelerator-based dark matter program is testing the broad idea of thermally produced sub-…
▽ More
The constituents of dark matter are still unknown, and the viable possibilities span a vast range of masses. The physics community has established searching for sub-GeV dark matter as a high priority and identified accelerator-based experiments as an essential facet of this search strategy. A key goal of the accelerator-based dark matter program is testing the broad idea of thermally produced sub-GeV dark matter through experiments designed to directly produce dark matter particles. The most sensitive way to search for the production of light dark matter is to use a primary electron beam to produce it in fixed-target collisions. The Light Dark Matter eXperiment (LDMX) is an electron-beam fixed-target missing-momentum experiment that realizes this approach and provides unique sensitivity to light dark matter in the sub-GeV range. This contribution provides an overview of the theoretical motivation, the main experimental challenges, how LDMX addresses these challenges, and projected sensitivities. We further describe the capabilities of LDMX to explore other interesting new and standard physics, such as visibly-decaying axion and vector mediators or rare meson decays, and to provide timely electronuclear scattering measurements that will inform the modeling of neutrino-nucleus scattering for DUNE.
△ Less
Submitted 21 August, 2023; v1 submitted 15 March, 2022;
originally announced March 2022.
-
Online Estimation and Optimization of Utility-Based Shortfall Risk
Authors:
Vishwajit Hegde,
Arvind S. Menon,
L. A. Prashanth,
Krishna Jagannathan
Abstract:
Utility-Based Shortfall Risk (UBSR) is a risk metric that is increasingly popular in financial applications, owing to certain desirable properties that it enjoys. We consider the problem of estimating UBSR in a recursive setting, where samples from the underlying loss distribution are available one-at-a-time. We cast the UBSR estimation problem as a root finding problem, and propose stochastic app…
▽ More
Utility-Based Shortfall Risk (UBSR) is a risk metric that is increasingly popular in financial applications, owing to certain desirable properties that it enjoys. We consider the problem of estimating UBSR in a recursive setting, where samples from the underlying loss distribution are available one-at-a-time. We cast the UBSR estimation problem as a root finding problem, and propose stochastic approximation-based estimations schemes. We derive non-asymptotic bounds on the estimation error in the number of samples. We also consider the problem of UBSR optimization within a parameterized class of random variables. We propose a stochastic gradient descent based algorithm for UBSR optimization, and derive non-asymptotic bounds on its convergence.
△ Less
Submitted 27 November, 2023; v1 submitted 16 November, 2021;
originally announced November 2021.
-
Response of a CMS HGCAL silicon-pad electromagnetic calorimeter prototype to 20-300 GeV positrons
Authors:
B. Acar,
G. Adamov,
C. Adloff,
S. Afanasiev,
N. Akchurin,
B. Akgün,
F. Alam Khan,
M. Alhusseini,
J. Alison,
A. Alpana,
G. Altopp,
M. Alyari,
S. An,
S. Anagul,
I. Andreev,
P. Aspell,
I. O. Atakisi,
O. Bach,
A. Baden,
G. Bakas,
A. Bakshi,
S. Bannerjee,
P. Bargassa,
D. Barney,
F. Beaudette
, et al. (364 additional authors not shown)
Abstract:
The Compact Muon Solenoid Collaboration is designing a new high-granularity endcap calorimeter, HGCAL, to be installed later this decade. As part of this development work, a prototype system was built, with an electromagnetic section consisting of 14 double-sided structures, providing 28 sampling layers. Each sampling layer has an hexagonal module, where a multipad large-area silicon sensor is glu…
▽ More
The Compact Muon Solenoid Collaboration is designing a new high-granularity endcap calorimeter, HGCAL, to be installed later this decade. As part of this development work, a prototype system was built, with an electromagnetic section consisting of 14 double-sided structures, providing 28 sampling layers. Each sampling layer has an hexagonal module, where a multipad large-area silicon sensor is glued between an electronics circuit board and a metal baseplate. The sensor pads of approximately 1 cm$^2$ are wire-bonded to the circuit board and are readout by custom integrated circuits. The prototype was extensively tested with beams at CERN's Super Proton Synchrotron in 2018. Based on the data collected with beams of positrons, with energies ranging from 20 to 300 GeV, measurements of the energy resolution and linearity, the position and angular resolutions, and the shower shapes are presented and compared to a detailed Geant4 simulation.
△ Less
Submitted 31 March, 2022; v1 submitted 12 November, 2021;
originally announced November 2021.
-
OPTIMADE, an API for exchanging materials data
Authors:
Casper W. Andersen,
Rickard Armiento,
Evgeny Blokhin,
Gareth J. Conduit,
Shyam Dwaraknath,
Matthew L. Evans,
Ádám Fekete,
Abhijith Gopakumar,
Saulius Gražulis,
Andrius Merkys,
Fawzi Mohamed,
Corey Oses,
Giovanni Pizzi,
Gian-Marco Rignanese,
Markus Scheidgen,
Leopold Talirz,
Cormac Toher,
Donald Winston,
Rossella Aversa,
Kamal Choudhary,
Pauline Colinet,
Stefano Curtarolo,
Davide Di Stefano,
Claudia Draxl,
Suleyman Er
, et al. (31 additional authors not shown)
Abstract:
The Open Databases Integration for Materials Design (OPTIMADE) consortium has designed a universal application programming interface (API) to make materials databases accessible and interoperable. We outline the first stable release of the specification, v1.0, which is already supported by many leading databases and several software packages. We illustrate the advantages of the OPTIMADE API throug…
▽ More
The Open Databases Integration for Materials Design (OPTIMADE) consortium has designed a universal application programming interface (API) to make materials databases accessible and interoperable. We outline the first stable release of the specification, v1.0, which is already supported by many leading databases and several software packages. We illustrate the advantages of the OPTIMADE API through worked examples on each of the public materials databases that support the full API specification.
△ Less
Submitted 25 August, 2021; v1 submitted 2 March, 2021;
originally announced March 2021.
-
Construction and commissioning of CMS CE prototype silicon modules
Authors:
B. Acar,
G. Adamov,
C. Adloff,
S. Afanasiev,
N. Akchurin,
B. Akgün,
M. Alhusseini,
J. Alison,
G. Altopp,
M. Alyari,
S. An,
S. Anagul,
I. Andreev,
M. Andrews,
P. Aspell,
I. A. Atakisi,
O. Bach,
A. Baden,
G. Bakas,
A. Bakshi,
P. Bargassa,
D. Barney,
E. Becheva,
P. Behera,
A. Belloni
, et al. (307 additional authors not shown)
Abstract:
As part of its HL-LHC upgrade program, the CMS Collaboration is develo** a High Granularity Calorimeter (CE) to replace the existing endcap calorimeters. The CE is a sampling calorimeter with unprecedented transverse and longitudinal readout for both electromagnetic (CE-E) and hadronic (CE-H) compartments. The calorimeter will be built with $\sim$30,000 hexagonal silicon modules. Prototype modul…
▽ More
As part of its HL-LHC upgrade program, the CMS Collaboration is develo** a High Granularity Calorimeter (CE) to replace the existing endcap calorimeters. The CE is a sampling calorimeter with unprecedented transverse and longitudinal readout for both electromagnetic (CE-E) and hadronic (CE-H) compartments. The calorimeter will be built with $\sim$30,000 hexagonal silicon modules. Prototype modules have been constructed with 6-inch hexagonal silicon sensors with cell areas of 1.1~$cm^2$, and the SKIROC2-CMS readout ASIC. Beam tests of different sampling configurations were conducted with the prototype modules at DESY and CERN in 2017 and 2018. This paper describes the construction and commissioning of the CE calorimeter prototype, the silicon modules used in the construction, their basic performance, and the methods used for their calibration.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
The DAQ system of the 12,000 Channel CMS High Granularity Calorimeter Prototype
Authors:
B. Acar,
G. Adamov,
C. Adloff,
S. Afanasiev,
N. Akchurin,
B. Akgün,
M. Alhusseini,
J. Alison,
G. Altopp,
M. Alyari,
S. An,
S. Anagul,
I. Andreev,
M. Andrews,
P. Aspell,
I. A. Atakisi,
O. Bach,
A. Baden,
G. Bakas,
A. Bakshi,
P. Bargassa,
D. Barney,
E. Becheva,
P. Behera,
A. Belloni
, et al. (307 additional authors not shown)
Abstract:
The CMS experiment at the CERN LHC will be upgraded to accommodate the 5-fold increase in the instantaneous luminosity expected at the High-Luminosity LHC (HL-LHC). Concomitant with this increase will be an increase in the number of interactions in each bunch crossing and a significant increase in the total ionising dose and fluence. One part of this upgrade is the replacement of the current endca…
▽ More
The CMS experiment at the CERN LHC will be upgraded to accommodate the 5-fold increase in the instantaneous luminosity expected at the High-Luminosity LHC (HL-LHC). Concomitant with this increase will be an increase in the number of interactions in each bunch crossing and a significant increase in the total ionising dose and fluence. One part of this upgrade is the replacement of the current endcap calorimeters with a high granularity sampling calorimeter equipped with silicon sensors, designed to manage the high collision rates. As part of the development of this calorimeter, a series of beam tests have been conducted with different sampling configurations using prototype segmented silicon detectors. In the most recent of these tests, conducted in late 2018 at the CERN SPS, the performance of a prototype calorimeter equipped with ${\approx}12,000\rm{~channels}$ of silicon sensors was studied with beams of high-energy electrons, pions and muons. This paper describes the custom-built scalable data acquisition system that was built with readily available FPGA mezzanines and low-cost Raspberry PI computers.
△ Less
Submitted 8 December, 2020; v1 submitted 7 December, 2020;
originally announced December 2020.
-
AutoMat: Accelerated Computational Electrochemical systems Discovery
Authors:
Emil Annevelink,
Rachel Kurchin,
Eric Muckley,
Lance Kavalsky,
Vinay I. Hegde,
Valentin Sulzer,
Shang Zhu,
Jiankun Pu,
David Farina,
Matthew Johnson,
Dhairya Gandhi,
Adarsh Dave,
Hongyi Lin,
Alan Edelman,
Bharath Ramsundar,
James Saal,
Christopher Rackauckas,
Viral Shah,
Bryce Meredig,
Venkatasubramanian Viswanathan
Abstract:
Large-scale electrification is vital to addressing the climate crisis, but several scientific and technological challenges remain to fully electrify both the chemical industry and transportation. In both of these areas, new electrochemical materials will be critical, but their development currently relies heavily on human-time-intensive experimental trial and error and computationally expensive fi…
▽ More
Large-scale electrification is vital to addressing the climate crisis, but several scientific and technological challenges remain to fully electrify both the chemical industry and transportation. In both of these areas, new electrochemical materials will be critical, but their development currently relies heavily on human-time-intensive experimental trial and error and computationally expensive first-principles, meso-scale and continuum simulations. We present an automated workflow, AutoMat, that accelerates these computational steps by introducing both automated input generation and management of simulations across scales from first principles to continuum device modeling. Furthermore, we show how to seamlessly integrate multi-fidelity predictions such as machine learning surrogates or automated robotic experiments "in-the-loop". The automated framework is implemented with design space search techniques to dramatically accelerate the overall materials discovery pipeline by implicitly learning design features that optimize device performance across several metrics. We discuss the benefits of AutoMat using examples in electrocatalysis and energy storage and highlight lessons learned.
△ Less
Submitted 13 May, 2022; v1 submitted 3 November, 2020;
originally announced November 2020.
-
Quantifying uncertainty in high-throughput density functional theory: a comparison of AFLOW, Materials Project, and OQMD
Authors:
Vinay I. Hegde,
Christopher K. H. Borg,
Zachary del Rosario,
Yoolhee Kim,
Maxwell Hutchinson,
Erin Antono,
Julia Ling,
Paul Saxe,
James E. Saal,
Bryce Meredig
Abstract:
A central challenge in high throughput density functional theory (HT-DFT) calculations is selecting a combination of input parameters and post-processing techniques that can be used across all materials classes, while also managing accuracy-cost tradeoffs. To investigate the effects of these parameter choices, we consolidate three large HT-DFT databases: Automatic-FLOW (AFLOW), the Materials Proje…
▽ More
A central challenge in high throughput density functional theory (HT-DFT) calculations is selecting a combination of input parameters and post-processing techniques that can be used across all materials classes, while also managing accuracy-cost tradeoffs. To investigate the effects of these parameter choices, we consolidate three large HT-DFT databases: Automatic-FLOW (AFLOW), the Materials Project (MP), and the Open Quantum Materials Database (OQMD), and compare reported properties across each pair of databases for materials calculated using the same initial crystal structure. We find that HT-DFT formation energies and volumes are generally more reproducible than band gaps and total magnetizations; for instance, a notable fraction of records disagree on whether a material is metallic (up to 7%) or magnetic (up to 15%). The variance between calculated properties is as high as 0.105 eV/atom (median relative absolute difference, or MRAD, of 6%) for formation energy, 0.65 Å$^3$/atom (MRAD of 4%) for volume, 0.21 eV (MRAD of 9%) for band gap, and 0.15 $μ_{\rm B}$/formula unit (MRAD of 8%) for total magnetization, comparable to the differences between DFT and experiment. We trace some of the larger discrepancies to choices involving pseudopotentials, the DFT+U formalism, and elemental reference states, and argue that further standardization of HT-DFT would be beneficial to reproducibility.
△ Less
Submitted 5 November, 2022; v1 submitted 3 July, 2020;
originally announced July 2020.
-
Knee Cartilage Segmentation Using Diffusion-Weighted MRI
Authors:
Alejandra Duarte,
Chaitra V. Hegde,
Aakash Kaku,
Sreyas Mohan,
José G. Raya
Abstract:
The integrity of articular cartilage is a crucial aspect in the early diagnosis of osteoarthritis (OA). Many novel MRI techniques have the potential to assess compositional changes of the cartilage extracellular matrix. Among these techniques, diffusion tensor imaging (DTI) of cartilage provides a simultaneous assessment of the two principal components of the solid matrix: collagen structure and p…
▽ More
The integrity of articular cartilage is a crucial aspect in the early diagnosis of osteoarthritis (OA). Many novel MRI techniques have the potential to assess compositional changes of the cartilage extracellular matrix. Among these techniques, diffusion tensor imaging (DTI) of cartilage provides a simultaneous assessment of the two principal components of the solid matrix: collagen structure and proteoglycan concentration. DTI, as for any other compositional MRI technique, require a human expert to perform segmentation manually. The manual segmentation is error-prone and time-consuming ($\sim$ few hours per subject). We use an ensemble of modified U-Nets to automate this segmentation task. We benchmark our model against a human expert test-retest segmentation and conclude that our model is superior for Patellar and Tibial cartilage using dice score as the comparison metric. In the end, we do a perturbation analysis to understand the sensitivity of our model to the different components of our input. We also provide confidence maps for the predictions so that radiologists can tweak the model predictions as required. The model has been deployed in practice. In conclusion, cartilage segmentation on DW-MRI images with modified U-Nets achieves accuracy that outperforms the human segmenter. Code is available at https://github.com/aakashrkaku/knee-cartilage-segmentation
△ Less
Submitted 4 December, 2019;
originally announced December 2019.
-
DARTS: DenseUnet-based Automatic Rapid Tool for brain Segmentation
Authors:
Aakash Kaku,
Chaitra V. Hegde,
Jeffrey Huang,
Sohae Chung,
Xiuyuan Wang,
Matthew Young,
Alireza Radmanesh,
Yvonne W. Lui,
Narges Razavian
Abstract:
Quantitative, volumetric analysis of Magnetic Resonance Imaging (MRI) is a fundamental way researchers study the brain in a host of neurological conditions including normal maturation and aging. Despite the availability of open-source brain segmentation software, widespread clinical adoption of volumetric analysis has been hindered due to processing times and reliance on manual corrections. Here,…
▽ More
Quantitative, volumetric analysis of Magnetic Resonance Imaging (MRI) is a fundamental way researchers study the brain in a host of neurological conditions including normal maturation and aging. Despite the availability of open-source brain segmentation software, widespread clinical adoption of volumetric analysis has been hindered due to processing times and reliance on manual corrections. Here, we extend the use of deep learning models from proof-of-concept, as previously reported, to present a comprehensive segmentation of cortical and deep gray matter brain structures matching the standard regions of aseg+aparc included in the commonly used open-source tool, Freesurfer. The work presented here provides a real-life, rapid deep learning-based brain segmentation tool to enable clinical translation as well as research application of quantitative brain segmentation. The advantages of the presented tool include short (~1 minute) processing time and improved segmentation quality. This is the first study to perform quick and accurate segmentation of 102 brain regions based on the surface-based protocol (DMK protocol), widely used by experts in the field. This is also the first work to include an expert reader study to assess the quality of the segmentation obtained using a deep-learning-based model. We show the superior performance of our deep-learning-based models over the traditional segmentation tool, Freesurfer. We refer to the proposed deep learning-based tool as DARTS (DenseUnet-based Automatic Rapid Tool for brain Segmentation). Our tool and trained models are available at https://github.com/NYUMedML/DARTS
△ Less
Submitted 14 November, 2019; v1 submitted 13 November, 2019;
originally announced November 2019.
-
Ternary mixed-anion semiconductors with tunable band gaps from machine-learning and crystal structure prediction
Authors:
Maximilian Amsler,
Logan Ward,
Vinay I. Hegde,
Maarten G. Goesten,
Xia Yi,
Chris Wolverton
Abstract:
We report the computational investigation of a series of ternary X$_4$Y$_2$Z and X$_5$Y$_2$Z$_2$ compounds with X={Mg, Ca, Sr, Ba}, Y={P, As, Sb, Bi}, and Z={S, Se, Te}. The compositions for these materials were predicted through a search guided by machine learning, while the structures were resolved using the minima hop** crystal structure prediction method. Based on $\textit{ab initio}$ calcul…
▽ More
We report the computational investigation of a series of ternary X$_4$Y$_2$Z and X$_5$Y$_2$Z$_2$ compounds with X={Mg, Ca, Sr, Ba}, Y={P, As, Sb, Bi}, and Z={S, Se, Te}. The compositions for these materials were predicted through a search guided by machine learning, while the structures were resolved using the minima hop** crystal structure prediction method. Based on $\textit{ab initio}$ calculations, we predict that many of these compounds are thermodynamically stable. In particular, 21 of the X$_4$Y$_2$Z compounds crystallize in a tetragonal structure with $\textit{I-42d}$ symmetry, and exhibit band gaps in the range of 0.3 and 1.8 eV, well suited for various energy applications. We show that several candidate compounds (in particular X$_4$Y$_2$Te and X$_4$Sb$_2$Se) exhibit good photo absorption in the visible range, while others (e.g., Ba$_4$Sb$_2$Se) show excellent thermoelectric performance due to a high power factor and extremely low lattice thermal conductivities.
△ Less
Submitted 6 December, 2018;
originally announced December 2018.
-
The Phase Stability Network of all Inorganic Materials
Authors:
Vinay I. Hegde,
Muratahan Aykol,
Scott Kirklin,
Chris Wolverton
Abstract:
One of the holy grails of materials science, unlocking structure-property relationships, has largely been pursued via bottom-up investigations of how the arrangement of atoms and interatomic bonding in a material determine its macroscopic behavior. Here we consider a complementary approach, a top-down study of the organizational structure of networks of materials, based on the interaction between…
▽ More
One of the holy grails of materials science, unlocking structure-property relationships, has largely been pursued via bottom-up investigations of how the arrangement of atoms and interatomic bonding in a material determine its macroscopic behavior. Here we consider a complementary approach, a top-down study of the organizational structure of networks of materials, based on the interaction between materials themselves. We unravel the complete "phase stability network of all inorganic materials" as a densely-connected complex network of 21,000 thermodynamically stable compounds (nodes) interlinked by 41 million tie-lines (edges) defining their two-phase equilibria, as computed by high-throughput density functional theory. We find that the node connectivity in the materials network has a lognormal distribution, and the connectivity decreases with the number of elemental constituents in a material. Analyzing the topology of this network of materials has the potential to uncover new knowledge inaccessible from traditional atoms-to-materials paradigms. Using the connectivity of nodes in the phase stability network, we derive a rational, data-driven metric for material reactivity, the "nobility index", and quantitatively identify the noblest materials in nature.
△ Less
Submitted 5 November, 2022; v1 submitted 31 August, 2018;
originally announced August 2018.
-
Network analysis of synthesizable materials discovery
Authors:
Muratahan Aykol,
Vinay I. Hegde,
Linda Hung,
Santosh Suram,
Patrick Herring,
Chris Wolverton,
Jens S. Hummelshøj
Abstract:
Assessing the synthesizability of inorganic materials is a grand challenge for accelerating their discovery using computations. Synthesis of a material is a complex process that depends not only on its thermodynamic stability with respect to others, but also on factors from kinetics, to advances in synthesis techniques, to the availability of precursors. This complexity makes the development of a…
▽ More
Assessing the synthesizability of inorganic materials is a grand challenge for accelerating their discovery using computations. Synthesis of a material is a complex process that depends not only on its thermodynamic stability with respect to others, but also on factors from kinetics, to advances in synthesis techniques, to the availability of precursors. This complexity makes the development of a general theory or first-principles approach to synthesizability currently impractical. Here we show how an alternative pathway to predicting synthesizability emerges from the dynamics of the materials stability network: a scale-free network constructed by combining the convex free-energy surface of inorganic materials computed by high-throughput density functional theory and their experimental discovery timelines extracted from citations. The time-evolution of the underlying network properties allows us to use machine-learning to predict the likelihood that hypothetical, computer-generated materials will be amenable to successful experimental synthesis.
△ Less
Submitted 2 May, 2019; v1 submitted 13 June, 2018;
originally announced June 2018.
-
Exploring the high-pressure materials genome
Authors:
Maximilian Amsler,
Vinay I. Hegde,
Steven D. Jacobsen,
Chris Wolverton
Abstract:
A thorough in situ characterization of materials at extreme conditions is challenging, and computational tools such as crystal structural search methods in combination with ab initio calculations are widely used to guide experiments by predicting the composition, structure, and properties of high-pressure compounds. However, such techniques are usually computationally expensive and not suitable fo…
▽ More
A thorough in situ characterization of materials at extreme conditions is challenging, and computational tools such as crystal structural search methods in combination with ab initio calculations are widely used to guide experiments by predicting the composition, structure, and properties of high-pressure compounds. However, such techniques are usually computationally expensive and not suitable for large-scale combinatorial exploration. On the other hand, data-driven computational approaches using large materials databases are useful for the analysis of energetics and stability of hundreds of thousands of compounds, but their utility for materials discovery is largely limited to idealized conditions of zero temperature and pressure. Here, we present a novel framework combining the two computational approaches, using a simple linear approximation to the enthalpy of a compound in conjunction with ambient-conditions data currently available in high-throughput databases of calculated materials properties. We demonstrate its utility by explaining the occurrence of phases in nature that are not ground states at ambient conditions and estimating the pressures at which such ambient-metastable phases become thermodynamically accessible, as well as guiding the exploration of ambient-immiscible binary systems via sophisticated structural search methods to discover new stable high-pressure phases.
△ Less
Submitted 19 February, 2018;
originally announced February 2018.
-
Designing and discovering a new family of semiconducting quaternary Heusler compounds based on the 18-electron rule
Authors:
Jiangang He,
S. Shahab Naghavi,
Vinay I. Hegde,
Maximilian Amsler,
Chris Wolverton
Abstract:
Intermetallic compounds with sizable band gaps are attractive for their unusual properties but rare. Here, we present a new family of stable semiconducting quaternary Heusler compounds, designed and discovered by means of high-throughput \textit{ab initio} calculations based on the 18-electron rule. The 99 new semiconductors reported here adopt the ordered quaternary Heusler structure with the pro…
▽ More
Intermetallic compounds with sizable band gaps are attractive for their unusual properties but rare. Here, we present a new family of stable semiconducting quaternary Heusler compounds, designed and discovered by means of high-throughput \textit{ab initio} calculations based on the 18-electron rule. The 99 new semiconductors reported here adopt the ordered quaternary Heusler structure with the prototype of LiMgSnPd (F$\bar{\mathbf{4}}$3m, No.\,216) and contain 18 valence electrons per formula unit. They are realized by filling the void in the half Heusler structure with a small and electropositive atom, i.e., lithium. These new stable quaternary Heusler semiconductors possess a range of band gaps from 0.3 to 2.5\,eV, and exhibit some unusual properties different from conventional semiconductors, such as strong optical absorption, giant dielectric screening, and high Seebeck coefficient, which suggest these semiconductors have potential applications as photovoltaic and thermoelectric materials. While this study opens up avenues for further exploration of this novel class of semiconducting quaternary Heuslers, the design strategy used herein is broadly applicable across a potentially wide array of chemistries to discover new stable materials.
△ Less
Submitted 13 February, 2018;
originally announced February 2018.
-
Computational Investigation of Half-Heusler Compounds for Spintronics Applications
Authors:
Jianhua Ma,
Vinay I. Hegde,
Kamaram Munira,
Yunkun Xie,
Sahar Keshavarz,
David T. Mildebrath,
C. Wolverton,
Avik W. Ghosh,
W. H. Butler
Abstract:
We present first-principles density functional calculations of the electronic structure, magnetism, and structural stability of 378 $\textit{XYZ}$ half-Heusler compounds (with $X=$ Cr, Mn, Fe, Co, Ni, Ru, Rh, $Y=$ Ti, V, Cr, Mn, Fe, Ni, $Z=$ Al, Ga, In, Si, Ge, Sn, P, As, Sb). We find that a "Slater-Pauling density of states" with a gap or pseudogap at three states per atom below the gap in at lea…
▽ More
We present first-principles density functional calculations of the electronic structure, magnetism, and structural stability of 378 $\textit{XYZ}$ half-Heusler compounds (with $X=$ Cr, Mn, Fe, Co, Ni, Ru, Rh, $Y=$ Ti, V, Cr, Mn, Fe, Ni, $Z=$ Al, Ga, In, Si, Ge, Sn, P, As, Sb). We find that a "Slater-Pauling density of states" with a gap or pseudogap at three states per atom below the gap in at least one spin channel is a common feature in half-Heusler compounds. We find that the presence of such a gap at the Fermi energy in one or both spin channels contributes greatly to the stability of a half-Heusler compound. We calculate the formation energy of each compound and systematically investigate its stability against all other phases in the Open Quantum Materials Database (OQMD). We represent the thermodynamic phase stability of each compound as its distance from the convex hull of stable phases in the respective chemical space and show that the hull distance of a compound is a good measure of the likelihood of its experimental synthesis. We identify 26 18-electron semiconductors, 45 half-metals, and 34 near half-metals with negative formation energy, that follow the Slater-Pauling rule of three electrons per atom. Our calculations predict new thermodynamically stable semiconducting phases NiScAs, RhTiP, and RuVAs, which merit further experimental exploration. Further, two interesting zero-moment half-metals, CrMnAs and MnCrAs, are calculated to have negative formation energy. In addition, our calculations predict a number of new, hitherto unreported, semiconducting (e.g., CoVGe, FeVAs), half-metallic (e.g., RhVSb), near half-metallic (e.g., CoFeSb, CoVP) half-Heusler compounds to lie close to the respective convex hull of stable phases, and thus may be experimentally realized under suitable synthesis conditions, resulting in potential candidates for various spintronics applications.
△ Less
Submitted 8 December, 2016; v1 submitted 7 October, 2016;
originally announced October 2016.
-
FusionNet: 3D Object Classification Using Multiple Data Representations
Authors:
Vishakh Hegde,
Reza Zadeh
Abstract:
High-quality 3D object recognition is an important component of many vision and robotics systems. We tackle the object recognition problem using two data representations, to achieve leading results on the Princeton ModelNet challenge. The two representations: 1. Volumetric representation: the 3D object is discretized spatially as binary voxels - $1$ if the voxel is occupied and $0$ otherwise. 2. P…
▽ More
High-quality 3D object recognition is an important component of many vision and robotics systems. We tackle the object recognition problem using two data representations, to achieve leading results on the Princeton ModelNet challenge. The two representations: 1. Volumetric representation: the 3D object is discretized spatially as binary voxels - $1$ if the voxel is occupied and $0$ otherwise. 2. Pixel representation: the 3D object is represented as a set of projected 2D pixel images. Current leading submissions to the ModelNet Challenge use Convolutional Neural Networks (CNNs) on pixel representations. However, we diverge from this trend and additionally, use Volumetric CNNs to bridge the gap between the efficiency of the above two representations. We combine both representations and exploit them to learn new features, which yield a significantly better classifier than using either of the representations in isolation. To do this, we introduce new Volumetric CNN (V-CNN) architectures.
△ Less
Submitted 26 November, 2016; v1 submitted 19 July, 2016;
originally announced July 2016.
-
Ultralow Thermal Conductivity in Full-Heusler Semiconductors
Authors:
Jiangang He,
Maximilian Amsler,
Yi Xia,
S. Shahab Naghavi,
Vinay I. Hegde,
Shiqiang Hao,
Stefan Goedecker,
Vidvuds Ozoliņš,
Chris Wolverton
Abstract:
Semiconducting half- and, to a lesser extent, full-Heusler compounds are promising thermoelectric materials due to their compelling electronic properties with large power factors. However, intrinsically high thermal conductivity resulting in a limited thermoelectric efficiency has so far impeded their widespread use in practical applications. Here, we report the computational discovery of a class…
▽ More
Semiconducting half- and, to a lesser extent, full-Heusler compounds are promising thermoelectric materials due to their compelling electronic properties with large power factors. However, intrinsically high thermal conductivity resulting in a limited thermoelectric efficiency has so far impeded their widespread use in practical applications. Here, we report the computational discovery of a class of hitherto unknown stable semiconducting full-Heusler compounds with ten valence electrons ($X_2YZ$, $X$=Ca, Sr, and Ba; $Y$= Au and Hg; $Z$=Sn, Pb, As, Sb, and Bi) through high-throughput $ab-initio$ screening. These new compounds exhibit ultralow lattice thermal conductivity $κ_{\text{L}}$ close to the theoretical minimum due to strong anharmonic rattling of the heavy noble metals, while preserving high power factors, thus resulting in excellent phonon-glass electron-crystal materials.
△ Less
Submitted 14 April, 2016; v1 submitted 13 April, 2016;
originally announced April 2016.
-
Unextendible mutually unbiased bases in prime-squared dimensions
Authors:
Vishakh Hegde,
Prabha Mandayam
Abstract:
A set of mutually unbiased bases (MUBs) is said to be unextendible if there does not exist another basis that is unbiased with respect to the given set. Here, we prove the existence of smaller sets of MUBs in prime-squared dimensions ($d=p^{2}$) that cannot be extended to a complete set using the generalized Pauli operators. We further observe an interesting connection between the existence of une…
▽ More
A set of mutually unbiased bases (MUBs) is said to be unextendible if there does not exist another basis that is unbiased with respect to the given set. Here, we prove the existence of smaller sets of MUBs in prime-squared dimensions ($d=p^{2}$) that cannot be extended to a complete set using the generalized Pauli operators. We further observe an interesting connection between the existence of unextendible sets and the tightness of entropic uncertainty relations (EURs) in these dimensions. In particular, we show that our construction of unextendible sets of MUBs naturally leads to sets of $p+1$ MUBs that saturate both a Shannon ($H_{1}$) and a collision ($H_{2}$) entropic lower bound. Such an identification of smaller sets of MUBs satisfying tight EURs is crucial for cryptographic applications as well as constructing optimal entanglement witnesses for higher dimensional systems.
△ Less
Submitted 24 August, 2015;
originally announced August 2015.
-
Virtual Location-Based Services: Merging the Physical and Virtual World
Authors:
Christian von der Weth,
Vinod Hegde,
Manfred Hauswirth
Abstract:
Location-based services gained much popularity through providing users with helpful information with respect to their current location. The search and recommendation of nearby locations or places, and the navigation to a specific location are some of the most prominent location-based services. As a recent trend, virtual location-based services consider webpages or sites associated with a location…
▽ More
Location-based services gained much popularity through providing users with helpful information with respect to their current location. The search and recommendation of nearby locations or places, and the navigation to a specific location are some of the most prominent location-based services. As a recent trend, virtual location-based services consider webpages or sites associated with a location as 'virtual locations' that online users can visit in spite of not being physically present at the location. The presence of links between virtual locations and the corresponding physical locations (e.g., geo-location information of a restaurant linked to its website), allows for novel types of services and applications which constitute virtual location-based services (VLBS). The quality and potential benefits of such services largely depends on the existence of websites referring to physical locations. In this paper, we investigate the usefulness of linking virtual and physical locations. For this, we analyze the presence and distribution of virtual locations, i.e., websites referring to places, for two Irish cities. Using simulated tracks based on a user movement model, we investigate how mobile users move through the Web as virtual space. Our results show that virtual locations are omnipresent in urban areas, and that the situation that a user is close to even several such locations at any time is rather the normal case instead of the exception.
△ Less
Submitted 10 October, 2013;
originally announced October 2013.
-
Web Pages Clustering: A New Approach
Authors:
Jeevan H E,
Prashanth P P,
Punith Kumar S N,
Vinay Hegde
Abstract:
The rapid growth of web has resulted in vast volume of information. Information availability at a rapid speed to the user is vital. English language (or any for that matter) has lot of ambiguity in the usage of words. So there is no guarantee that a keyword based search engine will provide the required results. This paper introduces the use of dictionary (standardised) to obtain the context with w…
▽ More
The rapid growth of web has resulted in vast volume of information. Information availability at a rapid speed to the user is vital. English language (or any for that matter) has lot of ambiguity in the usage of words. So there is no guarantee that a keyword based search engine will provide the required results. This paper introduces the use of dictionary (standardised) to obtain the context with which a keyword is used and in turn cluster the results based on this context. These ideas can be merged with a metasearch engine to enhance the search efficiency.
△ Less
Submitted 26 August, 2011;
originally announced August 2011.