Search | arXiv e-print repository

Scalable Training of Graph Foundation Models for Atomistic Materials Modeling: A Case Study with HydraGNN

Authors: Massimiliano Lupo Pasini, Jong Youl Choi, Kshitij Mehta, Pei Zhang, David Rogers, Jonghyun Bae, Khaled Z. Ibrahim, Ashwin M. Aji, Karl W. Schulz, Jorda Polo, Prasanna Balaprakash

Abstract: We present our work on develo** and training scalable graph foundation models (GFM) using HydraGNN, a multi-headed graph convolutional neural network architecture. HydraGNN expands the boundaries of graph neural network (GNN) in both training scale and data diversity. It abstracts over message passing algorithms, allowing both reproduction of and comparison across algorithmic innovations that de… ▽ More We present our work on develo** and training scalable graph foundation models (GFM) using HydraGNN, a multi-headed graph convolutional neural network architecture. HydraGNN expands the boundaries of graph neural network (GNN) in both training scale and data diversity. It abstracts over message passing algorithms, allowing both reproduction of and comparison across algorithmic innovations that define convolution in GNNs. This work discusses a series of optimizations that have allowed scaling up the GFM training to tens of thousands of GPUs on datasets that consist of hundreds of millions of graphs. Our GFMs use multi-task learning (MTL) to simultaneously learn graph-level and node-level properties of atomistic structures, such as the total energy and atomic forces. Using over 150 million atomistic structures for training, we illustrate the performance of our approach along with the lessons learned on two United States Department of Energy (US-DOE) supercomputers, namely the Perlmutter petascale system at the National Energy Research Scientific Computing Center and the Frontier exascale system at Oak Ridge National Laboratory. The HydraGNN architecture enables the GFM to achieve near-linear strong scaling performance using more than 2,000 GPUs on Perlmutter and 16,000 GPUs on Frontier. Hyperparameter optimization (HPO) was performed on over 64,000 GPUs on Frontier to select GFM architectures with high accuracy. Early stop** was applied on each GFM architecture for energy awareness in performing such an extreme-scale task. The training of an ensemble of highest-ranked GFM architectures continued until convergence to establish uncertainty quantification (UQ) capabilities with ensemble learning. Our contribution opens the door for rapidly develo**, training, and deploying GFMs using large-scale computational resources to enable AI-accelerated materials discovery and design. △ Less

Submitted 28 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

Comments: 16 pages, 13 figures

MSC Class: 68T07; 68T09 ACM Class: C.2.4; I.2.11

arXiv:2406.01963 [pdf]

Diamond molecular balance: Revolutionizing high-resolution mass spectrometry from MDa to TDa at room temperature

Authors: Donggeun Lee, Seung-Woo Jeon, Chang-Hwan Yi, Yang-Hee Kim, Yeeun Choi, Sang-Hun Lee, **woong Cha, Seung-Bo Shim, Junho Suh, Il-Young Kim, Dongyeon Daniel Kang, Hojoong Jung, Cherlhyun Jeong, Jae-pyoung Ahn, Hee Chul Park, Sang-Wook Han, Chulki Kim

Abstract: The significance of mass spectrometry lies in its unparalleled ability to accurately identify and quantify molecules in complex samples, providing invaluable insights into molecular structures and interactions. Here, we leverage diamond nanostructures as highly sensitive mass sensors by utilizing a self-excitation mechanism under an electron beam in a conventional scanning electron microscope (SEM… ▽ More The significance of mass spectrometry lies in its unparalleled ability to accurately identify and quantify molecules in complex samples, providing invaluable insights into molecular structures and interactions. Here, we leverage diamond nanostructures as highly sensitive mass sensors by utilizing a self-excitation mechanism under an electron beam in a conventional scanning electron microscope (SEM). The diamond molecular balance (DMB) exhibits exceptional mass resolution of a few MDa and an extensive dynamic range from MDa to TDa, positioning itself as a forefront molecular balance operating at room temperature. Notably, the DMB measures the mass of a single bacteriophage T4, achieving a mass resolution of 4.7 MDa for an analyte at 184 MDa, while precisely determining their positional information on the device. These findings highlight the groundbreaking potential of the DMB as a revolutionary tool for mass analysis at room temperature. △ Less

Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

Comments: 15 pages, 4 figures

arXiv:2404.10994 [pdf, other]

Quantum plasmonic sensing by Hong-Ou-Mandel interferometry

Authors: Seung** Yoon, Yu Sung Choi, Mark Tame, Jae Woong Yoon, Sergey V. Polyakov, Changhyoup Lee

Abstract: We propose a quantum plasmonic sensor using Hong-Ou-Mandel (HOM) interferometry that measures the refractive index of an analyte, embedded in a plasmonic beam splitter composed of a dual-Kretschmann configuration, which serves as a frustrated total internal reflection beamsplitter. The sensing performance of the HOM interferometry, combined with single-photon detectors, is evaluated through Fisher… ▽ More We propose a quantum plasmonic sensor using Hong-Ou-Mandel (HOM) interferometry that measures the refractive index of an analyte, embedded in a plasmonic beam splitter composed of a dual-Kretschmann configuration, which serves as a frustrated total internal reflection beamsplitter. The sensing performance of the HOM interferometry, combined with single-photon detectors, is evaluated through Fisher information for estimation of the refractive index of the analyte. This is subsequently compared with the classical benchmark that considers the injection of a coherent state of light into the plasmonic beamsplitter. By varying the wavelength of the single photons and the refractive index of the analyte, we identify a wide range where a 50 % quantum enhancement is achieved and discuss the observed behaviors in comparison with the classical benchmark. We expect this study to provide a useful insight into the advancement of quantum-enhanced sensing technologies, with direct implications for a wide range of nanophotonic beamsplitter structures. △ Less

Submitted 23 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

arXiv:2404.04153 [pdf, other]

Evaluation of the performance of the event reconstruction algorithms in the JSNS$^2$ experiment using a $^{252}$Cf calibration source

Authors: D. H. Lee, M. K. Cheoun, J. H. Choi, J. Y. Choi, T. Dodo, J. Goh, K. Haga, M. Harada, S. Hasegawa, W. Hwang, T. Iida, H. I. Jang, J. S. Jang, K. K. Joo, D. E. Jung, S. K. Kang, Y. Kasugai, T. Kawasaki, E. J. Kim, J. Y. Kim, S. B Kim, W. Kim, H. Kinoshita, T. Konno, I. T. Lim , et al. (28 additional authors not shown)

Abstract: JSNS$^2$ searches for short baseline neutrino oscillations with a baseline of 24~meters and a target of 17~tonnes of the Gd-loaded liquid scintillator. The correct algorithm on the event reconstruction of events, which determines the position and energy of neutrino interactions in the detector, are essential for the physics analysis of the data from the experiment. Therefore, the performance of th… ▽ More JSNS$^2$ searches for short baseline neutrino oscillations with a baseline of 24~meters and a target of 17~tonnes of the Gd-loaded liquid scintillator. The correct algorithm on the event reconstruction of events, which determines the position and energy of neutrino interactions in the detector, are essential for the physics analysis of the data from the experiment. Therefore, the performance of the event reconstruction is carefully checked with calibrations using $^{252}$Cf source. This manuscript describes the methodology and the performance of the event reconstruction. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2404.03679 [pdf, other]

Pulse Shape Discrimination in JSNS$^2$

Authors: T. Dodo, M. K. Cheoun, J. H. Choi, J. Y. Choi, J. Goh, K. Haga, M. Harada, S. Hasegawa, W. Hwang, T. Iida, H. I. Jang, J. S. Jang, K. K. Joo, D. E. Jung, S. K. Kang, Y. Kasugai, T. Kawasaki, E. J. Kim, J. Y. Kim, S. B. Kim, W. Kim, H. Kinoshita, T. Konno, D. H. Lee, I. T. Lim , et al. (29 additional authors not shown)

Abstract: JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment that is searching for sterile neutrinos via the observation of $\barν_μ \rightarrow \barν_e$ appearance oscillations using neutrinos with muon decay-at-rest. For this search, rejecting cosmic-ray-induced neutron events by Pulse Shape Discrimination (PSD) is essential because the JSNS$^2$ detector is loca… ▽ More JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment that is searching for sterile neutrinos via the observation of $\barν_μ \rightarrow \barν_e$ appearance oscillations using neutrinos with muon decay-at-rest. For this search, rejecting cosmic-ray-induced neutron events by Pulse Shape Discrimination (PSD) is essential because the JSNS$^2$ detector is located above ground, on the third floor of the building. We have achieved 95$\%$ rejection of neutron events while kee** 90$\%$ of signal, electron-like events using a data driven likelihood method. △ Less

Submitted 28 March, 2024; originally announced April 2024.

Comments: arXiv admin note: text overlap with arXiv:2111.07482, arXiv:2308.02722

arXiv:2403.10555 [pdf, other]

KARINA: An Efficient Deep Learning Model for Global Weather Forecast

Authors: Minjong Cheon, Yo-Hwan Choi, Seon-Yu Kang, Yumi Choi, Jeong-Gil Lee, Daehyun Kang

Abstract: Deep learning-based, data-driven models are gaining prevalence in climate research, particularly for global weather prediction. However, training the global weather data at high resolution requires massive computational resources. Therefore, we present a new model named KARINA to overcome the substantial computational demands typical of this field. This model achieves forecasting accuracy comparab… ▽ More Deep learning-based, data-driven models are gaining prevalence in climate research, particularly for global weather prediction. However, training the global weather data at high resolution requires massive computational resources. Therefore, we present a new model named KARINA to overcome the substantial computational demands typical of this field. This model achieves forecasting accuracy comparable to higher-resolution counterparts with significantly less computational resources, requiring only 4 NVIDIA A100 GPUs and less than 12 hours of training. KARINA combines ConvNext, SENet, and Geocyclic Padding to enhance weather forecasting at a 2.5° resolution, which could filter out high-frequency noise. Geocyclic Padding preserves pixels at the lateral boundary of the input image, thereby maintaining atmospheric flow continuity in the spherical Earth. SENet dynamically improves feature response, advancing atmospheric process modeling, particularly in the vertical column process as numerous channels. In this vein, KARINA sets new benchmarks in weather forecasting accuracy, surpassing existing models like the ECMWF S2S reforecasts at a lead time of up to 7 days. Remarkably, KARINA achieved competitive performance even when compared to the recently developed models (Pangu-Weather, GraphCast, ClimaX, and FourCastNet) trained with high-resolution data having 100 times larger pixels. Conclusively, KARINA significantly advances global weather forecasting by efficiently modeling Earth's atmosphere with improved accuracy and resource efficiency. △ Less

Submitted 13 March, 2024; originally announced March 2024.

arXiv:2403.08952 [pdf, other]

Characterisation of analogue Monolithic Active Pixel Sensor test structures implemented in a 65 nm CMOS imaging process

Authors: Gianluca Aglieri Rinella, Giacomo Alocco, Matias Antonelli, Roberto Baccomi, Stefania Maria Beole, Mihail Bogdan Blidaru, Bent Benedikt Buttwill, Eric Buschmann, Paolo Camerini, Francesca Carnesecchi, Marielle Chartier, Yongjun Choi, Manuel Colocci, Giacomo Contin, Dominik Dannheim, Daniele De Gruttola, Manuel Del Rio Viera, Andrea Dubla, Antonello di Mauro, Maurice Calvin Donner, Gregor Hieronymus Eberwein, Jan Egger, Laura Fabbietti, Finn Feindt, Kunal Gautam , et al. (69 additional authors not shown)

Abstract: Analogue test structures were fabricated using the Tower Partners Semiconductor Co. CMOS 65 nm ISC process. The purpose was to characterise and qualify this process and to optimise the sensor for the next generation of Monolithic Active Pixels Sensors for high-energy physics. The technology was explored in several variants which differed by: do** levels, pixel geometries and pixel pitches (10-25… ▽ More Analogue test structures were fabricated using the Tower Partners Semiconductor Co. CMOS 65 nm ISC process. The purpose was to characterise and qualify this process and to optimise the sensor for the next generation of Monolithic Active Pixels Sensors for high-energy physics. The technology was explored in several variants which differed by: do** levels, pixel geometries and pixel pitches (10-25 $μ$m). These variants have been tested following exposure to varying levels of irradiation up to 3 MGy and $10^{16}$ 1 MeV n$_\text{eq}$ cm$^{-2}$. Here the results from prototypes that feature direct analogue output of a 4$\times$4 pixel matrix are reported, allowing the systematic and detailed study of charge collection properties. Measurements were taken both using $^{55}$Fe X-ray sources and in beam tests using minimum ionizing particles. The results not only demonstrate the feasibility of using this technology for particle detection but also serve as a reference for future applications and optimisations. △ Less

Submitted 13 March, 2024; originally announced March 2024.

arXiv:2402.08185 [pdf, other]

Advancing Data-driven Weather Forecasting: Time-Sliding Data Augmentation of ERA5

Authors: Minjong Cheon, Daehyun Kang, Yo-Hwan Choi, Seon-Yu Kang

Abstract: Modern deep learning techniques, which mimic traditional numerical weather prediction (NWP) models and are derived from global atmospheric reanalysis data, have caused a significant revolution within a few years. In this new paradigm, our research introduces a novel strategy that deviates from the common dependence on high-resolution data, which is often constrained by computational resources, and… ▽ More Modern deep learning techniques, which mimic traditional numerical weather prediction (NWP) models and are derived from global atmospheric reanalysis data, have caused a significant revolution within a few years. In this new paradigm, our research introduces a novel strategy that deviates from the common dependence on high-resolution data, which is often constrained by computational resources, and instead utilizes low-resolution data (2.5 degrees) for global weather prediction and climate data analysis. Our main focus is evaluating data-driven weather prediction (DDWP) frameworks, specifically addressing sample size adequacy, structural improvements to the model, and the ability of climate data to represent current climatic trends. By using the Adaptive Fourier Neural Operator (AFNO) model via FourCastNet and a proposed time-sliding method to inflate the dataset of the ECMWF Reanalysis v5 (ERA5), this paper improves on conventional approaches by adding more variables and a novel approach to data augmentation and processing. Our findings reveal that despite the lower resolution, the proposed approach demonstrates considerable accuracy in predicting atmospheric conditions, effectively rivaling higher-resolution models. Furthermore, the study confirms the model's proficiency in reflecting current climate trends and its potential in predicting future climatic events, underscoring its utility in climate change strategies. This research marks a pivotal step in the realm of meteorological forecasting, showcasing the feasibility of lower-resolution data in producing reliable predictions and opening avenues for more accessible and inclusive climate modeling. The insights gleaned from this study not only contribute to the advancement of climate science but also lay the groundwork for future innovations in the field. △ Less

Submitted 12 February, 2024; originally announced February 2024.

arXiv:2401.13695 [pdf, other]

Inverse analysis of granular flows using differentiable graph neural network simulator

Authors: Yong** Choi, Krishna Kumar

Abstract: Inverse problems in granular flows, such as landslides and debris flows, involve estimating material parameters or boundary conditions based on target runout profile. Traditional high-fidelity simulators for these inverse problems are computationally demanding, restricting the number of simulations possible. Additionally, their non-differentiable nature makes gradient-based optimization methods, k… ▽ More Inverse problems in granular flows, such as landslides and debris flows, involve estimating material parameters or boundary conditions based on target runout profile. Traditional high-fidelity simulators for these inverse problems are computationally demanding, restricting the number of simulations possible. Additionally, their non-differentiable nature makes gradient-based optimization methods, known for their efficiency in high-dimensional problems, inapplicable. While machine learning-based surrogate models offer computational efficiency and differentiability, they often struggle to generalize beyond their training data due to their reliance on low-dimensional input-output map**s that fail to capture the complete physics of granular flows. We propose a novel differentiable graph neural network simulator (GNS) by combining reverse mode automatic differentiation of graph neural networks with gradient-based optimization for solving inverse problems. GNS learns the dynamics of granular flow by representing the system as a graph and predicts the evolution of the graph at the next time step, given the current state. The differentiable GNS shows optimization capabilities beyond the training data. We demonstrate the effectiveness of our method for inverse estimation across single and multi-parameter optimization problems, including evaluating material properties and boundary conditions for a target runout distance and designing baffle locations to limit a landslide runout. Our proposed differentiable GNS framework offers an orders of magnitude faster solution to these inverse problems than the conventional finite difference approach to gradient-based optimization. △ Less

Submitted 26 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

ACM Class: I.6.8

arXiv:2401.10245 [pdf, other]

Train Small, Model Big: Scalable Physics Simulators via Reduced Order Modeling and Domain Decomposition

Authors: Seung Whan Chung, Youngsoo Choi, Pratanu Roy, Thomas Moore, Thomas Roy, Tiras Y. Lin, Du Y. Nguyen, Christopher Hahn, Eric B. Duoss, Sarah E. Baker

Abstract: Numerous cutting-edge scientific technologies originate at the laboratory scale, but transitioning them to practical industry applications is a formidable challenge. Traditional pilot projects at intermediate scales are costly and time-consuming. An alternative, the E-pilot, relies on high-fidelity numerical simulations, but even these simulations can be computationally prohibitive at larger scale… ▽ More Numerous cutting-edge scientific technologies originate at the laboratory scale, but transitioning them to practical industry applications is a formidable challenge. Traditional pilot projects at intermediate scales are costly and time-consuming. An alternative, the E-pilot, relies on high-fidelity numerical simulations, but even these simulations can be computationally prohibitive at larger scales. To overcome these limitations, we propose a scalable, physics-constrained reduced order model (ROM) method. ROM identifies critical physics modes from small-scale unit components, projecting governing equations onto these modes to create a reduced model that retains essential physics details. We also employ Discontinuous Galerkin Domain Decomposition (DG-DD) to apply ROM to unit components and interfaces, enabling the construction of large-scale global systems without data at such large scales. This method is demonstrated on the Poisson and Stokes flow equations, showing that it can solve equations about $15 - 40$ times faster with only $\sim$ $1\%$ relative error. Furthermore, ROM takes one order of magnitude less memory than the full order model, enabling larger scale predictions at a given memory limitation. △ Less

Submitted 5 December, 2023; originally announced January 2024.

Comments: 40 pages, 12 figures. Submitted to Computer Methods in Applied Mechanics and Engineering

Report number: LLNL-JRNL-857774 MSC Class: 65F55; 65N55 (primary) 76D07 (secondary)

arXiv:2312.07902 [pdf, other]

doi 10.1016/j.cma.2024.116978

Gappy AE: A Nonlinear Approach for Gappy Data Reconstruction using Auto-Encoder

Authors: Youngkyu Kim, Youngsoo Choi, Byounghyun Yoo

Abstract: We introduce a novel data reconstruction algorithm known as Gappy auto-encoder (Gappy AE) to address the limitations associated with Gappy proper orthogonal decomposition (Gappy POD), a widely used method for data reconstruction when dealing with sparse measurements or missing data. Gappy POD has inherent constraints in accurately representing solutions characterized by slowly decaying Kolmogorov… ▽ More We introduce a novel data reconstruction algorithm known as Gappy auto-encoder (Gappy AE) to address the limitations associated with Gappy proper orthogonal decomposition (Gappy POD), a widely used method for data reconstruction when dealing with sparse measurements or missing data. Gappy POD has inherent constraints in accurately representing solutions characterized by slowly decaying Kolmogorov N-widths, primarily due to its reliance on linear subspaces for data prediction. In contrast, Gappy AE leverages the power of nonlinear manifold representations to address data reconstruction challenges of conventional Gappy POD. It excels at real-time state prediction in scenarios where only sparsely measured data is available, filling in the gaps effectively. This capability makes Gappy AE particularly valuable, such as for digital twin and image correction applications. To demonstrate the superior data reconstruction performance of Gappy AE with sparse measurements, we provide several numerical examples, including scenarios like 2D diffusion, 2D radial advection, and 2D wave equation problems. Additionally, we assess the impact of four distinct sampling algorithms - discrete empirical interpolation method, the S-OPT algorithm, Latin hypercube sampling, and uniformly distributed sampling - on data reconstruction accuracy. Our findings conclusively show that Gappy AE outperforms Gappy POD in data reconstruction when sparse measurements are given. △ Less

Submitted 31 March, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

Journal ref: Computer Methods in Applied Mechanics and Engineering 426 (2024)

arXiv:2311.18715 [pdf, other]

Accelerating Flow Simulations using Online Dynamic Mode Decomposition

Authors: Seung Won Suh, Seung Whan Chung, Peer-Timo Bremer, Youngsoo Choi

Abstract: We develop an on-the-fly reduced-order model (ROM) integrated with a flow simulation, gradually replacing a corresponding full-order model (FOM) of a physics solver. Unlike offline methods requiring a separate FOM-only simulation prior to model reduction, our approach constructs a ROM dynamically during the simulation, replacing the FOM when deemed credible. Dynamic mode decomposition (DMD) is emp… ▽ More We develop an on-the-fly reduced-order model (ROM) integrated with a flow simulation, gradually replacing a corresponding full-order model (FOM) of a physics solver. Unlike offline methods requiring a separate FOM-only simulation prior to model reduction, our approach constructs a ROM dynamically during the simulation, replacing the FOM when deemed credible. Dynamic mode decomposition (DMD) is employed for online ROM construction, with a single snapshot vector used for rank-1 updates in each iteration. Demonstrated on a flow over a cylinder with Re = 100, our hybrid FOM/ROM simulation is verified in terms of the Strouhal number, resulting in a 4.4 times speedup compared to the FOM solver. △ Less

Submitted 30 November, 2023; originally announced November 2023.

Comments: Presented at Machine Learning and the Physical Sciences Workshop, NeurIPS 2023

arXiv:2311.16410 [pdf, other]

Reduced-order modeling for parameterized PDEs via implicit neural representations

Authors: Tianshu Wen, Kook** Lee, Youngsoo Choi

Abstract: We present a new data-driven reduced-order modeling approach to efficiently solve parametrized partial differential equations (PDEs) for many-query problems. This work is inspired by the concept of implicit neural representation (INR), which models physics signals in a continuous manner and independent of spatial/temporal discretization. The proposed framework encodes PDE and utilizes a parametriz… ▽ More We present a new data-driven reduced-order modeling approach to efficiently solve parametrized partial differential equations (PDEs) for many-query problems. This work is inspired by the concept of implicit neural representation (INR), which models physics signals in a continuous manner and independent of spatial/temporal discretization. The proposed framework encodes PDE and utilizes a parametrized neural ODE (PNODE) to learn latent dynamics characterized by multiple PDE parameters. PNODE can be inferred by a hypernetwork to reduce the potential difficulties in learning PNODE due to a complex multilayer perceptron (MLP). The framework uses an INR to decode the latent dynamics and reconstruct accurate PDE solutions. Further, a physics-informed loss is also introduced to correct the prediction of unseen parameter instances. Incorporating the physics-informed loss also enables the model to be fine-tuned in an unsupervised manner on unseen PDE parameters. A numerical experiment is performed on a two-dimensional Burgers equation with a large variation of PDE parameters. We evaluate the proposed method at a large Reynolds number and obtain up to speedup of O(10^3) and ~1% relative error to the ground truth values. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 9 pages, 5 figures, Machine Learning and the Physical Sciences Workshop, NeurIPS 2023

arXiv:2311.07416 [pdf]

Three-dimensional granular flow simulation using graph neural network-based learned simulator

Authors: Yong** Choi, Krishna Kumar

Abstract: Reliable evaluations of geotechnical hazards like landslides and debris flow require accurate simulation of granular flow dynamics. Traditional numerical methods can simulate the complex behaviors of such flows that involve solid-like to fluid-like transitions, but they are computationally intractable when simulating large-scale systems. Surrogate models based on statistical or machine learning me… ▽ More Reliable evaluations of geotechnical hazards like landslides and debris flow require accurate simulation of granular flow dynamics. Traditional numerical methods can simulate the complex behaviors of such flows that involve solid-like to fluid-like transitions, but they are computationally intractable when simulating large-scale systems. Surrogate models based on statistical or machine learning methods are a viable alternative, but they are typically empirical and rely on a confined set of parameters in evaluating associated risks. Due to their permutation-dependent learning, conventional machine learning models require an unreasonably large amount of training data for building generalizable surrogate models. We employ a graph neural network (GNN), a novel deep learning technique, to develop a GNN-based simulator (GNS) for granular flows to address these issues. Graphs represent the state of granular flows and interactions, like the exchange of energy and momentum between grains, and GNN learns the local interaction law. GNS takes the current state of the granular flow and estimates the next state using Euler explicit integration. We train GNS on a limited set of granular flow trajectories and evaluate its performance in a three-dimensional granular column collapse domain. GNS successfully reproduces the overall behaviors of column collapses with various aspect ratios that were not encountered during training. The computation speed of GNS outperforms high-fidelity numerical simulators by 300 times. △ Less

Submitted 13 November, 2023; originally announced November 2023.

ACM Class: I.6.8

arXiv:2311.05407 [pdf]

Data Distillation for Neural Network Potentials toward Foundational Dataset

Authors: Gang Seob Jung, Sangkeun Lee, Jong Youl Choi

Abstract: Machine learning (ML) techniques and atomistic modeling have rapidly transformed materials design and discovery. Specifically, generative models can swiftly propose promising materials for targeted applications. However, the predicted properties of materials through the generative models often do not match with calculated properties through ab initio calculations. This discrepancy can arise becaus… ▽ More Machine learning (ML) techniques and atomistic modeling have rapidly transformed materials design and discovery. Specifically, generative models can swiftly propose promising materials for targeted applications. However, the predicted properties of materials through the generative models often do not match with calculated properties through ab initio calculations. This discrepancy can arise because the generated coordinates are not fully relaxed, whereas the many properties are derived from relaxed structures. Neural network-based potentials (NNPs) can expedite the process by providing relaxed structures from the initially generated ones. Nevertheless, acquiring data to train NNPs for this purpose can be extremely challenging as it needs to encompass previously unknown structures. This study utilized extended ensemble molecular dynamics (MD) to secure a broad range of liquid- and solid-phase configurations in one of the metallic systems, nickel. Then, we could significantly reduce them through active learning without losing much accuracy. We found that the NNP trained from the distilled data could predict different energy-minimized closed-pack crystal structures even though those structures were not explicitly part of the initial data. Furthermore, the data can be translated to other metallic systems (aluminum and niobium), without repeating the sampling and distillation processes. Our approach to data acquisition and distillation has demonstrated the potential to expedite NNP development and enhance materials design and discovery by integrating generative models. △ Less

Submitted 9 November, 2023; originally announced November 2023.

arXiv:2310.18493 [pdf, other]

Accelerating Kinetic Simulations of Electrostatic Plasmas with Reduced-Order Modeling

Authors: **-Hsuan Tsai, Seung Whan Chung, Debojyoti Ghosh, John Loffeld, Youngsoo Choi, Jonathan L. Belof

Abstract: Despite the advancements in high-performance computing and modern numerical algorithms, the cost remains prohibitive for multi-query kinetic plasma simulations. In this work, we develop data-driven reduced-order models (ROM) for collisionless electrostatic plasma dynamics, based on the kinetic Vlasov-Poisson equation. Our ROM approach projects the equation onto a linear subspace defined by princip… ▽ More Despite the advancements in high-performance computing and modern numerical algorithms, the cost remains prohibitive for multi-query kinetic plasma simulations. In this work, we develop data-driven reduced-order models (ROM) for collisionless electrostatic plasma dynamics, based on the kinetic Vlasov-Poisson equation. Our ROM approach projects the equation onto a linear subspace defined by principal proper orthogonal decomposition (POD) modes. We introduce an efficient tensorial method to update the nonlinear term using a precomputed third-order tensor. We capture multiscale behavior with a minimal number of POD modes by decomposing the solution into multiple time windows using a physical-time indicator and creating a temporally-local ROM. Applied to 1D-1V simulations, specifically the benchmark two-stream instability case, our time-windowed reduced-order model (TW-ROM) with the tensorial approach solves the equation approximately 280 times faster than Eulerian simulations while maintaining a maximum relative error of 4% for the training data and 13% for the testing data. △ Less

Submitted 27 November, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

Comments: 7 pages, 3 figures typos corrected; references added; add one figures for predicted solution fields; fix error in the legend of figure 1.b and caption; add rebox in figure 1.a to indicate training data; add timing for constructing the tensor in offline; add one more paragraph in section 3;

arXiv:2310.08874 [pdf, other]

Estimation of the Characteristic Wavelength Parameter in 1D Leray-Burgers Equation with PINN

Authors: Bong-Sik Kim, Yuncherl Choi, DooSeok Lee

Abstract: In this paper, we employ the Physics-Informed Neural Network (PINN) to estimate the practical range of the characteristic wavelength parameter(referred to as the smoothing parameter) $α$ in the Leray-Burgers equation. The Leray-Burgers equation, a regularization of the inviscid Burgers equation, incorporates a Helmholtz filter with a characteristic wavelength $α$ to replace the usual convective ve… ▽ More In this paper, we employ the Physics-Informed Neural Network (PINN) to estimate the practical range of the characteristic wavelength parameter(referred to as the smoothing parameter) $α$ in the Leray-Burgers equation. The Leray-Burgers equation, a regularization of the inviscid Burgers equation, incorporates a Helmholtz filter with a characteristic wavelength $α$ to replace the usual convective velocity, inducing a regularized convective velocity. The filter bends the equation's characteristics slightly and makes them not intersect each other, leading to a global solution in time. By conducting computational experiments with various initial conditions, we determine the practical range of $α>0$ that closely approximates the solutions of the inviscid Burgers equation. Our findings indicate that the value of $α$ depends on the initial data, with the practical range of $α$ being between 0.01 and 0.05 for continuous initial profiles and between 0.01 and 0.03 for discontinuous initial profiles. The Leray-Burgers equation captures shock and rarefaction waves within the temporal domain for which training data exists. However, as the temporal domain extends beyond the training interval, data-driven forward computation demonstrates that the predictions generated by the PINN start to deviate from the exact solutions. This study also highlights the effectiveness and efficiency of the Leray-Burgers equation in real practical problems, specifically Traffic State Estimation. △ Less

Submitted 13 October, 2023; originally announced October 2023.

MSC Class: 35L60

arXiv:2309.13348 [pdf, other]

Accelerating Particle and Fluid Simulations with Differentiable Graph Networks for Solving Forward and Inverse Problems

Authors: Krishna Kumar, Yong** Choi

Abstract: We leverage physics-embedded differentiable graph network simulators (GNS) to accelerate particulate and fluid simulations to solve forward and inverse problems. GNS represents the domain as a graph with particles as nodes and learned interactions as edges. Compared to modeling global dynamics, GNS enables learning local interaction laws through edge messages, improving its generalization to new e… ▽ More We leverage physics-embedded differentiable graph network simulators (GNS) to accelerate particulate and fluid simulations to solve forward and inverse problems. GNS represents the domain as a graph with particles as nodes and learned interactions as edges. Compared to modeling global dynamics, GNS enables learning local interaction laws through edge messages, improving its generalization to new environments. GNS achieves over 165x speedup for granular flow prediction compared to parallel CPU numerical simulations. We propose a novel hybrid GNS/Material Point Method (MPM) to accelerate forward simulations by minimizing error on a pure surrogate model by interleaving MPM in GNS rollouts to satisfy conservation laws and minimize errors achieving 24x speedup compared to pure numerical simulations. The differentiable GNS enables solving inverse problems through automatic differentiation, identifying material parameters that result in target runout distances. We demonstrate the ability of GNS to solve inverse problems by iteratively updating the friction angle (a material property) by computing the gradient of a loss function based on the final and target runouts, thereby identifying the friction angle that best matches the observed runout. The physics-embedded and differentiable simulators open an exciting new paradigm for AI-accelerated design, control, and optimization. △ Less

Submitted 23 September, 2023; originally announced September 2023.

Comments: 6 pages, The 4th workshop on Artificial Intelligence and Machine Learning for Scientific Applications, Super Computing '23

ACM Class: I.2; I.6.8

arXiv:2309.01887 [pdf, other]

doi 10.1088/1748-0221/18/12/T12001

The acrylic vessel for JSNS$^{2}$-II neutrino target

Authors: C. D. Shin, S. Ajimura, M. K. Cheoun, J. H. Choi, J. Y. Choi, T. Dodo, J. Goh, K. Haga, M. Harada, S. Hasegawa, T. Hiraiwa, W. Hwang, T. Iida, H. I. Jang, J. S. Jang, H. Jeon, S. Jeon, K. K. Joo, D. E. Jung, S. K. Kang, Y. Kasugai, T. Kawasaki, E. J. Kim, J. Y. Kim, S. B. Kim , et al. (35 additional authors not shown)

Abstract: The JSNS$^{2}$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment designed for the search for sterile neutrinos. The experiment is currently at the stage of the second phase named JSNS$^{2}$-II with two detectors at near and far locations from the neutrino source. One of the key components of the experiment is an acrylic vessel, that is used for the target volume… ▽ More The JSNS$^{2}$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment designed for the search for sterile neutrinos. The experiment is currently at the stage of the second phase named JSNS$^{2}$-II with two detectors at near and far locations from the neutrino source. One of the key components of the experiment is an acrylic vessel, that is used for the target volume for the detection of the anti-neutrinos. The specifications, design, and measured properties of the acrylic vessel are described. △ Less

Submitted 11 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

Journal ref: 2023 JINST 18 T12001

arXiv:2309.01089 [pdf, other]

doi 10.1103/PhysRevMaterials.8.013606

Study of vacancy ordering and the boson peak in metastable cubic Ge-Sb-Te using machine learning potentials

Authors: Young-Jae Choi, Minjae Ghim, Seung-Hoon Jhi

Abstract: The mechanism of the vacancy ordering in metastable cubic Ge-Sb-Te (c-GST) that underlies the ultrafast phase-change dynamics and prominent thermoelectric properties remains elusive. Achieving a comprehensive understanding of the vacancy-ordering process at an atomic level is challenging because of enormous computational demands required to simulate disordered structures on large temporal and spat… ▽ More The mechanism of the vacancy ordering in metastable cubic Ge-Sb-Te (c-GST) that underlies the ultrafast phase-change dynamics and prominent thermoelectric properties remains elusive. Achieving a comprehensive understanding of the vacancy-ordering process at an atomic level is challenging because of enormous computational demands required to simulate disordered structures on large temporal and spatial scales. In this study, we investigate the vacancy ordering in c-GST by performing large-scale molecular dynamics simulations using machine learning potentials. The initial c-GST structure with randomly distributed vacancies rearranges to develop a semi-ordered cubic structure with layer-like ordered vacancies after annealing at 700~K for 100~ns. The vacancy ordering significantly affects the lattice dynamical properties of c-GST. In the initial structure with fully disordered vacancies, we observe a boson peak, usually associated with amorphous solids, that consists of localized modes at $\sim$0.575~THz. The boson peak modes are highly localized around specific atomic arrangements of straight vacancy-Te-vacancy trios. As vacancies become ordered, the boson peak disappears and the Debye-Waller thermal \textit{B} factor of Te decreases substantially. This finding indicates that the c-GST undergoes a transition from amorphous-like to crystalline-like solid state by thermal annealing in low-frequency dynamics. △ Less

Submitted 4 January, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

Comments: 10 pages of main text, 1 Table of Contents figure, 8 main figures, Supplemental Material

Journal ref: Phys. Rev. Materials 8, 013606 (2024)

arXiv:2308.02722 [pdf, other]

doi 10.1140/epjc/s10052-024-12778-7

Study on the accidental background of the JSNS$^2$ experiment

Authors: D. H. Lee, S. Ajimura, M. K. Cheoun, J. H. Choi, J. Y. Choi, T. Dodo, J. Goh, K. Haga, M. Harada, S. Hasegawa, T. Hiraiwa, W. Hwang, H. I. Jang, J. S. Jang, H. Jeon, S. Jeon, K. K. Joo, D. E. Jung, S. K. Kang, Y. Kasugai, T. Kawasaki, E. J. Kim, J. Y. Kim, S. B. Kim, W. Kim , et al. (33 additional authors not shown)

Abstract: JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment which searches for sterile neutrinos via the observation of $\barν_μ \to \barν_{e}$ appearance oscillations using muon decay-at-rest neutrinos. The data taking of JSNS$^2$ have been performed from 2021. In this manuscript, a study of the accidental background is presented. The rate of the accidental back… ▽ More JSNS$^2$ (J-PARC Sterile Neutrino Search at J-PARC Spallation Neutron Source) is an experiment which searches for sterile neutrinos via the observation of $\barν_μ \to \barν_{e}$ appearance oscillations using muon decay-at-rest neutrinos. The data taking of JSNS$^2$ have been performed from 2021. In this manuscript, a study of the accidental background is presented. The rate of the accidental background is (9.29$\pm 0.39) \times 10^{-8}$ / spill with 0.75 MW beam power and comparable to the number of searching signals. △ Less

Submitted 22 April, 2024; v1 submitted 4 August, 2023; originally announced August 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2111.07482

Journal ref: Eur. Phys. J. C 84, 409 (2024)

arXiv:2308.02602 [pdf, other]

On stable wrapper-based parameter selection method for efficient ANN-based data-driven modeling of turbulent flows

Authors: Hyeongeun Yun, Yongcheol Choi, Youngjae Kim, Seongwon Kang

Abstract: To model complex turbulent flow and heat transfer phenomena, this study aims to analyze and develop a reduced modeling approach based on artificial neural network (ANN) and wrapper methods. This approach has an advantage over other methods such as the correlation-based filter method in terms of removing redundant or irrelevant parameters even under non-linearity among them. As a downside, the over… ▽ More To model complex turbulent flow and heat transfer phenomena, this study aims to analyze and develop a reduced modeling approach based on artificial neural network (ANN) and wrapper methods. This approach has an advantage over other methods such as the correlation-based filter method in terms of removing redundant or irrelevant parameters even under non-linearity among them. As a downside, the overfitting and randomness of ANN training may produce inconsistent subsets over selection trials especially in a higher physical dimension. This study analyzes a few existing ANN-based wrapper methods and develops a revised one based on the gradient-based subset selection indices to minimize the loss in the total derivative or the directional consistency at each elimination step. To examine parameter reduction performance and consistency-over-trials, we apply these methods to a manufactured subset selection problem, modeling of the bubble size in a turbulent bubbly flow, and modeling of the spatially varying turbulent Prandtl number in a duct flow. It is found that the gradient-based subset selection to minimize the total derivative loss results in improved consistency-over-trials compared to the other ANN-based wrapper methods, while removing unnecessary parameters successfully. For the reduced turbulent Prandtl number model, the gradient-based subset selection improves the prediction in the validation case over the other methods. Also, the reduced parameter subsets show a slight increase in the training speed compared to the others. △ Less

Submitted 4 August, 2023; originally announced August 2023.

Comments: 24 pages, 26 figures

arXiv:2306.00184 [pdf, other]

Data-scarce surrogate modeling of shock-induced pore collapse process

Authors: Siu Wun Cheung, Youngsoo Choi, H. Keo Springer, Teeratorn Kadeethum

Abstract: Understanding the mechanisms of shock-induced pore collapse is of great interest in various disciplines in sciences and engineering, including materials science, biological sciences, and geophysics. However, numerical modeling of the complex pore collapse processes can be costly. To this end, a strong need exists to develop surrogate models for generating economic predictions of pore collapse proc… ▽ More Understanding the mechanisms of shock-induced pore collapse is of great interest in various disciplines in sciences and engineering, including materials science, biological sciences, and geophysics. However, numerical modeling of the complex pore collapse processes can be costly. To this end, a strong need exists to develop surrogate models for generating economic predictions of pore collapse processes. In this work, we study the use of a data-driven reduced order model, namely dynamic mode decomposition, and a deep generative model, namely conditional generative adversarial networks, to resemble the numerical simulations of the pore collapse process at representative training shock pressures. Since the simulations are expensive, the training data are scarce, which makes training an accurate surrogate model challenging. To overcome the difficulties posed by the complex physics phenomena, we make several crucial treatments to the plain original form of the methods to increase the capability of approximating and predicting the dynamics. In particular, physics information is used as indicators or conditional inputs to guide the prediction. In realizing these methods, the training of each dynamic mode composition model takes only around 30 seconds on CPU. In contrast, training a generative adversarial network model takes 8 hours on GPU. Moreover, using dynamic mode decomposition, the final-time relative error is around 0.3% in the reproductive cases. We also demonstrate the predictive power of the methods at unseen testing shock pressures, where the error ranges from 1.3% to 5% in the interpolatory cases and 8% to 9% in extrapolatory cases. △ Less

Submitted 2 June, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

arXiv:2305.05218 [pdf, other]

Graph Neural Network-based surrogate model for granular flows

Authors: Yong** Choi, Krishna Kumar

Abstract: Accurate simulation of granular flow dynamics is crucial for assessing various geotechnical risks, including landslides and debris flows. Granular flows involve a dynamic rearrangement of particles exhibiting complex transitions from solid-like to fluid-like responses. Traditional continuum and discrete numerical methods are limited by their computational cost in simulating large-scale systems. St… ▽ More Accurate simulation of granular flow dynamics is crucial for assessing various geotechnical risks, including landslides and debris flows. Granular flows involve a dynamic rearrangement of particles exhibiting complex transitions from solid-like to fluid-like responses. Traditional continuum and discrete numerical methods are limited by their computational cost in simulating large-scale systems. Statistical or machine learning-based models offer an alternative. Still, they are largely empirical, based on a limited set of parameters. Due to their permutation-dependent learning, traditional machine learning-based models require huge training data to generalize. To resolve these problems, we use a graph neural network, a state-of-the-art machine learning architecture that learns local interactions. Graphs represent the state of dynamically changing granular flows and the interaction laws, such as energy and momentum exchange between grains. We develop a graph neural network-based simulator (GNS) that takes the current state of granular flow and predicts the next state using Euler explicit integration by learning the local interaction laws. We train GNS on different granular trajectories. We then assess the performance of GNS by predicting granular column collapse. GNS accurately predicts flow dynamics for column collapses with different aspect ratios unseen during training. GNS is hundreds of times faster than high-fidelity numerical simulators. The model also generalizes to domains much larger than the training data, handling more than twice the number of particles than it was trained on. △ Less

Submitted 12 December, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

ACM Class: I.6.8

arXiv:2304.12972 [pdf]

Automated Solubility Analysis System and Method Using Computer Vision and Machine Learning

Authors: Gahee Kim, Minwoo Jeon, Hyun Do Choi, Jun Ki Cho, Youn-Suk Choi, Hyoseok Hwang

Abstract: In this study, a novel active solubility sensing device using computer vision is proposed to improve separation purification performance and prevent malfunctions of separation equipment such as preparative liquid chromatographers and evaporators. The proposed device actively measures the solubility by transmitting a solution using a background image. The proposed system is a combination of a devic… ▽ More In this study, a novel active solubility sensing device using computer vision is proposed to improve separation purification performance and prevent malfunctions of separation equipment such as preparative liquid chromatographers and evaporators. The proposed device actively measures the solubility by transmitting a solution using a background image. The proposed system is a combination of a device that uses a background image and a method for estimating the dissolution and particle presence by changing the background image. The proposed device consists of four parts: camera, display, adjustment, and server units. The camera unit is made up of a rear image sensor on a mobile phone. The display unit is comprised of a tablet screen. The adjustment unit is composed of rotating and height-adjustment jigs. Finally, the server unit consists of a socket server for communication between the units and a PC, including an automated solubility analysis system implemented in Python. The dissolution status of the solution was divided into four categories and a case study was conducted. The algorithms were trained based on these results. Six organic materials and four organic solvents were combined with 202 tests to train the developed algorithm. As a result, the evaluation rate for the dissolution state exhibited an accuracy of 95 %. In addition, the device and method must develop a feedback function that can add a solvent or solute after dissolution detection using solubility results for use in autonomous systems, such as a synthetic automation system. Finally, the diversification of the sensing method is expected to extend not only to the solution but also to the solubility and homogeneity analysis of the film. △ Less

Submitted 7 April, 2023; originally announced April 2023.

Comments: 20 pages, 6 figures, 3 tables

arXiv:2304.06954 [pdf]

doi 10.1021/acs.nanolett.2c04425

Electrical transport properties driven by unique bonding configuration in gamma-GeSe

Authors: Jeongsu Jang, Joonho Kim, Dongchul Sung, Jong Hyuk Kim, Joong-Eon Jung, Sol Lee, **sub Park, Chaewoon Lee, Heesun Bae, Seongil Im, Kibog Park, Young Jai Choi, Suklyun Hong, Kwanpyo Kim

Abstract: Group-IV monochalcogenides have recently shown great potential for their thermoelectric, ferroelectric, and other intriguing properties. The electrical properties of group-IV monochalcogenides exhibit a strong dependence on the chalcogen type. For example, GeTe exhibits high do** concentration, whereas S/Se-based chalcogenides are semiconductors with sizable bandgaps. Here, we investigate the el… ▽ More Group-IV monochalcogenides have recently shown great potential for their thermoelectric, ferroelectric, and other intriguing properties. The electrical properties of group-IV monochalcogenides exhibit a strong dependence on the chalcogen type. For example, GeTe exhibits high do** concentration, whereas S/Se-based chalcogenides are semiconductors with sizable bandgaps. Here, we investigate the electrical and thermoelectric properties of gamma-GeSe, a recently identified polymorph of GeSe. gamma-GeSe exhibits high electrical conductivity (~106 S/m) and a relatively low Seebeck coefficient (9.4 uV/K at room temperature) owing to its high p-do** level (5x1021 cm-3), which is in stark contrast to other known GeSe polymorphs. Elemental analysis and first-principles calculations confirm that the abundant formation of Ge vacancies leads to the high p-do** concentration. The magnetoresistance measurements also reveal weak-antilocalization because of spin-orbit coupling in the crystal. Our results demonstrate that gamma-GeSe is a unique polymorph in which the modified local bonding configuration leads to substantially different physical properties. △ Less

Submitted 14 April, 2023; originally announced April 2023.

arXiv:2303.17918 [pdf]

doi 10.1016/j.chaos.2023.113679

Counting statistics based on the analytic solutions of the differential-difference equation for birth-death processes

Authors: Seong Jun Park, M. Y. Choi

Abstract: Birth-death processes take place ubiquitously throughout the universe. In general, birth and death rates depend on the system size (corresponding to the number of products or customers undergoing the birth-death process) and thus vary every time birth or death occurs, which makes fluctuations in the rates inevitable. The differential-difference equation governing the time evolution of such a birth… ▽ More Birth-death processes take place ubiquitously throughout the universe. In general, birth and death rates depend on the system size (corresponding to the number of products or customers undergoing the birth-death process) and thus vary every time birth or death occurs, which makes fluctuations in the rates inevitable. The differential-difference equation governing the time evolution of such a birth-death process is well established, but it resists solving for a non-asymptotic solution. In this work, we present the analytic solution of the differential-difference equation for birth-death processes without approximation. The time-dependent solution we obtain leads to an analytical expression for counting statistics of products (or customers). We further examine the relationship between the system size fluctuations and the birth and death rates, and find that statistical properties (variance subtracted by mean) of the system size are determined by the mean death rate as well as the covariance of the system size and the net growth rate (i.e., the birth rate minus the death rate). This work suggests a promising new direction for quantitative investigations into birth-death processes. △ Less

Submitted 4 April, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

arXiv:2302.02764 [pdf, other]

doi 10.1016/j.nima.2023.168449

Machine Learning based tool for CMS RPC currents quality monitoring

Authors: E. Shumka, A. Samalan, M. Tytgat, M. El Sawy, G. A. Alves, F. Marujo, E. A. Coelho, E. M. Da Costa, H. Nogima, A. Santoro, S. Fonseca De Souza, D. De Jesus Damiao, M. Thiel, K. Mota Amarilo, M. Barroso Ferreira Filho, A. Aleksandrov, R. Hadjiiska, P. Iaydjiev, M. Rodozov, M. Shopova, G. Soultanov, A. Dimitrov, L. Litov, B. Pavlov, P. Petkov , et al. (83 additional authors not shown)

Abstract: The muon system of the CERN Compact Muon Solenoid (CMS) experiment includes more than a thousand Resistive Plate Chambers (RPC). They are gaseous detectors operated in the hostile environment of the CMS underground cavern on the Large Hadron Collider where pp luminosities of up to $2\times 10^{34}$ $\text{cm}^{-2}\text{s}^{-1}$ are routinely achieved. The CMS RPC system performance is constantly m… ▽ More The muon system of the CERN Compact Muon Solenoid (CMS) experiment includes more than a thousand Resistive Plate Chambers (RPC). They are gaseous detectors operated in the hostile environment of the CMS underground cavern on the Large Hadron Collider where pp luminosities of up to $2\times 10^{34}$ $\text{cm}^{-2}\text{s}^{-1}$ are routinely achieved. The CMS RPC system performance is constantly monitored and the detector is regularly maintained to ensure stable operation. The main monitorable characteristics are dark current, efficiency for muon detection, noise rate etc. Herein we describe an automated tool for CMS RPC current monitoring which uses Machine Learning techniques. We further elaborate on the dedicated generalized linear model proposed already and add autoencoder models for self-consistent predictions as well as hybrid models to allow for RPC current predictions in a distant future. △ Less

Submitted 6 February, 2023; originally announced February 2023.

arXiv:2301.05374 [pdf]

Effect of Annealing Temperature on Minimum Domain Size of Ferroelectric Hafnia

Authors: Seokjung Yun, Hoon Kim, Myungsoo Seo, Min-Ho Kang, Taeho Kim, Seongwoo Cho, Min Hyuk Park, Sanghun Jeon, Yang-Kyu Choi, Seungbum Hong

Abstract: Here, we optimized the annealing temperature of HZO/TiN thin film heterostructure via multiscale analysis of remnant polarization, crystallographic phase, minimum ferroelectric domain size, and average grain size. We found that the remnant polarization was closely related to the relative amount of the orthorhombic phase whereas the minimum domain size was to the relative amount of the monoclinic p… ▽ More Here, we optimized the annealing temperature of HZO/TiN thin film heterostructure via multiscale analysis of remnant polarization, crystallographic phase, minimum ferroelectric domain size, and average grain size. We found that the remnant polarization was closely related to the relative amount of the orthorhombic phase whereas the minimum domain size was to the relative amount of the monoclinic phase. The minimum domain size was obtained at the annealing temperature of 500$^\cird$C while the optimum remnant polarization and capacitance at the annealing temperature of 600$^\circ$C. We conclude that the minimum domain size is more important than the sheer magnitude of remnant polarization considering the retention and fatigue of switchable polarization in nanoscale ferroelectric devices. Our results are expected to contribute to the development of ultra-low-power logic transistors and next-generation non-volatile memory devices. △ Less

Submitted 12 January, 2023; originally announced January 2023.

Comments: 28 pages, 11 figures, 1 table

arXiv:2211.16591 [pdf, other]

doi 10.1016/j.nima.2023.168271

RPC based tracking system at CERN GIF++ facility

Authors: K. Mota Amarilo, A. Samalan, M. Tytgat, M. El Sawy, G. A. Alves, F. Marujo, E. A. Coelho, E. M. Da Costa, H. Nogima, A. Santoro, S. Fonseca De Souza, D. De Jesus Damiao, M. Thiel, M. Barroso Ferreira Filho, A. Aleksandrov, R. Hadjiiska, P. Iaydjiev, M. Rodozov, M. Shopova, G. Soultanov, A. Dimitrov, L. Litov, B. Pavlov, P. Petkov, A. Petrov , et al. (83 additional authors not shown)

Abstract: With the HL-LHC upgrade of the LHC machine, an increase of the instantaneous luminosity by a factor of five is expected and the current detection systems need to be validated for such working conditions to ensure stable data taking. At the CERN Gamma Irradiation Facility (GIF++) many muon detectors undergo such studies, but the high gamma background can pose a challenge to the muon trigger system… ▽ More With the HL-LHC upgrade of the LHC machine, an increase of the instantaneous luminosity by a factor of five is expected and the current detection systems need to be validated for such working conditions to ensure stable data taking. At the CERN Gamma Irradiation Facility (GIF++) many muon detectors undergo such studies, but the high gamma background can pose a challenge to the muon trigger system which is exposed to many fake hits from the gamma background. A tracking system using RPCs is implemented to clean the fake hits, taking profit of the high muon efficiency of these chambers. This work will present the tracking system configuration, used detector analysis algorithm and results. △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: 12 pages, 9 figures. Contribution to XVI Workshop on Resistive Plate Chambers and Related Detectors (RPC2022), September 26-30 2022. Submitted to Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment

arXiv:2211.13698 [pdf, other]

Certified data-driven physics-informed greedy auto-encoder simulator

Authors: Xiaolong He, Youngsoo Choi, William D. Fries, Jonathan L. Belof, Jiun-Shyan Chen

Abstract: A parametric adaptive greedy Latent Space Dynamics Identification (gLaSDI) framework is developed for accurate, efficient, and certified data-driven physics-informed greedy auto-encoder simulators of high-dimensional nonlinear dynamical systems. In the proposed framework, an auto-encoder and dynamics identification models are trained interactively to discover intrinsic and simple latent-space dyna… ▽ More A parametric adaptive greedy Latent Space Dynamics Identification (gLaSDI) framework is developed for accurate, efficient, and certified data-driven physics-informed greedy auto-encoder simulators of high-dimensional nonlinear dynamical systems. In the proposed framework, an auto-encoder and dynamics identification models are trained interactively to discover intrinsic and simple latent-space dynamics. To effectively explore the parameter space for optimal model performance, an adaptive greedy sampling algorithm integrated with a physics-informed error indicator is introduced to search for optimal training samples on the fly, outperforming the conventional predefined uniform sampling. Further, an efficient k-nearest neighbor convex interpolation scheme is employed to exploit local latent-space dynamics for improved predictability. Numerical results demonstrate that the proposed method achieves 121 to 2,658x speed-up with 1 to 5% relative errors for radial advection and 2D Burgers dynamical problems. △ Less

Submitted 24 November, 2022; originally announced November 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2204.12005

Report number: LLNL-CONF-835143

arXiv:2210.14519 [pdf]

Probabilistic Prime Factorization based on Virtually Connected Boltzmann Machine and Probabilistic Annealing

Authors: Hyundo Jung, Hyun** Kim, Woo** Lee, **woo Jeon, Yohan Choi, Taehyeong Park, Chulwoo Kim

Abstract: Probabilistic computing has been introduced to operate functional networks using a probabilistic bit (p-bit), generating 0 or 1 probabilistically from its electrical input. In contrast to quantum computers, probabilistic computing enables the operation of adiabatic algorithms even at room temperature, and is expected to broaden computational abilities in non-deterministic polynomial searching and… ▽ More Probabilistic computing has been introduced to operate functional networks using a probabilistic bit (p-bit), generating 0 or 1 probabilistically from its electrical input. In contrast to quantum computers, probabilistic computing enables the operation of adiabatic algorithms even at room temperature, and is expected to broaden computational abilities in non-deterministic polynomial searching and learning problems. However, previous developments of probabilistic machines have focused on emulating the operation of quantum computers similarly, implementing every p-bit with large weight-sum matrix multiplication blocks or requiring tens of times more p-bits than semiprime bits. Furthermore, previous probabilistic machines adopted the graph model of quantum computers for updating the hardware connections, which further increased the number of sampling operations. Here we introduce a digitally accelerated prime factorization machine with a virtually connected Boltzmann machine and probabilistic annealing method, designed to reduce the complexity and number of sampling operations to below those of previous probabilistic factorization machines. In 10-bit to 64-bit factorizations were performed to assess the effectiveness of the machine, and the machine offers 1.2 X 10^8 times improvement in the number of sampling operations compared with previous factorization machines, with a 22-fold smaller hardware resource. This work shows that probabilistic machines can be implemented in a cost-effective manner using a field-programmable gate array, and hence we suggest that probabilistic computers can be employed for solving various large NP searching problems in the near future. △ Less

Submitted 26 October, 2022; originally announced October 2022.

Comments: 13 pages, 4 figures, 3 extended data figures and 1 table

MSC Class: 60G-08 ACM Class: B.0

arXiv:2209.08609 [pdf, other]

doi 10.1088/1748-0221/17/10/P10029

Neutron Tagging following Atmospheric Neutrino Events in a Water Cherenkov Detector

Authors: K. Abe, Y. Haga, Y. Hayato, K. Hiraide, K. Ieki, M. Ikeda, S. Imaizumi, K. Iyogi, J. Kameda, Y. Kanemura, Y. Kataoka, Y. Kato, Y. Kishimoto, S. Miki, S. Mine, M. Miura, T. Mochizuki, S. Moriyama, Y. Nagao, M. Nakahata, T. Nakajima, Y. Nakano, S. Nakayama, T. Okada, K. Okamoto , et al. (281 additional authors not shown)

Abstract: We present the development of neutron-tagging techniques in Super-Kamiokande IV using a neural network analysis. The detection efficiency of neutron capture on hydrogen is estimated to be 26%, with a mis-tag rate of 0.016 per neutrino event. The uncertainty of the tagging efficiency is estimated to be 9.0%. Measurement of the tagging efficiency with data from an Americium-Beryllium calibration agr… ▽ More We present the development of neutron-tagging techniques in Super-Kamiokande IV using a neural network analysis. The detection efficiency of neutron capture on hydrogen is estimated to be 26%, with a mis-tag rate of 0.016 per neutrino event. The uncertainty of the tagging efficiency is estimated to be 9.0%. Measurement of the tagging efficiency with data from an Americium-Beryllium calibration agrees with this value within 10%. The tagging procedure was performed on 3,244.4 days of SK-IV atmospheric neutrino data, identifying 18,091 neutrons in 26,473 neutrino events. The fitted neutron capture lifetime was measured as 218 \pm 9 μs. △ Less

Submitted 20 September, 2022; v1 submitted 18 September, 2022; originally announced September 2022.

Journal ref: JINST 17 P10029 (2022)

arXiv:2209.05033 [pdf]

doi 10.1103/PhysRevB.106.094311

Divergent phonon angular momentum driven by temperature and strain

Authors: Young-Jae Choi, Seung-Hoon Jhi

Abstract: The phonon angular momentum (PAM) may exhibit exotic temperature dependence as it is sensitive to the phonon lifetime. Constant phonon-lifetime approximation fails to depict such behavior. Here, we study the PAM of AlN, GaN, and graphene-like boron nitride (g-BN) monolayer with full consideration of phonon lifetime using first-principles calculations. We show that wurtzite AlN and GaN acquire dive… ▽ More The phonon angular momentum (PAM) may exhibit exotic temperature dependence as it is sensitive to the phonon lifetime. Constant phonon-lifetime approximation fails to depict such behavior. Here, we study the PAM of AlN, GaN, and graphene-like boron nitride (g-BN) monolayer with full consideration of phonon lifetime using first-principles calculations. We show that wurtzite AlN and GaN acquire divergent PAM at low temperatures from their lowest-lying phonon branches. The g-BN monolayer, on the other hand, does not have finite PAM at equilibrium structure. Rather it shows intriguing strain-dependence in PAM; the compressive strain greater than the critical size generates divergent PAM at low temperatures due to the divergent lifetime of TA phonons. As PAM couples with rotational excitations in solids associated with charge, spin, or electromagnetic fields, our study demonstrates a possibility of mechanical and thermal engineering of such excitations. △ Less

Submitted 12 September, 2022; originally announced September 2022.

Comments: 37 pages, 16 figures, to be published in Physical Review B,

Journal ref: Phys. Rev. B 106, 094311 (2022)

arXiv:2208.11477 [pdf, other]

Using Conservation Laws to Infer Deep Learning Model Accuracy of Richtmyer-meshkov Instabilities

Authors: Charles F. Jekel, Dane M. Sterbentz, Sylvie Aubry, Youngsoo Choi, Daniel A. White, Jonathan L. Belof

Abstract: Richtmyer-Meshkov Instability (RMI) is a complicated phenomenon that occurs when a shockwave passes through a perturbed interface. Over a thousand hydrodynamic simulations were performed to study the formation of RMI for a parameterized high velocity impact. Deep learning was used to learn the temporal map** of initial geometric perturbations to the full-field hydrodynamic solutions of density a… ▽ More Richtmyer-Meshkov Instability (RMI) is a complicated phenomenon that occurs when a shockwave passes through a perturbed interface. Over a thousand hydrodynamic simulations were performed to study the formation of RMI for a parameterized high velocity impact. Deep learning was used to learn the temporal map** of initial geometric perturbations to the full-field hydrodynamic solutions of density and velocity. The continuity equation was used to include physical information into the loss function, however only resulted in very minor improvements at the cost of additional training complexity. Predictions from the deep learning model appear to accurately capture temporal RMI formations for a variety of geometric conditions within the domain. First principle physical laws were investigated to infer the accuracy of the model's predictive capability. While the continuity equation appeared to show no correlation with the accuracy of the model, conservation of mass and momentum were weakly correlated with accuracy. Since conservation laws can be quickly calculated from the deep learning model, they may be useful in applications where a relative accuracy measure is needed. △ Less

Submitted 18 July, 2022; originally announced August 2022.

Comments: Presented at ECCOMAS 2022

Report number: LLNL-CONF-837041

arXiv:2208.04671 [pdf]

Customising radiative decay dynamics of two-dimensional excitons via position- and polarisation-dependent vacuum-field interference

Authors: Sanghyeok Park, Dongha Kim, Yun-Seok Choi, Arthur Baucour, Donghyeong Kim, Sangho Yoon, Kenji Watanabe, Takashi Taniguchi, Jonghwa Shin, Jonghwan Kim, Min-Kyo Seo

Abstract: Embodying bosonic and electrically interactive characteristics in two-dimensional space, excitons in transition-metal dichalcogenides (TMDCs) have garnered considerable attention. The realisation and application of strong-correlation effects, long-range transport, and valley-dependent optoelectronic properties require customising exciton decay dynamics. Strains, defects, and electrostatic do** e… ▽ More Embodying bosonic and electrically interactive characteristics in two-dimensional space, excitons in transition-metal dichalcogenides (TMDCs) have garnered considerable attention. The realisation and application of strong-correlation effects, long-range transport, and valley-dependent optoelectronic properties require customising exciton decay dynamics. Strains, defects, and electrostatic do** effectively control the decay dynamics but significantly disturb the intrinsic properties of TMDCs, such as electron band structure and exciton binding energy. Meanwhile, vacuum-field manipulation provides an optical alternative for engineering radiative decay dynamics. Planar mirrors and cavities have been employed to manage the light-matter interactions of two-dimensional excitons. However, the conventional flat platforms cannot customise the radiative decay landscape in the horizontal TMDC plane or independently control vacuum field interference at different pum** and emission frequencies. Here, we present a meta-mirror resolving the issues with more optical freedom. For neutral excitons of the monolayer MoSe2, the meta-mirror manipulated the radiative decay rate by two orders of magnitude, depending on its geometry. Moreover, we experimentally identified the correlation between emission intensity and spectral linewidth. The anisotropic meta-mirror demonstrated polarisation-dependent radiative decay control. We expect that the meta-mirror platform will be promising to tailor the two-dimensional distributions of lifetime, density, and diffusion of TMDC excitons in advanced opto-excitonic applications. △ Less

Submitted 9 August, 2022; originally announced August 2022.

arXiv:2207.11376 [pdf, other]

doi 10.1103/PhysRevB.107.L041110

Two-phonon scattering in non-polar semiconductors: a first-principles study of warm electron transport in Si

Authors: Benjamin Hatanpää, Alexander Y. Choi, Peishi S. Cheng, Austin J. Minnich

Abstract: The ab-initio theory of charge transport in semiconductors typically employs the lowest-order perturbation theory in which electrons interact with one phonon (1ph). This theory is accepted to be adequate to explain the low-field mobility of non-polar semiconductors but has not been tested extensively beyond the low-field regime. Here, we report first-principles calculations of the electric field-d… ▽ More The ab-initio theory of charge transport in semiconductors typically employs the lowest-order perturbation theory in which electrons interact with one phonon (1ph). This theory is accepted to be adequate to explain the low-field mobility of non-polar semiconductors but has not been tested extensively beyond the low-field regime. Here, we report first-principles calculations of the electric field-dependence of the electron mobility of Si as described by the warm electron coefficient, $β$. Although the 1ph theory overestimates the low-field mobility by only around 20%, it overestimates $β$ by over a factor of two over a range of temperatures and crystallographic axes. We show that the discrepancy in $β$ is reconciled by inclusion of on-shell iterated 2-phonon (2ph) scattering processes, indicating that scattering from higher-order electron-phonon interactions is non-negligible even in non-polar semiconductors. Further, a ~20% underestimate of the low-field mobility with 2ph scattering suggests that non-trivial cancellations may occur in the perturbative expansion of the electron-phonon interaction. △ Less

Submitted 22 July, 2022; originally announced July 2022.

Comments: 18 pages, 3 figures, submitted

arXiv:2207.11333 [pdf, other]

Scalable training of graph convolutional neural networks for fast and accurate predictions of HOMO-LUMO gap in molecules

Authors: Jong Youl Choi, Pei Zhang, Kshitij Mehta, Andrew Blanchard, Massimiliano Lupo Pasini

Abstract: Graph Convolutional Neural Network (GCNN) is a popular class of deep learning (DL) models in material science to predict material properties from the graph representation of molecular structures. Training an accurate and comprehensive GCNN surrogate for molecular design requires large-scale graph datasets and is usually a time-consuming process. Recent advances in GPUs and distributed computing op… ▽ More Graph Convolutional Neural Network (GCNN) is a popular class of deep learning (DL) models in material science to predict material properties from the graph representation of molecular structures. Training an accurate and comprehensive GCNN surrogate for molecular design requires large-scale graph datasets and is usually a time-consuming process. Recent advances in GPUs and distributed computing open a path to reduce the computational cost for GCNN training effectively. However, efficient utilization of high performance computing (HPC) resources for training requires simultaneously optimizing large-scale data management and scalable stochastic batched optimization techniques. In this work, we focus on building GCNN models on HPC systems to predict material properties of millions of molecules. We use HydraGNN, our in-house library for large-scale GCNN training, leveraging distributed data parallelism in PyTorch. We use ADIOS, a high-performance data management framework for efficient storage and reading of large molecular graph data. We perform parallel training on two open-source large-scale graph datasets to build a GCNN predictor for an important quantum property known as the HOMO-LUMO gap. We measure the scalability, accuracy, and convergence of our approach on two DOE supercomputers: the Summit supercomputer at the Oak Ridge Leadership Computing Facility (OLCF) and the Perlmutter system at the National Energy Research Scientific Computing Center (NERSC). We present our experimental results with HydraGNN showing i) reduction of data loading time up to 4.2 times compared with a conventional method and ii) linear scaling performance for training up to 1,024 GPUs on both Summit and Perlmutter. △ Less

Submitted 22 July, 2022; originally announced July 2022.

Comments: 19 pages, 9 figures

MSC Class: 68Q85; 68M14; 68W15; 68W15 ACM Class: I.2.11

arXiv:2206.07780 [pdf]

A machine learning approach to predicting pore pressure response in liquefiable sands under cyclic loading

Authors: Yong** Choi, Krishna Kumar

Abstract: Shear stress history controls the pore pressure response in liquefiable soils. The excess pore pressure does not increase under cyclic loading when shear stress amplitude is lower than the peak prior amplitude -- the shielding effect. Many sophisticated constitutive models fail to capture the shielding effect observed in the cyclic liquefaction experiments. We develop a data-driven machine learnin… ▽ More Shear stress history controls the pore pressure response in liquefiable soils. The excess pore pressure does not increase under cyclic loading when shear stress amplitude is lower than the peak prior amplitude -- the shielding effect. Many sophisticated constitutive models fail to capture the shielding effect observed in the cyclic liquefaction experiments. We develop a data-driven machine learning model based on the LSTM neural network to capture the liquefaction response of soils under cyclic loading. The LSTM model is trained on 12 laboratory cyclic simple shear tests on Nevada sand in loose and dense conditions subjected to different cyclic simple shear loading conditions. The LSTM model features include the relative density of soil and the previous stress history to predict the pore water pressure response. The LSTM model successfully replicates the pore pressure response for three cyclic simple test results considering the shielding and density effects. △ Less

Submitted 15 June, 2022; originally announced June 2022.

arXiv:2205.03975 [pdf, other]

doi 10.1063/5.0103156

Self-heating of cryogenic high-electron-mobility transistor amplifiers and the limits of microwave noise performance

Authors: Anthony J. Ardizzi, Alexander Y. Choi, Bekari Gabritchidze, Jacob Kooi, Kieran A. Cleary, Anthony C. Readhead, Austin J. Minnich

Abstract: The fundamental limits of the microwave noise performance of high electron mobility transistors (HEMTs) are of scientific and practical interest for applications in radio astronomy and quantum computing. Self-heating at cryogenic temperatures has been reported to be a limiting mechanism for the noise, but cryogenic cooling strategies to mitigate it, for instance using liquid cryogens, have not bee… ▽ More The fundamental limits of the microwave noise performance of high electron mobility transistors (HEMTs) are of scientific and practical interest for applications in radio astronomy and quantum computing. Self-heating at cryogenic temperatures has been reported to be a limiting mechanism for the noise, but cryogenic cooling strategies to mitigate it, for instance using liquid cryogens, have not been evaluated. Here, we report microwave noise measurements of a packaged two-stage HEMT amplifier immersed in normal and superfluid $^4$He baths and in vacuum from 1.6 - 80 K. We find that these liquid cryogens are unable to mitigate the thermal noise associated with self-heating. Considering this finding, we examine the implications for the lower bounds of cryogenic noise performance in HEMTs. Our analysis supports the general design principle for cryogenic HEMTs of maximizing gain at the lowest possible power. △ Less

Submitted 4 August, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

Comments: 36 pages (including 15 SI pages), 9 figures (including 3 SI figures), 2 tables (including 2 SI tables)

arXiv:2204.12006 [pdf, other]

doi 10.1016/j.jcp.2022.111852

Parametric Dynamic Mode Decomposition for Reduced Order Modeling

Authors: Quincy A. Huhn, Mauricio E. Tano, Jean C. Ragusa, Youngsoo Choi

Abstract: Dynamic Mode Decomposition (DMD) is a model-order reduction approach, whereby spatial modes of fixed temporal frequencies are extracted from numerical or experimental data sets. The DMD low-rank or reduced operator is typically obtained by singular value decomposition of the temporal data sets. For parameter-dependent models, as found in many multi-query applications such as uncertainty quantifica… ▽ More Dynamic Mode Decomposition (DMD) is a model-order reduction approach, whereby spatial modes of fixed temporal frequencies are extracted from numerical or experimental data sets. The DMD low-rank or reduced operator is typically obtained by singular value decomposition of the temporal data sets. For parameter-dependent models, as found in many multi-query applications such as uncertainty quantification or design optimization, the only parametric DMD technique developed was a stacked approach, with data sets at multiples parameter values were aggregated together, increasing the computational work needed to devise low-rank dynamical reduced-order models. In this paper, we present two novel approach to carry out parametric DMD: one based on the interpolation of the reduced-order DMD eigenpair and the other based on the interpolation of the reduced DMD (Koopman) operator. Numerical results are presented for diffusion-dominated nonlinear dynamical problems, including a multiphysics radiative transfer example. All three parametric DMD approaches are compared. △ Less

Submitted 25 April, 2022; originally announced April 2022.

Comments: 29 pages, 10 figures

arXiv:2204.12005 [pdf, other]

doi 10.1016/j.jcp.2023.112267

gLaSDI: Parametric Physics-informed Greedy Latent Space Dynamics Identification

Authors: Xiaolong He, Youngsoo Choi, William D. Fries, Jon Belof, Jiun-Shyan Chen

Abstract: A parametric adaptive physics-informed greedy Latent Space Dynamics Identification (gLaSDI) method is proposed for accurate, efficient, and robust data-driven reduced-order modeling of high-dimensional nonlinear dynamical systems. In the proposed gLaSDI framework, an autoencoder discovers intrinsic nonlinear latent representations of high-dimensional data, while dynamics identification (DI) models… ▽ More A parametric adaptive physics-informed greedy Latent Space Dynamics Identification (gLaSDI) method is proposed for accurate, efficient, and robust data-driven reduced-order modeling of high-dimensional nonlinear dynamical systems. In the proposed gLaSDI framework, an autoencoder discovers intrinsic nonlinear latent representations of high-dimensional data, while dynamics identification (DI) models capture local latent-space dynamics. An interactive training algorithm is adopted for the autoencoder and local DI models, which enables identification of simple latent-space dynamics and enhances accuracy and efficiency of data-driven reduced-order modeling. To maximize and accelerate the exploration of the parameter space for the optimal model performance, an adaptive greedy sampling algorithm integrated with a physics-informed residual-based error indicator and random-subset evaluation is introduced to search for the optimal training samples on the fly. Further, to exploit local latent-space dynamics captured by the local DI models for an improved modeling accuracy with a minimum number of local DI models in the parameter space, a k-nearest neighbor convex interpolation scheme is employed. The effectiveness of the proposed framework is demonstrated by modeling various nonlinear dynamical problems, including Burgers equations, nonlinear heat conduction, and radial advection. The proposed adaptive greedy sampling outperforms the conventional predefined uniform sampling in terms of accuracy. Compared with the high-fidelity models, gLaSDI achieves 17 to 2,658x speed-up with 1 to 5% relative errors. △ Less

Submitted 18 May, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

arXiv:2204.05917 [pdf]

Spatiotemporal Estimation of TROPOMI NO2 Column with Depthwise Partial Convolutional Neural Network

Authors: Yannic Lops, Masoud Ghahremanloo, Arman Pouyaei, Yunsoo Choi, Jia Jung, Seyedali Mousavinezhad, Ahmed Khan Salman, Davyda Hammond

Abstract: Satellite-derived measurements are negatively impacted by cloud cover and surface reflectivity. These biases must be discarded and significantly increase the amount of missing data within remote sensing images. This paper expands the application of a partial convolutional neural network (PCNN) to incorporate depthwise convolution layers, conferring temporal dimensionality to the imputation process… ▽ More Satellite-derived measurements are negatively impacted by cloud cover and surface reflectivity. These biases must be discarded and significantly increase the amount of missing data within remote sensing images. This paper expands the application of a partial convolutional neural network (PCNN) to incorporate depthwise convolution layers, conferring temporal dimensionality to the imputation process. The addition of a temporal dimension to the imputation process adds a state of successive existence within the dataset which spatial imputation cannot capture. The depthwise convolution process enables the PCNN to independently convolve the data for each channel. The deep learning system is trained with the Community Multiscale Air Quality model-simulated tropospheric column density of Nitrogen Dioxide (TCDNO2) to impute TROPOspheric Monitoring Instrument TCDNO2. The depthwise PCNN model achieves an index of agreement of 0.82 and outperforms the default PCNN models, with and without temporal dimensionality of data, and conventional data imputation methods such as inverse distance weighting by 3-11% and 8-15% in the index of agreement and correlation, respectively. The model demonstrates more consistency in the reconstruction of TROPOspheric Monitoring Instrument tropospheric column density of NO2 images. The model has also demonstrated the accurate imputation of remote sensing images with over 95% of the data missing. PCNN enables the accurate imputation of remote sensing data with large regions of missing data and will benefit future researchers conducting data assimilation for numerical models, emission studies, and human health impact analyses from air pollution. △ Less

Submitted 12 April, 2022; originally announced April 2022.

Comments: Keywords: Partial Convolution, Depthwise, TROPOMI, Kriging, Spatiotemporal Imputation, CMAQ. 12 pages & 6 figures

arXiv:2202.02335 [pdf]

doi 10.1038/s41586-022-05229-4

Spontaneous generation and active manipulation of real-space optical vortex

Authors: Dongha Kim, Arthur Baucour, Yun-Seok Choi, Jonghwa Shin, Min-Kyo Seo

Abstract: Optical vortices host the orbital nature of photons, which offers an extra degree of freedom in photonic applications. Unlike vortices in other physical entities, optical vortices require structural singularities, which restrict their abilities in terms of dynamic and interactive characteristics. In this study, we present the spontaneous generation and external magnetic field-induced manipulation… ▽ More Optical vortices host the orbital nature of photons, which offers an extra degree of freedom in photonic applications. Unlike vortices in other physical entities, optical vortices require structural singularities, which restrict their abilities in terms of dynamic and interactive characteristics. In this study, we present the spontaneous generation and external magnetic field-induced manipulation of an optical vortex and antivortex. A gradient-thickness optical cavity (GTOC) consisting of an Al/SiO2/Ni/SiO2 multilayer structure realised the distinct transition between the trivial and non-trivial topological phases, depending on the magneto-optic effects of the Ni layer. In the non-trivial topological phase, the mathematical singularities generating the optical vortex and antivortex pair in the reflected light existed in the generalised parameter space of the thicknesses of the top and bottom SiO2 layers, which is bijective to the real space of the GTOC. Coupled with the magnetisation, the optical vortex and antivortex in the GTOC experienced an effective spin-orbit interaction and showed topology-dependent dynamics under external magnetic fields. We expect that field-induced engineering of optical vortices will pave the way for the study of topological photonic interactions and their applications. △ Less

Submitted 4 February, 2022; originally announced February 2022.

Comments: 22 pages, 4 figures

arXiv:2202.01954 [pdf, other]

doi 10.1088/2632-2153/ac6a51

Multi-task graph neural networks for simultaneous prediction of global and atomic properties in ferromagnetic systems

Authors: Massimiliano Lupo Pasini, Pei Zhang, Samuel Temple Reeve, Jong Youl Choi

Abstract: We introduce a multi-tasking graph convolutional neural network, HydraGNN, to simultaneously predict both global and atomic physical properties and demonstrate with ferromagnetic materials. We train HydraGNN on an open-source ab initio density functional theory (DFT) dataset for iron-platinum (FePt) with a fixed body centered tetragonal (BCT) lattice structure and fixed volume to simultaneously pr… ▽ More We introduce a multi-tasking graph convolutional neural network, HydraGNN, to simultaneously predict both global and atomic physical properties and demonstrate with ferromagnetic materials. We train HydraGNN on an open-source ab initio density functional theory (DFT) dataset for iron-platinum (FePt) with a fixed body centered tetragonal (BCT) lattice structure and fixed volume to simultaneously predict the mixing enthalpy (a global feature of the system), the atomic charge transfer, and the atomic magnetic moment across configurations that span the entire compositional range. By taking advantage of underlying physical correlations between material properties, multi-task learning (MTL) with HydraGNN provides effective training even with modest amounts of data. Moreover, this is achieved with just one architecture instead of three, as required by single-task learning (STL). The first convolutional layers of the HydraGNN architecture are shared by all learning tasks and extract features common to all material properties. The following layers discriminate the features of the different properties, the results of which are fed to the separate heads of the final layer to produce predictions. Numerical results show that HydraGNN effectively captures the relation between the configurational entropy and the material properties over the entire compositional range. Overall, the accuracy of simultaneous MTL predictions is comparable to the accuracy of the STL predictions. In addition, the computational cost of training HydraGNN for MTL is much lower than the original DFT calculations and also lower than training separate STL models for each property. △ Less

Submitted 3 February, 2022; originally announced February 2022.

Comments: 13 pages, 6 figures

Journal ref: Mach. Learn.: Sci. Technol. 3 025007 (2022)

arXiv:2201.11912 [pdf, other]

doi 10.1103/PhysRevB.106.245201

High-field transport and hot electron noise in GaAs from first principles: role of two-phonon scattering

Authors: Peishi S. Cheng, Jiace Sun, Shi-Ning Sun, Alexander Y. Choi, Austin J. Minnich

Abstract: High-field charge transport in semiconductors is of fundamental interest and practical importance. While the \textit{ab initio} treatment of low-field transport is well-developed, the treatment of high-field transport is much less so, particularly for multi-phonon processes that are reported to be relevant in GaAs. Here, we report a calculation of the high-field transport properties and current po… ▽ More High-field charge transport in semiconductors is of fundamental interest and practical importance. While the \textit{ab initio} treatment of low-field transport is well-developed, the treatment of high-field transport is much less so, particularly for multi-phonon processes that are reported to be relevant in GaAs. Here, we report a calculation of the high-field transport properties and current power spectral density (PSD) of hot electrons in GaAs from first principles including on-shell two-phonon (2ph) scattering. The on-shell 2ph scattering rates are found to qualitatively alter the high-field distribution function by increasing both the momentum and energy relaxation rates as well as contributing markedly to intervalley scattering. This finding reconciles a long-standing discrepancy regarding the strength of intervalley scattering in GaAs as inferred from transport and optical studies. The characteristic non-monotonic trend of PSD with electric field is not predicted at this level of theory. Our work shows how \textit{ab initio} calculations of high-field transport and noise may be used as a stringent test of the electron-phonon interaction in semiconductors. △ Less

Submitted 6 July, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

Comments: 34 pages, 6 figures, submitted to Physical Review B

arXiv:2201.08331 [pdf]

doi 10.1002/smll.202206604

Pseudo-hydrodynamic flow of quasiparticles in semimetal WTe2 at room temperature

Authors: Young-Gwan Choi, Manh-Ha Doan, Gyung-Min Choi, Maxim N. Chernodub

Abstract: Recently, much interest has emerged in fluid-like electric charge transport in various solid-state systems. The hydrodynamic behavior of the electronic fluid reveals itself as a decrease of the electrical resistance with increasing temperature (the Gurzhi effect) in narrow conducting channels, polynomial scaling of the resistance as a function of the channel width, substantial violation of the Wie… ▽ More Recently, much interest has emerged in fluid-like electric charge transport in various solid-state systems. The hydrodynamic behavior of the electronic fluid reveals itself as a decrease of the electrical resistance with increasing temperature (the Gurzhi effect) in narrow conducting channels, polynomial scaling of the resistance as a function of the channel width, substantial violation of the Wiedemann-Franz law supported by the emergence of the Poiseuille flow. Similarly to whirlpools in flowing water, the viscous electronic flow generates vortices, resulting in abnormal sign-changing electrical response driven by the backflow of electrical current. Experimentally, the presence of the hydrodynamic vortices was observed in low-temperature graphene as a negative voltage drop near the current-injecting contacts. However, the question of whether the long-ranged sign-changing electrical response can be produced by a mechanism other than hydrodynamics has not been addressed so far. Here we use polarization-sensitive laser microscopy to demonstrate the emergence of visually similar abnormal sign-alternating patterns in charge density in multilayer tungsten ditelluride at room temperature where this material does not exhibit true electronic hydrodynamics. We argue that this pseudo-hydrodynamic behavior appears due to a subtle interplay between the diffusive transport of electrons and holes. In particular, the sign-alternating charge accumulation in WTe2 is supported by the unexpected backflow of compressible neutral electron-hole current, which creates charge-neutral whirlpools in the bulk of this nearly compensated semimetal. We demonstrate that the exceptionally large spatial size of the charge domains is sustained by the long recombination time of electron-hole pairs. △ Less

Submitted 20 January, 2022; originally announced January 2022.

Comments: 14 pages, 3 figures + supplementary material

Journal ref: Small 2023, 2206604

arXiv:2109.14331 [pdf, other]

doi 10.1088/1748-0221/17/01/C01011

Upgrade of the CMS Resistive Plate Chambers for the High Luminosity LHC

Authors: A. Samalan, M. Tytgat, G. A. Alves, F. Marujo, F. Torres Da Silva De Araujo, E. M. DaCosta, D. De Jesus Damiao, H. Nogima, A. Santoro, S. Fonseca De Souza, A. Aleksandrov, R. Hadjiiska, P. Iaydjiev, M. Rodozov, M. Shopova, G. Soultanov, M. Bonchev, A. Dimitrov, L. Litov, B. Pavlov, P. Petkov, A. Petrov, S. J. Qian, C. Bernal, A. Cabrera , et al. (86 additional authors not shown)

Abstract: During the upcoming High Luminosity phase of the Large Hadron Collider (HL-LHC), the integrated luminosity of the accelerator will increase to 3000 fb$^{-1}$. The expected experimental conditions in that period in terms of background rates, event pileup, and the probable aging of the current detectors present a challenge for all the existing experiments at the LHC, including the Compact Muon Solen… ▽ More During the upcoming High Luminosity phase of the Large Hadron Collider (HL-LHC), the integrated luminosity of the accelerator will increase to 3000 fb$^{-1}$. The expected experimental conditions in that period in terms of background rates, event pileup, and the probable aging of the current detectors present a challenge for all the existing experiments at the LHC, including the Compact Muon Solenoid (CMS) experiment. To ensure a highly performing muon system for this period, several upgrades of the Resistive Plate Chamber (RPC) system of the CMS are currently being implemented. These include the replacement of the readout system for the present system, and the installation of two new RPC stations with improved chamber and front-end electronics designs. The current overall status of this CMS RPC upgrade project is presented. △ Less

Submitted 2 November, 2021; v1 submitted 29 September, 2021; originally announced September 2021.

arXiv:2108.03370 [pdf, other]

doi 10.1063/5.0069352

Theory of drain noise in high electron mobility transistors based on real-space transfer

Authors: Iretomiwa Esho, Alexander Y. Choi, Austin J. Minnich

Abstract: High electron mobility transistors are widely used as microwave amplifiers owing to their low microwave noise figure. Electronic noise in these devices is typically modeled by noise sources at the gate and drain. While consensus exists regarding the origin of the gate noise, that of drain noise is a topic of debate. Here, we report a theory of drain noise as a type of partition noise arising from… ▽ More High electron mobility transistors are widely used as microwave amplifiers owing to their low microwave noise figure. Electronic noise in these devices is typically modeled by noise sources at the gate and drain. While consensus exists regarding the origin of the gate noise, that of drain noise is a topic of debate. Here, we report a theory of drain noise as a type of partition noise arising from real-space transfer of hot electrons from the channel to the barrier. The theory accounts for the magnitude and dependencies of the drain temperature and suggests strategies to realize devices with lower noise figure. △ Less

Submitted 7 August, 2021; originally announced August 2021.

arXiv:2108.02411 [pdf, other]

doi 10.3390/s21186255

Locking Multi-laser Frequencies to a Precision Wavelength Meter: Application to Cold Atoms

Authors: Junwoo Kim, Keumhyun Kim, Dowon Lee, Yongha Shin, Sungsam Kang, Jung-Ryul Kim, Youngwoon Choi, Kyungwon An, Moonjoo Lee

Abstract: We herein report a simultaneous frequency stabilization of two 780-nm external cavity diode lasers using a precision wavelength meter (WLM). The laser lock performance is characterized by the Allan deviation measurement in which we find $σ_{y}=10^{-12}$ at an averaging time of 1000 s. We also obtain spectral profiles through a heterodyne spectroscopy, identifying the contribution of white and flic… ▽ More We herein report a simultaneous frequency stabilization of two 780-nm external cavity diode lasers using a precision wavelength meter (WLM). The laser lock performance is characterized by the Allan deviation measurement in which we find $σ_{y}=10^{-12}$ at an averaging time of 1000 s. We also obtain spectral profiles through a heterodyne spectroscopy, identifying the contribution of white and flicker noises to the laser linewidth. The frequency drift of the WLM is measured to be about 2.0(4) MHz over 36 hours. Utilizing the two lasers as a cooling and repum** field, we demonstrate a magneto-optical trap of $^{87}$Rb atoms near a high-finesse optical cavity. Our laser stabilization technique operates at broad wavelength range without a radio frequency element. △ Less

Submitted 22 September, 2021; v1 submitted 5 August, 2021; originally announced August 2021.

Comments: 6 pages, 5 figures

Journal ref: Sensors 21, 6255 (2021)

Showing 1–50 of 119 results for author: Choi, Y