-
Improved limit on neutrinoless double beta decay of \mohundred~from AMoRE-I
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
Seonho Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Y. M. Gavrilyuk,
A. M. Gezhaev,
O. Gileva
, et al. (83 additional authors not shown)
Abstract:
AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate c…
▽ More
AMoRE searches for the signature of neutrinoless double beta decay of $^{100}$Mo with a 100 kg sample of enriched $^{100}$Mo. Scintillating molybdate crystals coupled with a metallic magnetic calorimeter operate at milli-Kelvin temperatures to measure the energy of electrons emitted in the decay. As a demonstration of the full-scale AMoRE, we conducted AMoRE-I, a pre-experiment with 18 molybdate crystals, at the Yangyang Underground Laboratory for over two years. The exposure was 8.02 kg$\cdot$year (or 3.89 kg$_{\mathrm{^{100}Mo}}\cdot$year) and the total background rate near the Q-value was 0.025 $\pm$ 0.002 counts/keV/kg/year. We observed no indication of $0νββ$ decay and report a new lower limit of the half-life of $^{100}$Mo $0νββ$ decay as $ T^{0ν}_{1/2}>3.0\times10^{24}~\mathrm{years}$ at 90\% confidence level. The effective Majorana mass limit range is $m_{ββ}<$(210--610) meV using nuclear matrix elements estimated in the framework of different models, including the recent shell model calculations.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM)
Authors:
Younghyun Koo,
Maryam Rahnemoonfar
Abstract:
The Ice-sheet and Sea-level System Model (ISSM) provides solutions for Stokes equations relevant to ice sheet dynamics by employing finite element and fine mesh adaption. However, since its finite element method is compatible only with Central Processing Units (CPU), the ISSM has limits on further economizing computational time. Thus, by taking advantage of Graphics Processing Units (GPUs), we des…
▽ More
The Ice-sheet and Sea-level System Model (ISSM) provides solutions for Stokes equations relevant to ice sheet dynamics by employing finite element and fine mesh adaption. However, since its finite element method is compatible only with Central Processing Units (CPU), the ISSM has limits on further economizing computational time. Thus, by taking advantage of Graphics Processing Units (GPUs), we design a graph convolutional network (GCN) as a fast emulator for ISSM. The GCN is trained and tested using the 20-year transient ISSM simulations in the Pine Island Glacier (PIG). The GCN reproduces ice thickness and velocity with a correlation coefficient greater than 0.998, outperforming the traditional convolutional neural network (CNN). Additionally, GCN shows 34 times faster computational speed than the CPU-based ISSM modeling. The GPU-based GCN emulator allows us to predict how the PIG will change in the future under different melting rate scenarios with high fidelity and much faster computational time.
△ Less
Submitted 26 June, 2024;
originally announced July 2024.
-
NAIST Simultaneous Speech Translation System for IWSLT 2024
Authors:
Yuka Ko,
Ryo Fukuda,
Yuta Nishikawa,
Yasumasa Kano,
Tomoya Yanagita,
Kosuke Doi,
Mana Makinae,
Haotian Tan,
Makoto Sakai,
Sakriani Sakti,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
This paper describes NAIST's submission to the simultaneous track of the IWSLT 2024 Evaluation Campaign: English-to-{German, Japanese, Chinese} speech-to-text translation and English-to-Japanese speech-to-speech translation. We develop a multilingual end-to-end speech-to-text translation model combining two pre-trained language models, HuBERT and mBART. We trained this model with two decoding poli…
▽ More
This paper describes NAIST's submission to the simultaneous track of the IWSLT 2024 Evaluation Campaign: English-to-{German, Japanese, Chinese} speech-to-text translation and English-to-Japanese speech-to-speech translation. We develop a multilingual end-to-end speech-to-text translation model combining two pre-trained language models, HuBERT and mBART. We trained this model with two decoding policies, Local Agreement (LA) and AlignAtt. The submitted models employ the LA policy because it outperformed the AlignAtt policy in previous models. Our speech-to-speech translation method is a cascade of the above speech-to-text model and an incremental text-to-speech (TTS) module that incorporates a phoneme estimation model, a parallel acoustic model, and a parallel WaveGAN vocoder. We improved our incremental TTS by applying the Transformer architecture with the AlignAtt policy for the estimation model. The results show that our upgraded TTS module contributed to improving the system performance.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Graph Neural Networks for Emulation of Finite-Element Ice Dynamics in Greenland and Antarctic Ice Sheets
Authors:
Younghyun Koo,
Maryam Rahnemoonfar
Abstract:
Although numerical models provide accurate solutions for ice sheet dynamics based on physics laws, they accompany intensified computational demands to solve partial differential equations. In recent years, convolutional neural networks (CNNs) have been widely used as statistical emulators for those numerical models. However, since CNNs operate on regular grids, they cannot represent the refined me…
▽ More
Although numerical models provide accurate solutions for ice sheet dynamics based on physics laws, they accompany intensified computational demands to solve partial differential equations. In recent years, convolutional neural networks (CNNs) have been widely used as statistical emulators for those numerical models. However, since CNNs operate on regular grids, they cannot represent the refined meshes and computational efficiency of finite-element numerical models. Therefore, instead of CNNs, this study adopts an equivariant graph convolutional network (EGCN) as an emulator for the ice sheet dynamics modeling. EGCN reproduces ice thickness and velocity changes in the Helheim Glacier, Greenland, and Pine Island Glacier, Antarctica, with 260 times and 44 times faster computation time, respectively. Compared to the traditional CNN and graph convolutional network, EGCN shows outstanding accuracy in thickness prediction near fast ice streams by preserving the equivariance to the translation and rotation of graphs.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Projected background and sensitivity of AMoRE-II
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
Seonho Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Y. M. Gavrilyuk,
A. M. Gezhaev,
O. Gileva
, et al. (81 additional authors not shown)
Abstract:
AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap…
▽ More
AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located approximately 1000 meters deep in Jeongseon, Korea. The goal of AMoRE-II is to reach up to $T^{0νββ}_{1/2}$ $\sim$ 6 $\times$ 10$^{26}$ years, corresponding to an effective Majorana mass of 15 - 29 meV, covering all the inverted mass hierarchy regions. To achieve this, the background level of the experimental configurations and possible background sources of gamma and beta events should be well understood. We have intensively performed Monte Carlo simulations using the GEANT4 toolkit in all the experimental configurations with potential sources. We report the estimated background level that meets the 10$^{-4}$counts/(keV$\cdot$kg$\cdot$yr) requirement for AMoRE-II in the region of interest (ROI) and show the projected half-life sensitivity based on the simulation study.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Word Order in English-Japanese Simultaneous Interpretation: Analyses and Evaluation using Chunk-wise Monotonic Translation
Authors:
Kosuke Doi,
Yuka Ko,
Mana Makinae,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
This paper analyzes the features of monotonic translations, which follow the word order of the source language, in simultaneous interpreting (SI). The word order differences are one of the biggest challenges in SI, especially for language pairs with significant structural differences like English and Japanese. We analyzed the characteristics of monotonic translations using the NAIST English-to-Jap…
▽ More
This paper analyzes the features of monotonic translations, which follow the word order of the source language, in simultaneous interpreting (SI). The word order differences are one of the biggest challenges in SI, especially for language pairs with significant structural differences like English and Japanese. We analyzed the characteristics of monotonic translations using the NAIST English-to-Japanese Chunk-wise Monotonic Translation Evaluation Dataset and found some grammatical structures that make monotonic translation difficult in English-Japanese SI. We further investigated the features of monotonic translations through evaluating the output from the existing speech translation (ST) and simultaneous speech translation (simulST) models on NAIST English-to-Japanese Chunk-wise Monotonic Translation Evaluation Dataset as well as on existing test sets. The results suggest that the existing SI-based test set underestimates the model performance. We also found that the monotonic-translation-based dataset would better evaluate simulST models, while using an offline-based test set for evaluating simulST models underestimates the model performance.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
MultiPragEval: Multilingual Pragmatic Evaluation of Large Language Models
Authors:
Dojun Park,
Jiwoo Lee,
Seohyun Park,
Hyeyun Jeong,
Youngeun Koo,
Soonha Hwang,
Seonwoo Park,
Sungeun Lee
Abstract:
As the capabilities of LLMs expand, it becomes increasingly important to evaluate them beyond basic knowledge assessment, focusing on higher-level language understanding. This study introduces MultiPragEval, a robust test suite designed for the multilingual pragmatic evaluation of LLMs across English, German, Korean, and Chinese. Comprising 1200 question units categorized according to Grice's Coop…
▽ More
As the capabilities of LLMs expand, it becomes increasingly important to evaluate them beyond basic knowledge assessment, focusing on higher-level language understanding. This study introduces MultiPragEval, a robust test suite designed for the multilingual pragmatic evaluation of LLMs across English, German, Korean, and Chinese. Comprising 1200 question units categorized according to Grice's Cooperative Principle and its four conversational maxims, MultiPragEval enables an in-depth assessment of LLMs' contextual awareness and their ability to infer implied meanings. Our findings demonstrate that Claude3-Opus significantly outperforms other models in all tested languages, establishing a state-of-the-art in the field. Among open-source models, Solar-10.7B and Qwen1.5-14B emerge as strong competitors. This study not only leads the way in the multilingual evaluation of LLMs in pragmatic inference but also provides valuable insights into the nuanced capabilities necessary for advanced language comprehension in AI systems.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Exclusion of the Cosmological Triangle in Reactor-Based Search for Axion-Like Particles
Authors:
Byung Ju Park,
Jae ** Choi,
Eunju Jeon,
**yu Kim,
Kyungwon Kim,
Sung Hyun Kim,
Sun Kee Kim,
Yeongduk Kim,
Young Ju Ko,
Byoung-Cheol Koh,
Chang Hyon Ha,
Seo Hyun Lee,
In Soo Lee,
Hyunseok Lee,
Hyun Su Lee,
Jaison Lee,
Yoomin Oh,
Doo** Kim
Abstract:
We report new constraints on axion-like particle (ALP) using data corresponding to a sodium iodine target exposure of 3063 kg$\cdot$days from the neutrino elastic scattering observation with NaI (NEON) experiment. A 16.7 kg of thallium-doped sodium iodide target was located 23.7 meters from a 2.8 GW thermal power nuclear reactor. We searched for ALPs produced by high-flux photons by comparing the…
▽ More
We report new constraints on axion-like particle (ALP) using data corresponding to a sodium iodine target exposure of 3063 kg$\cdot$days from the neutrino elastic scattering observation with NaI (NEON) experiment. A 16.7 kg of thallium-doped sodium iodide target was located 23.7 meters from a 2.8 GW thermal power nuclear reactor. We searched for ALPs produced by high-flux photons by comparing the energy spectra of data collected during reactor-on (1596 kg$\cdot$days exposure) and reactor-off (1467 kg$\cdot$days exposure) periods. No signal consistent with ALP interaction was identified, allowing us to set exclusion limits at the 95% confidence level. Our limits cover previously unexplored regions for both photon couplings (${g_{aγ}}$) and electron couplings (${g_{ae}}$) for axion masses around 1 MeV/c$^2$. Notably, the NEON data excludes the unconstrained region identified by laboratory-based searches for photon couplings within the "cosmological triangle" for the first time. The observed 95\% confidence level limits reach as low as ${g_{aγ}}$ of 4.33$\times$ 10$^{-8}$ GeV$^{-1}$ and ${g_{ae}}$ of 1.10$\times$ 10$^{-9}$ for axion masses of 1.7 MeV/c$^2$ and 1.0 MeV/c$^2$, respectively.
△ Less
Submitted 11 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
Recover as It is Designed to Be: Recovering from Compatibility Mobile App Crashes by Reusing User Flows
Authors:
Donghwi Kim,
Hyungjun Yoon,
Chang Min Park,
Su** Han,
Young** Kwon,
Steven Y. Ko,
Sung-Ju Lee
Abstract:
Android OS is severely fragmented by API updates and device vendors' OS customization, creating a market condition where vastly different OS versions coexist. This gives rise to compatibility crash problems where Android apps crash on certain Android versions but not on others. Although well-known, this problem is extremely challenging for app developers to overcome due to the sheer number of Andr…
▽ More
Android OS is severely fragmented by API updates and device vendors' OS customization, creating a market condition where vastly different OS versions coexist. This gives rise to compatibility crash problems where Android apps crash on certain Android versions but not on others. Although well-known, this problem is extremely challenging for app developers to overcome due to the sheer number of Android versions in the market that must be tested. We present RecoFlow, a framework for enabling app developers to automatically recover an app from a crash by programming user flows with our API and visual tools. RecoFlow tracks app feature usage with the user flows on user devices and recovers an app from a crash by replaying UI actions of the app feature disrupted by the crash. To prevent recurring compatibility crashes, RecoFlow executes a previously crashed app in compatibility mode that is enabled by our novel Android OS virtualization technique. Our evaluation with professional Android developers shows that our API and tools are easy to use and effective in recovering from compatibility crashes.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Physics-Informed Machine Learning On Polar Ice: A Survey
Authors:
Zesheng Liu,
YoungHyun Koo,
Maryam Rahnemoonfar
Abstract:
The mass loss of the polar ice sheets contributes considerably to ongoing sea-level rise and changing ocean circulation, leading to coastal flooding and risking the homes and livelihoods of tens of millions of people globally. To address the complex problem of ice behavior, physical models and data-driven models have been proposed in the literature. Although traditional physical models can guarant…
▽ More
The mass loss of the polar ice sheets contributes considerably to ongoing sea-level rise and changing ocean circulation, leading to coastal flooding and risking the homes and livelihoods of tens of millions of people globally. To address the complex problem of ice behavior, physical models and data-driven models have been proposed in the literature. Although traditional physical models can guarantee physically meaningful results, they have limitations in producing high-resolution results. On the other hand, data-driven approaches require large amounts of high-quality and labeled data, which is rarely available in the polar regions. Hence, as a promising framework that leverages the advantages of physical models and data-driven methods, physics-informed machine learning (PIML) has been widely studied in recent years. In this paper, we review the existing algorithms of PIML, provide our own taxonomy based on the methods of combining physics and data-driven approaches, and analyze the advantages of PIML in the aspects of accuracy and efficiency. Further, our survey discusses some current challenges and highlights future opportunities, including PIML on sea ice studies, PIML with different combination methods and backbone networks, and neural operator methods.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Infrared resonance-lattice device technology
Authors:
Robert Magnusson,
Yeong H. Ko,
Kyu J. Lee,
Fairooz A. Simlan,
Pawarat Bootpakdeetam,
Renjie Chen,
Debra Wawro Weidanz,
Susanne Gimlin,
Soroush Ghaffari
Abstract:
We present subwavelength resonant lattices fashioned as nano- and microstructured films as a basis for a host of device concepts. Whereas the canonical physical properties are fully embodied in a one-dimensional periodic lattice, the final device constructs are often patterned in two-dimensionally-modulated films in which case we may refer to them as photonic crystal slabs, metamaterials, or metas…
▽ More
We present subwavelength resonant lattices fashioned as nano- and microstructured films as a basis for a host of device concepts. Whereas the canonical physical properties are fully embodied in a one-dimensional periodic lattice, the final device constructs are often patterned in two-dimensionally-modulated films in which case we may refer to them as photonic crystal slabs, metamaterials, or metasurfaces. These surfaces can support lateral modes and localized field signatures with propagative and evanescent diffraction channels critically controlling the response. The governing principle of guided-mode, or lattice, resonance enables diverse spectral expressions such that a single-layer component can behave as a sensor, reflector, filter, or polarizer. This structural sparsity contrasts strongly with the venerable field of multi-layer thin-film optics that is basis for most optical components on the market today. The lattice resonance effect can be exploited in all major spectral regions with appropriate low-loss materials and fabrication resources. In this paper, we highlight resonant device technology and present our work on design, fabrication, and characterization of optical elements operating in the near-IR, mid-IR, and long-wave IR spectral regions. Examples of fabricated and tested devices include biological sensors, high-contrast-ratio polarizers, narrow-band notch filters, and wideband high reflectors.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Upgrade of NaI(Tl) crystal encapsulation for the NEON experiment
Authors:
J. J. Choi,
E. J. Jeon,
J. Y. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
Y. D. Kim,
Y. J. Ko,
B. C. Koh,
C. Ha,
B. J. Park,
S. H. Lee,
I. S. Lee,
H. Lee,
H. S. Lee,
J. Lee,
Y. M. Oh
Abstract:
The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which…
▽ More
The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which operates at a thermal power of 2.8\,GW. Initial engineering operation was performed from May 2021 to March 2022 and observed unexpected photomultiplier-induced noise and a decreased light yield that were caused by leakage of liquid scintillator into the detector due to weakness of detector encapsulation. We upgraded the detector encapsulation design to prevent the leakage of the liquid scintillator. Meanwhile two small-sized detectors were replaced with larger ones resulting in a total mass of 16.7\,kg. With this new design implementation, the detector system has been operating stably since April 2022 for over a year without detector gain drop. In this paper, we present an improved crystal encapsulation design and stability of the NEON experiment.
△ Less
Submitted 28 June, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
A Study of Vulnerability Repair in JavaScript Programs with Large Language Models
Authors:
Tan Khang Le,
Saba Alimadadi,
Steven Y. Ko
Abstract:
In recent years, JavaScript has become the most widely used programming language, especially in web development. However, writing secure JavaScript code is not trivial, and programmers often make mistakes that lead to security vulnerabilities in web applications. Large Language Models (LLMs) have demonstrated substantial advancements across multiple domains, and their evolving capabilities indicat…
▽ More
In recent years, JavaScript has become the most widely used programming language, especially in web development. However, writing secure JavaScript code is not trivial, and programmers often make mistakes that lead to security vulnerabilities in web applications. Large Language Models (LLMs) have demonstrated substantial advancements across multiple domains, and their evolving capabilities indicate their potential for automatic code generation based on a required specification, including automatic bug fixing. In this study, we explore the accuracy of LLMs, namely ChatGPT and Bard, in finding and fixing security vulnerabilities in JavaScript programs. We also investigate the impact of context in a prompt on directing LLMs to produce a correct patch of vulnerable JavaScript code. Our experiments on real-world software vulnerabilities show that while LLMs are promising in automatic program repair of JavaScript code, achieving a correct bug fix often requires an appropriate amount of context in the prompt.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
HearHere: Mitigating Echo Chambers in News Consumption through an AI-based Web System
Authors:
Youngseung Jeon,
Jaehoon Kim,
Sohyun Park,
Yunyong Ko,
Seongeun Ryu,
Sang-Wook Kim,
Kyungsik Han
Abstract:
Considerable efforts are currently underway to mitigate the negative impacts of echo chambers, such as increased susceptibility to fake news and resistance towards accepting scientific evidence. Prior research has presented the development of computer systems that support the consumption of news information from diverse political perspectives to mitigate the echo chamber effect. However, existing…
▽ More
Considerable efforts are currently underway to mitigate the negative impacts of echo chambers, such as increased susceptibility to fake news and resistance towards accepting scientific evidence. Prior research has presented the development of computer systems that support the consumption of news information from diverse political perspectives to mitigate the echo chamber effect. However, existing studies still lack the ability to effectively support the key processes of news information consumption and quantitatively identify a political stance towards the information. In this paper, we present HearHere, an AI-based web system designed to help users accommodate information and opinions from diverse perspectives. HearHere facilitates the key processes of news information consumption through two visualizations. Visualization 1 provides political news with quantitative political stance information, derived from our graph-based political classification model, and users can experience diverse perspectives (Hear). Visualization 2 allows users to express their opinions on specific political issues in a comment form and observe the position of their own opinions relative to pro-liberal and pro-conservative comments presented on a map interface (Here). Through a user study with 94 participants, we demonstrate the feasibility of HearHere in supporting the consumption of information from various perspectives. Our findings highlight the importance of providing political stance information and quantifying users' political status as a means to mitigate political polarization. In addition, we propose design implications for system development, including the consideration of demographics such as political interest and providing users with initiatives.
△ Less
Submitted 29 February, 2024; v1 submitted 28 February, 2024;
originally announced February 2024.
-
Waveform Simulation for Scintillation Characteristics of NaI(Tl) Crystal
Authors:
J. J. Choi,
C. Ha,
E. J. Jeon,
K. W. Kim,
S. K. Kim,
Y. D. Kim,
Y. J. Ko,
B. C. Koh,
H. S. Lee,
S. H. Lee,
S. M. Lee,
B. J. Park,
G. H. Yu
Abstract:
The lowering of the energy threshold in the NaI detector is crucial not only for comprehensive validation of DAMA/LIBRA but also for exploring new possibilities in the search for low-mass dark matter and observing coherent elastic scattering between neutrino and nucleus. Alongside hardware enhancements, extensive efforts have focused on refining event selection to discern noise, achieved through p…
▽ More
The lowering of the energy threshold in the NaI detector is crucial not only for comprehensive validation of DAMA/LIBRA but also for exploring new possibilities in the search for low-mass dark matter and observing coherent elastic scattering between neutrino and nucleus. Alongside hardware enhancements, extensive efforts have focused on refining event selection to discern noise, achieved through parameter development and the application of machine learning. Acquiring pure, unbiased datasets is crucial in this endeavor, for which a waveform simulation was developed. The simulation data were compared with the experimental data using several pulse shape discrimination parameters to test its performance in describing the experimental data. Additionally, we present the outcomes of multi-variable machine learning trained with simulation data as a scintillation signal sample. The distributions of outcomes for experimental and simulation data show a good agreement. As an application of the waveform simulation, we validate the trigger efficiency alongside estimations derived from the minimally biased measurement data.
△ Less
Submitted 17 June, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Measurements of low energy nuclear recoil quenching factors for Na and I recoils in the NaI(Tl) scintillator
Authors:
S. H. Lee,
H. W. Joo,
H. J. Kim,
K. W. Kim,
S. K. Kim,
Y. D. Kim,
Y. J. Ko,
H. S. Lee,
J. Y. Lee,
H. S. Park,
Y. S. Yoon
Abstract:
Elastic scattering off nuclei in target detectors, involving interactions with dark matter and coherent elastic neutrino nuclear recoil (CE$ν$NS), results in the deposition of low energy within the nuclei, dissipating rapidly through a combination of heat and ionization. The primary energy loss mechanism for nuclear recoil is heat, leading to consistently smaller measurable scintillation signals c…
▽ More
Elastic scattering off nuclei in target detectors, involving interactions with dark matter and coherent elastic neutrino nuclear recoil (CE$ν$NS), results in the deposition of low energy within the nuclei, dissipating rapidly through a combination of heat and ionization. The primary energy loss mechanism for nuclear recoil is heat, leading to consistently smaller measurable scintillation signals compared to electron recoils of the same energy. The nuclear recoil quenching factor (QF), representing the ratio of scintillation light yield produced by nuclear recoil to that of electron recoil at the same energy, is a critical parameter for understanding dark matter and neutrino interactions with nuclei. The low energy QF of NaI(Tl) crystals, commonly employed in dark matter searches and CE$ν$NS measurements, is of substantial importance. Previous low energy QF measurements were constrained by contamination from photomultiplier tube (PMT)-induced noise, resulting in an observed light yield of approximately 15 photoelectrons per keVee (kilo-electron-volt electron-equivalent energy) and nuclear recoil energy above 5 keVnr (kilo-electron-volt nuclear recoil energy). Through enhanced crystal encapsulation, an increased light yield of around 26 photoelectrons per keVee is achieved. This improvement enables the measurement of the nuclear recoil QF for sodium nuclei at an energy of 3.8 $\pm$ 0.6 keVnr with a QF of 11.2 $\pm$ 1.7%. Furthermore, a reevaluation of previously reported QF results is conducted, incorporating enhancements in low energy events based on waveform simulation. The outcomes are generally consistent with various recent QF measurements for sodium and iodine.
△ Less
Submitted 28 February, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
Assessing the Performance of the ADAPT and AFT Flux Transport Models Using In-Situ Measurements From Multiple Satellites
Authors:
Kalman J. Knizhnik,
Micah J. Weberg,
Elena Provornikova,
Harry P. Warren,
Mark G. Linton,
Shaheda Begum Shaik,
Yuan-Kuen Ko,
Samuel J. Schonfeld,
Ignacio Ugarte-Urra,
Lisa A. Upton
Abstract:
The launches of Parker Solar Probe (Parker) and Solar Orbiter (SolO) are enabling a new era of solar wind studies that track the solar wind from its origin at the photosphere, through the corona, to multiple vantage points in the inner heliosphere. A key ingredient for these models is the input photospheric magnetic field map that provides the boundary condition for the coronal portion of many hel…
▽ More
The launches of Parker Solar Probe (Parker) and Solar Orbiter (SolO) are enabling a new era of solar wind studies that track the solar wind from its origin at the photosphere, through the corona, to multiple vantage points in the inner heliosphere. A key ingredient for these models is the input photospheric magnetic field map that provides the boundary condition for the coronal portion of many heliospheric models. In this paper, we perform steady-state, data-driven magnetohydrodynamic (MHD) simulations of the solar wind during Carrington rotation 2258 with the GAMERA model. We use the ADAPT and AFT flux transport models and quantitatively assess how well each model matches in-situ measurements from Parker, SolO, and Earth. We find that both models reproduce the magnetic field components at Parker quantitatively well. At SolO and Earth, the magnetic field is reproduced relatively well, though not as well as at Parker, and the density is reproduced extremely poorly. The velocity is overpredicted at Parker, but not at SolO or Earth, hinting that the Wang-Sheeley-Arge (WSA) relation, fine-tuned for Earth, misses the deceleration of the solar wind near the Sun. We conclude that AFT performs quantitatively similarly to ADAPT in all cases and that both models are comparable to a purely WSA heliospheric treatment with no MHD component. Finally, we trace field lines from SolO back to an active region outflow that was observed by Hinode/EIS, and which shows evidence of elevated charge state ratios.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Graph Neural Networks as Fast and High-fidelity Emulators for Finite-Element Ice Sheet Modeling
Authors:
Maryam Rahnemoonfar,
Younghyun Koo
Abstract:
Although the finite element approach of the Ice-sheet and Sea-level System Model (ISSM) solves ice dynamics problems governed by Stokes equations quickly and accurately, such numerical modeling requires intensive computation on central processing units (CPU). In this study, we develop graph neural networks (GNN) as fast surrogate models to preserve the finite element structure of ISSM. Using the 2…
▽ More
Although the finite element approach of the Ice-sheet and Sea-level System Model (ISSM) solves ice dynamics problems governed by Stokes equations quickly and accurately, such numerical modeling requires intensive computation on central processing units (CPU). In this study, we develop graph neural networks (GNN) as fast surrogate models to preserve the finite element structure of ISSM. Using the 20-year transient simulations in the Pine Island Glacier (PIG), we train and test three GNNs: graph convolutional network (GCN), graph attention network (GAT), and equivariant graph convolutional network (EGCN). These GNNs reproduce ice thickness and velocity with better accuracy than the classic convolutional neural network (CNN) and multi-layer perception (MLP). In particular, GNNs successfully capture the ice mass loss and acceleration induced by higher basal melting rates in the PIG. When our GNN emulators are implemented on graphic processing units (GPUs), they show up to 50 times faster computational time than the CPU-based ISSM simulation.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Flash: A Hybrid Private Inference Protocol for Deep CNNs with High Accuracy and Low Latency on CPU
Authors:
Hyeri Roh,
**su Yeo,
Yeongil Ko,
Gu-Yeon Wei,
David Brooks,
Woo-Seok Choi
Abstract:
This paper presents Flash, an optimized private inference (PI) hybrid protocol utilizing both homomorphic encryption (HE) and secure two-party computation (2PC), which can reduce the end-to-end PI latency for deep CNN models less than 1 minute with CPU. To this end, first, Flash proposes a low-latency convolution algorithm built upon a fast slot rotation operation and a novel data encoding scheme,…
▽ More
This paper presents Flash, an optimized private inference (PI) hybrid protocol utilizing both homomorphic encryption (HE) and secure two-party computation (2PC), which can reduce the end-to-end PI latency for deep CNN models less than 1 minute with CPU. To this end, first, Flash proposes a low-latency convolution algorithm built upon a fast slot rotation operation and a novel data encoding scheme, which results in 4-94x performance gain over the state-of-the-art. Second, to minimize the communication cost introduced by the standard nonlinear activation function ReLU, Flash replaces the entire ReLUs with the polynomial $x^2+x$ and trains deep CNN models with the new activation function. The trained models improve the inference accuracy for CIFAR-10/100 and TinyImageNet by 16% on average (up to 40% for ResNet-32) compared to prior art. Last, Flash proposes an efficient 2PC-based $x^2+x$ evaluation protocol that does not require any offline communication and that reduces the total communication cost to process the activation layer by 84-196x over the state-of-the-art. As a result, the end-to-end PI latency of Flash implemented on CPU is 0.02 minute for CIFAR-100 and 0.57 minute for TinyImageNet classification, while the total data communication is 0.07GB for CIFAR-100 and 0.22GB for TinyImageNet. Flash improves the state-of-the-art PI by 16-45x in latency and 84-196x in communication cost. Moreover, even for ImageNet, Flash can deliver the latency less than 1 minute on CPU with the total communication less than 1GB.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Instruction Fine-Tuning: Does Prompt Loss Matter?
Authors:
Mathew Huerta-Enochian,
Seung Yong Ko
Abstract:
We present a novel study analyzing the effects of various prompt loss token weights (PLW) for supervised instruction fine-tuning (SIFT). While prompt-masking (PLW = 0) is common for SIFT, some fine-tuning APIs support fractional PLWs and suggest that using a small non-zero PLW can help stabilize learning when fine-tuning on short-completion data. However, there has never been a study confirming th…
▽ More
We present a novel study analyzing the effects of various prompt loss token weights (PLW) for supervised instruction fine-tuning (SIFT). While prompt-masking (PLW = 0) is common for SIFT, some fine-tuning APIs support fractional PLWs and suggest that using a small non-zero PLW can help stabilize learning when fine-tuning on short-completion data. However, there has never been a study confirming this claim, and OpenAI, a major cloud-based SIFT provider, recently removed this parameter from their fine-tuning API. We found that performance of models fine-tuned on short-completion data had a statistically-significant negative quadratic relationship with PLW. Using small values (0.01 - 0.5) of PLW produced better results on multiple-choice and short-generation benchmarks (outperforming models fine-tuned on long-completion data) while large values (~ 1.0) of PLW produced better results on long-generation benchmarks. We explained this effect and verified its importance through additional experiments. This research serves as a warning to API providers about the importance of providing a PLW parameter for SIFT.
△ Less
Submitted 18 June, 2024; v1 submitted 24 January, 2024;
originally announced January 2024.
-
νOscillation: a software package for computation and simulation of neutrino propagation and interaction
Authors:
Seonghyeok Jang,
Eunju Jeon,
Eunil Won,
Young Ju Ko,
Kyungmin Lee
Abstract:
The behavior of neutrinos is the only phenomenon that cannot be explained by the standard model of particle physics. Because of these mysterious neutrino interactions observed in nature, at present, there is growing interest in this field and ongoing or planned neutrino experiments are seeking solutions to this mystery very actively. The design of neutrino experiments and the analysis of neutrino…
▽ More
The behavior of neutrinos is the only phenomenon that cannot be explained by the standard model of particle physics. Because of these mysterious neutrino interactions observed in nature, at present, there is growing interest in this field and ongoing or planned neutrino experiments are seeking solutions to this mystery very actively. The design of neutrino experiments and the analysis of neutrino data rely on precise computations of neutrino oscillations and scattering processes in general. Motivated by this, we developed a software package that calculates neutrino production and oscillation in nuclear reactors, neutrino-electron scattering of solar neutrinos, and the oscillation of neutrinos from radioactive isotopes for the search of sterile neutrinos. This software package is validated by reproducing the result of calculations and observations in other publications. We also demonstrate the feasibility of this package by calculating the sensitivity of a liquid scintillator detector, currently in planning, to the sterile neutrinos. This work is expected to be used in designs of future neutrino experiments.
△ Less
Submitted 18 June, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.
-
Background study of the AMoRE-pilot experiment
Authors:
A. Agrawal,
V. V. Alenkov,
P. Aryal,
J. Beyer,
B. Bhandari,
R. S. Boiko,
K. Boonin,
O. Buzanov,
C. R. Byeon,
N. Chanthima,
M. K. Cheoun,
J. S. Choe,
Seonho Choi,
S. Choudhury,
J. S. Chung,
F. A. Danevich,
M. Djamal,
D. Drung,
C. Enss,
A. Fleischmann,
A. M. Gangapshev,
L. Gastaldo,
Yu. M. Gavrilyuk,
A. M. Gezhaev,
O. Gileva
, et al. (83 additional authors not shown)
Abstract:
We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental conf…
▽ More
We report a study on the background of the Advanced Molybdenum-Based Rare process Experiment (AMoRE), a search for neutrinoless double beta decay (\znbb) of $^{100}$Mo. The pilot stage of the experiment was conducted using $\sim$1.9 kg of \CAMOO~ crystals at the Yangyang Underground Laboratory, South Korea, from 2015 to 2018. We compared the measured $β/γ$ energy spectra in three experimental configurations with the results of Monte Carlo simulations and identified the background sources in each configuration. We replaced several detector components and enhanced the neutron shielding to lower the background level between configurations. A limit on the half-life of $0νββ$ decay of $^{100}$Mo was found at $T_{1/2}^{0ν} \ge 3.0\times 10^{23}$ years at 90\% confidence level, based on the measured background and its modeling. Further reduction of the background rate in the AMoRE-I and AMoRE-II are discussed.
△ Less
Submitted 7 April, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments
Authors:
S. M. Lee,
G. Adhikari,
N. Carlin,
J. Y. Cho,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Fran. a,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
S. W. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim
, et al. (37 additional authors not shown)
Abstract:
We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced…
▽ More
We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced by decays supported by both long and short-lived isotopes. Analyzing peaks from decays supported only by short-lived isotopes presented a unique challenge due to their limited statistics and overlap** energies, which was overcome by long-term data collection and a time-dependent analysis. A key achievement is the direct measurement of the 0.87 keV light yield, resulting from the cascade following electron capture decay of $^{22}$Na from internal contamination. This measurement, previously accessible only indirectly, deepens our understanding of NaI(Tl) scintillator behavior in the region of interest for dark matter searches. This study holds substantial implications for background modeling and the interpretation of dark matter signals in NaI(Tl) experiments.
△ Less
Submitted 10 May, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
BEC: Bit-Level Static Analysis for Reliability against Soft Errors
Authors:
Yousun Ko,
Bernd Burgstaller
Abstract:
Soft errors are a type of transient digital signal corruption that occurs in digital hardware components such as the internal flip-flops of CPU pipelines, the register file, memory cells, and even internal communication buses. Soft errors are caused by environmental radioactivity, magnetic interference, lasers, and temperature fluctuations, either unintentionally, or as part of a deliberate attemp…
▽ More
Soft errors are a type of transient digital signal corruption that occurs in digital hardware components such as the internal flip-flops of CPU pipelines, the register file, memory cells, and even internal communication buses. Soft errors are caused by environmental radioactivity, magnetic interference, lasers, and temperature fluctuations, either unintentionally, or as part of a deliberate attempt to compromise a system and expose confidential data.
We propose a bit-level error coalescing (BEC) static program analysis and its two use cases to understand and improve program reliability against soft errors. The BEC analysis tracks each bit corruption in the register file and classifies the effect of the corruption by its semantics at compile time. The usefulness of the proposed analysis is demonstrated in two scenarios, fault injection campaign pruning, and reliability-aware program transformation. Experimental results show that bit-level analysis pruned up to 30.04 % of exhaustive fault injection campaigns (13.71 % on average), without loss of accuracy. Program vulnerability was reduced by up to 13.11 % (4.94 % on average) through bit-level vulnerability-aware instruction scheduling. The analysis has been implemented within LLVM and evaluated on the RISC-V architecture.
To the best of our knowledge, the proposed BEC analysis is the first bit-level compiler analysis for program reliability against soft errors. The proposed method is generic and not limited to a specific computer architecture.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Solar Energetic Particle Charge States and Abundances with Nonthermal Electrons
Authors:
**-Yi Lee,
Stephen Kahler,
John C. Raymond,
Yuan-Kuen Ko
Abstract:
An important aspect of solar energetic particle (SEP) events is their source populations. Elemental abundance enhancements of impulsive SEP events, originating in presumed coronal reconnection episodes, can be fitted to steep power laws of A/Q, where A and Q are the atomic mass and ionic charge. Since thermal electron energies are enhanced and nonthermal electron distributions arise in the reconne…
▽ More
An important aspect of solar energetic particle (SEP) events is their source populations. Elemental abundance enhancements of impulsive SEP events, originating in presumed coronal reconnection episodes, can be fitted to steep power laws of A/Q, where A and Q are the atomic mass and ionic charge. Since thermal electron energies are enhanced and nonthermal electron distributions arise in the reconnection process, we might expect that ionic charge states Q would be increased through ionization interactions with those electron populations during the acceleration process. The temperature estimated from the SEPs corresponds to the charge state during the acceleration process, while the actual charge state measured in situ may be modified as the SEPs pass through the corona. We examine whether the temperature estimation from the A/Q would differ with various kappa values in a kappa function representing high-energy tail deviating from a Maxwellian velocity distribution. We find that the differences in the A/Q between a Maxwellian and an extreme kappa distribution are about 10-30%. We fit power-law enhancement of element abundances as a function of their A/Q with various kappa values. Then, we find that the derived source region temperature is not significantly affected by whether or not the electron velocity distribution deviates from a Maxwellian, i.e., thermal, distribution. Assuming that electrons are heated in the acceleration region, the agreement of the SEP charge state during acceleration with typical active region temperatures suggests that SEPs are accelerated and leave the acceleration region in a shorter time than the ionization time scale.
△ Less
Submitted 10 March, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
Gemini: A Family of Highly Capable Multimodal Models
Authors:
Gemini Team,
Rohan Anil,
Sebastian Borgeaud,
Jean-Baptiste Alayrac,
Jiahui Yu,
Radu Soricut,
Johan Schalkwyk,
Andrew M. Dai,
Anja Hauth,
Katie Millican,
David Silver,
Melvin Johnson,
Ioannis Antonoglou,
Julian Schrittwieser,
Amelia Glaese,
Jilin Chen,
Emily Pitler,
Timothy Lillicrap,
Angeliki Lazaridou,
Orhan Firat,
James Molloy,
Michael Isard,
Paul R. Barham,
Tom Hennigan,
Benjamin Lee
, et al. (1325 additional authors not shown)
Abstract:
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr…
▽ More
This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI.
△ Less
Submitted 17 June, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
Scintillation characteristics of an undoped CsI crystal at low-temperature for dark matter search
Authors:
W. K. Kim,
H. Y. Lee,
K. W. Kim,
Y. J. Ko,
J. A. Jeon,
H. J. Kim,
H. S. Lee
Abstract:
The scintillation characteristics of an undoped CsI crystal with dimensions of 5.8 mm $\times$ 5.9 mm $\times$ 7.0 mm, corresponding to a weight of 1.0 g, were studied by directly coupling two silicon photomultipliers (SiPMs) over a temperature range from room temperature (300 K) to a low temperature of 86 K. The scintillation decay time and light output were measured using x-ray (23 keV) and gamm…
▽ More
The scintillation characteristics of an undoped CsI crystal with dimensions of 5.8 mm $\times$ 5.9 mm $\times$ 7.0 mm, corresponding to a weight of 1.0 g, were studied by directly coupling two silicon photomultipliers (SiPMs) over a temperature range from room temperature (300 K) to a low temperature of 86 K. The scintillation decay time and light output were measured using x-ray (23 keV) and gamma-ray (88 keV) peaks from a $^{109}$Cd radioactive source. An increase in decay time was observed as the temperature decreased from room temperature to 86 K, ranging from 76 ns to 605 ns. Correspondingly, the light output increased as well, reaching 37.9 $\pm$ 1.5 photoelectrons per keV electron-equivalent at 86 K, which is approximately 18 times higher than the light yield at room temperature. Leveraging the significantly enhanced scintillation light output of the undpoed CsI crystal at the low temperature, coupling it with SiPMs makes it a promising candidate for the future dark matter search detector, benefiting from the low threshold owing to the high light output. The odd proton numbers from both cesium and iodine provide an advantage for the WIMP-proton spin-dependent interaction. We evaluated the sensitivity of low-mass dark matter on WIMP-proton spin-dependent interaction with the Migdal process, assuming 200 kg of undoped CsI crystals for the dark matter search. We conclude that undoped CsI crystal detectors exhibit world-competitive sensitivities for low-mass dark matter detection, particularly for the WIMP-proton spin-dependent interaction.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Explore, Select, Derive, and Recall: Augmenting LLM with Human-like Memory for Mobile Task Automation
Authors:
Sunjae Lee,
Junyoung Choi,
Jungjae Lee,
Munim Hasan Wasi,
Hojun Choi,
Steven Y. Ko,
Sangeun Oh,
Insik Shin
Abstract:
The advent of large language models (LLMs) has opened up new opportunities in the field of mobile task automation. Their superior language understanding and reasoning capabilities allow users to automate complex and repetitive tasks. However, due to the inherent unreliability and high operational cost of LLMs, their practical applicability is quite limited. To address these issues, this paper intr…
▽ More
The advent of large language models (LLMs) has opened up new opportunities in the field of mobile task automation. Their superior language understanding and reasoning capabilities allow users to automate complex and repetitive tasks. However, due to the inherent unreliability and high operational cost of LLMs, their practical applicability is quite limited. To address these issues, this paper introduces MobileGPT, an innovative LLM-based mobile task automator equipped with a human-like app memory. MobileGPT emulates the cognitive process of humans interacting with a mobile app -- explore, select, derive, and recall. This approach allows for a more precise and efficient learning of a task's procedure by breaking it down into smaller, modular sub-tasks that can be re-used, re-arranged, and adapted for various objectives. We implement MobileGPT using online LLMs services (GPT-3.5 and GPT-4) and evaluate its performance on a dataset of 160 user instructions across 8 widely used mobile apps. The results indicate that MobileGPT can automate and learn new tasks with 82.5% accuracy, and is able to adapt them to different contexts with near perfect (98.75%) accuracy while reducing both latency and cost by 62.5% and 68.8%, respectively, compared to the GPT-4 powered baseline.
△ Less
Submitted 16 March, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
GLAD: Global-Local View Alignment and Background Debiasing for Unsupervised Video Domain Adaptation with Large Domain Gap
Authors:
Hyogun Lee,
Kyungho Bae,
Seong Jong Ha,
Yumin Ko,
Gyeong-Moon Park,
**woo Choi
Abstract:
In this work, we tackle the challenging problem of unsupervised video domain adaptation (UVDA) for action recognition. We specifically focus on scenarios with a substantial domain gap, in contrast to existing works primarily deal with small domain gaps between labeled source domains and unlabeled target domains. To establish a more realistic setting, we introduce a novel UVDA scenario, denoted as…
▽ More
In this work, we tackle the challenging problem of unsupervised video domain adaptation (UVDA) for action recognition. We specifically focus on scenarios with a substantial domain gap, in contrast to existing works primarily deal with small domain gaps between labeled source domains and unlabeled target domains. To establish a more realistic setting, we introduce a novel UVDA scenario, denoted as Kinetics->BABEL, with a more considerable domain gap in terms of both temporal dynamics and background shifts. To tackle the temporal shift, i.e., action duration difference between the source and target domains, we propose a global-local view alignment approach. To mitigate the background shift, we propose to learn temporal order sensitive representations by temporal order learning and background invariant representations by background augmentation. We empirically validate that the proposed method shows significant improvement over the existing methods on the Kinetics->BABEL dataset with a large domain gap. The code is available at https://github.com/KHUVLL/GLAD.
△ Less
Submitted 22 November, 2023; v1 submitted 21 November, 2023;
originally announced November 2023.
-
An evolutionary continuum from nucleated dwarf galaxies to star clusters
Authors:
Kaixiang Wang,
Eric W. Peng,
Chengze Liu,
J. Christopher Mihos,
Patrick Côté,
Laura Ferrarese,
Matthew A. Taylor,
John P. Blakeslee,
Jean-Charles Cuillandre,
Pierre-Alain Duc,
Puragra Guhathakurta,
Stephen Gwyn,
Youkyung Ko,
Ariane Lançon,
Sungsoon Lim,
Lauren A. MacArthur,
Thomas Puzia,
Joel Roediger,
Laura V. Sales,
Rubén Sánchez-Janssen,
Chelsea Spengler,
Elisa Toloba,
Hongxin Zhang,
Mingcheng Zhu
Abstract:
Systematic studies have revealed hundreds of ultra-compact dwarf galaxies (UCDs) in the nearby Universe. With half-light radii $r_h$ of approximately 10-100 parsecs and stellar masses $M_*$ $\approx$ $10^6-10^8$ solar masses, UCDs are among the densest known stellar systems. Although similar in appearance to massive globular clusters, the detection of extended stellar envelopes, complex star forma…
▽ More
Systematic studies have revealed hundreds of ultra-compact dwarf galaxies (UCDs) in the nearby Universe. With half-light radii $r_h$ of approximately 10-100 parsecs and stellar masses $M_*$ $\approx$ $10^6-10^8$ solar masses, UCDs are among the densest known stellar systems. Although similar in appearance to massive globular clusters, the detection of extended stellar envelopes, complex star formation histories, elevated mass-to-light ratio, and supermassive black holes suggest that some UCDs are remnant nuclear star clusters of tidally-stripped dwarf galaxies, or even ancient compact galaxies. However, only a few objects have been found in the transient stage of tidal strip**, and this assumed evolutionary path has never been fully traced by observations. Here we show that 106 galaxies in the Virgo cluster have morphologies that are intermediate between normal, nucleated dwarf galaxies and single-component UCDs, revealing a continuum that fully maps this morphological transition, and fills the `size gap' between star clusters and galaxies. Their spatial distribution and redder color are also consistent with stripped satellite galaxies on their first few pericentric passages around massive galaxies. The `ultra-diffuse' tidal features around several of these galaxies directly show how UCDs are forming through tidal strip**, and that this evolutionary path can include an early phase as a nucleated ultra-diffuse galaxy (UDG). These UCDs represent substantial visible fossil remnants of ancient dwarf galaxies in galaxy clusters, and more low-mass remnants probably remain to be found.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Alpha backgrounds in NaI(Tl) crystals of COSINE-100
Authors:
G. Adhikari,
N. Carlin,
D. F. F. S. Cavalcante,
J. Y. Cho,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Franca,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
S. W. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim
, et al. (38 additional authors not shown)
Abstract:
COSINE-100 is a dark matter direct detection experiment with 106 kg NaI(Tl) as the target material. 210Pb and daughter isotopes are a dominant background in the WIMP region of interest and are detected via beta decay and alpha decay. Analysis of the alpha channel complements the background model as observed in the beta/gamma channel. We present the measurement of the quenching factors and Monte Ca…
▽ More
COSINE-100 is a dark matter direct detection experiment with 106 kg NaI(Tl) as the target material. 210Pb and daughter isotopes are a dominant background in the WIMP region of interest and are detected via beta decay and alpha decay. Analysis of the alpha channel complements the background model as observed in the beta/gamma channel. We present the measurement of the quenching factors and Monte Carlo simulation results and activity quantification of the alpha decay components of the COSINE-100 NaI(Tl) crystals. The data strongly indicate that the alpha decays probabilistically undergo two possible quenching factors but require further investigation. The fitted results are consistent with independent measurements and improve the overall understanding of the COSINE-100 backgrounds. Furthermore, the half-life of 216Po has been measured to be 143.4 +/- 1.2 ms, which is consistent with and more precise than recent measurements.
△ Less
Submitted 30 January, 2024; v1 submitted 8 November, 2023;
originally announced November 2023.
-
Multi-task Deep Convolutional Network to Predict Sea Ice Concentration and Drift in the Arctic Ocean
Authors:
Younghyun Koo,
Maryam Rahnemoonfar
Abstract:
Forecasting sea ice concentration (SIC) and sea ice drift (SID) in the Arctic Ocean is of great significance as the Arctic environment has been changed by the recent warming climate. Given that physical sea ice models require high computational costs with complex parameterization, deep learning techniques can effectively replace the physical model and improve the performance of sea ice prediction.…
▽ More
Forecasting sea ice concentration (SIC) and sea ice drift (SID) in the Arctic Ocean is of great significance as the Arctic environment has been changed by the recent warming climate. Given that physical sea ice models require high computational costs with complex parameterization, deep learning techniques can effectively replace the physical model and improve the performance of sea ice prediction. This study proposes a novel multi-task fully conventional network architecture named hierarchical information-sharing U-net (HIS-Unet) to predict daily SIC and SID. Instead of learning SIC and SID separately at each branch, we allow the SIC and SID layers to share their information and assist each other's prediction through the weighting attention modules (WAMs). Consequently, our HIS-Unet outperforms other statistical approaches, sea ice physical models, and neural networks without such information-sharing units. The improvement of HIS-Unet is obvious both for SIC and SID prediction when and where sea ice conditions change seasonally, which implies that the information sharing through WAMs allows the model to learn the sudden changes of SIC and SID. The weight values of the WAMs imply that SIC information plays a more critical role in SID prediction, compared to that of SID information in SIC prediction, and information sharing is more active in sea ice edges (seasonal sea ice) than in the central Arctic (multi-year sea ice).
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
CROWN: A Novel Approach to Comprehending Users' Preferences for Accurate Personalized News Recommendation
Authors:
Yunyong Ko,
Seongeun Ryu,
Sang-Wook Kim
Abstract:
Personalized news recommendation aims to assist users in finding news articles that align with their interests, which plays a pivotal role in mitigating users' information overload problem. Although many recent works have been studied for better personalized news recommendation, the following challenges should be explored more: (C1) Comprehending manifold intents coupled within a news article, (C2…
▽ More
Personalized news recommendation aims to assist users in finding news articles that align with their interests, which plays a pivotal role in mitigating users' information overload problem. Although many recent works have been studied for better personalized news recommendation, the following challenges should be explored more: (C1) Comprehending manifold intents coupled within a news article, (C2) Differentiating varying post-read preferences of news articles, and (C3) Addressing the cold-start user problem. To tackle the aforementioned challenges together, in this paper, we propose a novel personalized news recommendation framework (CROWN) that employs (1) category-guided intent disentanglement for (C1), (2) consistency-based news representation for (C2), and (3) GNN-enhanced hybrid user representation for (C3). Furthermore, we incorporate a category prediction into the training process of CROWN as an auxiliary task, which provides supplementary supervisory signals to enhance intent disentanglement. Extensive experiments on two real-world datasets reveal that (1) CROWN provides consistent performance improvements over ten state-of-the-art news recommendation methods and (2) the proposed strategies significantly improve the accuracy of CROWN.
△ Less
Submitted 13 February, 2024; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Inspiral and Plunging Orbits in Kerr-Newman Spacetimes
Authors:
Yu-Chung Ko,
Da-Shin Lee,
Chi-Yong Lin
Abstract:
We present the analytical solutions for the trajectories of particles that spiral and plunge inward the event horizon along the timelike geodesics following general non-equatorial paths within Kerr-Newman spacetimes. Our studies encompass both bound and unbound motions. The solutions can be written in terms of the elliptical integrals and the Jacobian elliptic functions of manifestly real function…
▽ More
We present the analytical solutions for the trajectories of particles that spiral and plunge inward the event horizon along the timelike geodesics following general non-equatorial paths within Kerr-Newman spacetimes. Our studies encompass both bound and unbound motions. The solutions can be written in terms of the elliptical integrals and the Jacobian elliptic functions of manifestly real functions of the Mino time. They can respectively reduce to the Kerr, Reissner-Nordstr$\ddot{o}$m, and Schwarzschild black holes in certain limits of the spin and charge of the black holes, and can be compared with the known ones restricted in equatorial motion. These explicit solutions may have some implications for the gravitational wave emission from extreme mass-ratio inspirals.
△ Less
Submitted 26 April, 2024; v1 submitted 24 September, 2023;
originally announced September 2023.
-
Enhancing Hyperedge Prediction with Context-Aware Self-Supervised Learning
Authors:
Yunyong Ko,
Hanghang Tong,
Sang-Wook Kim
Abstract:
Hypergraphs can naturally model group-wise relations (e.g., a group of users who co-purchase an item) as hyperedges. Hyperedge prediction is to predict future or unobserved hyperedges, which is a fundamental task in many real-world applications (e.g., group recommendation). Despite the recent breakthrough of hyperedge prediction methods, the following challenges have been rarely studied: (C1) How…
▽ More
Hypergraphs can naturally model group-wise relations (e.g., a group of users who co-purchase an item) as hyperedges. Hyperedge prediction is to predict future or unobserved hyperedges, which is a fundamental task in many real-world applications (e.g., group recommendation). Despite the recent breakthrough of hyperedge prediction methods, the following challenges have been rarely studied: (C1) How to aggregate the nodes in each hyperedge candidate for accurate hyperedge prediction? and (C2) How to mitigate the inherent data sparsity problem in hyperedge prediction? To tackle both challenges together, in this paper, we propose a novel hyperedge prediction framework (CASH) that employs (1) context-aware node aggregation to precisely capture complex relations among nodes in each hyperedge for (C1) and (2) self-supervised contrastive learning in the context of hyperedge prediction to enhance hypergraph representations for (C2). Furthermore, as for (C2), we propose a hyperedge-aware augmentation method to fully exploit the latent semantics behind the original hypergraph and consider both node-level and group-level contrasts (i.e., dual contrasts) for better node and hyperedge representations. Extensive experiments on six real-world hypergraphs reveal that CASH consistently outperforms all competing methods in terms of the accuracy in hyperedge prediction and each of the proposed strategies is effective in improving the model accuracy of CASH. For the detailed information of CASH, we provide the code and datasets at: https://github.com/yy-ko/cash.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
SAGE: A Storage-Based Approach for Scalable and Efficient Sparse Generalized Matrix-Matrix Multiplication
Authors:
Myung-Hwan Jang,
Yunyong Ko,
Hyuck-Moo Gwon,
Ikhyeon Jo,
Yongjun Park,
Sang-Wook Kim
Abstract:
Sparse generalized matrix-matrix multiplication (SpGEMM) is a fundamental operation for real-world network analysis. With the increasing size of real-world networks, the single-machine-based SpGEMM approach cannot perform SpGEMM on large-scale networks, exceeding the size of main memory (i.e., not scalable). Although the distributed-system-based approach could handle large-scale SpGEMM based on mu…
▽ More
Sparse generalized matrix-matrix multiplication (SpGEMM) is a fundamental operation for real-world network analysis. With the increasing size of real-world networks, the single-machine-based SpGEMM approach cannot perform SpGEMM on large-scale networks, exceeding the size of main memory (i.e., not scalable). Although the distributed-system-based approach could handle large-scale SpGEMM based on multiple machines, it suffers from severe inter-machine communication overhead to aggregate results of multiple machines (i.e., not efficient). To address this dilemma, in this paper, we propose a novel storage-based SpGEMM approach (SAGE) that stores given networks in storage (e.g., SSD) and loads only the necessary parts of the networks into main memory when they are required for processing via a 3-layer architecture. Furthermore, we point out three challenges that could degrade the overall performance of SAGE and propose three effective strategies to address them: (1) block-based workload allocation for balancing workloads across threads, (2) in-memory partial aggregation for reducing the amount of unnecessarily generated storage-memory I/Os, and (3) distribution-aware memory allocation for preventing unexpected buffer overflows in main memory. Via extensive evaluation, we verify the superiority of SAGE over existing SpGEMM methods in terms of scalability and efficiency.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
The Number of Overlap** Customers in Erlang-A Queues: An Asymptotic Approach
Authors:
Young Myoung Ko,
Jamol Pender,
** Xu
Abstract:
In this paper, we investigate the number of customers that overlap or coincide with a virtual customer in an Erlang-A queue. Our study provides a novel approach that exploits fluid and diffusion limits for the queue to approximate the mean and variance of the number of overlap** customers. We conduct a detailed analysis of the fluid and diffusion limit differential equations to derive these appr…
▽ More
In this paper, we investigate the number of customers that overlap or coincide with a virtual customer in an Erlang-A queue. Our study provides a novel approach that exploits fluid and diffusion limits for the queue to approximate the mean and variance of the number of overlap** customers. We conduct a detailed analysis of the fluid and diffusion limit differential equations to derive these approximations. We also construct new accurate approximations for the mean and variance of the waiting time in the Erlang-A queue by combining fluid limits with the polygamma function. Our findings have important implications for queueing theory and evaluating the overlap risk of more complicated service systems.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
The Evolution of Ion Charge States in Coronal Mass Ejections
Authors:
J. Martin Laming,
Elena Provornikova,
Yuan-Kuen Ko
Abstract:
We model the observed charge states of the elements C, O, Mg, Si, and Fe in the coronal mass ejections (CMEs) ejecta. We concentrate on "halo" CMEs observed in situ by ACE/SWICS to measure ion charge states, and also remotely by STEREO when in near quadrature with Earth, so that the CME expansion can be accurately specified. Within this observed expansion, we integrate equations for the CME ejecta…
▽ More
We model the observed charge states of the elements C, O, Mg, Si, and Fe in the coronal mass ejections (CMEs) ejecta. We concentrate on "halo" CMEs observed in situ by ACE/SWICS to measure ion charge states, and also remotely by STEREO when in near quadrature with Earth, so that the CME expansion can be accurately specified. Within this observed expansion, we integrate equations for the CME ejecta ionization balance, including electron heating parameterized as a fraction of the kinetic and gravitational energy gain of the CME. We also include the effects of non-Maxwellian electron distributions, characterized as a kappa function. Focusing first on the 2010 April 3 CME, we find a somewhat better match to observed charge states with kappa in the range 2-4, close to the theoretical minimum value of kappa = 3/2, implying a hard spectrum of non-thermal electrons. Similar, but more significant results come from the 2011 February 15 event, although it is quite different in terms of its evolution. We discuss the implications of these values, and of the heating required, in terms of the magnetic reconnection Lundquist number and anomalous resistivity associated with CME evolution close to the Sun.
△ Less
Submitted 28 July, 2023;
originally announced July 2023.
-
The Critical Coronal Transition Region: A Physics-framed Strategy to Uncover the Genesis of the Solar Wind and Solar Eruptions
Authors:
Angelos Vourlidas,
Amir Caspi,
Yuan-Kuen Ko,
J. Martin Laming,
James P. Mason,
Mari Paz Miralles,
Nour-Eddine Raouafi,
John C. Raymond,
Daniel B. Seaton,
Leonard Strachan,
Nicholeen Viall,
Juliana Vievering,
Matthew J. West
Abstract:
Our current theoretical and observational understanding suggests that critical properties of the solar wind and Coronal Mass Ejections (CMEs) are imparted within 10 Rs, particularly below 4 Rs. This seemingly narrow spatial region encompasses the transition of coronal plasma processes through the entire range of physical regimes from fluid to kinetic, and from primarily closed to open magnetic fie…
▽ More
Our current theoretical and observational understanding suggests that critical properties of the solar wind and Coronal Mass Ejections (CMEs) are imparted within 10 Rs, particularly below 4 Rs. This seemingly narrow spatial region encompasses the transition of coronal plasma processes through the entire range of physical regimes from fluid to kinetic, and from primarily closed to open magnetic field structures. From a physics perspective, therefore, it is more appropriate to refer to this region as the Critical Coronal Transition Region (CCTR) to emphasize its physical, rather than spatial, importance to key Heliophysics science.
This white paper argues that the comprehensive exploration of the CCTR will answer two of the most central Heliophysics questions, "How and where does the solar wind form?" and "How do eruptions form?", by unifying hardware/software/modeling development and seemingly disparate research communities and frameworks. We describe the outlines of decadal-scale plan to achieve that by 2050.
△ Less
Submitted 25 July, 2023;
originally announced July 2023.
-
Search for inelastic WIMP-iodine scattering with COSINE-100
Authors:
G. Adhikari,
N. Carlin,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Franca,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
J. H. Jo,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim,
Y. J. Ko,
D. H. Lee
, et al. (34 additional authors not shown)
Abstract:
We report the results of a search for inelastic scattering of weakly interacting massive particles (WIMPs) off $^{127}$I nuclei using NaI(Tl) crystals with a data exposure of 97.7 kg$\cdot$years from the COSINE-100 experiment. The signature of inelastic WIMP-$^{127}$I scattering is a nuclear recoil accompanied by a 57.6 keV $γ$-ray from the prompt deexcitation, producing a more energetic signal co…
▽ More
We report the results of a search for inelastic scattering of weakly interacting massive particles (WIMPs) off $^{127}$I nuclei using NaI(Tl) crystals with a data exposure of 97.7 kg$\cdot$years from the COSINE-100 experiment. The signature of inelastic WIMP-$^{127}$I scattering is a nuclear recoil accompanied by a 57.6 keV $γ$-ray from the prompt deexcitation, producing a more energetic signal compared to the typical WIMP nuclear recoil signal. We found no evidence for this inelastic scattering signature and set a 90 $\%$ confidence level upper limit on the WIMP-proton spin-dependent, inelastic scattering cross section of $1.2 \times 10^{-37} {\rm cm^{2}}$ at the WIMP mass 500 ${\rm GeV/c^{2}}$.
△ Less
Submitted 30 October, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
Production of antihydrogen atoms by 6 keV antiprotons through a positronium cloud
Authors:
P. Adrich,
P. Blumer,
G. Caratsch,
M. Chung,
P. Cladé,
P. Comini,
P. Crivelli,
O. Dalkarov,
P. Debu,
A. Douillet,
D. Drapier,
P. Froelich,
N. Garroum,
S. Guellati-Khelifa,
J. Guyomard,
P-A. Hervieux,
L. Hilico,
P. Indelicato,
S. Jonsell,
J-P. Karr,
B. Kim,
S. Kim,
E-S. Kim,
Y. J. Ko,
T. Kosinski
, et al. (39 additional authors not shown)
Abstract:
We report on the first production of an antihydrogen beam by charge exchange of 6.1 keV antiprotons with a cloud of positronium in the GBAR experiment at CERN. The antiproton beam was delivered by the AD/ELENA facility. The positronium target was produced from a positron beam itself obtained from an electron linear accelerator. We observe an excess over background indicating antihydrogen productio…
▽ More
We report on the first production of an antihydrogen beam by charge exchange of 6.1 keV antiprotons with a cloud of positronium in the GBAR experiment at CERN. The antiproton beam was delivered by the AD/ELENA facility. The positronium target was produced from a positron beam itself obtained from an electron linear accelerator. We observe an excess over background indicating antihydrogen production with a significance of 3-4 standard deviations.
△ Less
Submitted 3 July, 2023; v1 submitted 27 June, 2023;
originally announced June 2023.
-
Distributionally Robust Stratified Sampling for Stochastic Simulations with Multiple Uncertain Input Models
Authors:
Seung Min Baik,
Eunshin Byon,
Young Myoung Ko
Abstract:
This paper presents a robust version of the stratified sampling method when multiple uncertain input models are considered for stochastic simulation. Various variance reduction techniques have demonstrated their superior performance in accelerating simulation processes. Nevertheless, they often use a single input model and further assume that the input model is exactly known and fixed. We consider…
▽ More
This paper presents a robust version of the stratified sampling method when multiple uncertain input models are considered for stochastic simulation. Various variance reduction techniques have demonstrated their superior performance in accelerating simulation processes. Nevertheless, they often use a single input model and further assume that the input model is exactly known and fixed. We consider more general cases in which it is necessary to assess a simulation's response to a variety of input models, such as when evaluating the reliability of wind turbines under nonstationary wind conditions or the operation of a service system when the distribution of customer inter-arrival time is heterogeneous at different times. Moreover, the estimation variance may be considerably impacted by uncertainty in input models. To address such nonstationary and uncertain input models, we offer a distributionally robust (DR) stratified sampling approach with the goal of minimizing the maximum of worst-case estimator variances among plausible but uncertain input models. Specifically, we devise a bi-level optimization framework for formulating DR stochastic problems with different ambiguity set designs, based on the $L_2$-norm, 1-Wasserstein distance, parametric family of distributions, and distribution moments. In order to cope with the non-convexity of objective function, we present a solution approach that uses Bayesian optimization. Numerical experiments and the wind turbine case study demonstrate the robustness of the proposed approach.
△ Less
Submitted 15 June, 2023;
originally announced June 2023.
-
Tagged End-to-End Simultaneous Speech Translation Training using Simultaneous Interpretation Data
Authors:
Yuka Ko,
Ryo Fukuda,
Yuta Nishikawa,
Yasumasa Kano,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
Simultaneous speech translation (SimulST) translates partial speech inputs incrementally. Although the monotonic correspondence between input and output is preferable for smaller latency, it is not the case for distant language pairs such as English and Japanese. A prospective approach to this problem is to mimic simultaneous interpretation (SI) using SI data to train a SimulST model. However, the…
▽ More
Simultaneous speech translation (SimulST) translates partial speech inputs incrementally. Although the monotonic correspondence between input and output is preferable for smaller latency, it is not the case for distant language pairs such as English and Japanese. A prospective approach to this problem is to mimic simultaneous interpretation (SI) using SI data to train a SimulST model. However, the size of such SI data is limited, so the SI data should be used together with ordinary bilingual data whose translations are given in offline. In this paper, we propose an effective way to train a SimulST model using mixed data of SI and offline. The proposed method trains a single model using the mixed data with style tags that tell the model to generate SI- or offline-style outputs. Experiment results show improvements of BLEURT in different latency ranges, and our analyses revealed the proposed model generates SI-style outputs more than the baseline.
△ Less
Submitted 14 June, 2023;
originally announced June 2023.
-
Search for Boosted Dark Matter in COSINE-100
Authors:
G. Adhikari,
N. Carlin,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Franca,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
J. H. Jo,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim,
Y. J. Ko,
D. H. Lee
, et al. (34 additional authors not shown)
Abstract:
We search for energetic electron recoil signals induced by boosted dark matter (BDM) from the galactic center using the COSINE-100 array of NaI(Tl) crystal detectors at the Yangyang Underground Laboratory. The signal would be an excess of events with energies above 4 MeV over the well-understood background. Because no excess of events are observed in a 97.7 kg$\cdot$years exposure, we set limits o…
▽ More
We search for energetic electron recoil signals induced by boosted dark matter (BDM) from the galactic center using the COSINE-100 array of NaI(Tl) crystal detectors at the Yangyang Underground Laboratory. The signal would be an excess of events with energies above 4 MeV over the well-understood background. Because no excess of events are observed in a 97.7 kg$\cdot$years exposure, we set limits on BDM interactions under a variety of hypotheses. Notably, we explored the dark photon parameter space, leading to competitive limits compared to direct dark photon search experiments, particularly for dark photon masses below 4\,MeV and considering the invisible decay mode. Furthermore, by comparing our results with a previous BDM search conducted by the Super-Kamionkande experiment, we found that the COSINE-100 detector has advantages in searching for low-mass dark matter. This analysis demonstrates the potential of the COSINE-100 detector to search for MeV electron recoil signals produced by the dark sector particle interactions.
△ Less
Submitted 30 October, 2023; v1 submitted 31 May, 2023;
originally announced June 2023.
-
Reliability Improvement of Circular k-out-of-n: G Balanced Systems through Center of Gravity
Authors:
Yongkyu Cho,
Seung Min Baik,
Young Myoung Ko
Abstract:
This paper considers a circular k-out-of-n: G balance system equipped with homogeneous and stationary units. Building on previous research by Endharta et al. (Reliability Engineering & System Safety, 2018), we propose a new balance definition in circular k-out-of-n: G balance systems based on the concept of center of gravity. According to this condition, a circular k-out-of-n: G balance system is…
▽ More
This paper considers a circular k-out-of-n: G balance system equipped with homogeneous and stationary units. Building on previous research by Endharta et al. (Reliability Engineering & System Safety, 2018), we propose a new balance definition in circular k-out-of-n: G balance systems based on the concept of center of gravity. According to this condition, a circular k-out-of-n: G balance system is considered balanced if its center of gravity is located at the origin. This new balance condition is not only simple but also advantageous as it covers the previous two balance conditions of symmetry and proportionality. To evaluate the system's reliability, we consider the minimum tie-sets, and extensive numerical studies verify the enhancement of system reliability resulting from the proposed balance definition.
△ Less
Submitted 10 May, 2023;
originally announced May 2023.
-
NAIST-SIC-Aligned: an Aligned English-Japanese Simultaneous Interpretation Corpus
Authors:
**ming Zhao,
Yuka Ko,
Kosuke Doi,
Ryo Fukuda,
Katsuhito Sudoh,
Satoshi Nakamura
Abstract:
It remains a question that how simultaneous interpretation (SI) data affects simultaneous machine translation (SiMT). Research has been limited due to the lack of a large-scale training corpus. In this work, we aim to fill in the gap by introducing NAIST-SIC-Aligned, which is an automatically-aligned parallel English-Japanese SI dataset. Starting with a non-aligned corpus NAIST-SIC, we propose a t…
▽ More
It remains a question that how simultaneous interpretation (SI) data affects simultaneous machine translation (SiMT). Research has been limited due to the lack of a large-scale training corpus. In this work, we aim to fill in the gap by introducing NAIST-SIC-Aligned, which is an automatically-aligned parallel English-Japanese SI dataset. Starting with a non-aligned corpus NAIST-SIC, we propose a two-stage alignment approach to make the corpus parallel and thus suitable for model training. The first stage is coarse alignment where we perform a many-to-many map** between source and target sentences, and the second stage is fine-grained alignment where we perform intra- and inter-sentence filtering to improve the quality of aligned pairs. To ensure the quality of the corpus, each step has been validated either quantitatively or qualitatively. This is the first open-sourced large-scale parallel SI dataset in the literature. We also manually curated a small test set for evaluation purposes. Our results show that models trained with SI data lead to significant improvement in translation quality and latency over baselines. We hope our work advances research on SI corpora construction and SiMT. Our data can be found at https://github.com/mingzi151/AHC-SI.
△ Less
Submitted 31 March, 2024; v1 submitted 23 April, 2023;
originally announced April 2023.
-
Search for bosonic super-weakly interacting massive particles at COSINE-100
Authors:
G. Adhikari,
N. Carlin,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Franca,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
J. H. Jo,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim,
Y. J. Ko,
D. H. Lee
, et al. (34 additional authors not shown)
Abstract:
We present results of a search for bosonic super-weakly interacting massive particles (BSW) as keV scale dark matter candidates that is based on an exposure of 97.7 kg$\cdot$year from the COSINE experiment. In this search, we employ, for the first time, Compton-like as well as absorption processes for pseudoscalar and vector BSWs. No evidence for BSWs is found in the mass range from 10…
▽ More
We present results of a search for bosonic super-weakly interacting massive particles (BSW) as keV scale dark matter candidates that is based on an exposure of 97.7 kg$\cdot$year from the COSINE experiment. In this search, we employ, for the first time, Compton-like as well as absorption processes for pseudoscalar and vector BSWs. No evidence for BSWs is found in the mass range from 10 $\mathrm{keV/c}^2$ to 1 $\mathrm{MeV/c}^2$, and we present the exclusion limits on the dimensionless coupling constants to electrons $g_{ae}$ for pseudoscalar and $κ$ for vector BSWs at 90% confidence level. Our results show that these limits are improved by including the Compton-like process in masses of BSW, above $\mathcal{O}(100\,\mathrm{keV/c}^2)$.
△ Less
Submitted 27 August, 2023; v1 submitted 3 April, 2023;
originally announced April 2023.
-
Toward Polar Sea-Ice Classification using Color-based Segmentation and Auto-labeling of Sentinel-2 Imagery to Train an Efficient Deep Learning Model
Authors:
Jurdana Masuma Iqrah,
Younghyun Koo,
Wei Wang,
Hongjie Xie,
Sushil Prasad
Abstract:
Global warming is an urgent issue that is generating catastrophic environmental changes, such as the melting of sea ice and glaciers, particularly in the polar regions. The melting pattern and retreat of polar sea ice cover is an essential indicator of global warming. The Sentinel-2 satellite (S2) captures high-resolution optical imagery over the polar regions. This research aims at develo** a r…
▽ More
Global warming is an urgent issue that is generating catastrophic environmental changes, such as the melting of sea ice and glaciers, particularly in the polar regions. The melting pattern and retreat of polar sea ice cover is an essential indicator of global warming. The Sentinel-2 satellite (S2) captures high-resolution optical imagery over the polar regions. This research aims at develo** a robust and effective system for classifying polar sea ice as thick or snow-covered, young or thin, or open water using S2 images. A key challenge is the lack of labeled S2 training data to serve as the ground truth. We demonstrate a method with high precision to segment and automatically label the S2 images based on suitably determined color thresholds and employ these auto-labeled data to train a U-Net machine model (a fully convolutional neural network), yielding good classification accuracy. Evaluation results over S2 data from the polar summer season in the Ross Sea region of the Antarctic show that the U-Net model trained on auto-labeled data has an accuracy of 90.18% over the original S2 images, whereas the U-Net model trained on manually labeled data has an accuracy of 91.39%. Filtering out the thin clouds and shadows from the S2 images further improves U-Net's accuracy, respectively, to 98.97% for auto-labeled and 98.40% for manually labeled training datasets.
△ Less
Submitted 8 March, 2023;
originally announced March 2023.
-
KHAN: Knowledge-Aware Hierarchical Attention Networks for Accurate Political Stance Prediction
Authors:
Yunyong Ko,
Seongeun Ryu,
Soeun Han,
Youngseung Jeon,
Jaehoon Kim,
Sohyun Park,
Kyungsik Han,
Hanghang Tong,
Sang-Wook Kim
Abstract:
The political stance prediction for news articles has been widely studied to mitigate the echo chamber effect -- people fall into their thoughts and reinforce their pre-existing beliefs. The previous works for the political stance problem focus on (1) identifying political factors that could reflect the political stance of a news article and (2) capturing those factors effectively. Despite their e…
▽ More
The political stance prediction for news articles has been widely studied to mitigate the echo chamber effect -- people fall into their thoughts and reinforce their pre-existing beliefs. The previous works for the political stance problem focus on (1) identifying political factors that could reflect the political stance of a news article and (2) capturing those factors effectively. Despite their empirical successes, they are not sufficiently justified in terms of how effective their identified factors are in the political stance prediction. Motivated by this, in this work, we conduct a user study to investigate important factors in political stance prediction, and observe that the context and tone of a news article (implicit) and external knowledge for real-world entities appearing in the article (explicit) are important in determining its political stance. Based on this observation, we propose a novel knowledge-aware approach to political stance prediction (KHAN), employing (1) hierarchical attention networks (HAN) to learn the relationships among words and sentences in three different levels and (2) knowledge encoding (KE) to incorporate external knowledge for real-world entities into the process of political stance prediction. Also, to take into account the subtle and important difference between opposite political stances, we build two independent political knowledge graphs (KG) (i.e., KG-lib and KG-con) by ourselves and learn to fuse the different political knowledge. Through extensive evaluations on three real-world datasets, we demonstrate the superiority of DASH in terms of (1) accuracy, (2) efficiency, and (3) effectiveness.
△ Less
Submitted 4 April, 2023; v1 submitted 23 February, 2023;
originally announced February 2023.
-
Search for solar bosonic dark matter annual modulation with COSINE-100
Authors:
G. Adhikari,
N. Carlin,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. França,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
J. H. Jo,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim,
Y. J. Ko,
D. H. Lee
, et al. (34 additional authors not shown)
Abstract:
We present results from a search for solar bosonic dark matter using the annual modulation method with the COSINE-100 experiment. The results were interpreted considering three dark sector bosons models: solar dark photon; DFSZ and KSVZ solar axion; and Kaluza-Klein solar axion. No modulation signal that is compatible with the expected from the models was found from a data-set of 2.82 yr, using 61…
▽ More
We present results from a search for solar bosonic dark matter using the annual modulation method with the COSINE-100 experiment. The results were interpreted considering three dark sector bosons models: solar dark photon; DFSZ and KSVZ solar axion; and Kaluza-Klein solar axion. No modulation signal that is compatible with the expected from the models was found from a data-set of 2.82 yr, using 61.3 kg of NaI(Tl) crystals. Therefore, we set a 90$\%$ confidence level upper limits for each of the three models studied. For the solar dark photon model, the most stringent mixing parameter upper limit is $1.61 \times 10^{-14}$ for dark photons with a mass of 215 eV. For the DFSZ and KSVZ solar axion, and the Kaluza-Klein axion models, the upper limits exclude axion-electron couplings, $g_{ae}$, above $1.61 \times 10^{-11}$ for axion mass below 0.2 keV; and axion-photon couplings, $g_{aγγ}$, above $1.83 \times 10^{-11}$ GeV$^{-1}$ for an axion number density of $4.07 \times 10^{13}$ cm$^{-3}$. This is the first experimental search for solar dark photons and DFSZ and KSVZ solar axions using the annual modulation method. The lower background, higher light yield and reduced threshold of NaI(Tl) crystals of the future COSINE-200 experiment are expected to enhance the sensitivity of the analysis shown in this paper. We show the sensitivities for the three models studied, considering the same search method with COSINE-200.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.