Search | arXiv e-print repository

arXiv:2406.19502 [pdf, other]

Investigating How Large Language Models Leverage Internal Knowledge to Perform Complex Reasoning

Authors: Miyoung Ko, Sue Hyun Park, Joonsuk Park, Minjoon Seo

Abstract: Despite significant advancements, there is a limited understanding of how large language models (LLMs) utilize knowledge for reasoning. To address this, we propose a method that deconstructs complex real-world questions into a graph, representing each question as a node with parent nodes of background knowledge needed to solve the question. We develop the DepthQA dataset, deconstructing questions… ▽ More Despite significant advancements, there is a limited understanding of how large language models (LLMs) utilize knowledge for reasoning. To address this, we propose a method that deconstructs complex real-world questions into a graph, representing each question as a node with parent nodes of background knowledge needed to solve the question. We develop the DepthQA dataset, deconstructing questions into three depths: (i) recalling conceptual knowledge, (ii) applying procedural knowledge, and (iii) analyzing strategic knowledge. Based on a hierarchical graph, we quantify forward discrepancy, discrepancies in LLMs' performance on simpler sub-problems versus complex questions. We also measure backward discrepancy, where LLMs answer complex questions but struggle with simpler ones. Our analysis shows that smaller models have more discrepancies than larger models. Additionally, guiding models from simpler to complex questions through multi-turn interactions improves performance across model sizes, highlighting the importance of structured intermediate steps in knowledge reasoning. This work enhances our understanding of LLM reasoning and suggests ways to improve their problem-solving abilities. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: Work in progress; code is available at https://github.com/kaistAI/knowledge-reasoning

arXiv:2406.07899 [pdf, other]

Josephson Parametric Amplifier based Quantum Noise Limited Amplifier Development for Axion Search Experiments in CAPP

Authors: Sergey V. Uchaikin, **myeong Kim, Caglar Kutlu, Boris I. Ivanov, **su Kim, Arjan F. van Loo, Yasunobu Nakamura, Saebyeok Ahn, Seonjeong Oh, Minsu Ko, Yannis K. Semertzidis

Abstract: This paper provides a comprehensive overview of the development of flux-driven Josephson Parametric Amplifiers (JPAs) as Quantum Noise Limited Amplifier for axion search experiments conducted at the Center for Axion and Precision Physics Research (CAPP) of the Institute for Basic Science. It focuses on the characterization, and optimization of JPAs, which are crucial for achieving the highest sens… ▽ More This paper provides a comprehensive overview of the development of flux-driven Josephson Parametric Amplifiers (JPAs) as Quantum Noise Limited Amplifier for axion search experiments conducted at the Center for Axion and Precision Physics Research (CAPP) of the Institute for Basic Science. It focuses on the characterization, and optimization of JPAs, which are crucial for achieving the highest sensitivity in axion particle detection. We discuss various characterization techniques, methods for improving bandwidth, and the attainment of ultra-low noise temperatures. JPAs have emerged as indispensable tools in CAPPs axion search endeavors, playing a significant role in advancing our understanding of fundamental physics and unraveling the mysteries of the universe. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 29 pages, 15 figures

arXiv:2406.05761 [pdf, other]

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Gui** Son, Ye** Cho, Sheikh Shafayat, **heon Baek, Sue Hyun Park, Hyeonbin Hwang, **kyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on specific capabilities such as instruction following, leading to coverage bias. To overcome these limitations, we introduce the BiGGen Bench, a principled generation benchmark designed to thoroughly evaluate nine distinct capabilities of LMs across 77 diverse tasks. A key feature of the BiGGen Bench is its use of instance-specific evaluation criteria, closely mirroring the nuanced discernment of human evaluation. We apply this benchmark to assess 103 frontier LMs using five evaluator LMs. Our code, data, and evaluation results are all publicly available at https://github.com/prometheus-eval/prometheus-eval/tree/main/BiGGen-Bench. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: Work in Progress

arXiv:2405.12015 [pdf, other]

Global Polarization of (Anti-)Hypertriton in Heavy-Ion Collisions

Authors: Kai-Jia Sun, Dai-Neng Liu, Yun-Peng Zhen, **-Hui Chen, Che Ming Ko, Yu-Gang Ma

Abstract: Particles of non-zero spin produced in non-central heavy-ion collisions are expected to be polarized along the direction perpendicular to the reaction plane because of their spin-orbit interactions in the produced matter, and this has indeed been observed for many hyperons and vector mesons. Here, we show that the hypertriton ($^3_Λ\text{H}$), which is the lightest hypernucleus, is also polarized… ▽ More Particles of non-zero spin produced in non-central heavy-ion collisions are expected to be polarized along the direction perpendicular to the reaction plane because of their spin-orbit interactions in the produced matter, and this has indeed been observed for many hyperons and vector mesons. Here, we show that the hypertriton ($^3_Λ\text{H}$), which is the lightest hypernucleus, is also polarized in these collisions. Using the coalescence model based on the kinetic freezeout baryons for light (hyper-)nuclei production, we find that the angular distribution of the decay product of polarized $^3_Λ\text{H}$ is highly sensitive to the spin configuration of its wavefunction, providing a novel way to determine its spin structure. We also predict the beam energy dependence of $^3_Λ\text{H}$ and ${^3_{\barΛ}}\overline{\rm H}$ polarizations in heavy-ion collisions from a few GeV to several TeV based on the measured $Λ$ and $\barΛ$ polarizations. We further discuss the possibility of studying the spin correlations among nucleons and $Λ$ hyperons in the produced hadronic matter from the measured $^3_Λ\text{H}$ polarization in non-central heavy-ion collisions. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 6 pages, 2 figures

arXiv:2405.01974 [pdf, other]

Multitask Extension of Geometrically Aligned Transfer Encoder

Authors: Sung Moon Ko, Sumin Lee, Dae-Woong Jeong, Hyunseung Kim, Chanhui Lee, Soorin Yim, Sehui Han

Abstract: Molecular datasets often suffer from a lack of data. It is well-known that gathering data is difficult due to the complexity of experimentation or simulation involved. Here, we leverage mutual information across different tasks in molecular data to address this issue. We extend an algorithm that utilizes the geometric characteristics of the encoding space, known as the Geometrically Aligned Transf… ▽ More Molecular datasets often suffer from a lack of data. It is well-known that gathering data is difficult due to the complexity of experimentation or simulation involved. Here, we leverage mutual information across different tasks in molecular data to address this issue. We extend an algorithm that utilizes the geometric characteristics of the encoding space, known as the Geometrically Aligned Transfer Encoder (GATE), to a multi-task setup. Thus, we connect multiple molecular tasks by aligning the curved coordinates onto locally flat coordinates, ensuring the flow of information from source tasks to support performance on target data. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: 7 pages, 3 figures, 2 tables

arXiv:2404.15890 [pdf, other]

Hadronic effects on $Λ$ polarization in relativistic heavy ion collisions

Authors: Haesom Sung, Che Ming Ko, Su Houng Lee

Abstract: The $Λ$ hyperon spin flip and non-flip cross sections are calculated in a simple hadronic model by including both the $s$-channel process involving the spin 3/2, positive parity $Σ^*(1358)$ resonance and the $t$-channel process via the exchange of a scalar $σ$ meson. Although the $t$-channel process gives a thermally averaged cross section that is about a factor of 1.3 larger than that from the… ▽ More The $Λ$ hyperon spin flip and non-flip cross sections are calculated in a simple hadronic model by including both the $s$-channel process involving the spin 3/2, positive parity $Σ^*(1358)$ resonance and the $t$-channel process via the exchange of a scalar $σ$ meson. Although the $t$-channel process gives a thermally averaged cross section that is about a factor of 1.3 larger than that from the $s$-channel process, the $Λ$ spin flip to non-flip cross sections is negligibly small in the $t$-channel compared to the constant value of 1/3.5 in the $s$-channel process. With these cross sections included in a schematic kinetic model, the effects of hadronic scatterings on the $Λ$ spin polarization in Au-Au collisions at $\sqrt{s}_{NN}=7.7$ GeV are studied. It is found that the $Λ$ spin polarization only decreases by about 7\% during the hadronic stage of these collisions, which thus justifies the assumption in theoretical studies that compare the $Λ$ polarization calculated at the chemical freeze out to the measured one at the kinetic freeze out. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: 7 pages, 5 figures

arXiv:2404.13286 [pdf, other]

Track Role Prediction of Single-Instrumental Sequences

Authors: Changheon Han, Suhyun Lee, Minsam Ko

Abstract: In the composition process, selecting appropriate single-instrumental music sequences and assigning their track-role is an indispensable task. However, manually determining the track-role for a myriad of music samples can be time-consuming and labor-intensive. This study introduces a deep learning model designed to automatically predict the track-role of single-instrumental music sequences. Our ev… ▽ More In the composition process, selecting appropriate single-instrumental music sequences and assigning their track-role is an indispensable task. However, manually determining the track-role for a myriad of music samples can be time-consuming and labor-intensive. This study introduces a deep learning model designed to automatically predict the track-role of single-instrumental music sequences. Our evaluations show a prediction accuracy of 87% in the symbolic domain and 84% in the audio domain. The proposed track-role prediction methods hold promise for future applications in AI music generation and analysis. △ Less

Submitted 20 April, 2024; originally announced April 2024.

Comments: ISMIR LBD 2023

arXiv:2404.10966 [pdf, other]

Domain-Specific Block Selection and Paired-View Pseudo-Labeling for Online Test-Time Adaptation

Authors: Yeonguk Yu, Sungho Shin, Seunghyeok Back, Minhwan Ko, Sangjun Noh, Kyoobin Lee

Abstract: Test-time adaptation (TTA) aims to adapt a pre-trained model to a new test domain without access to source data after deployment. Existing approaches typically rely on self-training with pseudo-labels since ground-truth cannot be obtained from test data. Although the quality of pseudo labels is important for stable and accurate long-term adaptation, it has not been previously addressed. In this wo… ▽ More Test-time adaptation (TTA) aims to adapt a pre-trained model to a new test domain without access to source data after deployment. Existing approaches typically rely on self-training with pseudo-labels since ground-truth cannot be obtained from test data. Although the quality of pseudo labels is important for stable and accurate long-term adaptation, it has not been previously addressed. In this work, we propose DPLOT, a simple yet effective TTA framework that consists of two components: (1) domain-specific block selection and (2) pseudo-label generation using paired-view images. Specifically, we select blocks that involve domain-specific feature extraction and train these blocks by entropy minimization. After blocks are adjusted for current test domain, we generate pseudo-labels by averaging given test images and corresponding flipped counterparts. By simply using flip augmentation, we prevent a decrease in the quality of the pseudo-labels, which can be caused by the domain gap resulting from strong augmentation. Our experimental results demonstrate that DPLOT outperforms previous TTA methods in CIFAR10-C, CIFAR100-C, and ImageNet-C benchmarks, reducing error by up to 5.4%, 9.1%, and 2.9%, respectively. Also, we provide an extensive analysis to demonstrate effectiveness of our framework. Code is available at https://github.com/gist-ailab/domain-specific-block-selection-and-paired-view-pseudo-labeling-for-online-TTA. △ Less

Submitted 7 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

Comments: Accepted at CVPR 2024

arXiv:2404.02701 [pdf, ps, other]

Quantum Mechanical Softening of the Hypertriton Transverse Momentum Spectrum in Heavy-Ion Collisions

Authors: Dai-Neng Liu, Che Ming Ko, Yu-Gang Ma, Francesco Mazzaschi, Maximiliano Puccio, Qi-Ye Shou, Kai-Jia Sun, Yuan-Zhe Wang

Abstract: Understanding the properties of hypernuclei helps to constrain the interaction between hyperon and nucleon, which is known to play an essential role in determining the properties of neutron stars. Experimental measurements have suggested that the hypertriton ($^3_Λ\text{H}$), the lightest hypernucleus, exhibits a halo structure with a deuteron core encircled by a $Λ$ hyperon at a distance of about… ▽ More Understanding the properties of hypernuclei helps to constrain the interaction between hyperon and nucleon, which is known to play an essential role in determining the properties of neutron stars. Experimental measurements have suggested that the hypertriton ($^3_Λ\text{H}$), the lightest hypernucleus, exhibits a halo structure with a deuteron core encircled by a $Λ$ hyperon at a distance of about 10 fm. This large $Λ-d$ distance in $^3_Λ\text{H}$ wave function is found to cause a suppressed $^3_Λ\text{H}$ yield and a softening of its transverse momentum ($p_T$) spectrum in relativistic heavy-ion collisions. Within the coalescence model based on nucleons and $Λ$ hyperons from a microscopic hybrid hydro model with a hadronic afterburner for nuclear cluster production in Pb-Pb collisions at $\sqrt{s_{NN}}$= 5.02 TeV, we show how this softening of the hypertriton $p_T$ spectrum appears and leads to a significantly smaller mean $p_T$ for $^3_Λ\text{H}$ than for helium-3 ($^3$He). The latter is opposite to the predictions from the blast-wave model which assumes that $^3_Λ\text{H}$ and $^3$He are thermally produced at the kinetic freeze-out of heavy ion collisions. The discovered quantum mechanical softening of the (anti-)hypertriton spectrum can be experimentally tested in relativistic heavy-ion collisions at different collision energies and centralities and used to obtain valuable insights to the mechanisms for light (hyper-)nuclei production in these collisions. △ Less

Submitted 7 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

Comments: 6 pages, 4 figures

arXiv:2403.13203 [pdf, other]

Quadratic Point Estimate Method for Probabilistic Moments Computation

Authors: Minhyeok Ko, Konstantinos G. Papakonstantinou

Abstract: This paper presents in detail the originally developed Quadratic Point Estimate Method (QPEM), aimed at efficiently and accurately computing the first four output moments of probabilistic distributions, using 2n^2+1 sample (or sigma) points, with n, the number of input random variables. The proposed QPEM particularly offers an effective, superior, and practical alternative to existing sampling and… ▽ More This paper presents in detail the originally developed Quadratic Point Estimate Method (QPEM), aimed at efficiently and accurately computing the first four output moments of probabilistic distributions, using 2n^2+1 sample (or sigma) points, with n, the number of input random variables. The proposed QPEM particularly offers an effective, superior, and practical alternative to existing sampling and quadrature methods for low- and moderately-high-dimensional problems. Detailed theoretical derivations are provided proving that the proposed method can achieve a fifth or higher-order accuracy for symmetric input distributions. Various numerical examples, from simple polynomial functions to nonlinear finite element analyses with random field representations, support the theoretical findings and further showcase the validity, efficiency, and applicability of the QPEM, from low- to high-dimensional problems. △ Less

Submitted 19 March, 2024; originally announced March 2024.

arXiv:2402.18923 [pdf, other]

Inappropriate Pause Detection In Dysarthric Speech Using Large-Scale Speech Recognition

Authors: Jeehyun Lee, Yerin Choi, Tae-** Song, Myoung-Wan Koo

Abstract: Dysarthria, a common issue among stroke patients, severely impacts speech intelligibility. Inappropriate pauses are crucial indicators in severity assessment and speech-language therapy. We propose to extend a large-scale speech recognition model for inappropriate pause detection in dysarthric speech. To this end, we propose task design, labeling strategy, and a speech recognition model with an in… ▽ More Dysarthria, a common issue among stroke patients, severely impacts speech intelligibility. Inappropriate pauses are crucial indicators in severity assessment and speech-language therapy. We propose to extend a large-scale speech recognition model for inappropriate pause detection in dysarthric speech. To this end, we propose task design, labeling strategy, and a speech recognition model with an inappropriate pause prediction layer. First, we treat pause detection as speech recognition, using an automatic speech recognition (ASR) model to convert speech into text with pause tags. According to the newly designed task, we label pause locations at the text level and their appropriateness. We collaborate with speech-language pathologists to establish labeling criteria, ensuring high-quality annotated data. Finally, we extend the ASR model with an inappropriate pause prediction layer for end-to-end inappropriate pause detection. Moreover, we propose a task-tailored metric for evaluating inappropriate pause detection independent of ASR performance. Our experiments show that the proposed method better detects inappropriate pauses in dysarthric speech than baselines. (Inappropriate Pause Error Rate: 14.47%) △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: Accepted to ICASSP 2024

arXiv:2402.08922 [pdf, other]

The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes

Authors: Myeongseob Ko, Feiyang Kang, Weiyan Shi, Ming **, Zhou Yu, Ruoxi Jia

Abstract: Large-scale black-box models have become ubiquitous across numerous applications. Understanding the influence of individual training data sources on predictions made by these models is crucial for improving their trustworthiness. Current influence estimation techniques involve computing gradients for every training point or repeated training on different subsets. These approaches face obvious comp… ▽ More Large-scale black-box models have become ubiquitous across numerous applications. Understanding the influence of individual training data sources on predictions made by these models is crucial for improving their trustworthiness. Current influence estimation techniques involve computing gradients for every training point or repeated training on different subsets. These approaches face obvious computational challenges when scaled up to large datasets and models. In this paper, we introduce and explore the Mirrored Influence Hypothesis, highlighting a reciprocal nature of influence between training and test data. Specifically, it suggests that evaluating the influence of training data on test predictions can be reformulated as an equivalent, yet inverse problem: assessing how the predictions for training samples would be altered if the model were trained on specific test samples. Through both empirical and theoretical validations, we demonstrate the wide applicability of our hypothesis. Inspired by this, we introduce a new method for estimating the influence of training data, which requires calculating gradients for specific test samples, paired with a forward pass for each training point. This approach can capitalize on the common asymmetry in scenarios where the number of test samples under concurrent examination is much smaller than the scale of the training dataset, thus gaining a significant improvement in efficiency compared to existing approaches. We demonstrate the applicability of our method across a range of scenarios, including data attribution in diffusion models, data leakage detection, analysis of memorization, mislabeled data detection, and tracing behavior in language models. Our code will be made available at https://github.com/ruoxi-jia-group/Forward-INF. △ Less

Submitted 19 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

Journal ref: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024

arXiv:2401.14635 [pdf, other]

Signing in Four Public Software Package Registries: Quantity, Quality, and Influencing Factors

Authors: Taylor R Schorlemmer, Kelechi G Kalu, Luke Chigges, Kyung Myung Ko, Eman Abu Isghair, Saurabh Baghi, Santiago Torres-Arias, James C Davis

Abstract: Many software applications incorporate open-source third-party packages distributed by public package registries. Guaranteeing authorship along this supply chain is a challenge. Package maintainers can guarantee package authorship through software signing. However, it is unclear how common this practice is, and whether the resulting signatures are created properly. Prior work has provided raw data… ▽ More Many software applications incorporate open-source third-party packages distributed by public package registries. Guaranteeing authorship along this supply chain is a challenge. Package maintainers can guarantee package authorship through software signing. However, it is unclear how common this practice is, and whether the resulting signatures are created properly. Prior work has provided raw data on registry signing practices, but only measured single platforms, did not consider quality, did not consider time, and did not assess factors that may influence signing. We do not have up-to-date measurements of signing practices nor do we know the quality of existing signatures. Furthermore, we lack a comprehensive understanding of factors that influence signing adoption. This study addresses this gap. We provide measurements across three kinds of package registries: traditional software (Maven, PyPI), container images (DockerHub), and machine learning models (Hugging Face). For each registry, we describe the nature of the signed artifacts as well as the current quantity and quality of signatures. Then, we examine longitudinal trends in signing practices. Finally, we use a quasi-experiment to estimate the effect that various factors had on software signing practices. To summarize our findings: (1) mandating signature adoption improves the quantity of signatures; (2) providing dedicated tooling improves the quality of signing; (3) getting started is the hard part -- once a maintainer begins to sign, they tend to continue doing so; and (4) although many supply chain attacks are mitigable via signing, signing adoption is primarily affected by registry policy rather than by public knowledge of attacks, new engineering standards, etc. These findings highlight the importance of software package registry managers and signing infrastructure. △ Less

Submitted 14 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

Comments: Accepted at IEEE Security & Privacy 2024 (S&P'24)

arXiv:2312.02531 [pdf, other]

PolyFit: A Peg-in-hole Assembly Framework for Unseen Polygon Shapes via Sim-to-real Adaptation

Authors: Geonhyup Lee, Joosoon Lee, Sangjun Noh, Minhwan Ko, Kangmin Kim, Kyoobin Lee

Abstract: The study addresses the foundational and challenging task of peg-in-hole assembly in robotics, where misalignments caused by sensor inaccuracies and mechanical errors often result in insertion failures or jamming. This research introduces PolyFit, representing a paradigm shift by transitioning from a reinforcement learning approach to a supervised learning methodology. PolyFit is a Force/Torque (F… ▽ More The study addresses the foundational and challenging task of peg-in-hole assembly in robotics, where misalignments caused by sensor inaccuracies and mechanical errors often result in insertion failures or jamming. This research introduces PolyFit, representing a paradigm shift by transitioning from a reinforcement learning approach to a supervised learning methodology. PolyFit is a Force/Torque (F/T)-based supervised learning framework designed for 5-DoF peg-in-hole assembly. It utilizes F/T data for accurate extrinsic pose estimation and adjusts the peg pose to rectify misalignments. Extensive training in a simulated environment involves a dataset encompassing a diverse range of peg-hole shapes, extrinsic poses, and their corresponding contact F/T readings. To enhance extrinsic pose estimation, a multi-point contact strategy is integrated into the model input, recognizing that identical F/T readings can indicate different poses. The study proposes a sim-to-real adaptation method for real-world application, using a sim-real paired dataset to enable effective generalization to complex and unseen polygon shapes. PolyFit achieves impressive peg-in-hole success rates of 97.3% and 96.3% for seen and unseen shapes in simulations, respectively. Real-world evaluations further demonstrate substantial success rates of 86.7% and 85.0%, highlighting the robustness and adaptability of the proposed method. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 8 pages, 8 figures, 3 tables

arXiv:2311.08329 [pdf, other]

KTRL+F: Knowledge-Augmented In-Document Search

Authors: Hanseok Oh, Haebin Shin, Miyoung Ko, Hyunji Lee, Minjoon Seo

Abstract: We introduce a new problem KTRL+F, a knowledge-augmented in-document search task that necessitates real-time identification of all semantic targets within a document with the awareness of external sources through a single natural query. KTRL+F addresses following unique challenges for in-document search: 1)utilizing knowledge outside the document for extended use of additional information about ta… ▽ More We introduce a new problem KTRL+F, a knowledge-augmented in-document search task that necessitates real-time identification of all semantic targets within a document with the awareness of external sources through a single natural query. KTRL+F addresses following unique challenges for in-document search: 1)utilizing knowledge outside the document for extended use of additional information about targets, and 2) balancing between real-time applicability with the performance. We analyze various baselines in KTRL+F and find limitations of existing models, such as hallucinations, high latency, or difficulties in leveraging external knowledge. Therefore, we propose a Knowledge-Augmented Phrase Retrieval model that shows a promising balance between speed and performance by simply augmenting external knowledge in phrase embedding. We also conduct a user study to verify whether solving KTRL+F can enhance search experience for users. It demonstrates that even with our simple model, users can reduce the time for searching with less queries and reduced extra visits to other sources for collecting evidence. We encourage the research community to work on KTRL+F to enhance more efficient in-document information access. △ Less

Submitted 18 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

arXiv:2310.11434 [pdf, other]

doi 10.1103/PhysRevC.109.044911

Is $K_{1}/K^{*}$ enhancement in heavy ion collisions a signature of chiral symmetry restoration?

Authors: Haesom Sung, Sungtae Cho, Che Ming Ko, Su Houng Lee, Sanghoon Lim

Abstract: We extend the recent study of $K_{1}/K^{*}$ enhancement as a signature of chiral symmetry restoration in heavy ion collisions at the Large Hadron Collider (LHC) via the kinetic approach to include the effects due to non-unity hadron fugacities during the evolution of produced hadronic matter and the temperature-dependent $K_1$ mass. Although the effect of non-unity fugacity only slightly reduces t… ▽ More We extend the recent study of $K_{1}/K^{*}$ enhancement as a signature of chiral symmetry restoration in heavy ion collisions at the Large Hadron Collider (LHC) via the kinetic approach to include the effects due to non-unity hadron fugacities during the evolution of produced hadronic matter and the temperature-dependent $K_1$ mass. Although the effect of non-unity fugacity only slightly reduces the $K_1/K^*$ enhancement due to chiral symmetry restoration, the inclusion of the temperature-dependent $K_1$ mass leads to a substantial reduction in the $K_1/K^*$ enhancement. However, the final $K_1/K^*$ ratio in peripheral collisions still shows a more than factor of two enhancement compared to the case without chiral symmetry restoration and thus remains a good signature for chiral symmetry restoration in the hot dense matter produced in relativistic heavy ion collisions. △ Less

Submitted 8 November, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: 5 pages with 5 figures

Journal ref: Phys.Rev.C 109 (2024) 4, 044911

arXiv:2310.06369 [pdf, other]

Geometrically Aligned Transfer Encoder for Inductive Transfer in Regression Tasks

Authors: Sung Moon Ko, Sumin Lee, Dae-Woong Jeong, Woohyung Lim, Sehui Han

Abstract: Transfer learning is a crucial technique for handling a small amount of data that is potentially related to other abundant data. However, most of the existing methods are focused on classification tasks using images and language datasets. Therefore, in order to expand the transfer learning scheme to regression tasks, we propose a novel transfer technique based on differential geometry, namely the… ▽ More Transfer learning is a crucial technique for handling a small amount of data that is potentially related to other abundant data. However, most of the existing methods are focused on classification tasks using images and language datasets. Therefore, in order to expand the transfer learning scheme to regression tasks, we propose a novel transfer technique based on differential geometry, namely the Geometrically Aligned Transfer Encoder (GATE). In this method, we interpret the latent vectors from the model to exist on a Riemannian curved manifold. We find a proper diffeomorphism between pairs of tasks to ensure that every arbitrary point maps to a locally flat coordinate in the overlap** region, allowing the transfer of knowledge from the source to the target data. This also serves as an effective regularizer for the model to behave in extrapolation regions. In this article, we demonstrate that GATE outperforms conventional methods and exhibits stable behavior in both the latent space and extrapolation regions for various molecular graph datasets. △ Less

Submitted 10 October, 2023; originally announced October 2023.

Comments: 12+11 pages, 6+1 figures, 0+7 tables

arXiv:2310.00108 [pdf, other]

Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study

Authors: Myeongseob Ko, Ming **, Chenguang Wang, Ruoxi Jia

Abstract: Membership inference attacks (MIAs) aim to infer whether a data point has been used to train a machine learning model. These attacks can be employed to identify potential privacy vulnerabilities and detect unauthorized use of personal data. While MIAs have been traditionally studied for simple classification models, recent advancements in multi-modal pre-training, such as CLIP, have demonstrated r… ▽ More Membership inference attacks (MIAs) aim to infer whether a data point has been used to train a machine learning model. These attacks can be employed to identify potential privacy vulnerabilities and detect unauthorized use of personal data. While MIAs have been traditionally studied for simple classification models, recent advancements in multi-modal pre-training, such as CLIP, have demonstrated remarkable zero-shot performance across a range of computer vision tasks. However, the sheer scale of data and models presents significant computational challenges for performing the attacks. This paper takes a first step towards develo** practical MIAs against large-scale multi-modal models. We introduce a simple baseline strategy by thresholding the cosine similarity between text and image features of a target point and propose further enhancing the baseline by aggregating cosine similarity across transformations of the target. We also present a new weakly supervised attack method that leverages ground-truth non-members (e.g., obtained by using the publication date of a target model and the timestamps of the open data) to further enhance the attack. Our evaluation shows that CLIP models are susceptible to our attack strategies, with our simple baseline achieving over $75\%$ membership identification accuracy. Furthermore, our enhanced attacks outperform the baseline across multiple models and datasets, with the weakly supervised attack demonstrating an average-case performance improvement of $17\%$ and being at least $7$X more effective at low false-positive rates. These findings highlight the importance of protecting the privacy of multi-modal foundational models, which were previously assumed to be less susceptible to MIAs due to less overfitting. Our code is available at https://github.com/ruoxi-jia-group/CLIP-MIA. △ Less

Submitted 29 September, 2023; originally announced October 2023.

Comments: International Conference on Computer Vision (ICCV) 2023

arXiv:2309.04062 [pdf, other]

3D Denoisers are Good 2D Teachers: Molecular Pretraining via Denoising and Cross-Modal Distillation

Authors: Sungjun Cho, Dae-Woong Jeong, Sung Moon Ko, **woo Kim, Sehui Han, Seunghoon Hong, Honglak Lee, Moontae Lee

Abstract: Pretraining molecular representations from large unlabeled data is essential for molecular property prediction due to the high cost of obtaining ground-truth labels. While there exist various 2D graph-based molecular pretraining approaches, these methods struggle to show statistically significant gains in predictive performance. Recent work have thus instead proposed 3D conformer-based pretraining… ▽ More Pretraining molecular representations from large unlabeled data is essential for molecular property prediction due to the high cost of obtaining ground-truth labels. While there exist various 2D graph-based molecular pretraining approaches, these methods struggle to show statistically significant gains in predictive performance. Recent work have thus instead proposed 3D conformer-based pretraining under the task of denoising, which led to promising results. During downstream finetuning, however, models trained with 3D conformers require accurate atom-coordinates of previously unseen molecules, which are computationally expensive to acquire at scale. In light of this limitation, we propose D&D, a self-supervised molecular representation learning framework that pretrains a 2D graph encoder by distilling representations from a 3D denoiser. With denoising followed by cross-modal knowledge distillation, our approach enjoys use of knowledge obtained from denoising as well as painless application to downstream tasks with no access to accurate conformers. Experiments on real-world molecular property prediction datasets show that the graph encoder trained via D&D can infer 3D information based on the 2D graph and shows superior performance and label-efficiency against other baselines. △ Less

Submitted 7 September, 2023; originally announced September 2023.

Comments: 16 pages, 5 figures

arXiv:2308.05347 [pdf, ps, other]

doi 10.1103/PhysRevC.109.044609

Comparing pion production in transport simulations of heavy-ion collisions at $270A$ MeV under controlled conditions

Authors: Jun Xu, Hermann Wolter, Maria Colonna, Mircea Dan Cozma, Pawel Danielewicz, Che Ming Ko, Akira Ono, ManYee Betty Tsang, Ying-Xun Zhang, Hui-Gan Cheng, Natsumi Ikeno, Rohit Kumar, Jun Su, Hua Zheng, Zhen Zhang, Lie-Wen Chen, Zhao-Qing Feng, Christoph Hartnack, Arnaud Le Fèvre, Bao-An Li, Yasushi Nara, Akira Ohnishi, Feng-Shou Zhang

Abstract: Within the TMEP, we present a detailed study of the performance of different transport models in Sn+Sn collisions at $270A$ MeV, and put particular emphasis on the production of pions and $Δ$ resonances, which have been used as probes of the nuclear symmetry energy. We prescribe a common and rather simple physics model, and follow in detail the results of 4 BUU models and 6 QMD models. The nucleon… ▽ More Within the TMEP, we present a detailed study of the performance of different transport models in Sn+Sn collisions at $270A$ MeV, and put particular emphasis on the production of pions and $Δ$ resonances, which have been used as probes of the nuclear symmetry energy. We prescribe a common and rather simple physics model, and follow in detail the results of 4 BUU models and 6 QMD models. The nucleonic evolution of the collision and the nucleonic observables in these codes do not completely converge, but the differences among the codes can be understood as being due to several reasons: the basic differences between BUU and QMD models in the representation of the phase-space distributions, computational differences in the mean-field evaluation, and differences in the adopted strategies for the Pauli blocking in the collision integrals. For pionic observables, we find that a higher maximum density leads to an enhanced pion yield and a reduced $π^-/π^+$ yield ratio, while a more effective Pauli blocking generally leads to a slightly suppressed pion yield and an enhanced $π^-/π^+$ yield ratio. We specifically investigate the effect of the Coulomb force, and find that it increases the total $π^-/π^+$ yield ratio but reduces the ratio at high pion energies, although differences in its implementations do not have a dominating role in the differences among the codes. Taking into account only the results of codes that strictly follow the homework specifications, we find a convergence of the codes in the final charged pion yield ratio to a $1σ$ deviation of about $5\%$. However, the uncertainty is expected to be reduced to about $1.6\%$ if the same or similar strategies and ingredients, i.e., an improved Pauli blocking and calculation of the non-linear term in the mean-field potential, are similarly used in all codes. △ Less

Submitted 14 March, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

Comments: 39 pages, 17 figures

Journal ref: Physical Review C 109, 044609 (2024)

arXiv:2308.04709 [pdf, other]

A Comparative Study of Open-Source Large Language Models, GPT-4 and Claude 2: Multiple-Choice Test Taking in Nephrology

Authors: Sean Wu, Michael Koo, Lesley Blum, Andy Black, Liyo Kao, Fabien Scalzo, Ira Kurtz

Abstract: In recent years, there have been significant breakthroughs in the field of natural language processing, particularly with the development of large language models (LLMs). These LLMs have showcased remarkable capabilities on various benchmarks. In the healthcare field, the exact role LLMs and other future AI models will play remains unclear. There is a potential for these models in the future to be… ▽ More In recent years, there have been significant breakthroughs in the field of natural language processing, particularly with the development of large language models (LLMs). These LLMs have showcased remarkable capabilities on various benchmarks. In the healthcare field, the exact role LLMs and other future AI models will play remains unclear. There is a potential for these models in the future to be used as part of adaptive physician training, medical co-pilot applications, and digital patient interaction scenarios. The ability of AI models to participate in medical training and patient care will depend in part on their mastery of the knowledge content of specific medical fields. This study investigated the medical knowledge capability of LLMs, specifically in the context of internal medicine subspecialty multiple-choice test-taking ability. We compared the performance of several open-source LLMs (Koala 7B, Falcon 7B, Stable-Vicuna 13B, and Orca Mini 13B), to GPT-4 and Claude 2 on multiple-choice questions in the field of Nephrology. Nephrology was chosen as an example of a particularly conceptually complex subspecialty field within internal medicine. The study was conducted to evaluate the ability of LLM models to provide correct answers to nephSAP (Nephrology Self-Assessment Program) multiple-choice questions. The overall success of open-sourced LLMs in answering the 858 nephSAP multiple-choice questions correctly was 17.1% - 25.5%. In contrast, Claude 2 answered 54.4% of the questions correctly, whereas GPT-4 achieved a score of 73.3%. We show that current widely used open-sourced LLMs do poorly in their ability for zero-shot reasoning when compared to GPT-4 and Claude 2. The findings of this study potentially have significant implications for the future of subspecialty medical training and patient care. △ Less

Submitted 9 August, 2023; originally announced August 2023.

Comments: 7 pages, 3 figures, 1 table

arXiv:2308.04684 [pdf, other]

The Number of Overlap** Customers in Erlang-A Queues: An Asymptotic Approach

Authors: Young Myoung Ko, Jamol Pender, ** Xu

Abstract: In this paper, we investigate the number of customers that overlap or coincide with a virtual customer in an Erlang-A queue. Our study provides a novel approach that exploits fluid and diffusion limits for the queue to approximate the mean and variance of the number of overlap** customers. We conduct a detailed analysis of the fluid and diffusion limit differential equations to derive these appr… ▽ More In this paper, we investigate the number of customers that overlap or coincide with a virtual customer in an Erlang-A queue. Our study provides a novel approach that exploits fluid and diffusion limits for the queue to approximate the mean and variance of the number of overlap** customers. We conduct a detailed analysis of the fluid and diffusion limit differential equations to derive these approximations. We also construct new accurate approximations for the mean and variance of the waiting time in the Erlang-A queue by combining fluid limits with the polygamma function. Our findings have important implications for queueing theory and evaluating the overlap risk of more complicated service systems. △ Less

Submitted 8 August, 2023; originally announced August 2023.

arXiv:2308.01573 [pdf]

doi 10.1109/OJSP.2024.3386495

Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS

Authors: Myeong** Ko, Yong-Hoon Choi

Abstract: The diffusion model is capable of generating high-quality data through a probabilistic approach. However, it suffers from the drawback of slow generation speed due to the requirement of a large number of time steps. To address this limitation, recent models such as denoising diffusion implicit models (DDIM) focus on generating samples without directly modeling the probability distribution, while m… ▽ More The diffusion model is capable of generating high-quality data through a probabilistic approach. However, it suffers from the drawback of slow generation speed due to the requirement of a large number of time steps. To address this limitation, recent models such as denoising diffusion implicit models (DDIM) focus on generating samples without directly modeling the probability distribution, while models like denoising diffusion generative adversarial networks (GAN) combine diffusion processes with GANs. In the field of speech synthesis, a recent diffusion speech synthesis model called DiffGAN-TTS, utilizing the structure of GANs, has been introduced and demonstrates superior performance in both speech quality and generation speed. In this paper, to further enhance the performance of DiffGAN-TTS, we propose a speech synthesis model with two discriminators: a diffusion discriminator for learning the distribution of the reverse process and a spectrogram discriminator for learning the distribution of the generated data. Objective metrics such as structural similarity index measure (SSIM), mel-cepstral distortion (MCD), F0 root mean squared error (F0 RMSE), short-time objective intelligibility (STOI), perceptual evaluation of speech quality (PESQ), as well as subjective metrics like mean opinion score (MOS), are used to evaluate the performance of the proposed model. The evaluation results show that the proposed model outperforms recent state-of-the-art models such as FastSpeech2 and DiffGAN-TTS in various metrics. Our implementation and audio samples are located on GitHub. △ Less

Submitted 3 August, 2023; originally announced August 2023.

Journal ref: IEEE Open Journal of Signal Processing, vol. 5, pp. 577-587, 2024

arXiv:2307.01480 [pdf, ps, other]

doi 10.1109/EDM58354.2023.10225136

Expanding Scanning Frequency Range of Josephson Parametric Amplifier Axion Haloscope Readout with Schottky Diode Bias Circuit

Authors: Minsu Ko, Sergey V. Uchaikin, Boris I. Ivanov, **Myeong Kim, Seonjeong Oh, Violeta Gkika, Yannis K. Semertzidis

Abstract: The axion search experiments in the microwave frequency range require high sensitive detectors with intrinsic noise close to quantum noise limit. Josephson parametric amplifiers (JPAs) are the most valuable candidates for the role of the first stage amplifier in the measurement circuit of the microwave frequency range, as they are well-known in superconducting quantum circuits readout. To increase… ▽ More The axion search experiments in the microwave frequency range require high sensitive detectors with intrinsic noise close to quantum noise limit. Josephson parametric amplifiers (JPAs) are the most valuable candidates for the role of the first stage amplifier in the measurement circuit of the microwave frequency range, as they are well-known in superconducting quantum circuits readout. To increase the frequency range, a challenging scientific task involves implementing an assembly with parallel connection of several single JPAs, which requires matching the complex RF circuit at microwaves and ensuring proper DC flux bias. In this publication, we present a new DC flux bias setup based on a Schottky diode circuit for a JPA assembly consisting of two JPAs. We provide a detailed characterization of the diodes at cryogenic temperatures lower than 4 K. Specifically, we selected two RF Schottky diodes with desirable characteristics for the DC flux bias setup, and our results demonstrate that the Schottky diode circuit is a promising method for achieving proper DC flux bias in JPA assemblies. △ Less

Submitted 4 July, 2023; originally announced July 2023.

Comments: 7 pages, 6 images

arXiv:2306.09020 [pdf, other]

Distributionally Robust Stratified Sampling for Stochastic Simulations with Multiple Uncertain Input Models

Authors: Seung Min Baik, Eunshin Byon, Young Myoung Ko

Abstract: This paper presents a robust version of the stratified sampling method when multiple uncertain input models are considered for stochastic simulation. Various variance reduction techniques have demonstrated their superior performance in accelerating simulation processes. Nevertheless, they often use a single input model and further assume that the input model is exactly known and fixed. We consider… ▽ More This paper presents a robust version of the stratified sampling method when multiple uncertain input models are considered for stochastic simulation. Various variance reduction techniques have demonstrated their superior performance in accelerating simulation processes. Nevertheless, they often use a single input model and further assume that the input model is exactly known and fixed. We consider more general cases in which it is necessary to assess a simulation's response to a variety of input models, such as when evaluating the reliability of wind turbines under nonstationary wind conditions or the operation of a service system when the distribution of customer inter-arrival time is heterogeneous at different times. Moreover, the estimation variance may be considerably impacted by uncertainty in input models. To address such nonstationary and uncertain input models, we offer a distributionally robust (DR) stratified sampling approach with the goal of minimizing the maximum of worst-case estimator variances among plausible but uncertain input models. Specifically, we devise a bi-level optimization framework for formulating DR stochastic problems with different ambiguity set designs, based on the $L_2$-norm, 1-Wasserstein distance, parametric family of distributions, and distribution moments. In order to cope with the non-convexity of objective function, we present a solution approach that uses Bayesian optimization. Numerical experiments and the wind turbine case study demonstrate the robustness of the proposed approach. △ Less

Submitted 15 June, 2023; originally announced June 2023.

arXiv:2305.19567 [pdf, other]

DC CoMix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer

Authors: Yerin Choi, Myoung-Wan Koo

Abstract: Despite the huge successes made in neutral TTS, content-leakage remains a challenge. In this paper, we propose a new input representation and simple architecture to achieve improved prosody modeling. Inspired by the recent success in the use of discrete code in TTS, we introduce discrete code to the input of the reference encoder. Specifically, we leverage the vector quantizer from the audio compr… ▽ More Despite the huge successes made in neutral TTS, content-leakage remains a challenge. In this paper, we propose a new input representation and simple architecture to achieve improved prosody modeling. Inspired by the recent success in the use of discrete code in TTS, we introduce discrete code to the input of the reference encoder. Specifically, we leverage the vector quantizer from the audio compression model to exploit the diverse acoustic information it has already been trained on. In addition, we apply the modified MLP-Mixer to the reference encoder, making the architecture lighter. As a result, we train the prosody transfer TTS in an end-to-end manner. We prove the effectiveness of our method through both subjective and objective evaluations. We demonstrate that the reference encoder learns better speaker-independent prosody when discrete code is utilized as input in the experiments. In addition, we obtain comparable results even when fewer parameters are inputted. △ Less

Submitted 28 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

Comments: Accepted in Interspeech 2023

arXiv:2305.06919 [pdf, other]

Reliability Improvement of Circular k-out-of-n: G Balanced Systems through Center of Gravity

Authors: Yongkyu Cho, Seung Min Baik, Young Myoung Ko

Abstract: This paper considers a circular k-out-of-n: G balance system equipped with homogeneous and stationary units. Building on previous research by Endharta et al. (Reliability Engineering & System Safety, 2018), we propose a new balance definition in circular k-out-of-n: G balance systems based on the concept of center of gravity. According to this condition, a circular k-out-of-n: G balance system is… ▽ More This paper considers a circular k-out-of-n: G balance system equipped with homogeneous and stationary units. Building on previous research by Endharta et al. (Reliability Engineering & System Safety, 2018), we propose a new balance definition in circular k-out-of-n: G balance systems based on the concept of center of gravity. According to this condition, a circular k-out-of-n: G balance system is considered balanced if its center of gravity is located at the origin. This new balance condition is not only simple but also advantageous as it covers the previous two balance conditions of symmetry and proportionality. To evaluate the system's reliability, we consider the minimum tie-sets, and extensive numerical studies verify the enhancement of system reliability resulting from the proposed balance definition. △ Less

Submitted 10 May, 2023; originally announced May 2023.

Comments: 27 pages, 10 figures (35 subfigures)

arXiv:2305.02988 [pdf, ps, other]

Kinetic approach of light-nuclei production in intermediate-energy heavy-ion collisions

Authors: Rui Wang, Yu-Gang Ma, Lie-Wen Chen, Che Ming Ko, Kai-Jia Sun, Zhen Zhang

Abstract: We develop a kinetic approach to the production of light nuclei up to mass number $A$ $\leqslant$ $4$ in intermediate-energy heavy-ion collisions by including them as dynamic degrees of freedom. The conversions between nucleons and light nuclei during the collisions are incorporated dynamically via the breakup of light nuclei by a nucleon and their inverse reactions. We also include the Mott effec… ▽ More We develop a kinetic approach to the production of light nuclei up to mass number $A$ $\leqslant$ $4$ in intermediate-energy heavy-ion collisions by including them as dynamic degrees of freedom. The conversions between nucleons and light nuclei during the collisions are incorporated dynamically via the breakup of light nuclei by a nucleon and their inverse reactions. We also include the Mott effect on light nuclei, i.e., a light nucleus would no longer be bound if the phase-space density of its surrounding nucleons is too large. With this kinetic approach, we obtain a reasonable description of the measured yields of light nuclei in central Au+Au collisions at energies of $0.25$ - $1.0A~\rm GeV$ by the FOPI collaboration. Our study also indicates that the observed enhancement of the $α$-particle yield at low incident energies can be attributed to a weaker Mott effect on the $α$-particle, which makes it more difficult to dissolve in nuclear medium, as a result of its much larger binding energy. △ Less

Submitted 4 May, 2023; originally announced May 2023.

Comments: 6 pages, 4 figures

arXiv:2305.02468 [pdf, other]

Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System

Authors: Namo Bang, Jeehyun Lee, Myoung-Wan Koo

Abstract: Task-Oriented Dialogue (TOD) systems are designed to carry out specific tasks by tracking dialogue states and generating appropriate responses to help users achieve defined goals. Recently, end-to-end dialogue models pre-trained based on large datasets have shown promising performance in the conversational system. However, they share the same parameters to train tasks of the dialogue system (NLU,… ▽ More Task-Oriented Dialogue (TOD) systems are designed to carry out specific tasks by tracking dialogue states and generating appropriate responses to help users achieve defined goals. Recently, end-to-end dialogue models pre-trained based on large datasets have shown promising performance in the conversational system. However, they share the same parameters to train tasks of the dialogue system (NLU, DST, NLG), so debugging each task is challenging. Also, they require a lot of effort to fine-tune large parameters to create a task-oriented chatbot, making it difficult for non-experts to handle. Therefore, we intend to train relatively lightweight and fast models compared to PLM. In this paper, we propose an End-to-end TOD system with Task-Optimized Adapters which learn independently per task, adding only small number of parameters after fixed layers of pre-trained network. We also enhance the performance of the DST and NLG modules through reinforcement learning, overcoming the learning curve that has lacked at the adapter learning and enabling the natural and consistent response generation that is appropriate for the goal. Our method is a model-agnostic approach and does not require prompt-tuning as only input data without a prompt. As results of the experiment, our method shows competitive performance on the MultiWOZ benchmark compared to the existing end-to-end models. In particular, we attain state-of-the-art performance on the DST task of 2.2 dataset. △ Less

Submitted 31 May, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

Comments: Accepted to Findings of ACL2023

arXiv:2305.00054 [pdf, other]

LAVA: Data Valuation without Pre-Specified Learning Algorithms

Authors: Hoang Anh Just, Feiyang Kang, Jiachen T. Wang, Yi Zeng, Myeongseob Ko, Ming **, Ruoxi Jia

Abstract: Traditionally, data valuation (DV) is posed as a problem of equitably splitting the validation performance of a learning algorithm among the training data. As a result, the calculated data values depend on many design choices of the underlying learning algorithm. However, this dependence is undesirable for many DV use cases, such as setting priorities over different data sources in a data acquisit… ▽ More Traditionally, data valuation (DV) is posed as a problem of equitably splitting the validation performance of a learning algorithm among the training data. As a result, the calculated data values depend on many design choices of the underlying learning algorithm. However, this dependence is undesirable for many DV use cases, such as setting priorities over different data sources in a data acquisition process and informing pricing mechanisms in a data marketplace. In these scenarios, data needs to be valued before the actual analysis and the choice of the learning algorithm is still undetermined then. Another side-effect of the dependence is that to assess the value of individual points, one needs to re-run the learning algorithm with and without a point, which incurs a large computation burden. This work leapfrogs over the current limits of data valuation methods by introducing a new framework that can value training data in a way that is oblivious to the downstream learning algorithm. Our main results are as follows. (1) We develop a proxy for the validation performance associated with a training set based on a non-conventional class-wise Wasserstein distance between training and validation sets. We show that the distance characterizes the upper bound of the validation performance for any given model under certain Lipschitz conditions. (2) We develop a novel method to value individual data based on the sensitivity analysis of the class-wise Wasserstein distance. Importantly, these values can be directly obtained for free from the output of off-the-shelf optimization solvers when computing the distance. (3) We evaluate our new data valuation framework over various use cases related to detecting low-quality data and show that, surprisingly, the learning-agnostic feature of our framework enables a significant improvement over SOTA performance while being orders of magnitude faster. △ Less

Submitted 19 December, 2023; v1 submitted 28 April, 2023; originally announced May 2023.

Comments: ICLR 2023 Spotlight Latest Updated Version: 2023/12/19

arXiv:2301.09789 [pdf, other]

A Qualitative Study on the Implementation Design Decisions of Developers

Authors: Jenny T. Liang, Maryam Arab, Minhyuk Ko, Amy J. Ko, Thomas D. LaToza

Abstract: Decision-making is a key software engineering skill. Developers constantly make choices throughout the software development process, from requirements to implementation. While prior work has studied developer decision-making, the choices made while choosing what solution to write in code remain understudied. In this mixed-methods study, we examine the phenomenon where developers select one specifi… ▽ More Decision-making is a key software engineering skill. Developers constantly make choices throughout the software development process, from requirements to implementation. While prior work has studied developer decision-making, the choices made while choosing what solution to write in code remain understudied. In this mixed-methods study, we examine the phenomenon where developers select one specific way to implement a behavior in code, given many potential alternatives. We call these decisions implementation design decisions. Our mixed-methods study includes 46 survey responses and 14 semi-structured interviews with professional developers about their decision types, considerations, processes, and expertise for implementation design decisions. We find that implementation design decisions, rather than being a natural outcome from higher levels of design, require constant monitoring of higher level design choices, such as requirements and architecture. We also show that developers have a consistent general structure to their implementation decision-making process, but no single process is exactly the same. We discuss the implications of our findings on research, education, and practice, including insights on teaching developers how to make implementation design decisions. △ Less

Submitted 23 January, 2023; originally announced January 2023.

arXiv:2211.03962 [pdf, other]

Overlap** time of a virtual customer in time-varying many-server queues

Authors: Young Myoung Ko, ** Xu

Abstract: Motivated by the ongoing COVID-19 pandemic, this paper investigates customers' infection risk by evaluating the overlap** time of a virtual customer with others in queueing systems. Most of the current methodologies focus on characterizing the risk in stationary systems, which may not apply to the more practical time-varying systems. As such, we propose an approximation framework that relies on… ▽ More Motivated by the ongoing COVID-19 pandemic, this paper investigates customers' infection risk by evaluating the overlap** time of a virtual customer with others in queueing systems. Most of the current methodologies focus on characterizing the risk in stationary systems, which may not apply to the more practical time-varying systems. As such, we propose an approximation framework that relies on the fluid limit to compute the expected overlap** time in time-varying queueing systems. Simulation experiments verify the accuracy of our approach. △ Less

Submitted 7 November, 2022; originally announced November 2022.

arXiv:2209.05746 [pdf]

Interwire and Intrawire Magnetostatic Interactions in Fe-Au Barcode Nanowires with Alternating Ferromagnetically Strong and Weak Segments

Authors: Aleksei Yu. Samardak, Yoo Sang Jeon, Vadim Yu. Samardak, Alexey G. Kozlov, Kirill A. Rogachev, Alexey V. Ognev, Eun** Jeong, Gyu Won Kim, Min Jun Ko, Alexander S. Samardak, Young Keun Kim

Abstract: Metallic barcode nanowires (BNWs) composed of repeating heterogeneous segments fabricated by template-assisted electrodeposition can offer extended functionality in magnetic, electrical, mechanical, and biomedical applications. We can consider such nanostructures as a three-dimensional system of magnetically interacting elements with magnetic behavior strongly affected by complex magnetostatic int… ▽ More Metallic barcode nanowires (BNWs) composed of repeating heterogeneous segments fabricated by template-assisted electrodeposition can offer extended functionality in magnetic, electrical, mechanical, and biomedical applications. We can consider such nanostructures as a three-dimensional system of magnetically interacting elements with magnetic behavior strongly affected by complex magnetostatic interactions. This study discusses the influence of geometrical parameters of segments on the character of their interactions and the overall magnetic behavior of the array of BNWs having alternating magnetization. By controlling the applied current densities and the elapsed time in the electrodeposition, we regulate the dimension of the Fe-Au BNWs. We show that the Fe and Au segments are made of Fe-Au alloys with high and low magnetization. With the help of micromagnetic simulations, we discover and analyze the three types of magnetostatic interactions in the BNW arrays. As a result, we demonstrate that the dominating type of interaction depends on the geometric parameters of the Fe and Au segments and the interwire and intrawire distances. △ Less

Submitted 13 September, 2022; originally announced September 2022.

Comments: 29 pages, 7 figures

arXiv:2209.02939 [pdf, other]

doi 10.1609/aaai.v37i7.26005

Grou**-matrix based Graph Pooling with Adaptive Number of Clusters

Authors: Sung Moon Ko, Sungjun Cho, Dae-Woong Jeong, Sehui Han, Moontae Lee, Honglak Lee

Abstract: Graph pooling is a crucial operation for encoding hierarchical structures within graphs. Most existing graph pooling approaches formulate the problem as a node clustering task which effectively captures the graph topology. Conventional methods ask users to specify an appropriate number of clusters as a hyperparameter, then assume that all input graphs share the same number of clusters. In inductiv… ▽ More Graph pooling is a crucial operation for encoding hierarchical structures within graphs. Most existing graph pooling approaches formulate the problem as a node clustering task which effectively captures the graph topology. Conventional methods ask users to specify an appropriate number of clusters as a hyperparameter, then assume that all input graphs share the same number of clusters. In inductive settings where the number of clusters can vary, however, the model should be able to represent this variation in its pooling layers in order to learn suitable clusters. Thus we propose GMPool, a novel differentiable graph pooling architecture that automatically determines the appropriate number of clusters based on the input data. The main intuition involves a grou** matrix defined as a quadratic form of the pooling operator, which induces use of binary classification probabilities of pairwise combinations of nodes. GMPool obtains the pooling operator by first computing the grou** matrix, then decomposing it. Extensive evaluations on molecular property prediction tasks demonstrate that our method outperforms conventional methods. △ Less

Submitted 7 September, 2022; originally announced September 2022.

Comments: 10 pages, 3 figures

arXiv:2208.06882 [pdf, other]

CoShNet: A Hybrid Complex Valued Neural Network using Shearlets

Authors: Manny Ko, Ujjawal K. Panchal, Héctor Andrade-Loarca, Andres Mendez-Vazquez

Abstract: In a hybrid neural network, the expensive convolutional layers are replaced by a non-trainable fixed transform with a great reduction in parameters. In previous works, good results were obtained by replacing the convolutions with wavelets. However, wavelet based hybrid network inherited wavelet's lack of vanishing moments along curves and its axis-bias. We propose to use Shearlets with its robust… ▽ More In a hybrid neural network, the expensive convolutional layers are replaced by a non-trainable fixed transform with a great reduction in parameters. In previous works, good results were obtained by replacing the convolutions with wavelets. However, wavelet based hybrid network inherited wavelet's lack of vanishing moments along curves and its axis-bias. We propose to use Shearlets with its robust support for important image features like edges, ridges and blobs. The resulting network is called Complex Shearlets Network (CoShNet). It was tested on Fashion-MNIST against ResNet-50 and Resnet-18, obtaining 92.2% versus 90.7% and 91.8% respectively. The proposed network has 49.9k parameters versus ResNet-18 with 11.18m and use 52 times fewer FLOPs. Finally, we trained in under 20 epochs versus 200 epochs required by ResNet and do not need any hyperparameter tuning nor regularization. Code: https://github.com/Ujjawal-K-Panchal/coshnet △ Less

Submitted 29 October, 2022; v1 submitted 14 August, 2022; originally announced August 2022.

Comments: 16 pages, 11 figures

arXiv:2207.12532 [pdf, ps, other]

Unveiling the dynamics of nucleosynthesis in relativistic heavy-ion collisions

Authors: Kai-Jia Sun, Rui Wang, Che Ming Ko, Yu-Gang Ma, Chun Shen

Abstract: Like nucleosynthesis during the early universe, light nuclei are also produced in relativistic heavy-ion collisions. Although the deuteron ($d$) yields in these collisions can be well described by the statistical hadronization model (SHM), which assumes that particle yields are fixed at a common chemical freezeout near the phase boundary between the quark-gluon plasma and the hadron gas, the recen… ▽ More Like nucleosynthesis during the early universe, light nuclei are also produced in relativistic heavy-ion collisions. Although the deuteron ($d$) yields in these collisions can be well described by the statistical hadronization model (SHM), which assumes that particle yields are fixed at a common chemical freezeout near the phase boundary between the quark-gluon plasma and the hadron gas, the recently measured triton ($^3\text{H}$) yields in Au+Au collisions at $\sqrt{s_{NN}}=7.7-200$ GeV are overestimated systematically by this model. Here, we develop a comprehensive kinetic approach to study the effects of hadronic re-scatterings, such as $πNN\leftrightarrowπd$ and $πNNN\leftrightarrowπ^3\text{H}~(^3\text{He})$, on $d$, $^3\text{H}$, and $^3\text{He}$ production in these collisions. We find that these reactions have little effects on the deuteron yield but reduce the $^3\text{H}$ and $^3\text{He}$ yields by about a factor of 1.8 from their initial values given by the SHM. This finding helps resolve the overestimation of triton production in the SHM and provides the evidence for hadronic re-scattering effects on nucleosynthesis in relativistic heavy-ion collisions. △ Less

Submitted 31 July, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

Comments: 6 pages, 4 figures. arXiv admin note: text overlap with arXiv:2106.12742

arXiv:2205.12221 [pdf, other]

ClaimDiff: Comparing and Contrasting Claims on Contentious Issues

Authors: Miyoung Ko, Ingyu Seong, Hwaran Lee, Joonsuk Park, Minsuk Chang, Minjoon Seo

Abstract: With the growing importance of detecting misinformation, many studies have focused on verifying factual claims by retrieving evidence. However, canonical fact verification tasks do not apply to catching subtle differences in factually consistent claims, which might still bias the readers, especially on contentious political or economic issues. Our underlying assumption is that among the trusted so… ▽ More With the growing importance of detecting misinformation, many studies have focused on verifying factual claims by retrieving evidence. However, canonical fact verification tasks do not apply to catching subtle differences in factually consistent claims, which might still bias the readers, especially on contentious political or economic issues. Our underlying assumption is that among the trusted sources, one's argument is not necessarily more true than the other, requiring comparison rather than verification. In this study, we propose ClaimDiff, a novel dataset that primarily focuses on comparing the nuance between claim pairs. In ClaimDiff, we provide 2,941 annotated claim pairs from 268 news articles. We observe that while humans are capable of detecting the nuances between claims, strong baselines struggle to detect them, showing over a 19% absolute gap with the humans. We hope this initial study could help readers to gain an unbiased grasp of contentious issues through machine-aided comparison. △ Less

Submitted 11 June, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

Comments: published at Findings of ACL 2023

arXiv:2205.11010 [pdf, ps, other]

Spinodal Enhancement of Light Nuclei Yield Ratio in Relativistic Heavy Ion Collisions

Authors: Kai-Jia Sun, Wen-Hao Zhou, Lie-Wen Chen, Che Ming Ko, Feng Li, Rui Wang, Jun Xu

Abstract: Using a relativistic transport model to describe the evolution of the quantum chromodynamic matter produced in Au+Au collisions at $\sqrt{s_{NN}}=3-200$ GeV, we study the effect of a first-order phase transition in the equation of state of this matter on the yield ratio $N_tN_p/ N_d^2$ ($tp/d^2$) of produced proton ($p$), deuteron ($d$), and triton ($t$). We find that the large density inhomogenei… ▽ More Using a relativistic transport model to describe the evolution of the quantum chromodynamic matter produced in Au+Au collisions at $\sqrt{s_{NN}}=3-200$ GeV, we study the effect of a first-order phase transition in the equation of state of this matter on the yield ratio $N_tN_p/ N_d^2$ ($tp/d^2$) of produced proton ($p$), deuteron ($d$), and triton ($t$). We find that the large density inhomogeneities generated by the spinodal instability during the first-order phase transition can survive the fast expansion of the subsequent hadronic matter and lead to an enhanced $tp/d^2$ in central collisions at $\sqrt{s_{NN}}=3-5$ GeV as seen in the experiments by the STAR Collaboration and the E864 Collaboration. However, this enhancement subsides with increasing collision centrality, and the resulting almost flat centrality dependence of $tp/d^2$ at $\sqrt{s_{NN}}=3$ GeV can also be used as a signal for the first-order phase transition. △ Less

Submitted 22 May, 2022; originally announced May 2022.

Comments: 6 pages, 3 figures

arXiv:2204.10879 [pdf, ps, other]

doi 10.1016/j.physletb.2023.137864

Event-by-event anti-deuteron multiplicity fluctuation in Pb+Pb collisions at $\sqrt{s_{NN}}=5.02$ TeV

Authors: Kai-Jia Sun, Che Ming Ko

Abstract: Using the nucleon coalescence model based on kinetic freeze-out nucleons from the hybrid model of MUSIC hydrodynamics and UrQMD hadronic transport, we study the production of anti-deuteron and its event-by-event fluctuation in Pb+Pb collisions at $\sqrt{s_{NN}}=5.02$ TeV. We find a clear suppression of the anti-deuteron to antiproton yield ratio in peripheral collisions, which is in accordance wit… ▽ More Using the nucleon coalescence model based on kinetic freeze-out nucleons from the hybrid model of MUSIC hydrodynamics and UrQMD hadronic transport, we study the production of anti-deuteron and its event-by-event fluctuation in Pb+Pb collisions at $\sqrt{s_{NN}}=5.02$ TeV. We find a clear suppression of the anti-deuteron to antiproton yield ratio in peripheral collisions, which is in accordance with the measurements from the ALICE Collaboration. Also found is a Poissonian event-by-event fluctuation of the anti-deuteron multiplicity distribution in all collision centralities, which is different from the prediction of a simple coalescence model calculation that assumes all antiproton and antineutron pairs to have the same probability to form anti-deuterons. We further find a small negative correlation between the anti-deuteron and antiproton multiplicity distributions as a result of the baryon conservation. △ Less

Submitted 11 June, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

Comments: 5 pages, 5 figures

arXiv:2203.10827 [pdf, other]

Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments

Authors: Dongseok Heo, Cheul Young Park, Jaemin Cheun, Myung ** Ko

Abstract: Deep speaker embeddings have been shown effective for assessing cognitive impairments aside from their original purpose of speaker verification. However, the research found that speaker embeddings encode speaker identity and an array of information, including speaker demographics, such as sex and age, and speech contents to an extent, which are known confounders in the assessment of cognitive impa… ▽ More Deep speaker embeddings have been shown effective for assessing cognitive impairments aside from their original purpose of speaker verification. However, the research found that speaker embeddings encode speaker identity and an array of information, including speaker demographics, such as sex and age, and speech contents to an extent, which are known confounders in the assessment of cognitive impairments. In this paper, we hypothesize that content information separated from speaker identity using a framework for voice conversion is more effective for assessing cognitive impairments and train simple classifiers for the comparative analysis on the DementiaBank Pitt Corpus. Our results show that while content embeddings have an advantage over speaker embeddings for the defined problem, further experiments show their effectiveness depends on information encoded in speaker embeddings due to the inherent design of the architecture used for extracting contents. △ Less

Submitted 21 March, 2022; originally announced March 2022.

Comments: 5 pages, submitted to INTERSPEECH 2022

arXiv:2202.06672 [pdf, ps, other]

doi 10.1016/j.ppnp.2022.103962

Transport Model Comparison Studies of Intermediate-Energy Heavy-Ion Collisions

Authors: Hermann Wolter, Maria Colonna, Dan Cozma, Pawel Danielewicz, Che Ming Ko, Rohit Kumar, Akira Ono, ManYee Betty Tsang, Jun Xu, Ying-Xun Zhang, Elena Bratkovskaya, Zhao-Qing Feng, Theodoros Gaitanos, Arnaud Le Fèvre, Natsumi Ikeno, Youngman Kim, Swagata Mallik, Paolo Napolitani, Dmytro Oliinychenko, Tatsuhiko Ogawa, Massimo Papa, Jun Su, Rui Wang, Yong-Jia Wang, Janus Weil , et al. (27 additional authors not shown)

Abstract: Transport models are the main method to obtain physics information from low to relativistic-energy heavy-ion collisions. The Transport Model Evaluation Project (TMEP) has been pursued to test the robustness of transport model predictions in reaching consistent conclusions from the same type of physical model. Calculations under controlled conditions of physical input and set-up were performed with… ▽ More Transport models are the main method to obtain physics information from low to relativistic-energy heavy-ion collisions. The Transport Model Evaluation Project (TMEP) has been pursued to test the robustness of transport model predictions in reaching consistent conclusions from the same type of physical model. Calculations under controlled conditions of physical input and set-up were performed with various participating codes. These included both calculations of nuclear matter in a box with periodic boundary conditions, and more realistic calculations of heavy-ion collisions. In this intermediate review, we summarize and discuss the present status of the project. We also provide condensed descriptions of the 26 participating codes, which contributed to some part of the project. These include the major codes in use today. We review the main results of the studies completed so far. They show, that in box calculations the differences between the codes can be well understood and a convergence of the results can be reached. These studies also highlight the systematic differences between the two families of transport codes, known as BUU and QMD type codes. However, when the codes were compared in full heavy-ion collisions using different physical models, as recently for pion production, they still yielded substantially different results. This calls for further comparisons of heavy-ion collisions with controlled models and of box comparisons of important ingredients, like momentum-dependent fields, which are currently underway. We often indicate improved strategies in performing transport simulations and thus provide guidance to code developers. Results of transport simulations of heavy-ion collisions from a given code will have more significance if the code can be validated against benchmark calculations such as the ones summarized in this review. △ Less

Submitted 4 May, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

Comments: 114 pages, 14 figures, 479 references, accepted for publication in Progress of Particle and Nuclear Phsics

Journal ref: Prog. Part. Nucl. Phys. 125 (2022) 103962

arXiv:2112.14410 [pdf, other]

doi 10.1103/PhysRevC.105.034911

Evolution of $Λ$ polarization in the hadronic phase of heavy-ion collisions

Authors: Yifeng Sun, Zhen Zhang, Che Ming Ko, Wenbin Zhao

Abstract: Using the AMPT + MUSIC+UrQMD hybrid model, we study the global and local spin polarizations of $Λ$ hyperons as functions of the freeze-out temperature of the spin degree of freedom in the hadronic phase of Au+Au collisions at $\sqrt{s_{NN}}=19.6$ GeV. Including contributions from both the thermal vorticity and thermal shear of the hadronic matter, we find that, with the spin freeze-out temperature… ▽ More Using the AMPT + MUSIC+UrQMD hybrid model, we study the global and local spin polarizations of $Λ$ hyperons as functions of the freeze-out temperature of the spin degree of freedom in the hadronic phase of Au+Au collisions at $\sqrt{s_{NN}}=19.6$ GeV. Including contributions from both the thermal vorticity and thermal shear of the hadronic matter, we find that, with the spin freeze-out temperature drop** from the hadronization temperature of 160 MeV to 110 MeV at the kinetic freeze-out, both the global and local spin polarizations of $Λ$ hyperons due to the thermal vorticity decrease by a factor of two, while those due to the thermal shear decrease quickly and become negligibly small at 140 MeV. Our results suggest the importance of understanding the dynamical evolution of the spin degree of freedom in the hadronic stage in relativistic heavy-ion collisions. △ Less

Submitted 30 December, 2021; v1 submitted 29 December, 2021; originally announced December 2021.

Comments: 5 pages, 3 figures, references updated

arXiv:2112.12269 [pdf, other]

doi 10.1016/j.aop.2022.168960

Angular Momentum Eigenstates of the Isotropic 3-D Harmonic Oscillator: Phase-Space Distributions and Coalescence Probabilities

Authors: Michael Kordell II, Rainer J. Fries, Che Ming Ko

Abstract: The isotropic 3-dimensional harmonic oscillator potential can serve as an approximate description of many systems in atomic, solid state, nuclear, and particle physics. In particular, the question of 2 particles binding (or coalescing) into angular momentum eigenstates in such a potential has interesting applications. We compute the probabilities for coalescence of two distinguishable, non-relativ… ▽ More The isotropic 3-dimensional harmonic oscillator potential can serve as an approximate description of many systems in atomic, solid state, nuclear, and particle physics. In particular, the question of 2 particles binding (or coalescing) into angular momentum eigenstates in such a potential has interesting applications. We compute the probabilities for coalescence of two distinguishable, non-relativistic particles into such a bound state, where the initial particles are represented by generic wave packets of given average positions and momenta. We use a phase-space formulation and hence need the Wigner distribution functions of angular momentum eigenstates in isotropic 3-dimensional harmonic oscillators. These distribution functions have been discussed in the literature before but we utilize an alternative approach to obtain these functions. Along the way, we derive a general formula that expands angular momentum eigenstates in terms of products of 1-dimensional harmonic oscillator eigenstates. △ Less

Submitted 30 December, 2021; v1 submitted 22 December, 2021; originally announced December 2021.

Comments: 31 pages, 4 figures; v2: typos fixed, reference added

arXiv:2110.06170 [pdf, other]

doi 10.1051/0004-6361/202141047

Outflows in the presence of cosmic rays and waves with cooling

Authors: C. M. Ko, B. Ramzan, D. O. Chernyshov

Abstract: Plasma outflow from a gravitational potential well with cosmic rays and self-excited Alfvén waves with cooling and wave dam** is studied in the hydrodynamics regime. We study outflows in the presence of cosmic ray and Alfvén waves including the effect of cooling and wave dam**. We seek physically allowable steady-state subsonic-supersonic transonic solutions. We adopted a multi-fluid hydrodyna… ▽ More Plasma outflow from a gravitational potential well with cosmic rays and self-excited Alfvén waves with cooling and wave dam** is studied in the hydrodynamics regime. We study outflows in the presence of cosmic ray and Alfvén waves including the effect of cooling and wave dam**. We seek physically allowable steady-state subsonic-supersonic transonic solutions. We adopted a multi-fluid hydrodynamical model for the cosmic ray plasma system. Thermal plasma, cosmic rays, and self-excited Alfvén waves are treated as fluids. Interactions such as cosmic-ray streaming instability, cooling, and wave dam** were fully taken into account. We considered one-dimensional geometry and explored steady-state solutions. The model is reduced to a set of ordinary differential equations, which we solved for subsonic-supersonic transonic solutions with given boundary conditions at the base of the gravitational potential well. We find that physically allowable subsonic-supersonic transonic solutions exist for a wide range of parameters. We studied the three-fluid system (considering only forward-propagating Alfvén waves) in detail. We examined the cases with and without cosmic ray diffusion separately. Comparisons of solutions with and without cooling and with and without wave dam** for the same set of boundary conditions (on density, pressures of thermal gas, cosmic rays and waves) are presented. We also present the interesting case of a four-fluid system (both forward- and backward-propagating Alfvén waves are included), highlighting the intriguing relation between different components. △ Less

Submitted 12 October, 2021; originally announced October 2021.

Comments: Outflows --Hydrodynamics-- Cosmic rays -- Alfvén waves-- Cooling --Wave Dam**

MSC Class: Accepted in A&A

Journal ref: A&A 654, A63 (2021)

arXiv:2107.13384 [pdf, other]

doi 10.1016/j.physletb.2022.137134

Charged pion production from Au + Au collisions at $\sqrt{s_{NN}}=2.4$ GeV in the Relativistic Vlasov-Uehling-Uhlenbeck model

Authors: Kyle Godbey, Zhen Zhang, Jeremy W. Holt, Che Ming Ko

Abstract: Using the isospin-dependent relativistic Vlasov-Uehling-Uhlenbeck (RVUU) model, we study charged pion ($π^\pm$) production in Au+Au collisions at $\sqrt{s_{NN}}=$ 2.4 GeV. By fitting the density dependence of the $Δ$ resonance production cross section in nuclear medium to reproduce the experimental $π^\pm$ multiplicities measured by the HADES Collaboration, we obtain a good description of the rapi… ▽ More Using the isospin-dependent relativistic Vlasov-Uehling-Uhlenbeck (RVUU) model, we study charged pion ($π^\pm$) production in Au+Au collisions at $\sqrt{s_{NN}}=$ 2.4 GeV. By fitting the density dependence of the $Δ$ resonance production cross section in nuclear medium to reproduce the experimental $π^\pm$ multiplicities measured by the HADES Collaboration, we obtain a good description of the rapidity distributions and transverse momentum spectra of $π^\pm$ in collisions at various centralities. Some shortcomings in the description of $π^{\pm}$ production may indicate the need for including the strong potential on $π^\pm$ in RVUU, which is at present absent. We also calculate the proton rapidity distribution in the most central collisions and compare with the coalescence invariant proton rapidity distribution extracted from preliminary HADES data. △ Less

Submitted 29 April, 2022; v1 submitted 28 July, 2021; originally announced July 2021.

Comments: 7 pages, 5 figures, version to appear in PLB

arXiv:2106.12742 [pdf, other]

Relativistic kinetic approach to light nuclei production in high-energy nuclear collisions

Authors: Kai-Jia Sun, Rui Wang, Che Ming Ko, Yu-Gang Ma, Chun Shen

Abstract: Understanding the production mechanism of light (anti-)nuclei in high-energy nuclear collisions and cosmic rays has been a long-standing problem in nuclear physics. In the present study, we develop a stochastic method to solve the relativistic kinetic equations for light nuclei production from many-body reactions with the inclusion of their finite sizes. The present approach gives an excellent des… ▽ More Understanding the production mechanism of light (anti-)nuclei in high-energy nuclear collisions and cosmic rays has been a long-standing problem in nuclear physics. In the present study, we develop a stochastic method to solve the relativistic kinetic equations for light nuclei production from many-body reactions with the inclusion of their finite sizes. The present approach gives an excellent description of the deuteron and helium-3 data from central Au+Au (Pb+Pb) collisions at $\sqrt{s_{\rm NN}}$ $=$ $200~\rm GeV$ ($2.76~\rm TeV$). It can also naturally explain their suppressed production in $pp$ collisions at 7 TeV as a result of their finite sizes. △ Less

Submitted 23 June, 2021; originally announced June 2021.

Comments: 7 pages, 3 figures

arXiv:2106.12287 [pdf, ps, other]

doi 10.1103/PhysRevC.104.024603

Comparison of Heavy-Ion Transport Simulations: Mean-field Dynamics in a Box

Authors: Maria Colonna, Ying-Xun Zhang, Yong-Jia Wang, Dan Cozma, Pawel Danielewicz, Che Ming Ko, Akira Ono, Manyee Betty Tsang, Rui Wang, Hermann Wolter, Jun Xu, Zhen Zhang, Lie-Wen Chen, Hui-Gan Cheng, Hannah Elfner, Zhao-Qing Feng, Myungkuk Kim, Youngman Kim, Sangyong Jeon, Chang-Hwan Lee, Bao-An Li, Qing-Feng Li, Zhu-Xia Li, Swagata Mallik, Dmytro Oliinychenko , et al. (4 additional authors not shown)

Abstract: Within the transport model evaluation project (TMEP) of simulations for heavy-ion collisions, the mean-field response is examined here. Specifically, zero-sound propagation is considered for neutron-proton symmetric matter enclosed in a periodic box, at zero temperature and around normal density. The results of several transport codes belonging to two families (BUU-like and QMD-like) are compared… ▽ More Within the transport model evaluation project (TMEP) of simulations for heavy-ion collisions, the mean-field response is examined here. Specifically, zero-sound propagation is considered for neutron-proton symmetric matter enclosed in a periodic box, at zero temperature and around normal density. The results of several transport codes belonging to two families (BUU-like and QMD-like) are compared among each other and to exact calculations. For BUU-like codes, employing the test particle method, the results depend on the combination of the number of test particles and the spread of the profile functions that weight integration over space. These parameters can be properly adapted to give a good reproduction of the analytical zero-sound features. QMD-like codes, using molecular dynamics methods, are characterized by large dam** effects, attributable to the fluctuations inherent in their phase-space representation. Moreover, for a given nuclear effective interaction, they generally lead to slower density oscillations, as compared to BUU-like codes. The latter problem is mitigated in the more recent lattice formulation of some of the QMD codes. The significance of these results for the description of real heavy-ion collisions is discussed. △ Less

Submitted 23 June, 2021; originally announced June 2021.

Journal ref: Phys. Rev. C 104, 024603 (2021)

arXiv:2105.14204 [pdf, other]

doi 10.1016/j.physletb.2021.136571

Multiplicity Scaling of Light Nuclei Production in Relativistic Heavy-Ion Collisions

Authors: Wenbin Zhao, Kai-jia Sun, Che Ming Ko, Xiaofeng Luo

Abstract: Using the nucleon coalescence model based on kinetic freeze-out nucleons from the 3D MUSIC+UrQMD and the 2D VISHNU hybrid model with a crossover equation of state, we study the multiplicity dependence of deuteron ($d$) and triton ($t$) production from central to peripheral Au+Au collisions at $\sqrt{s_\mathrm{NN}}=$ 7.7, 14.5, 19.6, 27, 39, 62.4 and 200 GeV and Pb+Pb at… ▽ More Using the nucleon coalescence model based on kinetic freeze-out nucleons from the 3D MUSIC+UrQMD and the 2D VISHNU hybrid model with a crossover equation of state, we study the multiplicity dependence of deuteron ($d$) and triton ($t$) production from central to peripheral Au+Au collisions at $\sqrt{s_\mathrm{NN}}=$ 7.7, 14.5, 19.6, 27, 39, 62.4 and 200 GeV and Pb+Pb at $\sqrt{s_\mathrm{NN}}=2.76$ TeV, respectively. It is found that the ratio $N_t N_p/N_d^2$ of the proton yield $N_p$, deuteron yield $N_d$ and triton yield $N_t$ exhibits a scaling behavior in its multiplicity dependence, i.e., decreasing monotonically with increasing charged-particle multiplicity. A similar multiplicity scaling of this ratio is also found in the nucleon coalescence calculation based on kinetic freeze-out nucleons from a multiphase transport (AMPT) model. The scaling behavior of $N_t N_p/N_d^2$ can be naturally explained by the interplay between the sizes of light nuclei and the nucleon emission source. We further argue that the multiplicity scaling of $N_t N_p/N_d^2$ can be used to validate the production mechanism of light nuclei, and the collision energy dependence of this yield ratio can further serve as a baseline in the search for the QCD critical point in relativistic heavy-ion collisions. △ Less

Submitted 14 August, 2021; v1 submitted 29 May, 2021; originally announced May 2021.

Comments: 9 pages, 5 figures

Journal ref: Phys. Lett. B 820, 136571 (2021)

arXiv:2105.10477 [pdf]

Towards Realization of Augmented Intelligence in Dermatology: Advances and Future Directions

Authors: Roxana Daneshjou, Carrie Kovarik, Justin M Ko

Abstract: Artificial intelligence (AI) algorithms using deep learning have advanced the classification of skin disease images; however these algorithms have been mostly applied "in silico" and not validated clinically. Most dermatology AI algorithms perform binary classification tasks (e.g. malignancy versus benign lesions), but this task is not representative of dermatologists' diagnostic range. The Americ… ▽ More Artificial intelligence (AI) algorithms using deep learning have advanced the classification of skin disease images; however these algorithms have been mostly applied "in silico" and not validated clinically. Most dermatology AI algorithms perform binary classification tasks (e.g. malignancy versus benign lesions), but this task is not representative of dermatologists' diagnostic range. The American Academy of Dermatology Task Force on Augmented Intelligence published a position statement emphasizing the importance of clinical validation to create human-computer synergy, termed augmented intelligence (AuI). Liu et al's recent paper, "A deep learning system for differential diagnosis of skin diseases" represents a significant advancement of AI in dermatology, bringing it closer to clinical impact. However, significant issues must be addressed before this algorithm can be integrated into clinical workflow. These issues include accurate and equitable model development, defining and assessing appropriate clinical outcomes, and real-world integration. △ Less

Submitted 21 May, 2021; originally announced May 2021.

Comments: 5 pages, no figures

arXiv:2105.09518 [pdf, ps, other]

doi 10.1103/PhysRevC.104.044901

Elliptic flow splittings in the Polyakov-looped Nambu-Jona Lasinio transport model

Authors: Wen-Hao Zhou, He Liu, Feng Li, Yi-Feng Sun, Jun Xu, Che Ming Ko

Abstract: To incorporate the effect of gluons on the evolution dynamics of the quark matter produced in relativistic heavy-ion collisions, we extend the 3-flavor Nambu-Jona-Lasinio (NJL) transport model to include the contribution from the Polyakov loops. Imbedding the resulting pNJL partonic transport model in an extended multiphase transport (extended AMPT) model, we then study the elliptic flow splitting… ▽ More To incorporate the effect of gluons on the evolution dynamics of the quark matter produced in relativistic heavy-ion collisions, we extend the 3-flavor Nambu-Jona-Lasinio (NJL) transport model to include the contribution from the Polyakov loops. Imbedding the resulting pNJL partonic transport model in an extended multiphase transport (extended AMPT) model, we then study the elliptic flow splittings between particles and their antiparticles in relativistic heavy-ion collisions at RHIC-BES energies. We find that a weak quark vector interaction in the partonic phase is able to describe the elliptic flow splitting between protons and antiprotons in heavy-ion collisions at $\sqrt{s_{NN}}=7.7$ to 39 GeV. Knowledge on the quark vector interaction is useful for understanding the equation of state of quark matter at large baryon chemical potentials and thus the location of the critical point in the QCD phase diagram. △ Less

Submitted 15 September, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

Comments: 14 pages, 15 figures

Journal ref: Phys. Rev. C 104, 044901 (2021)

Showing 1–50 of 363 results for author: Koo, M