Search | arXiv e-print repository

DEBATE: Devil's Advocate-Based Assessment and Text Evaluation

Authors: Alex Kim, Keonwoo Kim, Sangwon Yoon

Abstract: As natural language generation (NLG) models have become prevalent, systematically assessing the quality of machine-generated texts has become increasingly important. Recent studies introduce LLM-based evaluators that operate as reference-free metrics, demonstrating their capability to adeptly handle novel tasks. However, these models generally rely on a single-agent approach, which, we argue, intr… ▽ More As natural language generation (NLG) models have become prevalent, systematically assessing the quality of machine-generated texts has become increasingly important. Recent studies introduce LLM-based evaluators that operate as reference-free metrics, demonstrating their capability to adeptly handle novel tasks. However, these models generally rely on a single-agent approach, which, we argue, introduces an inherent limit to their performance. This is because there exist biases in LLM agent's responses, including preferences for certain text structure or content. In this work, we propose DEBATE, an NLG evaluation framework based on multi-agent scoring system augmented with a concept of Devil's Advocate. Within the framework, one agent is instructed to criticize other agents' arguments, potentially resolving the bias in LLM agent's answers. DEBATE substantially outperforms the previous state-of-the-art methods in two meta-evaluation benchmarks in NLG evaluation, SummEval and TopicalChat. We also show that the extensiveness of debates among agents and the persona of an agent can influence the performance of evaluators. △ Less

Submitted 23 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.09765 [pdf, other]

doi 10.1109/ICASSP48485.2024.10446698

Unsupervised Extractive Dialogue Summarization in Hyperdimensional Space

Authors: Seongmin Park, Kyungho Kim, Jae** Seo, Jihwa Lee

Abstract: We present HyperSum, an extractive summarization framework that captures both the efficiency of traditional lexical summarization and the accuracy of contemporary neural approaches. HyperSum exploits the pseudo-orthogonality that emerges when randomly initializing vectors at extremely high dimensions ("blessing of dimensionality") to construct representative and efficient sentence embeddings. Simp… ▽ More We present HyperSum, an extractive summarization framework that captures both the efficiency of traditional lexical summarization and the accuracy of contemporary neural approaches. HyperSum exploits the pseudo-orthogonality that emerges when randomly initializing vectors at extremely high dimensions ("blessing of dimensionality") to construct representative and efficient sentence embeddings. Simply clustering the obtained embeddings and extracting their medoids yields competitive summaries. HyperSum often outperforms state-of-the-art summarizers -- in terms of both summary accuracy and faithfulness -- while being 10 to 100 times faster. We open-source HyperSum as a strong baseline for unsupervised extractive summarization. △ Less

Submitted 15 May, 2024; originally announced May 2024.

Comments: ICASSP 2024

arXiv:2405.08311 [pdf, ps, other]

A Decoupling and Aggregating Framework for Joint Extraction of Entities and Relations

Authors: Yao Wang, Xin Liu, Weikun Kong, Hai-Tao Yu, Teeradaj Racharak, Kyoung-Sook Kim, Minh Le Nguyen

Abstract: Named Entity Recognition and Relation Extraction are two crucial and challenging subtasks in the field of Information Extraction. Despite the successes achieved by the traditional approaches, fundamental research questions remain open. First, most recent studies use parameter sharing for a single subtask or shared features for both two subtasks, ignoring their semantic differences. Second, informa… ▽ More Named Entity Recognition and Relation Extraction are two crucial and challenging subtasks in the field of Information Extraction. Despite the successes achieved by the traditional approaches, fundamental research questions remain open. First, most recent studies use parameter sharing for a single subtask or shared features for both two subtasks, ignoring their semantic differences. Second, information interaction mainly focuses on the two subtasks, leaving the fine-grained informtion interaction among the subtask-specific features of encoding subjects, relations, and objects unexplored. Motivated by the aforementioned limitations, we propose a novel model to jointly extract entities and relations. The main novelties are as follows: (1) We propose to decouple the feature encoding process into three parts, namely encoding subjects, encoding objects, and encoding relations. Thanks to this, we are able to use fine-grained subtask-specific features. (2) We propose novel inter-aggregation and intra-aggregation strategies to enhance the information interaction and construct individual fine-grained subtask-specific features, respectively. The experimental results demonstrate that our model outperforms several previous state-of-the-art models. Extensive additional experiments further confirm the effectiveness of our model. △ Less

Submitted 14 May, 2024; originally announced May 2024.

arXiv:2405.07386 [pdf, other]

Search for lepton-flavor-violating $τ^- \to μ^-μ^+μ^-$ decays at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker , et al. (407 additional authors not shown)

Abstract: We present the result of a search for the charged-lepton-flavor violating decay $τ^- \to μ^-μ^+μ^-$ using a $424fb^{-1}$ sample of data recorded by the Belle II experiment at the SuperKEKB $e^{-}e^{+}$ collider. The selection of $e^{-}e^{+}\toτ^+τ^-$ events is based on an inclusive reconstruction of the non-signal tau decay, and on a boosted decision tree to suppress background. We observe one sig… ▽ More We present the result of a search for the charged-lepton-flavor violating decay $τ^- \to μ^-μ^+μ^-$ using a $424fb^{-1}$ sample of data recorded by the Belle II experiment at the SuperKEKB $e^{-}e^{+}$ collider. The selection of $e^{-}e^{+}\toτ^+τ^-$ events is based on an inclusive reconstruction of the non-signal tau decay, and on a boosted decision tree to suppress background. We observe one signal candidate, which is compatible with the expectation from background processes. We set a $90\%$ confidence level upper limit of $1.9 \times 10^{-8}$ on the branching fraction of the \taumu decay, which is the most stringent bound to date. △ Less

Submitted 12 May, 2024; originally announced May 2024.

Report number: Belle II Preprint 2024-012 KEK Preprint 2024-6

arXiv:2405.06953 [pdf, other]

The Sunburst Arc with JWST: II. Observations of an Eta Carinae Analog at $z=2.37$

Authors: S. Choe, T. Emil Rivera-Thorsen, H. Dahle, K. Sharon, M. Riley Owens, J. R. Rigby, M. B. Bayliss, M. J. Hayes, T. Hutchison, B. Welch, J. Chisholm, M. D. Gladders, G. Khullar, K. Kim

Abstract: "Godzilla" is a peculiar object within the gravitationally lensed Sunburst Arc at $z=2.37$. Despite being very bright, it appears in only one of the twelve lensed images of the source galaxy, and shows exotic spectroscopic properties not found elsewhere in the galaxy. We use JWST's unique combination of spatial resolution and spectroscopic sensitivity to provide a unified, coherent explanation of… ▽ More "Godzilla" is a peculiar object within the gravitationally lensed Sunburst Arc at $z=2.37$. Despite being very bright, it appears in only one of the twelve lensed images of the source galaxy, and shows exotic spectroscopic properties not found elsewhere in the galaxy. We use JWST's unique combination of spatial resolution and spectroscopic sensitivity to provide a unified, coherent explanation of the physical nature of Godzilla. We measure fluxes and kinematic properties of rest-optical emission lines in Godzilla and surrounding regions. Using standard line ratio-based diagnostic methods in combination with NIRCam imaging and ground based rest-UV spectra, we characterize Godzilla and its surroundings. We find that Godzilla is most likely an extremely magnified, non-erupting LBV star with dense gas condensations in close proximity. Among around 60 detected lines, we find a cascade of strong O I lines pumped by intense Ly$β$ emission, as well as Ly$α$-pumped rest-optical Fe II lines, reminiscent of the Weigelt blobs in the local LBV star Eta Carinae. Godzilla is surrounded by dusty, inhomogeneous gas common to massive, evolved stars. Spectra and images of Godzilla and adjacent objects and the detection of a low-surface brightness foreground galaxy in the NIRCam data support the interpretation that Godzilla is a stellar-scale object extremely magnified by alignment with lensing caustics. To explain the dusty surroundings, strong [Ne III] and line kinematics simultaneously, we argue that Godzilla is a post-eruption LBV accompanied by a hotter companion and/or gas condensations exposed to more intense radiation compared to the Weigelt blobs. We expect periodic spectroscopic variations if Godzilla is a binary system. If Godzilla is confirmed to be an LBV star, it expands the distance to the furthest known LBV from a dozen Mpc to several Gpc. △ Less

Submitted 11 May, 2024; originally announced May 2024.

Comments: 18 pages, 16 figures. Submitted to A&A

arXiv:2405.06631 [pdf, other]

The Sunburst Arc with JWST: III. An Abundance of Direct Chemical Abundances

Authors: Brian Welch, T. Emil Rivera-Thorsen, Jane Rigby, Taylor Hutchison, Grace M. Olivier, Danielle A. Berg, Keren Sharon, Hakon Dahle, M. Riley Owens, Matthew B. Bayliss, Gourav Khullar, John Chisholm, Matthew Hayes, Keunho J. Kim

Abstract: We measure the gas-phase abundances of the elements He, N, O, Ne, S, Ar, and Fe in the Lyman-continuum emitting region of the Sunburst Arc, a highly magnified galaxy at redshift $z=2.37$. We detect the temperature-sensitive auroral lines [SII]$λ\lambda4069,4076$, [OII]$λ\lambda7320,7330$, [SIII]$\lambda6312$, [OIII]$\lambda4363$, and [NeIII]$\lambda3343$ in a stacked spectrum of 5 multiple images… ▽ More We measure the gas-phase abundances of the elements He, N, O, Ne, S, Ar, and Fe in the Lyman-continuum emitting region of the Sunburst Arc, a highly magnified galaxy at redshift $z=2.37$. We detect the temperature-sensitive auroral lines [SII]$λ\lambda4069,4076$, [OII]$λ\lambda7320,7330$, [SIII]$\lambda6312$, [OIII]$\lambda4363$, and [NeIII]$\lambda3343$ in a stacked spectrum of 5 multiple images of the Lyman-continuum emitter (LCE), from which we directly measure the electron temperature in the low, intermediate, and high ionization zones. We also detect the density-sensitive doublets of [OII]$λ\lambda3727,3729$, [SII]$λ\lambda6717,6731$, and [ArIV]$λ\lambda4713,4741$, which constrain the density in both the low- and high-ionization gas. With these temperature and density measurements, we measure gas-phase abundances with similar rigor as studies of local galaxies. We measure a gas-phase metallicity for the LCE of $12+\log(\textrm{O}/\textrm{H}) = 7.97 \pm 0.05$, and find an enhanced nitrogen abundance $\log(\textrm{N}/\textrm{O}) = -0.65^{+0.16}_{-0.25}$. This nitrogen abundance is consistent with enrichment from a population of Wolf-Rayet stars, additional signatures of which are reported in a companion paper. Abundances of sulfur, argon, neon, and iron are consistent with local low-metallicity HII regions and low-redshift galaxies. This study represents the most complete chemical abundance analysis of a galaxy at Cosmic Noon to date, which enables direct comparisons between local HII regions and those in the distant universe. △ Less

Submitted 14 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

Comments: 15 pages, 4 figures, 3 tables. Submitted to ApJ

arXiv:2405.05787 [pdf, other]

Autonomous Robotic Ultrasound System for Liver Follow-up Diagnosis: Pilot Phantom Study

Authors: Tianpeng Zhang, Sekeun Kim, Jerome Charton, Haitong Ma, Kyungsang Kim, Na Li, Quanzheng Li

Abstract: The paper introduces a novel autonomous robot ultrasound (US) system targeting liver follow-up scans for outpatients in local communities. Given a computed tomography (CT) image with specific target regions of interest, the proposed system carries out the autonomous follow-up scan in three steps: (i) initial robot contact to surface, (ii) coordinate map** between CT image and robot, and (iii) ta… ▽ More The paper introduces a novel autonomous robot ultrasound (US) system targeting liver follow-up scans for outpatients in local communities. Given a computed tomography (CT) image with specific target regions of interest, the proposed system carries out the autonomous follow-up scan in three steps: (i) initial robot contact to surface, (ii) coordinate map** between CT image and robot, and (iii) target US scan. Utilizing 3D US-CT registration and deep learning-based segmentation networks, we can achieve precise imaging of 3D hepatic veins, facilitating accurate coordinate map** between CT and the robot. This enables the automatic localization of follow-up targets within the CT image, allowing the robot to navigate precisely to the target's surface. Evaluation of the ultrasound phantom confirms the quality of the US-CT registration and shows the robot reliably locates the targets in repeated trials. The proposed framework holds the potential to significantly reduce time and costs for healthcare providers, clinicians, and follow-up patients, thereby addressing the increasing healthcare burden associated with chronic disease in local communities. △ Less

Submitted 9 May, 2024; originally announced May 2024.

arXiv:2405.03905 [pdf, other]

A 65nm 36nJ/Decision Bio-inspired Temporal-Sparsity-Aware Digital Keyword Spotting IC with 0.6V Near-Threshold SRAM

Authors: Qinyu Chen, Kwantae Kim, Chang Gao, Sheng Zhou, Taekwang Jang, Tobi Delbruck, Shih-Chii Liu

Abstract: This paper introduces, to the best of the authors' knowledge, the first fine-grained temporal sparsity-aware keyword spotting (KWS) IC leveraging temporal similarities between neighboring feature vectors extracted from input frames and network hidden states, eliminating unnecessary operations and memory accesses. This KWS IC, featuring a bio-inspired delta-gated recurrent neural network (ΔRNN) cla… ▽ More This paper introduces, to the best of the authors' knowledge, the first fine-grained temporal sparsity-aware keyword spotting (KWS) IC leveraging temporal similarities between neighboring feature vectors extracted from input frames and network hidden states, eliminating unnecessary operations and memory accesses. This KWS IC, featuring a bio-inspired delta-gated recurrent neural network (ΔRNN) classifier, achieves an 11-class Google Speech Command Dataset (GSCD) KWS accuracy of 90.5% and energy consumption of 36nJ/decision. At 87% temporal sparsity, computing latency and energy per inference are reduced by 2.4$\times$/3.4$\times$, respectively. The 65nm design occupies 0.78mm$^2$ and features two additional blocks, a compact 0.084mm$^2$ digital infinite-impulse-response (IIR)-based band-pass filter (BPF) audio feature extractor (FEx) and a 24kB 0.6V near-Vth weight SRAM with 6.6$\times$ lower read power compared to the standard SRAM. △ Less

Submitted 6 May, 2024; originally announced May 2024.

arXiv:2405.03083 [pdf, other]

Causal K-Means Clustering

Authors: Kwangho Kim, Jisu Kim, Edward H. Kennedy

Abstract: Causal effects are often characterized with population summaries. These might provide an incomplete picture when there are heterogeneous treatment effects across subgroups. Since the subgroup structure is typically unknown, it is more challenging to identify and evaluate subgroup effects than population effects. We propose a new solution to this problem: Causal k-Means Clustering, which harnesses… ▽ More Causal effects are often characterized with population summaries. These might provide an incomplete picture when there are heterogeneous treatment effects across subgroups. Since the subgroup structure is typically unknown, it is more challenging to identify and evaluate subgroup effects than population effects. We propose a new solution to this problem: Causal k-Means Clustering, which harnesses the widely-used k-means clustering algorithm to uncover the unknown subgroup structure. Our problem differs significantly from the conventional clustering setup since the variables to be clustered are unknown counterfactual functions. We present a plug-in estimator which is simple and readily implementable using off-the-shelf algorithms, and study its rate of convergence. We also develop a new bias-corrected estimator based on nonparametric efficiency theory and double machine learning, and show that this estimator achieves fast root-n rates and asymptotic normality in large nonparametric models. Our proposed methods are especially useful for modern outcome-wide studies with multiple treatment levels. Further, our framework is extensible to clustering with generic pseudo-outcomes, such as partially observed outcomes or otherwise unknown functions. Finally, we explore finite sample properties via simulation, and illustrate the proposed methods in a study of treatment programs for adolescent substance abuse. △ Less

Submitted 29 June, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

arXiv:2405.02367 [pdf, other]

Enhancing Social Media Post Popularity Prediction with Visual Content

Authors: Dahyun Jeong, Hyelim Son, Yun** Choi, Keunwoo Kim

Abstract: Our study presents a framework for predicting image-based social media content popularity that focuses on addressing complex image information and a hierarchical data structure. We utilize the Google Cloud Vision API to effectively extract key image and color information from users' postings, achieving 6.8% higher accuracy compared to using non-image covariates alone. For prediction, we explore a… ▽ More Our study presents a framework for predicting image-based social media content popularity that focuses on addressing complex image information and a hierarchical data structure. We utilize the Google Cloud Vision API to effectively extract key image and color information from users' postings, achieving 6.8% higher accuracy compared to using non-image covariates alone. For prediction, we explore a wide range of prediction models, including Linear Mixed Model, Support Vector Regression, Multi-layer Perceptron, Random Forest, and XGBoost, with linear regression as the benchmark. Our comparative study demonstrates that models that are capable of capturing the underlying nonlinear interactions between covariates outperform other methods. △ Less

Submitted 8 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

Report number: Report-no: JKSS-D-23-00299R1

arXiv:2405.00873 [pdf, other]

Implementing a synthetic magnetic vector potential in a 2D superconducting qubit array

Authors: Ilan T. Rosen, Sarah Muschinske, Cora N. Barrett, Arkya Chatterjee, Max Hays, Michael DeMarco, Amir Karamlou, David Rower, Rabindra Das, David K. Kim, Bethany M. Niedzielski, Meghan Schuldt, Kyle Serniak, Mollie E. Schwartz, Jonilyn L. Yoder, Jeffrey A. Grover, William D. Oliver

Abstract: Superconducting quantum processors are a compelling platform for analog quantum simulation due to the precision control, fast operation, and site-resolved readout inherent to the hardware. Arrays of coupled superconducting qubits natively emulate the dynamics of interacting particles according to the Bose-Hubbard model. However, many interesting condensed-matter phenomena emerge only in the presen… ▽ More Superconducting quantum processors are a compelling platform for analog quantum simulation due to the precision control, fast operation, and site-resolved readout inherent to the hardware. Arrays of coupled superconducting qubits natively emulate the dynamics of interacting particles according to the Bose-Hubbard model. However, many interesting condensed-matter phenomena emerge only in the presence of electromagnetic fields. Here, we emulate the dynamics of charged particles in an electromagnetic field using a superconducting quantum simulator. We realize a broadly adjustable synthetic magnetic vector potential by applying continuous modulation tones to all qubits. We verify that the synthetic vector potential obeys requisite properties of electromagnetism: a spatially-varying vector potential breaks time-reversal symmetry and generates a gauge-invariant synthetic magnetic field, and a temporally-varying vector potential produces a synthetic electric field. We demonstrate that the Hall effect--the transverse deflection of a charged particle propagating in an electromagnetic field--exists in the presence of the synthetic electromagnetic field. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 9 pages, 5 figures, and Supplementary Information

arXiv:2405.00493 [pdf, other]

A study of Galactic Plane Planck Galactic Cold Clumps observed by SCOPE and the JCMT Plane Survey

Authors: D. J. Eden, Tie Liu, T. J. T. Moore, J. Di Francesco, G. Fuller, Kee-Tae Kim, Di Li, S. -Y. Liu, R. Plume, Ken'ichi Tatematsu, M. A. Thompson, Y. Wu, L. Bronfman, H. M. Butner, M. J. Currie, G. Garay, P. F. Goldsmith, N. Hirano, D. Johnstone, M. Juvela, S. -P. Lai, C. W. Lee, E. E. Mannfors, F. Olguin, K. Pattle , et al. (10 additional authors not shown)

Abstract: We have investigated the physical properties of Planck Galactic Cold Clumps (PGCCs) located in the Galactic Plane, using the JCMT Plane Survey (JPS) and the SCUBA-2 Continuum Observations of Pre-protostellar Evolution (SCOPE) survey. By utilising a suite of molecular-line surveys, velocities and distances were assigned to the compact sources within the PGCCs, placing them in a Galactic context. Th… ▽ More We have investigated the physical properties of Planck Galactic Cold Clumps (PGCCs) located in the Galactic Plane, using the JCMT Plane Survey (JPS) and the SCUBA-2 Continuum Observations of Pre-protostellar Evolution (SCOPE) survey. By utilising a suite of molecular-line surveys, velocities and distances were assigned to the compact sources within the PGCCs, placing them in a Galactic context. The properties of these compact sources show no large-scale variations with Galactic environment. Investigating the star-forming content of the sample, we find that the luminosity-to-mass ratio (L/M) is an order of magnitude lower than in other Galactic studies, indicating that these objects are hosting lower levels of star formation. Finally, by comparing ATLASGAL sources that are associated or are not associated with PGCCs, we find that those associated with PGCCs are typically colder, denser, and have a lower L/M ratio, hinting that PGCCs are a distinct population of Galactic Plane sources. △ Less

Submitted 1 May, 2024; originally announced May 2024.

Comments: 18 pages, 14 figures, 7 tables. Accepted for publication in MNRAS

arXiv:2404.19535 [pdf, other]

Ferroelectrically-enhanced Schottky barrier transistors for Logic-in-Memory applications

Authors: Daniele Nazzari, Lukas Wind, Masiar Sistani, Dominik Mayr, Kihye Kim, Walter M. Weber

Abstract: Artificial neural networks (ANNs) have had an enormous impact on a multitude of sectors, from research to industry, generating an unprecedented demand for tailor-suited hardware platforms. Their training and execution is highly memory-intensive, clearly evidencing the limitations affecting the currently available hardware based on the von Neumann architecture, which requires frequent data shuttlin… ▽ More Artificial neural networks (ANNs) have had an enormous impact on a multitude of sectors, from research to industry, generating an unprecedented demand for tailor-suited hardware platforms. Their training and execution is highly memory-intensive, clearly evidencing the limitations affecting the currently available hardware based on the von Neumann architecture, which requires frequent data shuttling due to the physical separation of logic and memory units. This does not only limit the achievable performances but also greatly increases the energy consumption, hindering the integration of ANNs into low-power platforms. New Logic in Memory (LiM) architectures, able to unify memory and logic functionalities into a single component, are highly promising for overcoming these limitations, by drastically reducing the need of data transfers. Recently, it has been shown that a very flexible platform for logic applications can be realized recurring to a multi-gated Schottky-Barrier Field Effect Transistor (SBFET). If equipped with memory capabilities, this architecture could represent an ideal building block for versatile LiM hardware. To reach this goal, here we investigate the integration of a ferroelectric Hf$_{0.5}$Zr$_{0.5}$O$_2$ (HZO) layer onto Dual Top Gated SBFETs. We demonstrate that HZO polarization charges can be successfully employed to tune the height of the two Schottky barriers, influencing the injection behavior, thus defining the transistor mode, switching it between n and p-type transport. The modulation strength is strongly dependent on the polarization pulse height, allowing for the selection of multiple current levels. All these achievable states can be well retained over time, thanks to the HZO stability. The presented result show how ferroelectric-enhanced SBFETs are promising for the realization of novel LiM hardware, enabling low-power circuits for ANNs execution. △ Less

Submitted 30 April, 2024; originally announced April 2024.

arXiv:2404.18306 [pdf, other]

Tunable Ultrafast Dynamics of Antiferromagnetic Vortices in Nanoscale Dots

Authors: Ji Zou, Even Thingstad, Se Kwon Kim, Jelena Klinovaja, Daniel Loss

Abstract: Topological vortex textures in magnetic disks have garnered great attention due to their interesting physics and diverse applications. However, up to now, the vortex state has mainly been studied in microsize ferromagnetic disks, which have oscillation frequencies confined to the GHz range. Here, we propose an experimentally feasible ultrasmall and ultrafast vortex state in an antiferromagnetic na… ▽ More Topological vortex textures in magnetic disks have garnered great attention due to their interesting physics and diverse applications. However, up to now, the vortex state has mainly been studied in microsize ferromagnetic disks, which have oscillation frequencies confined to the GHz range. Here, we propose an experimentally feasible ultrasmall and ultrafast vortex state in an antiferromagnetic nanodot surrounded by a heavy metal, which is further harnessed to construct a highly tunable vortex network. We theoretically demonstrate that, interestingly, the interfacial Dzyaloshinskii-Moriya interaction (iDMI) induced by the heavy metal at the boundary of the dot acts as an effective chemical potential for the vortices in the interior. Mimicking the creation of a superfluid vortex by rotation, we show that a magnetic vortex state can be stabilized by this iDMI. Subjecting the system to an electric current can trigger vortex oscillations via spin-transfer torque, which reside in the THz regime and can be further modulated by external magnetic fields. Furthermore, we show that coherent coupling between vortices in different nanodisks can be achieved via an antiferromagnetic link. Remarkably, this interaction depends on the vortex polarity and topological charge and is also exceptionally tunable through the vortex resonance frequency. This opens up the possibility for controllable interconnected networks of antiferromagnetic vortices. Our proposal therefore introduces a new avenue for develo** high-density memory, ultrafast logic devices, and THz signal generators, which are ideal for compact integration into microchips. △ Less

Submitted 28 April, 2024; originally announced April 2024.

Comments: 11 pages including supplemental material; 4 figures

arXiv:2404.12817 [pdf, other]

Determination of the CKM angle $φ_{3}$ from a combination of Belle and Belle II results

Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien , et al. (377 additional authors not shown)

Abstract: We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihoo… ▽ More We report a determination of the CKM angle $φ_{3}$, also known as $γ$, from a combination of measurements using samples of up to 711~fb$^{-1}$ from the Belle experiment and up to 362~fb$^{-1}$ from the Belle II experiment. We combine results from analyses of $B^+\to DK^+, B^+\to Dπ^+$, and $B^+ \to D^{*}K^+$ decays, where $D$ is an admixture of $D^0$ and $\overline{D}{}^{0}$ mesons, in a likelihood fit to obtain $φ_{3} = (78.6^{+7.2}_{-7.3})^{\circ}$. We also briefly discuss the interpretation of this result. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 31 pages, 4 figures

Report number: Belle II Preprint 2023-015, KEK Preprint 2023-31

arXiv:2404.10874 [pdf, other]

doi 10.1103/PhysRevD.109.L111103

Measurement of the branching fraction of the decay $B^- \to D^0 ρ(770)^-$ at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker, J. V. Bennett , et al. (367 additional authors not shown)

Abstract: We measure the branching fraction of the decay $B^- \to D^0 ρ(770)^-$ using data collected with the Belle II detector. The data contain 387 million $B\overline{B}$ pairs produced in $e^+e^-$ collisions at the $Υ(4S)$ resonance. We reconstruct $8360\pm 180$ decays from an analysis of the distributions of the $B^-$ energy and the $ρ(770)^-$ helicity angle. We determine the branching fraction to be… ▽ More We measure the branching fraction of the decay $B^- \to D^0 ρ(770)^-$ using data collected with the Belle II detector. The data contain 387 million $B\overline{B}$ pairs produced in $e^+e^-$ collisions at the $Υ(4S)$ resonance. We reconstruct $8360\pm 180$ decays from an analysis of the distributions of the $B^-$ energy and the $ρ(770)^-$ helicity angle. We determine the branching fraction to be $(0.939 \pm 0.021\mathrm{(stat)} \pm 0.050\mathrm{(syst)})\%$, in agreement with previous results. Our measurement improves the relative precision of the world average by more than a factor of two. △ Less

Submitted 27 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

Report number: Belle II Preprint 2024-011, KEK Preprint 2024-4

Journal ref: PRD 109, 111103 (2024)

arXiv:2404.10294 [pdf, other]

Topological Fukaya category of tagged arcs

Authors: Cheol-Hyun Cho, Kyoungmo Kim

Abstract: A tagged arc on a surface is introduced by Fomin, Shapiro, and Thurston to study cluster theory on marked surfaces. Given a tagged arc system on a graded marked surface, we define its $\mathbb{Z}$-graded $\mathcal{A}_\infty$-category, generalizing the construction of Haiden, Katzarkov, and Kontsevich for arc systems. When a tagged arc system arises from a non-trivial involution on a marked surface… ▽ More A tagged arc on a surface is introduced by Fomin, Shapiro, and Thurston to study cluster theory on marked surfaces. Given a tagged arc system on a graded marked surface, we define its $\mathbb{Z}$-graded $\mathcal{A}_\infty$-category, generalizing the construction of Haiden, Katzarkov, and Kontsevich for arc systems. When a tagged arc system arises from a non-trivial involution on a marked surface, we show that this $\mathcal{A}_\infty$-category is quasi-isomorphic to the invariant part of the topological Fukaya category under the involution. In particular, this identifies tagged arcs with non-geometric idempotents of Fukaya category. △ Less

Submitted 16 April, 2024; originally announced April 2024.

Comments: 57 pages, 18 figures

MSC Class: 53D37; 16E35

arXiv:2404.09603 [pdf, ps, other]

Construction of smooth chiral finite-time blow-up solutions to Calogero--Moser derivative nonlinear Schrödinger equation

Authors: Kihyun Kim, Taegyu Kim, Soonsik Kwon

Abstract: We consider the Calogero--Moser derivative nonlinear Schrödinger equation (CM-DNLS), which is an $L^{2}$-critical nonlinear Schrödinger equation with explicit solitons, self-duality, and pseudo-conformal symmetry. More importantly, this equation is known to be completely integrable in the Hardy space $L_{+}^{2}$ and the solutions in this class are referred to as \emph{chiral} solutions. A rigorous… ▽ More We consider the Calogero--Moser derivative nonlinear Schrödinger equation (CM-DNLS), which is an $L^{2}$-critical nonlinear Schrödinger equation with explicit solitons, self-duality, and pseudo-conformal symmetry. More importantly, this equation is known to be completely integrable in the Hardy space $L_{+}^{2}$ and the solutions in this class are referred to as \emph{chiral} solutions. A rigorous PDE analysis of this equation with complete integrability was recently initiated by Gérard and Lenzmann. Our main result constructs smooth, chiral, and finite energy finite-time blow-up solutions with mass arbitrarily close to that of soliton, answering the global regularity question for chiral solutions raised by Gérard and Lenzmann. The blow-up rate obtained for these solutions is different from the pseudo-conformal rate. Our proof also gives a construction of a codimension one set of smooth finite energy initial data (but without addressing chirality) leading to the same blow-up dynamics. Our blow-up construction in the Hardy space might also be contrasted with the global well-posedness of the derivative nonlinear Schrödinger equation (DNLS), which is another integrable $L^{2}$-critical Schrödinger equation. The overall scheme of our proof is the forward construction of blow-up dynamics with modulation analysis and is not reliant on complete integrability. We begin with develo** a linear theory for the near soliton dynamics. We discover a nontrivial conjugation identity, which unveils a surprising connection from the linearized (CM-DNLS) to the 1D free Schrödinger equation, which is a crucial ingredient for overcoming the difficulties from the non-local nonlinearity. Another principal challenge in this work, the slow decay of soliton, is overcome by introducing a trick of decomposing solutions depending on topologies, which we believe is of independent interest. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 99 pages

MSC Class: 35B44 (primary); 35Q55; 37K10; 37K40

arXiv:2404.09482 [pdf, other]

Binary microlensing by high eccentric stellar-mass black hole binaries

Authors: Kyungmin Kim, Yeong-Bok Bae, Yoon-Hyun Ryu

Abstract: Microlensing is one of the most promising tools for discovering stellar-mass black holes (BHs) in the Milky Way because it allows us to probe dark or faint celestial compact objects. While the existence of stellar-mass BHs has been confirmed through observation of X-ray binaries within our galaxy and gravitational waves from extragalactic BH binaries, a conclusive observation of microlensing event… ▽ More Microlensing is one of the most promising tools for discovering stellar-mass black holes (BHs) in the Milky Way because it allows us to probe dark or faint celestial compact objects. While the existence of stellar-mass BHs has been confirmed through observation of X-ray binaries within our galaxy and gravitational waves from extragalactic BH binaries, a conclusive observation of microlensing events caused by Galactic BH binaries has yet to be achieved. In this study, we focus on those with high eccentricity, including unbound orbits, which can dynamically form in star clusters and could potentially increase the observation rate. We demonstrate parameter estimation for simulated light curves supposing various orbital configurations of BH binary lenses. We employ a model-based fitting using the Nelder-Mead method and Bayesian inference based on the Markov chain Monte Carlo method for the demonstration. The results show that we can retrieve true values of the parameters of high eccentric BH binary lenses within the 1$σ$ uncertainty of inferred values. We conclude it is feasible to find high eccentric Galactic BH binaries from the observation of binary microlensing events. △ Less

Submitted 15 April, 2024; originally announced April 2024.

Comments: 12 pages, 9 figures, 4 tables

arXiv:2404.09122 [pdf, other]

Monotonicity of renormalization group flow, Perelman's entropy functional, and emergent dual holography in the worldsheet nonlinear $σ$ model

Authors: Ki-Seok Kim, Arpita Mitra, Debangshu Mukherjee, Shinsei Ryu

Abstract: Based on the renormalization group (RG) flow of worldsheet bosonic string theory, we construct an effective holographic dual description, where an extra dimension is identified with an RG scale. As a result, we obtain a dilaton-gravity effective theory for the dynamics of an emergent target spacetime, analogous to the low-energy description of bosonic M theory. We argue that this holographic dual… ▽ More Based on the renormalization group (RG) flow of worldsheet bosonic string theory, we construct an effective holographic dual description, where an extra dimension is identified with an RG scale. As a result, we obtain a dilaton-gravity effective theory for the dynamics of an emergent target spacetime, analogous to the low-energy description of bosonic M theory. We argue that this holographic dual effective field theory is non-perturbative in nature for the $α'$ expansion, where the RG flow of the target spacetime manifests in the level of an effective bulk action. Based on the holographic dual effective field theory, we investigate the monotonicity of the RG flow. Inspired by the monotonicity of the Ricci flow given by Perelman, we propose a holographic construction of the Perelman's entropy functional. Based on the equivalence between the Hamilton-Jacobi equation and the local RG equation, we show that the RG flow of holographic Perelman's entropy functional is nothing but the Weyl anomaly. This leads us to the monotonicity of the RG flow of the emergent target spacetime. Furthermore, considering the entropy production along the RG flow, we construct a microscopic entropy functional based on the probability distribution function of the holographic dual effective field theory, regarded as Gibbs or Shannon entropy. We find that the monotonicity of this microscopically constructed entropy functional shows a strong connection with the monotonicity of the holographic Perelman's entropy functional. △ Less

Submitted 19 April, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

arXiv:2404.08884 [pdf, other]

The Sunburst Arc with JWST: Detection of Wolf-Rayet stars injecting nitrogen into a low-metallicity, $z=2.37$ proto-globular cluster leaking ionizing photons

Authors: T. Emil Rivera-Thorsen, J. Chisholm, B. Welch, J. R. Rigby, T. Hutchison, M. Florian, K. Sharon, S. Choe, H. Dahle, M. B. Bayliss, G. Khullar, M. Gladders, M. Hayes, A. Adamo, M. R. Owens, K. Kim

Abstract: We report the detection of a population of Wolf-Rayet (WR) stars in the Sunburst Arc, a strongly gravitationally lensed galaxy at redshift $z=2.37$. As the brightest known lensed galaxy, the Sunburst Arc has become an important cosmic laboratory for studying star and cluster formation, Lyman $α$ radiative transfer, and Lyman Continuum (LyC) escape. Here, we present the first results of JWST/NIRCam… ▽ More We report the detection of a population of Wolf-Rayet (WR) stars in the Sunburst Arc, a strongly gravitationally lensed galaxy at redshift $z=2.37$. As the brightest known lensed galaxy, the Sunburst Arc has become an important cosmic laboratory for studying star and cluster formation, Lyman $α$ radiative transfer, and Lyman Continuum (LyC) escape. Here, we present the first results of JWST/NIRCam imaging and NIRSpec IFU observations of the Sunburst Arc, focusing on a stacked spectrum of the 12-fold imaged LyC-emitting (Sunburst LCE) cluster. In agreement with previous studies, we find that the cluster is massive and compact, with $M_{\text{dyn}} = (9\pm1) \times 10^{6} M_{\odot}$, Our age estimate of 4.2--4.5 Myr is much larger than the crossing time of $t_{\text{cross}} = 183 \pm 9 $ kyr, indicating that the cluster is dynamically evolved and consistent with being gravitationally bound. We find a significant nitrogen enhancement of the low ionization state ISM, with $\log(N/O) = -0.74 \pm 0.09$, which is $\approx 0.8$ dex above typical values for H II regions of similar metallicity in the local Universe. We find broad stellar emission complexes around He II$λ4686$ and C IV$λ5808$ with associated nitrogen emission -- this is the first time WR signatures have been directly observed at redshifts above $\sim 0.5$. The strength of the WR signatures cannot be reproduced by stellar population models that only include single-star evolution. While models with binary evolution better match the WR features, they still struggle to reproduce the nitrogen-enhanced WR features. JWST reveals the Sunburst LCE to be a highly ionized, proto-globular cluster with low oxygen abundance and extreme nitrogen enhancement that hosts a population of Wolf-Rayet stars, and possibly Very Massive stars (VMSs), which are rapidly enriching the surrounding medium. △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: 8 pages, 4 figures, 3 tables. Submitted to A&A

arXiv:2404.08672 [pdf, other]

Taxonomy and Analysis of Sensitive User Queries in Generative AI Search

Authors: Hwiyeol Jo, Taiwoo Park, Nayoung Choi, Changbong Kim, Ohjoon Kwon, Donghyeon Jeon, Hyunwoo Lee, Eui-Hyeon Lee, Kyoungho Shin, Sun Suk Lim, Kyungmi Kim, Jihye Lee, Sun Kim

Abstract: Although there has been a growing interest among industries to integrate generative LLMs into their services, limited experiences and scarcity of resources acts as a barrier in launching and servicing large-scale LLM-based conversational services. In this paper, we share our experiences in develo** and operating generative AI models within a national-scale search engine, with a specific focus on… ▽ More Although there has been a growing interest among industries to integrate generative LLMs into their services, limited experiences and scarcity of resources acts as a barrier in launching and servicing large-scale LLM-based conversational services. In this paper, we share our experiences in develo** and operating generative AI models within a national-scale search engine, with a specific focus on the sensitiveness of user queries. We propose a taxonomy for sensitive search queries, outline our approaches, and present a comprehensive analysis report on sensitive queries from actual users. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2404.08133 [pdf, other]

Search for rare $b \to d\ell^+\ell^-$ transitions at Belle

Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Beaubien, F. Becherer, J. Becker , et al. (371 additional authors not shown)

Abstract: We present the results of a search for the $b \to d\ell^+\ell^-$ flavor-changing neutral-current rare decays $B^{+, 0} \to (η, ω, π^{+,0}, ρ^{+, 0}) e^+e^-$ and $B^{+, 0} \to (η, ω, π^{0}, ρ^{+}) μ^+μ^-$ using a $711$ fb$^{-1}$ data sample that contains $772 \times 10^{6}$ $B\overline{B}$ events. The data were collected at the $Υ(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy… ▽ More We present the results of a search for the $b \to d\ell^+\ell^-$ flavor-changing neutral-current rare decays $B^{+, 0} \to (η, ω, π^{+,0}, ρ^{+, 0}) e^+e^-$ and $B^{+, 0} \to (η, ω, π^{0}, ρ^{+}) μ^+μ^-$ using a $711$ fb$^{-1}$ data sample that contains $772 \times 10^{6}$ $B\overline{B}$ events. The data were collected at the $Υ(4S)$ resonance with the Belle detector at the KEKB asymmetric-energy $e^+e^-$ collider. We find no evidence for signal and set upper limits on branching fractions at the $90\%$ confidence level in the range $(3.8 - 47) \times 10^{-8}$ depending on the decay channel. The obtained limits are the world's best results. This is the first search for the channels $B^{+, 0} \to (ω, ρ^{+,0}) e^+e^-$ and $B^{+, 0} \to (ω, ρ^{+})μ^+μ^-$. △ Less

Submitted 11 April, 2024; originally announced April 2024.

Comments: 7 pages, 12 figures

Report number: Belle II Preprint 2024-005, KEK Preprint 2023-52

arXiv:2404.07947 [pdf, other]

ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference

Authors: Hyungjun Oh, Kihong Kim, Jaemin Kim, Sungkyun Kim, Junyeol Lee, Du-seong Chang, Jiwon Seo

Abstract: This paper presents ExeGPT, a distributed system designed for constraint-aware LLM inference. ExeGPT finds and runs with an optimal execution schedule to maximize inference throughput while satisfying a given latency constraint. By leveraging the distribution of input and output sequences, it effectively allocates resources and determines optimal execution configurations, including batch sizes and… ▽ More This paper presents ExeGPT, a distributed system designed for constraint-aware LLM inference. ExeGPT finds and runs with an optimal execution schedule to maximize inference throughput while satisfying a given latency constraint. By leveraging the distribution of input and output sequences, it effectively allocates resources and determines optimal execution configurations, including batch sizes and partial tensor parallelism. We also introduce two scheduling strategies based on Round-Robin Allocation and Workload-Aware Allocation policies, suitable for different NLP workloads. We evaluate ExeGPT on six LLM instances of T5, OPT, and GPT-3 and five NLP tasks, each with four distinct latency constraints. Compared to FasterTransformer, ExeGPT achieves up to 15.2x improvements in throughput and 6x improvements in latency. Overall, ExeGPT achieves an average throughput gain of 2.9x across twenty evaluation scenarios. Moreover, when adapting to changing sequence distributions, the cost of adjusting the schedule in ExeGPT is reasonably modest. ExeGPT proves to be an effective solution for optimizing and executing LLM inference for diverse NLP workload and serving conditions. △ Less

Submitted 15 March, 2024; originally announced April 2024.

Comments: Accepted to ASPLOS 2024 (summer cycle)

Journal ref: 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS 24 summer cycle), Volume 2, Nov 15, 2023 (Notification Date)

arXiv:2404.07021 [pdf, other]

A 4x32Gb/s 1.8pJ/bit Collaborative Baud-Rate CDR with Background Eye-Climbing Algorithm and Low-Power Global Clock Distribution

Authors: Jihee Kim, Jia Park, Jiwon Shin, Hanseok Kim, Kahyun Kim, Haengbeom Shin, Ha-Jung Park, Woo-Seok Choi

Abstract: This paper presents design techniques for an energy-efficient multi-lane receiver (RX) with baud-rate clock and data recovery (CDR), which is essential for high-throughput low-latency communication in high-performance computing systems. The proposed low-power global clock distribution not only significantly reduces power consumption across multi-lane RXs but is capable of compensating for the freq… ▽ More This paper presents design techniques for an energy-efficient multi-lane receiver (RX) with baud-rate clock and data recovery (CDR), which is essential for high-throughput low-latency communication in high-performance computing systems. The proposed low-power global clock distribution not only significantly reduces power consumption across multi-lane RXs but is capable of compensating for the frequency offset without any phase interpolators. To this end, a fractional divider controlled by CDR is placed close to the global phase locked loop. Moreover, in order to address the sub-optimal lock point of conventional baud-rate phase detectors, the proposed CDR employs a background eye-climbing algorithm, which optimizes the sampling phase and maximizes the vertical eye margin (VEM). Fabricated in a 28nm CMOS process, the proposed 4x32Gb/s RX shows a low integrated fractional spur of -40.4dBc at a 2500ppm frequency offset. Furthermore, it improves bit-error-rate performance by increasing the VEM by 17%. The entire RX achieves the energy efficiency of 1.8pJ/bit with the aggregate data rate of 128Gb/s. △ Less

Submitted 22 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

arXiv:2404.06731 [pdf]

Accuracy of a Large Language Model in Distinguishing Anti- And Pro-vaccination Messages on Social Media: The Case of Human Papillomavirus Vaccination

Authors: Soojong Kim, Kwanho Kim, Claire Wonjeong Jo

Abstract: Objective. Vaccination has engendered a spectrum of public opinions, with social media acting as a crucial platform for health-related discussions. The emergence of artificial intelligence technologies, such as large language models (LLMs), offers a novel opportunity to efficiently investigate public discourses. This research assesses the accuracy of ChatGPT, a widely used and freely available ser… ▽ More Objective. Vaccination has engendered a spectrum of public opinions, with social media acting as a crucial platform for health-related discussions. The emergence of artificial intelligence technologies, such as large language models (LLMs), offers a novel opportunity to efficiently investigate public discourses. This research assesses the accuracy of ChatGPT, a widely used and freely available service built upon an LLM, for sentiment analysis to discern different stances toward Human Papillomavirus (HPV) vaccination. Methods. Messages related to HPV vaccination were collected from social media supporting different message formats: Facebook (long format) and Twitter (short format). A selection of 1,000 human-evaluated messages was input into the LLM, which generated multiple response instances containing its classification results. Accuracy was measured for each message as the level of concurrence between human and machine decisions, ranging between 0 and 1. Results. Average accuracy was notably high when 20 response instances were used to determine the machine decision of each message: .882 (SE = .021) and .750 (SE = .029) for anti- and pro-vaccination long-form; .773 (SE = .027) and .723 (SE = .029) for anti- and pro-vaccination short-form, respectively. Using only three or even one instance did not lead to a severe decrease in accuracy. However, for long-form messages, the language model exhibited significantly lower accuracy in categorizing pro-vaccination messages than anti-vaccination ones. Conclusions. ChatGPT shows potential in analyzing public opinions on HPV vaccination using social media content. However, understanding the characteristics and limitations of a language model within specific public health contexts remains imperative. △ Less

Submitted 10 April, 2024; originally announced April 2024.

Comments: Forthcoming in Preventive Medicine Reports

arXiv:2404.06324 [pdf, other]

Dynamic D2D-Assisted Federated Learning over O-RAN: Performance Analysis, MAC Scheduler, and Asymmetric User Selection

Authors: Payam Abdisarabshali, Kwang Taik Kim, Michael Langberg, Weifeng Su, Seyyedali Hosseinalipour

Abstract: Existing studies on federated learning (FL) are mostly focused on system orchestration for static snapshots of the network and making static control decisions (e.g., spectrum allocation). However, real-world wireless networks are susceptible to temporal variations of wireless channel capacity and users' datasets. In this paper, we incorporate multi-granular system dynamics (MSDs) into FL, includin… ▽ More Existing studies on federated learning (FL) are mostly focused on system orchestration for static snapshots of the network and making static control decisions (e.g., spectrum allocation). However, real-world wireless networks are susceptible to temporal variations of wireless channel capacity and users' datasets. In this paper, we incorporate multi-granular system dynamics (MSDs) into FL, including (M1) dynamic wireless channel capacity, captured by a set of discrete-time events, called $\mathscr{D}$-Events, and (M2) dynamic datasets of users. The latter is characterized by (M2-a) modeling the dynamics of user's dataset size via an ordinary differential equation and (M2-b) introducing dynamic model drift}, formulated via a partial differential inequality} drawing concrete analytical connections between the dynamics of users' datasets and FL accuracy. We then conduct FL orchestration under MSDs by introducing dynamic cooperative FL with dedicated MAC schedulers (DCLM), exploiting the unique features of open radio access network (O-RAN). DCLM proposes (i) a hierarchical device-to-device (D2D)-assisted model training, (ii) dynamic control decisions through dedicated O-RAN MAC schedulers, and (iii) asymmetric user selection. We provide extensive theoretical analysis to study the convergence of DCLM. We then optimize the degrees of freedom (e.g., user selection and spectrum allocation) in DCLM through a highly non-convex optimization problem. We develop a systematic approach to obtain the solution for this problem, opening the door to solving a broad variety of network-aware FL optimization problems. We show the efficiency of DCLM via numerical simulations and provide a series of future directions. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 120 pages, 13 figures

arXiv:2404.05916 [pdf, other]

Prompt-driven Universal Model for View-Agnostic Echocardiography Analysis

Authors: Sekeun Kim, Hui Ren, Peng Guo, Abder-Rahman Ali, Patrick Zhang, Kyungsang Kim, Xiang Li, Quanzheng Li

Abstract: Echocardiography segmentation for cardiac analysis is time-consuming and resource-intensive due to the variability in image quality and the necessity to process scans from various standard views. While current automated segmentation methods in echocardiography show promising performance, they are trained on specific scan views to analyze corresponding data. However, this solution has a limitation… ▽ More Echocardiography segmentation for cardiac analysis is time-consuming and resource-intensive due to the variability in image quality and the necessity to process scans from various standard views. While current automated segmentation methods in echocardiography show promising performance, they are trained on specific scan views to analyze corresponding data. However, this solution has a limitation as the number of required models increases with the number of standard views. To address this, in this paper, we present a prompt-driven universal method for view-agnostic echocardiography analysis. Considering the domain shift between standard views, we first introduce a method called prompt matching, aimed at learning prompts specific to different views by matching prompts and querying input embeddings using a pre-trained vision model. Then, we utilized a pre-trained medical language model to align textual information with pixel data for accurate segmentation. Extensive experiments on three standard views showed that our approach significantly outperforms the state-of-the-art universal methods and achieves comparable or even better performances over the segmentation model trained and tested on same views. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.04915 [pdf, other]

Measurement of the $e^+e^- \to π^+π^-π^0$ cross section in the energy range 0.62-3.50 GeV at Belle II

Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker, J. V. Bennett , et al. (338 additional authors not shown)

Abstract: We report a measurement of the $e^+e^- \to π^+π^-π^0$ cross section in the energy range from 0.62 to 3.50 GeV using an initial-state radiation technique. We use an $e^+e^-$ data sample corresponding to 191 $\text{fb}^{-1}$ of integrated luminosity, collected at a center-of-mass energy at or near the $Υ{(4S)}$ resonance with the Belle II detector at the SuperKEKB collider. Signal yields are extract… ▽ More We report a measurement of the $e^+e^- \to π^+π^-π^0$ cross section in the energy range from 0.62 to 3.50 GeV using an initial-state radiation technique. We use an $e^+e^-$ data sample corresponding to 191 $\text{fb}^{-1}$ of integrated luminosity, collected at a center-of-mass energy at or near the $Υ{(4S)}$ resonance with the Belle II detector at the SuperKEKB collider. Signal yields are extracted by fitting the two-photon mass distribution in $e^+e^- \to π^+π^-π^0γ$ events, which involve a $π^0 \to γγ$ decay and an energetic photon radiated from the initial state. Signal efficiency corrections with an accuracy of 1.6% are obtained from several control data samples. The uncertainty on the cross section at the $ω$ and $φ$ resonances is dominated by the systematic uncertainty of 2.2%. The resulting cross sections in the 0.62-1.80 GeV energy range yield $ a_μ^{3π} = [48.91 \pm 0.23~(\mathrm{stat}) \pm 1.07~(\mathrm{syst})] \times 10^{-10} $ for the leading-order hadronic vacuum polarization contribution to the muon anomalous magnetic moment. This result differs by $2.5$ standard deviations from the most precise current determination. △ Less

Submitted 7 April, 2024; originally announced April 2024.

Comments: 23 pages, 24 figures, submitted to PRD

Report number: KEK Preprint 2023-51, Belle II Preprint 2024-004

arXiv:2404.04247 [pdf, ps, other]

On classification of global dynamics for energy-critical equivariant harmonic map heat flows and radial nonlinear heat equation

Authors: Kihyun Kim, Frank Merle

Abstract: We consider the global dynamics of finite energy solutions to energy-critical equivariant harmonic map heat flow (HMHF) and radial nonlinear heat equation (NLH). It is known that any finite energy equivariant solutions to (HMHF) decompose into finitely many harmonic maps (bubbles) separated by scales and a body map, as approaching to the maximal time of existence. Our main result for (HMHF) gives… ▽ More We consider the global dynamics of finite energy solutions to energy-critical equivariant harmonic map heat flow (HMHF) and radial nonlinear heat equation (NLH). It is known that any finite energy equivariant solutions to (HMHF) decompose into finitely many harmonic maps (bubbles) separated by scales and a body map, as approaching to the maximal time of existence. Our main result for (HMHF) gives a complete classification of their dynamics for equivariance indices $D\geq3$; (i) they exist globally in time, (ii) the number of bubbles and signs are determined by the energy class of the initial data, and (iii) the scales of bubbles are asymptotically given by a universal sequence of rates up to scaling symmetry. In parallel, we also obtain a complete classification of $\dot{H}^{1}$-bounded radial solutions to (NLH) in dimensions $N\geq7$, building upon soliton resolution for such solutions. To our knowledge, this provides the first rigorous classification of bubble tree dynamics within symmetry. We introduce a new approach based on the energy method that does not rely on maximum principle. The key ingredient of the proof is a monotonicity estimate near any bubble tree configurations, which in turn requires a delicate construction of modified multi-bubble profiles also. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: 44 pages

MSC Class: 35K58 (primary); 35B40; 37K40; 58E20

arXiv:2404.04096 [pdf, other]

Machine Learning-Aided Cooperative Localization under Dense Urban Environment

Authors: Hoon Lee, Hong Ki Kim, Seung Hyun Oh, Sang Hyun Lee

Abstract: Future wireless network technology provides automobiles with the connectivity feature to consolidate the concept of vehicular networks that collaborate on conducting cooperative driving tasks. The full potential of connected vehicles, which promises road safety and quality driving experience, can be leveraged if machine learning models guarantee the robustness in performing core functions includin… ▽ More Future wireless network technology provides automobiles with the connectivity feature to consolidate the concept of vehicular networks that collaborate on conducting cooperative driving tasks. The full potential of connected vehicles, which promises road safety and quality driving experience, can be leveraged if machine learning models guarantee the robustness in performing core functions including localization and controls. Location awareness, in particular, lends itself to the deployment of location-specific services and the improvement of the operation performance. The localization entails direct communication to the network infrastructure, and the resulting centralized positioning solutions readily become intractable as the network scales up. As an alternative to the centralized solutions, this article addresses decentralized principle of vehicular localization reinforced by machine learning techniques in dense urban environments with frequent inaccessibility to reliable measurement. As such, the collaboration of multiple vehicles enhances the positioning performance of machine learning approaches. A virtual testbed is developed to validate this machine learning model for real-map vehicular networks. Numerical results demonstrate universal feasibility of cooperative localization, in particular, for dense urban area configurations. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2404.03691 [pdf, other]

Upgrade of NaI(Tl) crystal encapsulation for the NEON experiment

Authors: J. J. Choi, E. J. Jeon, J. Y. Kim, K. W. Kim, S. H. Kim, S. K. Kim, Y. D. Kim, Y. J. Ko, B. C. Koh, C. Ha, B. J. Park, S. H. Lee, I. S. Lee, H. Lee, H. S. Lee, J. Lee, Y. M. Oh

Abstract: The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which… ▽ More The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which operates at a thermal power of 2.8\,GW. Initial engineering operation was performed from May 2021 to March 2022 and observed unexpected photomultiplier-induced noise and a decreased light yield that were caused by leakage of liquid scintillator into the detector due to weakness of detector encapsulation. We upgraded the detector encapsulation design to prevent the leakage of the liquid scintillator. Meanwhile two small-sized detectors were replaced with larger ones resulting in a total mass of 16.7\,kg. With this new design implementation, the detector system has been operating stably since April 2022 for over a year without detector gain drop. In this paper, we present an improved crystal encapsulation design and stability of the NEON experiment. △ Less

Submitted 28 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

arXiv:2404.01954 [pdf, other]

HyperCLOVA X Technical Report

Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in develo** their sovereign LLMs. △ Less

Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: 44 pages; updated authors list and fixed author names

arXiv:2404.01863 [pdf, other]

Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models

Authors: Kyuyoung Kim, Jongheon Jeong, Minyong An, Mohammad Ghavamzadeh, Krishnamurthy Dvijotham, **woo Shin, Kimin Lee

Abstract: Fine-tuning text-to-image models with reward functions trained on human feedback data has proven effective for aligning model behavior with human intent. However, excessive optimization with such reward models, which serve as mere proxy objectives, can compromise the performance of fine-tuned models, a phenomenon known as reward overoptimization. To investigate this issue in depth, we introduce th… ▽ More Fine-tuning text-to-image models with reward functions trained on human feedback data has proven effective for aligning model behavior with human intent. However, excessive optimization with such reward models, which serve as mere proxy objectives, can compromise the performance of fine-tuned models, a phenomenon known as reward overoptimization. To investigate this issue in depth, we introduce the Text-Image Alignment Assessment (TIA2) benchmark, which comprises a diverse collection of text prompts, images, and human annotations. Our evaluation of several state-of-the-art reward models on this benchmark reveals their frequent misalignment with human assessment. We empirically demonstrate that overoptimization occurs notably when a poorly aligned reward model is used as the fine-tuning objective. To address this, we propose TextNorm, a simple method that enhances alignment based on a measure of reward model confidence estimated across a set of semantically contrastive text prompts. We demonstrate that incorporating the confidence-calibrated rewards in fine-tuning effectively reduces overoptimization, resulting in twice as many wins in human evaluation for text-image alignment compared against the baseline reward models. △ Less

Submitted 2 April, 2024; originally announced April 2024.

Comments: ICLR 2024

arXiv:2404.01562 [pdf]

Efficient, indistinguishable telecom C-band photons using a tapered nanobeam

Authors: Mohammad Habibur Rahaman, Samuel Harper, Chang-Min Lee, Kyu-Young Kim, Mustafa Atabey Buyukkaya, Victor J. Patel, Samuel D. Hawkins, Je-Hyung Kim, Sadhvikas Addamane, Edo Waks

Abstract: Telecom C-band single photons exhibit the lowest attenuation in optical fibers, enabling long-haul quantum-secured communication. However, efficient coupling with optical fibers is crucial for these single photons to be effective carriers in long-distance transmission. In this work, we demonstrate an efficient fiber-coupled single photon source at the telecom C-band using InAs/InP quantum dots cou… ▽ More Telecom C-band single photons exhibit the lowest attenuation in optical fibers, enabling long-haul quantum-secured communication. However, efficient coupling with optical fibers is crucial for these single photons to be effective carriers in long-distance transmission. In this work, we demonstrate an efficient fiber-coupled single photon source at the telecom C-band using InAs/InP quantum dots coupled to a tapered nanobeam. The tapered nanobeam structure facilitates directional emission that is mode-matched to a lensed fiber, resulting in a collection efficiency of up to 65% from the nanobeam to a single-mode fiber. Using this approach, we demonstrate single photon count rates of 575 $\pm$ 5 Kcps and a single photon purity of $g^2$ (0) = 0.015 $\pm$ 0.003. Additionally, we demonstrate Hong-Ou Mandel interference from the emitted photons with a visibility of 0.84 $\pm$ 0.06. From these measurements, we determine a photon coherence time of 450 $\pm$ 20 ps, a factor of just 8.3 away from the lifetime limit. This work represents an important step towards the development of telecom C-band single-photon sources emitting bright, pure, and indistinguishable photons, which are necessary to realize fiber-based long-distance quantum networks △ Less

Submitted 5 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

arXiv:2404.01517 [pdf, other]

Addressing Heterogeneity in Federated Load Forecasting with Personalization Layers

Authors: Shourya Bose, Yu Zhang, Kibaek Kim

Abstract: The advent of smart meters has enabled pervasive collection of energy consumption data for training short-term load forecasting models. In response to privacy concerns, federated learning (FL) has been proposed as a privacy-preserving approach for training, but the quality of trained models degrades as client data becomes heterogeneous. In this paper we propose the use of personalization layers fo… ▽ More The advent of smart meters has enabled pervasive collection of energy consumption data for training short-term load forecasting models. In response to privacy concerns, federated learning (FL) has been proposed as a privacy-preserving approach for training, but the quality of trained models degrades as client data becomes heterogeneous. In this paper we propose the use of personalization layers for load forecasting in a general framework called PL-FL. We show that PL-FL outperforms FL and purely local training, while requiring lower communication bandwidth than FL. This is done through extensive simulations on three different datasets from the NREL ComStock repository. △ Less

Submitted 1 April, 2024; originally announced April 2024.

arXiv:2404.01464 [pdf, other]

Data-Efficient Unsupervised Interpolation Without Any Intermediate Frame for 4D Medical Images

Authors: JungEun Kim, Hangyul Yoon, Geondo Park, Kyungsu Kim, Eunho Yang

Abstract: 4D medical images, which represent 3D images with temporal information, are crucial in clinical practice for capturing dynamic changes and monitoring long-term disease progression. However, acquiring 4D medical images poses challenges due to factors such as radiation exposure and imaging duration, necessitating a balance between achieving high temporal resolution and minimizing adverse effects. Gi… ▽ More 4D medical images, which represent 3D images with temporal information, are crucial in clinical practice for capturing dynamic changes and monitoring long-term disease progression. However, acquiring 4D medical images poses challenges due to factors such as radiation exposure and imaging duration, necessitating a balance between achieving high temporal resolution and minimizing adverse effects. Given these circumstances, not only is data acquisition challenging, but increasing the frame rate for each dataset also proves difficult. To address this challenge, this paper proposes a simple yet effective Unsupervised Volumetric Interpolation framework, UVI-Net. This framework facilitates temporal interpolation without the need for any intermediate frames, distinguishing it from the majority of other existing unsupervised methods. Experiments on benchmark datasets demonstrate significant improvements across diverse evaluation metrics compared to unsupervised and supervised baselines. Remarkably, our approach achieves this superior performance even when trained with a dataset as small as one, highlighting its exceptional robustness and efficiency in scenarios with sparse supervision. This positions UVI-Net as a compelling alternative for 4D medical imaging, particularly in settings where data availability is limited. The source code is available at https://github.com/jungeun122333/UVI-Net. △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: CVPR 2024

arXiv:2404.01140 [pdf, other]

KoCoNovel: Annotated Dataset of Character Coreference in Korean Novels

Authors: Kyuhee Kim, Surin Lee, Sangah Lee

Abstract: In this paper, we present KoCoNovel, a novel character coreference dataset derived from Korean literary texts, complete with detailed annotation guidelines. Comprising 178K tokens from 50 modern and contemporary novels, KoCoNovel stands as one of the largest public coreference resolution corpora in Korean, and the first to be based on literary texts. KoCoNovel offers four distinct versions to acco… ▽ More In this paper, we present KoCoNovel, a novel character coreference dataset derived from Korean literary texts, complete with detailed annotation guidelines. Comprising 178K tokens from 50 modern and contemporary novels, KoCoNovel stands as one of the largest public coreference resolution corpora in Korean, and the first to be based on literary texts. KoCoNovel offers four distinct versions to accommodate a wide range of literary coreference analysis needs. These versions are designed to support perspectives of the omniscient author or readers, and to manage multiple entities as either separate or overlap**, thereby broadening its applicability. One of KoCoNovel's distinctive features is that 24% of all character mentions are single common nouns, lacking possessive markers or articles. This feature is particularly influenced by the nuances of Korean address term culture, which favors the use of terms denoting social relationships and kinship over personal names. In experiments with a BERT-based coreference model, we observe notable performance enhancements with KoCoNovel in character coreference tasks within literary texts, compared to a larger non-literary coreference dataset. Such findings underscore KoCoNovel's potential to significantly enhance coreference resolution models through the integration of Korean cultural and linguistic dynamics. △ Less

Submitted 11 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

Comments: 12 pages

arXiv:2404.01104 [pdf, other]

SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity

Authors: Jaemin Kim, Yohan Na, Kangmin Kim, Sang Rak Lee, Dong-Kyu Chae

Abstract: Recently, sentiment-aware pre-trained language models (PLMs) demonstrate impressive results in downstream sentiment analysis tasks. However, they neglect to evaluate the quality of their constructed sentiment representations; they just focus on improving the fine-tuning performance, which overshadows the representation quality. We argue that without guaranteeing the representation quality, their d… ▽ More Recently, sentiment-aware pre-trained language models (PLMs) demonstrate impressive results in downstream sentiment analysis tasks. However, they neglect to evaluate the quality of their constructed sentiment representations; they just focus on improving the fine-tuning performance, which overshadows the representation quality. We argue that without guaranteeing the representation quality, their downstream performance can be highly dependent on the supervision of the fine-tuning data rather than representation quality. This problem would make them difficult to foray into other sentiment-related domains, especially where labeled data is scarce. We first propose Sentiment-guided Textual Similarity (SgTS), a novel metric for evaluating the quality of sentiment representations, which is designed based on the degree of equivalence in sentiment polarity between two sentences. We then propose SentiCSE, a novel Sentiment-aware Contrastive Sentence Embedding framework for constructing sentiment representations via combined word-level and sentence-level objectives, whose quality is guaranteed by SgTS. Qualitative and quantitative comparison with the previous sentiment-aware PLMs shows the superiority of our work. Our code is available at: https://github.com/nayohan/SentiCSE △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: 14 pages, 8 figures

MSC Class: 68T50 ACM Class: I.2.7

Journal ref: LREC-COLING2024

arXiv:2404.01076 [pdf, other]

Debiased calibration estimation using generalized entropy in survey sampling

Authors: Yonghyun Kwon, Jae Kwang Kim, Yumou Qiu

Abstract: Incorporating the auxiliary information into the survey estimation is a fundamental problem in survey sampling. Calibration weighting is a popular tool for incorporating the auxiliary information. The calibration weighting method of Deville and Sarndal (1992) uses a distance measure between the design weights and the final weights to solve the optimization problem with calibration constraints. Thi… ▽ More Incorporating the auxiliary information into the survey estimation is a fundamental problem in survey sampling. Calibration weighting is a popular tool for incorporating the auxiliary information. The calibration weighting method of Deville and Sarndal (1992) uses a distance measure between the design weights and the final weights to solve the optimization problem with calibration constraints. This paper introduces a novel framework that leverages generalized entropy as the objective function for optimization, where design weights play a role in the constraints to ensure design consistency, rather than being part of the objective function. This innovative calibration framework is particularly attractive due to its generality and its ability to generate more efficient calibration weights compared to traditional methods based on Deville and Sarndal (1992). Furthermore, we identify the optimal choice of the generalized entropy function that achieves the minimum variance across various choices of the generalized entropy function under the same constraints. Asymptotic properties, such as design consistency and asymptotic normality, are presented rigorously. The results from a limited simulation study are also presented. We demonstrate a real-life application using agricultural survey data collected from Kynetec, Inc. △ Less

Submitted 2 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

arXiv:2404.00974 [pdf, other]

Improving Visual Recognition with Hyperbolical Visual Hierarchy Map**

Authors: Hyeongjun Kwon, **hyun Jang, ** Kim, Kwonyoung Kim, Kwanghoon Sohn

Abstract: Visual scenes are naturally organized in a hierarchy, where a coarse semantic is recursively comprised of several fine details. Exploring such a visual hierarchy is crucial to recognize the complex relations of visual elements, leading to a comprehensive scene understanding. In this paper, we propose a Visual Hierarchy Mapper (Hi-Mapper), a novel approach for enhancing the structured understanding… ▽ More Visual scenes are naturally organized in a hierarchy, where a coarse semantic is recursively comprised of several fine details. Exploring such a visual hierarchy is crucial to recognize the complex relations of visual elements, leading to a comprehensive scene understanding. In this paper, we propose a Visual Hierarchy Mapper (Hi-Mapper), a novel approach for enhancing the structured understanding of the pre-trained Deep Neural Networks (DNNs). Hi-Mapper investigates the hierarchical organization of the visual scene by 1) pre-defining a hierarchy tree through the encapsulation of probability densities; and 2) learning the hierarchical relations in hyperbolic space with a novel hierarchical contrastive loss. The pre-defined hierarchy tree recursively interacts with the visual features of the pre-trained DNNs through hierarchy decomposition and encoding procedures, thereby effectively identifying the visual hierarchy and enhancing the recognition of an entire scene. Extensive experiments demonstrate that Hi-Mapper significantly enhances the representation capability of DNNs, leading to an improved performance on various tasks, including image classification and dense prediction tasks. △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: This paper is accepted to CVPR 2024. The supplementary material is included. The code is available at \url{https://github.com/kwonjunn01/Hi-Mapper}

arXiv:2404.00384 [pdf, other]

TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias

Authors: Sanghyun Jo, Soohyun Ryu, Sungyub Kim, Eunho Yang, Kyungsu Kim

Abstract: We identify a critical bias in contemporary CLIP-based models, which we denote as single tag bias. This bias manifests as a disproportionate focus on a singular tag (word) while neglecting other pertinent tags, stemming from CLIP's text embeddings that prioritize one specific tag in image-text relationships. When deconstructing text into individual tags, only one tag tends to have high relevancy w… ▽ More We identify a critical bias in contemporary CLIP-based models, which we denote as single tag bias. This bias manifests as a disproportionate focus on a singular tag (word) while neglecting other pertinent tags, stemming from CLIP's text embeddings that prioritize one specific tag in image-text relationships. When deconstructing text into individual tags, only one tag tends to have high relevancy with CLIP's image embedding, leading to biased tag relevancy. In this paper, we introduce a novel two-step fine-tuning approach, Text-Tag Self-Distillation (TTD), to address this challenge. TTD first extracts image-relevant tags from text based on their similarity to the nearest pixels then employs a self-distillation strategy to align combined masks with the text-derived mask. This approach ensures the unbiased image-text alignment of the CLIP-based models using only image-text pairs without necessitating additional supervision. Our technique demonstrates model-agnostic improvements in multi-tag classification and segmentation tasks, surpassing competing methods that rely on external resources. The code is available at https://github.com/shjo-april/TTD. △ Less

Submitted 20 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

arXiv:2404.00380 [pdf, other]

DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation

Authors: Sanghyun Jo, Fei Pan, In-Jae Yu, Kyungsu Kim

Abstract: Weakly-supervised semantic segmentation (WSS) ensures high-quality segmentation with limited data and excels when employed as input seed masks for large-scale vision models such as Segment Anything. However, WSS faces challenges related to minor classes since those are overlooked in images with adjacent multiple classes, a limitation originating from the overfitting of traditional expansion method… ▽ More Weakly-supervised semantic segmentation (WSS) ensures high-quality segmentation with limited data and excels when employed as input seed masks for large-scale vision models such as Segment Anything. However, WSS faces challenges related to minor classes since those are overlooked in images with adjacent multiple classes, a limitation originating from the overfitting of traditional expansion methods like Random Walk. We first address this by employing unsupervised and weakly-supervised feature maps instead of conventional methodologies, allowing for hierarchical mask enhancement. This method distinctly categorizes higher-level classes and subsequently separates their associated lower-level classes, ensuring all classes are correctly restored in the mask without losing minor ones. Our approach, validated through extensive experimentation, significantly improves WSS across five benchmarks (VOC: 79.8\%, COCO: 53.9\%, Context: 49.0\%, ADE: 32.9\%, Stuff: 37.4\%), reducing the gap with fully supervised methods by over 84\% on the VOC validation set. Code is available at https://github.com/shjo-april/DHR. △ Less

Submitted 19 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

arXiv:2404.00201 [pdf, other]

Angular analysis of $B \to K^* e^+ e^-$ in the low-$q^2$ region with new electron identification at Belle

Authors: Belle Collaboration, D. Ferlewicz, P. Urquijo, I. Adachi, K. Adamczyk, H. Aihara, D. M. Asner, H. Atmacan, R. Ayad, V. Babu, Sw. Banerjee, P. Behera, K. Belous, J. Bennett, M. Bessner, V. Bhardwaj, B. Bhuyan, T. Bilka, D. Biswas, D. Bodrov, M. Bračko, P. Branchini, T. E. Browder, A. Budano, M. Campajola , et al. (145 additional authors not shown)

Abstract: We perform an angular analysis of the $B\to K^* e^+ e^-$ decay for the dielectron mass squared, $q^2$, range of $0.0008$ to $1.1200 ~\text{GeV}^2 /c^4$ using the full Belle data set in the $K^{*0} \to K^+ π^-$ and $K^{*+} \to K_S^0 π^+$ channels, incorporating new methods of electron identification to improve the statistical power of the data set. This analysis is sensitive to contributions from r… ▽ More We perform an angular analysis of the $B\to K^* e^+ e^-$ decay for the dielectron mass squared, $q^2$, range of $0.0008$ to $1.1200 ~\text{GeV}^2 /c^4$ using the full Belle data set in the $K^{*0} \to K^+ π^-$ and $K^{*+} \to K_S^0 π^+$ channels, incorporating new methods of electron identification to improve the statistical power of the data set. This analysis is sensitive to contributions from right-handed currents from physics beyond the Standard Model by constraining the Wilson coefficients $\mathcal{C}_7^{(\prime)}$. We perform a fit to the $B\to K^* e^+ e^-$ differential decay rate and measure the imaginary component of the transversality amplitude to be $A_T^{\rm Im} = -1.27 \pm 0.52 \pm 0.12$, and the $K^*$ transverse asymmetry to be $A_T^{(2)} = 0.52 \pm 0.53 \pm 0.11$. The resulting constraints on the value of $\mathcal{C}_7^{\prime}$ are consistent with the Standard Model within a $2σ$ confidence interval. △ Less

Submitted 2 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

Comments: Submitted to PRD

Report number: Belle preprint 2023-20, KEK preprint 2023-38

arXiv:2404.00033 [pdf, other]

The Hall of Singularity: VR Experience of Prophecy by AI

Authors: Jisu Kim, Kirak Kim

Abstract: "The Hall of Singularity" is an immersive art that creates personalized experiences of receiving prophecies from an AI deity through an integration of Artificial Intelligence (AI) and Virtual Reality (VR). As a metaphor for the mythologizing of AI in our society, "The Hall of Singularity" offers an immersive quasi-religious experience where individuals can encounter an AI that has the power to mak… ▽ More "The Hall of Singularity" is an immersive art that creates personalized experiences of receiving prophecies from an AI deity through an integration of Artificial Intelligence (AI) and Virtual Reality (VR). As a metaphor for the mythologizing of AI in our society, "The Hall of Singularity" offers an immersive quasi-religious experience where individuals can encounter an AI that has the power to make prophecies. This journey enables users to experience and imagine a world with an omnipotent AI deity. △ Less

Submitted 22 March, 2024; originally announced April 2024.

Comments: 3 pages, 4 figures

arXiv:2403.19144 [pdf, other]

MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation

Authors: Seyeon Kim, Siyoon **, Jihye Park, Kihong Kim, Jiyoung Kim, Jisu Nam, Seungryong Kim

Abstract: Conventional GAN-based models for talking head generation often suffer from limited quality and unstable training. Recent approaches based on diffusion models aimed to address these limitations and improve fidelity. However, they still face challenges, including extensive sampling times and difficulties in maintaining temporal consistency due to the high stochasticity of diffusion models. To overc… ▽ More Conventional GAN-based models for talking head generation often suffer from limited quality and unstable training. Recent approaches based on diffusion models aimed to address these limitations and improve fidelity. However, they still face challenges, including extensive sampling times and difficulties in maintaining temporal consistency due to the high stochasticity of diffusion models. To overcome these challenges, we propose a novel motion-disentangled diffusion model for high-quality talking head generation, dubbed MoDiTalker. We introduce the two modules: audio-to-motion (AToM), designed to generate a synchronized lip motion from audio, and motion-to-video (MToV), designed to produce high-quality head video following the generated motion. AToM excels in capturing subtle lip movements by leveraging an audio attention mechanism. In addition, MToV enhances temporal consistency by leveraging an efficient tri-plane representation. Our experiments conducted on standard benchmarks demonstrate that our model achieves superior performance compared to existing models. We also provide comprehensive ablation studies and user study results. △ Less

Submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.18411 [pdf, ps, other]

Role of hidden-color components in the tetraquark mixing model

Authors: Hungchong Kim, K. S. Kim

Abstract: Multiquarks can have two-hadron components and hidden-color components in their wave functions. The presence of two-hadron components in multiquarks introduces a potential source of confusion, particularly with respect to their resemblance to hadronic molecules. On the other hand, hidden-color components are essential for distinguishing between multiquarks and hadronic molecules. In this work, we… ▽ More Multiquarks can have two-hadron components and hidden-color components in their wave functions. The presence of two-hadron components in multiquarks introduces a potential source of confusion, particularly with respect to their resemblance to hadronic molecules. On the other hand, hidden-color components are essential for distinguishing between multiquarks and hadronic molecules. In this work, we study the hidden-color components in the wave functions of the tetraquark mixing model, a model that has been proposed as a suitable framework for describing the properties of two nonets in the $J^P=0^+$ channel: the light nonet [$a_0 (980)$, $K_0^* (700)$, $f_0 (500)$, $f_0 (980)$] and the heavy nonet [$a_0 (1450)$, $K_0^* (1430)$, $f_0 (1370)$, $f_0 (1500)$]. Our analysis reveals a substantial presence of hidden-color components within the tetraquark wave functions. To elucidate the impact of hidden-color components on physical quantities, we conduct computations of the hyperfine masses, $\langle V_{CS}\rangle$, for the two nonets, considering scenarios involving only the two-meson components and those incorporating the hidden-color components. We demonstrate that the hidden-color components constitute an important part of the hyperfine masses, such that the mass difference formula, $ΔM\approx Δ\langle V_{CS}\rangle$, which has been successful for the two nonets, cannot be achieved without the hidden-color contributions. This can provide another evidence supporting the tetraquark nature of the two nonets. △ Less

Submitted 29 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

Comments: 10 pages, no figure. The version accepted for publication in EPJC

arXiv:2403.18277 [pdf, other]

BlendX: Complex Multi-Intent Detection with Blended Patterns

Authors: Ye** Yoon, Jungyeon Lee, Kangsan Kim, Chanhee Park, Taeuk Kim

Abstract: Task-oriented dialogue (TOD) systems are commonly designed with the presumption that each utterance represents a single intent. However, this assumption may not accurately reflect real-world situations, where users frequently express multiple intents within a single utterance. While there is an emerging interest in multi-intent detection (MID), existing in-domain datasets such as MixATIS and MixSN… ▽ More Task-oriented dialogue (TOD) systems are commonly designed with the presumption that each utterance represents a single intent. However, this assumption may not accurately reflect real-world situations, where users frequently express multiple intents within a single utterance. While there is an emerging interest in multi-intent detection (MID), existing in-domain datasets such as MixATIS and MixSNIPS have limitations in their formulation. To address these issues, we present BlendX, a suite of refined datasets featuring more diverse patterns than their predecessors, elevating both its complexity and diversity. For dataset construction, we utilize both rule-based heuristics as well as a generative tool -- OpenAI's ChatGPT -- which is augmented with a similarity-driven strategy for utterance selection. To ensure the quality of the proposed datasets, we also introduce three novel metrics that assess the statistical properties of an utterance related to word count, conjunction use, and pronoun usage. Extensive experiments on BlendX reveal that state-of-the-art MID models struggle with the challenges posed by the new datasets, highlighting the need to reexamine the current state of the MID field. The dataset is available at https://github.com/HYU-NLP/BlendX. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: Accepted to LREC-COLING2024

arXiv:2403.16784 [pdf, other]

Enhanced extracellular matrix remodeling due to embedded spheroid fluidization

Authors: Tao Zhang, Shabeeb Ameen, Sounok Ghosh, Kyungeun Kim, Minh Thanh, Alison E. Patteson, Mingming Wu, J. M. Schwarz

Abstract: Tumor spheroids are in vitro three-dimensional, cellular collectives consisting of cancerous cells. Embedding these spheroids in an in vitro fibrous environment, such as a collagen network, to mimic the extracellular matrix (ECM) provides an essential platform to quantitatively investigate the biophysical mechanisms leading to tumor invasion of the ECM. To understand the mechanical interplay betwe… ▽ More Tumor spheroids are in vitro three-dimensional, cellular collectives consisting of cancerous cells. Embedding these spheroids in an in vitro fibrous environment, such as a collagen network, to mimic the extracellular matrix (ECM) provides an essential platform to quantitatively investigate the biophysical mechanisms leading to tumor invasion of the ECM. To understand the mechanical interplay between tumor spheroids and the ECM, we computationally construct and study a three-dimensional vertex model for a tumor spheroid that is mechanically coupled to a cross-linked network of fibers. In such a vertex model, cells are represented as deformable polyhedrons that share faces. Some fraction of the boundary faces of the tumor spheroid contain linker springs connecting the center of the boundary face to the nearest node in the fiber network. As these linker springs actively contract, the fiber network remodels. By toggling between fluid-like and solid-like spheroids via changing the dimensionless cell shape index, we find that the spheroid rheology affects the remodeling of the fiber network. More precisely, fluid-like spheroids displace the fiber network more on average near the vicinity of the spheroid than solid-like spheroids. We also find more densification of the fiber network near the spheroid for the fluid-like spheroids. These spheroid rheology-dependent effects are the result of cellular motility due to active cellular rearrangements that emerge over time in the fluid-like spheroids to generate spheroid shape fluctuations. Our results uncover intricate morphological-mechanical interplay between an embedded spheroid and its surrounding fiber network with both spheroid contractile strength and spheroid shape fluctuations playing important roles in the pre-invasion stages of tumor invasion. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: 19 pages, 12 figures, 7 SI figures

arXiv:2403.15040 [pdf, other]

ESG Classification by Implicit Rule Learning via GPT-4

Authors: Hyo Jeong Yun, Chanyoung Kim, Moonjeong Hahm, Kyuri Kim, Gui** Son

Abstract: Environmental, social, and governance (ESG) factors are widely adopted as higher investment return indicators. Accordingly, ongoing efforts are being made to automate ESG evaluation with language models to extract signals from massive web text easily. However, recent approaches suffer from a lack of training data, as rating agencies keep their evaluation metrics confidential. This paper investigat… ▽ More Environmental, social, and governance (ESG) factors are widely adopted as higher investment return indicators. Accordingly, ongoing efforts are being made to automate ESG evaluation with language models to extract signals from massive web text easily. However, recent approaches suffer from a lack of training data, as rating agencies keep their evaluation metrics confidential. This paper investigates whether state-of-the-art language models like GPT-4 can be guided to align with unknown ESG evaluation criteria through strategies such as prompting, chain-of-thought reasoning, and dynamic in-context learning. We demonstrate the efficacy of these approaches by ranking 2nd in the Shared-Task ML-ESG-3 Impact Type track for Korean without updating the model on the provided training data. We also explore how adjusting prompts impacts the ability of language models to address financial tasks leveraging smaller models with openly available weights. We observe longer general pre-training to correlate with enhanced performance in financial downstream tasks. Our findings showcase the potential of language models to navigate complex, subjective evaluation guidelines despite lacking explicit training examples, revealing opportunities for training-free solutions for financial downstream tasks. △ Less

Submitted 22 March, 2024; originally announced March 2024.

Comments: Accepted as Shared Track Paper at 7th FinNLP Workshop @ LREC-COLING 2024

Showing 51–100 of 4,214 results for author: Kim, K