Skip to main content

Showing 1–50 of 4,047 results for author: Lee, K

.
  1. arXiv:2407.03086  [pdf, other

    cs.LG cs.AI cs.DC

    Effective Heterogeneous Federated Learning via Efficient Hypernetwork-based Weight Generation

    Authors: Yu** Shin, Kichang Lee, Sungmin Lee, You Rim Choi, Hyung-Sin Kim, JeongGil Ko

    Abstract: While federated learning leverages distributed client resources, it faces challenges due to heterogeneous client capabilities. This necessitates allocating models suited to clients' resources and careful parameter aggregation to accommodate this heterogeneity. We propose HypeMeFed, a novel federated learning framework for supporting client heterogeneity by combining a multi-exit network architectu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2407.01225  [pdf, other

    quant-ph physics.optics

    Hong-Ou-Mandel Interference with a Coexisting Clock using Transceivers for Synchronization over Deployed Fiber

    Authors: Anirudh Ramesh, Daniel R. Reilly, Kim Fook Lee, Paul M. Moraw, Joaquin Chung, Md Shariful Islam, Cristián Peña, Xu Han, Rajkumar Kettimuthu, Prem Kumar, Gregory Kanter

    Abstract: Interference between independently generated photons is a key step towards distributing entanglement over long distances, but it requires synchronization between the distantly-located photon sources. Synchronizing the clocks of such photon sources using coexisting two-way classical optical communications over the same fiber that transport the quantum photonic signals is a promising approach for ac… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2407.01158  [pdf, other

    cs.CL

    Learning to Explore and Select for Coverage-Conditioned Retrieval-Augmented Generation

    Authors: Takyoung Kim, Kyungjae Lee, Young Rok Jang, Ji Yong Cho, Gangwoo Kim, Minseok Cho, Moontae Lee

    Abstract: Interactions with billion-scale large language models typically yield long-form responses due to their extensive parametric capacities, along with retrieval-augmented features. While detailed responses provide insightful viewpoint of a specific subject, they frequently generate redundant and less engaging content that does not meet user interests. In this work, we focus on the role of query outlin… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Work in progress. Resources are available at https://github.com/youngerous/qtree

  4. arXiv:2407.00977  [pdf, other

    astro-ph.GA

    Ongoing and fossil large-scale outflows detected in a high-redshift radio galaxy: [C II] observations of TN J0924$-$2201 at $z=5.174$

    Authors: Kianhong Lee, Masayuki Akiyama, Kotaro Kohno, Daisuke Iono, Masatoshi Imanishi, Bunyo Hatsukade, Hideki Umehata, Tohru Nagao, Yoshiki Toba, Xiaoyang Chen, Fumi Egusa, Kohei Ichikawa, Takuma Izumi, Naoki Matsumoto, Malte Schramm, Kenta Matsuoka

    Abstract: We present Atacama Large Millimeter/submillimeter Array observations of the [C II] 158 $μ$m line and the underlying continuum emission of TN J0924$-$2201, which is one of the most distant known radio galaxies at $z>5$. The [C II] line and 1-mm continuum emission are detected at the host galaxy. The systemic redshift derived from the [C II] line is $z_{\rm [C II]}=5.1736\pm0.0002$, indicating that… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 16 pages, 11 figures, accepted for publication in ApJ

  5. arXiv:2407.00878  [pdf, other

    cs.DC cs.LG

    A Robust Power Model Training Framework for Cloud Native Runtime Energy Metric Exporter

    Authors: Sunyanan Choochotkaew, Chen Wang, Huamin Chen, Tatsuhiro Chiba, Marcelo Amaral, Eun Kyung Lee, Tamar Eilam

    Abstract: Estimating power consumption in modern Cloud environments is essential for carbon quantification toward green computing. Specifically, it is important to properly account for the power consumed by each of the running applications, which are packaged as containers. This paper examines multiple challenges associated with this goal. The first challenge is that multiple customers are sharing the same… ▽ More

    Submitted 9 April, 2024; originally announced July 2024.

    Comments: This is a full-version (8-page) paper of our previous publication in IEEE MASCOTS 2023, which has been accepted as a 4-page short paper (https://ieeexplore.ieee.org/document/10387542)

  6. arXiv:2406.19292  [pdf, other

    cs.LG cs.AI cs.CL

    From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data

    Authors: Zheyang Xiong, Vasilis Papageorgiou, Kangwook Lee, Dimitris Papailiopoulos

    Abstract: Recent studies have shown that Large Language Models (LLMs) struggle to accurately retrieve information and maintain reasoning capabilities when processing long-context inputs. To address these limitations, we propose a finetuning approach utilizing a carefully designed synthetic dataset comprising numerical key-value retrieval tasks. Our experiments on models like GPT-3.5 Turbo and Mistral 7B dem… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  7. arXiv:2406.19287  [pdf, other

    astro-ph.HE

    Isotropy of cosmic rays beyond $10^{20}$ eV favors their heavy mass composition

    Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

    Abstract: We report an estimation of the injected mass composition of ultra-high energy cosmic rays (UHECRs) at energies higher than 10 EeV. The composition is inferred from an energy-dependent sky distribution of UHECR events observed by the Telescope Array surface detector by comparing it to the Large Scale Structure of the local Universe. In the case of negligible extra-galactic magnetic fields the resul… ▽ More

    Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures, accepted for publication in PRL

  8. arXiv:2406.19286  [pdf, other

    astro-ph.HE

    Mass composition of ultra-high energy cosmic rays from distribution of their arrival directions with the Telescope Array

    Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

    Abstract: We use a new method to estimate the injected mass composition of ultrahigh cosmic rays (UHECRs) at energies higher than 10 EeV. The method is based on comparison of the energy-dependent distribution of cosmic ray arrival directions as measured by the Telescope Array experiment (TA) with that calculated in a given putative model of UHECR under the assumption that sources trace the large-scale struc… ▽ More

    Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 18 pages, 11 figures, accepted for publication in PRD

  9. arXiv:2406.18887  [pdf, ps, other

    math.AP

    The global dynamics for the Maxwell-Dirac system

    Authors: Yonggeun Cho, Kiyeon Lee

    Abstract: In this paper, we study the (1+3) dimensional massive Maxwell-Dirac system in the context of global existence and asymptotic behavior of solutions under the Lorenz gauge condition, as well as the modified and linear scattering phenomena for the Dirac spinor and the electromagnetic potential, respectively. We employ a vector fields energy method combined with a detailed analysis of the space-time r… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 51 pages

    MSC Class: 35Q41; 35Q55; 35Q40

  10. arXiv:2406.17746  [pdf, other

    cs.CL cs.AI

    Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

    Authors: USVSN Sai Prashanth, Alvin Deng, Kyle O'Brien, Jyothir S V, Mohammad Aflah Khan, Jaydeep Borkar, Christopher A. Choquette-Choo, Jacob Ray Fuehne, Stella Biderman, Tracy Ke, Katherine Lee, Naomi Saphra

    Abstract: Memorization in language models is typically treated as a homogenous phenomenon, neglecting the specifics of the memorized data. We instead model memorization as the effect of a set of complex factors that describe each sample and relate it to the model and corpus. To build intuition around these factors, we break memorization down into a taxonomy: recitation of highly duplicated sequences, recons… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  11. arXiv:2406.17376  [pdf, other

    cs.SD cs.AI eess.AS

    Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection

    Authors: Duc-Tuan Truong, Ruijie Tao, Tuan Nguyen, Hieu-Thi Luong, Kong Aik Lee, Eng Siong Chng

    Abstract: Recent synthetic speech detectors leveraging the Transformer model have superior performance compared to the convolutional neural network counterparts. This improvement could be due to the powerful modeling ability of the multi-head self-attention (MHSA) in the Transformer model, which learns the temporal relationship of each input token. However, artifacts of synthetic speech can be located in sp… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH 2024

  12. arXiv:2406.17294  [pdf, other

    cs.CL

    Math-LLaVA: Bootstrap** Mathematical Reasoning for Multimodal Large Language Models

    Authors: Wenhao Shi, Zhiqiang Hu, Yi Bin, Junhua Liu, Yang Yang, See-Kiong Ng, Lidong Bing, Roy Ka-Wei Lee

    Abstract: Large language models (LLMs) have demonstrated impressive reasoning capabilities, particularly in textual mathematical problem-solving. However, existing open-source image instruction fine-tuning datasets, containing limited question-answer pairs per image, do not fully exploit visual information to enhance the multimodal mathematical reasoning capabilities of Multimodal LLMs (MLLMs). To bridge th… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

    Comments: 8 pages

  13. Improving Rehabilitative Assessment with Statistical and Shape Preserving Surrogate Data and Singular Spectrum Analysis

    Authors: T. K. M. Lee, H. W. Chan, K. H. Leo, E. Chew, Ling Zhao, S. Sanei

    Abstract: Time series data are collected in temporal order and are widely used to train systems for prediction, modeling and classification to name a few. These systems require large amounts of data to improve generalization and prevent over-fitting. However there is a comparative lack of time series data due to operational constraints. This situation is alleviated by synthesizing data which have a suitable… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: This version of the paper under the same title, acknowledges the data source and the funding for current research using this data. arXiv admin note: substantial text overlap with arXiv:2404.14211

    Journal ref: 2022 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), Poznan, Poland, 2022, pp. 58-63

  14. arXiv:2406.16807  [pdf, other

    cs.LG cs.CL cs.CV

    Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

    Authors: Katherine M. Collins, Najoung Kim, Yonatan Bitton, Verena Rieser, Shayegan Omidshafiei, Yushi Hu, Sherol Chen, Senjuti Dutta, Minsuk Chang, Kimin Lee, Youwei Liang, Georgina Evans, Sahil Singla, Gang Li, Adrian Weller, Junfeng He, Deepak Ramachandran, Krishnamurthy Dj Dvijotham

    Abstract: Human feedback plays a critical role in learning and refining reward models for text-to-image generation, but the optimal form the feedback should take for learning an accurate reward function has not been conclusively established. This paper investigates the effectiveness of fine-grained feedback which captures nuanced distinctions in image quality and prompt-alignment, compared to traditional co… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  15. arXiv:2406.16620  [pdf, other

    cs.CV cs.CL

    OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer

    Authors: Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee

    Abstract: Recent advancements in Large Language Models (LLMs) have expanded their capabilities to multimodal contexts, including comprehensive video understanding. However, processing extensive videos such as 24-hour CCTV footage or full-length films presents significant challenges due to the vast data and processing demands. Traditional methods, like extracting key frames or converting frames to text, ofte… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  16. arXiv:2406.14176  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake Detection

    Authors: Kyungbok Lee, You Zhang, Zhiyao Duan

    Abstract: This paper addresses the challenge of develo** a robust audio-visual deepfake detection model. In practical use cases, new generation algorithms are continually emerging, and these algorithms are not encountered during the development of detection methods. This calls for the generalization ability of the method. Additionally, to ensure the credibility of detection methods, it is beneficial for t… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  17. arXiv:2406.12223  [pdf, other

    cs.CL cs.CY

    ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations

    Authors: Yunze Xiao, Yujia Hu, Kenny Tsu Wei Choo, Roy Ka-wei Lee

    Abstract: Detecting hate speech and offensive language is essential for maintaining a safe and respectful digital environment. This study examines the limitations of state-of-the-art large language models (LLMs) in identifying offensive content within systematically perturbed data, with a focus on Chinese, a language particularly susceptible to such perturbations. We introduce \textsf{ToxiCloakCN}, an enhan… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 10 pages,5 Tables, 2 Figures

  18. arXiv:2406.11427  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

    Authors: Keon Lee, Dong Won Kim, Jaehyeon Kim, Jaewoong Cho

    Abstract: Large-scale diffusion models have shown outstanding generative abilities across multiple modalities including images, videos, and audio. However, text-to-speech (TTS) systems typically involve domain-specific modeling factors (e.g., phonemes and phoneme-level durations) to ensure precise temporal alignments between text and speech, which hinders the efficiency and scalability of diffusion models f… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  19. arXiv:2406.11354  [pdf, other

    cs.CL cs.AI cs.CV

    Preserving Knowledge in Large Language Model with Model-Agnostic Self-Decompression

    Authors: Zilun Zhang, Yutao Sun, Tiancheng Zhao, Leigang Sha, Ruochen Xu, Kyusong Lee, Jianwei Yin

    Abstract: Humans can retain old knowledge while learning new information, but Large Language Models (LLMs) often suffer from catastrophic forgetting when post-pretrained or supervised fine-tuned (SFT) on domain-specific data. Moreover, for Multimodal Large Language Models (MLLMs) which are composed of the LLM base and visual projector (e.g. LLaVA), a significant decline in performance on language benchmarks… ▽ More

    Submitted 19 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  20. Expanding the Design Space of Computer Vision-based Interactive Systems for Group Dance Practice

    Authors: Soohwan Lee, Seoyeong Hwang, Ian Oakley, Kyungho Lee

    Abstract: Group dance, a sub-genre characterized by intricate motions made by a cohort of performers in tight synchronization, has a longstanding and culturally significant history and, in modern forms such as cheerleading, a broad base of current adherents. However, despite its popularity, learning group dance routines remains challenging. Based on the prior success of interactive systems to support indivi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 20 pages, 10 figures, 1 table, to be published in the proceedings of the ACM Designing Interactive Systems Conference, 2024, (DIS '24)

    Journal ref: ACM Designing Interactive Systems Conference, 2024, (DIS '24)

  21. arXiv:2406.11125  [pdf, other

    cs.HC

    Conversational Agents as Catalysts for Critical Thinking: Challenging Design Fixation in Group Design

    Authors: Soohwan Lee, Seoyeong Hwang, Kyungho Lee

    Abstract: This paper investigates the potential of LLM-based conversational agents (CAs) to enhance critical reflection and mitigate design fixation in group design work. By challenging AI-generated recommendations and prevailing group opinions, these agents address issues such as groupthink and promote a more dynamic and inclusive design process. Key design considerations include optimizing intervention ti… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 7 pages, 2 figures, DIS2024 Workshop on 'Death of Design Researcher'

  22. arXiv:2406.10836  [pdf, other

    eess.AS cs.SD

    Revisiting and Improving Scoring Fusion for Spoofing-aware Speaker Verification Using Compositional Data Analysis

    Authors: Xin Wang, Tomi Kinnunen, Kong Aik Lee, Paul-Gauthier Noé, Junichi Yamagishi

    Abstract: Fusing outputs from automatic speaker verification (ASV) and spoofing countermeasure (CM) is expected to make an integrated system robust to zero-effort imposters and synthesized spoofing attacks. Many score-level fusion methods have been proposed, but many remain heuristic. This paper revisits score-level fusion using tools from decision theory and presents three main findings. First, fusion by s… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024 Accepted. https://github.com/nii-yamagishilab/SpeechSPC-mini

  23. arXiv:2406.10815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    On the Effectiveness of Supervision in Asymmetric Non-Contrastive Learning

    Authors: Jeongheon Oh, Kibok Lee

    Abstract: Supervised contrastive representation learning has been shown to be effective in various transfer learning scenarios. However, while asymmetric non-contrastive learning (ANCL) often outperforms its contrastive learning counterpart in self-supervised representation learning, the extension of ANCL to supervised scenarios is less explored. To bridge the gap, we study ANCL for supervised representatio… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  24. arXiv:2406.10207  [pdf, other

    astro-ph.EP

    Latitudinal Asymmetry in the Dayside Atmosphere of WASP-43b

    Authors: Ryan C. Challener, Zafar Rustamkulov, Elspeth K. H. Lee, Nikole Lewis, David K. Sing, Stephan M. Birkmann, Nicolas Crouzet, Néstor Espinoza, Elena Manjavacas, Natalia Oliveros-Gomez, Jeff A. Valenti, **gxuan Yang

    Abstract: We present two-dimensional near-infrared temperature maps of the canonical hot Jupiter WASP-43b using a phase-curve observation with JWST NIRSpec/G395H. From the white-light planetary transit, we improve constraints on the planet's orbital parameters and measure a planet-to-star radius ratio of $0.15883^{+0.00056}_{-0.00053}$. Using the white-light phase curve, we measure a longitude of maximum br… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 15 pages, 8 figures. In review at ApJ Letters

  25. arXiv:2406.09698  [pdf, other

    physics.ins-det hep-ex

    Projected background and sensitivity of AMoRE-II

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (81 additional authors not shown)

    Abstract: AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  26. arXiv:2406.09641  [pdf, other

    astro-ph.EP

    Phase-resolving the absorption signatures of water and carbon monoxide in the atmosphere of the ultra-hot Jupiter WASP-121b with GEMINI-S/IGRINS

    Authors: Joost P. Wardenier, Vivien Parmentier, Michael R. Line, Megan Weiner Mansfield, Xianyu Tan, Shang-Min Tsai, Jacob L. Bean, Jayne L. Birkby, Matteo Brogi, Jean-Michel Désert, Siddharth Gandhi, Elspeth K. H. Lee, Colette I. Levens, Lorenzo Pino, Peter C. B. Smith

    Abstract: Ultra-hot Jupiters are among the best targets for atmospheric characterization at high spectral resolution. Resolving their transmission spectra as a function of orbital phase offers a unique window into the 3D nature of these objects. In this work, we present three transits of the ultra-hot Jupiter WASP-121b observed with Gemini-S/IGRINS. For the first time, we measure the phase-dependent absorpt… ▽ More

    Submitted 19 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 24 pages, 16 figures, resubmitted to PASP (made a few minor changes to the text w.r.t. v1)

  27. arXiv:2406.08702  [pdf, other

    cs.AI cs.CL cs.CV

    VLind-Bench: Measuring Language Priors in Large Vision-Language Models

    Authors: Kang-il Lee, Minbeom Kim, Minsung Kim, Dongryeol Lee, Hyukhun Koh, Kyomin Jung

    Abstract: Large Vision-Language Models (LVLMs) have demonstrated outstanding performance across various multimodal tasks. However, they suffer from a problem known as language prior, where responses are generated based solely on textual patterns while disregarding image information. Addressing the issue of language prior is crucial, as it can lead to undesirable biases or hallucinations when dealing with im… ▽ More

    Submitted 17 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  28. arXiv:2406.08645  [pdf, other

    astro-ph.GA astro-ph.CO

    ODIN: Identifying Protoclusters and Cosmic Filaments Traced by Ly$α$-emitting Galaxies

    Authors: Vandana Ramakrishnan, Kyoung-Soo Lee, Maria Celeste Artale, Eric Gawiser. Yu** Yang, Changbom Park, Robin Ciardullo, Lucia Guaita, Sang Hyeok Im, Seongjae Kim, Ankit Kumar, Jaehyun Lee, Seong-Kook Lee, Byeongha Moon, Nelson Padilla, Alexandra Pope, Roxana Popescu, Hyunmi Song, Paulina Troncoso, Francisco Valdes, Ann Zabludoff

    Abstract: To understand the formation and evolution of massive cosmic structures, studying them at high redshift, in the epoch when they formed the majority of their mass is essential. The One-hundred-deg$^2$ DECam Imaging in Narrowbands (ODIN) survey is undertaking the widest-area narrowband program to date, to use Ly$α$-emitting galaxies (LAEs) to trace the large-scale structure (LSS) of the Universe at t… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 26 pages, 18 figures; submitted to ApJ

  29. arXiv:2406.08612  [pdf, other

    astro-ph.HE

    Observation of Declination Dependence in the Cosmic Ray Energy Spectrum

    Authors: The Telescope Array Collaboration, R. U. Abbasi, T. Abu-Zayyad, M. Allen, J. W. Belz, D. R. Bergman, I. Buckland, W. Campbell, B. G. Cheon, K. Endo, A. Fedynitch, T. Fujii, K. Fujisue, K. Fujita, M. Fukushima, G. Furlich, Z. Gerber, N. Globus, W. Hanlon, N. Hayashida, H. He, K. Hibino, R. Higuchi, D. Ikeda, T. Ishii , et al. (101 additional authors not shown)

    Abstract: We report on an observation of the difference between northern and southern skies of the ultrahigh energy cosmic ray energy spectrum with a significance of ${\sim}8σ$. We use measurements from the two largest experiments$\unicode{x2014}$the Telescope Array observing the northern hemisphere and the Pierre Auger Observatory viewing the southern hemisphere. Since the comparison of two measurements fr… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures

  30. arXiv:2406.08301  [pdf, other

    nucl-ex

    Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, H. Al-Bataineh, J. Alexander, M. Alfred, K. Aoki, N. Apadula, L. Aphecetche, J. Asai, H. Asano, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, A. Baldisseri , et al. (510 additional authors not shown)

    Abstract: High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 534 authors from 83 institutions, 12 pages, 7 figures. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  31. arXiv:2406.08200  [pdf, other

    cs.SD cs.AI eess.AS

    Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding

    Authors: Rui Wang, Li** Chen, Kong AiK Lee, Zhen-Hua Ling

    Abstract: Voice anonymization has been developed as a technique for preserving privacy by replacing the speaker's voice in a speech signal with that of a pseudo-speaker, thereby obscuring the original voice attributes from machine recognition and human perception. In this paper, we focus on altering the voice attributes against machine recognition while retaining human perception. We referred to this as the… ▽ More

    Submitted 13 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: accpeted by Interspeech2024

  32. arXiv:2406.07909  [pdf, other

    eess.AS cs.CL cs.SD stat.ML

    Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation

    Authors: Eungbeom Kim, Hantae Kim, Kyogu Lee

    Abstract: Transformer encoder with connectionist temporal classification (CTC) framework is widely used for automatic speech recognition (ASR). However, knowledge distillation (KD) for ASR displays a problem of disagreement between teacher-student models in frame-level alignment which ultimately hinders it from improving the student model's performance. In order to resolve this problem, this paper introduce… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  33. Optimal Qubit Map** Search for Encoding Classical Data into Matrix Product State Representation with Minimal Loss

    Authors: Hyeongjun Jeon, Kyungmin Lee, Dongkyu Lee, Bongsang Kim, Taehyun Kim

    Abstract: Matrix product state (MPS) offers a framework for encoding classical data into quantum states, enabling the efficient utilization of quantum resources for data representation and processing. This research paper investigates techniques to enhance the efficiency and accuracy of MPS representations specifically designed for encoding classical data. Based on the observations that MPS truncation error… ▽ More

    Submitted 12 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 11 pages, 15 figures, The current version represents the initial submission and differs significantly from the final published version. Please check the official publication in Physics Letters A for the most up-to-date and comprehensive content

  34. arXiv:2406.06822  [pdf, other

    cs.CR cs.AI cs.SE

    An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection

    Authors: Shenao Yan, Shen Wang, Yue Duan, Hanbin Hong, Kiho Lee, Doowon Kim, Yuan Hong

    Abstract: Large Language Models (LLMs) have transformed code completion tasks, providing context-based suggestions to boost developer productivity in software engineering. As users often fine-tune these models for specific applications, poisoning and backdoor attacks can covertly alter the model outputs. To address this critical security challenge, we introduce CodeBreaker, a pioneering LLM-assisted backdoo… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: To appear in USENIX Security '24

  35. arXiv:2406.06717  [pdf, ps, other

    cs.SI cs.HC

    Analyzing user archetypes in Singapore's Telegram groups on COVID-19 and climate change

    Authors: Val Alvern Cueco Ligo, Lan Tianxiang, Ying Zeng, Lam Yin Cheung, Pi Zonooz, Roy Ka-Wei Lee, Koustuv Saha, Edson C. Tandoc Jr., Navin Kumar

    Abstract: Social media platforms, particularly Telegram, play a pivotal role in sha** public perceptions and opinions on global and national issues. Unlike traditional news media, Telegram allows for the proliferation of user-generated content with minimal oversight, making it a significant venue for the spread of controversial and misinformative content. During the COVID-19 pandemic, Telegram's popularit… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  36. arXiv:2406.06037  [pdf, other

    cs.LG cs.AI cs.CV

    Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

    Authors: Donghu Kim, Hojoon Lee, Kyungmin Lee, Dongyoon Hwang, Jaegul Choo

    Abstract: Recently, various pre-training methods have been introduced in vision-based Reinforcement Learning (RL). However, their generalization ability remains unclear due to evaluations being limited to in-distribution environments and non-unified experimental setups. To address this, we introduce the Atari Pre-training Benchmark (Atari-PB), which pre-trains a ResNet-50 model on 10 million transitions fro… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: accepted to ICML 2024

  37. arXiv:2406.05761  [pdf, other

    cs.CL

    The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

    Authors: Seungone Kim, Juyoung Suk, Ji Yong Cho, Shayne Longpre, Chaeeun Kim, Dongkeun Yoon, Gui** Son, Ye** Cho, Sheikh Shafayat, **heon Baek, Sue Hyun Park, Hyeonbin Hwang, **kyung Jo, Hyowon Cho, Haebin Shin, Seongyun Lee, Hanseok Oh, Noah Lee, Namgyu Ho, Se June Joo, Miyoung Ko, Yoonjoo Lee, Hyungjoo Chae, Jamin Shin, Joel Jang , et al. (7 additional authors not shown)

    Abstract: As language models (LMs) become capable of handling a wide range of tasks, their evaluation is becoming as challenging as their development. Most generation benchmarks currently assess LMs using abstract evaluation criteria like helpfulness and harmlessness, which often lack the flexibility and granularity of human assessment. Additionally, these benchmarks tend to focus disproportionately on spec… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Work in Progress

  38. arXiv:2406.05479  [pdf

    cond-mat.soft

    Microscopic Dynamics of Particle Rearrangement and its Correlation with Stick-slip Behavior in Granular Shear

    Authors: Kwangmin Lee, Ryan C. Hurley

    Abstract: The link between the microscopic dynamics of particles and the macroscale response of granular materials has not been well established. To address this, we investigated the microscopic dynamics and fluctuations in the force network in a granular material subjected to plane shear. A two-dimensional discrete element model of a plane shear test was established, considering both sliding and rolling fr… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 46 pages, 25 figures

  39. arXiv:2406.04412  [pdf, other

    cs.LG cs.AI cs.CL

    Aligning Large Language Models with Self-generated Preference Data

    Authors: Dongyoung Kim, Kimin Lee, **woo Shin, Jaehyung Kim

    Abstract: Aligning large language models (LLMs) with human preferences becomes a key component to obtaining state-of-the-art performance, but it yields a huge cost to construct a large human-annotated preference dataset. To tackle this problem, we propose a new framework that boosts the alignment of LLMs through Self-generated Preference data (Selfie) using only a very small amount of human-annotated prefer… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 18 pages, under review

  40. arXiv:2406.03951  [pdf, ps, other

    math.AP math.DS

    L-shadowing lemma for the Cauchy equation

    Authors: K. Lee, C. A. Morales

    Abstract: We prove that if the Cauchy problem $\dot{u}=Au$ in a Banach space is hyperbolic, then the problem has the L-shadowing property. Conversely, if the space is finite-dimensional and the L-shadowing property is satisfied, then the problem is hyperbolic. This generalizes a previous result by Ombach \cite{o, o1} for linear homeomorphisms. Some short applications are given.

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 15 pages

    MSC Class: Primary 54G99; Secondary 37B05

  41. arXiv:2406.03705  [pdf, other

    cond-mat.mes-hall quant-ph

    Coherent control of a triangular exchange-only spin qubit

    Authors: Edwin Acuna, Joseph D. Broz, Kaushal Shyamsundar, Antonio B. Mei, Colin P. Feeney, Valerie Smetanka, Tiffany Davis, Kangmu Lee, Maxwell D. Choi, Brydon Boyd, June Suh, Wonill D. Ha, Cameron Jennings, Andrew S. Pan, Daniel S. Sanchez, Matthew D. Reed, Jason R. Petta

    Abstract: We demonstrate coherent control of a three-electron exchange-only spin qubit with the quantum dots arranged in a close-packed triangular geometry. The device is tuned to confine one electron in each quantum dot, as evidenced by pairwise charge stability diagrams. Time-domain control of the exchange coupling is demonstrated and qubit performance is characterized using blind randomized benchmarking,… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  42. arXiv:2406.03633  [pdf, other

    astro-ph.GA astro-ph.CO

    The Massive and Distant Clusters of WISE Survey 2: A Stacking Analysis Investigating the Evolution of Star Formation Rates and Stellar Masses in Groups and Clusters

    Authors: A. Trudeau, Anthony H. Gonzalez, K. Thongkham, Kyoung-Soo Lee, Stacey Alberts, M. Brodwin, Thomas Connor, Peter R. M. Eisenhardt, Emily Moravec, Eshwar Puvvada, S. A. Stanford

    Abstract: The evolution of galaxies depends on their masses and local environments; understanding when and how environmental quenching starts to operate remains a challenge. Furthermore, studies of the high-redshift regime have been limited to massive cluster members, owing to sensitivity limits or small fields of views when the sensitivity is sufficient, intrinsically biasing the picture of cluster evoluti… ▽ More

    Submitted 25 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

    Comments: 22 pages, 14 figures, accepted for publication in ApJ (corrected Popescu et al., 2023 sSFR on Figure 14)

  43. arXiv:2406.02920  [pdf

    physics.plasm-ph

    Physical Origin of H-Mode

    Authors: Kwan Chul Lee

    Abstract: The high confinement mode (H-mode), the most important operation mode for the nuclear fusion reactor, has been studied for 42 years, but the transition mechanism has not been unanimously agreed so far. Four decades of H-mode experiments revealed many features of heating power threshold (Pth) for the low to high confinement (L-H) transition, where Pth is proportional to the toroidal magnetic field… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  44. arXiv:2406.02847  [pdf, other

    cs.LG stat.ML

    Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

    Authors: Brian K Chen, Tianyang Hu, Hui **, Hwee Kuan Lee, Kenji Kawaguchi

    Abstract: In-Context Learning (ICL) has been a powerful emergent property of large language models that has attracted increasing attention in recent years. In contrast to regular gradient-based learning, ICL is highly interpretable and does not require parameter updates. In this paper, we show that, for linearized transformer networks, ICL can be made explicit and permanent through the inclusion of bias ter… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  45. arXiv:2406.02331  [pdf, other

    cs.CL

    Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering

    Authors: ChaeHun Park, Koanho Lee, Hyesu Lim, Jaeseok Kim, Junmo Park, Yu-Jung Heo, Du-Seong Chang, Jaegul Choo

    Abstract: Building a reliable visual question answering~(VQA) system across different languages is a challenging problem, primarily due to the lack of abundant samples for training. To address this challenge, recent studies have employed machine translation systems for the cross-lingual VQA task. This involves translating the evaluation samples into a source language (usually English) and using monolingual… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Findings Accepted

  46. arXiv:2406.02028  [pdf

    stat.ME

    How should parallel cluster randomized trials with a baseline period be analyzed? A survey of estimands and common estimators

    Authors: Kenneth Menglin Lee, Fan Li

    Abstract: The parallel cluster randomized trial with baseline (PB-CRT) is a common variant of the standard parallel cluster randomized trial (P-CRT) that maintains parallel randomization but additionally allows for both within and between-cluster comparisons. We define two estimands of interest in the context of PB-CRTs, the participant-average treatment effect (pATE) and cluster-average treatment effect (c… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 77 pages, 16 figures

  47. arXiv:2406.01803  [pdf, other

    astro-ph.CO

    The clustering of Lyman Alpha Emitting galaxies at z=2-3

    Authors: M. White, A. Raichoor, Arjun Dey, Lehman H. Garrison, Eric Gawiser, D. Lang, Kyoung-soo Lee, A. D. Myers, D. Schlegel, F. Valdes, J. Aguilar, S. Ahlen, D. Brooks, E. Chaussidon, T. Claybaugh, K. Dawson, A. de la Macorra, Biprateep Dey, P. Doel, K. Fanning, A. Font-Ribera, J. E. Forero-Romero, S. Gontcho A Gontcho, G. Gutierrez, J. Guy , et al. (30 additional authors not shown)

    Abstract: We measure the clustering of Lyman Alpha Emitting galaxies (LAEs) selected from the One-hundred-square-degree DECam Imaging in Narrowbands (ODIN) survey, with spectroscopic follow-up from Dark Energy Spectroscopic Instrument (DESI). We use DESI spectroscopy to optimize our selection and to constrain the interloper fraction and redshift distribution of our narrow-band selected sources. We select sa… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 26 pages, 12 figures

  48. arXiv:2406.01049  [pdf, other

    cs.SD

    Searching For Music Mixing Graphs: A Pruning Approach

    Authors: Sungho Lee, Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Stefan Uhlich, Giorgio Fabbro, Kyogu Lee, Yuki Mitsufuji

    Abstract: Music mixing is compositional -- experts combine multiple audio processors to achieve a cohesive mix from dry source tracks. We propose a method to reverse engineer this process from the input and output audio. First, we create a mixing console that applies all available processors to every chain. Then, after the initial console parameter optimization, we alternate between removing redundant proce… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to DAFx 2024

  49. arXiv:2406.00925  [pdf, other

    hep-th

    Dimers for Type D Relativistic Toda Model

    Authors: Kimyeong Lee, Norton Lee

    Abstract: We construct dimer graphs for type D relativistic Toda models by introducing impurities to the $Y^{2N,0}$ square dimer graphs. By properly placing the impurities and change of canonical variables assigned to the 1-loops on the dimer graph, we introduce the "folding" of the graphs and get the type D relativistic Toda lattice Hamiltonian and monodromy matrix.

    Submitted 23 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 25+6 pages, 14 figures, add citation

    Report number: KIAS-P24038, CGP24008

  50. arXiv:2406.00636  [pdf, other

    cs.CV

    T2LM: Long-Term 3D Human Motion Generation from Multiple Sentences

    Authors: Taeryung Lee, Fabien Baradel, Thomas Lucas, Kyoung Mu Lee, Gregory Rogez

    Abstract: In this paper, we address the challenging problem of long-term 3D human motion generation. Specifically, we aim to generate a long sequence of smoothly connected actions from a stream of multiple sentences (i.e., paragraph). Previous long-term motion generating approaches were mostly based on recurrent methods, using previously generated motion chunks as input for the next step. However, this appr… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 HuMoGen Workshop