Skip to main content

Showing 1–50 of 2,932 results for author: Kim, M

.
  1. arXiv:2406.19596  [pdf, other

    cs.CR cs.AI cs.LG

    Optimizing Cyber Defense in Dynamic Active Directories through Reinforcement Learning

    Authors: Diksha Goel, Kristen Moore, Mingyu Guo, Derui Wang, Minjune Kim, Seyit Camtepe

    Abstract: This paper addresses a significant gap in Autonomous Cyber Operations (ACO) literature: the absence of effective edge-blocking ACO strategies in dynamic, real-world networks. It specifically targets the cybersecurity vulnerabilities of organizational Active Directory (AD) systems. Unlike the existing literature on edge-blocking defenses which considers AD systems as static entities, our study coun… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: The manuscript has been accepted as full paper at European Symposium on Research in Computer Security (ESORICS) 2024

  2. arXiv:2406.18925  [pdf, other

    cs.CL cs.CV

    Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding

    Authors: Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu

    Abstract: Visual arguments, often used in advertising or social causes, rely on images to persuade viewers to do or believe something. Understanding these arguments requires selective vision: only specific visual stimuli within an image are relevant to the argument, and relevance can only be understood within the context of a broader argumentative structure. While visual arguments are readily appreciated by… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures

  3. arXiv:2406.18675  [pdf, other

    cs.HC cs.AI cs.CL

    Human-AI Collaborative Taxonomy Construction: A Case Study in Profession-Specific Writing Assistants

    Authors: Minhwa Lee, Zae Myung Kim, Vivek A. Khetan, Dongyeop Kang

    Abstract: Large Language Models (LLMs) have assisted humans in several writing tasks, including text revision and story generation. However, their effectiveness in supporting domain-specific writing, particularly in business contexts, is relatively less explored. Our formative study with industry professionals revealed the limitations in current LLMs' understanding of the nuances in such domain-specific wri… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to CHI 2024 In2Writing Workshop

  4. arXiv:2406.17310  [pdf, other

    eess.AS

    High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model

    Authors: Joun Yeop Lee, Myeonghun Jeong, Minchan Kim, Ji-Hyun Lee, Hoon-Young Cho, Nam Soo Kim

    Abstract: We propose a novel two-stage text-to-speech (TTS) framework with two types of discrete tokens, i.e., semantic and acoustic tokens, for high-fidelity speech synthesis. It features two core components: the Interpreting module, which processes text and a speech prompt into semantic tokens focusing on linguistic contents and alignment, and the Speaking module, which captures the timbre of the target v… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech2024

  5. arXiv:2406.16716  [pdf, other

    eess.AS cs.CR cs.SD

    One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection

    Authors: Hyun Myung Kim, Kangwook Jang, Hoirin Kim

    Abstract: As speech synthesis systems continue to make remarkable advances in recent years, the importance of robust deepfake detection systems that perform well in unseen systems has grown. In this paper, we propose a novel adaptive centroid shift (ACS) method that updates the centroid representation by continually shifting as the weighted average of bonafide representations. Our approach uses only bonafid… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  6. arXiv:2406.15225  [pdf, other

    cs.AI cs.RO eess.SP

    Deep UAV Path Planning with Assured Connectivity in Dense Urban Setting

    Authors: Jiyong Oh, Syed M. Raza, Lusungu J. Mwasinga, Moonseong Kim, Hyunseung Choo

    Abstract: Unmanned Ariel Vehicle (UAV) services with 5G connectivity is an emerging field with numerous applications. Operator-controlled UAV flights and manual static flight configurations are major limitations for the wide adoption of scalability of UAV services. Several services depend on excellent UAV connectivity with a cellular network and maintaining it is challenging in predetermined flight paths. T… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures, Published in the 2024 IEEE Network Operations and Management Symposium (NOMS 2024)

  7. arXiv:2406.14703  [pdf, other

    cs.CL cs.AI

    Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics

    Authors: Seungbeen Lee, Seungwon Lim, Seungju Han, Giyeong Oh, Hyungjoo Chae, Jiwan Chung, Minju Kim, Beong-woo Kwak, Yeonsoo Lee, Dongha Lee, **young Yeo, Youngjae Yu

    Abstract: The idea of personality in descriptive psychology, traditionally defined through observable behavior, has now been extended to Large Language Models (LLMs) to better understand their behavior. This raises a question: do LLMs exhibit distinct and consistent personality traits, similar to humans? Existing self-assessment personality tests, while applicable, lack the necessary validity and reliabilit… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Preprint; Under review

  8. arXiv:2406.14277  [pdf, other

    cs.CL cs.AI

    Augmenting Query and Passage for Retrieval-Augmented Generation using LLMs for Open-Domain Question Answering

    Authors: Minsang Kim, Cheoneum Park, Seungjun Baek

    Abstract: Retrieval-augmented generation (RAG) has received much attention for Open-domain question-answering (ODQA) tasks as a means to compensate for the parametric knowledge of large language models (LLMs). While previous approaches focused on processing retrieved passages to remove irrelevant context, they still rely heavily on the quality of retrieved passages which can degrade if the question is ambig… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  9. arXiv:2406.14124  [pdf, other

    cs.AI cs.LG

    Measuring Sample Importance in Data Pruning for Training LLMs from a Data Compression Perspective

    Authors: Minsang Kim, Seungjun Baek

    Abstract: Compute-efficient training of large language models (LLMs) has become an important research problem. In this work, we consider data pruning as a method of data-efficient training of LLMs, where we take a data compression view on data pruning. We argue that the amount of information of a sample, or the achievable compression on its description length, represents its sample importance. The key idea… ▽ More

    Submitted 20 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  10. arXiv:2406.13160  [pdf, ps, other

    math.RT

    Global bases for Bosonic extensions of quantum unipotent coordinate rings

    Authors: Masaki Kashiwara, Myungho Kim, Se-** Oh, Euiyong Park

    Abstract: In the paper, we establish the global basis theory for the bosonic extension $\widehat{\mathcal{A}}$ associated with an arbitrary generalized Cartan matrix. When $\widehat{\mathcal{A}}$ is of simply-laced finite type, it is isomorphic to the quantum Grothendieck ring of the Hernandez-Leclerc category over a quantum affine algebra. In this case, we show that the $(t,q)$-characters of simple modules… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 37pages

    MSC Class: 05E10; 05E18; 17B37}

  11. arXiv:2406.12688  [pdf, other

    eess.AS eess.SP

    Speak in the Scene: Diffusion-based Acoustic Scene Transfer toward Immersive Speech Generation

    Authors: Miseul Kim, Soo-Whan Chung, Youna Ji, Hong-Goo Kang, Min-Seok Choi

    Abstract: This paper introduces a novel task in generative speech processing, Acoustic Scene Transfer (AST), which aims to transfer acoustic scenes of speech signals to diverse environments. AST promises an immersive experience in speech perception by adapting the acoustic scene behind speech signals to desired environments. We propose AST-LDM for the AST task, which generates speech signals accompanied by… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  12. arXiv:2406.12430  [pdf, other

    cs.CL cs.AI cs.LG

    PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers

    Authors: Myeonghwa Lee, Seonho An, Min-Soo Kim

    Abstract: In this paper, we conduct a study to utilize LLMs as a solution for decision making that requires complex data analysis. We define Decision QA as the task of answering the best decision, $d_{best}$, for a decision-making question $Q$, business rules $R$ and a database $D$. Since there is no benchmark that can examine Decision QA, we propose Decision QA benchmark, DQA. It has two scenarios, Locatin… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: NAACL 2024

    ACM Class: I.2.7

  13. arXiv:2406.12254  [pdf, other

    eess.IV cs.CV

    Enhancing Single-Slice Segmentation with 3D-to-2D Unpaired Scan Distillation

    Authors: Xin Yu, Qi Yang, Han Liu, Ho Hin Lee, Yucheng Tang, Lucas W. Remedios, Michael Kim, Shunxing Bao, Ann Xenobia Moore, Luigi Ferrucci, Bennett A. Landman

    Abstract: 2D single-slice abdominal computed tomography (CT) enables the assessment of body habitus and organ health with low radiation exposure. However, single-slice data necessitates the use of 2D networks for segmentation, but these networks often struggle to capture contextual information effectively. Consequently, even when trained on identical datasets, 3D networks typically achieve superior segmenta… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  14. arXiv:2406.10363  [pdf, other

    astro-ph.IM astro-ph.EP

    The Focal-plane Actualized Shifted Technique Realized for a Shack Hartmann Wavefront Sensor (fastrSHWFS)

    Authors: Benjamin L. Gerard, Aaron Lemmer, Bautista R. Fernandez, Xiaoxing Xia, Cesar Laguna, Mike Kim, Stephen Mark Ammons, Brian Bauman, Lisa Poyneer

    Abstract: Astronomical adaptive optics (AO) is a critical approach to enable ground-based diffraction-limited imaging and high contrast science, with the potential to enable habitable exoplanet imaging on future extremely large telescopes. However, AO systems must improve significantly to enable habitable exoplanet imaging. Time lag between the end of an exposure and end of deformable mirror commands being… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: submitted to SPIE Astronomical Telescopes and Instrumentation, paper number 13097-283

  15. arXiv:2406.09698  [pdf, other

    physics.ins-det hep-ex

    Projected background and sensitivity of AMoRE-II

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (81 additional authors not shown)

    Abstract: AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  16. arXiv:2406.09246  [pdf, other

    cs.RO cs.LG

    OpenVLA: An Open-Source Vision-Language-Action Model

    Authors: Moo ** Kim, Karl Pertsch, Siddharth Karamcheti, Ted Xiao, Ashwin Balakrishna, Suraj Nair, Rafael Rafailov, Ethan Foster, Grace Lam, Pannag Sanketi, Quan Vuong, Thomas Kollar, Benjamin Burchfiel, Russ Tedrake, Dorsa Sadigh, Sergey Levine, Percy Liang, Chelsea Finn

    Abstract: Large policies pretrained on a combination of Internet-scale vision-language data and diverse robot demonstrations have the potential to change how we teach robots new skills: rather than training new behaviors from scratch, we can fine-tune such vision-language-action (VLA) models to obtain robust, generalizable policies for visuomotor control. Yet, widespread adoption of VLAs for robotics has be… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Website: https://openvla.github.io/

  17. arXiv:2406.09025  [pdf, other

    eess.SP

    Site-Specific Radio Channel Representation -- Current State and Future Applications

    Authors: Thomas Zemen, Jorge Gomez-Ponce, Aniruddha Chandra, Michael Walter, Enes Aksoy, Ruisi He, David Matolak, Minseok Kim, Jun-ichi Takada, Sana Salous, Reinaldo Valenzuela, Andreas F. Molisch

    Abstract: A site-specific radio channel representation considers the surroundings of the communication system through the environment geometry, such as buildings, vegetation, and mobile objects including their material and surface properties. In this article, we focus on communication technologies for 5G and beyond that are increasingly able to exploit the specific environment geometry for both communicatio… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures, submitted to the IEEE Communication Magazine

  18. arXiv:2406.08702  [pdf, other

    cs.AI cs.CL cs.CV

    VLind-Bench: Measuring Language Priors in Large Vision-Language Models

    Authors: Kang-il Lee, Minbeom Kim, Minsung Kim, Dongryeol Lee, Hyukhun Koh, Kyomin Jung

    Abstract: Large Vision-Language Models (LVLMs) have demonstrated outstanding performance across various multimodal tasks. However, they suffer from a problem known as language prior, where responses are generated based solely on textual patterns while disregarding image information. Addressing the issue of language prior is crucial, as it can lead to undesirable biases or hallucinations when dealing with im… ▽ More

    Submitted 17 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  19. arXiv:2406.08328  [pdf, other

    eess.AS

    Multimodal Representation Loss Between Timed Text and Audio for Regularized Speech Separation

    Authors: Tsun-An Hsieh, Heeyoul Choi, Minje Kim

    Abstract: Recent studies highlight the potential of textual modalities in conditioning the speech separation model's inference process. However, regularization-based methods remain underexplored despite their advantages of not requiring auxiliary text data during the test time. To address this gap, we introduce a timed text-based regularization (TTR) method that uses language model-derived semantics to impr… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  20. arXiv:2406.08301  [pdf, other

    nucl-ex

    Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, H. Al-Bataineh, J. Alexander, M. Alfred, K. Aoki, N. Apadula, L. Aphecetche, J. Asai, H. Asano, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, A. Baldisseri , et al. (510 additional authors not shown)

    Abstract: High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 534 authors from 83 institutions, 12 pages, 7 figures. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  21. arXiv:2406.08292  [pdf, other

    cs.CV

    Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata

    Authors: Dongsu Zhang, Francis Williams, Zan Gojcic, Karsten Kreis, Sanja Fidler, Young Min Kim, Amlan Kar

    Abstract: We aim to generate fine-grained 3D geometry from large-scale sparse LiDAR scans, abundantly captured by autonomous vehicles (AV). Contrary to prior work on AV scene completion, we aim to extrapolate fine geometry from unlabeled and beyond spatial limits of LiDAR scans, taking a step towards generating realistic, high-resolution simulation-ready 3D street environments. We propose hierarchical Gener… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR 2024 as highlight

  22. arXiv:2406.07867  [pdf, other

    cs.CV cs.AI cs.HC

    Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation

    Authors: Se ** Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeong Hun Yeo, Yong Man Ro

    Abstract: In this paper, we introduce a novel Face-to-Face spoken dialogue model. It processes audio-visual speech from user input and generates audio-visual speech as the response, marking the initial step towards creating an avatar chatbot system without relying on intermediate text. To this end, we newly introduce MultiDialog, the first large-scale multimodal (i.e., audio and visual) spoken dialogue corp… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024

  23. arXiv:2406.07863  [pdf, other

    astro-ph.GA

    The Size-luminosity Relation of the AGN Torus Determined from the Comparison between Optical and Mid-infrared Variability

    Authors: Min** Kim, Suyeon Son, Luis C. Ho

    Abstract: We investigate the optical variability of low-redshift ($0.15< z\leq0.4$) active galactic nuclei using the multi-epoch data from the Zwicky Transient Facility. We find that a damped random walk model well describes the ensemble structure function in the $g$ band. Consistent with previous studies, more luminous active galactic nuclei tend to have a steeper structure function at a timescale less tha… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures. Accepted for publication in A&A

  24. arXiv:2406.05994  [pdf, ps, other

    math.AP

    Perron solutions and boundary regularity for nonlocal nonlinear Dirichlet problems

    Authors: Anders Björn, Jana Björn, Minhyun Kim

    Abstract: For nonlinear operators of fractional $p$-Laplace type, we consider two types of solutions to the nonlocal Dirichlet problem: Sobolev solutions based on fractional Sobolev spaces and Perron solutions based on superharmonic functions. These solutions give rise to two different concepts of regularity for boundary points, namely Sobolev and Perron regularity. We show that these two notions are equiva… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    MSC Class: Primary: 35R11. Secondary: 31C15; 31C45; 35J66

  25. arXiv:2406.05965  [pdf, other

    eess.AS cs.AI

    MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance

    Authors: Semin Kim, Myeonghun Jeong, Hyeonseung Lee, Minchan Kim, Byoung ** Choi, Nam Soo Kim

    Abstract: In this paper, we propose MakeSinger, a semi-supervised training method for singing voice synthesis (SVS) via classifier-free diffusion guidance. The challenge in SVS lies in the costly process of gathering aligned sets of text, pitch, and audio data. MakeSinger enables the training of the diffusion-based SVS model from any speech and singing voice data regardless of its labeling, thereby enhancin… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  26. arXiv:2406.05963  [pdf, other

    cs.CV cs.AI

    Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024

    Authors: **woo Ahn, Junhyeok Park, Min-Jun Kim, Kang-Hyeon Kim, So-Yeong Sohn, Yun-Ji Lee, Du-Seong Chang, Yu-Jung Heo, Eun-Sol Kim

    Abstract: In this paper, the solution of HYU MLLAB KT Team to the Multimodal Algorithmic Reasoning Task: SMART-101 CVPR 2024 Challenge is presented. Beyond conventional visual question-answering problems, the SMART-101 challenge aims to achieve human-level multimodal understanding by tackling complex visio-linguistic puzzles designed for children in the 6-8 age group. To solve this problem, we suggest two m… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  27. arXiv:2406.05752  [pdf, ps, other

    astro-ph.HE gr-qc

    The quantum-refractive evolution of polarization states in pulsar emission

    Authors: Dong-Hoon Kim, Chul Min Kim, Sang Pyo Kim

    Abstract: Highly magnetized neutron stars have quantum refraction effects on pulsar emission due to the non-linearity of the quantum electrodynamics (QED) action. In this paper, we investigate the evolution of the polarization states under the quantum refraction effects combined with the frequency dependence of pulsar emission; we solve a system of evolution equations of the Stokes vector, where the birefri… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 14 pages, 9 figures

  28. arXiv:2406.04307  [pdf, other

    quant-ph cond-mat.str-el physics.comp-ph

    High-precision and low-depth eigenstate property estimation: theory and resource estimation

    Authors: **zhao Sun, Pei Zeng, Tom Gur, M. S. Kim

    Abstract: Estimating the eigenstate properties of quantum many-body systems is a long-standing, challenging problem for both classical and quantum computing. For the task of eigenstate preparation, quantum signal processing (QSP) has established near-optimal query complexity $O( Δ^{-1} \log(ε^{-1}) )$ by querying the block encoding of the Hamiltonian $H$ where $Δ$ is the energy gap and $ε$ is the target pre… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 48 pages, 7 figures, and 4 tables

  29. arXiv:2406.04190  [pdf, other

    quant-ph cond-mat.stat-mech

    Probing quantum complexity via universal saturation of stabilizer entropies

    Authors: Tobias Haug, Leandro Aolita, M. S. Kim

    Abstract: Nonstabilizerness or `magic' is a key resource for quantum computing and a necessary condition for quantum advantage. Non-Clifford operations turn stabilizer states into resourceful states, where the amount of nonstabilizerness is quantified by resource measures such as stabilizer Rényi entropies (SREs). Here, we show that SREs saturate their maximum value at a critical number of non-Clifford oper… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 13 pages, 8 figures

  30. arXiv:2406.03328  [pdf, other

    quant-ph physics.app-ph

    Leveraging Off-the-Shelf Silicon Chips for Quantum Computing

    Authors: John Michniewicz, M. S. Kim

    Abstract: There is a growing demand for quantum computing across various sectors, including finance, materials and studying chemical reactions. A promising implementation involves semiconductor qubits utilizing quantum dots within transistors. While academic research labs currently produce their own devices, scaling this process is challenging, requires expertise, and results in devices of varying quality.… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  31. arXiv:2406.02803  [pdf, other

    cs.DC

    DRust: Language-Guided Distributed Shared Memory with Fine Granularity, Full Transparency, and Ultra Efficiency

    Authors: Haoran Ma, Yifan Qiao, Shi Liu, Shan Yu, Yuanjiang Ni, Qingda Lu, Jiesheng Wu, Yiying Zhang, Miryung Kim, Harry Xu

    Abstract: Despite being a powerful concept, distributed shared memory (DSM) has not been made practical due to the extensive synchronization needed between servers to implement memory coherence. This paper shows a practical DSM implementation based on the insight that the ownership model embedded in programming languages such as Rust automatically constrains the order of read and write, providing opportunit… ▽ More

    Submitted 27 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  32. arXiv:2406.02756  [pdf, other

    cs.CL cs.AI cs.LG

    Aligning Large Language Models via Fine-grained Supervision

    Authors: Dehong Xu, Liang Qiu, Minseok Kim, Faisal Ladhak, Jaeyoung Do

    Abstract: Pre-trained large-scale language models (LLMs) excel at producing coherent articles, yet their outputs may be untruthful, toxic, or fail to align with user expectations. Current approaches focus on using reinforcement learning with human feedback (RLHF) to improve model alignment, which works by transforming coarse human preferences of LLM outputs into a feedback signal that guides the model learn… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  33. arXiv:2406.02701  [pdf, other

    stat.CO

    MPCR: Multi- and Mixed-Precision Computations Package in R

    Authors: Mary Lai O. Salvana, Sameh Abdulah, Minwoo Kim, David Helmy, Ying Sun, Marc G. Genton

    Abstract: Computational statistics has traditionally utilized double-precision (64-bit) data structures and full-precision operations, resulting in higher-than-necessary accuracy for certain applications. Recently, there has been a growing interest in exploring low-precision options that could reduce computational complexity while still achieving the required level of accuracy. This trend has been amplified… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  34. arXiv:2406.01483  [pdf

    physics.atom-ph

    Influence of spectra sewing on XCT measurement

    Authors: A. J. Arikkat, K. A. Janulewicz, C. M. Kim, P. Wachulak

    Abstract: The paper presents an analysis of the possible spectra manipulation and its consequence for the specific application of XCT. The focus was on the modification of the registered spectra dominantly by the sewing/stitching method. A model spectrum was created to analyse the possible behaviour of the spectral components when specifically arranged. The model and processing of real experimental data rev… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 5 figures, 19 references

  35. Focus on the Core: Efficient Attention via Pruned Token Compression for Document Classification

    Authors: Jungmin Yun, Mihyeon Kim, Youngbin Kim

    Abstract: Transformer-based models have achieved dominant performance in numerous NLP tasks. Despite their remarkable successes, pre-trained transformers such as BERT suffer from a computationally expensive self-attention mechanism that interacts with all tokens, including the ones unfavorable to classification performance. To overcome these challenges, we propose integrating two strategies: token pruning a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to EMNLP 2023 Findings

  36. arXiv:2406.00857  [pdf, other

    astro-ph.IM

    Modeling the refractive index profile n(z) of polar ice for ultra-high energy neutrino experiments

    Authors: S. Ali, P. Allison, S. Archambault, J. J. Beatty, D. Z. Besson, A. Bishop, P. Chen, Y. C. Chen, B. A. Clark, W. Clay, A. Connolly, K. Couberly, L. Cremonesi, A. Cummings, P. Dasgupta, R. Debolt, S. de Kockere, K. D. de Vries, C. Deaconu, M. A. DuVernois, J. Flaherty, E. Friedman, R. Gaior, P. Giri, J. Hanson , et al. (45 additional authors not shown)

    Abstract: We develop an in-situ index of refraction profile using the transit time of radio signals broadcast from an englacial transmitter to 2-5 km distant radio-frequency receivers, deployed at depths up to 200 m. Maxwell's equations generally admit two ray propagation solutions from a given transmitter, corresponding to a direct path (D) and a refracted path (R); the measured D vs. R (dt(D,R)) timing di… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  37. arXiv:2406.00431  [pdf, ps, other

    cs.LG cs.AI cs.DC

    SpaFL: Communication-Efficient Federated Learning with Sparse Models and Low computational Overhead

    Authors: Minsu Kim, Walid Saad, Merouane Debbah, Choong Seon Hong

    Abstract: The large communication and computation overhead of federated learning (FL) is one of the main challenges facing its practical deployment over resource-constrained clients and systems. In this work, SpaFL: a communication-efficient FL framework is proposed to optimize sparse model structures with low computational overhead. In SpaFL, a trainable threshold is defined for each filter/neuron to prune… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  38. arXiv:2405.20971  [pdf, other

    cs.LG cs.CV

    Amortizing intractable inference in diffusion models for vision, language, and control

    Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $\mathbf{x}\sim p^{\rm post}(\mathbf{x})\propto p(\mathbf{x})r(\mathbf{x})$, in a model that consists of a diffusion generat… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Code: https://github.com/GFNOrg/diffusion-finetuning

  39. arXiv:2405.20597  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Double-sided van der Waals epitaxy of topological insulators across an atomically thin membrane

    Authors: Joon Young Park, Young Jae Shin, Jeacheol Shin, Jehyun Kim, Janghyun Jo, Hyobin Yoo, Danial Haei, Chohee Hyun, Jiyoung Yun, Robert M. Huber, Arijit Gupta, Kenji Watanabe, Takashi Taniguchi, Wan Kyu Park, Hyeon Suk Shin, Miyoung Kim, Dohun Kim, Gyu-Chul Yi, Philip Kim

    Abstract: Atomically thin van der Waals (vdW) films provide a novel material platform for epitaxial growth of quantum heterostructures. However, unlike the remote epitaxial growth of three-dimensional bulk crystals, the growth of two-dimensional (2D) material heterostructures across atomic layers has been limited due to the weak vdW interaction. Here, we report the double-sided epitaxy of vdW layered materi… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 24 pages, 4 main figures, 7 extended data figures

  40. arXiv:2405.20517  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci

    Solid-State Reactions at Niobium-Germanium Interfaces in Hybrid Superconductor-Semiconductor Devices

    Authors: Bernardo Langa Jr., Deepak Sapkota, Ivan Lainez, Richard Haight, Bernadeta Srijanto, Leonard Feldman, Hussein Hijazi, Xiangyu Zhu, Lifang Hu, Moon Kim, Kasra Sardashti

    Abstract: Hybrid Superconductor-Semiconductor (S-Sm) materials systems are promising candidates for quantum computing applications. Their integration into superconducting electronics has enabled on-demand voltage tunability at millikelvin temperatures. Ge quantum wells (Ge QWs) have been among the semiconducting platforms interfaced with superconducting Al to realize voltage tunable Josephson junctions. Her… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 8 pages including 3 for supplementary material

  41. arXiv:2405.19891  [pdf, other

    quant-ph cs.IT

    Improving the Fidelity of CNOT Circuits on NISQ Hardware

    Authors: Dohun Kim, Minyoung Kim, Sarah Meng Li, Michele Mosca

    Abstract: We introduce an improved CNOT synthesis algorithm that considers nearest-neighbour interactions and CNOT gate error rates in noisy intermediate-scale quantum (NISQ) hardware. Compared to IBM's Qiskit compiler, it improves the fidelity of a synthesized CNOT circuit by about 2 times on average (up to 9 times). It lowers the synthesized CNOT count by a factor of 13 on average (up to a factor of 162).… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 67 pages, 33 figures, and 9 tables

  42. arXiv:2405.19806  [pdf, other

    cs.LG

    Preference Alignment with Flow Matching

    Authors: Minu Kim, Yongsik Lee, Sehyeok Kang, Jihwan Oh, Song Chong, Seyoung Yun

    Abstract: We present Preference Flow Matching (PFM), a new framework for preference-based reinforcement learning (PbRL) that streamlines the integration of preferences into an arbitrary class of pre-trained models. Existing PbRL methods require fine-tuning pre-trained models, which presents challenges such as scalability, inefficiency, and the need for model modifications, especially with black-box APIs lik… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  43. arXiv:2405.19691  [pdf, other

    cs.HC

    Designing Prompt Analytics Dashboards to Analyze Student-ChatGPT Interactions in EFL Writing

    Authors: Minsun Kim, SeonGyeom Kim, Suyoun Lee, Yoosang Yoon, Junho Myung, Haneul Yoo, Hyungseung Lim, Jieun Han, Yoonsu Kim, So-Yeon Ahn, Juho Kim, Alice Oh, Hwajung Hong, Tak Yeon Lee

    Abstract: While ChatGPT has significantly impacted education by offering personalized resources for students, its integration into educational settings poses unprecedented risks, such as inaccuracies and biases in AI-generated content, plagiarism and over-reliance on AI, and privacy and security issues. To help teachers address such risks, we conducted a two-phase iterative design process that comprises sur… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  44. arXiv:2405.18986  [pdf, other

    cs.LG q-bio.BM q-bio.QM

    Robust Optimization in Protein Fitness Landscapes Using Reinforcement Learning in Latent Space

    Authors: Minji Lee, Luiz Felipe Vecchietti, Hyunkyu Jung, Hyun Joo Ro, Meeyoung Cha, Ho Min Kim

    Abstract: Proteins are complex molecules responsible for different functions in nature. Enhancing the functionality of proteins and cellular fitness can significantly impact various industries. However, protein optimization using computational methods remains challenging, especially when starting from low-fitness sequences. We propose LatProtRL, an optimization method to efficiently traverse a latent space… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  45. arXiv:2405.18540  [pdf, other

    cs.CL cs.CR cs.LG

    Learning diverse attacks on large language models for robust red-teaming and safety tuning

    Authors: Seanie Lee, Minsu Kim, Lynn Cherif, David Dobre, Juho Lee, Sung Ju Hwang, Kenji Kawaguchi, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Moksh Jain

    Abstract: Red-teaming, or identifying prompts that elicit harmful responses, is a critical step in ensuring the safe and responsible deployment of large language models (LLMs). Develo** effective protection against many modes of attack prompts requires discovering diverse attacks. Automated red-teaming typically uses reinforcement learning to fine-tune an attacker language model to generate prompts that e… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  46. arXiv:2405.18206  [pdf, other

    cs.LG stat.ME stat.ML

    Multi-CATE: Multi-Accurate Conditional Average Treatment Effect Estimation Robust to Unknown Covariate Shifts

    Authors: Christoph Kern, Michael Kim, Angela Zhou

    Abstract: Estimating heterogeneous treatment effects is important to tailor treatments to those individuals who would most likely benefit. However, conditional average treatment effect predictors may often be trained on one population but possibly deployed on different, possibly unknown populations. We use methodology for learning multi-accurate predictors to post-process CATE T-learners (differenced regres… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  47. arXiv:2405.17938  [pdf, other

    cs.LG

    RC-Mixup: A Data Augmentation Strategy against Noisy Data for Regression Tasks

    Authors: Seong-Hyeon Hwang, Minsu Kim, Steven Euijong Whang

    Abstract: We study the problem of robust data augmentation for regression tasks in the presence of noisy data. Data augmentation is essential for generalizing deep learning models, but most of the techniques like the popular Mixup are primarily designed for classification tasks on image data. Recently, there are also Mixup techniques that are specialized to regression tasks like C-Mixup. In comparison to Mi… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to KDD 2024

  48. arXiv:2405.17318  [pdf, other

    math.ST stat.ME

    Extremal correlation coefficient for functional data

    Authors: Mihyun Kim, Piotr Kokoszka

    Abstract: We propose a coefficient that measures dependence in paired samples of functions. It has properties similar to the Pearson correlation, but differs in significant ways: 1) it is designed to measure dependence between curves, 2) it focuses only on extreme curves. The new coefficient is derived within the framework of regular variation in Banach spaces. A consistent estimator is proposed and justifi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    MSC Class: 62R10; 60G70

  49. arXiv:2405.17000  [pdf, other

    astro-ph.GA

    MHONGOOSE discovery of a gas-rich low-surface brightness galaxy in the Dorado Group

    Authors: F. M. Maccagni, W. J. G. de Blok, P. E. Mancera Piña, R. Ragusa, E. Iodice, M. Spavone, S. McGaugh, K. A. Oman, T. A. Oosterloo, B. S. Koribalski, M. Kim, E. A. K. Adams, P. Amram, A. Bosma, F. Bigiel, E. Brinks, L. Chemin, F. Combes, B. Gibson, J. Healy, B. W. Holwerda, G. I. G. Józsa, P. Kamphuis, D. Kleiner, S. Kurapati , et al. (6 additional authors not shown)

    Abstract: We present the discovery of a low-mass gas-rich low-surface brightness galaxy in the Dorado Group, at a distance of 17.7 Mpc. Combining deep MeerKAT 21-cm observations from the MeerKAT HI Observations of Nearby Galactic Objects: Observing Southern Emitters (MHONGOOSE) survey with deep photometric images from the VST Early-type Galaxy Survey (VEGAS) we find a stellar and neutral atomic hydrogen (HI… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 18 pages, 14 figures, 5 tables; accepted for publication in A&A

  50. arXiv:2405.16658  [pdf, other

    cs.LG cs.AI

    Acceleration of Grokking in Learning Arithmetic Operations via Kolmogorov-Arnold Representation

    Authors: Yeachan Park, Minseok Kim, Yeoneung Kim

    Abstract: We propose novel methodologies aimed at accelerating the grokking phenomenon, which refers to the rapid increment of test accuracy after a long period of overfitting as reported in~\cite{power2022grokking}. Focusing on the grokking phenomenon that arises in learning arithmetic binary operations via the transformer model, we begin with a discussion on data augmentation in the case of commutative bi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.