Skip to main content

Showing 1–50 of 740 results for author: Sim

.
  1. arXiv:2407.01513  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Pair Density Waves and Supercurrent Diode Effect in Altermagnets

    Authors: GiBaik Sim, Johannes Knolle

    Abstract: Metallic altermagnets are unusual collinear magnets that feature zero net magnetization with momentum-dependent spin splitting. Here, we show that this spin splitting can induce pair density wave states even in the absence of external magnetic fields. Focusing on BCS-type attractive interactions, we find the stabilization of symmetrically distinct pair density wave states depending on the chemical… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 9 pages, 5 figures

  2. arXiv:2407.01398  [pdf, other

    astro-ph.HE astro-ph.SR physics.atom-ph physics.chem-ph

    New Radiative and Collisional Atomic Data for Sr {\sc ii} and Y {\sc ii} with application to Kilonova modelling

    Authors: Leo Mulholland, Niall McElroy, Fiona McNeill, Stuart Sim, Connor Ballance, Catherine Ramsbottom

    Abstract: The spectra of singly ionised Strontium and Yttrium (Sr {\sc ii} and Y {\sc ii}) have been proposed as identifications of certain spectral features in the AT2017gfo spectrum. With the growing demand for NLTE simulations of Kilonovae, there is a increasing need for atomic data for these and other $r$-process elements. Our goal is to expand upon the current set of atomic data for $r$-process element… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.19707  [pdf, other

    cs.LG cs.DC

    InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management

    Authors: Wonbeom Lee, Jungi Lee, Junghwan Seo, Jaewoong Sim

    Abstract: Transformer-based large language models (LLMs) demonstrate impressive performance across various natural language processing tasks. Serving LLM inference for generating long contents, however, poses a challenge due to the enormous memory footprint of the transient state, known as the key-value (KV) cache, which scales with the sequence length and batch size. In this paper, we present InfiniGen, a… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: OSDI 2024

  4. arXiv:2406.19151  [pdf, other

    quant-ph

    Trivariate Bicycle Codes

    Authors: Lukas Voss, Sim Jian Xian, Tobias Haug, Kishor Bharti

    Abstract: Quantum error correction suppresses noise in quantum systems to allow for high-precision computations. In this work, we introduce Trivariate Bicycle Quantum Low-Density Parity-Check (TB-QLDPC) codes, via an extension of the framework developed by Bravyi et al. [Nature, 627, 778-782 (2024)]. Unlike the weight-6 codes proposed in their study, our approach also offers weight-4 and weight-5 codes, whi… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 18 pages, 18 figures

  5. arXiv:2406.16609  [pdf, other

    cs.NE cs.AI cs.LG

    Evaluating the Robustness of Deep-Learning Algorithm-Selection Models by Evolving Adversarial Instances

    Authors: Emma Hart, Quentin Renau, Kevin Sim, Mohamad Alissa

    Abstract: Deep neural networks (DNN) are increasingly being used to perform algorithm-selection in combinatorial optimisation domains, particularly as they accommodate input representations which avoid designing and calculating features. Mounting evidence from domains that use images as input shows that deep convolutional networks are vulnerable to adversarial samples, in which a small perturbation of an in… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: To appear in the proceedings of the 18th International Conference on Parallel Problem Solving from Nature (PPSN 2024)

  6. arXiv:2406.14473  [pdf, other

    cs.LG cs.CL

    Data-Centric AI in the Age of Large Language Models

    Authors: Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, **gtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, Gregory Kang Ruey Lau, Hieu Dao, Lucas Agussurja, Rachael Hwee Ling Sim, Xiaoqiang Lin, Wenyang Hu, Zhongxiang Dai, Pang Wei Koh, Bryan Kian Hsiang Low

    Abstract: This position paper proposes a data-centric viewpoint of AI research, focusing on large language models (LLMs). We start by making the key observation that data is instrumental in the developmental (e.g., pretraining and fine-tuning) and inferential stages (e.g., in-context learning) of LLMs, and yet it receives disproportionally low attention from the research community. We identify four specific… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Preprint

  7. arXiv:2406.12930  [pdf, other

    cs.LG cs.AR

    Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization

    Authors: Jungi Lee, Wonbeom Lee, Jaewoong Sim

    Abstract: Large language models (LLMs) demonstrate outstanding performance in various tasks in machine learning and have thus become one of the most important workloads in today's computing landscape. However, deploying LLM inference poses challenges due to the high compute and memory requirements stemming from the enormous model size and the difficulty of running it in the integer pipelines. In this paper,… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: To appear at the 51st International Symposium on Computer Architecture (ISCA 2024)

  8. arXiv:2406.12208  [pdf, other

    cs.CL cs.AI cs.CV cs.NE

    Knowledge Fusion By Evolving Weights of Language Models

    Authors: Guodong Du, **g Li, Hanting Liu, Runhua Jiang, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang

    Abstract: Fine-tuning pre-trained language models, particularly large language models, demands extensive computing resources and can result in varying performance outcomes across different domains and datasets. This paper examines the approach of integrating multiple models from diverse training scenarios into a unified model. This unified model excels across various data domains and exhibits the ability to… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL2024 Findings

  9. arXiv:2406.11379  [pdf, other

    math.AP math-ph

    On existence of Sadovskii vortex patch: A touching pair of symmetric counter-rotating uniform vortex

    Authors: Kyudong Choi, In-Jee Jeong, Young-** Sim

    Abstract: The Sadovskii vortex patch is a traveling wave for the two-dimensional incompressible Euler equations consisting of an odd symmetric pair of vortex patches touching the symmetry axis. Its existence was first suggested by numerical computations of Sadovskii in [J. Appl. Math. Mech., 1971], and has gained significant interest due to its relevance in inviscid limit of planar flows via Prandtl--Batche… ▽ More

    Submitted 29 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 42 pages, 1 figure

  10. Towards Understanding Emotions for Engaged Mental Health Conversations

    Authors: Kellie Yu Hui Sim, Kohleen Ti**g Fortuno, Kenny Tsu Wei Choo

    Abstract: Providing timely support and intervention is crucial in mental health settings. As the need to engage youth comfortable with texting increases, mental health providers are exploring and adopting text-based media such as chatbots, community-based forums, online therapies with licensed professionals, and helplines operated by trained responders. To support these text-based media for mental health--p… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 5 pages, 1 figure, to be published in DIS Companion '24

    ACM Class: H.5.2; I.2.7

  11. arXiv:2406.08682  [pdf, other

    cond-mat.mtrl-sci

    FIP-GNN: Graph neural networks for scalable prediction of grain-level fatigue indicator parameters

    Authors: Gyu-Jang Sim, Myoung-Gyu Lee, Marat I. Latypov

    Abstract: High-cycle fatigue is a critical performance metric of structural alloys for many applications. The high cost, time, and labor involved in experimental fatigue testing call for efficient and accurate computer models of fatigue life. We present graph neural networks for polycrystals that, for the first time, can (i) predict fatigue indicator parameters -- grain-level responses to cyclic loading wel… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  12. arXiv:2406.08301  [pdf, other

    nucl-ex

    Jet modification via $π^0$-hadron correlations in Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, S. Afanasiev, C. Aidala, N. N. Ajitanand, Y. Akiba, H. Al-Bataineh, J. Alexander, M. Alfred, K. Aoki, N. Apadula, L. Aphecetche, J. Asai, H. Asano, E. T. Atomssa, R. Averbeck, T. C. Awes, B. Azmoun, V. Babintsev, M. Bai, G. Baksay, L. Baksay, A. Baldisseri , et al. (510 additional authors not shown)

    Abstract: High-momentum two-particle correlations are a useful tool for studying jet-quenching effects in the quark-gluon plasma. Angular correlations between neutral-pion triggers and charged hadrons with transverse momenta in the range 4--12~GeV/$c$ and 0.5--7~GeV/$c$, respectively, have been measured by the PHENIX experiment in 2014 for Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. Suppression is obs… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 534 authors from 83 institutions, 12 pages, 7 figures. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  13. arXiv:2406.05411  [pdf, other

    quant-ph cond-mat.other

    Generalized symmetry in non-Hermitian systems

    Authors: Karin Sim, Nicolò Defenu, Paolo Molignini, R. Chitra

    Abstract: Despite acute interest in the dynamics of non-Hermitian systems, there is a lack of consensus in the mathematical formulation of non-Hermitian quantum mechanics in the community. Different methodologies are used in the literature to study non-Hermitian dynamics. This ranges from consistent frameworks like biorthogonal quantum mechanics and metric approach characterized by modified inner products,… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 11 pages, 1 figure

  14. arXiv:2406.04576  [pdf, other

    physics.optics physics.app-ph physics.bio-ph physics.med-ph

    Metasurfaces for infrared multi-modal microscopy: phase contrast and bright field

    Authors: Shaban B. Sulejman, Lukas Wesemann, Mikkaela McCormack, Jiajun Meng, James A. Hutchison, Niken Priscilla, Gawain McColl, Katrina Read, Wilson Sim, Andrey A. Sukhorukov, Kenneth B. Crozier, Ann Roberts

    Abstract: Different imaging modalities are used to extract the diverse information carried in an optical field. Two prominent modalities include bright field and phase contrast microscopy that can visualize the amplitude and phase features of a sample, respectively. However, capturing both of these images on the same camera typically requires interchanging optical components. Metasurfaces are ultra-thin nan… ▽ More

    Submitted 9 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Main text 18 pages, 5 main figures, Supplementary information 19 pages, 19 supplementary figures, 2 supplementary videos

  15. arXiv:2406.02349  [pdf, other

    cs.NE cs.AI cs.CV

    CADE: Cosine Annealing Differential Evolution for Spiking Neural Network

    Authors: Runhua Jiang, Guodong Du, Shuyang Yu, Yifei Guo, Sim Kuan Goh, Ho-Kin Tang

    Abstract: Spiking neural networks (SNNs) have gained prominence for their potential in neuromorphic computing and energy-efficient artificial intelligence, yet optimizing them remains a formidable challenge for gradient-based methods due to their discrete, spike-based computation. This paper attempts to tackle the challenges by introducing Cosine Annealing Differential Evolution (CADE), designed to modulate… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  16. arXiv:2405.18832  [pdf, other

    cs.LG cs.AI cs.AR

    MoNDE: Mixture of Near-Data Experts for Large-Scale Sparse Models

    Authors: Taehyun Kim, Kwanseok Choi, Youngmock Cho, Jaehoon Cho, Hyuk-Jae Lee, Jaewoong Sim

    Abstract: Mixture-of-Experts (MoE) large language models (LLM) have memory requirements that often exceed the GPU memory capacity, requiring costly parameter movement from secondary memories to the GPU for expert computation. In this work, we present Mixture of Near-Data Experts (MoNDE), a near-data computing solution that efficiently enables MoE LLM inference. MoNDE reduces the volume of MoE parameter move… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted to DAC 2024

  17. arXiv:2405.13596  [pdf, other

    astro-ph.HE astro-ph.SR

    SN 2023zaw: the low-energy explosion of an ultra-stripped star, with non-radioactive heating

    Authors: Thomas Moore, James Gillanders, Matt Nicholl, Mark Huber, Stephen Smartt, Shubham Srivastav, Heloise Stevance, Ting-Wan Chen, Kenneth Chambers, Joseph Anderson, Michael Fulton, Samantha Oates, Charlotte Angus, Giuliano Pignata, Nicolas Erasmus, Hua Gao, Joanna Bulger, Chien-Cheng Lin, Thomas Lowe, Eugene Magnier, Paloma Minguez, Chow-Choong Ngeow, Xinyue Sheng, Stuart A. Sim, Ken Smith , et al. (4 additional authors not shown)

    Abstract: Most stripped envelope supernova progenitors are formed through binary interaction, losing hydrogen and/or helium from their outer layers. An emerging class of supernovae with the highest degree of envelope-strip** are thought to be the product of strip** by a NS companion. However, relatively few examples are known and the outcomes of such systems can be diverse and are poorly understood at p… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  18. arXiv:2405.11436  [pdf, other

    quant-ph

    Quantum sampling algorithms for quantum state preparation and matrix block-encoding

    Authors: Jessica Lemieux, Matteo Lostaglio, Sam Pallister, William Pol, Karthik Seetharam, Sukin Sim, Burak Şahinoğlu

    Abstract: The problems of quantum state preparation and matrix block-encoding are ubiquitous in quantum computing: they are crucial parts of various quantum algorithms for the purpose for initial state preparation as well as loading problem relevant data. We first present an algorithm based on QRS that prepares a quantum state $|ψ_f\rangle \propto \sum^N_{x=1} f(x)|x\rangle$. When combined with efficient re… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: 58 pages, 28 figures, 5 tables

  19. arXiv:2405.07457  [pdf, other

    cond-mat.mtrl-sci

    Magnetoelectric domain engineering from micrometer to Ångstrøm scales

    Authors: Marcela Giraldo, Arkadiy Simonov, Hasung Sim, Ahmed Samir Lotfy, Martin Lilienblum, Lea Forster, Elzbieta Gradauskaite, Morgan Trassin, Je-Geun Park, Thomas Lottermoser, Manfred Fiebig

    Abstract: The functionality of magnetoelectric multiferroics depends on the formation, size, and coupling of their magnetic and electric domains. Knowing the parameters guiding these criteria is a key effort in the emerging field of magnetoelectric domain engineering. Here we show, using a combination of piezoresponse-force microscopy, non-linear optics, and x-ray scattering, that the correlation length set… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages, 8 figures

  20. arXiv:2405.07414  [pdf, other

    cs.LG cs.AI

    Binning as a Pretext Task: Improving Self-Supervised Learning in Tabular Domains

    Authors: Kyungeun Lee, Ye Seul Sim, Hye-Seung Cho, Moonjung Eo, Suhee Yoon, Sanghyu Yoon, Woohyung Lim

    Abstract: The ability of deep networks to learn superior representations hinges on leveraging the proper inductive biases, considering the inherent properties of datasets. In tabular domains, it is critical to effectively handle heterogeneous features (both categorical and numerical) in a unified manner and to grasp irregular functions like piecewise constant functions. To address the challenges in the self… ▽ More

    Submitted 13 May, 2024; v1 submitted 12 May, 2024; originally announced May 2024.

    Comments: ICML 2024, 18 pages (including supplementary materials)

  21. arXiv:2404.19611  [pdf, other

    eess.SP cs.ET cs.IT cs.NI

    Radio Resource Management Design for RSMA: Optimization of Beamforming, User Admission, and Discrete/Continuous Rates with Imperfect SIC

    Authors: L. F. Abanto-Leon, A. Krishnamoorthy, A. Garcia-Saavedra, G. H. Sim, R. Schober, M. Hollick

    Abstract: This paper investigates the radio resource management (RRM) design for multiuser rate-splitting multiple access (RSMA), accounting for various characteristics of practical wireless systems, such as the use of discrete rates, the inability to serve all users, and the imperfect successive interference cancellation (SIC). Specifically, failure to consider these characteristics in RRM design may lead… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  22. arXiv:2404.14618  [pdf, other

    cs.LG cs.AI cs.CL

    Hybrid LLM: Cost-Efficient and Quality-Aware Query Routing

    Authors: Dujian Ding, Ankur Mallick, Chi Wang, Robert Sim, Subhabrata Mukherjee, Victor Ruhle, Laks V. S. Lakshmanan, Ahmed Hassan Awadallah

    Abstract: Large language models (LLMs) excel in most NLP tasks but also require expensive cloud servers for deployment due to their size, while smaller models that can be deployed on lower cost (e.g., edge) devices, tend to lag behind in terms of response quality. Therefore in this work we propose a hybrid inference approach which combines their respective strengths to save cost and maintain quality. Our ap… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted to ICLR 2024 (main conference)

  23. arXiv:2404.11955  [pdf

    cond-mat.mes-hall quant-ph

    Electrical control of a Kondo spin screening cloud

    Authors: Ngoc Han Tu, Donghoon Kim, Minsoo Kim, Jeongmin Shim, Ryo Ito, David Pomaranski, Ivan V. Borzenets, Arne Ludwig, Andreas D. Wieck, Heung-Sun Sim, Michihisa Yamamoto

    Abstract: In metals and semiconductors, an impurity spin is quantum entangled with and thereby screened by surrounding conduction electrons at low temperatures, called the Kondo screening cloud. Quantum confinement of the Kondo screening cloud in a region, called a Kondo box, with a length smaller than the original cloud extension length strongly deforms the screening cloud and provides a way of controlling… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  24. arXiv:2404.11099  [pdf

    cond-mat.mtrl-sci

    Interplay between magnetic and lattice excitations and emergent multiple phase transitions in MnPSe3-xSx

    Authors: Deepu Kumar, Nguyen The Hoang, Yumin Sim, Youngsu Choi, Kalaivanan Raju, Rajesh Kumar Ulaganathan, Raman Sankar, Maeng-Je Seong, Kwang-Yong Choi

    Abstract: The intricate interplay between spin and lattice degrees of freedom in two-dimensional magnetic materials plays a pivotal role in modifying their magnetic characteristics, engendering hybrid quasiparticles, and implementing functional devices. Herein, we present our comprehensive and in-depth investigations on magnetic and lattice excitations of MnPSe3-xSx (x = 0, 0.5, and 1.5) alloys, utilizing t… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  25. arXiv:2404.09173  [pdf, other

    cs.LG cs.AI cs.CL

    TransformerFAM: Feedback attention is working memory

    Authors: Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim, Pedro Moreno Mengibar

    Abstract: While Transformers have revolutionized deep learning, their quadratic attention complexity hinders their ability to process infinitely long inputs. We propose Feedback Attention Memory (FAM), a novel Transformer architecture that leverages a feedback loop to enable the network to attend to its own latent representations. This design fosters the emergence of working memory within the Transformer, a… ▽ More

    Submitted 7 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: 26 pages, 12 figures, 14 tables

  26. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  27. arXiv:2404.01752  [pdf, other

    cs.RO cs.AI cs.MA

    Safe Interval RRT* for Scalable Multi-Robot Path Planning in Continuous Space

    Authors: Joonyeol Sim, Joonkyung Kim, Changjoo Nam

    Abstract: In this paper, we consider the problem of Multi-Robot Path Planning (MRPP) in continuous space to find conflict-free paths. The difficulty of the problem arises from two primary factors. First, the involvement of multiple robots leads to combinatorial decision-making, which escalates the search space exponentially. Second, the continuous space presents potentially infinite states and actions. For… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  28. arXiv:2404.01676  [pdf, other

    cs.LG

    Incentives in Private Collaborative Machine Learning

    Authors: Rachael Hwee Ling Sim, Yehong Zhang, Trong Nghia Hoang, Xinyi Xu, Bryan Kian Hsiang Low, Patrick Jaillet

    Abstract: Collaborative machine learning involves training models on data from multiple parties but must incentivize their participation. Existing data valuation methods fairly value and reward each party based on shared data or model parameters but neglect the privacy risks involved. To address this, we introduce differential privacy (DP) as an incentive. Each party can select its required DP guarantee and… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted to NeurIPS 2023

  29. arXiv:2404.01396  [pdf, other

    quant-ph

    A case study against QSVT: assessment of quantum phase estimation improved by signal processing techniques

    Authors: Sean Greenaway, William Pol, Sukin Sim

    Abstract: In recent years, quantum algorithms have been proposed which use quantum phase estimation (QPE) coherently as a subroutine without measurement. In order to do this effectively, the routine must be able to distinguish eigenstates with success probability close to unity. In this paper, we provide the first systematic comparison between two approaches towards maximizing this success probability, one… ▽ More

    Submitted 17 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: References fixed and added

  30. arXiv:2404.00963  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Inversion and Tunability of Van Hove Singularities in $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, and Cs) kagome metals

    Authors: Sangjun Sim, Min Yong Jeong, Hyunggeun Lee, Dong Hyun David Lee, Myung Joon Han

    Abstract: To understand the alkali-metal-dependent material properties of recently discovered $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, and Cs), we conducted a detailed electronic structure analysis based on first-principles density functional theory calculations. Contrary to the case of $A$ = K and Rb, the energetic positions of the low-lying Van Hove singularities are reversed in CsV$_{3}$Sb$_{5}$, and the charact… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Physical Chemistry Chemical Physics (PCCP) in press

  31. arXiv:2404.00626  [pdf, other

    cs.CV

    Domain Generalizable Person Search Using Unreal Dataset

    Authors: Minyoung Oh, Duhyun Kim, Jae-Young Sim

    Abstract: Collecting and labeling real datasets to train the person search networks not only requires a lot of time and effort, but also accompanies privacy issues. The weakly-supervised and unsupervised domain adaptation methods have been proposed to alleviate the labeling burden for target datasets, however, their generalization capability is limited. We introduce a novel person search method based on the… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: AAAI2024 accepted

  32. arXiv:2403.19709  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.NE

    Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models

    Authors: Tsendsuren Munkhdalai, Youzheng Chen, Khe Chai Sim, Fadi Biadsy, Tara Sainath, Pedro Moreno Mengibar

    Abstract: Parameter efficient adaptation methods have become a key mechanism to train large pre-trained models for downstream tasks. However, their per-task parameter overhead is considered still high when the number of downstream tasks to adapt for is large. We introduce an adapter module that has a better efficiency in large scale multi-task adaptation scenario. Our adapter is hierarchical in terms of how… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 5 pages, 3 figures, 5 tables

  33. Including a Luminous Central Remnant in Radiative Transfer Simulations for Type Iax Supernovae

    Authors: F. P. Callan, S. A. Sim, C. E. Collins, L. J. Shingles, F. Lach, F. K. Roepke, R. Pakmor, M. Kromer, S. Srivastav

    Abstract: Type Iax supernovae (SNe Iax) are proposed to arise from deflagrations of Chandrasekhar mass white dwarfs (WDs). Previous deflagration simulations have achieved good agreement with the light curves and spectra of intermediate-luminosity and bright SNe Iax. However, the model light curves decline too quickly after peak, particularly in red optical and near-infrared (NIR) bands. Deflagration models… ▽ More

    Submitted 19 April, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    Comments: 17 pages, 6 figures. Lightcurves and spectra available at https://hesma.h-its.org

    Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 530, Issue 2, May 2024, Pages 1457 to 1473

  34. arXiv:2403.11793  [pdf, other

    cs.CL cs.AI cs.ET cs.SC

    Reasoning Abilities of Large Language Models: In-Depth Analysis on the Abstraction and Reasoning Corpus

    Authors: Seungpil Lee, Woochang Sim, Donghyeon Shin, Sanha Hwang, Wongyu Seo, Jiwon Park, Seokki Lee, Se** Kim, Sundong Kim

    Abstract: The existing methods for evaluating the inference abilities of Large Language Models (LLMs) have been results-centric, making it difficult to assess the inference process. We introduce a new approach using the Abstract and Reasoning Corpus (ARC) dataset to evaluate the inference and contextual understanding abilities of large language models in a process-centric manner. ARC demands rigorous logica… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 25 pages

  35. arXiv:2403.10948  [pdf, other

    cs.RO

    Real-to-Sim Adaptation via High-Fidelity Simulation to Control a Wheeled-Humanoid Robot with Unknown Dynamics

    Authors: Donghoon Baek, Youngwoo Sim, Amartya Purushottam, Saurabh Gupta, Joao Ramos

    Abstract: Model-based controllers using a linearized model around the system's equilibrium point is a common approach in the control of a wheeled humanoid due to their less computational load and ease of stability analysis. However, controlling a wheeled humanoid robot while it lifts an unknown object presents significant challenges, primarily due to the lack of knowledge in object dynamics. This paper pres… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  36. arXiv:2403.10022  [pdf, other

    cs.CV

    Lifelong Person Re-Identification with Backward-Compatibility

    Authors: Minyoung Oh, Jae-Young Sim

    Abstract: Lifelong person re-identification (LReID) assumes a practical scenario where the model is sequentially trained on continuously incoming datasets while alleviating the catastrophic forgetting in the old datasets. However, not only the training datasets but also the gallery images are incrementally accumulated, that requires a huge amount of computational complexity and storage space to extract the… ▽ More

    Submitted 17 March, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: 17 pages, 5 figures, 7 tables

  37. arXiv:2403.06381  [pdf, other

    cs.CV

    Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

    Authors: Yang Zhang, Teoh Tze Tzun, Lim Wei Hern, Tiviatis Sim, Kenji Kawaguchi

    Abstract: Recent advancements in diffusion models have notably improved the perceptual quality of generated images in text-to-image synthesis tasks. However, diffusion models often struggle to produce images that accurately reflect the intended semantics of the associated text prompts. We examine cross-attention layers in diffusion models and observe a propensity for these layers to disproportionately focus… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  38. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  39. Type Ia supernova explosion models are inherently multidimensional

    Authors: R. Pakmor, I. R. Seitenzahl, A. J. Ruiter, S. A. Sim, F. K. Roepke, S. Taubenberger, R. Bieri, S. Blondin

    Abstract: Theoretical and observational approaches to settling the important questions surrounding the progenitor systems and the explosion mechanism of normal Type Ia supernovae have thus far failed. With its unique capability to obtain continuous spectra through the near- and mid-infrared, JWST now offers completely new insights into Type Ia supernovae. In particular, observing them in the nebular phase a… ▽ More

    Submitted 26 April, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 7 pages, 2 figures, accepted by A&A, comments welcome

    Journal ref: A&A 686, A227 (2024)

  40. arXiv:2402.10517  [pdf, other

    cs.LG

    Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

    Authors: Yeonhong Park, Jake Hyun, SangLyul Cho, Bonggeun Sim, Jae W. Lee

    Abstract: Recently, considerable efforts have been directed towards compressing Large Language Models (LLMs), which showcase groundbreaking capabilities across diverse applications but entail significant deployment costs due to their large sizes. Meanwhile, much less attention has been given to mitigating the costs associated with deploying multiple LLMs of varying sizes despite its practical significance.… ▽ More

    Submitted 21 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: To appear at ICML 2024. Code is available at https://github.com/SNU-ARC/any-precision-llm

  41. arXiv:2402.07334  [pdf, other

    cs.CR cs.LG

    Differentially Private Training of Mixture of Experts Models

    Authors: Pierre Tholoniat, Huseyin A. Inan, Janardhan Kulkarni, Robert Sim

    Abstract: This position paper investigates the integration of Differential Privacy (DP) in the training of Mixture of Experts (MoE) models within the field of natural language processing. As Large Language Models (LLMs) scale to billions of parameters, leveraging expansive datasets, they exhibit enhanced linguistic capabilities and emergent abilities. However, this growth raises significant computational an… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: Preliminary work presented as a poster at the 5th AAAI Workshop on Privacy-Preserving Artificial Intelligence (PPAI 24)

  42. arXiv:2402.04850  [pdf, other

    hep-ph hep-th

    Muon $g-2$ and Proton Lifetime in SUSY SU(5) GUTs with Split Superpartners

    Authors: Seong-Sik Kim, Hyun Min Lee, Sung-Bo Sim

    Abstract: We consider the interplay of the muon $g-2$ anomaly and the proton decay in the SUSY SU(5) GUTs with generation-independent scalar soft masses. In these scenarios, we introduce a number of $\bf 5+{\bar 5}$ messenger fields with doublet-triplet splitting in general gauge mediation to transmit SUSY breaking to the visible sector by gauge loops. As a result, squarks and sleptons receive generation-in… ▽ More

    Submitted 29 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 35 pages, 8 figures, v2: typos fixed and reference updated, v3: version to appear in Phys. Rev. D

  43. arXiv:2402.02674  [pdf, ps, other

    astro-ph.HE

    Modeling X-ray and gamma-ray emission from redback pulsar binaries

    Authors: Minju Sim, Hongjun An, Zorawar Wadiasingh

    Abstract: We investigated the multiband emission from the pulsar binaries XSS J12270-4859, PSR J2039-5617, and PSR J2339-0533, which exhibit orbital modulation in the X-ray and gamma-ray bands. We constructed the sources' broadband spectral energy distributions and multiband orbital light curves by supplementing our X-ray measurements with published gamma-ray results, and we modeled the data using intra-bin… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: accepted for the publication of ApJ (20 pages, 11 figures)

  44. arXiv:2401.16559  [pdf, other

    cs.CV

    IEEE BigData 2023 Keystroke Verification Challenge (KVC)

    Authors: Giuseppe Stragapede, Ruben Vera-Rodriguez, Ruben Tolosana, Aythami Morales, Ivan DeAndres-Tame, Naser Damer, Julian Fierrez, Javier-Ortega Garcia, Nahuel Gonzalez, Andrei Shadrikov, Dmitrii Gordin, Leon Schmitt, Daniel Wimmer, Christoph Grossmann, Joerdis Krieger, Florian Heinz, Ron Krestel, Christoffer Mayer, Simon Haberl, Helena Gschrey, Yosuke Yamagishi, Sanjay Saha, Sanka Rasnayaka, Sandareka Wickramanayake, Terence Sim , et al. (4 additional authors not shown)

    Abstract: This paper describes the results of the IEEE BigData 2023 Keystroke Verification Challenge (KVC), that considers the biometric verification performance of Keystroke Dynamics (KD), captured as tweet-long sequences of variable transcript text from over 185,000 subjects. The data are obtained from two of the largest public databases of KD up to date, the Aalto Desktop and Mobile Keystroke Databases,… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 9 pages, 10 pages, 2 figures. arXiv admin note: text overlap with arXiv:2311.06000

  45. BugsInPy: A Database of Existing Bugs in Python Programs to Enable Controlled Testing and Debugging Studies

    Authors: Ratnadira Widyasari, Sheng Qin Sim, Camellia Lok, Haodi Qi, Jack Phan, Qi** Tay, Constance Tan, Fiona Wee, Jodie Ethelda Tan, Yuheng Yieh, Brian Goh, Ferdian Thung, Hong ** Kang, Thong Hoang, David Lo, Eng Lieh Ouh

    Abstract: The 2019 edition of Stack Overflow developer survey highlights that, for the first time, Python outperformed Java in terms of popularity. The gap between Python and Java further widened in the 2020 edition of the survey. Unfortunately, despite the rapid increase in Python's popularity, there are not many testing and debugging tools that are designed for Python. This is in stark contrast with the a… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (2020) 1556-1560

  46. arXiv:2401.15313  [pdf, other

    cs.RO cs.CV eess.SY math.OC

    Multi-Robot Relative Pose Estimation in SE(2) with Observability Analysis: A Comparison of Extended Kalman Filtering and Robust Pose Graph Optimization

    Authors: Kihoon Shin, Hyunjae Sim, Seungwon Nam, Yonghee Kim, Jae Hu, Kwang-Ki K. Kim

    Abstract: In this study, we address multi-robot localization issues, with a specific focus on cooperative localization and observability analysis of relative pose estimation. Cooperative localization involves enhancing each robot's information through a communication network and message passing. If odometry data from a target robot can be transmitted to the ego robot, observability of their relative pose es… ▽ More

    Submitted 4 February, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: 20 pages, 21 figures

    MSC Class: 93C85; 93E11; 93E24; 90C26; 93E10; 62M20;

  47. arXiv:2401.11840  [pdf, other

    cs.LG cs.AI

    Learning to Approximate Adaptive Kernel Convolution on Graphs

    Authors: Jaeyoon Sim, Sooyeon Jeon, InJun Choi, Guorong Wu, Won Hwa Kim

    Abstract: Various Graph Neural Networks (GNNs) have been successful in analyzing data in non-Euclidean spaces, however, they have limitations such as oversmoothing, i.e., information becomes excessively averaged as the number of hidden layers increases. The issue stems from the intrinsic formulation of conventional graph convolution where the nodal features are aggregated from a direct neighborhood per laye… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 15 pages, Accepted to AAAI 2024

  48. arXiv:2312.11413  [pdf, other

    cs.LG cs.AI

    DeRDaVa: Deletion-Robust Data Valuation for Machine Learning

    Authors: Xiao Tian, Rachael Hwee Ling Sim, Jue Fan, Bryan Kian Hsiang Low

    Abstract: Data valuation is concerned with determining a fair valuation of data from data sources to compensate them or to identify training examples that are the most or least useful for predictions. With the rising interest in personal data ownership and data protection regulations, model owners will likely have to fulfil more data deletion requests. This raises issues that have not been addressed by exis… ▽ More

    Submitted 21 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  49. arXiv:2312.07399  [pdf, other

    cs.CL cs.AI

    Large Language Models are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales

    Authors: Taeyoon Kwon, Kai Tzu-iunn Ong, Dong** Kang, Seungjun Moon, Jeong Ryong Lee, Dosik Hwang, Yongsik Sim, Beomseok Sohn, Dongha Lee, **young Yeo

    Abstract: Machine reasoning has made great progress in recent years owing to large language models (LLMs). In the clinical domain, however, most NLP-driven projects mainly focus on clinical classification or reading comprehension, and under-explore clinical reasoning for disease diagnosis due to the expensive rationale annotation with clinicians. In this work, we present a "reasoning-aware" diagnosis framew… ▽ More

    Submitted 10 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  50. arXiv:2312.06042  [pdf, other

    astro-ph.HE astro-ph.GA astro-ph.SR

    State-of-the-art simulations of line-driven accretion disc winds: realistic radiation-hydrodynamics leads to weaker outflows

    Authors: Nick Higginbottom, Nicolas Scepi, Christian Knigge, Knox S. Long, James H. Matthews, Stuart A. Sim

    Abstract: Disc winds are a common feature in accreting astrophysical systems on all scales. In active galactic nuclei (AGN) and accreting white dwarfs (AWDs), specifically, radiation pressure mediated by spectral lines is a promising mechanism for driving these outflows. Previous hydrodynamical simulations have largely supported this idea, but relied on highly approximate treatments of ionization and radiat… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in MNRAS. 14 pages, 10 figures + 3 figures in Appendix