Skip to main content

Showing 1–50 of 6,930 results for author: Lee, H

.
  1. arXiv:2407.01470  [pdf, other

    cs.CL

    DogeRM: Equip** Reward Models with Domain Knowledge through Model Merging

    Authors: Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung Chen

    Abstract: Reinforcement learning from human feedback (RLHF) is a popular strategy for aligning large language models (LLMs) with desired behaviors. Reward modeling is a crucial step in RLHF. However, collecting paired preference data for training reward models is often costly and time-consuming, especially for domain-specific preferences requiring expert annotation. To address this challenge, we propose the… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Preprint. Code will be released after the review results

  2. arXiv:2407.01197  [pdf, ps, other

    math.AP

    Uniform approximation by harmonic polynomials for solving the Dirichlet problem of Laplace's equation on a disk

    Authors: Taewan Kim, Haesung Lee

    Abstract: In this paper, we study the Dirichlet problem for Laplace's equation in an open disk. The uniqueness of solutions is ensured by the well-known weak maximum principle. We introduce a novel approach to demonstrate the existence of a solution using harmonic polynomials that converge uniformly to a solution. Specifically, we rigorously derive the convergence rate of the harmonic polynomials and show t… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 21 pages

    MSC Class: 31A05; 31A25 (Primary); 35C10; 35A23 (Secondary)

  3. arXiv:2407.00868  [pdf, other

    math.PR cs.DS

    Sampling from the Continuous Random Energy Model in Total Variation Distance

    Authors: Holden Lee, Qiang Wu

    Abstract: The continuous random energy model (CREM) is a toy model of spin glasses on $\{0,1\}^N$ that, in the limit, exhibits an infinitely hierarchical correlation structure. We give two polynomial-time algorithms to approximately sample from the Gibbs distribution of the CREM in the high-temperature regime, based on a Markov chain and a sequential sampler. The running time depends algebraically on the de… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2407.00842  [pdf, other

    cond-mat.soft physics.bio-ph

    Active Healing of Microtubule-Motor Networks

    Authors: Fan Yang, Shichen Liu, Heun ** Lee, Rob Phillips, Matt Thomson

    Abstract: Cytoskeletal networks have a self-healing property where networks can repair defects to maintain structural integrity. However, both the mechanisms and dynamics of healing remain largely unknown. Here we report an unexplored healing mechanism in microtubule-motor networks by active crosslinking. We directly generate network cracks using a light-controlled microtubule-motor system, and observe that… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  5. arXiv:2407.00517  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Local and Global Reciprocity in Orbital-Charge-Coupled Transport

    Authors: Dongwook Go, Tom S. Seifert, Tobias Kampfrath, Kazuya Ando, Hyun-Woo Lee, Yuriy Mokrousov

    Abstract: The coupled transport of the charge and orbital angular momentum of electrons is at the heart of orbitronics. Here, we discuss the reciprocal relation between the direct and inverse orbital Hall effects (OHEs) in thin films. We argue that the conventional orbital current is ill-defined as it does not satisfy the reciprocal relation owing to non-conservation of the orbital angular momentum. We reso… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  6. arXiv:2407.00355  [pdf, other

    physics.soc-ph cond-mat.stat-mech cs.SI

    Global decomposition of networks into multiple cores formed by local hubs

    Authors: Wonhee Jeong, Unjong Yu, Sang Hoon Lee

    Abstract: Networks are ubiquitous in various fields, representing systems where nodes and their interconnections constitute their intricate structures. We introduce a network decomposition scheme to reveal multiscale core-periphery structures lurking inside, using the concept of locally defined nodal hub centrality and edge-pruning techniques built upon it. We demonstrate that the hub-centrality-based edge… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 10 pages, 8 figures, 1 table

  7. arXiv:2407.00281  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Distinguishing Surface and Bulk Electromagnetism via Their Dynamics in an Intrinsic Magnetic Topological Insulator

    Authors: Khanh Duy Nguyen, Woojoo Lee, Jianchen Dang, Tongyao Wu, Gabriele Berruto, Chenhui Yan, Chi Ian Jess Ip, Haoran Lin, Qiang Gao, Seng Huat Lee, Binghai Yan, Chaoxing Liu, Zhiqiang Mao, Xiao-Xiao Zhang, Shuolong Yang

    Abstract: The indirect exchange interaction between local magnetic moments via surface electrons has been long predicted to bolster the surface ferromagnetism in magnetic topological insulators (MTIs), which facilitates the quantum anomalous Hall effect. This unconventional effect is critical to determining the operating temperatures of future topotronic devices. However, the experimental confirmation of th… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: 19 pages, 4 figures

  8. arXiv:2406.19694  [pdf, other

    hep-ph astro-ph.CO

    Refined approaches in second leptogenesis for the baryon-lepton asymmetry discrepancy

    Authors: YeolLin ChoeJo, Kazuki Enomoto, Yechan Kim, Hye-Sung Lee

    Abstract: The temperature-dependent mass of the heavy neutrino can lead to the second leptogenesis occurring below the electroweak scale, potentially explaining the large discrepancy between baryon and lepton asymmetries. We investigate this scenario further, exploring the intricate interplay of the weak interaction processes within this framework. It includes notable shifts in the dominant decay channels o… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 29 pages, 4 figures

  9. arXiv:2406.19679  [pdf, other

    physics.ed-ph physics.soc-ph

    Statistical Analysis on Scale and Regional Distribution of Undergraduate Physics Programs in Korean Universities

    Authors: Gahyoun Gim, Sang Hoon Lee

    Abstract: We report on the temporal changes in undergraduate-level physics programs at Korean universities from 1915 to 2023 by analyzing data on physics-related departments and their students using basic statistics and the scaling theory of statistical physics. Our analysis reveals that the number of departments peaked around the turn of the 21st century, and it has been steadily decreasing ever since, wit… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 11 pages, in Korean language, 8 figures, 1 table, in Korean

    Journal ref: New Phys.: Sae Mulli 74, 610 (2024)

  10. arXiv:2406.19287  [pdf, other

    astro-ph.HE

    Isotropy of cosmic rays beyond $10^{20}$ eV favors their heavy mass composition

    Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

    Abstract: We report an estimation of the injected mass composition of ultra-high energy cosmic rays (UHECRs) at energies higher than 10 EeV. The composition is inferred from an energy-dependent sky distribution of UHECR events observed by the Telescope Array surface detector by comparing it to the Large Scale Structure of the local Universe. In the case of negligible extra-galactic magnetic fields the resul… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures, accepted for publication in PRL

  11. arXiv:2406.19286  [pdf, other

    astro-ph.HE

    Mass composition of ultra-high energy cosmic rays from distribution of their arrival directions with the Telescope Array

    Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

    Abstract: We use a new method to estimate the injected mass composition of ultrahigh cosmic rays (UHECRs) at energies higher than 10 EeV. The method is based on comparison of the energy-dependent distribution of cosmic ray arrival directions as measured by the Telescope Array experiment (TA) with that calculated in a given putative model of UHECR under the assumption that sources trace the large-scale struc… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 18 pages, 11 figures, accepted for publication in PRD

  12. arXiv:2406.18871  [pdf, other

    eess.AS cs.CL

    DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

    Authors: Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu, He Huang, Boris Ginsburg, Yu-Chiang Frank Wang, Hung-yi Lee

    Abstract: Recent speech language models (SLMs) typically incorporate pre-trained speech models to extend the capabilities from large language models (LLMs). In this paper, we propose a Descriptive Speech-Text Alignment approach that leverages speech captioning to bridge the gap between speech and text modalities, enabling SLMs to interpret and generate comprehensive natural language descriptions, thereby fa… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  13. arXiv:2406.18699  [pdf, ps, other

    cs.RO

    From Pixels to Torques with Linear Feedback

    Authors: Jeong Hun Lee, Sam Schoedel, Aditya Bhardwaj, Zachary Manchester

    Abstract: We demonstrate the effectiveness of simple observer-based linear feedback policies for "pixels-to-torques" control of robotic systems using only a robot-facing camera. Specifically, we show that the matrices of an image-based Luenberger observer (linear state estimator) for a "student" output-feedback policy can be learned from demonstration data provided by a "teacher" state-feedback policy via s… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Submitted to Workshop on Algorithmic Foundations of Robotics (WAFR) 2024

  14. arXiv:2406.18505  [pdf, other

    cs.LG cs.AI cs.CL cs.RO

    Mental Modeling of Reinforcement Learning Agents by Language Models

    Authors: Wenhao Lu, Xufeng Zhao, Josua Spisak, Jae Hee Lee, Stefan Wermter

    Abstract: Can emergent language models faithfully model the intelligence of decision-making agents? Though modern language models exhibit already some reasoning ability, and theoretically can potentially express any probable distribution over tokens, it remains underexplored how the world knowledge these pretrained models have memorized can be utilized to comprehend an agent's behaviour in the physical worl… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: https://lukaswill.github.io/

  15. arXiv:2406.17808  [pdf, other

    cs.CL cs.AI cs.LG

    Training-Free Exponential Extension of Sliding Window Context with Cascading KV Cache

    Authors: Jeffrey Willette, Heejun Lee, Youngwan Lee, Myeongjae Jeon, Sung Ju Hwang

    Abstract: The context window within a transformer provides a form of active memory for the current task, which can be useful for few-shot learning and conditional generation, both which depend heavily on previous context tokens. However, as the context length grows, the computational cost increases quadratically. Recent works have shown that saving a few initial tokens along with a fixed-sized sliding windo… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  16. arXiv:2406.17221  [pdf, other

    hep-th

    IR physics from the holographic RG flow

    Authors: Chanyong Park, Jung Hun Lee

    Abstract: Applying the holographic method, we investigate an RG flow and IR physics holographically when a two-dimensional conformal field theory is deformed by a relevant scalar operator. To do so, we first assume an RG flow from a UV to new IR CFT. On the dual gravity side, such an RG flow can be described by rolling down of a bulk scalar field from an unstable to stable equilibrium point. After consideri… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  17. arXiv:2406.17006  [pdf, other

    hep-ex

    Probing the nature of the $χ_{c1}(3872)$ state using radiative decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1094 additional authors not shown)

    Abstract: The radiative decays $χ_{c1}(3872)\rightarrowψ(2S)γ$ and $χ_{c1}(3872)\rightarrow J/ψγ$ are used to probe the~nature of the~$χ_{c1}(3872)$ state using proton-proton collision data collected with the LHCb detector, corresponding to an~integrated luminosity of~9fb$^{-1}$. Using the~$B^+\rightarrow χ_{c1}(3872)K^+$decay, the $χ_{c1}(3872)\rightarrow ψ(2S)γ$ process is observed for the first time and… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 31 pages, 2 figures. All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-015.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-015, CERN-EP-2025-157

  18. arXiv:2406.16727  [pdf, other

    math.AP

    Higher differentiability for the fractional $p$-Laplacian

    Authors: Lars Diening, Kyeongbae Kim, Ho-Sik Lee, Simon Nowak

    Abstract: In this work, we study the higher differentiability of solutions to the inhomogeneous fractional $p$-Laplace equation under different regularity assumptions on the data. In the superquadratic case, we extend and sharpen several previous results, while in the subquadratic regime our results constitute completely novel developments even in the homogeneous case. In particular, in the local limit our… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 48 pages

  19. arXiv:2406.16470  [pdf, other

    astro-ph.IM

    Project Management for Ground-based Telescope Array Development

    Authors: Ji Hoon Kim, Myungshin Im, Hyung Mok Lee, Seo-Won Chang

    Abstract: Center for the Gravitational-Wave Universe at Seoul National University has been operating its main observational facility, the 7-Dimensional Telescope (7DT) since October 2023. Located at El Sauce Observatory in Chilean Rio Hurtado Valley, 7DT consists of 20 50-cm telescopes equipped with 40 medium-band filters of 25 nm full width at half maximum along with a CMOS camera of 61 megapixels. 7DT pro… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 7 pages, 1 figures, Proceedings of the SPIE conference "Modeling, Systems Engineering, and Project Management for Astronomy XI" SPIE Astronomical Telescopes + Instrumentation 2024 (Paper No. 13099-77)

  20. arXiv:2406.16462  [pdf, other

    astro-ph.IM astro-ph.HE

    Introduction to the 7-Dimensional Telescope: Commissioning Procedures and Data Characteristics

    Authors: Ji Hoon Kim, Myungshin Im, Hyung Mok Lee, Seo-Won Chang, Hyeonho Choi, Gregory S. H. Paek

    Abstract: The 7-Dimensional Telescope (7DT) is a multi-telescope system designed to identify electromagnetic (EM) counterparts of gravitational-wave (GW) sources. Consisting of 20 50-cm telescopes along with 40 medium-band filters of 25 nm width, 7DT can obtain spectral map** images for a large field of view (~1.25 square degrees). Along with flexible operation, real-time data reduction, and analysis, the… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 11 pages, 3 figures, Proceedings of the SPIE conference 13094 Ground-based and Airborne Telescope X, SPIE Astronomical Telescopes + Instrumentations 2024 (Paper No. 13094-034)

  21. arXiv:2406.16003  [pdf

    physics.optics

    Unidirectional Chiral Emission via Twisted Bi-layer Metasurfaces

    Authors: Dmitrii Gromyko, Shu An, Sergey Gorelik, Jiahui Xu, Li Jun Lim, Henry Yit Loong Lee, Febiana Tjiptoharsono, Zhi-Kuang Tan, Cheng-Wei Qiu, Zhaogang Dong, Lin Wu

    Abstract: Controlling and channelling light emissions from unpolarized quantum dots into specific directions with chiral polarization remains a key challenge in modern photonics. Stacked metasurface designs offer a potential compact solution for chirality and directionality engineering. However, experimental observations of directional chiral radiation from resonant metasurfaces with quantum emitters remain… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 16 pages, 4 figures

  22. arXiv:2406.15635  [pdf, other

    cs.LG cs.CR cs.CV

    DataFreeShield: Defending Adversarial Attacks without Training Data

    Authors: Hyeyoon Lee, Kanghyun Choi, Dain Kwon, Sunjong Park, Mayoore Selvarasa Jaiswal, Noseong Park, Jonghyun Choi, **ho Lee

    Abstract: Recent advances in adversarial robustness rely on an abundant set of training data, where using external or additional datasets has become a common setting. However, in real life, the training data is often kept private for security and privacy issues, while only the pretrained weight is available to the public. In such scenarios, existing methods that assume accessibility to the original data bec… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  23. arXiv:2406.15481  [pdf, other

    cs.AI cs.CL

    CSRT: Evaluation and Analysis of LLMs using Code-Switching Red-Teaming Dataset

    Authors: Haneul Yoo, Yong** Yang, Hwaran Lee

    Abstract: Recent studies in large language models (LLMs) shed light on their multilingual ability and safety, beyond conventional tasks in language modeling. Still, current benchmarks reveal their inability to comprehensively evaluate them and are excessively dependent on manual annotations. In this paper, we introduce code-switching red-teaming (CSRT), a simple yet effective red-teaming technique that simu… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  24. arXiv:2406.15361  [pdf, other

    math.GT

    Minimal grid diagrams of the prime alternating knots with 13 crossings

    Authors: Hwa Jeong Lee, Alexander Stoimenow, Gyo Taek **

    Abstract: A knot is a closed loop in space without self-intersection. Two knots are equivalent if there is a self homeomorphism of space bringing one onto the other. An arc presentation is an embedding of a knot in the union of finitely many half planes with a common boundary line such that each half plane contains a simple arc of the knot. The minimal number of such half planes among all arc presentations… ▽ More

    Submitted 31 March, 2024; originally announced June 2024.

    Comments: 76 pages, 4 figures, 4878 grid diagrams

    MSC Class: 57K10

  25. arXiv:2406.15325  [pdf, other

    cs.AI cs.SE

    Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks

    Authors: Hokyung Lee, Sumanyu Sharma, Bing Hu

    Abstract: Recent research in Needle-in-a-Haystack (NIAH) benchmarks has explored the capabilities of Large Language Models (LLMs) in retrieving contextual information from large text documents. However, as LLMs become increasingly integrated into software development processes, it is crucial to evaluate their performance in code-based environments. As LLMs are further developed for program synthesis, we nee… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 8 pages

    MSC Class: 68T50 ACM Class: I.2.7; D.2.5

  26. arXiv:2406.13924  [pdf, other

    astro-ph.GA

    Impact of Internal Dust Correction on the Stellar Populations of Galaxies Estimated Using the Full Spectrum Fitting

    Authors: Joon Hyeop Lee, Hyun** Jeong, Jiwon Chung, Mina Pak, Sree Oh

    Abstract: Full spectrum fitting is a powerful tool for estimating the stellar populations of galaxies, but the fitting results are often significantly influenced by internal dust attenuation. For understanding how the choice of the internal dust correction method affects the detailed stellar populations estimated from the full spectrum fitting, we analyze the Sydney-Australian Astronomical Observatory Multi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 10 pages, 8 figures, accepted for publication in Journal of the Korean Astronomical Society

  27. arXiv:2406.13342  [pdf, other

    cs.CL cs.AI

    ZeroDL: Zero-shot Distribution Learning for Text Clustering via Large Language Models

    Authors: Hwiyeol Jo, Hyunwoo Lee, Taiwoo Park

    Abstract: The recent advancements in large language models (LLMs) have brought significant progress in solving NLP tasks. Notably, in-context learning (ICL) is the key enabling mechanism for LLMs to understand specific tasks and gras** nuances. In this paper, we propose a simple yet effective method to contextualize a task toward a specific LLM, by (1) observing how a given LLM describes (all or a part of… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ARR Submitted

  28. arXiv:2406.13309  [pdf, other

    math.GT

    The Powell Conjecture in genus four

    Authors: Sangbum Cho, Yuya Koda, Jung Hoon Lee

    Abstract: The Powell Conjecture states that four specific elements suffice to generate the Goeritz group of the Heegaard splitting of the $3$-sphere. We show that this conjecture is true when the genus of the splitting is four.

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures

    MSC Class: 57K30

  29. arXiv:2406.12254  [pdf, other

    eess.IV cs.CV

    Enhancing Single-Slice Segmentation with 3D-to-2D Unpaired Scan Distillation

    Authors: Xin Yu, Qi Yang, Han Liu, Ho Hin Lee, Yucheng Tang, Lucas W. Remedios, Michael Kim, Shunxing Bao, Ann Xenobia Moore, Luigi Ferrucci, Bennett A. Landman

    Abstract: 2D single-slice abdominal computed tomography (CT) enables the assessment of body habitus and organ health with low radiation exposure. However, single-slice data necessitates the use of 2D networks for segmentation, but these networks often struggle to capture contextual information effectively. Consequently, even when trained on identical datasets, 3D networks typically achieve superior segmenta… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  30. arXiv:2406.11579  [pdf, other

    cs.CV

    Duoduo CLIP: Efficient 3D Understanding with Multi-View Images

    Authors: Han-Hung Lee, Yiming Zhang, Angel X. Chang

    Abstract: We introduce Duoduo CLIP, a model for 3D representation learning that learns shape encodings from multi-view images instead of point-clouds. The choice of multi-view images allows us to leverage 2D priors from off-the-shelf CLIP models to facilitate fine-tuning with 3D data. Our approach not only shows better generalization compared to existing point cloud methods, but also reduces GPU requirement… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  31. arXiv:2406.11311  [pdf, other

    cs.CV

    Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection

    Authors: Yunsong Wang, Na Zhao, Gim Hee Lee

    Abstract: The use of synthetic data in indoor 3D object detection offers the potential of greatly reducing the manual labor involved in 3D annotations and training effective zero-shot detectors. However, the complicated domain shifts across syn-to-real indoor datasets remains underexplored. In this paper, we propose a novel Object-wise Hierarchical Domain Alignment (OHDA) framework for syn-to-real unsupervi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  32. arXiv:2406.11283  [pdf, other

    cs.CV

    Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding

    Authors: Yunsong Wang, Na Zhao, Gim Hee Lee

    Abstract: The field of self-supervised 3D representation learning has emerged as a promising solution to alleviate the challenge presented by the scarcity of extensive, well-annotated datasets. However, it continues to be hindered by the lack of diverse, large-scale, real-world 3D scene datasets for source data. To address this shortfall, we propose Generalizable Representation Learning (GRL), where we devi… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  33. arXiv:2406.11271  [pdf, other

    cs.CV cs.LG

    MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

    Authors: Anas Awadalla, Le Xue, Oscar Lo, Manli Shu, Hannah Lee, Etash Kumar Guha, Matt Jordan, Sheng Shen, Mohamed Awadalla, Silvio Savarese, Caiming Xiong, Ran Xu, Ye** Choi, Ludwig Schmidt

    Abstract: Multimodal interleaved datasets featuring free-form interleaved sequences of images and text are crucial for training frontier large multimodal models (LMMs). Despite the rapid progression of open-source LMMs, there remains a pronounced scarcity of large-scale, diverse open-source multimodal interleaved datasets. In response, we introduce MINT-1T, the most extensive and diverse open-source Multimo… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  34. arXiv:2406.11248  [pdf

    eess.AS cs.AI cs.SD

    Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 9

    Authors: Do Hyun Lee, Yoonah Song, Hong Kook Kim

    Abstract: We present a prompt-engineering-based text-augmentation approach applied to a language-queried audio source separation (LASS) task. To enhance the performance of LASS, the proposed approach utilizes large language models (LLMs) to generate multiple captions corresponding to each sentence of the training dataset. To this end, we first perform experiments to identify the most effective prompts for c… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: DCASE 2024 Challenge Task 9, 4 pages

  35. arXiv:2406.11215  [pdf, ps, other

    math.AP

    Long-time behavior toward composite wave of shocks for 3D barotropic navier-stokes system

    Authors: Moon-** Kang, Hobin Lee

    Abstract: We consider the barotropic Navier-Stokes system in three space dimensions with periodic boundary condition in the transversal direction. We show the long-time behavior of the 3D barotropic Navier-Stokes flow perturbed from a composition of two shock waves with suitably small amplitudes. We prove that the perturbed Navier-Stokes flow converges, uniformly in space, towards a composition of two plana… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  36. arXiv:2406.11130  [pdf, other

    cs.CL

    Dynamic Order Template Prediction for Generative Aspect-Based Sentiment Analysis

    Authors: Yonghyun Jun, Hwanhee Lee

    Abstract: Aspect-based sentiment analysis (ABSA) assesses sentiments towards specific aspects within texts, resulting in detailed sentiment tuples. Previous ABSA models often use static templates to predict all of the elements in the tuples, and these models often fail to accurately capture dependencies between elements. Multi-view prompting method improves the performance of ABSA by predicting tuples with… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 10 pages, 7 figures

  37. arXiv:2406.11065  [pdf, other

    cs.CL

    Can LLMs Understand the Implication of Emphasized Sentences in Dialogue?

    Authors: Guan-Ting Lin, Hung-yi Lee

    Abstract: Emphasis is a crucial component in human communication, which indicates the speaker's intention and implication beyond pure text in dialogue. While Large Language Models (LLMs) have revolutionized natural language processing, their ability to understand emphasis in dialogue remains unclear. This paper introduces Emphasized-Talk, a benchmark with emphasis-annotated dialogue samples capturing the im… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 10 pages

  38. arXiv:2406.11064  [pdf, other

    eess.AS cs.SD

    Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech

    Authors: Guan-Ting Lin, Wei-** Huang, Hung-yi Lee

    Abstract: Deep learning-based end-to-end automatic speech recognition (ASR) has made significant strides but still struggles with performance on out-of-domain (OOD) samples due to domain shifts in real-world scenarios. Test-Time Adaptation (TTA) methods address this issue by adapting models using test samples at inference time. However, current ASR TTA methods have largely focused on non-continual TTA, whic… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 13 pages

  39. arXiv:2406.10923  [pdf, other

    cs.CV cs.CL cs.LG

    Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies

    Authors: Hung-Ting Su, Chun-Tong Chao, Ya-Ching Hsu, Xudong Lin, Yulei Niu, Hung-Yi Lee, Winston H. Hsu

    Abstract: Large Language Models (LLMs) have demonstrated effectiveness not only in language tasks but also in video reasoning. This paper introduces a novel dataset, Tropes in Movies (TiM), designed as a testbed for exploring two critical yet previously overlooked video reasoning skills: (1) Abstract Perception: understanding and tokenizing abstract concepts in videos, and (2) Long-range Compositional Reaso… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Project page: https://ander1119.github.io/TiM

  40. arXiv:2406.10207  [pdf, other

    astro-ph.EP

    Latitudinal Asymmetry in the Dayside Atmosphere of WASP-43b

    Authors: Ryan C. Challener, Zafar Rustamkulov, Elspeth K. H. Lee, Nikole Lewis, David K. Sing, Stephan M. Birkmann, Nicolas Crouzet, Néstor Espinoza, Elena Manjavacas, Natalia Oliveros-Gomez, Jeff A. Valenti, **gxuan Yang

    Abstract: We present two-dimensional near-infrared temperature maps of the canonical hot Jupiter WASP-43b using a phase-curve observation with JWST NIRSpec/G395H. From the white-light planetary transit, we improve constraints on the planet's orbital parameters and measure a planet-to-star radius ratio of $0.15883^{+0.00056}_{-0.00053}$. Using the white-light phase curve, we measure a longitude of maximum br… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 15 pages, 8 figures. In review at ApJ Letters

  41. arXiv:2406.10083  [pdf, other

    cs.CL cs.SD eess.AS

    On the Evaluation of Speech Foundation Models for Spoken Language Understanding

    Authors: Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji Watanabe

    Abstract: The Spoken Language Understanding Evaluation (SLUE) suite of benchmark tasks was recently introduced to address the need for open resources and benchmarking of complex spoken language understanding (SLU) tasks, including both classification and sequence generation tasks, on natural speech. The benchmark has demonstrated preliminary success in using pre-trained speech foundation models (SFM) for th… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL Findings 2024

  42. arXiv:2406.09988  [pdf, other

    cs.AI cs.CL cs.RO

    Details Make a Difference: Object State-Sensitive Neurorobotic Task Planning

    Authors: Xiaowen Sun, Xufeng Zhao, Jae Hee Lee, Wenhao Lu, Matthias Kerzel, Stefan Wermter

    Abstract: The state of an object reflects its current status or condition and is important for a robot's task planning and manipulation. However, detecting an object's state and generating a state-sensitive plan for robots is challenging. Recently, pre-trained Large Language Models (LLMs) and Vision-Language Models (VLMs) have shown impressive capabilities in generating plans. However, to the best of our kn… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  43. arXiv:2406.09948  [pdf, other

    cs.CL

    BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

    Authors: Junho Myung, Nayeon Lee, Yi Zhou, Jiho **, Rifki Afina Putri, Dimosthenis Antypas, Hsuvas Borkakoty, Eunsu Kim, Carla Perez-Almendros, Abinew Ali Ayele, Víctor Gutiérrez-Basulto, Yazmín Ibáñez-García, Hwaran Lee, Shamsuddeen Hassan Muhammad, Kiwoong Park, Anar Sabuhi Rzayev, Nina White, Seid Muhie Yimam, Mohammad Taher Pilehvar, Nedjma Ousidhoum, Jose Camacho-Collados, Alice Oh

    Abstract: Large language models (LLMs) often lack culture-specific knowledge of daily life, especially across diverse regions and non-English languages. Existing benchmarks for evaluating LLMs' cultural sensitivities are limited to a single language or collected from online sources such as Wikipedia, which do not reflect the mundane everyday lifestyles of diverse regions. That is, information about the food… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  44. arXiv:2406.09894  [pdf, other

    eess.AS cs.SD

    Period Singer: Integrating Periodic and Aperiodic Variational Autoencoders for Natural-Sounding End-to-End Singing Voice Synthesis

    Authors: Taewoo Kim, Choongsang Cho, Young Han Lee

    Abstract: In this paper, we present Period Singer, a novel end-to-end singing voice synthesis (SVS) model that utilizes variational inference for periodic and aperiodic components, aimed at producing natural-sounding waveforms. Recent end-to-end SVS models have demonstrated the capability of synthesizing high-fidelity singing voices. However, owing to deterministic pitch conditioning, they do not fully addr… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  45. arXiv:2406.09827  [pdf, other

    cs.CL cs.CV cs.DC cs.LG

    HiP Attention: Sparse Sub-Quadratic Attention with Hierarchical Attention Pruning

    Authors: Heejun Lee, Geon Park, Youngwan Lee, **a Kim, Wonyoung Jeong, Myeongjae Jeon, Sung Ju Hwang

    Abstract: In modern large language models (LLMs), increasing sequence lengths is a crucial challenge for enhancing their comprehension and coherence in handling complex tasks such as multi-modal question answering. However, handling long context sequences with LLMs is prohibitively costly due to the conventional attention mechanism's quadratic time and space complexity, and the context window size is limite… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 26 pages, 15 figures

  46. arXiv:2406.09698  [pdf, other

    physics.ins-det hep-ex

    Projected background and sensitivity of AMoRE-II

    Authors: A. Agrawal, V. V. Alenkov, P. Aryal, J. Beyer, B. Bhandari, R. S. Boiko, K. Boonin, O. Buzanov, C. R. Byeon, N. Chanthima, M. K. Cheoun, J. S. Choe, Seonho Choi, S. Choudhury, J. S. Chung, F. A. Danevich, M. Djamal, D. Drung, C. Enss, A. Fleischmann, A. M. Gangapshev, L. Gastaldo, Y. M. Gavrilyuk, A. M. Gezhaev, O. Gileva , et al. (81 additional authors not shown)

    Abstract: AMoRE-II aims to search for neutrinoless double beta decay with an array of 423 Li$_2$$^{100}$MoO$_4$ crystals operating in the cryogenic system as the main phase of the Advanced Molybdenum-based Rare process Experiment (AMoRE). AMoRE has been planned to operate in three phases: AMoRE-pilot, AMoRE-I, and AMoRE-II. AMoRE-II is currently being installed at the Yemi Underground Laboratory, located ap… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  47. arXiv:2406.09641  [pdf, other

    astro-ph.EP

    Phase-resolving the absorption signatures of water and carbon monoxide in the atmosphere of the ultra-hot Jupiter WASP-121b with GEMINI-S/IGRINS

    Authors: Joost P. Wardenier, Vivien Parmentier, Michael R. Line, Megan Weiner Mansfield, Xianyu Tan, Shang-Min Tsai, Jacob L. Bean, Jayne L. Birkby, Matteo Brogi, Jean-Michel Désert, Siddharth Gandhi, Elspeth K. H. Lee, Colette I. Levens, Lorenzo Pino, Peter C. B. Smith

    Abstract: Ultra-hot Jupiters are among the best targets for atmospheric characterization at high spectral resolution. Resolving their transmission spectra as a function of orbital phase offers a unique window into the 3D nature of these objects. In this work, we present three transits of the ultra-hot Jupiter WASP-121b observed with Gemini-S/IGRINS. For the first time, we measure the phase-dependent absorpt… ▽ More

    Submitted 19 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 24 pages, 16 figures, resubmitted to PASP (made a few minor changes to the text w.r.t. v1)

  48. arXiv:2406.08796  [pdf, other

    cs.CL

    Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning

    Authors: Janghoon Han, Changho Lee, Joongbo Shin, Stanley Jungkyu Choi, Honglak Lee, Kynghoon Bae

    Abstract: Instruction tuning has emerged as a powerful technique, significantly boosting zero-shot performance on unseen tasks. While recent work has explored cross-lingual generalization by applying instruction tuning to multilingual models, previous studies have primarily focused on English, with a limited exploration of non-English tasks. For an in-depth exploration of cross-lingual generalization in ins… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024 (Camera-ready), by Janghoon Han and Changho Lee, with equal contribution

  49. arXiv:2406.08747  [pdf, other

    cs.CL

    StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

    Authors: Cheng-Kuang Wu, Zhi Rui Tam, Chieh-Yen Lin, Yun-Nung Chen, Hung-yi Lee

    Abstract: Recent works have shown that large language model (LLM) agents are able to improve themselves from experience, which is an important ability for continuous enhancement post-deployment. However, existing benchmarks primarily evaluate their innate capabilities and do not assess their ability to improve over time. To address this gap, we introduce StreamBench, a pioneering benchmark designed to evalu… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  50. arXiv:2406.08641  [pdf, ps, other

    cs.SD cs.CL eess.AS

    ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets

    Authors: Jiatong Shi, Shih-Heng Wang, William Chen, Martijn Bartelds, Vanya Bannihatti Kumar, **chuan Tian, Xuankai Chang, Dan Jurafsky, Karen Livescu, Hung-yi Lee, Shinji Watanabe

    Abstract: ML-SUPERB evaluates self-supervised learning (SSL) models on the tasks of language identification and automatic speech recognition (ASR). This benchmark treats the models as feature extractors and uses a single shallow downstream model, which can be fine-tuned for a downstream task. However, real-world use cases may require different configurations. This paper presents ML-SUPERB~2.0, which is a ne… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024