Skip to main content

Showing 101–150 of 8,263 results for author: Lee, J

.
  1. arXiv:2406.02398  [pdf, other

    cs.SE

    MOTIF: A tool for Mutation Testing with Fuzzing

    Authors: Jaekwon Lee, Enrico Viganò, Fabrizio Pastore, Lionel Briand

    Abstract: Mutation testing consists of generating test cases that detect faults injected into software (generating mutants) which its original test suite could not. By running such an augmented set of test cases, it may discover actual faults that may have gone unnoticed with the original test suite. It is thus a desired practice for embedded software running in safety-critical cyber-physical systems (CPS).… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: ICST2024. arXiv admin note: text overlap with arXiv:2308.07949

  2. arXiv:2406.02030  [pdf, other

    cs.CL cs.AI

    Multimodal Reasoning with Multimodal Knowledge Graph

    Authors: Junlin Lee, Yequan Wang, **g Li, Min Zhang

    Abstract: Multimodal reasoning with large language models (LLMs) often suffers from hallucinations and the presence of deficient or outdated knowledge within LLMs. Some approaches have sought to mitigate these issues by employing textual knowledge graphs, but their singular modality of knowledge limits comprehensive cross-modal understanding. In this paper, we propose the Multimodal Reasoning with Multimoda… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024 (Main Conference)

  3. arXiv:2406.01947  [pdf, other

    cs.RO cs.LG

    Data-Driven Approaches for Thrust Prediction in Underwater Flap** Fin Propulsion Systems

    Authors: Julian Lee, Kamal Viswanath, Alisha Sharma, Jason Geder, Ravi Ramamurti, Marius D. Pruessner

    Abstract: Flap**-fin underwater vehicle propulsion systems provide an alternative to propeller-driven systems in situations that require involve a constrained environment or require high maneuverability. Testing new configurations through experiments or high-fidelity simulations is an expensive process, slowing development of new systems. This is especially true when introducing new fin geometries. In thi… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 11 figures, AAAI 2021 Fall Series Symposium on Science-Guided AI

  4. CAFO: Feature-Centric Explanation on Time Series Classification

    Authors: Jaeho Kim, Seok-Ju Hahn, Yoontae Hwang, Junghye Lee, Seulki Lee

    Abstract: In multivariate time series (MTS) classification, finding the important features (e.g., sensors) for model performance is crucial yet challenging due to the complex, high-dimensional nature of MTS data, intricate temporal dynamics, and the necessity for domain-specific interpretations. Current explanation methods for MTS mostly focus on time-centric explanations, apt for pinpointing important time… ▽ More

    Submitted 11 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to KDD 2024 Research Track

  5. arXiv:2406.01647  [pdf, other

    cs.LG cs.AI

    An Analysis under a Unified Fomulation of Learning Algorithms with Output Constraints

    Authors: Mooho Song, Jay-Yoon Lee

    Abstract: Neural networks (NN) perform well in diverse tasks, but sometimes produce nonsensical results to humans. Most NN models "solely" learn from (input, output) pairs, occasionally conflicting with human knowledge. Many studies indicate injecting human knowledge by reducing output constraints during training can improve model performance and reduce constraint violations. While there have been several a… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  6. arXiv:2406.01581  [pdf, other

    cs.LG stat.ML

    Neural network learns low-dimensional polynomials with SGD near the information-theoretic limit

    Authors: Jason D. Lee, Kazusato Oko, Taiji Suzuki, Denny Wu

    Abstract: We study the problem of gradient descent learning of a single-index target function $f_*(\boldsymbol{x}) = \textstyleσ_*\left(\langle\boldsymbol{x},\boldsymbolθ\rangle\right)$ under isotropic Gaussian data in $\mathbb{R}^d$, where the link function $σ_*:\mathbb{R}\to\mathbb{R}$ is an unknown degree $q$ polynomial with information exponent $p$ (defined as the lowest degree in the Hermite expansion)… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 34 pages

  7. arXiv:2406.01152  [pdf, other

    cs.RO

    Learning-based legged locomotion; state of the art and future perspectives

    Authors: Sehoon Ha, Joonho Lee, Michiel van de Panne, Zhaoming Xie, Wenhao Yu, Majid Khadiv

    Abstract: Legged locomotion holds the premise of universal mobility, a critical capability for many real-world robotic applications. Both model-based and learning-based approaches have advanced the field of legged locomotion in the past three decades. In recent years, however, a number of factors have dramatically accelerated progress in learning-based methods, including the rise of deep learning, rapid pro… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  8. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2406.00905  [pdf, other

    hep-ex

    Exploration of mass splitting and muon/tau mixing parameters for an eV-scale sterile neutrino with IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

    Abstract: We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  10. arXiv:2406.00669  [pdf

    eess.SY econ.GN

    Multi-technology co-optimization approach for sustainable hydrogen and electricity supply chains considering variability and demand scale

    Authors: Sunwoo Kim, Joungho Park, Jay H. Lee

    Abstract: In the pursuit of a carbon-neutral future, hydrogen emerges as a pivotal element, serving as a carbon-free energy carrier and feedstock. As efforts to decarbonize sectors such as heating and transportation intensify, understanding and navigating through the dynamics of hydrogen demand expansion becomes critical. Transitioning to hydrogen economy is complicated by varying regional scales and types… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  11. arXiv:2406.00665   

    econ.GN eess.SY

    Integrating solid direct air capture systems with green hydrogen production: Economic synergy of sector coupling

    Authors: Sunwoo Kim, Joungho Park, Jay H. Lee

    Abstract: In the global pursuit of sustainable energy solutions, mitigating carbon dioxide (CO2) emissions stands as a pivotal challenge. With escalating atmospheric CO2 levels, the imperative of direct air capture (DAC) systems becomes evident. Simultaneously, green hydrogen (GH) emerges as a pivotal medium for renewable energy. Nevertheless, the substantial expenses associated with these technologies impe… ▽ More

    Submitted 28 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: Some of the results of our previous preprint paper are flawed, and we are withdrawing them to prevent the spread of incorrect knowledge

  12. arXiv:2406.00549  [pdf, other

    stat.ME cs.AI

    Zero Inflation as a Missing Data Problem: a Proxy-based Approach

    Authors: Trung Phung, Jaron J. R. Lee, Opeyemi Oladapo-Shittu, Eili Y. Klein, Ayse Pinar Gurses, Susan M. Hannum, Kimberly Weems, Jill A. Marsteller, Sara E. Cosgrove, Sara C. Keller, Ilya Shpitser

    Abstract: A common type of zero-inflated data has certain true values incorrectly replaced by zeros due to data recording conventions (rare outcomes assumed to be absent) or details of data recording equipment (e.g. artificial zeros in gene expression data). Existing methods for zero-inflated data either fit the observed data likelihood via parametric mixture models that explicitly represent excess zeros,… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 28 pages, 8 figues, accepted for the 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

  13. arXiv:2406.00360  [pdf, other

    cs.AR

    L2R-CIPU: Efficient CNN Computation with Left-to-Right Composite Inner Product Units

    Authors: Malik Zohaib Nisar, Mohammad Sohail Ibrahim, Muhammad Usman, Jeong-A Lee

    Abstract: This paper proposes a composite inner-product computation unit based on left-to-right (LR) arithmetic for the acceleration of convolution neural networks (CNN) on hardware. The efficacy of the proposed L2R-CIPU method has been shown on the VGG-16 network, and assessment is done on various performance metrics. The L2R-CIPU design achieves 1.06x to 6.22x greater performance, 4.8x to 15x more TOPS/W,… ▽ More

    Submitted 10 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  14. arXiv:2406.00014  [pdf, other

    cs.DB cs.AI cs.CL cs.IR

    KU-DMIS at EHRSQL 2024:Generating SQL query via question templatization in EHR

    Authors: Hajung Kim, Chanhwi Kim, Hoonick Lee, Kyochul Jang, Jiwoo Lee, Kyungjae Lee, Gangwoo Kim, Jaewoo Kang

    Abstract: Transforming natural language questions into SQL queries is crucial for precise data retrieval from electronic health record (EHR) databases. A significant challenge in this process is detecting and rejecting unanswerable questions that request information beyond the database's scope or exceed the system's capabilities. In this paper, we introduce a novel text-to-SQL framework that robustly handle… ▽ More

    Submitted 19 June, 2024; v1 submitted 21 May, 2024; originally announced June 2024.

    Comments: Published at ClinicalNLP workshop @ NAACL 2024

  15. arXiv:2405.20821  [pdf, other

    cs.LG cs.DC stat.ML

    Pursuing Overall Welfare in Federated Learning through Sequential Decision Making

    Authors: Seok-Ju Hahn, Gi-Soo Kim, Junghye Lee

    Abstract: In traditional federated learning, a single global model cannot perform equally well for all clients. Therefore, the need to achieve the client-level fairness in federated system has been emphasized, which can be realized by modifying the static aggregation scheme for updating the global model to an adaptive one, in response to the local signals of the participating clients. Our work reveals that… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML 2024

  16. arXiv:2405.20720  [pdf, other

    cs.CV

    Power of Cooperative Supervision: Multiple Teachers Framework for Enhanced 3D Semi-Supervised Object Detection

    Authors: **-Hee Lee, Jae-Keun Lee, Je-Seok Kim, Soon Kwon

    Abstract: To ensure safe urban driving for autonomous platforms, it is crucial not only to develop high-performance object detection techniques but also to establish a diverse and representative dataset that captures various urban environments and object characteristics. To address these two issues, we have constructed a multi-class 3D LiDAR dataset reflecting diverse urban environments and object character… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: under review

  17. arXiv:2405.20630  [pdf, other

    cs.LG

    Stochastic Optimal Control for Diffusion Bridges in Function Spaces

    Authors: Byoungwoo Park, Jungwon Choi, Sungbin Lim, Juho Lee

    Abstract: Recent advancements in diffusion models and diffusion bridges primarily focus on finite-dimensional spaces, yet many real-world problems necessitate operations in infinite-dimensional function spaces for more natural and interpretable formulations. In this paper, we present a theory of stochastic optimal control (SOC) tailored to infinite-dimensional spaces, aiming to extend diffusion-based algori… ▽ More

    Submitted 2 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  18. arXiv:2405.20605  [pdf, other

    cs.LG cs.AI cs.CV

    Searching for internal symbols underlying deep learning

    Authors: Jung H. Lee, Sujith Vijayan

    Abstract: Deep learning (DL) enables deep neural networks (DNNs) to automatically learn complex tasks or rules from given examples without instructions or guiding principles. As we do not engineer DNNs' functions, it is extremely difficult to diagnose their decisions, and multiple lines of studies proposed to explain principles of DNNs/DL operations. Notably, one line of studies suggests that DNNs may learn… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 10 pages, 7 figures, 3 tables and Appendix

  19. arXiv:2405.20233  [pdf, other

    cs.LG cs.AI

    Grokfast: Accelerated Grokking by Amplifying Slow Gradients

    Authors: Jaerin Lee, Bong Gyun Kang, Kihoon Kim, Kyoung Mu Lee

    Abstract: One puzzling artifact in machine learning dubbed grokking is where delayed generalization is achieved tenfolds of iterations after near perfect overfitting to the training data. Focusing on the long delay itself on behalf of machine learning practitioners, our goal is to accelerate generalization of a model under grokking phenomenon. By regarding a series of gradients of a parameter over training… ▽ More

    Submitted 5 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 17 pages, 13 figures. Typo fixed. Project page: https://jaerinlee.com/research/grokfast

  20. arXiv:2405.20165  [pdf, other

    stat.ML cs.LG

    Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation

    Authors: Wooseong Cho, Taehyun Hwang, Joongkyu Lee, Min-hwan Oh

    Abstract: We study reinforcement learning with multinomial logistic (MNL) function approximation where the underlying transition probability kernel of the Markov decision processes (MDPs) is parametrized by an unknown transition core with features of state and action. For the finite horizon episodic setting with inhomogeneous state transitions, we propose provably efficient algorithms with randomized explor… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  21. arXiv:2405.19794  [pdf, other

    cs.CV

    Video Question Answering for People with Visual Impairments Using an Egocentric 360-Degree Camera

    Authors: Inpyo Song, Minjun Joo, Joonhyung Kwon, Jangwon Lee

    Abstract: This paper addresses the daily challenges encountered by visually impaired individuals, such as limited access to information, navigation difficulties, and barriers to social interaction. To alleviate these challenges, we introduce a novel visual question answering dataset. Our dataset offers two significant advancements over previous datasets: Firstly, it features videos captured using a 360-degr… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: CVPR2024 EgoVis Workshop

  22. arXiv:2405.19734  [pdf, other

    hep-ex

    Search for the decay $B^{0}\toγγ$ using Belle and Belle II data

    Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, S. Al Said, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot , et al. (385 additional authors not shown)

    Abstract: We report the result of a search for the rare decay $B^{0} \to γγ$ using a combined dataset of $753\times10^{6}$ $B\bar{B}$ pairs collected by the Belle experiment and $387\times10^{6}$ $B\bar{B}$ pairs collected by the Belle II experiment from decays of the $\rm Υ(4S)$ resonance produced in $e^{+}e^{-}$ collisions. A simultaneous fit to the Belle and Belle II data sets yields… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Report number: Belle II Preprint: 2024-017, KEK Preprint: 2024-13

  23. arXiv:2405.19713  [pdf, other

    math.NA math.CA

    Summing divergent matrix series

    Authors: Rongbiao Wang, JungHo Lee, Lek-Heng Lim

    Abstract: We extend several celebrated methods in classical analysis for summing series of complex numbers to series of complex matrices. These include the summation methods of Abel, Borel, Cesáro, Euler, Lambert, Nörlund, and Mittag-Leffler, which are frequently used to sum scalar series that are divergent in the conventional sense. One feature of our matrix extensions is that they are fully noncommutative… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 39 pages, 11 figures

    MSC Class: 15A16; 40D05; 40G10; 47A56; 65B10; 65F60

  24. arXiv:2405.19685  [pdf

    eess.IV

    Identifying Functional Brain Networks of Spatiotemporal Wide-Field Calcium Imaging Data via a Long Short-Term Memory Autoencoder

    Authors: Xiaohui Zhang, Eric C Landsness, Lindsey M Brier, Wei Chen, Michelle J. Tang, Hanyang Miao, **-Moo Lee, Mark A. Anastasio, Joseph P. Culver

    Abstract: Wide-field calcium imaging (WFCI) that records neural calcium dynamics allows for identification of functional brain networks (FBNs) in mice that express genetically encoded calcium indicators. Estimating FBNs from WFCI data is commonly achieved by use of seed-based correlation (SBC) analysis and independent component analysis (ICA). These two methods are conceptually distinct and each possesses l… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  25. arXiv:2405.19504  [pdf, other

    cs.DS cs.DB cs.IR

    MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings

    Authors: Laxman Dhulipala, Majid Hadian, Rajesh Jayaram, Jason Lee, Vahab Mirrokni

    Abstract: Neural embedding models have become a fundamental component of modern information retrieval (IR) pipelines. These models produce a single embedding $x \in \mathbb{R}^d$ per data-point, allowing for fast retrieval via highly optimized maximum inner product search (MIPS) algorithms. Recently, beginning with the landmark ColBERT paper, multi-vector models, which produce a set of embedding per data po… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  26. arXiv:2405.18928  [pdf, other

    hep-ex

    Measurement of the energy dependence of the $e^+e^- \to B\bar{B}$, $B\bar{B}{}^*$, and $B^*\bar{B}{}^*$ cross sections at Belle~II

    Authors: Belle II Collaboration, I. Adachi, L. Aggarwal, H. Ahmed, H. Aihara, N. Akopov, A. Aloisio, N. Althubiti, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, M. Bauer, A. Baur , et al. (444 additional authors not shown)

    Abstract: We report measurements of the $e^+e^- \to B\bar{B}$, $B\bar{B}{}^*$, and $B^*\bar{B}{}^*$ cross sections at four energies, 10653, 10701, 10746 and 10805 MeV, using data collected by the Belle~II experiment. We reconstruct one $B$ meson in a large number of hadronic final states and use its momentum to identify the production process. In the first $2-5$ MeV above $B^*\bar{B}{}^*$ threshold, the… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 30 pages, 15 figures, submitted to JHEP

    Report number: Belle II Preprint 2024-016, KEK Preprint 2024-12

  27. arXiv:2405.18792  [pdf, other

    cs.LG cs.AI

    Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies

    Authors: Haanvid Lee, Tri Wahyu Guntara, Jongmin Lee, Yung-Kyun Noh, Kee-Eung Kim

    Abstract: We consider off-policy evaluation (OPE) of deterministic target policies for reinforcement learning (RL) in environments with continuous action spaces. While it is common to use importance sampling for OPE, it suffers from high variance when the behavior policy deviates significantly from the target policy. In order to address this issue, some recent works on OPE proposed in-sample learning with i… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 23 pages, 2 figures, Accepted at ICLR 2024 (spotlight)

  28. arXiv:2405.18710  [pdf, other

    cs.LG cs.AI

    To FP8 and Back Again: Quantifying the Effects of Reducing Precision on LLM Training Stability

    Authors: Joonhyung Lee, Jeongin Bae, Byeongwook Kim, Se Jung Kwon, Dongsoo Lee

    Abstract: The massive computational costs associated with large language model (LLM) pretraining have spurred great interest in reduced-precision floating-point representations to accelerate the process. As a result, the BrainFloat16 (BF16) precision has become the de facto standard for LLM training, with hardware support included in recent accelerators. This trend has gone even further in the latest proces… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  29. arXiv:2405.18540  [pdf, other

    cs.CL cs.CR cs.LG

    Learning diverse attacks on large language models for robust red-teaming and safety tuning

    Authors: Seanie Lee, Minsu Kim, Lynn Cherif, David Dobre, Juho Lee, Sung Ju Hwang, Kenji Kawaguchi, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Moksh Jain

    Abstract: Red-teaming, or identifying prompts that elicit harmful responses, is a critical step in ensuring the safe and responsible deployment of large language models (LLMs). Develo** effective protection against many modes of attack prompts requires discovering diverse attacks. Automated red-teaming typically uses reinforcement learning to fine-tune an attacker language model to generate prompts that e… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  30. arXiv:2405.18093  [pdf, other

    cs.DC cs.LG

    Pipette: Automatic Fine-grained Large Language Model Training Configurator for Real-World Clusters

    Authors: **kyu Yim, Jaeyong Song, Yerim Choi, Jaebeen Lee, Jaewon Jung, Hongsun Jang, **ho Lee

    Abstract: Training large language models (LLMs) is known to be challenging because of the huge computational and memory capacity requirements. To address these issues, it is common to use a cluster of GPUs with 3D parallelism, which splits a model along the data batch, pipeline stage, and intra-layer tensor dimensions. However, the use of 3D parallelism produces the additional challenge of finding the optim… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: published at DATE 2024

  31. arXiv:2405.18089  [pdf, other

    econ.EM

    Semi-nonparametric models of multidimensional matching: an optimal transport approach

    Authors: Dongwoo Kim, Young Jun Lee

    Abstract: This paper proposes empirically tractable multidimensional matching models, focusing on worker-job matching. We generalize the parametric model proposed by Lindenlaub (2017), which relies on the assumption of joint normality of observed characteristics of workers and jobs. In our paper, we allow unrestricted distributions of characteristics and show identification of the production technology, and… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  32. arXiv:2405.18047  [pdf, other

    cs.LG cs.AI cs.DC

    2BP: 2-Stage Backpropagation

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings

    Abstract: As Deep Neural Networks (DNNs) grow in size and complexity, they often exceed the memory capacity of a single accelerator, necessitating the sharding of model parameters across multiple accelerators. Pipeline parallelism is a commonly used sharding strategy for training large DNNs. However, current implementations of pipeline parallelism are being unintentionally bottlenecked by the automatic diff… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  33. arXiv:2405.17918  [pdf, other

    cs.LG cs.AI

    Cost-Sensitive Multi-Fidelity Bayesian Optimization with Transfer of Learning Curve Extrapolation

    Authors: Dong Bok Lee, Aoxuan Silvia Zhang, Byungjoo Kim, Junhyeon Park, Juho Lee, Sung Ju Hwang, Hae Beom Lee

    Abstract: In this paper, we address the problem of cost-sensitive multi-fidelity Bayesian Optimization (BO) for efficient hyperparameter optimization (HPO). Specifically, we assume a scenario where users want to early-stop the BO when the performance improvement is not satisfactory with respect to the required computational cost. Motivated by this scenario, we introduce utility, which is a function predefin… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  34. arXiv:2405.17720  [pdf, other

    cs.CV cs.AI cs.LG

    MindFormer: A Transformer Architecture for Multi-Subject Brain Decoding via fMRI

    Authors: Inhwa Han, Jaayeon Lee, Jong Chul Ye

    Abstract: Research efforts to understand neural signals have been ongoing for many years, with visual decoding from fMRI signals attracting considerable attention. Particularly, the advent of image diffusion models has advanced the reconstruction of images from fMRI data significantly. However, existing approaches often introduce inter- and intra- subject variations in the reconstructed images, which can co… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  35. arXiv:2405.17562  [pdf, other

    hep-th

    Angular fractals in thermal QFT

    Authors: Nathan Benjamin, Jaeha Lee, Sridip Pal, David Simmons-Duffin, Yixin Xu

    Abstract: We show that thermal effective field theory controls the long-distance expansion of the partition function of a $d$-dimensional QFT, with an insertion of any finite-order spatial isometry. Consequently, the thermal partition function on a sphere displays a fractal-like structure as a function of angular twist, reminiscent of the behavior of a modular form near the real line. As an example applicat… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 45 pages+ appendices, 7 figures

    Report number: CALT-TH 2024-021

  36. arXiv:2405.17430  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Matryoshka Multimodal Models

    Authors: Mu Cai, Jianwei Yang, Jianfeng Gao, Yong Jae Lee

    Abstract: Large Multimodal Models (LMMs) such as LLaVA have shown strong performance in visual-linguistic reasoning. These models first embed images into a fixed large number of visual tokens and then feed them into a Large Language Model (LLM). However, this design causes an excessive number of tokens for dense visual scenarios such as high-resolution images and videos, leading to great inefficiency. While… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Project Page: https://matryoshka-mm.github.io/

  37. arXiv:2405.17385  [pdf, other

    quant-ph cond-mat.mes-hall cond-mat.str-el

    Thermalization and Criticality on an Analog-Digital Quantum Simulator

    Authors: Trond I. Andersen, Nikita Astrakhantsev, Amir Karamlou, Julia Berndtsson, Johannes Motruk, Aaron Szasz, Jonathan A. Gross, Tom Westerhout, Yaxing Zhang, Ebrahim Forati, Dario Rossi, Bryce Kobrin, Agustin Di Paolo, Andrey R. Klots, Ilya Drozdov, Vladislav D. Kurilovich, Andre Petukhov, Lev B. Ioffe, Andreas Elben, Aniket Rath, Vittorio Vitale, Benoit Vermersch, Rajeev Acharya, Laleh Aghababaie Beni, Kyle Anderson , et al. (202 additional authors not shown)

    Abstract: Understanding how interacting particles approach thermal equilibrium is a major challenge of quantum simulators. Unlocking the full potential of such systems toward this goal requires flexible initial state preparation, precise time evolution, and extensive probes for final state characterization. We present a quantum simulator comprising 69 superconducting qubits which supports both universal qua… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  38. arXiv:2405.17083  [pdf, other

    cs.CV

    F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting

    Authors: Xiangyu Sun, Joo Chan Lee, Daniel Rho, Jong Hwan Ko, Usman Ali, Eunbyung Park

    Abstract: The neural radiance field (NeRF) has made significant strides in representing 3D scenes and synthesizing novel views. Despite its advancements, the high computational costs of NeRF have posed challenges for its deployment in resource-constrained environments and real-time applications. As an alternative to NeRF-like neural rendering methods, 3D Gaussian Splatting (3DGS) offers rapid rendering spee… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Our project page including code is available at https://xiangyu1sun.github.io/Factorize-3DGS/

  39. arXiv:2405.16907  [pdf, other

    cs.AI cs.LG

    GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning

    Authors: Jaewoo Lee, Su** Yun, Taeyoung Yun, **kyoo Park

    Abstract: Offline Reinforcement Learning (Offline RL) presents challenges of learning effective decision-making policies from static datasets without any online interactions. Data augmentation techniques, such as noise injection and data synthesizing, aim to improve Q-function approximation by smoothing the learned state-action region. However, these methods often fall short of directly improving the qualit… ▽ More

    Submitted 12 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted (Spotlight) to ICLR 2024 Workshop on Generative Models for Decision Making. Jaewoo Lee and Su** Yun are equal contribution authors

  40. arXiv:2405.16894  [pdf, ps, other

    math.NA

    An Unconstrained Formulation of Some Constrained Partial Differential Equations and its Application to Finite Neuron Methods

    Authors: Jiwei Jia, Young Ju Lee, Ruitong Shan

    Abstract: In this paper, we present a new framework how a PDE with constraints can be formulated into a sequence of PDEs with no constraints, whose solutions are convergent to the solution of the PDE with constraints. This framework is then used to build a novel finite neuron method to solve the 2nd order elliptic equations with the Dirichlet boundary condition. Our algorithm is the first algorithm, proven… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  41. arXiv:2405.16877  [pdf, other

    cs.LG cs.AI

    Are Self-Attentions Effective for Time Series Forecasting?

    Authors: Dongbin Kim, **seong Park, Jaewook Lee, Hoki Kim

    Abstract: Time series forecasting is crucial for applications across multiple domains and various scenarios. Although Transformer models have dramatically shifted the landscape of forecasting, their effectiveness remains debated. Recent findings have indicated that simpler linear models might outperform complex Transformer-based approaches, highlighting the potential for more streamlined architectures. In t… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 20 pages, 14 figures, 13 tables. Submitted to NeurIPS 2024 (under review)

  42. arXiv:2405.16861  [pdf, other

    q-bio.BM cs.LG physics.bio-ph

    NCIDiff: Non-covalent Interaction-generative Diffusion Model for Improving Reliability of 3D Molecule Generation Inside Protein Pocket

    Authors: Joongwon Lee, Wonho Zhung, Woo Youn Kim

    Abstract: Advancements in deep generative modeling have changed the paradigm of drug discovery. Among such approaches, target-aware methods that exploit 3D structures of protein pockets were spotlighted for generating ligand molecules with their plausible binding modes. While docking scores superficially assess the quality of generated ligands, closer inspection of the binding structures reveals the inconsi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  43. arXiv:2405.16855  [pdf, ps, other

    math.CA

    Maximal operators given by Fourier multipliers with dilation of fractional dimensions

    Authors: ** Bong Lee, **sol Seo

    Abstract: In this paper, we investigate $L^p$ bounds of maximal Fourier multiplier operators with dilation of fractional dimensions. For the Fourier multipliers, we suggest a criterion related to dimensions of dilation sets which guarantees $L^p$ bounds of the maximal operators for each $p$. Our criterion covers Mikhlin-type multipliers, multipliers with limited decay, and multipliers with slow decay.

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 15 pages

    MSC Class: 42B25; 42B15; 42B35; 42B37

  44. arXiv:2405.16828  [pdf, other

    cs.LG math.ST stat.ML

    Kernel-based optimally weighted conformal prediction intervals

    Authors: Jonghyeok Lee, Chen Xu, Yao Xie

    Abstract: Conformal prediction has been a popular distribution-free framework for uncertainty quantification. In this paper, we present a novel conformal prediction method for time-series, which we call Kernel-based Optimally Weighted Conformal Prediction Intervals (KOWCPI). Specifically, KOWCPI adapts the classic Reweighted Nadaraya-Watson (RNW) estimator for quantile regression on dependent data and learn… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  45. arXiv:2405.16751  [pdf, other

    cs.AI cs.CL cs.CV cs.MA

    LLM-Based Cooperative Agents using Information Relevance and Plan Validation

    Authors: SeungWon Seo, Junhyeok Lee, SeongRae Noh, HyeongYeop Kang

    Abstract: We address the challenge of multi-agent cooperation, where agents achieve a common goal by interacting with a 3D scene and cooperating with decentralized agents under complex partial observations. This involves managing communication costs and optimizing interaction trajectories in dynamic environments. Our research focuses on three primary limitations of existing cooperative agent systems. Firstl… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  46. arXiv:2405.15898  [pdf, other

    cond-mat.mtrl-sci

    Thermodynamics of Sodium-Lead Alloys for Negative Electrodes from First-Principles

    Authors: Damien K. J. Lee, Zeyu Deng, Gopalakrishnan Sai Gautam, Pieremanuele Canepa

    Abstract: Metals, such as tin, antimony, and lead (Pb) have garnered renewed attention for their potential use as alloyant-negative electrode materials in sodium (Na)-ion batteries (NIBs). Despite Pb's toxicity and its high molecular weight, lead is one of the most commonly recycled metals, positioning Pb as a promising candidate for a cost-effective, high-capacity anode material. Understanding the miscibil… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  47. arXiv:2405.15654  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Interfacially enhanced superconductivity in Fe(Te,Se)/Bi4Te3 heterostructures

    Authors: An-Hsi Chen, Qiangsheng Lu, Eitan Hershkovitz, Miguel L. Crespillo, Alessandro R. Mazza, Tyler Smith, T. Zac Ward, Gyula Eres, Shornam Gandhi, Meer Muhtasim Mahfuz, Vitalii Starchenko, Khalid Hattar, Joon Sue Lee, Honggyu Kim, Robert G. Moore, Matthew Brahlek

    Abstract: Realizing topological superconductivity by integrating high-transition-temperature ($T_C$) superconductors with topological insulators can open new paths for quantum computing applications. Here, we report a new approach for increasing the superconducting transition temperature ($T_{C}^{onset}$) by interfacing the unconventional superconductor Fe(Te,Se) with the topological insulator Bi-Te system… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 18 pages, 5 figures, accepted by Advanced Materials

  48. arXiv:2405.15229  [pdf, other

    cond-mat.mes-hall

    Multi-Orbital Interactions and Spin Polarization in Single Rare-Earth Adatoms

    Authors: Massine Kelai, Stefano Reale, Roberto Robles, Jaehyun Lee, Divya Jyoti, Philippe Ohresser, Edwige Otero, Fadi Choueikani, Fabrice Scheurer, Nicolás Lorente, Deung-Jang Choi, Aparajita Singha, Fabio Donati

    Abstract: Surface-adsorbed rare-earth nanostructures are ideal platforms to investigate the interplay between intra-atomic interactions and multi-orbital spin configurations. However, addressing these properties has posed severe experimental and theoretical challenges. Here, we use the orbital selectivity offered by X-ray absorption spectroscopy to quantify the Coulomb integrals of Nd atoms on conductive su… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  49. arXiv:2405.14625  [pdf, other

    hep-ex

    Test of light-lepton universality in $τ$ decays with the Belle II experiment

    Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Aihara, N. Akopov, A. Aloisio, N. Anh Ky, D. M. Asner, H. Atmacan, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, S. Bahinipati, P. Bambade, Sw. Banerjee, S. Bansal, M. Barrett, J. Baudot, A. Baur, A. Beaubien, F. Becherer, J. Becker , et al. (406 additional authors not shown)

    Abstract: We present a measurement of the ratio $R_μ= \mathcal{B}(τ^-\to μ^-\barν_μν_τ) / \mathcal{B}(τ^-\to e^-\barν_eν_τ)$ of branching fractions $\mathcal{B}$ of the $τ$ lepton decaying to muons or electrons using data collected with the Belle II detector at the SuperKEKB $e^+e^-$ collider. The sample has an integrated luminosity of 362 fb$^{-1}$ at a centre-of-mass energy of 10.58 GeV. Using an optimise… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Report number: Belle II Preprint 2024-002, KEK Preprint 2023-49

  50. arXiv:2405.14100  [pdf

    physics.soc-ph astro-ph.EP astro-ph.IM physics.pop-ph

    Water Management Considerations for a Self-Sustaining Moonbase

    Authors: Jeffrey S. Lee, Joe Yelderman, Gerald B. Cleaver

    Abstract: The most pragmatic first step in the all-but-inevitable 3rd-millennium Völkerwanderung of humanity throughout the Solar System is the establishment of a permanent human presence on the Moon. This research examines: 1. the human, agricultural, and technical water needs of a 100-person, 500 m x 100 m x 6 m self-sustaining lunar colony; 2. choosing a strategic location for the moonbase; 3. a heat dri… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 36 pages, 12 Figues, 3 Tables