Skip to main content

Showing 1–50 of 393 results for author: Ahn, S

.
  1. arXiv:2407.02490  [pdf, other

    cs.CL cs.LG

    MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

    Authors: Huiqiang Jiang, Yucheng Li, Chengruidong Zhang, Qianhui Wu, Xufang Luo, Surin Ahn, Zhenhua Han, Amir H. Abdi, Dongsheng Li, Chin-Yew Lin, Yuqing Yang, Lili Qiu

    Abstract: The computational challenges of Large Language Model (LLM) inference remain a significant barrier to their widespread deployment, especially as prompt lengths continue to increase. Due to the quadratic complexity of the attention computation, it takes 30 minutes for an 8B LLM to process a prompt of 1M tokens (i.e., the pre-filling stage) on a single A100 GPU. Existing methods for speeding up prefi… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. Predicting Visual Attention in Graphic Design Documents

    Authors: Souradeep Chakraborty, Zijun Wei, Conor Kelton, Seoyoung Ahn, Aruna Balasubramanian, Gregory J. Zelinsky, Dimitris Samaras

    Abstract: We present a model for predicting visual attention during the free viewing of graphic design documents. While existing works on this topic have aimed at predicting static saliency of graphic designs, our work is the first attempt to predict both spatial attention and dynamic temporal order in which the document regions are fixated by gaze using a deep learning based model. We propose a two-stage m… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Journal ref: IEEE Transactions on Multimedia 25 (2022): 4478-4493

  3. arXiv:2406.18508  [pdf

    eess.IV

    Assessment of Clonal Hematopoiesis of Indeterminate Potential from Cardiac Magnetic Resonance Imaging using Deep Learning in a Cardio-oncology Population

    Authors: Sangeon Ryu, Shawn Ahn, Jeacy Espinoza, Alokkumar Jha, Stephanie Halene, James S. Duncan, Jennifer M Kwan, Nicha C. Dvornek

    Abstract: Background: We propose a novel method to identify who may likely have clonal hematopoiesis of indeterminate potential (CHIP), a condition characterized by the presence of somatic mutations in hematopoietic stem cells without detectable hematologic malignancy, using deep learning techniques. Methods: We developed a convolutional neural network (CNN) to predict CHIP status using 4 different views fr… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  4. arXiv:2406.14188  [pdf, other

    cond-mat.mes-hall

    Anisptropic plasmons in threefold Hopf semimetals

    Authors: Seong** Ahn

    Abstract: Threefold Hopf semimetals are a novel type of topological semimetals that possess an internal anisotropy characterized by a dipolar structure of the Berry curvature and an isotropic energy band structure consisting of a Dirac cone and a flat band. In this study, we theoretically investigate the impact of internal anisotropy on plasmons in threefold Hopf semimetals using random-phase approximation.… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures

  5. arXiv:2406.13458  [pdf, other

    astro-ph.SR

    12C+12C Reaction Rates and the Evolution of a Massive Star

    Authors: Gwangeon Seong, Yubin Kim, Kyu** Kwak, Sunghoon Ahn, Chaeyeon Park, Kevin Insik Hahn, Chunglee Kim

    Abstract: Carbon fusion is important to understand the late stages in the evolution of a massive star. Astronomically interesting energy ranges for the 12C+12C reactions have been, however, poorly constrained by experiments. Theoretical studies on stellar evolution have relied on reaction rates that are extrapolated from those measured in higher energies. In this work, we update the carbon fusion reaction r… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures

  6. arXiv:2406.12272  [pdf, other

    cs.AI

    Slot State Space Models

    Authors: **dong Jiang, Fei Deng, Gautam Singh, Minseung Lee, Sung** Ahn

    Abstract: Recent State Space Models (SSMs) such as S4, S5, and Mamba have shown remarkable computational benefits in long-range temporal dependency modeling. However, in many sequence modeling problems, the underlying process is inherently modular and it is of interest to have inductive biases that mimic this modular structure. In this paper, we introduce SlotSSMs, a novel framework for incorporating indepe… ▽ More

    Submitted 30 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.07899  [pdf, other

    hep-ex physics.ins-det

    Josephson Parametric Amplifier based Quantum Noise Limited Amplifier Development for Axion Search Experiments in CAPP

    Authors: Sergey V. Uchaikin, **myeong Kim, Caglar Kutlu, Boris I. Ivanov, **su Kim, Arjan F. van Loo, Yasunobu Nakamura, Saebyeok Ahn, Seonjeong Oh, Minsu Ko, Yannis K. Semertzidis

    Abstract: This paper provides a comprehensive overview of the development of flux-driven Josephson Parametric Amplifiers (JPAs) as Quantum Noise Limited Amplifier for axion search experiments conducted at the Center for Axion and Precision Physics Research (CAPP) of the Institute for Basic Science. It focuses on the characterization, and optimization of JPAs, which are crucial for achieving the highest sens… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 29 pages, 15 figures

  8. arXiv:2406.07782  [pdf, other

    hep-ph hep-ex physics.ins-det

    Wideband tunable cavity development for axion dark matter searches using a piezoelectric motor in combination with gears

    Authors: A. K. Yi, T. Seong, S. Lee, S. Ahn, B. I. Ivanov, S. V. Uchaikin, B. R. Ko, Y. K. Semertzidis

    Abstract: Most search experiments sensitive to quantum chromodynamics axion dark matter benefit from microwave cavities, as electromagnetic resonators, that enhance the detected axion signal power and thus the experimental sensitivity drastically. As the possible axion mass spans multiple orders of magnitude, microwave cavities must be tunable and it is desirable for the cavity to have a tunable frequency r… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 10 pages, 7 figures

  9. arXiv:2406.06793  [pdf, other

    cs.LG cs.AI

    PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer

    Authors: Chang Chen, Junyeob Baek, Fei Deng, Kenji Kawaguchi, Caglar Gulcehre, Sung** Ahn

    Abstract: Despite the recent advancements in offline RL, no unified algorithm could achieve superior performance across a broad range of tasks. Offline \textit{value function learning}, in particular, struggles with sparse-reward, long-horizon tasks due to the difficulty of solving credit assignment and extrapolation errors that accumulates as the horizon of the task grows.~On the other hand, models that ca… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  10. arXiv:2406.02355  [pdf, other

    cs.CV cs.AI cs.DC cs.LG

    FedDr+: Stabilizing Dot-regression with Global Feature Distillation for Federated Learning

    Authors: Seongyoon Kim, Minchan Jeong, Sungnyun Kim, Sungwoo Cho, Sumyeong Ahn, Se-Young Yun

    Abstract: Federated Learning (FL) has emerged as a pivotal framework for the development of effective global models (global FL) or personalized models (personalized FL) across clients with heterogeneous, non-iid data distribution. A key challenge in FL is client drift, where data heterogeneity impedes the aggregation of scattered knowledge. Recent studies have tackled the client drift issue by identifying s… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  11. arXiv:2406.01302  [pdf

    cs.CV

    Pulmonary Embolism Mortality Prediction Using Multimodal Learning Based on Computed Tomography Angiography and Clinical Data

    Authors: Zhusi Zhong, Helen Zhang, Fayez H. Fayad, Andrew C. Lancaster, John Sollee, Shreyas Kulkarni, Cheng Ting Lin, Jie Li, Xinbo Gao, Scott Collins, Colin Greineder, Sun H. Ahn, Harrison X. Bai, Zhicheng Jiao, Michael K. Atalay

    Abstract: Purpose: Pulmonary embolism (PE) is a significant cause of mortality in the United States. The objective of this study is to implement deep learning (DL) models using Computed Tomography Pulmonary Angiography (CTPA), clinical data, and PE Severity Index (PESI) scores to predict PE mortality. Materials and Methods: 918 patients (median age 64 years, range 13-99 years, 52% female) with 3,978 CTPAs w… ▽ More

    Submitted 5 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  12. arXiv:2405.19961  [pdf, other

    cs.LG

    Collective Variable Free Transition Path Sampling with Generative Flow Network

    Authors: Kiyoung Seong, Seonghyun Park, Seonghwan Kim, Woo Youn Kim, Sungsoo Ahn

    Abstract: Understanding transition paths between meta-stable states in molecular systems is fundamental for material design and drug discovery. However, sampling these paths via molecular dynamics simulations is computationally prohibitive due to the high-energy barriers between the meta-stable states. Recent machine learning approaches are often restricted to simple systems or rely on collective variables… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures, 2 tables

  13. arXiv:2405.19691  [pdf, other

    cs.HC

    Designing Prompt Analytics Dashboards to Analyze Student-ChatGPT Interactions in EFL Writing

    Authors: Minsun Kim, SeonGyeom Kim, Suyoun Lee, Yoosang Yoon, Junho Myung, Haneul Yoo, Hyungseung Lim, Jieun Han, Yoonsu Kim, So-Yeon Ahn, Juho Kim, Alice Oh, Hwajung Hong, Tak Yeon Lee

    Abstract: While ChatGPT has significantly impacted education by offering personalized resources for students, its integration into educational settings poses unprecedented risks, such as inaccuracies and biases in AI-generated content, plagiarism and over-reliance on AI, and privacy and security issues. To help teachers address such risks, we conducted a two-phase iterative design process that comprises sur… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  14. arXiv:2405.16413  [pdf, other

    cs.AI cs.CL cs.LG stat.AP

    Augmented Risk Prediction for the Onset of Alzheimer's Disease from Electronic Health Records with Large Language Models

    Authors: Jiankun Wang, Sumyeong Ahn, Taykhoom Dalal, Xiaodan Zhang, Weishen Pan, Qiannan Zhang, Bin Chen, Hiroko H. Dodge, Fei Wang, Jiayu Zhou

    Abstract: Alzheimer's disease (AD) is the fifth-leading cause of death among Americans aged 65 and older. Screening and early detection of AD and related dementias (ADRD) are critical for timely intervention and for identifying clinical trial participants. The widespread adoption of electronic health records (EHRs) offers an important resource for develo** ADRD screening tools such as machine learning bas… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  15. arXiv:2405.16012  [pdf, other

    cs.LG

    Pessimistic Backward Policy for GFlowNets

    Authors: Hyosoon Jang, Yunhui Jang, Minsu Kim, **kyoo Park, Sungsoo Ahn

    Abstract: This paper studies Generative Flow Networks (GFlowNets), which learn to sample objects proportionally to a given reward function through the trajectory of state transitions. In this work, we observe that GFlowNets tend to under-exploit the high-reward objects due to training on insufficient number of trajectories, which may lead to a large gap between the estimated flow and the (known) reward valu… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  16. arXiv:2405.08424  [pdf, other

    cs.LG math.OC

    Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and More

    Authors: Fanchen Bu, Hyeonsoo Jo, Soo Yong Lee, Sungsoo Ahn, Kijung Shin

    Abstract: Combinatorial optimization (CO) is naturally discrete, making machine learning based on differentiable optimization inapplicable. Karalias & Loukas (2020) adapted the probabilistic method to incorporate CO into differentiable optimization. Their work ignited the research on unsupervised learning for CO, composed of two main components: probabilistic objectives and derandomization. However, each co… ▽ More

    Submitted 23 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  17. arXiv:2405.04752  [pdf, other

    eess.AS cs.SD

    HILCodec: High Fidelity and Lightweight Neural Audio Codec

    Authors: Sunghwan Ahn, Beom Jun Woo, Min Hyun Han, Chanyeong Moon, Nam Soo Kim

    Abstract: The recent advancement of end-to-end neural audio codecs enables compressing audio at very low bitrates while reconstructing the output audio with high fidelity. Nonetheless, such improvements often come at the cost of increased model complexity. In this paper, we identify and address the problems of existing neural audio codecs. We show that the performance of Wave-U-Net does not increase consist… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  18. arXiv:2405.00646  [pdf, other

    cs.CV cs.LG

    Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

    Authors: Whie Jung, Jaehoon Yoo, Sung** Ahn, Seunghoon Hong

    Abstract: Learning compositional representation is a key aspect of object-centric learning as it enables flexible systematic generalization and supports complex visual reasoning. However, most of the existing approaches rely on auto-encoding objective, while the compositionality is implicitly imposed by the architectural or algorithmic bias in the encoder. This misalignment between auto-encoding objective a… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  19. arXiv:2404.16012  [pdf, other

    cs.CV cs.MM

    GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting

    Authors: Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn, Seungryong Kim

    Abstract: We propose GaussianTalker, a novel framework for real-time generation of pose-controllable talking heads. It leverages the fast rendering capabilities of 3D Gaussian Splatting (3DGS) while addressing the challenges of directly controlling 3DGS with speech audio. GaussianTalker constructs a canonical 3DGS representation of the head and deforms it in sync with the audio. A key insight is to encode t… ▽ More

    Submitted 25 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Project Page: https://ku-cvlab.github.io/GaussianTalker

  20. arXiv:2404.05832  [pdf, other

    cs.HC eess.SY

    Human-Machine Interaction in Automated Vehicles: Reducing Voluntary Driver Intervention

    Authors: Xinzhi Zhong, Yang Zhou, Varshini Kamaraj, Zhenhao Zhou, Wissam Kontar, Dan Negrut, John D. Lee, Soyoung Ahn

    Abstract: This paper develops a novel car-following control method to reduce voluntary driver interventions and improve traffic stability in Automated Vehicles (AVs). Through a combination of experimental and empirical analysis, we show how voluntary driver interventions can instigate substantial traffic disturbances that are amplified along the traffic upstream. Motivated by these findings, we present a fr… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  21. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  22. arXiv:2403.20153  [pdf, other

    cs.CV

    Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior

    Authors: Jaehoon Ko, Kyusun Cho, Joungbin Lee, Heeji Yoon, Sangmin Lee, Sangjun Ahn, Seungryong Kim

    Abstract: Recent methods for audio-driven talking head synthesis often optimize neural radiance fields (NeRF) on a monocular talking portrait video, leveraging its capability to render high-fidelity and 3D-consistent novel-view frames. However, they often struggle to reconstruct complete face geometry due to the absence of comprehensive 3D information in the input monocular videos. In this paper, we introdu… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Project page: https://ku-cvlab.github.io/Talk3D/

  23. arXiv:2403.08272  [pdf, other

    cs.CL

    RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education

    Authors: Jieun Han, Haneul Yoo, Junho Myung, Minsun Kim, Tak Yeon Lee, So-Yeon Ahn, Alice Oh

    Abstract: The integration of generative AI in education is expanding, yet empirical analyses of large-scale and real-world interactions between students and AI systems still remain limited. Addressing this gap, we present RECIPE4U (RECIPE for University), a dataset sourced from a semester-long experiment with 212 college students in English as Foreign Language (EFL) writing courses. During the study, studen… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2309.13243

  24. arXiv:2403.02642  [pdf, other

    cs.RO cs.CV

    UFO: Uncertainty-aware LiDAR-image Fusion for Off-road Semantic Terrain Map Estimation

    Authors: Ohn Kim, Junwon Seo, Seongyong Ahn, Chong Hui Kim

    Abstract: Autonomous off-road navigation requires an accurate semantic understanding of the environment, often converted into a bird's-eye view (BEV) representation for various downstream tasks. While learning-based methods have shown success in generating local semantic terrain maps directly from sensor data, their efficacy in off-road environments is hindered by challenges in accurately representing uncer… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  25. arXiv:2402.18866  [pdf, other

    cs.LG

    Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming

    Authors: Hany Hamed, Subin Kim, Dongyeong Kim, Jaesik Yoon, Sung** Ahn

    Abstract: Model-based reinforcement learning (MBRL) has been a primary approach to ameliorating the sample efficiency issue as well as to make a generalist agent. However, there has not been much effort toward enhancing the strategy of dreaming itself. Therefore, it is a question whether and how an agent can "dream better" in a more structured and strategic way. In this paper, inspired by the observation fr… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: First two authors contributed equally

  26. arXiv:2402.17077  [pdf, other

    cs.LG cs.CV

    Parallelized Spatiotemporal Binding

    Authors: Gautam Singh, Yue Wang, Jiawei Yang, Boris Ivanovic, Sung** Ahn, Marco Pavone, Tong Che

    Abstract: While modern best practices advocate for scalable architectures that support long-range interactions, object-centric models are yet to fully embrace these architectures. In particular, existing object-centric models for handling sequential inputs, due to their reliance on RNN-based implementation, show poor stability and capacity and are slow to train on long sequences. We introduce Parallelizable… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: See project page at https://parallel-st-binder.github.io

  27. arXiv:2402.16733  [pdf, other

    cs.CL cs.AI

    DREsS: Dataset for Rubric-based Essay Scoring on EFL Writing

    Authors: Haneul Yoo, Jieun Han, So-Yeon Ahn, Alice Oh

    Abstract: Automated essay scoring (AES) is a useful tool in English as a Foreign Language (EFL) writing education, offering real-time essay scores for students and instructors. However, previous AES models were trained on essays and scores irrelevant to the practical scenarios of EFL writing education and usually provided a single holistic score due to the lack of appropriate datasets. In this paper, we rel… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.05191

  28. arXiv:2402.16677  [pdf, other

    nucl-ex

    Cluster structure of 3$α$+p states in $^{13}$N

    Authors: J. Bishop, G. V. Rogachev, S. Ahn, M. Barbui, S. M. Cha, E. Harris, C. Hunt, C. H. Kim, D. Kim, S. H. Kim, E. Koshchiy, Z. Luo, C. Park, C. E. Parker, E. C. Pollacco, B. T. Roeder, M. Roosa, A. Saastamoinen, D. P. Scriven

    Abstract: Background: Cluster states in $^{13}$N are extremely difficult to measure due to the unavailability of $^{9}$B+$α$ elastic scattering data. Purpose: Using $β$-delayed charged-particle spectroscopy of $^{13}$O, clustered states in $^{13}$N can be populated and measured in the 3$α$+p decay channel. Method: One-at-a-time implantation/decay of $^{13}$O was performed with the Texas Active Target Time P… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.14111

  29. arXiv:2402.15160  [pdf, other

    cs.LG cs.AI

    Spatially-Aware Transformer for Embodied Agents

    Authors: Junmo Cho, Jaesik Yoon, Sung** Ahn

    Abstract: Episodic memory plays a crucial role in various cognitive processes, such as the ability to mentally recall past events. While cognitive science emphasizes the significance of spatial context in the formation and retrieval of episodic memory, the current primary approach to implementing episodic memory in AI systems is through transformers that store temporally ordered experiences, which overlooks… ▽ More

    Submitted 29 February, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: ICLR 2024 Spotlight. First two authors contributed equally

  30. arXiv:2402.12892  [pdf, other

    hep-ex hep-ph

    Extensive search for axion dark matter over 1\,GHz with CAPP's Main Axion eXperiment

    Authors: Saebyeok Ahn, **Myeong Kim, Boris I. Ivanov, Ohjoon Kwon, HeeSu Byun, Arjan F. van Loo, SeongTae Par, Junu Jeong, Soohyung Lee, **su Kim, Çağlar Kutlu, Andrew K. Yi, Yasunobu Nakamura, Seonjeong Oh, Danho Ahn, SungJae Bae, Hyoungsoon Choi, Jihoon Choi, Yonuk Chong, Woohyun Chung, Violeta Gkika, Jihn E. Kim, Younggeun Kim, Byeong Rok Ko, Lino Miceli , et al. (11 additional authors not shown)

    Abstract: We report an extensive high-sensitivity search for axion dark matter above 1\,GHz at the Center for Axion and Precision Physics Research (CAPP). The cavity resonant search, exploiting the coupling between axions and photons, explored the frequency (mass) range of 1.025\,GHz (4.24\,$μ$eV) to 1.185\,GHz (4.91\,$μ$eV). We have introduced a number of innovations in this field, demonstrating the practi… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: A detailed axion dark matter article with 27 pages, 22 figures

  31. arXiv:2402.12412  [pdf, other

    cs.HC cs.AI cs.MM eess.SP

    Dynamic and Super-Personalized Media Ecosystem Driven by Generative AI: Unpredictable Plays Never Repeating The Same

    Authors: Sungjun Ahn, Hyun-Jeong Yim, Youngwan Lee, Sung-Ik Park

    Abstract: This paper introduces a media service model that exploits artificial intelligence (AI) video generators at the receive end. This proposal deviates from the traditional multimedia ecosystem, completely relying on in-house production, by shifting part of the content creation onto the receiver. We bring a semantic process into the framework, allowing the distribution network to provide service elemen… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 13 pages, 7 figures

  32. arXiv:2402.05982  [pdf, other

    q-bio.QM cs.LG

    Decoupled Sequence and Structure Generation for Realistic Antibody Design

    Authors: Nayoung Kim, Minsu Kim, Sungsoo Ahn, **kyoo Park

    Abstract: Antibody design plays a pivotal role in advancing therapeutics. Although deep learning has made rapid progress in this field, existing methods jointly generate antibody sequences and structures, limiting task-specific optimization. In response, we propose an antibody sequence-structure decoupling (ASSD) framework, which separates sequence generation and structure prediction. Although our approach… ▽ More

    Submitted 27 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 18 pages, 6 figures

  33. arXiv:2402.05965  [pdf, other

    cs.LG eess.SP

    Hybrid Neural Representations for Spherical Data

    Authors: Hyomin Kim, Yunhui Jang, Jaeho Lee, Sungsoo Ahn

    Abstract: In this paper, we study hybrid neural representations for spherical data, a domain of increasing relevance in scientific research. In particular, our work focuses on weather and climate data as well as comic microwave background (CMB) data. Although previous studies have delved into coordinate-based neural representations for spherical signals, they often fail to capture the intricate details of h… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 13 pages, 8 figures

  34. arXiv:2402.04278  [pdf, other

    physics.chem-ph cs.LG

    Gaussian Plane-Wave Neural Operator for Electron Density Estimation

    Authors: Seongsu Kim, Sungsoo Ahn

    Abstract: This work studies machine learning for electron density prediction, which is fundamental for understanding chemical systems and density functional theory (DFT) simulations. To this end, we introduce the Gaussian plane-wave neural operator (GPWNO), which operates in the infinite-dimensional functional space using the plane-wave and Gaussian-type orbital bases, widely recognized in the context of DF… ▽ More

    Submitted 13 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024 main poster

    Journal ref: International Conference on Machine Learning (ICML), 2024

  35. arXiv:2402.01203  [pdf, other

    cs.LG cs.CV

    Neural Language of Thought Models

    Authors: Yi-Fu Wu, Minseung Lee, Sung** Ahn

    Abstract: The Language of Thought Hypothesis suggests that human cognition operates on a structured, language-like system of mental representations. While neural language models can naturally benefit from the compositional structure inherently and explicitly expressed in language data, learning such representations from non-linguistic general observations, like images, remains a challenge. In this work, we… ▽ More

    Submitted 16 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted in ICLR 2024

  36. arXiv:2401.12920  [pdf, other

    cs.AI

    Truck Parking Usage Prediction with Decomposed Graph Neural Networks

    Authors: Rei Tamaru, Yang Cheng, Steven Parker, Ernie Perry, Bin Ran, Soyoung Ahn

    Abstract: Truck parking on freight corridors faces various challenges, such as insufficient parking spaces and compliance with Hour-of-Service (HOS) regulations. These constraints often result in unauthorized parking practices, causing safety concerns. To enhance the safety of freight operations, providing accurate parking usage prediction proves to be a cost-effective solution. Despite the existing researc… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 10 pages, 5 figures, 3 tables, Manuscript for IEEE Transactions on Intelligent Transportation Systems

  37. arXiv:2401.02644  [pdf, other

    cs.LG cs.AI

    Simple Hierarchical Planning with Diffusion

    Authors: Chang Chen, Fei Deng, Kenji Kawaguchi, Caglar Gulcehre, Sung** Ahn

    Abstract: Diffusion-based generative methods have proven effective in modeling trajectories with offline datasets. However, they often face computational challenges and can falter in generalization, especially in capturing temporal abstractions for long-horizon tasks. To overcome this, we introduce the Hierarchical Diffuser, a simple, fast, yet surprisingly effective planning method combining the advantages… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  38. arXiv:2401.00355  [pdf, other

    stat.AP

    Understanding Heterogeneity of Automated Vehicles and Its Traffic-level Impact: A Stochastic Behavioral Perspective

    Authors: Xinzhi Zhong, Yang Zhou, Soyoung Ahn, Danjue Chen

    Abstract: This paper develops a stochastic and unifying framework to examine variability in car-following (CF) dynamics of commercial automated vehicles (AVs) and its direct relation to traffic-level dynamics. The asymmetric behavior (AB) model by Chen at al. (2012a) is extended to accommodate a range of CF behaviors by AVs and compare with the baseline of human-driven vehicles (HDVs). The parameters of the… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  39. arXiv:2312.16839  [pdf, other

    cs.RO

    Similar but Different: A Survey of Ground Segmentation and Traversability Estimation for Terrestrial Robots

    Authors: Hyungtae Lim, Minho Oh, Seungjae Lee, Seunguk Ahn, Hyun Myung

    Abstract: With the increasing demand for mobile robots and autonomous vehicles, several approaches for long-term robot navigation have been proposed. Among these techniques, ground segmentation and traversability estimation play important roles in perception and path planning, respectively. Even though these two techniques appear similar, their objectives are different. Ground segmentation divides data into… ▽ More

    Submitted 2 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 10 pages, 8 figures

  40. arXiv:2312.14184  [pdf

    cs.CL cs.AI cs.LG

    Large Language Models in Medical Term Classification and Unexpected Misalignment Between Response and Reasoning

    Authors: Xiaodan Zhang, Sandeep Vemulapalli, Nabasmita Talukdar, Sumyeong Ahn, Jiankun Wang, Han Meng, Sardar Mehtab Bin Murtaza, Aakash Ajay Dave, Dmitry Leshchiner, Dimitri F. Joseph, Martin Witteveen-Lane, Dave Chesla, Jiayu Zhou, Bin Chen

    Abstract: This study assesses the ability of state-of-the-art large language models (LLMs) including GPT-3.5, GPT-4, Falcon, and LLaMA 2 to identify patients with mild cognitive impairment (MCI) from discharge summaries and examines instances where the models' responses were misaligned with their reasoning. Utilizing the MIMIC-IV v2.2 database, we focused on a cohort aged 65 and older, verifying MCI diagnos… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  41. arXiv:2312.10042  [pdf

    cs.LG cs.RO

    A Generic Stochastic Hybrid Car-following Model Based on Approximate Bayesian Computation

    Authors: Jiwan Jiang, Yang Zhou, Xin Wang, Soyoung Ahn

    Abstract: Car following (CF) models are fundamental to describing traffic dynamics. However, the CF behavior of human drivers is highly stochastic and nonlinear. As a result, identifying the best CF model has been challenging and controversial despite decades of research. Introduction of automated vehicles has further complicated this matter as their CF controllers remain proprietary, though their behavior… ▽ More

    Submitted 26 November, 2023; originally announced December 2023.

    Comments: 25 pages, 6 figures

  42. arXiv:2312.06678  [pdf, other

    nucl-ex nucl-th

    Constraining nucleon effective masses with flow and stop** observables from the S$π$RIT experiment

    Authors: C. Y. Tsang, M. Kurata-Nishimura, M. B. Tsang, W. G. Lynch, Y. X. Zhang, J. Barney, J. Estee, G. Jhang, R. Wang, M. Kaneko, J. W. Lee, T. Isobe, T. Murakami, D. S. Ahn, L. Atar, T. Aumann, H. Baba, K. Boretzky, J. Brzychczyk, G. Cerizza, N. Chiga, N. Fukuda, I. Gasparic, B. Hong, A. Horvat , et al. (30 additional authors not shown)

    Abstract: Properties of the nuclear equation of state (EoS) can be probed by measuring the dynamical properties of nucleus-nucleus collisions. In this study, we present the directed flow ($v_1$), elliptic flow ($v_2$) and stop** (VarXZ) measured in fixed target Sn + Sn collisions at 270 AMeV with the S$π$RIT Time Projection Chamber. We perform Bayesian analyses in which EoS parameters are varied simultane… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  43. arXiv:2312.05611  [pdf, other

    cs.LG cs.AI

    Triplet Edge Attention for Algorithmic Reasoning

    Authors: Yeonjoon Jung, Sungsoo Ahn

    Abstract: This work investigates neural algorithmic reasoning to develop neural networks capable of learning from classical algorithms. The main challenge is to develop graph neural networks that are expressive enough to predict the given algorithm outputs while generalizing well to out-of-distribution data. In this work, we introduce a new graph neural network layer called Triplet Edge Attention (TEA), an… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  44. arXiv:2312.02230  [pdf, other

    cs.LG cs.AI

    A Simple and Scalable Representation for Graph Generation

    Authors: Yunhui Jang, Seul Lee, Sungsoo Ahn

    Abstract: Recently, there has been a surge of interest in employing neural networks for graph generation, a fundamental statistical learning problem with critical applications like molecule design and community analysis. However, most approaches encounter significant limitations when generating large-scale graphs. This is due to their requirement to output the full adjacency matrices whose size grows quadra… ▽ More

    Submitted 26 March, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: International Conference on Learning Representations (ICLR) 2024

  45. arXiv:2312.00836  [pdf, other

    eess.IV cs.CV

    Heteroscedastic Uncertainty Estimation for Probabilistic Unsupervised Registration of Noisy Medical Images

    Authors: Xiaoran Zhang, Daniel H. Pak, Shawn S. Ahn, Xiaoxiao Li, Chenyu You, Lawrence Staib, Albert J. Sinusas, Alex Wong, James S. Duncan

    Abstract: This paper proposes a heteroscedastic uncertainty estimation framework for unsupervised medical image registration. Existing methods rely on objectives (e.g. mean-squared error) that assume a uniform noise level across the image, disregarding the heteroscedastic and input-dependent characteristics of noise distribution in real-world medical images. This further introduces noisy gradients due to un… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  46. arXiv:2311.17639  [pdf, other

    gr-qc astro-ph.IM hep-ex hep-th

    A superconducting tensor detector for mid-frequency gravitational waves: its multi-channel nature and main astrophysical targets

    Authors: Yeong-Bok Bae, Chan Park, Edwin J. Son, Sang-Hyeon Ahn, Minjoong Jeong, Gungwon Kang, Chunglee Kim, Dong Lak Kim, Jaewan Kim, Whansun Kim, Hyung Mok Lee, Yong-Ho Lee, Ronald S. Norton, John J. Oh, Sang Hoon Oh, Ho Jung Paik

    Abstract: Mid-frequency band gravitational-wave detectors will be complementary for the existing Earth-based detectors (sensitive above 10 Hz or so) and the future space-based detectors such as LISA, which will be sensitive below around 10 mHz. A ground-based superconducting omnidirectional gravitational radiation observatory (SOGRO) has recently been proposed along with several design variations for the fr… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 35 pages, 8 figures, 4 tables

  47. arXiv:2311.11178  [pdf, other

    cs.CV

    Active Prompt Learning in Vision Language Models

    Authors: Jihwan Bang, Sumyeong Ahn, Jae-Gil Lee

    Abstract: Pre-trained Vision Language Models (VLMs) have demonstrated notable progress in various zero-shot tasks, such as classification and retrieval. Despite their performance, because improving performance on new tasks requires task-specific knowledge, their adaptation is essential. While labels are needed for the adaptation, acquiring them is typically expensive. To overcome this challenge, active lear… ▽ More

    Submitted 21 March, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: accepted at CVPR 2024

  48. arXiv:2311.09064  [pdf, other

    cs.CV cs.LG

    Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models

    Authors: Yeongbin Kim, Gautam Singh, Junyeong Park, Caglar Gulcehre, Sung** Ahn

    Abstract: Systematic compositionality, or the ability to adapt to novel situations by creating a mental model of the world using reusable pieces of knowledge, remains a significant challenge in machine learning. While there has been considerable progress in the language domain, efforts towards systematic visual imagination, or envisioning the dynamical implications of a visual observation, are in their infa… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Published as a conference paper at NeurIPS 2023. The first two authors contributed equally. To download the benchmark, visit https://systematic-visual-imagination.github.io

  49. arXiv:2310.17668  [pdf, other

    cs.LG

    Fine tuning Pre trained Models for Robustness Under Noisy Labels

    Authors: Sumyeong Ahn, Sihyeon Kim, Jongwoo Ko, Se-Young Yun

    Abstract: The presence of noisy labels in a training dataset can significantly impact the performance of machine learning models. To tackle this issue, researchers have explored methods for Learning with Noisy Labels to identify clean samples and reduce the influence of noisy labels. However, constraining the influence of a certain portion of the training dataset can result in a reduction in overall general… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 10 pages (17 pages including supplementary)

    MSC Class: Computer Science; Artificial Intelligence

  50. arXiv:2310.13312  [pdf, other

    cs.CL

    Exploring the Impact of Corpus Diversity on Financial Pretrained Language Models

    Authors: Jaeyoung Choe, Keonwoong Noh, Nayeon Kim, Seyun Ahn, Woohwan Jung

    Abstract: Over the past few years, various domain-specific pretrained language models (PLMs) have been proposed and have outperformed general-domain PLMs in specialized areas such as biomedical, scientific, and clinical domains. In addition, financial PLMs have been studied because of the high economic impact of financial data analysis. However, we found that financial PLMs were not pretrained on sufficient… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (Findings)