Skip to main content

Showing 1–50 of 306 results for author: Park, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15951  [pdf, other

    cs.CL

    Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration

    Authors: Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Ye** Choi, Yulia Tsvetkov

    Abstract: While existing alignment paradigms have been integral in develo** large language models (LLMs), LLMs often learn an averaged human preference and struggle to model diverse preferences across cultures, demographics, and communities. We propose Modular Pluralism, a modular framework based on multi-LLM collaboration for pluralistic alignment: it "plugs into" a base LLM a pool of smaller but special… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  2. arXiv:2406.15725  [pdf, other

    eess.AS cs.SD

    Self Training and Ensembling Frequency Dependent Networks with Coarse Prediction Pooling and Sound Event Bounding Boxes

    Authors: Hyeonuk Nam, Deokki Min, Seungdeok Choi, Inhan Choi, Yong-Hwa Park

    Abstract: To tackle sound event detection (SED) task, we propose frequency dependent networks (FreDNets), which heavily leverage frequency-dependent methods. We apply frequency war** and FilterAugment, which are frequency-dependent data augmentation methods. The model architecture consists of 3 branches: audio teacher-student transformer (ATST) branch, BEATs branch and CNN branch including either partial… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: DCASE 2024 Challenge Task 4 technical report

  3. arXiv:2406.13856  [pdf, other

    cs.DB

    Kishu: Time-Traveling for Computational Notebooks

    Authors: Zhaoheng Li, Supawit Chockchowwat, Ribhav Sahu, Areet Sheth, Yongjoo Park

    Abstract: Computational notebooks (e.g., Jupyter, Google Colab) are widely used by data scientists. A key feature of notebooks is the interactive computing model of iteratively executing cells (i.e., a set of statements) and observing the result (e.g., model or plot). Unfortunately, existing notebook systems do not offer time-traveling to past states: when the user executes a cell, the notebook session stat… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.13312  [pdf, other

    eess.AS cs.SD

    Pushing the Limit of Sound Event Detection with Multi-Dilated Frequency Dynamic Convolution

    Authors: Hyeonuk Nam, Yong-Hwa Park

    Abstract: Frequency dynamic convolution (FDY conv) has been a milestone in the sound event detection (SED) field, but it involves a substantial increase in model size due to multiple basis kernels. In this work, we propose partial frequency dynamic convolution (PFD conv), which concatenates static conventional 2D convolution branch output and dynamic FDY conv branch output in order to minimize model size in… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. arXiv:2406.13280  [pdf, other

    cs.NI cs.AI

    Design Optimization of NOMA Aided Multi-STAR-RIS for Indoor Environments: A Convex Approximation Imitated Reinforcement Learning Approach

    Authors: Yu Min Park, Sheikh Salman Hassan, Yan Kyaw Tun, Eui-Nam Huh, Walid Saad, Choong Seon Hong

    Abstract: Sixth-generation (6G) networks leverage simultaneously transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs) to overcome the limitations of traditional RISs. STAR-RISs offer 360-degree full-space coverage and optimized transmission and reflection for enhanced network performance and dynamic control of the indoor propagation environment. However, deploying STAR-RISs indoors pr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 37 pages, 11 figures, IEEE Transactions on Communications submitted. arXiv admin note: text overlap with arXiv:2311.08708

  6. arXiv:2406.13251  [pdf, other

    cs.CV cs.GR eess.IV

    Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields

    Authors: Youngin Park, Seungtae Nam, Cheul-hee Hahm, Eunbyung Park

    Abstract: Neural Radiance Fields (NeRF) have shown remarkable success in representing 3D scenes and generating novel views. However, they often struggle with aliasing artifacts, especially when rendering images from different camera distances from the training views. To address the issue, Mip-NeRF proposed using volumetric frustums to render a pixel and suggested integrated positional encoding (IPE). While… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to ICIP 2024, 7 pages, 3 figures

  7. arXiv:2406.12904  [pdf, other

    cs.LG physics.comp-ph physics.optics

    Meent: Differentiable Electromagnetic Simulator for Machine Learning

    Authors: Yongha Kim, Anthony W. Jung, Sanmun Kim, Kevin Octavian, Doyoung Heo, Chae** Park, Jeongmin Shin, Sunghyun Nam, Chanhyung Park, Juho Park, Sangjun Han, **myoung Lee, Seolho Kim, Min Seok Jang, Chan Y. Park

    Abstract: Electromagnetic (EM) simulation plays a crucial role in analyzing and designing devices with sub-wavelength scale structures such as solar cells, semiconductor devices, image sensors, future displays and integrated photonic devices. Specifically, optics problems such as estimating semiconductor device structures and designing nanophotonic devices provide intriguing research topics with far-reachin… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: under review

  8. arXiv:2406.11938  [pdf, other

    cs.AI cs.MA

    Tracking the perspectives of interacting language models

    Authors: Hayden Helm, Brandon Duderstadt, Youngser Park, Carey E. Priebe

    Abstract: Large language models (LLMs) are capable of producing high quality information at unprecedented rates. As these models continue to entrench themselves in society, the content they produce will become increasingly pervasive in databases that are, in turn, incorporated into the pre-training data, fine-tuning data, retrieval data, etc. of other language models. In this paper we formalize the idea of… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  9. arXiv:2406.08070  [pdf, ps, other

    cs.CV cs.AI cs.LG

    CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models

    Authors: Hyung** Chung, Jeongsol Kim, Geon Yeong Park, Hyelin Nam, Jong Chul Ye

    Abstract: Classifier-free guidance (CFG) is a fundamental tool in modern diffusion models for text-guided generation. Although effective, CFG has notable drawbacks. For instance, DDIM with CFG lacks invertibility, complicating image editing; furthermore, high guidance scales, essential for high-quality outputs, frequently result in issues like mode collapse. Contrary to the widespread belief that these are… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  10. arXiv:2406.05341  [pdf, other

    eess.AS cs.SD

    Diversifying and Expanding Frequency-Adaptive Convolution Kernels for Sound Event Detection

    Authors: Hyeonuk Nam, Seong-Hu Kim, Deokki Min, Junhyeok Lee, Yong-Hwa Park

    Abstract: Frequency dynamic convolution (FDY conv) has shown the state-of-the-art performance in sound event detection (SED) using frequency-adaptive kernels obtained by frequency-varying combination of basis kernels. However, FDY conv lacks an explicit mean to diversify frequency-adaptive kernels, potentially limiting the performance. In addition, size of basis kernels is limited while time-frequency patte… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024

  11. arXiv:2406.01079  [pdf, other

    cs.CV cs.AI

    Object Aware Egocentric Online Action Detection

    Authors: Joungbin An, Yunsu Park, Hyolim Kang, Seon Joo Kim

    Abstract: Advancements in egocentric video datasets like Ego4D, EPIC-Kitchens, and Ego-Exo4D have enriched the study of first-person human interactions, which is crucial for applications in augmented reality and assisted living. Despite these advancements, current Online Action Detection methods, which efficiently detect actions in streaming videos, are predominantly designed for exocentric views and thus f… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: CVPR First Joint Egocentric Vision Workshop 2024

  12. arXiv:2405.20738  [pdf, other

    cs.LG

    Federated Random Forest for Partially Overlap** Clinical Data

    Authors: Youngjun Park, Cord Eric Schmidt, Benedikt Marcel Batton, Anne-Christin Hauschild

    Abstract: In the healthcare sector, a consciousness surrounding data privacy and corresponding data protection regulations, as well as heterogeneous and non-harmonized data, pose huge challenges to large-scale data analysis. Moreover, clinical data often involves partially overlap** features, as some observations may be missing due to various reasons, such as differences in procedures, diagnostic tests, o… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  13. arXiv:2405.16658  [pdf, other

    cs.LG cs.AI

    Acceleration of Grokking in Learning Arithmetic Operations via Kolmogorov-Arnold Representation

    Authors: Yeachan Park, Minseok Kim, Yeoneung Kim

    Abstract: We propose novel methodologies aimed at accelerating the grokking phenomenon, which refers to the rapid increment of test accuracy after a long period of overfitting as reported in~\cite{power2022grokking}. Focusing on the grokking phenomenon that arises in learning arithmetic binary operations via the transformer model, we begin with a discussion on data augmentation in the case of commutative bi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  14. arXiv:2405.14150  [pdf, other

    cs.CL

    jp-evalb: Robust Alignment-based PARSEVAL Measures

    Authors: Jungyeul Park, Junrui Wang, Eunkyul Leah Jo, Angela Yoonseo Park

    Abstract: We introduce an evaluation system designed to compute PARSEVAL measures, offering a viable alternative to \texttt{evalb} commonly used for constituency parsing evaluation. The widely used \texttt{evalb} script has traditionally been employed for evaluating the accuracy of constituency parsing results, albeit with the requirement for consistent tokenization and sentence boundaries. In contrast, our… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: To appear in The system demonstration track at NAACL-HLT 2024

  15. arXiv:2405.03637  [pdf, other

    cs.LG

    Collage: Light-Weight Low-Precision Strategy for LLM Training

    Authors: Tao Yu, Gaurav Gupta, Karthick Gopalswamy, Amith Mamidala, Hao Zhou, Jeffrey Huynh, Youngsuk Park, Ron Diamant, Anoop Deoras, Luke Huan

    Abstract: Large models training is plagued by the intense compute cost and limited hardware memory. A practical solution is low-precision representation but is troubled by loss in numerical accuracy and unstable training rendering the model less useful. We argue that low-precision floating points can perform well provided the error is properly compensated at the critical locations in the training process. W… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  16. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  17. arXiv:2405.00828  [pdf, other

    cs.CL

    WIBA: What Is Being Argued? A Comprehensive Approach to Argument Mining

    Authors: Arman Irani, Ju Yeon Park, Kevin Esterling, Michalis Faloutsos

    Abstract: We propose WIBA, a novel framework and suite of methods that enable the comprehensive understanding of "What Is Being Argued" across contexts. Our approach develops a comprehensive framework that detects: (a) the existence, (b) the topic, and (c) the stance of an argument, correctly accounting for the logical dependence among the three tasks. Our algorithm leverages the fine-tuning and prompt-engi… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 8 pages, 2 figures, submitted to The 16th International Conference on Advances in Social Networks Analysis and Mining (ASONAM) '24

  18. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, **-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  19. arXiv:2404.08080  [pdf, other

    cs.LG cs.AI cs.CL math.OC

    Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models

    Authors: Tanmay Gautam, Youngsuk Park, Hao Zhou, Parameswaran Raman, Wooseok Ha

    Abstract: Fine-tuning language models (LMs) has demonstrated success in a wide array of downstream tasks. However, as LMs are scaled up, the memory requirements for backpropagation become prohibitively high. Zeroth-order (ZO) optimization methods can leverage memory-efficient forward passes to estimate gradients. More recently, MeZO, an adaptation of ZO-SGD, has been shown to consistently outperform zero-sh… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: 29 pages, 25 tables, 9 figures

  20. arXiv:2404.07308  [pdf, other

    cs.LG

    Spatial Transfer Learning for Estimating PM2.5 in Data-poor Regions

    Authors: Shrey Gupta, Yongbee Park, Jianzhao Bi, Suyash Gupta, Andreas Züfle, Avani Wildani, Yang Liu

    Abstract: Air pollution, especially particulate matter 2.5 (PM2.5), is a pressing concern for public health and is difficult to estimate in develo** countries (data-poor regions) due to a lack of ground sensors. Transfer learning models can be leveraged to solve this problem, as they use alternate data sources to gain knowledge (i.e., data from data-rich regions). However, current transfer learning method… ▽ More

    Submitted 22 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted for publication at ECML-PKDD 2024

  21. arXiv:2404.06664  [pdf, other

    cs.CL cs.AI cs.HC

    CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

    Authors: Yu Ying Chiu, Liwei Jiang, Maria Antoniak, Chan Young Park, Shuyue Stella Li, Mehar Bhatia, Sahithya Ravi, Yulia Tsvetkov, Vered Shwartz, Ye** Choi

    Abstract: Frontier large language models (LLMs) are developed by researchers and practitioners with skewed cultural backgrounds and on datasets with skewed sources. However, LLMs' (lack of) multicultural knowledge cannot be effectively assessed with current methods for develo** benchmarks. Existing multicultural evaluations primarily rely on expensive and restricted human annotations or potentially outdat… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Preprint (under review)

  22. arXiv:2404.06641  [pdf

    cs.LG cs.AI cs.CY

    Federated learning model for predicting major postoperative complications

    Authors: Yonggi Park, Yuanfang Ren, Benjamin Shickel, Ziyuan Guan, Ayush Patela, Yingbo Ma, Zhenhong Hu, Tyler J. Loftus, Parisa Rashidi, Tezcan Ozrazgat-Baslanti, Azra Bihorac

    Abstract: Background: The accurate prediction of postoperative complication risk using Electronic Health Records (EHR) and artificial intelligence shows great potential. Training a robust artificial intelligence model typically requires large-scale and diverse datasets. In reality, collecting medical data often encounters challenges surrounding privacy protection. Methods: This retrospective cohort study in… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 57 pages. 2 figures, 3 tables, 2 supplemental figures, 8 supplemental tables

  23. arXiv:2404.03991  [pdf, other

    eess.IV cs.CV cs.LG

    Towards Efficient and Accurate CT Segmentation via Edge-Preserving Probabilistic Downsampling

    Authors: Shahzad Ali, Yu Rim Lee, Soo Young Park, Won Young Tak, Soon Ki Jung

    Abstract: Downsampling images and labels, often necessitated by limited resources or to expedite network training, leads to the loss of small objects and thin boundaries. This undermines the segmentation network's capacity to interpret images accurately and predict detailed labels, resulting in diminished performance compared to processing at original resolutions. This situation exemplifies the trade-off be… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 5 pages (4 figures, 1 table); This work has been submitted to the IEEE Signal Processing Letters. Copyright may be transferred without notice, after which this version may no longer be accessible

  24. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seong** Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  25. arXiv:2404.01709  [pdf, other

    cs.CV cs.AI

    Upsample Guidance: Scale Up Diffusion Models without Training

    Authors: Juno Hwang, Yong-Hyun Park, Junghyo Jo

    Abstract: Diffusion models have demonstrated superior performance across various generative tasks including images, videos, and audio. However, they encounter difficulties in directly generating high-resolution samples. Previously proposed solutions to this issue involve modifying the architecture, further training, or partitioning the sampling process into multiple stages. These methods have the limitation… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 15 pages, 15 Figures

  26. arXiv:2403.18452  [pdf, other

    cs.CV cs.LG cs.RO

    SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model

    Authors: Inhwan Bae, Young-Jae Park, Hae-Gon Jeon

    Abstract: There are five types of trajectory prediction tasks: deterministic, stochastic, domain adaptation, momentary observation, and few-shot. These associated tasks are defined by various factors, such as the length of input paths, data split and pre-processing methods. Interestingly, even though they commonly take sequential coordinates of observations as input and infer future paths in the same coordi… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024

  27. arXiv:2403.15249  [pdf, other

    cs.CV cs.AI cs.LG

    Spectral Motion Alignment for Video Motion Transfer using Diffusion Models

    Authors: Geon Yeong Park, Hyeonho Jeong, Sang Wan Lee, Jong Chul Ye

    Abstract: The evolution of diffusion models has greatly impacted video generation and understanding. Particularly, text-to-video diffusion models (VDMs) have significantly facilitated the customization of input video with target appearance, motion, etc. Despite these advances, challenges persist in accurately distilling motion information from video frames. While existing works leverage the consecutive fram… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Project page: https://geonyeong-park.github.io/spectral-motion-alignment/

  28. arXiv:2403.12002  [pdf, other

    cs.CV cs.AI

    DreamMotion: Space-Time Self-Similarity Score Distillation for Zero-Shot Video Editing

    Authors: Hyeonho Jeong, **ho Chang, Geon Yeong Park, Jong Chul Ye

    Abstract: Text-driven diffusion-based video editing presents a unique challenge not encountered in image editing literature: establishing real-world motion. Unlike existing video editing approaches, here we focus on score distillation sampling to circumvent the standard reverse diffusion process and initiate optimization from videos that already exhibit natural motion. Our analysis reveals that while video… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Project page: https://hyeonho99.github.io/dreammotion/

  29. arXiv:2403.11415  [pdf, other

    cs.CV cs.AI cs.LG

    DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation

    Authors: Jeongsol Kim, Geon Yeong Park, Jong Chul Ye

    Abstract: Reverse sampling and score-distillation have emerged as main workhorses in recent years for image manipulation using latent diffusion models (LDMs). While reverse diffusion sampling often requires adjustments of LDM architecture or feature engineering, score distillation offers a simple yet powerful model-agnostic approach, but it is often prone to mode-collapsing. To address these limitations and… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  30. arXiv:2403.03408  [pdf, other

    cs.CV

    Scene Depth Estimation from Traditional Oriental Landscape Paintings

    Authors: Sungho Kang, YeongHyeon Park, Hyunkyu Park, Juneho Yi

    Abstract: Scene depth estimation from paintings can streamline the process of 3D sculpture creation so that visually impaired people appreciate the paintings with tactile sense. However, measuring depth of oriental landscape painting images is extremely challenging due to its unique method of depicting depth and poor preservation. To address the problem of scene depth estimation from oriental landscape pain… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  31. A Scalable and Transferable Time Series Prediction Framework for Demand Forecasting

    Authors: Young-** Park, Donghyun Kim, Frédéric Odermatt, Juho Lee, Kyung-Min Kim

    Abstract: Time series forecasting is one of the most essential and ubiquitous tasks in many business problems, including demand forecasting and logistics optimization. Traditional time series forecasting methods, however, have resulted in small models with limited expressive power because they have difficulty in scaling their model size up while maintaining high accuracy. In this paper, we propose Forecasti… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Published as a full paper at ICDM 2022

  32. arXiv:2402.17862  [pdf, other

    cs.CV cs.AI

    REPrune: Channel Pruning via Kernel Representative Selection

    Authors: Mincheol Park, Dong** Kim, Cheonjun Park, Yuna Park, Gyeong Eun Gong, Won Woo Ro, Suhyun Kim

    Abstract: Channel pruning is widely accepted to accelerate modern convolutional neural networks (CNNs). The resulting pruned model benefits from its immediate deployment on general-purpose software and hardware resources. However, its large pruning granularity, specifically at the unit of a convolution filter, often leads to undesirable accuracy drops due to the inflexibility of deciding how and where to in… ▽ More

    Submitted 8 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Published at AAAI2024

  33. arXiv:2402.15019  [pdf, other

    cs.LG cs.AI stat.ML

    Consistency-Guided Temperature Scaling Using Style and Content Information for Out-of-Domain Calibration

    Authors: Wonjeong Choi, Jungwuk Park, Dong-Jun Han, Younghyun Park, Jaekyun Moon

    Abstract: Research interests in the robustness of deep neural networks against domain shifts have been rapidly increasing in recent years. Most existing works, however, focus on improving the accuracy of the model, not the calibration performance which is another important requirement for trustworthy AI systems. Temperature scaling (TS), an accuracy-preserving post-hoc calibration method, has been proven to… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted at AAAI-24 (The 38th AAAI Conference on Artificial Intelligence, February 2024)

  34. arXiv:2402.10645  [pdf, other

    cs.CL cs.AI

    Can Separators Improve Chain-of-Thought Prompting?

    Authors: Yoonjeong Park, Hyun** Kim, Chanyeol Choi, Junseong Kim, Jy-yong Sohn

    Abstract: Chain-of-thought (CoT) prompting is a simple and effective method for improving the reasoning capabilities of Large language models (LLMs). The basic idea of CoT is to let LLMs break down their thought processes step-by-step by putting exemplars in the input prompt. However, the densely structured prompt exemplars of CoT may cause the cognitive overload of LLMs. Inspired by human cognition, we int… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  35. arXiv:2402.10517  [pdf, other

    cs.LG

    Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

    Authors: Yeonhong Park, Jake Hyun, SangLyul Cho, Bonggeun Sim, Jae W. Lee

    Abstract: Recently, considerable efforts have been directed towards compressing Large Language Models (LLMs), which showcase groundbreaking capabilities across diverse applications but entail significant deployment costs due to their large sizes. Meanwhile, much less attention has been given to mitigating the costs associated with deploying multiple LLMs of varying sizes despite its practical significance.… ▽ More

    Submitted 21 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: To appear at ICML 2024. Code is available at https://github.com/SNU-ARC/any-precision-llm

  36. arXiv:2402.04287  [pdf

    q-bio.NC cs.ET quant-ph

    Association between Prefrontal fNIRS signals during Cognitive tasks and College scholastic ability test (CSAT) scores: Analysis using a quantum annealing approach

    Authors: Yeaju Kim, Junggu Choi, Bora Kim, Yongwan Park, Jihyun Cha, Jongkwan Choi, Sanghoon Han

    Abstract: Academic achievement is a critical measure of intellectual ability, prompting extensive research into cognitive tasks as potential predictors. Neuroimaging technologies, such as functional near-infrared spectroscopy (fNIRS), offer insights into brain hemodynamics, allowing understanding of the link between cognitive performance and academic achievement. Herein, we explored the association between… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 42 pages, 11 tables

  37. arXiv:2401.16731  [pdf, other

    cs.CL cs.AI

    Towards Generating Informative Textual Description for Neurons in Language Models

    Authors: Shrayani Mondal, Rishabh Garodia, Arbaaz Qureshi, Taesung Lee, Youngja Park

    Abstract: Recent developments in transformer-based language models have allowed them to capture a wide variety of world knowledge that can be adapted to downstream tasks with limited resources. However, what pieces of information are understood in these models is unclear, and neuron-level contributions in identifying them are largely unknown. Conventional approaches in neuron explainability either depend on… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: AAAI 2024

  38. arXiv:2401.15726  [pdf, other

    cs.CV

    Long-Term Typhoon Trajectory Prediction: A Physics-Conditioned Approach Without Reanalysis Data

    Authors: Young-Jae Park, Minseok Seo, Doyi Kim, Hyeri Kim, Sanghoon Choi, Beomkyu Choi, Jeongwon Ryu, Sohee Son, Hae-Gon Jeon, Yeji Choi

    Abstract: In the face of escalating climate changes, typhoon intensities and their ensuing damage have surged. Accurate trajectory prediction is crucial for effective damage control. Traditional physics-based models, while comprehensive, are computationally intensive and rely heavily on the expertise of forecasters. Contemporary data-driven methods often rely on reanalysis data, which can be considered to b… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: This paper was accepted for a Spotlight presentation at ICLR 2024

  39. arXiv:2401.15121  [pdf, ps, other

    cs.LG cs.AI

    Expressive Power of ReLU and Step Networks under Floating-Point Operations

    Authors: Yeachan Park, Geonho Hwang, Wonyeol Lee, Sejun Park

    Abstract: The study of the expressive power of neural networks has investigated the fundamental limits of neural networks. Most existing results assume real-valued inputs and parameters as well as exact operations during the evaluation of neural networks. However, neural networks are typically executed on computers that can only represent a tiny subset of the reals and apply inexact operations. In this work… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  40. arXiv:2401.11419  [pdf, other

    cs.NI eess.SP

    Joint UAV Deployment and Resource Allocation in THz-Assisted MEC-Enabled Integrated Space-Air-Ground Networks

    Authors: Yan Kyaw Tun, György Dán, Yu Min Park, Choong Seon Hong

    Abstract: Multi-access edge computing (MEC)-enabled integrated space-air-ground (SAG) networks have drawn much attention recently, as they can provide communication and computing services to wireless devices in areas that lack terrestrial base stations (TBSs). Leveraging the ample bandwidth in the terahertz (THz) spectrum, in this paper, we propose MEC-enabled integrated SAG networks with collaboration amon… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 36 pages, 8 figures

  41. arXiv:2401.10247  [pdf, other

    cs.CV cs.LG

    Resolution Chromatography of Diffusion Models

    Authors: Juno Hwang, Yong-Hyun Park, Junghyo Jo

    Abstract: Diffusion models generate high-resolution images through iterative stochastic processes. In particular, the denoising method is one of the most popular approaches that predicts the noise in samples and denoises it at each time step. It has been commonly observed that the resolution of generated samples changes over time, starting off blurry and coarse, and becoming sharper and finer. In this paper… ▽ More

    Submitted 6 December, 2023; originally announced January 2024.

    Comments: 24 pages, 9 figures

  42. arXiv:2401.10124  [pdf, other

    stat.ME cs.SI physics.soc-ph stat.AP

    Lower Ricci Curvature for Efficient Community Detection

    Authors: Yun ** Park, Didong Li

    Abstract: This study introduces the Lower Ricci Curvature (LRC), a novel, scalable, and scale-free discrete curvature designed to enhance community detection in networks. Addressing the computational challenges posed by existing curvature-based methods, LRC offers a streamlined approach with linear computational complexity, making it well-suited for large-scale network analysis. We further develop an LRC-ba… ▽ More

    Submitted 27 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  43. arXiv:2401.04437  [pdf, other

    cs.CV cs.AI

    Empirical Analysis of Anomaly Detection on Hyperspectral Imaging Using Dimension Reduction Methods

    Authors: Dongeon Kim, YeongHyeon Park

    Abstract: Recent studies try to use hyperspectral imaging (HSI) to detect foreign matters in products because it enables to visualize the invisible wavelengths including ultraviolet and infrared. Considering the enormous image channels of the HSI, several dimension reduction methods-e.g., PCA or UMAP-can be considered to reduce but those cannot ease the fundamental limitations, as follows: (1) latency of HS… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 4 pages, 4 figures, 3 tables

  44. arXiv:2312.06176  [pdf, other

    quant-ph cs.AI

    Improvement in Variational Quantum Algorithms by Measurement Simplification

    Authors: Jaehoon Hahm, Hayeon Kim, Young June Park

    Abstract: Variational Quantum Algorithms (VQAs) are expected to be promising algorithms with quantum advantages that can be run at quantum computers in the close future. In this work, we review simple rules in basic quantum circuits, and propose a simplification method, Measurement Simplification, that simplifies the expression for the measurement of quantum circuit. By the Measurement Simplification, we si… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  45. arXiv:2312.00845  [pdf, other

    cs.CV cs.AI cs.LG

    VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models

    Authors: Hyeonho Jeong, Geon Yeong Park, Jong Chul Ye

    Abstract: Text-to-video diffusion models have advanced video generation significantly. However, customizing these models to generate videos with tailored motions presents a substantial challenge. In specific, they encounter hurdles in (a) accurately reproducing motion from a target video, and (b) creating diverse visual variations. For example, straightforward extensions of static image customization method… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Project page: https://video-motion-customization.github.io

  46. arXiv:2311.18608  [pdf, other

    cs.CV cs.AI cs.LG

    Contrastive Denoising Score for Text-guided Latent Diffusion Image Editing

    Authors: Hyelin Nam, Gihyun Kwon, Geon Yeong Park, Jong Chul Ye

    Abstract: With the remarkable advent of text-to-image diffusion models, image editing methods have become more diverse and continue to evolve. A promising recent approach in this realm is Delta Denoising Score (DDS) - an image editing technique based on Score Distillation Sampling (SDS) framework that leverages the rich generative prior of text-to-image diffusion models. However, relying solely on the diffe… ▽ More

    Submitted 1 April, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 (poster); Project page: https://hyelinnam.github.io/CDS/

  47. arXiv:2311.15658  [pdf, other

    cs.CV cs.AI cs.LG

    Regularization by Texts for Latent Diffusion Inverse Solvers

    Authors: Jeongsol Kim, Geon Yeong Park, Hyung** Chung, Jong Chul Ye

    Abstract: The recent advent of diffusion models has led to significant progress in solving inverse problems, leveraging these models as effective generative priors. Nonetheless, there remain challenges related to the ill-posed nature of such problems, often due to inherent ambiguities in measurements or intrinsic system symmetries. To address this, drawing inspiration from the human ability to resolve visua… ▽ More

    Submitted 16 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  48. arXiv:2311.11812  [pdf, other

    cs.AI

    Improving Real Estate Appraisal with POI Integration and Areal Embedding

    Authors: Sumin Han, Youngjun Park, Sonia Sabir, Jisun An, Dongman Lee

    Abstract: Despite advancements in real estate appraisal methods, this study primarily focuses on two pivotal challenges. Firstly, we explore the often-underestimated impact of Points of Interest (POI) on property values, emphasizing the necessity for a comprehensive, data-driven approach to feature selection. Secondly, we integrate road-network-based Areal Embedding to enhance spatial understanding for real… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  49. arXiv:2311.09741  [pdf, other

    cs.CL cs.LG

    P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models

    Authors: Yuhan Liu, Shangbin Feng, Xiaochuang Han, Vidhisha Balachandran, Chan Young Park, Sachin Kumar, Yulia Tsvetkov

    Abstract: In this work, we take a first step towards designing summarization systems that are faithful to the author's intent, not only the semantic content of the article. Focusing on a case study of preserving political perspectives in news summarization, we find that existing approaches alter the political opinions and stances of news articles in more than 50% of summaries, misrepresenting the intent and… ▽ More

    Submitted 4 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  50. arXiv:2311.08708  [pdf, other

    cs.IT cs.AI cs.NI

    Joint User Pairing and Beamforming Design of Multi-STAR-RISs-Aided NOMA in the Indoor Environment via Multi-Agent Reinforcement Learning

    Authors: Yu Min Park, Yan Kyaw Tun, Choong Seon Hong

    Abstract: The development of 6G/B5G wireless networks, which have requirements that go beyond current 5G networks, is gaining interest from academia and industry. However, to increase 6G/B5G network quality, conventional cellular networks that rely on terrestrial base stations are constrained geographically and economically. Meanwhile, NOMA allows multiple users to share the same resources, which improves t… ▽ More

    Submitted 16 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages, 9 figures, IEEE/IFIP Network Operations and Management Symposium (NOMS) 2024 submitted