Skip to main content

Showing 1–11 of 11 results for author: Sim, B

.
  1. arXiv:2402.10517  [pdf, other

    cs.LG

    Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

    Authors: Yeonhong Park, Jake Hyun, SangLyul Cho, Bonggeun Sim, Jae W. Lee

    Abstract: Recently, considerable efforts have been directed towards compressing Large Language Models (LLMs), which showcase groundbreaking capabilities across diverse applications but entail significant deployment costs due to their large sizes. Meanwhile, much less attention has been given to mitigating the costs associated with deploying multiple LLMs of varying sizes despite its practical significance.… ▽ More

    Submitted 21 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: To appear at ICML 2024. Code is available at https://github.com/SNU-ARC/any-precision-llm

  2. arXiv:2209.13394  [pdf, other

    cs.LG

    Magnitude and Angle Dynamics in Training Single ReLU Neurons

    Authors: Sangmin Lee, Byeongsu Sim, Jong Chul Ye

    Abstract: To understand learning the dynamics of deep ReLU networks, we investigate the dynamic system of gradient flow $w(t)$ by decomposing it to magnitude $w(t)$ and angle $φ(t):= π- θ(t) $ components. In particular, for multi-layer single ReLU neurons with spherically symmetric data distribution and the square loss function, we provide upper and lower bounds for magnitude and angle components to describ… ▽ More

    Submitted 11 October, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

  3. arXiv:2208.06775  [pdf, other

    physics.bio-ph

    The high-resolution in vivo measurement of replication fork velocity and pausing by lag-time analysis

    Authors: Dean Huang, Anna E. Johnson, Brandon S. Sim, Teresa Lo, Houra Merrikh, Paul A. Wiggins

    Abstract: An important step towards understanding the mechanistic basis of the central dogma is the quantitative characterization of the dynamics of nucleic-acid-bound molecular motors in the context of the living cell, where a crowded cytoplasm as well as competing and potentially antagonistic processes may significantly affect their rapidity and reliability. To capture these dynamics, we develop a novel m… ▽ More

    Submitted 6 September, 2022; v1 submitted 14 August, 2022; originally announced August 2022.

    Comments: 37 pages, 24 figures

  4. arXiv:2206.00941  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Improving Diffusion Models for Inverse Problems using Manifold Constraints

    Authors: Hyung** Chung, Byeongsu Sim, Dohoon Ryu, Jong Chul Ye

    Abstract: Recently, diffusion models have been used to solve various inverse problems in an unsupervised manner with appropriate modifications to the sampling process. However, the current solvers, which recursively apply a reverse diffusion step followed by a projection-based measurement consistency step, often produce suboptimal results. By studying the generative sampling path, here we show that current… ▽ More

    Submitted 20 May, 2024; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022 camera-ready; 29 pages, 16 figures

  5. Transformer Network-based Reinforcement Learning Method for Power Distribution Network (PDN) Optimization of High Bandwidth Memory (HBM)

    Authors: Hyunwook Park, Minsu Kim, Seongguk Kim, Keunwoo Kim, Haeyeon Kim, Taein Shin, Keeyoung Son, Boogyo Sim, Subin Kim, Seungtaek Jeong, Chulsoon Hwang, Joungho Kim

    Abstract: In this article, for the first time, we propose a transformer network-based reinforcement learning (RL) method for power distribution network (PDN) optimization of high bandwidth memory (HBM). The proposed method can provide an optimal decoupling capacitor (decap) design to maximize the reduction of PDN self- and transfer impedance seen at multiple ports. An attention-based transformer network is… ▽ More

    Submitted 23 August, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: 15 pages, 14 figures, Under review as a journal paper at IEEE Transactions on Microwave and Theory and Techniques (TMTT) Fig. 10 revised; Fig. 14 added

  6. arXiv:2202.05510  [pdf, other

    cs.LG cs.AI stat.ML

    Support Vectors and Gradient Dynamics of Single-Neuron ReLU Networks

    Authors: Sangmin Lee, Byeongsu Sim, Jong Chul Ye

    Abstract: Understanding implicit bias of gradient descent for generalization capability of ReLU networks has been an important research topic in machine learning research. Unfortunately, even for a single ReLU neuron trained with the square loss, it was recently shown impossible to characterize the implicit regularization in terms of a norm of model parameters (Vardi & Shamir, 2021). In order to close the g… ▽ More

    Submitted 13 June, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

  7. arXiv:2112.05146  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction

    Authors: Hyung** Chung, Byeongsu Sim, Jong Chul Ye

    Abstract: Diffusion models have recently attained significant interest within the community owing to their strong performance as generative models. Furthermore, its application to inverse problems have demonstrated state-of-the-art performance. Unfortunately, diffusion models have a critical downside - they are inherently slow to sample from, needing few thousand steps of iteration to generate images from p… ▽ More

    Submitted 19 March, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted to CVPR 2022

  8. arXiv:2008.12967  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Unpaired Deep Learning for Accelerated MRI using Optimal Transport Driven CycleGAN

    Authors: Gyutaek Oh, Byeongsu Sim, Hyung** Chung, Leonard Sunwoo, Jong Chul Ye

    Abstract: Recently, deep learning approaches for accelerated MRI have been extensively studied thanks to their high performance reconstruction in spite of significantly reduced runtime complexity. These neural networks are usually trained in a supervised manner, so matched pairs of subsampled and fully sampled k-space data are required. Unfortunately, it is often difficult to acquire matched fully sampled k… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: Accepted for IEEE Transactions on Computational Imaging

  9. arXiv:1912.05073  [pdf

    physics.app-ph cond-mat.mtrl-sci

    A Comprehensive Model of the Degradation of Organic Light-Emitting Diodes and Application for Efficient Stable Blue Phosphorescent Devices with Reduced Influence of Polarons

    Authors: Bomi Sim, Jong Soo Kim, Hye** Bae, Sungho Nam, Eunsuk Kwon, Ji Whan Kim, Hwa-Young Cho, Sunghan Kim, Jang-Joo Kim

    Abstract: We present a comprehensive model to analyze, quantitatively, and predict the process of degradation of organic light-emitting diodes (OLEDs) considering all possible degradation mechanisms, i.e., polaron, exciton, exciton-polaron interactions, exciton-exciton interactions, and a newly proposed impurity effect. The loss of efficiency during degradation is presented as a function of quencher density… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Journal ref: Phys. Rev. Applied 14, 024002 (2020)

  10. arXiv:1909.12116  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Optimal Transport driven CycleGAN for Unsupervised Learning in Inverse Problems

    Authors: Byeongsu Sim, Gyutaek Oh, Jeongsol Kim, Chanyong Jung, Jong Chul Ye

    Abstract: To improve the performance of classical generative adversarial network (GAN), Wasserstein generative adversarial networks (W-GAN) was developed as a Kantorovich dual formulation of the optimal transport (OT) problem using Wasserstein-1 distance. However, it was not clear how cycleGAN-type generative models can be derived from the optimal transport theory. Here we show that a novel cycleGAN archite… ▽ More

    Submitted 30 August, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: accepted for publication in the SIAM Journal on Imaging Sciences

  11. arXiv:1506.00010  [pdf, other

    astro-ph.IM

    FATS: Feature Analysis for Time Series

    Authors: Isadora Nun, Pavlos Protopapas, Brandon Sim, Ming Zhu, Rahul Dave, Nicolas Castro, Karim Pichara

    Abstract: In this paper, we present the FATS (Feature Analysis for Time Series) library. FATS is a Python library which facilitates and standardizes feature extraction for time series data. In particular, we focus on one application: feature extraction for astronomical light curve data, although the library is generalizable for other uses. We detail the methods and features implemented for light curve analy… ▽ More

    Submitted 31 August, 2015; v1 submitted 29 May, 2015; originally announced June 2015.