Skip to main content

Showing 1–28 of 28 results for author: Rim, D

.
  1. arXiv:2406.14091  [pdf, other

    cs.CL

    Protecting Privacy Through Approximating Optimal Parameters for Sequence Unlearning in Language Models

    Authors: Dohyun Lee, Daniel Rim, Minseok Choi, Jaegul Choo

    Abstract: Although language models (LMs) demonstrate exceptional capabilities on various tasks, they are potentially vulnerable to extraction attacks, which represent a significant privacy risk. To mitigate the privacy concerns of LMs, machine unlearning has emerged as an important research area, which is utilized to induce the LM to selectively forget about some of its training data. While completely retra… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL2024 findings

  2. arXiv:2406.12329  [pdf, other

    cs.CL

    SNAP: Unlearning Selective Knowledge in Large Language Models with Negative Instructions

    Authors: Minseok Choi, Daniel Rim, Dohyun Lee, Jaegul Choo

    Abstract: Instruction-following large language models (LLMs), such as ChatGPT, have become increasingly popular with the general audience, many of whom are incorporating them into their daily routines. However, these LLMs inadvertently disclose personal or copyrighted information, which calls for a machine unlearning method to remove selective knowledge. Previous attempts sought to forget the link between t… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures

  3. arXiv:2406.05694  [pdf, other

    math.NA cs.LG

    A Low Rank Neural Representation of Entropy Solutions

    Authors: Donsub Rim, Gerrit Welper

    Abstract: We construct a new representation of entropy solutions to nonlinear scalar conservation laws with a smooth convex flux function in a single spatial dimension. The representation is a generalization of the method of characteristics and posseses a compositional form. While it is a nonlinear representation, the embedded dynamics of the solution in the time variable is linear. This representation is t… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 42 pages, 9 figures

    MSC Class: 68T07; 41A46; 41A25; 65N15; 35L65

  4. Enhanced Labeling Technique for Reddit Text and Fine-Tuned Longformer Models for Classifying Depression Severity in English and Luganda

    Authors: Richard Kimera, Daniela N. Rim, Joseph Kirabira, Ubong Godwin Udomah, Heeyoul Choi

    Abstract: Depression is a global burden and one of the most challenging mental health conditions to control. Experts can detect its severity early using the Beck Depression Inventory (BDI) questionnaire, administer appropriate medication to patients, and impede its progression. Due to the fear of potential stigmatization, many patients turn to social media platforms like Reddit for advice and assistance at… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: In IEEE Proceedings of the 14th International Conference on ICT Convergence (ICTC), Jeju, Korea, October 2023

  5. arXiv:2312.08553  [pdf, other

    eess.AS cs.SD

    USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

    Authors: Shao** Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal

    Abstract: End-to-end automatic speech recognition (ASR) models have seen revolutionary quality gains with the recent development of large-scale universal speech models (USM). However, deploying these massive USMs is extremely expensive due to the enormous memory usage and computational cost. Therefore, model compression is an important research topic to fit USM-based ASR under budget in real-world scenarios… ▽ More

    Submitted 16 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024. Preprint

  6. arXiv:2310.14391  [pdf, ps, other

    math.NA

    Performance bounds for Reduced Order Models with Application to Parametric Transport

    Authors: D. Rim, G. Welper

    Abstract: The Kolmogorov $n$-width is an established benchmark to judge the performance of reduced basis and similar methods that produce linear reduced spaces. Although immensely successful in the elliptic regime, this width, shows unsatisfactory slow convergence rates for transport dominated problems. While this has triggered a large amount of work on nonlinear model reduction techniques, we are lacking a… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    MSC Class: 41A46; 41A25; 65N15

  7. arXiv:2310.09528  [pdf, other

    cs.LG math.NA physics.comp-ph

    Hypernetwork-based Meta-Learning for Low-Rank Physics-Informed Neural Networks

    Authors: Woo** Cho, Kook** Lee, Donsub Rim, Noseong Park

    Abstract: In various engineering and applied science applications, repetitive numerical simulations of partial differential equations (PDEs) for varying input parameters are often required (e.g., aircraft shape optimization over many design parameters) and solvers are required to perform rapid execution. In this study, we suggest a path that potentially opens up a possibility for physics-informed neural net… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  8. arXiv:2308.08153  [pdf, other

    cs.CL

    Fast Training of NMT Model with Data Sorting

    Authors: Daniela N. Rim, Kimera Richard, Heeyoul Choi

    Abstract: The Transformer model has revolutionized Natural Language Processing tasks such as Neural Machine Translation, and many efforts have been made to study the Transformer architecture, which increased its efficiency and accuracy. One potential area for improvement is to address the computation of empty tokens that the Transformer computes only to discard them later, leading to an unnecessary computat… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

  9. arXiv:2305.16619  [pdf, other

    eess.AS

    2-bit Conformer quantization for automatic speech recognition

    Authors: Oleg Rybakov, Phoenix Meadowlark, Shao** Ding, David Qiu, Jian Li, David Rim, Yanzhang He

    Abstract: Large speech models are rapidly gaining traction in research community. As a result, model compression has become an important topic, so that these models can fit in memory and be served with reduced cost. Practical approaches for compressing automatic speech recognition (ASR) model use int8 or int4 weight quantization. In this study, we propose to develop 2-bit ASR models. We explore the impact o… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: submitted to Interspeech

  10. arXiv:2305.15536  [pdf, other

    eess.AS cs.LG

    RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models

    Authors: David Qiu, David Rim, Shao** Ding, Oleg Rybakov, Yanzhang He

    Abstract: With the rapid increase in the size of neural networks, model compression has become an important area of research. Quantization is an effective technique at decreasing the model size, memory access, and compute load of large models. Despite recent advances in quantization aware training (QAT) technique, most papers present evaluations that are focused on computer vision tasks, which have differen… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  11. arXiv:2305.04720  [pdf, other

    cs.CL

    DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation

    Authors: ChaeHun Park, Seungil Chad Lee, Daniel Rim, Jaegul Choo

    Abstract: Despite the recent advances in open-domain dialogue systems, building a reliable evaluation metric is still a challenging problem. Recent studies proposed learnable metrics based on classification models trained to distinguish the correct response. However, neural classifiers are known to make overly confident predictions for examples from unseen distributions. We propose DEnsity, which evaluates… ▽ More

    Submitted 25 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Findings of ACL2023. 13 pages

  12. Building a Parallel Corpus and Training Translation Models Between Luganda and English

    Authors: Richard Kimera, Daniela N. Rim, Heeyoul Choi

    Abstract: Neural machine translation (NMT) has achieved great successes with large datasets, so NMT is more premised on high-resource languages. This continuously underpins the low resource languages such as Luganda due to the lack of high-quality parallel corpora, so even 'Google translate' does not serve Luganda at the time of this writing. In this paper, we build a parallel corpus with 41,070 pairwise se… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Journal ref: Journal of KIISE, Vol. 49, No. 11, pp. 1009-1016, 2022. 11

  13. arXiv:2210.04958  [pdf, other

    cs.LG stat.ME

    Mining Causality from Continuous-time Dynamics Models: An Application to Tsunami Forecasting

    Authors: Fan Wu, Sanghyun Hong, Donsub Rim, Noseong Park, Kook** Lee

    Abstract: Continuous-time dynamics models, such as neural ordinary differential equations, have enabled the modeling of underlying dynamics in time-series data and accurate forecasting. However, parameterization of dynamics using a neural network makes it difficult for humans to identify causal structures in the data. In consequence, this opaqueness hinders the use of these models in the domains where captu… ▽ More

    Submitted 13 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

  14. arXiv:2109.09075  [pdf, other

    cs.CL

    Adversarial Training with Contrastive Learning in NLP

    Authors: Daniela N. Rim, DongNyeong Heo, Heeyoul Choi

    Abstract: For years, adversarial training has been extensively studied in natural language processing (NLP) settings. The main goal is to make models robust so that similar inputs derive in semantically similar outcomes, which is not a trivial problem since there is no objective measure of semantic similarity in language. Previous works use an external pre-trained NLP model to tackle this challenge, introdu… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

  15. arXiv:2105.11681  [pdf, other

    cs.LG cs.SD eess.AS

    Deep Neural Networks and End-to-End Learning for Audio Compression

    Authors: Daniela N. Rim, Inseon Jang, Heeyoul Choi

    Abstract: Recent achievements in end-to-end deep learning have encouraged the exploration of tasks dealing with highly structured data with unified deep network models. Having such models for compressing audio signals has been challenging since it requires discrete representations that are not easy to train with end-to-end backpropagation. In this paper, we present an end-to-end deep learning approach that… ▽ More

    Submitted 13 July, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  16. arXiv:2010.05360  [pdf, other

    math.NA cs.CV

    A range characterization of the single-quadrant ADRT

    Authors: Weilin Li, Kui Ren, Donsub Rim

    Abstract: This work characterizes the range of the single-quadrant approximate discrete Radon transform (ADRT) of square images. The characterization follows from a set of linear constraints on the codomain. We show that for data satisfying these constraints, the exact and fast inversion formula [Rim, Appl. Math. Lett. 102 106159, 2020] yields a square image in a stable manner. The range characterization is… ▽ More

    Submitted 22 March, 2022; v1 submitted 11 October, 2020; originally announced October 2020.

    MSC Class: 44A12; 65R10; 92C55; 68U05; 15A04

  17. arXiv:2007.13977  [pdf, other

    math.NA cs.LG

    Depth separation for reduced deep networks in nonlinear model reduction: Distilling shock waves in nonlinear hyperbolic problems

    Authors: Donsub Rim, Luca Venturi, Joan Bruna, Benjamin Peherstorfer

    Abstract: Classical reduced models are low-rank approximations using a fixed basis designed to achieve dimensionality reduction of large-scale systems. In this work, we introduce reduced deep networks, a generalization of classical reduced models formulated as deep neural networks. We prove depth separation results showing that reduced deep networks approximate solutions of parametrized hyperbolic partial d… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

    MSC Class: 68T07; 65M22; 41A46

  18. arXiv:1912.13024  [pdf, other

    math.NA

    Manifold Approximations via Transported Subspaces: Model reduction for transport-dominated problems

    Authors: Donsub Rim, Benjamin Peherstorfer, Kyle T. Mandli

    Abstract: This work presents a method for constructing online-efficient reduced models of large-scale systems governed by parametrized nonlinear scalar conservation laws. The solution manifolds induced by transport-dominated problems such as hyperbolic conservation laws typically exhibit nonlinear structures, which means that traditional model reduction methods based on linear approximations are inefficient… ▽ More

    Submitted 30 December, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

    MSC Class: 78M34; 41A46; 35F20; 78M12

  19. arXiv:1908.00887  [pdf, ps, other

    math.NA cs.CC cs.CV

    Exact and fast inversion of the approximate discrete Radon transform from partial data

    Authors: Donsub Rim

    Abstract: We give an exact inversion formula for the approximate discrete Radon transform introduced in [Brady, SIAM J. Comput., 27(1), 107--119] that is of cost $O(N \log N)$ for a square 2D image with $N$ pixels and requires only partial data.

    Submitted 18 May, 2020; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: 4 pages, 1 figure

    MSC Class: 44A12; 65R10; 65F05; 65Q30

    Journal ref: Appl. Math. Lett. 102 106159 (2020)

  20. arXiv:1901.09893  [pdf, ps, other

    nlin.CD

    A simple electronic device to experiment with the Hopf bifurcation

    Authors: Daniela N. Rim, Pablo Cremades, Pablo Kaluza

    Abstract: We present a simple low-cost electronic circuit that is able to show two different dynamical regimens with oscillations of voltages and with constant values of them. This device is designed as a negative feedback three-node network inspired in the genetic repressilator. The circuit's behavior is modeled by a system of differential equations which is studied in several different ways by applying th… ▽ More

    Submitted 26 January, 2019; originally announced January 2019.

    Journal ref: Rev. Mex. de Física E 65 (2019) 58-63

  21. arXiv:1805.05938  [pdf, other

    math.NA

    Model reduction of a parametrized scalar hyperbolic conservation law using displacement interpolation

    Authors: Donsub Rim, Kyle T. Mandli

    Abstract: We propose a model reduction technique for parametrized partial differential equations arising from scalar hyperbolic conservation laws. The key idea of the technique is to construct basis functions that are local in parameter and time space via displacement interpolation. The construction is motivated by the observation that the derivative of solutions to hyperbolic conservation laws satisfy a co… ▽ More

    Submitted 15 May, 2018; originally announced May 2018.

  22. Displacement interpolation using monotone rearrangement

    Authors: Donsub Rim, Kyle T. Mandli

    Abstract: When approximating a function that depends on a parameter, one encounters many practical examples where linear interpolation or linear approximation with respect to the parameters prove ineffective. This is particularly true for responses from hyperbolic partial differential equations (PDEs) where linear, low-dimensional bases are difficult to construct. We propose the use of displacement interpol… ▽ More

    Submitted 3 September, 2018; v1 submitted 11 December, 2017; originally announced December 2017.

    MSC Class: 65D05; 65D15; 65K10

    Journal ref: SIAM/ASA J. Uncertainty Quantification, 6(4) (2018) 1503-1531

  23. Imaging of isotropic and anisotropic conductivities from power densities in three dimensions

    Authors: François Monard, Donsub Rim

    Abstract: We present numerical reconstructions of anisotropic conductivity tensors in three dimensions, from knowledge of a finite family of power density functionals. Such a problem arises in the coupled-physics imaging modality Ultrasound Modulated Electrical Impedance Tomography for instance. We improve on the algorithms previously derived in [Bal et al, Inverse Probl Imaging (2013), pp.353-375, Monard a… ▽ More

    Submitted 19 March, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

    MSC Class: 65M32; 35R30; 35J15

  24. Dimensional splitting of hyperbolic partial differential equations using the Radon transform

    Authors: Donsub Rim

    Abstract: We introduce a dimensional splitting method based on the intertwining property of the Radon transform, with a particular focus on its applications related to hyperbolic partial differential equations (PDEs). This dimensional splitting has remarkable properties that makes it useful in a variety of contexts, including multi-dimensional extension of large time-step (LTS) methods, absorbing boundary c… ▽ More

    Submitted 6 December, 2018; v1 submitted 10 May, 2017; originally announced May 2017.

    Comments: 25 pages

    MSC Class: 65N08; 35L60; 35L65; 65R32

    Journal ref: SIAM J. Sci. Comput., 40(6) (2018), A4184-A4207

  25. arXiv:1701.07529  [pdf, other

    math.NA

    Transport reversal for model reduction of hyperbolic partial differential equations

    Authors: Donsub Rim, Scott Moe, Randall J. LeVeque

    Abstract: Snapshot matrices built from solutions to hyperbolic partial differential equations exhibit slow decay in singular values, whereas fast decay is crucial for the success of projection- based model reduction methods. To overcome this problem, we build on previous work in symmetry reduction [Rowley and Marsden, Physica D (2000), pp. 1-19] and propose an iterative algorithm that decomposes the snapsho… ▽ More

    Submitted 25 January, 2017; originally announced January 2017.

  26. Generating Random Earthquake Events for PTHA

    Authors: Randall J. LeVeque, Knut Waagan, Frank I. González, Donsub Rim, Guang Lin

    Abstract: In order to perform probabilistic tsunami hazard assessment (PTHA) based on subduction zone earthquakes, it is necessary to start with a catalog of possible future events along with the annual probability of occurance, or a probability distribution of such events that can be easily sampled. For nearfield events, the distribution of slip on the fault can have a significant effect on the resulting t… ▽ More

    Submitted 10 May, 2016; originally announced May 2016.

    Comments: 24 pages, 12 figures, code provided at <a href="https://github.com/rjleveque/KLslip-paper">this URL</a>

    MSC Class: 86-08; 65

    Journal ref: SIAM/ASA J. Uncertainty Quantification, 6(1), (2018) 118-150

  27. arXiv:1512.08212  [pdf, other

    cs.CV

    Improving Facial Analysis and Performance Driven Animation through Disentangling Identity and Expression

    Authors: David Rim, Sina Honari, Md Kamrul Hasan, Chris Pal

    Abstract: We present techniques for improving performance driven facial animation, emotion recognition, and facial key-point or landmark prediction using learned identity invariant representations. Established approaches to these problems can work well if sufficient examples and labels for a particular identity are available and factors of variation are highly controlled. However, labeled examples of facial… ▽ More

    Submitted 22 May, 2016; v1 submitted 27 December, 2015; originally announced December 2015.

    Comments: to appear in Image and Vision Computing Journal (IMAVIS)

  28. arXiv:1505.04240  [pdf, ps, other

    math.HO math.DS

    An Elementary Proof That Symplectic Matrices Have Determinant One

    Authors: Donsub Rim

    Abstract: We give one more proof of the fact that symplectic matrices over real and complex fields have determinant one. While this has already been proved many times, there has been lasting interest in finding an elementary proof. Our result is restricted to the real and complex case due to its reliance on field-dependent spectral theory, however in this setting we obtain a proof which is more elementary i… ▽ More

    Submitted 23 March, 2018; v1 submitted 15 May, 2015; originally announced May 2015.

    MSC Class: 15A15; 15A42; 37J10

    Journal ref: Adv. Dyn. Syst. Appl. (2017) 12 (1) 15-20