Skip to main content

Showing 1–28 of 28 results for author: Chung, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19622  [pdf, other

    cs.LG cs.AI

    Data-Driven Lipschitz Continuity: A Cost-Effective Approach to Improve Adversarial Robustness

    Authors: Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-Rung Lee

    Abstract: The security and robustness of deep neural networks (DNNs) have become increasingly concerning. This paper aims to provide both a theoretical foundation and a practical solution to ensure the reliability of DNNs. We explore the concept of Lipschitz continuity to certify the robustness of DNNs against adversarial attacks, which aim to mislead the network with adding imperceptible perturbations into… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2404.15881  [pdf, other

    cs.CV cs.AI

    Steal Now and Attack Later: Evaluating Robustness of Object Detection against Black-box Adversarial Attacks

    Authors: Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-Rung Lee

    Abstract: Latency attacks against object detection represent a variant of adversarial attacks that aim to inflate the inference time by generating additional ghost objects in a target image. However, generating ghost objects in the black-box scenario remains a challenge since information about these unqualified objects remains opaque. In this study, we demonstrate the feasibility of generating ghost objects… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  3. arXiv:2403.01344  [pdf, other

    cs.LG cs.CV

    Mitigating the Bias in the Model for Continual Test-Time Adaptation

    Authors: Inseop Chung, Kyomin Hwang, Jayeon Yoo, Nojun Kwak

    Abstract: Continual Test-Time Adaptation (CTA) is a challenging task that aims to adapt a source pre-trained model to continually changing target domains. In the CTA setting, a model does not know when the target domain changes, thus facing a drastic change in the distribution of streaming inputs during the test-time. The key challenge is to keep adapting the model to the continually changing target domains… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  4. arXiv:2312.08875  [pdf, other

    cs.CV

    What, How, and When Should Object Detectors Update in Continually Changing Test Domains?

    Authors: Jayeon Yoo, Dongkwan Lee, Inseop Chung, Donghyun Kim, Nojun Kwak

    Abstract: It is a well-known fact that the performance of deep learning models deteriorates when they encounter a distribution shift at test time. Test-time adaptation (TTA) algorithms have been proposed to adapt the model online while inferring test data. However, existing research predominantly focuses on classification tasks through the optimization of batch normalization layers or classification heads,… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  5. arXiv:2312.05141  [pdf, other

    cs.CV

    Open Domain Generalization with a Single Network by Regularization Exploiting Pre-trained Features

    Authors: Inseop Chung, KiYoon Yoo, Nojun Kwak

    Abstract: Open Domain Generalization (ODG) is a challenging task as it not only deals with distribution shifts but also category shifts between the source and target datasets. To handle this task, the model has to learn a generalizable representation that can be applied to unseen domains while also identify unknown classes that were not present during training. Previous work has used multiple source-specifi… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  6. arXiv:2307.03760  [pdf, other

    cs.DC

    CODAG: Characterizing and Optimizing Decompression Algorithms for GPUs

    Authors: Jeongmin Park, Zaid Qureshi, Vikram Mailthody, Andrew Gacek, Shunfan Shao, Mohammad AlMasri, Isaac Gelado, **jun Xiong, Chris Newburn, I-hsin Chung, Michael Garland, Nikolay Sakharnykh, Wen-mei Hwu

    Abstract: Data compression and decompression have become vital components of big-data applications to manage the exponential growth in the amount of data collected and stored. Furthermore, big-data applications have increasingly adopted GPUs due to their high compute throughput and memory bandwidth. Prior works presume that decompression is memory-bound and have dedicated most of the GPU's threads to data m… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  7. arXiv:2304.05370  [pdf, other

    cs.CV

    Overload: Latency Attacks on Object Detection for Edge Devices

    Authors: Erh-Chung Chen, Pin-Yu Chen, I-Hsin Chung, Che-rung Lee

    Abstract: Nowadays, the deployment of deep learning-based applications is an essential task owing to the increasing demands on intelligent services. In this paper, we investigate latency attacks on deep learning applications. Unlike common adversarial attacks for misclassification, the goal of latency attacks is to increase the inference time, which may stop applications from responding to the requests with… ▽ More

    Submitted 26 April, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

  8. arXiv:2303.15110  [pdf, other

    cs.CL cs.AI

    Beyond Toxic: Toxicity Detection Datasets are Not Enough for Brand Safety

    Authors: Elizaveta Korotkova, Isaac Kwan Yin Chung

    Abstract: The rapid growth in user generated content on social media has resulted in a significant rise in demand for automated content moderation. Various methods and frameworks have been proposed for the tasks of hate speech detection and toxic comment classification. In this work, we combine common datasets to extend these tasks to brand safety. Brand safety aims to protect commercial branding by identif… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  9. arXiv:2212.14574  [pdf, other

    cs.RO

    X-MAS: Extremely Large-Scale Multi-Modal Sensor Dataset for Outdoor Surveillance in Real Environments

    Authors: DongKi Noh, Changki Sung, Teayoung Uhm, WooJu Lee, Hyungtae Lim, Jaeseok Choi, Kyuewang Lee, Dasol Hong, Daeho Um, Inseop Chung, Hochul Shin, MinJung Kim, Hyoung-Rock Kim, SeungMin Baek, Hyun Myung

    Abstract: In robotics and computer vision communities, extensive studies have been widely conducted regarding surveillance tasks, including human detection, tracking, and motion recognition with a camera. Additionally, deep learning algorithms are widely utilized in the aforementioned tasks as in other computer vision tasks. Existing public datasets are insufficient to develop learning-based methods that ha… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

    Comments: 8 pages, 13 figures, IEEE Robotics and Automation Letters

  10. arXiv:2207.09656  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation for One-stage Object Detector using Offsets to Bounding Box

    Authors: Jayeon Yoo, Inseop Chung, Nojun Kwak

    Abstract: Most existing domain adaptive object detection methods exploit adversarial feature alignment to adapt the model to a new domain. Recent advances in adversarial feature alignment strives to reduce the negative effect of alignment, or negative transfer, that occurs because the distribution of features varies depending on the category of objects. However, by analyzing the features of the anchor-free… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: ECCV 2022, 24 pages

  11. arXiv:2206.13708  [pdf, other

    cs.SD cs.LG eess.AS

    Personalized Keyword Spotting through Multi-task Learning

    Authors: Seunghan Yang, Byeonggeun Kim, Inseop Chung, Simyung Chang

    Abstract: Keyword spotting (KWS) plays an essential role in enabling speech-based user interaction on smart devices, and conventional KWS (C-KWS) approaches have concentrated on detecting user-agnostic pre-defined keywords. However, in practice, most user interactions come from target users enrolled in the device which motivates to construct personalized keyword spotting. We design two personalized KWS task… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Proceedings of INTERSPEECH 2022

  12. arXiv:2206.13691  [pdf, other

    cs.SD cs.LG eess.AS

    Dummy Prototypical Networks for Few-Shot Open-Set Keyword Spotting

    Authors: Byeonggeun Kim, Seunghan Yang, Inseop Chung, Simyung Chang

    Abstract: Keyword spotting is the task of detecting a keyword in streaming audio. Conventional keyword spotting targets predefined keywords classification, but there is growing attention in few-shot (query-by-example) keyword spotting, e.g., N-way classification given M-shot support samples. Moreover, in real-world scenarios, there can be utterances from unexpected categories (open-set) which need to be rej… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Proceedings of INTERSPEECH 2022

  13. arXiv:2205.08714  [pdf, other

    cs.CV

    End-to-End Multi-Object Detection with a Regularized Mixture Model

    Authors: Jaeyoung Yoo, Hojun Lee, Seunghyeon Seo, Inseop Chung, Nojun Kwak

    Abstract: Recent end-to-end multi-object detectors simplify the inference pipeline by removing hand-crafted processes such as non-maximum suppression (NMS). However, during training, they still heavily rely on heuristics and hand-crafted processes which deteriorate the reliability of the predicted confidence score. In this paper, we propose a novel framework to train an end-to-end multi-object detector cons… ▽ More

    Submitted 28 April, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted at ICML 2023

  14. arXiv:2204.11593  [pdf, other

    cs.IR cs.CV

    Scaling Cross-Domain Content-Based Image Retrieval for E-commerce Snap and Search Application

    Authors: Isaac Kwan Yin Chung, Minh Tran, Eran Nussinovitch

    Abstract: In this industry talk at ECIR 2022, we illustrate how we approach the main challenges from large scale cross-domain content-based image retrieval using a cascade method and a combination of our visual search and classification capabilities. Specifically, we present a system that is able to handle the scale of the data for e-commerce usage and the cross-domain nature of the query and gallery image… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: ECIR 2022 Industry Day

  15. arXiv:2203.04910  [pdf, other

    cs.DC cs.AR cs.OS cs.PF

    GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture

    Authors: Zaid Qureshi, Vikram Sharma Mailthody, Isaac Gelado, Seung Won Min, Amna Masood, Jeongmin Park, **jun Xiong, CJ Newburn, Dmitri Vainbrand, I-Hsin Chung, Michael Garland, William Dally, Wen-mei Hwu

    Abstract: Graphics Processing Units (GPUs) have traditionally relied on the host CPU to initiate access to the data storage. This approach is well-suited for GPU applications with known data access patterns that enable partitioning of their dataset to be processed in a pipelined fashion in the GPU. However, emerging applications such as graph and data analytics, recommender systems, or graph neural networks… ▽ More

    Submitted 6 February, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: This is an extension to the published conference paper at ASPLOS'23: https://dl.acm.org/doi/abs/10.1145/3575693.3575748

    Journal ref: ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2

  16. arXiv:2112.06536  [pdf, other

    cs.CV

    SphereSR: 360° Image Super-Resolution with Arbitrary Projection via Continuous Spherical Image Representation

    Authors: Youngho Yoon, Inchul Chung, Lin Wang, Kuk-** Yoon

    Abstract: The 360°imaging has recently gained great attention; however, its angular resolution is relatively lower than that of a narrow field-of-view (FOV) perspective image as it is captured by using fisheye lenses with the same sensor size. Therefore, it is beneficial to super-resolve a 360°image. Some attempts have been made but mostly considered the equirectangular projection (ERP) as one of the way fo… ▽ More

    Submitted 13 December, 2021; v1 submitted 13 December, 2021; originally announced December 2021.

  17. arXiv:2110.10916  [pdf, other

    cs.CV

    Exploiting Inter-pixel Correlations in Unsupervised Domain Adaptation for Semantic Segmentation

    Authors: Inseop Chung, Jayeon Yoo, Nojun Kwak

    Abstract: "Self-training" has become a dominant method for semantic segmentation via unsupervised domain adaptation (UDA). It creates a set of pseudo labels for the target domain to give explicit supervision. However, the pseudo labels are noisy, sparse and do not provide any information about inter-pixel correlations. We regard inter-pixel correlation quite important because semantic segmentation is a task… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  18. arXiv:2110.09646  [pdf, other

    cs.CL cs.AI cs.LG

    Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement

    Authors: HyoJung Han, Seokchan Ahn, Yoonjung Choi, Insoo Chung, Sangha Kim, Kyunghyun Cho

    Abstract: Recent work in simultaneous machine translation is often trained with conventional full sentence translation corpora, leading to either excessive latency or necessity to anticipate as-yet-unarrived words, when dealing with a language pair whose word orders significantly differ. This is unlike human simultaneous interpreters who produce largely monotonic translations at the expense of the grammatic… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: To be published in WMT2021

  19. arXiv:2102.13002  [pdf, other

    cs.CV

    Maximizing Cosine Similarity Between Spatial Features for Unsupervised Domain Adaptation in Semantic Segmentation

    Authors: Inseop Chung, Daesik Kim, Nojun Kwak

    Abstract: We propose a novel method that tackles the problem of unsupervised domain adaptation for semantic segmentation by maximizing the cosine similarity between the source and the target domain at the feature level. A segmentation network mainly consists of two parts, a feature extractor and a classification head. We expect that if we can make the two domains have small domain gap at the feature level,… ▽ More

    Submitted 17 March, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

  20. arXiv:2012.14363  [pdf, other

    cs.DC

    TEMPI: An Interposed MPI Library with a Canonical Representation of CUDA-aware Datatypes

    Authors: Carl Pearson, Kun Wu, I-Hsin Chung, **jun Xiong, Wen-Mei Hwu

    Abstract: MPI derived datatypes are an abstraction that simplifies handling of non-contiguous data in MPI applications. These datatypes are recursively constructed at runtime from primitive Named Types defined in the MPI standard. More recently, the development and deployment of CUDA-aware MPI implementations has encouraged the transition of distributed high-performance MPI codes to use GPUs. Such implement… ▽ More

    Submitted 20 April, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

    Comments: 12 pages

  21. arXiv:2009.07453  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Extremely Low Bit Transformer Quantization for On-Device Neural Machine Translation

    Authors: Insoo Chung, Byeongwook Kim, Yoonjung Choi, Se Jung Kwon, Yongkweon Jeon, Baeseong Park, Sangha Kim, Dongsoo Lee

    Abstract: The deployment of widely used Transformer architecture is challenging because of heavy computation load and memory overhead during inference, especially when the target device is limited in computational resources such as mobile or edge devices. Quantization is an effective technique to address such challenges. Our analysis shows that for a given number of quantization bits, each block of Transfor… ▽ More

    Submitted 13 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: Findings of EMNLP 2020

  22. arXiv:2008.10169  [pdf, other

    cs.AR cs.DC cs.PF

    Tearing Down the Memory Wall

    Authors: Zaid Qureshi, Vikram Sharma Mailthody, Seung Won Min, I-Hsin Chung, **jun Xiong, Wen-mei Hwu

    Abstract: We present a vision for the Erudite architecture that redefines the compute and memory abstractions such that memory bandwidth and capacity become first-class citizens along with compute throughput. In this architecture, we envision coupling a high-density, massively parallel memory technology like Flash with programmable near-data accelerators, like the streaming multiprocessors in modern GPUs. E… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: SRC Techcon 2020 paper. Discusses vision of GPU-Centric architecture, Erudite

  23. arXiv:2002.11275  [pdf, other

    stat.ML cs.LG stat.CO

    Adversarial Monte Carlo Meta-Learning of Optimal Prediction Procedures

    Authors: Alex Luedtke, Incheoul Chung, Oleg Sofrygin

    Abstract: We frame the meta-learning of prediction procedures as a search for an optimal strategy in a two-player game. In this game, Nature selects a prior over distributions that generate labeled data consisting of features and an associated outcome, and the Predictor observes data sampled from a distribution drawn from this prior. The Predictor's objective is to learn a function that maps from a new feat… ▽ More

    Submitted 25 September, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    MSC Class: 62C20 ACM Class: G.3

  24. arXiv:2002.01775  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Feature-map-level Online Adversarial Knowledge Distillation

    Authors: Inseop Chung, SeongUk Park, Jangho Kim, Nojun Kwak

    Abstract: Feature maps contain rich information about image intensity and spatial correlation. However, previous online knowledge distillation methods only utilize the class probabilities. Thus in this paper, we propose an online knowledge distillation method that transfers not only the knowledge of the class probabilities but also that of the feature map using the adversarial training framework. We train m… ▽ More

    Submitted 5 June, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

  25. arXiv:1911.12721  [pdf, other

    cs.CV

    Training Multi-Object Detector by Estimating Bounding Box Distribution for Input Image

    Authors: Jaeyoung Yoo, Hojun Lee, Inseop Chung, Geonseok Seo, Nojun Kwak

    Abstract: In multi-object detection using neural networks, the fundamental problem is, "How should the network learn a variable number of bounding boxes in different input images?". Previous methods train a multi-object detection network through a procedure that directly assigns the ground truth bounding boxes to the specific locations of the network's output. However, this procedure makes the training of a… ▽ More

    Submitted 5 September, 2021; v1 submitted 28 November, 2019; originally announced November 2019.

    Comments: 10 pages, 7 figures

  26. arXiv:1911.04283  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning

    Authors: Sathish Indurthi, Houjeung Han, Nikhil Kumar Lakumarapu, Beomseok Lee, Insoo Chung, Sangha Kim, Chanwoo Kim

    Abstract: End-to-end Speech Translation (ST) models have several advantages such as lower latency, smaller model size, and less error compounding over conventional pipelines that combine Automatic Speech Recognition (ASR) and text Machine Translation (MT) models. However, collecting large amounts of parallel data for ST task is more difficult compared to the ASR and MT tasks. Previous studies have proposed… ▽ More

    Submitted 27 April, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: ICASSP 2020

  27. arXiv:1904.09058  [pdf, other

    cs.CV

    Feature Fusion for Online Mutual Knowledge Distillation

    Authors: Jangho Kim, Minsung Hyun, Inseop Chung, Nojun Kwak

    Abstract: We propose a learning framework named Feature Fusion Learning (FFL) that efficiently trains a powerful classifier through a fusion module which combines the feature maps generated from parallel neural networks. Specifically, we train a number of parallel neural networks as sub-networks, then we combine the feature maps from each sub-network using a fusion module to create a more meaningful feature… ▽ More

    Submitted 21 July, 2020; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: International Conference on Pattern Recognition

  28. arXiv:1806.01551  [pdf, other

    stat.ML cs.LG

    Deep Mixed Effect Model using Gaussian Processes: A Personalized and Reliable Prediction for Healthcare

    Authors: Ingyo Chung, Saehoon Kim, Juho Lee, Kwang Joon Kim, Sung Ju Hwang, Eunho Yang

    Abstract: We present a personalized and reliable prediction model for healthcare, which can provide individually tailored medical services such as diagnosis, disease treatment, and prevention. Our proposed framework targets at making personalized and reliable predictions from time-series data, such as Electronic Health Records (EHR), by modeling two complementary components: i) a shared component that captu… ▽ More

    Submitted 24 November, 2019; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: AAAI 2020