Skip to main content

Showing 1–18 of 18 results for author: Yun, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.17995  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    DMT-JEPA: Discriminative Masked Targets for Joint-Embedding Predictive Architecture

    Authors: Shentong Mo, Sukmin Yun

    Abstract: The joint-embedding predictive architecture (JEPA) recently has shown impressive results in extracting visual representations from unlabeled imagery under a masking strategy. However, we reveal its disadvantages, notably its insufficient understanding of local semantics. This deficiency originates from masked modeling in the embedding space, resulting in a reduction of discriminative power and can… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2401.13936  [pdf, ps, other

    eess.SY

    Learning-based sensing and computing decision for data freshness in edge computing-enabled networks

    Authors: Sinwoong Yun, Dongsun Kim, Chanwon Park, Jemin Lee

    Abstract: As the demand on artificial intelligence (AI)-based applications increases, the freshness of sensed data becomes crucial in the wireless sensor networks. Since those applications require a large amount of computation for processing the sensed data, it is essential to offload the computation load to the edge computing (EC) server. In this paper, we propose the sensing and computing decision (SCD) a… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 15 pages

  3. arXiv:2311.00822  [pdf, other

    eess.SY cs.RO

    Synthesis and verification of robust-adaptive safe controllers

    Authors: Simin Liu, Kai S. Yun, John M. Dolan, Changliu Liu

    Abstract: Safe control with guarantees generally requires the system model to be known. It is far more challenging to handle systems with uncertain parameters. In this paper, we propose a generic algorithm that can synthesize and verify safe controllers for systems with constant, unknown parameters. In particular, we use robust-adaptive control barrier functions (raCBFs) to achieve safety. We develop new th… ▽ More

    Submitted 2 April, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: First 2 authors contributed equally

  4. Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

    Authors: Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun

    Abstract: Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study,… ▽ More

    Submitted 22 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023, Code URL: https://github.com/raymin0223/patch-mix_contrastive_learning

  5. Recycle-and-Distill: Universal Compression Strategy for Transformer-based Speech SSL Models with Attention Map Reusing and Masking Distillation

    Authors: Kangwook Jang, Sungnyun Kim, Se-Young Yun, Hoirin Kim

    Abstract: Transformer-based speech self-supervised learning (SSL) models, such as HuBERT, show surprising performance in various speech processing tasks. However, huge number of parameters in speech SSL models necessitate the compression to a more compact model for wider usage in academia or small companies. In this study, we suggest to reuse attention maps across the Transformer layers, so as to remove key… ▽ More

    Submitted 26 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Proceedings of Interspeech 2023. Code URL: https://github.com/sungnyun/ARMHuBERT

  6. arXiv:2302.08779  [pdf, other

    math.OC cs.DC eess.SP

    On the convergence result of the gradient-push algorithm on directed graphs with constant stepsize

    Authors: Woocheol Choi, Doheon Kim, Seok-Bae Yun

    Abstract: Gradient-push algorithm has been widely used for decentralized optimization problems when the connectivity network is a direct graph. This paper shows that the gradient-push algorithm with stepsize $α>0$ converges exponentially fast to an $O(α)$-neighborhood of the optimizer under the assumption that each cost is smooth and the total cost is strongly convex. Numerical experiments are provided to s… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    MSC Class: 90C25; 68Q25

  7. arXiv:2211.13920  [pdf, other

    eess.SP

    Secure Power Control for Downlink Cell-Free Massive MIMO With Passive Eavesdroppers

    Authors: Junguk Park, Sangseok Yun, Jeongseok Ha

    Abstract: This work studies secure communications for a cell-free massive multiple-input multiple-output (CF-mMIMO) network which is attacked by multiple passive eavesdroppers overhearing communications between access points (APs) and users in the network. It will be revealed that the distributed APs in CF-mMIMO allows not only legitimate users but also eavesdroppers to reap the diversity gain, which seriou… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 5 pages, 3 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  8. arXiv:2206.13700  [pdf, other

    cs.SD cs.LG eess.AS

    Domain Agnostic Few-shot Learning for Speaker Verification

    Authors: Seunghan Yang, Debasmit Das, Janghoon Cho, Hyoungwoo Park, Sungrack Yun

    Abstract: Deep learning models for verification systems often fail to generalize to new users and new environments, even though they learn highly discriminative features. To address this problem, we propose a few-shot domain generalization framework that learns to tackle distribution shift for new users and new domains. Our framework consists of domain-specific and domain-aggregation networks, which are the… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Proceedings of INTERSPEECH 2022

  9. arXiv:2206.07651  [pdf

    eess.SP

    Fault Diagnosis of Inter-turn Short Circuit in Permanent Magnet Synchronous Motors with Current Signal Imaging and Unsupervised Learning

    Authors: W. Jung, S. H. Yun, Y. S. Lim, S. Cheong, J. Bae, Y. H. Park

    Abstract: This paper proposes machine-independent feature engineering for winding inter-turn short circuit fault that uses electrical current signals. Electrical current signal collected from permanent magnet synchronous motor (PMSM) is subjected to different environmental and operational conditions. To solve these problems, robust current signal imaging method and deep learning-based feature extraction met… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: submitted to IECON 2022

  10. arXiv:2202.03571  [pdf, other

    eess.IV cs.CV

    Metal Artifact Reduction with Intra-Oral Scan Data for 3D Low Dose Maxillofacial CBCT Modeling

    Authors: Chang Min Hyun, Taigyntuya Bayaraa, Hye Sun Yun, Tae Jun Jang, Hyoung Suk Park, ** Keun Seo

    Abstract: Low-dose dental cone beam computed tomography (CBCT) has been increasingly used for maxillofacial modeling. However, the presence of metallic inserts, such as implants, crowns, and dental filling, causes severe streaking and shading artifacts in a CBCT image and loss of the morphological structures of the teeth, which consequently prevents accurate segmentation of bones. A two-stage metal artifact… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  11. Fully automatic integration of dental CBCT images and full-arch intraoral impressions with stitching error correction via individual tooth segmentation and identification

    Authors: Tae Jun Jang, Hye Sun Yun, Chang Min Hyun, Jong-Eun Kim, Sang-Hwy Lee, ** Keun Seo

    Abstract: We present a fully automated method of integrating intraoral scan (IOS) and dental cone-beam computerized tomography (CBCT) images into one image by complementing each image's weaknesses. Dental CBCT alone may not be able to delineate precise details of the tooth surface due to limited image resolution and various CBCT artifacts, including metal-induced artifacts. IOS is very accurate for the scan… ▽ More

    Submitted 2 March, 2023; v1 submitted 3 December, 2021; originally announced December 2021.

  12. arXiv:2104.11849  [pdf, other

    cs.CV cs.LG eess.IV

    Do All MobileNets Quantize Poorly? Gaining Insights into the Effect of Quantization on Depthwise Separable Convolutional Networks Through the Eyes of Multi-scale Distributional Dynamics

    Authors: Stone Yun, Alexander Wong

    Abstract: As the "Mobile AI" revolution continues to grow, so does the need to understand the behaviour of edge-deployed deep neural networks. In particular, MobileNets are the go-to family of deep convolutional neural networks (CNN) for mobile. However, they often have significant accuracy degradation under post-training quantization. While studies have introduced quantization-aware training and other meth… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in Mobile AI (MAI) Workshop 2021 at CVPR

  13. arXiv:2101.05205  [pdf, other

    cs.CV eess.IV

    Automated 3D cephalometric landmark identification using computerized tomography

    Authors: Hye Sun Yun, Chang Min Hyun, Seong Hyeon Baek, Sang-Hwy Lee, ** Keun Seo

    Abstract: Identification of 3D cephalometric landmarks that serve as proxy to the shape of human skull is the fundamental step in cephalometric analysis. Since manual landmarking from 3D computed tomography (CT) images is a cumbersome task even for the trained experts, automatic 3D landmark detection system is in a great need. Recently, automatic landmarking of 2D cephalograms using deep learning (DL) has a… ▽ More

    Submitted 16 December, 2020; originally announced January 2021.

  14. arXiv:1910.06790  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Weakly Labeled Sound Event Detection Using Tri-training and Adversarial Learning

    Authors: Hyoungwoo Park, Sungrack Yun, Jungyun Eum, Janghoon Cho, Kyuwoong Hwang

    Abstract: This paper considers a semi-supervised learning framework for weakly labeled polyphonic sound event detection problems for the DCASE 2019 challenge's task4 by combining both the tri-training and adversarial learning. The goal of the task4 is to detect onsets and offsets of multiple sound events in a single audio clip. The entire dataset consists of the synthetic data with a strong label (sound eve… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: 5 pages, DCASE 2019 Workshop

  15. arXiv:1910.06784  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Acoustic Scene Classification Based on a Large-margin Factorized CNN

    Authors: Janghoon Cho, Sungrack Yun, Hyoungwoo Park, Jungyun Eum, Kyuwoong Hwang

    Abstract: In this paper, we present an acoustic scene classification framework based on a large-margin factorized convolutional neural network (CNN). We adopt the factorized CNN to learn the patterns in the time-frequency domain by factorizing the 2D kernel into two separate 1D kernels. The factorized kernel leads to learn the main component of two patterns: the long-term ambient and short-term event sounds… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: 5 pages, DCASE 2019 Workshop

  16. arXiv:1908.02612  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    An End-to-End Text-independent Speaker Verification Framework with a Keyword Adversarial Network

    Authors: Sungrack Yun, Janghoon Cho, Jungyun Eum, Wonil Chang, Kyuwoong Hwang

    Abstract: This paper presents an end-to-end text-independent speaker verification framework by jointly considering the speaker embedding (SE) network and automatic speech recognition (ASR) network. The SE network learns to output an embedding vector which distinguishes the speaker characteristics of the input utterance, while the ASR network learns to recognize the phonetic context of the input. In training… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: Will be appeared in INTERSPEECH 2019

  17. arXiv:1906.06579  [pdf, other

    cs.CV eess.IV

    EXTD: Extremely Tiny Face Detector via Iterative Filter Reuse

    Authors: YoungJoon Yoo, Dongyoon Han, Sangdoo Yun

    Abstract: In this paper, we propose a new multi-scale face detector having an extremely tiny number of parameters (EXTD),less than 0.1 million, as well as achieving comparable performance to deep heavy detectors. While existing multi-scale face detectors extract feature maps with different scales from a single backbone network, our method generates the feature maps by iteratively reusing a shared lightweigh… ▽ More

    Submitted 23 June, 2019; v1 submitted 15 June, 2019; originally announced June 2019.

  18. arXiv:1810.11520  [pdf, other

    cs.SD cs.LG eess.AS eess.SP stat.ML

    Spectrogram-channels u-net: a source separation model viewing each channel as the spectrogram of each source

    Authors: Jaehoon Oh, Duyeon Kim, Se-Young Yun

    Abstract: Sound source separation has attracted attention from Music Information Retrieval(MIR) researchers, since it is related to many MIR tasks such as automatic lyric transcription, singer identification, and voice conversion. In this paper, we propose an intuitive spectrogram-based model for source separation by adapting U-Net. We call it Spectrogram-Channels U-Net, which means each channel of the outp… ▽ More

    Submitted 30 October, 2018; v1 submitted 26 October, 2018; originally announced October 2018.

    Comments: 3 figures