Skip to main content

Showing 1–16 of 16 results for author: Son, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.12721  [pdf

    eess.AS cs.SD

    Sound event detection based on auxiliary decoder and maximum probability aggregation for DCASE Challenge 2024 Task 4

    Authors: Sang Won Son, Jongyeon Park, Hong Kook Kim, Sulaiman Vesal, Jeong Eun Lim

    Abstract: In this report, we propose three novel methods for develo** a sound event detection (SED) model for the DCASE 2024 Challenge Task 4. First, we propose an auxiliary decoder attached to the final convolutional block to improve feature extraction capabilities while reducing dependency on embeddings from pre-trained large models. The proposed auxiliary decoder operates independently from the main de… ▽ More

    Submitted 24 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: DCASE 2024 challenge Task4, 4 pages

  2. arXiv:2403.01119  [pdf, other

    physics.optics eess.IV

    Quasi-calibration method for structured light system with auxiliary camera

    Authors: Seung-Jae Son, Yatong An, Jae-Sang Hyun

    Abstract: The structured light projection technique is a representative active method for 3-D reconstruction, but many researchers face challenges with the intricate projector calibration process. To address this complexity, we employs an additional camera, temporarily referred to as the auxiliary camera, to eliminate the need for projector calibration. The auxiliary camera aids in constructing rational mod… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 22 pages, 13 figures

  3. arXiv:2307.12751  [pdf, other

    eess.IV cs.CV

    ICF-SRSR: Invertible scale-Conditional Function for Self-Supervised Real-world Single Image Super-Resolution

    Authors: Reyhaneh Neshatavar, Mohsen Yavartanoo, Sanghyun Son, Kyoung Mu Lee

    Abstract: Single image super-resolution (SISR) is a challenging ill-posed problem that aims to up-sample a given low-resolution (LR) image to a high-resolution (HR) counterpart. Due to the difficulty in obtaining real LR-HR training pairs, recent approaches are trained on simulated LR images degraded by simplified down-sampling operators, e.g., bicubic. Such an approach can be problematic in practice becaus… ▽ More

    Submitted 31 August, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  4. arXiv:2306.06461  [pdf

    eess.AS cs.SD

    Semi-supervsied Learning-based Sound Event Detection using Freuqency Dynamic Convolution with Large Kernel Attention for DCASE Challenge 2023 Task 4

    Authors: Ji Won Kim, Sang Won Son, Yoonah Song, Hong Kook Kim, Il Hoon Song, Jeong Eun Lim

    Abstract: This report proposes a frequency dynamic convolution (FDY) with a large kernel attention (LKA)-convolutional recurrent neural network (CRNN) with a pre-trained bidirectional encoder representation from audio transformers (BEATs) embedding-based sound event detection (SED) model that employs a mean-teacher and pseudo-label approach to address the challenge of limited labeled data for DCASE 2023 Tas… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: DCASE 2023 Challenge Task 4A, 5 pages

  5. arXiv:2305.15417  [pdf, other

    eess.IV cs.CV cs.LG

    Entropy-Aware Similarity for Balanced Clustering: A Case Study with Melanoma Detection

    Authors: Seok Bin Son, Soohyun Park, Joongheon Kim

    Abstract: Clustering data is an unsupervised learning approach that aims to divide a set of data points into multiple groups. It is a crucial yet demanding subject in machine learning and data mining. Its successful applications span various fields. However, conventional clustering techniques necessitate the consideration of balance significance in specific applications. Therefore, this paper addresses the… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  6. Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

    Authors: Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun

    Abstract: Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study,… ▽ More

    Submitted 22 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023, Code URL: https://github.com/raymin0223/patch-mix_contrastive_learning

  7. arXiv:2203.11799  [pdf, other

    cs.CV eess.IV

    AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network

    Authors: Wooseok Lee, Sanghyun Son, Kyoung Mu Lee

    Abstract: Blind-spot network (BSN) and its variants have made significant advances in self-supervised denoising. Nevertheless, they are still bound to synthetic noisy inputs due to less practical assumptions like pixel-wise independent noise. Hence, it is challenging to deal with spatially correlated real-world noise using self-supervised BSN. Recently, pixel-shuffle downsampling (PD) has been proposed to r… ▽ More

    Submitted 24 March, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR2022

  8. arXiv:2202.09533  [pdf, other

    cs.CV eess.IV

    C2N: Practical Generative Noise Modeling for Real-World Denoising

    Authors: Geonwoon Jang, Wooseok Lee, Sanghyun Son, Kyoung Mu Lee

    Abstract: Learning-based image denoising methods have been bounded to situations where well-aligned noisy and clean images are given, or samples are synthesized from predetermined noise models, e.g., Gaussian. While recent generative noise modeling methods aim to simulate the unknown distribution of real-world noise, several limitations still exist. In a practical scenario, a noise generator should learn to… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 2350-2359

  9. Toward Real-World Super-Resolution via Adaptive Downsampling Models

    Authors: Sanghyun Son, Jaeha Kim, Wei-Sheng Lai, Ming-Husan Yang, Kyoung Mu Lee

    Abstract: Most image super-resolution (SR) methods are developed on synthetic low-resolution (LR) and high-resolution (HR) image pairs that are constructed by a predetermined operation, e.g., bicubic downsampling. As existing methods typically learn an inverse map** of the specific function, they produce blurry results when applied to real-world images whose exact formulation is different and unknown. The… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted at TPAMI

  10. arXiv:2105.07562  [pdf, other

    physics.soc-ph cs.LG eess.SY

    Power-grid stability predictions using transferable machine learning

    Authors: Seong-Gyu Yang, Beom Jun Kim, Seung-Woo Son, Heetae Kim

    Abstract: Complex network analyses have provided clues to improve power-grid stability with the help of numerical models. The high computational cost of numerical simulations, however, has inhibited the approach, especially when it deals with the dynamic properties of power grids such as frequency synchronization. In this study, we investigate machine learning techniques to estimate the stability of power-g… ▽ More

    Submitted 7 December, 2021; v1 submitted 16 May, 2021; originally announced May 2021.

    Comments: 10 pages, 6 figures, 4 tables

  11. arXiv:2104.12665  [pdf, other

    eess.IV cs.CV

    Clean Images are Hard to Reblur: Exploiting the Ill-Posed Inverse Task for Dynamic Scene Deblurring

    Authors: Seungjun Nah, Sanghyun Son, Jaerin Lee, Kyoung Mu Lee

    Abstract: The goal of dynamic scene deblurring is to remove the motion blur in a given image. Typical learning-based approaches implement their solutions by minimizing the L1 or L2 distance between the output and the reference sharp image. Recent attempts adopt visual recognition features in training to improve the perceptual quality. However, those features are primarily designed to capture high-level cont… ▽ More

    Submitted 2 April, 2022; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: ICLR 2022

  12. arXiv:2012.02859  [pdf, other

    eess.SY

    Idle speed control with low-complexity offset-free explicit model predictive control in presence of system delay

    Authors: Sang Hwan Son, Se-Kyu Oh, Byung Jun Park, Min Jun Song, Jong Min Lee

    Abstract: The requirement for continual improvement of idle speed control (ISC) performance is increasing due to the stringent regulation on emission and fuel economy these days. In this regard, a low-complexity offset-free explicit model predictive control (EMPC) with constraint horizon is designed to regulate the idle speed under unmeasured disturbance in presence of system delay with rigorous formulation… ▽ More

    Submitted 13 December, 2020; v1 submitted 4 December, 2020; originally announced December 2020.

  13. arXiv:2012.02753  [pdf, other

    eess.SY

    Model-plant mismatch learning offset-free model predictive control

    Authors: Sang Hwan Son, Jong Woo Kim, Tae Hoon Oh, Jong Min Lee

    Abstract: We propose model-plant mismatch learning offset-free model predictive control (MPC), which learns and applies the intrinsic model-plant mismatch, to effectively exploit the advantages of model-based and data-driven control strategies and overcome the limitations of each approach. In this study, the model-plant mismatch map on steady-state manifold in the controlled variable space is approximated v… ▽ More

    Submitted 13 December, 2020; v1 submitted 4 December, 2020; originally announced December 2020.

  14. arXiv:2010.07239  [pdf, other

    eess.SY

    Handling plant-model mismatch in Koopman Lyapunov-based model predictive control via offset-free control framework

    Authors: Sang Hwan Son, Abhinav Narasingam, Joseph Sang-Il Kwon

    Abstract: Koopman operator theory enables a global linear representation of a given nonlinear dynamical system by transforming the nonlinear dynamics into a higher dimensional observable function space where the evolution of observable functions is governed by an infinite-dimensional linear operator. For practical application of Koopman operator theory, various data-driven methods have been developed to der… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  15. arXiv:1906.08629  [pdf, other

    physics.soc-ph eess.SP

    On structural and dynamical factors determining the integrated basin instability of power-grid nodes

    Authors: Heetae Kim, Mi ** Lee, Sang Hoon Lee, Seung-Woo Son

    Abstract: In electric power systems delivering alternating current, it is essential to maintain its synchrony of the phase with the rated frequency. The synchronization stability that quantifies how well the power-grid system recovers its synchrony against perturbation depends on various factors. As an intrinsic factor that we can design and control, the transmission capacity of the power grid affects the s… ▽ More

    Submitted 22 October, 2019; v1 submitted 19 June, 2019; originally announced June 2019.

    Comments: 11 pages, 5 figures, 3 tables

    Journal ref: Chaos 29, 103132 (2019)

  16. arXiv:1510.04712  [pdf, other

    eess.SY

    A Graph Theoretic Characterization of Perfect Attackability and Detection in Distributed Control Systems

    Authors: Sean Weerakkody, Xiaofei Liu, Sang H. Son, Bruno Sinopoli

    Abstract: This paper is concerned with the analysis and design of secure Distributed Control Systems in the face of integrity attacks on sensors and controllers by external attackers or insiders. In general a DCS consists of many heterogenous components and agents including sensors, actuators, controllers. Due to its distributed nature, some agents may start misbehaving to disrupt the system. This paper fir… ▽ More

    Submitted 15 October, 2015; originally announced October 2015.

    Comments: 8 pages, 1 figure