Skip to main content

Showing 1–15 of 15 results for author: Oh, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00888  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Papez: Resource-Efficient Speech Separation with Auditory Working Memory

    Authors: Hyunseok Oh, Juheon Yi, Youngki Lee

    Abstract: Transformer-based models recently reached state-of-the-art single-channel speech separation accuracy; However, their extreme computational load makes it difficult to deploy them in resource-constrained mobile or IoT devices. We thus present Papez, a lightweight and computation-efficient single-channel speech separation model. Papez is based on three key techniques. We first replace the inter-chunk… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 5 pages. Accepted by ICASSP 2023

  2. arXiv:2406.07803  [pdf, other

    cs.SD cs.AI eess.AS

    EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee

    Abstract: Despite rapid advances in the field of emotional text-to-speech (TTS), recent studies primarily focus on mimicking the average style of a particular emotion. As a result, the ability to manipulate speech emotion remains constrained to several predefined labels, compromising the ability to reflect the nuanced variations of emotion. In this paper, we propose EmoSphere-TTS, which synthesizes expressi… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted at INTERSPEECH 2024

  3. arXiv:2404.04096  [pdf, other

    cs.IT eess.SP

    Machine Learning-Aided Cooperative Localization under Dense Urban Environment

    Authors: Hoon Lee, Hong Ki Kim, Seung Hyun Oh, Sang Hyun Lee

    Abstract: Future wireless network technology provides automobiles with the connectivity feature to consolidate the concept of vehicular networks that collaborate on conducting cooperative driving tasks. The full potential of connected vehicles, which promises road safety and quality driving experience, can be leveraged if machine learning models guarantee the robustness in performing core functions includin… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  4. arXiv:2401.08095  [pdf, other

    cs.SD cs.AI eess.AS

    DurFlex-EVC: Duration-Flexible Emotional Voice Conversion with Parallel Generation

    Authors: Hyung-Seok Oh, Sang-Hoon Lee, Deok-Hyeon Cho, Seong-Whan Lee

    Abstract: Emotional voice conversion (EVC) seeks to modify the emotional tone of a speaker's voice while preserving the original linguistic content and the speaker's unique vocal characteristics. Recent advancements in EVC have involved the simultaneous modeling of pitch and duration, utilizing the potential of sequence-to-sequence (seq2seq) models. To enhance reliability and efficiency in conversion, this… ▽ More

    Submitted 7 March, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 13 pages, 9 figures, 8 tables

  5. arXiv:2401.06913  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Microphone Conversion: Mitigating Device Variability in Sound Event Classification

    Authors: Myeonghoon Ryu, Hongseok Oh, Suji Lee, Han Park

    Abstract: In this study, we introduce a new augmentation technique to enhance the resilience of sound event classification (SEC) systems against device variability through the use of CycleGAN. We also present a unique dataset to evaluate this method. As SEC systems become increasingly common, it is crucial that they work well with audio from diverse recording devices. Our method addresses limited device div… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  6. arXiv:2312.04382  [pdf, other

    eess.IV cs.AI

    Adversarial Denoising Diffusion Model for Unsupervised Anomaly Detection

    Authors: Jongmin Yu, Hyeontaek Oh, **hong Yang

    Abstract: In this paper, we propose the Adversarial Denoising Diffusion Model (ADDM). The ADDM is based on the Denoising Diffusion Probabilistic Model (DDPM) but complementarily trained by adversarial learning. The proposed adversarial learning is achieved by classifying model-based denoised samples and samples to which random Gaussian noise is added to a specific sampling step. With the addition of explici… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted for the poster session of DGM4H worshop on NeuralPS 2023

  7. arXiv:2308.05992  [pdf, other

    cs.RO eess.SY

    Reachable Set-based Path Planning for Automated Vertical Parking System

    Authors: In Hyuk Oh, Ju Won Seo, ** Sung Kim, Chung Choo Chung

    Abstract: This paper proposes a local path planning method with a reachable set for Automated vertical Parking Systems (APS). First, given a parking lot layout with a goal position, we define an intermediate pose for the APS to accomplish reverse parking with a single maneuver, i.e., without changing the gear shift. Then, we introduce a reachable set which is a set of points consisting of the grid points of… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 8 pages, 10 figures, conference. This is the Accepted Manuscript version of an article accepted for publication in [IEEE International Conference on Intelligent Transportation Systems ITSC 2023]. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. No information about DOI has been posted yet

  8. arXiv:2307.16549  [pdf, other

    cs.SD cs.CL eess.AS

    DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training

    Authors: Hyung-Seok Oh, Sang-Hoon Lee, Seong-Whan Lee

    Abstract: Expressive text-to-speech systems have undergone significant advancements owing to prosody modeling, but conventional methods can still be improved. Traditional approaches have relied on the autoregressive method to predict the quantized prosody vector; however, it suffers from the issues of long-term dependency and slow inference. This study proposes a novel approach called DiffProsody in which e… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 10 pages, 8 figures, 5 tables, under review

  9. arXiv:2307.16171  [pdf, other

    cs.SD cs.AI cs.MM eess.AS

    HierVST: Hierarchical Adaptive Zero-shot Voice Style Transfer

    Authors: Sang-Hoon Lee, Ha-Yeong Choi, Hyung-Seok Oh, Seong-Whan Lee

    Abstract: Despite rapid progress in the voice style transfer (VST) field, recent zero-shot VST systems still lack the ability to transfer the voice style of a novel speaker. In this paper, we present HierVST, a hierarchical adaptive end-to-end zero-shot VST model. Without any text transcripts, we only use the speech dataset to train the model by utilizing hierarchical variational inference and self-supervis… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: INTERSPEECH 2023 (Oral)

  10. arXiv:2208.07422  [pdf, other

    cs.CV cs.AI cs.LG eess.IV eess.SP

    Deep Unsupervised Domain Adaptation: A Review of Recent Advances and Perspectives

    Authors: Xiaofeng Liu, Chaehwa Yoo, Fangxu Xing, Hye** Oh, Georges El Fakhri, Je-Won Kang, Jonghye Woo

    Abstract: Deep learning has become the method of choice to tackle real-world problems in different domains, partly because of its ability to learn from data and achieve impressive performance on a wide range of applications. However, its success usually relies on two assumptions: (i) vast troves of labeled datasets are required for accurate model fitting, and (ii) training and testing data are independent a… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: APSIPA Transactions on Signal and Information Processing

  11. arXiv:2012.02753  [pdf, other

    eess.SY

    Model-plant mismatch learning offset-free model predictive control

    Authors: Sang Hwan Son, Jong Woo Kim, Tae Hoon Oh, Jong Min Lee

    Abstract: We propose model-plant mismatch learning offset-free model predictive control (MPC), which learns and applies the intrinsic model-plant mismatch, to effectively exploit the advantages of model-based and data-driven control strategies and overcome the limitations of each approach. In this study, the model-plant mismatch map on steady-state manifold in the controlled variable space is approximated v… ▽ More

    Submitted 13 December, 2020; v1 submitted 4 December, 2020; originally announced December 2020.

  12. arXiv:2006.00284  [pdf

    eess.SY

    Unit Commitment Considering the Impact of Deep Cycling

    Authors: HyungSeon Oh

    Abstract: Wind energy has been integrated into the power system with the hope that it improves the energy efficiency and decreases greenhouse gas emission. However, several studies over the world imply that the result was in the opposite way that was hoped mainly because of the negative correlation between wind availability and load. Under the situation, coal power plants are forced to cycle while they are… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Comments: 25 pages, 10 figures

  13. Analytical solution to swing equations in power grids

    Authors: HyungSeon Oh

    Abstract: Objective: To derive a closed-form analytical solution to the swing equation describing the power system dynamics, which is a nonlinear second order differential equation. Existing challenges: No analytical solution to the swing equation has been identified, due to the complex nature of power systems. Two major approaches are pursued for stability assessments on systems: (1) computationally simple… ▽ More

    Submitted 25 November, 2019; originally announced November 2019.

    Comments: Corrected version of the published paper at PLoS ONE

    Journal ref: published, 2019

  14. arXiv:1904.03643  [pdf, other

    eess.SP stat.ME

    Ensemble Patch Transformation: A New Tool for Signal Decomposition

    Authors: Donghoh Kim, Guebin Choi, Hee-Seok Oh

    Abstract: This paper considers the problem of signal decomposition and data visualization. For this purpose, we introduce a new multiscale transform, termed `ensemble patch transformation' that enhances identification of local characteristics embedded in a signal and provides multiscale visualization according to different levels; hence, it is useful for data analysis and signal decomposition. In literature… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

    Comments: 32 pages with 24 figures

  15. arXiv:1706.00795  [pdf

    eess.SY

    Situational Awareness with PMUs and SCADA

    Authors: HyungSeon Oh

    Abstract: Phasor measurement units (PMUs) are integrated to the transmission networks under the smart grid umbrella. The observability of PMUs is geographically limited due to their high cost in integration. The measurements of PMUs can be complemented by those from widely installed supervisory control and data acquisition (SCADA) to enhance the situational awareness. This paper proposes a new state estimat… ▽ More

    Submitted 9 June, 2017; v1 submitted 2 June, 2017; originally announced June 2017.

    Comments: 8 pages