Skip to main content

Showing 1–16 of 16 results for author: Kim, J S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19135  [pdf, other

    eess.AS cs.AI

    DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability

    Authors: Hyun Joon Park, ** Sob Kim, Wooseok Shin, Sung Won Han

    Abstract: Expressive Text-to-Speech (TTS) using reference speech has been studied extensively to synthesize natural speech, but there are limitations to obtaining well-represented styles and improving model generalization ability. In this study, we present Diffusion-based EXpressive TTS (DEX-TTS), an acoustic model designed for reference-based speech synthesis with enhanced style representations. Based on a… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Preprint

  2. arXiv:2311.01908  [pdf, other

    eess.IV cs.CV

    LLM-driven Multimodal Target Volume Contouring in Radiation Oncology

    Authors: Yu** Oh, Sangjoon Park, Hwa Kyung Byun, Yeona Cho, Ik Jae Lee, ** Sung Kim, Jong Chul Ye

    Abstract: Target volume contouring for radiation therapy is considered significantly more challenging than the normal organ segmentation tasks as it necessitates the utilization of both image and text-based clinical information. Inspired by the recent advancement of large language models (LLMs) that can facilitate the integration of the textural information and images, here we present a novel LLM-driven mul… ▽ More

    Submitted 15 April, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  3. arXiv:2310.10214  [pdf, other

    eess.SY

    K-SMPC: Koopman Operator-Based Stochastic Model Predictive Control for Enhanced Lateral Control of Autonomous Vehicles

    Authors: ** Sung Kim, Ying Shuai Quan, Chung Choo Chung

    Abstract: This paper proposes Koopman operator-based Stochastic Model Predictive Control (K-SMPC) for enhanced lateral control of autonomous vehicles. The Koopman operator is a linear map representing the nonlinear dynamics in an infinite-dimensional space. Thus, we use the Koopman operator to represent the nonlinear dynamics of a vehicle in dynamic lane-kee** situations. The Extended Dynamic Mode Decompo… ▽ More

    Submitted 9 December, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: 13 pages, 12 figures

  4. arXiv:2309.09419  [pdf, other

    eess.SY

    Uncertainty Quantification of Autoencoder-based Koopman Operator

    Authors: ** Sung Kim, Ying Shuai Quan, Chung Choo Chung

    Abstract: This paper proposes a method for uncertainty quantification of an autoencoder-based Koopman operator. The main challenge of using the Koopman operator is to design the basis functions for lifting the state. To this end, this paper builds an autoencoder to automatically search the optimal lifting basis functions with a given loss function. We approximate the Koopman operator in a finite-dimensional… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures

    Journal ref: 2024 American Control Conference

  5. arXiv:2309.08852  [pdf, other

    eess.SY

    RNN Controller for Lane-Kee** Systems with Robustness and Safety Verification

    Authors: Ying Shuai Quan, ** Sung Kim, Chung Choo Chung

    Abstract: This paper proposes a Recurrent Neural Network (RNN) controller for lane-kee** systems, effectively handling model uncertainties and disturbances. First, quadratic constraints cover the nonlinearities brought by the RNN controller, and the linear fractional transformation method models the dynamics of system uncertainties. Second, we prove the robust stability of the lane-kee** system in the p… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 7 pages, 6 figures

  6. arXiv:2309.06770  [pdf, other

    eess.IV eess.SY

    Deep Learning-based Synthetic High-Resolution In-Depth Imaging Using an Attachable Dual-element Endoscopic Ultrasound Probe

    Authors: Hah Min Lew, Jae Seong Kim, Moon Hwan Lee, Jaegeun Park, Sangyeon Youn, Hee Man Kim, Jihun Kim, Jae Youn Hwang

    Abstract: Endoscopic ultrasound (EUS) imaging has a trade-off between resolution and penetration depth. By considering the in-vivo characteristics of human organs, it is necessary to provide clinicians with appropriate hardware specifications for precise diagnosis. Recently, super-resolution (SR) ultrasound imaging studies, including the SR task in deep learning fields, have been reported for enhancing ultr… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 10 pages, 9 figures

  7. arXiv:2308.05992  [pdf, other

    cs.RO eess.SY

    Reachable Set-based Path Planning for Automated Vertical Parking System

    Authors: In Hyuk Oh, Ju Won Seo, ** Sung Kim, Chung Choo Chung

    Abstract: This paper proposes a local path planning method with a reachable set for Automated vertical Parking Systems (APS). First, given a parking lot layout with a goal position, we define an intermediate pose for the APS to accomplish reverse parking with a single maneuver, i.e., without changing the gear shift. Then, we introduce a reachable set which is a set of points consisting of the grid points of… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 8 pages, 10 figures, conference. This is the Accepted Manuscript version of an article accepted for publication in [IEEE International Conference on Intelligent Transportation Systems ITSC 2023]. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. No information about DOI has been posted yet

  8. arXiv:2308.05965  [pdf, other

    eess.IV

    Classification Method of Road Surface Condition and Type with LiDAR Using Spatiotemporal Information

    Authors: Ju Won Seo, ** Sung Kim, Chung Choo Chung

    Abstract: This paper proposes a spatiotemporal architecture with a deep neural network (DNN) for road surface conditions and types classification using LiDAR. It is known that LiDAR provides information on the reflectivity and number of point clouds depending on a road surface. Thus, this paper utilizes the information to classify the road surface. We divided the front road area into four subregions. First,… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 10 pages

    MSC Class: 68T40 Artificial intelligence for robotics

  9. arXiv:2303.15703  [pdf, other

    eess.AS

    AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection

    Authors: ** Sob Kim, Hyun Joon Park, Wooseok Shin, Sung Won Han

    Abstract: Sound event localization and detection (SELD) combines the identification of sound events with the corresponding directions of arrival (DOA). Recently, event-oriented track output formats have been adopted to solve this problem; however, they still have limited generalization toward real-world problems in an unknown polyphony environment. To address the issue, we proposed an angular-distance-based… ▽ More

    Submitted 10 May, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 5 pages, 3 figures, accepted for publication in IEEE ICASSP 2023

  10. arXiv:2303.09057  [pdf, other

    eess.AS cs.SD

    TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

    Authors: Hyun Joon Park, Seok Woo Yang, ** Sob Kim, Wooseok Shin, Sung Won Han

    Abstract: Voice Conversion (VC) must be achieved while maintaining the content of the source speech and representing the characteristics of the target speaker. The existing methods do not simultaneously satisfy the above two aspects of VC, and their conversion outputs suffer from a trade-off problem between maintaining source contents and target characteristics. In this study, we propose Triple Adaptive Att… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: To appear in ICASSP 2023

  11. Multi-View Attention Transfer for Efficient Speech Enhancement

    Authors: Wooseok Shin, Hyun Joon Park, ** Sob Kim, Byung Hoon Lee, Sung Won Han

    Abstract: Recent deep learning models have achieved high performance in speech enhancement; however, it is still challenging to obtain a fast and low-complexity model without significant performance degradation. Previous knowledge distillation studies on speech enhancement could not solve this problem because their output distillation methods do not fit the speech enhancement task in some aspects. In this s… ▽ More

    Submitted 30 October, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Proceedings of Interspeech 2022

  12. arXiv:2203.02181  [pdf, other

    eess.AS cs.SD eess.SP

    MANNER: Multi-view Attention Network for Noise Erasure

    Authors: Hyun Joon Park, Byung Ha Kang, Wooseok Shin, ** Sob Kim, Sung Won Han

    Abstract: In the field of speech enhancement, time domain methods have difficulties in achieving both high performance and efficiency. Recently, dual-path models have been adopted to represent long sequential features, but they still have limited representations and poor memory efficiency. In this study, we propose Multi-view Attention Network for Noise ERasure (MANNER) consisting of a convolutional encoder… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: To appear in ICASSP 2022

  13. arXiv:2105.00712  [pdf, ps, other

    eess.SY

    Robust Control for Lane Kee** System Using Linear Parameter Varying Approach with Scheduling Variables Reduction

    Authors: Ying Shuai Quan, ** Sung Kim, Chung Choo Chung

    Abstract: This paper presents a robust controller using a Linear Parameter Varying (LPV) model of the lane-kee** system with parameter reduction. Both varying vehicle speed and roll motion on a curved road influence the lateral vehicle model parameters, such as tire cornering stiffness. Thus, we use the LPV technique to take the parameter variations into account in vehicle dynamics. However, multiple vary… ▽ More

    Submitted 4 May, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 7 pages, 7 figures

  14. arXiv:2104.11401  [pdf

    cs.LG cs.CV eess.IV

    Intentional Deep Overfit Learning (IDOL): A Novel Deep Learning Strategy for Adaptive Radiation Therapy

    Authors: Jaehee Chun, Justin C. Park, Sven Olberg, You Zhang, Dan Nguyen, **g Wang, ** Sung Kim, Steve Jiang

    Abstract: In this study, we propose a tailored DL framework for patient-specific performance that leverages the behavior of a model intentionally overfitted to a patient-specific training dataset augmented from the prior information available in an ART workflow - an approach we term Intentional Deep Overfit Learning (IDOL). Implementing the IDOL framework in any task in radiotherapy consists of two training… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  15. arXiv:2103.05198  [pdf, ps, other

    physics.bio-ph eess.SY q-bio.QM

    Continuous body 3-D reconstruction of limbless animals

    Authors: Qiyuan Fu, Thomas W. Mitchel, ** Seob Kim, Gregory S. Chirikjian, Chen Li

    Abstract: Limbless animals such as snakes, limbless lizards, worms, eels, and lampreys move their slender, long bodies in three dimensions to traverse diverse environments. Accurately quantifying their continuous body's 3-D shape and motion is important for understanding body-environment interactions in complex terrain, but this is difficult to achieve (especially for local orientation and rotation). Here,… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Journal ref: Journal of Experimental Biology (2021), 224 (6), jeb220731

  16. arXiv:2003.13733  [pdf

    physics.bio-ph eess.SY q-bio.QM

    Lateral oscillation and body compliance help snakes and snake robots stably traverse large, smooth obstacles

    Authors: Qiyuan Fu, Sean W. Gart, Thomas W. Mitchel, ** Seob Kim, Gregory S. Chirikjian, Chen Li

    Abstract: Snakes can move through almost any terrain. Similarly, snake robots hold the promise as a versatile platform to traverse complex environments like earthquake rubble. Unlike snake locomotion on flat surfaces which is inherently stable, when snakes traverse complex terrain by deforming their body out of plane, it becomes challenging to maintain stability. Here, we review our recent progress in under… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Journal ref: Integrative and Comparative Biology (2020), 60 (1), 171