Skip to main content

Showing 1–26 of 26 results for author: Jeon, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.02592  [pdf

    cs.CL cs.SD eess.AS

    Leveraging the Interplay Between Syntactic and Acoustic Cues for Optimizing Korean TTS Pause Formation

    Authors: Ye** Jeon, Yunsu Kim, Gary Geunbae Lee

    Abstract: Contemporary neural speech synthesis models have indeed demonstrated remarkable proficiency in synthetic speech generation as they have attained a level of quality comparable to that of human-produced speech. Nevertheless, it is important to note that these achievements have predominantly been verified within the context of high-resource languages such as English. Furthermore, the Tacotron and Fas… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024

  2. arXiv:2403.18878  [pdf, other

    cs.CV cs.LG eess.IV

    AIC-UNet: Anatomy-informed Cascaded UNet for Robust Multi-Organ Segmentation

    Authors: Young Seok Jeon, Hongfei Yang, Huazhu Fu, Mengling Feng

    Abstract: Imposing key anatomical features, such as the number of organs, their shapes, sizes, and relative positions, is crucial for building a robust multi-organ segmentation model. Current attempts to incorporate anatomical features include broadening effective receptive fields (ERF) size with resource- and data-intensive modules such as self-attention or introducing organ-specific topology regularizers,… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  3. arXiv:2403.07355  [pdf, ps, other

    eess.SP cs.AI cs.CV

    Vector Quantization for Deep-Learning-Based CSI Feedback in Massive MIMO Systems

    Authors: Junyong Shin, Yu** Kang, Yo-Seb Jeon

    Abstract: This paper presents a finite-rate deep-learning (DL)-based channel state information (CSI) feedback method for massive multiple-input multiple-output (MIMO) systems. The presented method provides a finite-bit representation of the latent vector based on a vector-quantized variational autoencoder (VQ-VAE) framework while reducing its computational complexity based on shape-gain vector quantization.… ▽ More

    Submitted 12 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  4. arXiv:2403.07255  [pdf, other

    eess.SP cs.AI cs.LG

    Deep Learning-Assisted Parallel Interference Cancellation for Grant-Free NOMA in Machine-Type Communication

    Authors: Yongjeong Oh, Jaehong Jo, Byonghyo Shim, Yo-Seb Jeon

    Abstract: In this paper, we present a novel approach for joint activity detection (AD), channel estimation (CE), and data detection (DD) in uplink grant-free non-orthogonal multiple access (NOMA) systems. Our approach employs an iterative and parallel interference removal strategy inspired by parallel interference cancellation (PIC), enhanced with deep learning to jointly tackle the AD, CE, and DD problems.… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  5. arXiv:2403.04111  [pdf

    cs.SD eess.AS

    Multi-Level Attention Aggregation for Language-Agnostic Speaker Replication

    Authors: Ye** Jeon, Gary Geunbae Lee

    Abstract: This paper explores the task of language-agnostic speaker replication, a novel endeavor that seeks to replicate a speaker's voice irrespective of the language they are speaking. Towards this end, we introduce a multi-level attention aggregation approach that systematically probes and amplifies various speaker-specific attributes in a hierarchical manner. Through rigorous evaluations across a wide… ▽ More

    Submitted 3 April, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted to EACL Main 2024

  6. arXiv:2401.02014  [pdf, other

    cs.SD eess.AS

    Enhancing Zero-Shot Multi-Speaker TTS with Negated Speaker Representations

    Authors: Ye** Jeon, Yunsu Kim, Gary Geunbae Lee

    Abstract: Zero-shot multi-speaker TTS aims to synthesize speech with the voice of a chosen target speaker without any fine-tuning. Prevailing methods, however, encounter limitations at adapting to new speakers of out-of-domain settings, primarily due to inadequate speaker disentanglement and content leakage. To overcome these constraints, we propose an innovative negation feature learning paradigm that mode… ▽ More

    Submitted 5 March, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI 2024

  7. arXiv:2312.01842  [pdf, other

    cs.SD cs.AI eess.AS

    Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking

    Authors: Jihyun Lee, Ye** Jeon, Wonjun Lee, Yunsu Kim, Gary Geunbae Lee

    Abstract: Dialogue state tracking plays a crucial role in extracting information in task-oriented dialogue systems. However, preceding research are limited to textual modalities, primarily due to the shortage of authentic human audio datasets. We address this by investigating synthetic audio data for audio-based DST. To this end, we develop cascading and end-to-end models, train them with our synthetic audi… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted in ASRU 2023

  8. arXiv:2312.01100  [pdf, ps, other

    cs.IT eess.SP

    Prior-Aware Robust Beam Alignment for Low-SNR Millimeter-Wave Communications

    Authors: Jihun Park, Yongjeong Oh, Jaewon Yun, Seonjung Kim, Yo-Seb Jeon

    Abstract: This paper presents a robust beam alignment technique for millimeter-wave communications in low signal-to-noise ratio (SNR) environments. The core strategy of our technique is to repeatedly transmit the most probable beam candidates to reduce beam misalignment probability induced by noise. Specifically, for a given beam training overhead, both the selection of candidates and the number of repetiti… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  9. arXiv:2311.17396  [pdf, other

    cs.CV eess.IV

    Spectral and Polarization Vision: Spectro-polarimetric Real-world Dataset

    Authors: Yu** Jeon, Eunsue Choi, Youngchan Kim, Yunseong Moon, Khalid Omer, Felix Heide, Seung-Hwan Baek

    Abstract: Image datasets are essential not only in validating existing methods in computer vision but also in develo** new methods. Most existing image datasets focus on trichromatic intensity images to mimic human vision. However, polarization and spectrum, the wave properties of light that animals in harsh environments and with limited brain capacity often rely on, remain underrepresented in existing da… ▽ More

    Submitted 30 November, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  10. arXiv:2311.08146  [pdf, ps, other

    eess.SP cs.IT

    Joint Source-Channel Coding for Channel-Adaptive Digital Semantic Communications

    Authors: Joohyuk Park, Yongjeong Oh, Seonjung Kim, Yo-Seb Jeon

    Abstract: In this paper, we propose a novel joint source-channel coding (JSCC) approach for channel-adaptive digital semantic communications. In semantic communication systems with digital modulation and demodulation, robust design of JSCC encoder and decoder becomes challenging not only due to the unpredictable dynamics of channel conditions but also due to diverse modulation orders. To address this challe… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  11. arXiv:2311.02405  [pdf, ps, other

    cs.IT eess.SP

    SplitMAC: Wireless Split Learning over Multiple Access Channels

    Authors: Seonjung Kim, Yongjeong Oh, Yo-Seb Jeon

    Abstract: This paper presents a novel split learning (SL) framework, referred to as SplitMAC, which reduces the latency of SL by leveraging simultaneous uplink transmission over multiple access channels. The key strategy is to divide devices into multiple groups and allow the devices within the same group to simultaneously transmit their smashed data and device-side models over the multiple access channels.… ▽ More

    Submitted 19 March, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

  12. arXiv:2307.10815  [pdf, ps, other

    eess.SP cs.DC

    Communication-Efficient Federated Learning over Capacity-Limited Wireless Networks

    Authors: Jaewon Yun, Yongjeong Oh, Yo-Seb Jeon, H. Vincent Poor

    Abstract: In this paper, a communication-efficient federated learning (FL) framework is proposed for improving the convergence rate of FL under a limited uplink capacity. The central idea of the proposed framework is to transmit the values and positions of the top-$S$ entries of a local model update for uplink transmission. A lossless encoding technique is considered for transmitting the positions of these… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  13. arXiv:2306.13361  [pdf, other

    physics.optics cs.CV eess.IV

    Neural 360$^\circ$ Structured Light with Learned Metasurfaces

    Authors: Eunsue Choi, Gyeongtae Kim, Jooyeong Yun, Yu** Jeon, Junsuk Rho, Seung-Hwan Baek

    Abstract: Structured light has proven instrumental in 3D imaging, LiDAR, and holographic light projection. Metasurfaces, comprised of sub-wavelength-sized nanostructures, facilitate 180$^\circ$ field-of-view (FoV) structured light, circumventing the restricted FoV inherent in traditional optics like diffractive optical elements. However, extant metasurface-facilitated structured light exhibits sub-optimal p… ▽ More

    Submitted 27 June, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

  14. arXiv:2306.05146  [pdf, ps, other

    eess.SP cs.IT

    MIMO Detection under Hardware Impairments: Learning with Noisy Labels

    Authors: **man Kwon, Seunghyeon Jeon, Yo-Seb Jeon, H. Vincent Poor

    Abstract: This paper considers a data detection problem in multiple-input multiple-output (MIMO) communication systems with hardware impairments. To address challenges posed by nonlinear and unknown distortion in received signals, two learning-based detection methods, referred to as model-driven and data-driven, are presented. The model-driven method employs a generalized Gaussian distortion model to approx… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  15. arXiv:2207.14477  [pdf, other

    eess.IV cs.CV

    FCSN: Global Context Aware Segmentation by Learning the Fourier Coefficients of Objects in Medical Images

    Authors: Young Seok Jeon, Hongfei Yang, Mengling Feng

    Abstract: The encoder-decoder model is a commonly used Deep Neural Network (DNN) model for medical image segmentation. Conventional encoder-decoder models make pixel-wise predictions focusing heavily on local patterns around the pixel. This makes it challenging to give segmentation that preserves the object's shape and topology, which often requires an understanding of the global context of the object. In t… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

  16. arXiv:2207.01581  [pdf, other

    cs.LG cs.AI eess.SP q-bio.NC

    Interpretable Fusion Analytics Framework for fMRI Connectivity: Self-Attention Mechanism and Latent Space Item-Response Model

    Authors: Jeong-Jae Kim, Yeseul Jeon, SuMin Yu, Junggu Choi, Sanghoon Han

    Abstract: There have been several attempts to use deep learning based on brain fMRI signals to classify cognitive impairment diseases. However, deep learning is a hidden black box model that makes it difficult to interpret the process of classification. To address this issue, we propose a novel analytical framework that interprets the classification result from deep learning processes. We first derive the r… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 38 pages,12 figures,3 tables

  17. arXiv:2205.15271  [pdf, other

    cs.LG eess.SP

    MetaSSD: Meta-Learned Self-Supervised Detection

    Authors: Moon Jeong Park, Jungseul Ok, Yo-Seb Jeon, Dongwoo Kim

    Abstract: Deep learning-based symbol detector gains increasing attention due to the simple algorithm design than the traditional model-based algorithms such as Viterbi and BCJR. The supervised learning framework is often employed to predict the input symbols, where training symbols are used to train the model. There are two major limitations in the supervised approaches: a) a model needs to be retrained fro… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: Accepted by ISIT 2022

  18. arXiv:2204.07692  [pdf, ps, other

    cs.IT cs.DC eess.SP

    FedVQCS: Federated Learning via Vector Quantized Compressed Sensing

    Authors: Yongjeong Oh, Yo-Seb Jeon, Mingzhe Chen, Walid Saad

    Abstract: In this paper, a new communication-efficient federated learning (FL) framework is proposed, inspired by vector quantized compressed sensing. The basic strategy of the proposed framework is to compress the local model update at each device by applying dimensionality reduction followed by vector quantization. Subsequently, the global model update is reconstructed at a parameter server by applying a… ▽ More

    Submitted 30 June, 2023; v1 submitted 15 April, 2022; originally announced April 2022.

  19. arXiv:2204.01052  [pdf, ps, other

    eess.SP

    Semi-Data-Aided Channel Estimation for MIMO Systems via Reinforcement Learning

    Authors: Tae-Kyoung Kim, Yo-Seb Jeon, Jun Li, Nima Tavangaran, H. Vincent Poor

    Abstract: Data-aided channel estimation is a promising solution to improve channel estimation accuracy by exploiting data symbols as pilot signals for updating an initial channel estimate. In this paper, we propose a semi-data-aided channel estimator for multiple-input multiple-output communication systems. Our strategy is to leverage reinforcement learning (RL) for selecting reliable detected symbols among… ▽ More

    Submitted 3 April, 2022; originally announced April 2022.

  20. arXiv:2111.15071  [pdf, ps, other

    cs.DC cs.AI eess.SP

    Communication-Efficient Federated Learning via Quantized Compressed Sensing

    Authors: Yongjeong Oh, Namyoon Lee, Yo-Seb Jeon, H. Vincent Poor

    Abstract: In this paper, we present a communication-efficient federated learning framework inspired by quantized compressed sensing. The presented framework consists of gradient compression for wireless devices and gradient reconstruction for a parameter server (PS). Our strategy for gradient compression is to sequentially perform block sparsification, dimensional reduction, and quantization. Thanks to grad… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  21. arXiv:2006.16515  [pdf, other

    eess.SP

    Design and Analysis of LoS MIMO Systems with Uniform Circular Arrays

    Authors: Yuri Jeon, Gye-Tae Gil, Yong H. Lee

    Abstract: We consider the design of a uniform circular array (UCA) based multiple-input multiple-output (MIMO) system over line-of-sight (LoS) environments in which array misalignment exists. In particular, optimal antenna placement in UCAs and transceiver architectures to achieve the maximum channel capacity without the knowledge of misalignment components are presented. To this end, we first derive a gene… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: 13 pages, 10 figures, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  22. arXiv:2003.10084  [pdf, ps, other

    eess.SP cs.IT

    Data-Aided Channel Estimator for MIMO Systems via Reinforcement Learning

    Authors: Yo-Seb Jeon, Jun Li, Nima Tavangaran, H. Vincent Poor

    Abstract: This paper presents a data-aided channel estimator that reduces the channel estimation error of the conventional linear minimum-mean-squared-error (LMMSE) method for multiple-input multiple-output communication systems. The basic idea is to selectively exploit detected symbol vectors obtained from data detection as additional pilot signals. To optimize the selection of the detected symbol vectors,… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  23. arXiv:2003.08059  [pdf, ps, other

    eess.SP cs.DC cs.LG

    A Compressive Sensing Approach for Federated Learning over Massive MIMO Communication Systems

    Authors: Yo-Seb Jeon, Mohammad Mohammadi Amiri, Jun Li, H. Vincent Poor

    Abstract: Federated learning is a privacy-preserving approach to train a global model at a central server by collaborating with wireless devices, each with its own local training data set. In this paper, we present a compressive sensing approach for federated learning over massive multiple-input multiple-output communication systems in which the central server equipped with a massive antenna array communica… ▽ More

    Submitted 5 August, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: The title of the paper has been changed from "Gradient Estimation for Federated Learning over Massive MIMO Communication Systems" to "A Compressive Sensing Approach for Federated Learning over Massive MIMO Communication Systems"

  24. Multi-Channel Volumetric Neural Network for Knee Cartilage Segmentation in Cone-beam CT

    Authors: Jennifer Maier, Luis Carlos Rivera Monroy, Christopher Syben, Ye** Jeon, Jang-Hwan Choi, Mary Elizabeth Hall, Marc Levenston, Garry Gold, Rebecca Fahrig, Andreas Maier

    Abstract: Analyzing knee cartilage thickness and strain under load can help to further the understanding of the effects of diseases like Osteoarthritis. A precise segmentation of the cartilage is a necessary prerequisite for this analysis. This segmentation task has mainly been addressed in Magnetic Resonance Imaging, and was rarely investigated on contrast-enhanced Computed Tomography, where contrast agent… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: 6 pages, accepted at BVM 2020

  25. arXiv:1905.00273  [pdf

    eess.SP

    A fully-digital semi-rotational frequency detection algorithm for bang-bang CDRs

    Authors: Soon-Won Kwon, Hanho Choi, Younho Jeon, Bong** Kim, WooHyun Kwon, Homin Park, Kyeongha Kwon, Gain Kim, Hyeon-Min Bae

    Abstract: This work presents a new frequency acquisition method using semi-rotational frequency detection (SRFD) algorithm for a reference-less clock and data recovery (CDR) in a serial-link receiver. The proposed SRFD algorithm classifies the bang-bang phase detector(BBPD) outputs to estimate the current phase state, and detects the frequency mismatch between the input data and the sampling clock. The VCO-… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

  26. arXiv:1903.12546  [pdf, ps, other

    eess.SP cs.IT cs.LG

    Robust Data Detection for MIMO Systems with One-Bit ADCs: A Reinforcement Learning Approach

    Authors: Yo-Seb Jeon, Namyoon Lee, H. Vincent Poor

    Abstract: The use of one-bit analog-to-digital converters (ADCs) at a receiver is a power-efficient solution for future wireless systems operating with a large signal bandwidth and/or a massive number of receive radio frequency chains. This solution, however, induces a high channel estimation error and therefore makes it difficult to perform the optimal data detection that requires perfect knowledge of like… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.