Skip to main content

Showing 1–19 of 19 results for author: Shin, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19135  [pdf, other

    eess.AS cs.AI

    DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability

    Authors: Hyun Joon Park, ** Sob Kim, Wooseok Shin, Sung Won Han

    Abstract: Expressive Text-to-Speech (TTS) using reference speech has been studied extensively to synthesize natural speech, but there are limitations to obtaining well-represented styles and improving model generalization ability. In this study, we present Diffusion-based EXpressive TTS (DEX-TTS), an acoustic model designed for reference-based speech synthesis with enhanced style representations. Based on a… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Preprint

  2. arXiv:2402.09253  [pdf, other

    eess.SP

    Max-Min Fair Energy-Efficient Beam Design for Quantized ISAC LEO Satellite Systems: A Rate-Splitting Approach

    Authors: Ziang Liu, Longfei Yin, Wonjae Shin, Bruno Clerckx

    Abstract: Low earth orbit (LEO) satellite systems with sensing functionality is envisioned to facilitate global-coverage service and emerging applications in 6G. Currently, two fundamental challenges, namely, inter-beam interference among users and power limitation at the LEO satellites, limit the full potential of the joint design of sensing and communication. To effectively control the interference, rate-… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Submitted to IEEE journal

  3. arXiv:2308.05966  [pdf, other

    eess.SP

    On the Learning of Digital Self-Interference Cancellation in Full-Duplex Radios

    Authors: Jungyeon Kim, Hyowon Lee, Heedong Do, **seok Choi, Jeonghun Park, Wonjae Shin, Yonina C. Eldar, Namyoon Lee

    Abstract: Full-duplex communication systems have the potential to achieve significantly higher data rates and lower latency compared to their half-duplex counterparts. This advantage stems from their ability to transmit and receive data simultaneously. However, to enable successful full-duplex operation, the primary challenge lies in accurately eliminating strong self-interference (SI). Overcoming this chal… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 figures and 1 table

  4. arXiv:2307.07382  [pdf, other

    cs.IT eess.SP

    Distributed Rate-Splitting Multiple Access for Multilayer Satellite Communications

    Authors: Yunnuo Xu, Longfei Yin, Yijie Mao, Wonjae Shin, Bruno Clerckx

    Abstract: Future wireless networks, in particular, 5G and beyond, are anticipated to deploy dense Low Earth Orbit (LEO) satellites to provide global coverage and broadband connectivity. However, the limited frequency band and the coexistence of multiple constellations bring new challenges for interference management. In this paper, we propose a robust multilayer interference management scheme for spectrum s… ▽ More

    Submitted 2 May, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

  5. arXiv:2306.12978  [pdf, other

    cs.IT eess.SP

    Rate-Splitting Multiple Access for 6G Networks: Ten Promising Scenarios and Applications

    Authors: Jeonghun Park, Byungju Lee, **seok Choi, Hoon Lee, Namyoon Lee, Seok-Hwan Park, Kyoung-Jae Lee, Junil Choi, Sung Ho Chae, Sang-Woon Jeon, Kyung Sup Kwak, Bruno Clerckx, Wonjae Shin

    Abstract: In the upcoming 6G era, multiple access (MA) will play an essential role in achieving high throughput performances required in a wide range of wireless applications. Since MA and interference management are closely related issues, the conventional MA techniques are limited in that they cannot provide near-optimal performance in universal interference regimes. Recently, rate-splitting multiple acce… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: 17 pages, 6 figures, submitted to IEEE Network Magazine

  6. arXiv:2303.15703  [pdf, other

    eess.AS

    AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection

    Authors: ** Sob Kim, Hyun Joon Park, Wooseok Shin, Sung Won Han

    Abstract: Sound event localization and detection (SELD) combines the identification of sound events with the corresponding directions of arrival (DOA). Recently, event-oriented track output formats have been adopted to solve this problem; however, they still have limited generalization toward real-world problems in an unknown polyphony environment. To address the issue, we proposed an angular-distance-based… ▽ More

    Submitted 10 May, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 5 pages, 3 figures, accepted for publication in IEEE ICASSP 2023

  7. arXiv:2303.09057  [pdf, other

    eess.AS cs.SD

    TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

    Authors: Hyun Joon Park, Seok Woo Yang, ** Sob Kim, Wooseok Shin, Sung Won Han

    Abstract: Voice Conversion (VC) must be achieved while maintaining the content of the source speech and representing the characteristics of the target speaker. The existing methods do not simultaneously satisfy the above two aspects of VC, and their conversion outputs suffer from a trade-off problem between maintaining source contents and target characteristics. In this study, we propose Triple Adaptive Att… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: To appear in ICASSP 2023

  8. arXiv:2302.07476  [pdf, other

    cs.IT eess.SP

    Indexed Multiple Access with Reconfigurable Intelligent Surfaces: The Reflection Tuning Potential

    Authors: Rohit Singh, Aryan Kaushik, Wonjae Shin, George C. Alexandropoulos, Mesut Toka, Marco Di Renzo

    Abstract: Indexed modulation (IM) is an evolving technique that has become popular due to its ability of parallel data communication over distinct combinations of transmission entities. In this article, we first provide a comprehensive survey of IM-enabled multiple access (MA) techniques, emphasizing the shortcomings of existing non-indexed MA schemes. Theoretical comparisons are presented to show how the n… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 7 pages, 5 figures, 1 table

  9. arXiv:2211.09988  [pdf, ps, other

    eess.AS cs.SD

    Exploring WavLM on Speech Enhancement

    Authors: Hyungchan Song, Sanyuan Chen, Zhuo Chen, Yu Wu, Takuya Yoshioka, Min Tang, Jong Won Shin, Shujie Liu

    Abstract: There is a surge in interest in self-supervised learning approaches for end-to-end speech encoding in recent years as they have achieved great success. Especially, WavLM showed state-of-the-art performance on various speech processing tasks. To better understand the efficacy of self-supervised learning models for speech enhancement, in this work, we design and conduct a series of experiments with… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by IEEE SLT 2022

  10. arXiv:2211.08454  [pdf, other

    eess.SP

    Flexible Hybrid Beamforming for Spectrally Efficient 6G Joint Radar-Communications

    Authors: Aryan Kaushik, Evangelos Vlachos, Muhammad Z. Shakir, Wonjae Shin, Rongke Liu

    Abstract: Joint radar-communications (JRC) benefits from multi-functionality of radar and communication operations using same hardware and radio frequency (RF) spectrum resources. Thus JRC systems possess very high potential to be employed into the sixth generation (6G) standards. This paper designs a flexible beamformer for multiple-input multiple output (MIMO) JRC with maximized spectral efficiency (SE).… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 6 pages, conference

  11. Multi-View Attention Transfer for Efficient Speech Enhancement

    Authors: Wooseok Shin, Hyun Joon Park, ** Sob Kim, Byung Hoon Lee, Sung Won Han

    Abstract: Recent deep learning models have achieved high performance in speech enhancement; however, it is still challenging to obtain a fast and low-complexity model without significant performance degradation. Previous knowledge distillation studies on speech enhancement could not solve this problem because their output distillation methods do not fit the speech enhancement task in some aspects. In this s… ▽ More

    Submitted 30 October, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Proceedings of Interspeech 2022

  12. arXiv:2208.00643  [pdf, other

    cs.IT eess.SP

    Rate-Splitting Multiple Access for Quantized Multiuser MIMO Communications

    Authors: Seokjun Park, **seok Choi, Jeonghun Park, Wonjae Shin, Bruno Clerckx

    Abstract: This paper investigates the sum spectral efficiency maximization problem in downlink multiuser multiple-input multiple-output (MIMO) systems with low-resolution quantizers at an access point (AP) and users. In particular, we consider rate-splitting multiple access (RSMA) to enhance spectral efficiency by offering opportunities to boost achievable degrees of freedom. Optimizing RSMA precoders, howe… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 30 pages, 8 figures

  13. arXiv:2207.11728  [pdf, other

    eess.SP cs.AR

    A Custom IC Layout Generation Engine Based on Dynamic Templates and Grids

    Authors: Taeho Shin, Dongjun Lee, Dongwhee Kim, Gaeryun Sung, Wook** Shin, Yunseong Jo, Hyungjoo Park, Jaeduk Han

    Abstract: This paper presents an automatic layout generation framework in advanced CMOS technologies. The framework extends the template-and-grid-based layout generation methodology with the following additional techniques applied to produce optimal layouts more effectively. First, layout templates and grids are dynamically created and adjusted during runtime to serve various structural, functional, and des… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

    Comments: 10 pages, 6 figures

  14. arXiv:2203.02181  [pdf, other

    eess.AS cs.SD eess.SP

    MANNER: Multi-view Attention Network for Noise Erasure

    Authors: Hyun Joon Park, Byung Ha Kang, Wooseok Shin, ** Sob Kim, Sung Won Han

    Abstract: In the field of speech enhancement, time domain methods have difficulties in achieving both high performance and efficiency. Recently, dual-path models have been adopted to represent long sequential features, but they still have limited representations and poor memory efficiency. In this study, we propose Multi-view Attention Network for Noise ERasure (MANNER) consisting of a convolutional encoder… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: To appear in ICASSP 2022

  15. arXiv:2202.05093  [pdf, other

    cs.AI cs.DC cs.LG cs.NE eess.SY

    Two-Stage Deep Anomaly Detection with Heterogeneous Time Series Data

    Authors: Kyeong-Joong Jeong, **-Duk Park, Kyusoon Hwang, Seong-Lyun Kim, Won-Yong Shin

    Abstract: We introduce a data-driven anomaly detection framework using a manufacturing dataset collected from a factory assembly line. Given heterogeneous time series data consisting of operation cycle signals and sensor signals, we aim at discovering abnormal events. Motivated by our empirical findings that conventional single-stage benchmark approaches may not exhibit satisfactory performance under our ch… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 10 pages, 4 figures, 4 tables; published in the IEEE Access (Please cite our journal version.)

  16. arXiv:2108.06844  [pdf, ps, other

    eess.SP cs.IT

    Rate-Splitting Multiple Access for Downlink MIMO: A Generalized Power Iteration Approach

    Authors: Jeonghun Park, **seok Choi, Namyoon Lee, Wonjae Shin, H. Vincent Poor

    Abstract: Rate-splitting multiple access (RSMA) is a general multiple access scheme for downlink multi-antenna systems embracing both classical spatial division multiple access and more recent non-orthogonal multiple access. Finding a linear precoding strategy that maximizes the sum spectral efficiency of RSMA is a challenging yet significant problem. In this paper, we put forth a novel precoder design fram… ▽ More

    Submitted 2 June, 2022; v1 submitted 15 August, 2021; originally announced August 2021.

    Comments: submitted to possible IEEE publication

  17. arXiv:2106.14203  [pdf, other

    eess.SY

    Joint Mobile Charging and Coverage-Time Extension for Unmanned Aerial Vehicles

    Authors: Soohyun Park, Won-Yong Shin, Minseok Choi, Joongheon Kim

    Abstract: In modern networks, the use of drones as mobile base stations (MBSs) has been discussed for coverage flexibility. However, the realization of drone-based networks raises several issues. One of the critical issues is drones are extremely power-hungry. To overcome this, we need to characterize a new type of drones, so-called charging drones, which can deliver energy to MBS drones. Motivated by the f… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

  18. arXiv:1808.07864  [pdf, ps, other

    cs.IT cs.NI eess.SP

    Secure Relaying in Non-Orthogonal Multiple Access: Trusted and Untrusted Scenarios

    Authors: Ahmed Arafa, Wonjae Shin, Mojtaba Vaezi, H. Vincent Poor

    Abstract: A downlink single-input single-output non-orthogonal multiple access setting is considered, in which a base station (BS) is communicating with two legitimate users in two possible scenarios of unsecure environments: existence of an external eavesdropper and communicating through an untrusted relay. For the first scenario, a number of trusted cooperative half-duplex relays is employed to assist wit… ▽ More

    Submitted 31 January, 2019; v1 submitted 23 August, 2018; originally announced August 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1805.01449

  19. arXiv:1805.01449  [pdf, ps, other

    cs.IT cs.NI eess.SP

    Securing Downlink Non-Orthogonal Multiple Access Systems by Trusted Relays

    Authors: Ahmed Arafa, Wonjae Shin, Mojtaba Vaezi, H. Vincent Poor

    Abstract: A downlink single-input single-output non-orthogonal multiple access system is considered in which a base station (BS) is communicating with two legitimate users in the presence of an external eavesdropper. A group of trusted cooperative half-duplex relay nodes, powered by the BS, is employed to assist the BS's transmission. The goal is to design relaying schemes such that the legitimate users' se… ▽ More

    Submitted 23 August, 2018; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: To appear in IEEE Globecom 2018