Skip to main content

Showing 1–7 of 7 results for author: Tsai, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.12888  [pdf, other

    eess.IV

    Transformer-based Learned Image Compression for Joint Decoding and Denoising

    Authors: Yi-Hsin Chen, Kuan-Wei Ho, Shiau-Rung Tsai, Guan-Hsun Lin, Alessandro Gnutti, Wen-Hsiao Peng, Riccardo Leonardi

    Abstract: This work introduces a Transformer-based image compression system. It has the flexibility to switch between the standard image reconstruction and the denoising reconstruction from a single compressed bitstream. Instead of training separate decoders for these tasks, we incorporate two add-on modules to adapt a pre-trained image decoder from performing the standard image reconstruction to joint deco… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted to PCS 2024

  2. arXiv:2401.09648  [pdf

    eess.SP cs.NI

    Staggered Comb Reference Signal Design for Integrated Communication and Sensing

    Authors: Rui Zhang, Shawn Tsai, Tzu-Han Chou, Jiaying Ren

    Abstract: Ambiguity performance is a critical criterion in radar sensor design, which indicates the ambiguities arising from multiple target estimation and detection. We considered a requirement-driven selection of OFDM reference signal (RS) patterns based on ambiguity performances for bi-static sensing in integrated communication and sensing with minimal modifications of current RSs. An RS pattern with a s… ▽ More

    Submitted 25 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: accepted by IEEE International Symposium on Personal, Indoor and Mobile Radio Communications. arXiv admin note: substantial text overlap with arXiv:2401.09643

  3. arXiv:2401.09643  [pdf

    eess.SP cs.NI

    OFDM Reference Signal Pattern Design Criteria for Integrated Communication and Sensing

    Authors: Rui Zhang, Shawn Tsai, Tzu-Han Chou, Jiaying Ren, Wenze Qu, Oliver Sun

    Abstract: Ambiguity performance, which indicates the maximum detectable region for target parameter estimation, is critical to radar sensor design. Driven by ambiguity performance requirements of bi-static sensing, we propose design criteria for orthogonal frequency division multiplexing (OFDM) reference signal (RS) patterns. The design not only reduces ambiguities in both time delay and Doppler shift domai… ▽ More

    Submitted 25 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  4. arXiv:2312.14495  [pdf, other

    cs.SI cs.IT eess.SP

    Beam Foreseeing in Millimeter-Wave Systems with Situational Awareness: Fundamental Limits via Cramér-Rao Lower Bound

    Authors: Wan-Ting Shih, Chao-Kai Wen, Shang-Ho Tsai, Shi **, Chau Yuen

    Abstract: Millimeter-wave (mmWave) networks offer the potential for high-speed data transfer and precise localization, leveraging large antenna arrays and extensive bandwidths. However, these networks are challenged by significant path loss and susceptibility to blockages. In this study, we delve into the use of situational awareness for beam prediction within the 5G NR beam management framework. We introdu… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 16 pages, 10 figures; IEEE Transactions on Wireless Communications

  5. arXiv:2306.06653  [pdf, other

    cs.SD eess.AS

    Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features

    Authors: Hsin-Hao Chen, Yung-Lun Chien, Ming-Chi Yen, Shu-Wei Tsai, Yu Tsao, Tai-shih Chi, Hsin-Min Wang

    Abstract: Patients who have had their entire larynx removed, including the vocal folds, owing to throat cancer may experience difficulties in speaking. In such cases, electrolarynx devices are often prescribed to produce speech, which is commonly referred to as electrolaryngeal speech (EL speech). However, the quality and intelligibility of EL speech are poor. To address this problem, EL voice conversion (E… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted to INTERSPEECH 2023

  6. arXiv:2306.06652  [pdf, other

    cs.SD eess.AS

    Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion

    Authors: Yung-Lun Chien, Hsin-Hao Chen, Ming-Chi Yen, Shu-Wei Tsai, Hsin-Min Wang, Yu Tsao, Tai-Shih Chi

    Abstract: Electrolarynx is a commonly used assistive device to help patients with removed vocal cords regain their ability to speak. Although the electrolarynx can generate excitation signals like the vocal cords, the naturalness and intelligibility of electrolaryngeal (EL) speech are very different from those of natural (NL) speech. Many deep-learning-based models have been applied to electrolaryngeal spee… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted to INTERSPEECH 2023

  7. arXiv:2109.03551  [pdf, other

    cs.SD cs.CL cs.CV eess.AS

    Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion

    Authors: Yi-Syuan Liou, Wen-Chin Huang, Ming-Chi Yen, Shu-Wei Tsai, Yu-Huai Peng, Tomoki Toda, Yu Tsao, Hsin-Min Wang

    Abstract: Voice conversion (VC) is an effective approach to electrolaryngeal (EL) speech enhancement, a task that aims to improve the quality of the artificial voice from an electrolarynx device. In frame-based VC methods, time alignment needs to be performed prior to model training, and the dynamic time war** (DTW) algorithm is widely adopted to compute the best time alignment between each utterance pair… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to APSIPA ASC 2021