Skip to main content

Showing 1–3 of 3 results for author: Tuan, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2005.09966  [pdf, other

    cs.SD eess.AS

    SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning

    Authors: Yuan-Kuei Wu, Chao-I Tuan, Hung-yi Lee, Yu Tsao

    Abstract: Speech data collected in real-world scenarios often encounters two issues. First, multiple sources may exist simultaneously, and the number of sources may vary with time. Second, the existence of background noise in recording is inevitable. To handle the first issue, we refer to speech separation approaches, that separate speech from an unknown number of speakers. To address the second issue, we r… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Comments: The two first authors made equal contributions

  2. arXiv:1912.03884  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    MITAS: A Compressed Time-Domain Audio Separation Network with Parameter Sharing

    Authors: Chao-I Tuan, Yuan-Kuei Wu, Hung-yi Lee, Yu Tsao

    Abstract: Deep learning methods have brought substantial advancements in speech separation (SS). Nevertheless, it remains challenging to deploy deep-learning-based models on edge devices. Thus, identifying an effective way to compress these large models without hurting SS performance has become an important research topic. Recently, TasNet and Conv-TasNet have been proposed. They achieved state-of-the-art r… ▽ More

    Submitted 9 December, 2019; originally announced December 2019.

  3. arXiv:1904.07845  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering

    Authors: Gene-** Yang, Chao-I Tuan, Hung-Yi Lee, Lin-shan Lee

    Abstract: Speech separation has been very successful with deep learning techniques. Substantial effort has been reported based on approaches over spectrogram, which is well known as the standard time-and-frequency cross-domain representation for speech signals. It is highly correlated to the phonetic structure of speech, or "how the speech sounds" when perceived by human, but primarily frequency domain feat… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

    Comments: Submitted to Interspeech 2019