Skip to main content

Showing 1–7 of 7 results for author: Weng, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.00780  [pdf, ps, other

    eess.SP

    Cooperative Gradient Coding for Collaborative Federated Learning

    Authors: Shudi Weng, Chengxi Li, Ming Xiao, Mikael Skoglund

    Abstract: We investigate federated learning (FL) in the presence of stragglers, with emphasis on wireless scenarios where the power-constrained edge devices collaboratively train a global model on their local datasets and transmit local model updates through fading channels. To tackle stragglers resulting from link disruptions without requiring accurate prior information on connectivity or dataset sharing,… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  2. arXiv:2402.18147  [pdf, other

    eess.IV cs.CV

    A Lightweight Low-Light Image Enhancement Network via Channel Prior and Gamma Correction

    Authors: Shyang-En Weng, Shaou-Gang Miaou, Ricky Christanto

    Abstract: Human vision relies heavily on available ambient light to perceive objects. Low-light scenes pose two distinct challenges: information loss due to insufficient illumination and undesirable brightness shifts. Low-light image enhancement (LLIE) refers to image enhancement technology tailored to handle this scenario. We introduce CPGA-Net, an innovative LLIE network that combines dark/bright channel… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Preprint of an article submitted for consideration in [International Journal of Pattern Recognition and Artificial Intelligence] \c{opyright} [2024] [copyright World Scientific Publishing Company] [https://www.worldscientific.com/worldscinet/ijprai]

  3. arXiv:2209.08606  [pdf, other

    eess.SP

    Wideband mmWave Massive MIMO Channel Estimation and Localization

    Authors: Shudi Weng, Fan Jiang, Henk Wymeersch

    Abstract: Spatial wideband effects are known to affect channel estimation and localization performance in millimeter wave (mmWave) massive multiple-input multiple-output (MIMO) systems. Based on perturbation analysis, we show that the spatial wideband effect is in fact more pronounced than previously thought and significantly degrades performance, even at moderate bandwidths, if it is not properly considere… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

  4. arXiv:2104.04221  [pdf

    eess.AS eess.SP

    The NTNU Taiwanese ASR System for Formosa Speech Recognition Challenge 2020

    Authors: Fu-An Chao, Tien-Hong Lo, Shi-Yan Weng, Shih-Hsuan Chiu, Yao-Ting Sung, Berlin Chen

    Abstract: This paper describes the NTNU ASR system participating in the Formosa Speech Recognition Challenge 2020 (FSR-2020) supported by the Formosa Speech in the Wild project (FSW). FSR-2020 aims at fostering the development of Taiwanese speech recognition. Apart from the issues on tonal and dialectical variations of the Taiwanese language, speech artificially contaminated with different types of real-wor… ▽ More

    Submitted 9 July, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: 17 pages, 3 figures, Accepted for publication in IJCLCLP

  5. arXiv:2010.14764   

    eess.AS

    Effective Decoder Masking for Transformer Based End-to-End Speech Recognition

    Authors: Shi-Yan Weng, Berlin Chen

    Abstract: The attention-based encoder-decoder modeling paradigm has achieved promising results on a variety of speech processing tasks like automatic speech recognition (ASR), text-to-speech (TTS) and among others. This paradigm takes advantage of the generalization ability of neural networks to learn a direct map** from an input sequence to an output sequence, without recourse to prior knowledge such as… ▽ More

    Submitted 21 July, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: More extensions and experiments are under exploration

  6. arXiv:2005.08440  [pdf

    eess.AS cs.CL cs.SD

    An Effective End-to-End Modeling Approach for Mispronunciation Detection

    Authors: Tien-Hong Lo, Shi-Yan Weng, Hsiu-Jui Chang, Berlin Chen

    Abstract: Recently, end-to-end (E2E) automatic speech recognition (ASR) systems have garnered tremendous attention because of their great success and unified modeling paradigms in comparison to conventional hybrid DNN-HMM ASR systems. Despite the widespread adoption of E2E modeling frameworks on ASR, there still is a dearth of work on investigating the E2E frameworks for use in computer-assisted pronunciati… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020

  7. arXiv:2005.08433  [pdf, other

    eess.AS cs.CL cs.SD

    The NTNU System at the Interspeech 2020 Non-Native Children's Speech ASR Challenge

    Authors: Tien-Hong Lo, Fu-An Chao, Shi-Yan Weng, Berlin Chen

    Abstract: This paper describes the NTNU ASR system participating in the Interspeech 2020 Non-Native Children's Speech ASR Challenge supported by the SIG-CHILD group of ISCA. This ASR shared task is made much more challenging due to the coexisting diversity of non-native and children speaking characteristics. In the setting of closed-track evaluation, all participants were restricted to develop their systems… ▽ More

    Submitted 2 June, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020 Special Session: Shared Task on Automatic Speech Recognition for Non-Native Children's Speech