Skip to main content

Showing 1–11 of 11 results for author: Yeh, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00463  [pdf, other

    cs.LG cs.AI cs.CL cs.HC eess.AS

    Open-Source Conversational AI with SpeechBrain 1.0

    Authors: Mirco Ravanelli, Titouan Parcollet, Adel Moumen, Sylvain de Langen, Cem Subakan, Peter Plantinga, Yingzhi Wang, Pooneh Mousavi, Luca Della Libera, Artem Ploujnikov, Francesco Paissan, Davide Borra, Salah Zaiem, Zeyu Zhao, Shucong Zhang, Georgios Karakasidis, Sung-Lin Yeh, Aku Rouhe, Rudolf Braun, Florian Mai, Juan Zuluaga-Gomez, Seyed Mahed Mousavi, Andreas Nautsch, Xuechen Liu, Sangeet Sagar , et al. (5 additional authors not shown)

    Abstract: SpeechBrain is an open-source Conversational AI toolkit based on PyTorch, focused particularly on speech processing tasks such as speech recognition, speech enhancement, speaker recognition, text-to-speech, and much more.It promotes transparency and replicability by releasing both the pre-trained models and the complete "recipes" of code and algorithms required for training them. This paper presen… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Submitted to JMLR (Machine Learning Open Source Software)

  2. arXiv:2405.06937  [pdf, other

    math.NA eess.SP

    High-Order Synchrosqueezed Chirplet Transforms for Multicomponent Signal Analysis

    Authors: Yi-Ju Yen, De-Yan Lu, Sing-Yuan Yeh, Jian-Jiun Ding, Chun-Yen Shen

    Abstract: This study focuses on the analysis of signals containing multiple components with crossover instantaneous frequencies (IF). This problem was initially solved with the chirplet transform (CT). Also, it can be sharpened by adding the synchrosqueezing step, which is called the synchrosqueezed chirplet transform (SCT). However, we found that the SCT goes wrong with the high chirp modulation signal due… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    MSC Class: 65T99; 42C99; 42a38

  3. arXiv:2401.08833  [pdf, other

    eess.AS cs.CL cs.SD

    Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective

    Authors: Alexander H. Liu, Sung-Lin Yeh, James Glass

    Abstract: Existing studies on self-supervised speech representation learning have focused on develo** new training methods and applying pre-trained models for different applications. However, the quality of these models is often measured by the performance of different downstream tasks. How well the representations access the information of interest is less studied. In this work, we take a closer look int… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: ICASSP 2024

  4. arXiv:2312.10547  [pdf, other

    cs.IT cs.LG cs.NI eess.SP

    Advancing RAN Slicing with Offline Reinforcement Learning

    Authors: Kun Yang, Shu-** Yeh, Menglei Zhang, Jerry Sydir, **g Yang, Cong Shen

    Abstract: Dynamic radio resource management (RRM) in wireless networks presents significant challenges, particularly in the context of Radio Access Network (RAN) slicing. This technology, crucial for catering to varying user requirements, often grapples with complex optimization scenarios. Existing Reinforcement Learning (RL) approaches, while achieving good performance in RAN slicing, typically rely on onl… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: 9 pages. 6 figures

  5. arXiv:2311.11423  [pdf, other

    cs.IT cs.LG cs.NI eess.SP

    Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets

    Authors: Kun Yang, Cong Shen, **g Yang, Shu-** Yeh, Jerry Sydir

    Abstract: The recent development of reinforcement learning (RL) has boosted the adoption of online RL for wireless radio resource management (RRM). However, online RL algorithms require direct interactions with the environment, which may be undesirable given the potential performance loss due to the unavoidable exploration in RL. In this work, we first investigate the use of \emph{offline} RL algorithms in… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: This paper is the camera ready version for Asilomar 2023

  6. arXiv:2309.01007  [pdf

    eess.IV cs.CV cs.LG

    Comparative Analysis of Deep Learning Architectures for Breast Cancer Diagnosis Using the BreaKHis Dataset

    Authors: İrem Sayın, Muhammed Ali Soydaş, Yunus Emre Mert, Arda Yarkataş, Berk Ergun, Selma Sözen Yeh, Hüseyin Üvet

    Abstract: Cancer is an extremely difficult and dangerous health problem because it manifests in so many different ways and affects so many different organs and tissues. The primary goal of this research was to evaluate deep learning models' ability to correctly identify breast cancer cases using the BreakHis dataset. The BreakHis dataset covers a wide range of breast cancer subtypes through its huge collect… ▽ More

    Submitted 10 September, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: 7 pages, 1 figure, 2 tables

    MSC Class: 68T01

  7. arXiv:2210.15793  [pdf, ps, other

    eess.AS cs.SD eess.SP

    Conditioning and Sampling in Variational Diffusion Models for Speech Super-Resolution

    Authors: Chin-Yun Yu, Sung-Lin Yeh, György Fazekas, Hao Tang

    Abstract: Recently, diffusion models (DMs) have been increasingly used in audio processing tasks, including speech super-resolution (SR), which aims to restore high-frequency content given low-resolution speech utterances. This is commonly achieved by conditioning the network of noise predictor with low-resolution audio. In this paper, we propose a novel sampling algorithm that communicates the information… ▽ More

    Submitted 24 November, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: Submitted to ICASSP 2023

  8. arXiv:2106.04624  [pdf, other

    eess.AS cs.AI cs.LG cs.SD

    SpeechBrain: A General-Purpose Speech Toolkit

    Authors: Mirco Ravanelli, Titouan Parcollet, Peter Plantinga, Aku Rouhe, Samuele Cornell, Loren Lugosch, Cem Subakan, Nauman Dawalatabad, Abdelwahab Heba, Jianyuan Zhong, Ju-Chieh Chou, Sung-Lin Yeh, Szu-Wei Fu, Chien-Feng Liao, Elena Rastorgueva, François Grondin, William Aris, Hwidong Na, Yan Gao, Renato De Mori, Yoshua Bengio

    Abstract: SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the research and development of neural speech processing technologies by being simple, flexible, user-friendly, and well-documented. This paper describes the core architecture designed to support several tasks of common interest, allowing users to naturally conceive, compare and share novel speech processing… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: Preprint

  9. arXiv:2006.13372  [pdf, other

    cs.NI eess.SP

    Handling Spontaneous Traffic Variations in 5G+ via Offloading onto mmWave-Capable UAV `Bridges'

    Authors: Nikita Tafintsev, Dmitri Moltchanov, Sergey Andreev, Shu-** Yeh, Nageen Himayat, Yevgeni Koucheryavy, Mikko Valkama

    Abstract: Unmanned aerial vehicles (UAVs) are increasingly employed for numerous public and civil applications, such as goods delivery, medicine, surveillance, and telecommunications. For the latter, UAVs with onboard communication equipment may help temporarily offload traffic onto the neighboring cells in fifth-generation networks and beyond (5G+). In this paper, we propose and evaluate the use of UAVs tr… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: This work has been accepted for publication in the IEEE Transactions on Vehicular Technology

  10. arXiv:2005.12076  [pdf

    eess.SP

    An Effective Entropy-assisted Mind-wandering Detection System with EEG Signals based on MM-SART Database

    Authors: Yi-Ta Chen, Hsing-Hao Lee, Ching-Yen Shih, Zih-Ling Chen, Win-Ken Beh, Su-Ling Yeh, An-Yeu Wu

    Abstract: Mind-wandering (MW), which usually defined as a lapse of attention, occurs between 20%-40% of the time, has negative effects on our daily life. Therefore, detecting when MW occurs can prevent us from those negative outcomes resulting from MW, such as failing to keep track of course during learning. In this work, we first collect a multi-modal Sustained Attention to Response Task (MM-SART) database… ▽ More

    Submitted 27 November, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

    Comments: 15 pages, Journal version

  11. arXiv:1903.09893  [pdf, other

    eess.SP cs.NI

    Full-duplex in 5G Small Cell Access: SystemDesign and Performance Aspects

    Authors: **gwen Bai, Shu-** Yeh, Feng Xue, Yang-seok Choi, ** Wang, Shilpa Talwar

    Abstract: Recent achievement in self-interference cancellation algorithms enables potential application of full-duplex (FD) in 5G radio access systems. The exponential growth of data traffic in 5G can be supported by having more spectrum and higher spectral efficiency. FD communication promises to double the spectral efficiency by enabling simultaneous uplink and downlink transmissions in the same frequency… ▽ More

    Submitted 23 March, 2019; originally announced March 2019.

    Comments: Submitted to IEEE Communications Magazine