Skip to main content

Showing 1–8 of 8 results for author: Mu, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.12464  [pdf, other

    eess.SY

    Evaluation of Connected Vehicle Identification-Aware Mixed Traffic Freeway Cooperative Merging

    Authors: Haoji Liu, Fatemeh Jahedinia, Zeyu Mu, B. Brian Park

    Abstract: Cooperative on-ramp merging control for connected automated vehicles (CAVs) has been extensively investigated. However, they did neglect the connected vehicle identification process, which is a must for CAV cooperations. In this paper, we introduced a connected vehicle identification system (VIS) into the on-ramp merging control process for the first time and proposed an evaluation framework to as… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: @2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  2. arXiv:2404.12725  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction

    Authors: Zhaoxi Mu, Xinyu Yang

    Abstract: The integration of visual cues has revitalized the performance of the target speech extraction task, elevating it to the forefront of the field. Nevertheless, this multi-modal learning paradigm often encounters the challenge of modality imbalance. In audio-visual target speech extraction tasks, the audio modality tends to dominate, potentially overshadowing the importance of visual guidance. To ta… ▽ More

    Submitted 5 May, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: Accepted by IJCAI 2024

  3. arXiv:2312.10305  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction

    Authors: Zhaoxi Mu, Xinyu Yang, Sining Sun, Qing Yang

    Abstract: Speech signals are inherently complex as they encompass both global acoustic characteristics and local semantic information. However, in the task of target speech extraction, certain elements of global and local semantic information in the reference speech, which are irrelevant to speaker identity, can lead to speaker confusion within the speech extraction network. To overcome this challenge, we p… ▽ More

    Submitted 19 January, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI2024

  4. arXiv:2303.03737  [pdf, other

    cs.SD cs.LG eess.AS

    Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning

    Authors: Zhaoxi Mu, Xinyu Yang, Wen**g Zhu

    Abstract: Transformer has shown advanced performance in speech separation, benefiting from its ability to capture global features. However, capturing local features and channel information of audio sequences in speech separation is equally important. In this paper, we present a novel approach named Intra-SE-Conformer and Inter-Transformer (ISCIT) for speech separation. Specifically, we design a new network… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  5. arXiv:2303.03732  [pdf, other

    cs.SD cs.LG eess.AS

    A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments

    Authors: Zhaoxi Mu, Xinyu Yang, Xiangyuan Yang, Wen**g Zhu

    Abstract: In noisy and reverberant environments, the performance of deep learning-based speech separation methods drops dramatically because previous methods are not designed and optimized for such situations. To address this issue, we propose a multi-stage end-to-end learning method that decouples the difficult speech separation problem in noisy and reverberant environments into three sub-problems: speech… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  6. arXiv:2202.05430  [pdf

    eess.SY eess.SP

    Wind power ramp prediction algorithm based on wavelet deep belief network

    Authors: Zhenhao Tang, Qingyu Meng, Shengxian Cao, Yang Li, Zhongha Mu, Xiaoya Pang

    Abstract: The wind power ramp events threaten the power grid safety significantly. To improve the ramp prediction accuracy, a hybrid wavelet deep belief network algorithm with adaptive feature selection (WDBNAFS) is proposed. First, the wind power characteristic is analyzed. Then, wavelet decomposition is addressed to the time series, and an adaptive feature selection algorithm is proposed to select the inp… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: in Chinese language

    Journal ref: ACTA Energiae Solaris Sinica 40 (2019) 3213-3220

  7. arXiv:2104.09995  [pdf, other

    cs.SD cs.CL eess.AS

    Review of end-to-end speech synthesis technology based on deep learning

    Authors: Zhaoxi Mu, Xinyu Yang, Yizhuo Dong

    Abstract: As an indispensable part of modern human-computer interaction system, speech synthesis technology helps users get the output of intelligent machine more easily and intuitively, thus has attracted more and more attention. Due to the limitations of high complexity and low efficiency of traditional speech synthesis technology, the current research focus is the deep learning-based end-to-end speech sy… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

  8. arXiv:2102.12726  [pdf, other

    cs.RO cs.CY eess.SY

    Design and Control of a Highly Redundant Rigid-Flexible Coupling Robot to Assist the COVID-19 Oropharyngeal-Swab Sampling

    Authors: Yingbai Hu, Jian Li, Yongquan Chen, Qiwen Wang, Chuliang Chi, Heng Zhang, Qing Gao, Yuanmin Lan, Zheng Li, Zonggao Mu, Zhenglong Sun, Alois Knoll

    Abstract: The outbreak of novel coronavirus pneumonia (COVID-19) has caused mortality and morbidity worldwide. Oropharyngeal-swab (OP-swab) sampling is widely used for the diagnosis of COVID-19 in the world. To avoid the clinical staff from being affected by the virus, we developed a 9-degree-of-freedom (DOF) rigid-flexible coupling (RFC) robot to assist the COVID-19 OP-swab sampling. This robot is composed… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 8 pages, 11 figures