Skip to main content

Showing 1–10 of 10 results for author: Qian, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.19373  [pdf, other

    eess.SP cs.LG

    Multi-modal Mood Reader: Pre-trained Model Empowers Cross-Subject Emotion Recognition

    Authors: Yihang Dong, Xuhang Chen, Yanyan Shen, Michael Kwok-Po Ng, Tao Qian, Shuqiang Wang

    Abstract: Emotion recognition based on Electroencephalography (EEG) has gained significant attention and diversified development in fields such as neural signal processing and affective computing. However, the unique brain anatomy of individuals leads to non-negligible natural differences in EEG signals across subjects, posing challenges for cross-subject emotion recognition. While recent studies have attem… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by International Conference on Neural Computing for Advanced Applications, 2024

  2. arXiv:2312.11896  [pdf, other

    eess.SY

    Stable Relay Learning Optimization Approach for Fast Power System Production Cost Minimization Simulation

    Authors: Zishan Guo, Qinran Hu, Tao Qian, Xin Fang, Renjie Hu, Zaijun Wu

    Abstract: Production cost minimization (PCM) simulation is commonly employed for assessing the operational efficiency, economic viability, and reliability, providing valuable insights for power system planning and operations. However, solving a PCM problem is time-consuming, consisting of numerous binary variables for simulation horizon extending over months and years. This hinders rapid assessment of moder… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: Submitted to IEEE Transactions on Power Systems on December 15, 2023

  3. arXiv:2308.02867  [pdf, other

    cs.SD eess.AS

    A Systematic Exploration of Joint-training for Singing Voice Synthesis

    Authors: Yuning Wu, Yifeng Yu, Jiatong Shi, Tao Qian, Qin **

    Abstract: There has been a growing interest in using end-to-end acoustic models for singing voice synthesis (SVS). Typically, these models require an additional vocoder to transform the generated acoustic features into the final waveform. However, since the acoustic model and the vocoder are not jointly optimized, a gap can exist between the two models, leading to suboptimal performance. Although a similar… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  4. arXiv:2303.08607  [pdf, other

    cs.SD eess.AS

    PHONEix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation with Phoneme Distribution Predictor

    Authors: Yuning Wu, Jiatong Shi, Tao Qian, Dongji Gao, Qin **

    Abstract: Singing voice synthesis (SVS), as a specific task for generating the vocal singing voice from a music score, has drawn much attention in recent years. SVS faces the challenge that the singing has various pronunciation flexibility conditioned on the same music score. Most of the previous works of SVS can not well handle the misalignment between the music score and actual singing. In this paper, we… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  5. arXiv:2208.10059  [pdf, ps, other

    stat.ME eess.SY

    Sampling Gaussian Stationary Random Fields: A Stochastic Realization Approach

    Authors: Bin Zhu, Jiahao Liu, Zhengshou Lai, Tao Qian

    Abstract: Generating large-scale samples of stationary random fields is of great importance in the fields such as geomaterial modeling and uncertainty quantification. Traditional methodologies based on covariance matrix decomposition have the diffculty of being computationally expensive, which is even more serious when the dimension of the random field is large. This paper proposes an effcient stochastic re… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: 17 pages, 9 figures

  6. arXiv:2205.07319  [pdf

    cs.SD cs.AI cs.LG eess.AS

    cMelGAN: An Efficient Conditional Generative Model Based on Mel Spectrograms

    Authors: Tracy Qian, Jackson Kaunismaa, Tony Chung

    Abstract: Analysing music in the field of machine learning is a very difficult problem with numerous constraints to consider. The nature of audio data, with its very high dimensionality and widely varying scales of structure, is one of the primary reasons why it is so difficult to model. There are many applications of machine learning in music, like the classifying the mood of a piece of music, conditional… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

  7. arXiv:2205.04029  [pdf, other

    cs.SD cs.MM eess.AS

    Muskits: an End-to-End Music Processing Toolkit for Singing Voice Synthesis

    Authors: Jiatong Shi, Shuai Guo, Tao Qian, Nan Huo, Tomoki Hayashi, Yuning Wu, Frank Xu, Xuankai Chang, Huazhe Li, Peter Wu, Shinji Watanabe, Qin **

    Abstract: This paper introduces a new open-source platform named Muskits for end-to-end music processing, which mainly focuses on end-to-end singing voice synthesis (E2E-SVS). Muskits supports state-of-the-art SVS models, including RNN SVS, transformer SVS, and XiaoiceSing. The design of Muskits follows the style of widely-used speech processing toolkits, ESPnet and Kaldi, for data prepossessing, training,… ▽ More

    Submitted 2 July, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Accepted by Interspeech

  8. arXiv:2203.17001  [pdf, other

    eess.AS cs.LG cs.SD

    SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy

    Authors: Shuai Guo, Jiatong Shi, Tao Qian, Shinji Watanabe, Qin **

    Abstract: Deep learning based singing voice synthesis (SVS) systems have been demonstrated to flexibly generate singing with better qualities, compared to conventional statistical parametric based methods. However, neural systems are generally data-hungry and have difficulty to reach reasonable singing quality with limited public available training data. In this work, we explore different data augmentation… ▽ More

    Submitted 6 July, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted by INTERSPEECH 2022

  9. arXiv:2203.05571  [pdf, other

    eess.IV cs.CV

    Deep Convolutional Neural Networks for Molecular Subty** of Gliomas Using Magnetic Resonance Imaging

    Authors: Dong Wei, Yiming Li, Yinyan Wang, Tianyi Qian, Yefeng Zheng

    Abstract: Knowledge of molecular subtypes of gliomas can provide valuable information for tailored therapies. This study aimed to investigate the use of deep convolutional neural networks (DCNNs) for noninvasive glioma subty** with radiological imaging data according to the new taxonomy announced by the World Health Organization in 2016. Methods: A DCNN model was developed for the prediction of the five g… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Proc. SPIE 11314, Medical Imaging 2020: Computer-Aided Diagnosis

  10. arXiv:1906.07361  [pdf, other

    eess.SP

    A Novel Feature Representation for Single-Channel Heartbeat Classification based on Adaptive Fourier Decomposition

    Authors: Chunyu Tan, Liming Zhang, Hau-tieng Wu, Tao Qian

    Abstract: This paper proposes a novel approach for heartbeat classification from single-lead electrocardiogram (ECG) signals based on the novel adaptive Fourier decomposition (AFD). AFD is a recently developed signal processing tool that provides useful morphological features, referred to as AFD-derived instantaneous frequency (IF) features, that are different from those provided by traditional tools. A sup… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.