Skip to main content

Showing 1–17 of 17 results for author: Xie, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08196  [pdf, other

    cs.SD eess.AS

    FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

    Authors: Yuanjun Lv, Hai Li, Ying Yan, Junhui Liu, Danming Xie, Lei Xie

    Abstract: Vocoders reconstruct speech waveforms from acoustic features and play a pivotal role in modern TTS systems. Frequent-domain GAN vocoders like Vocos and APNet2 have recently seen rapid advancements, outperforming time-domain models in inference speed while achieving comparable audio quality. However, these frequency-domain vocoders suffer from large parameter sizes, thus introducing extra memory bu… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by InterSpeech 2024; 5 pages, 5 figures

  2. arXiv:2310.03963  [pdf, other

    cs.SD eess.AS

    Zero-Shot Emotion Transfer For Cross-Lingual Speech Synthesis

    Authors: Yuke Li, Xinfa Zhu, Yi Lei, Hai Li, Junhui Liu, Danming Xie, Lei Xie

    Abstract: Zero-shot emotion transfer in cross-lingual speech synthesis aims to transfer emotion from an arbitrary speech reference in the source language to the synthetic speech in the target language. Building such a system faces challenges of unnatural foreign accents and difficulty in modeling the shared emotional expressions of different languages. Building on the DelightfulTTS neural architecture, this… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted by ASRU2023

  3. arXiv:2307.05324  [pdf, other

    cs.SD eess.AS

    ShredGP: Guitarist Style-Conditioned Tablature Generation

    Authors: Pedro Sarmento, Adarsh Kumar, Dekun Xie, CJ Carr, Zack Zukowski, Mathieu Barthet

    Abstract: GuitarPro format tablatures are a type of digital music notation that encapsulates information about guitar playing techniques and fingerings. We introduce ShredGP, a GuitarPro tablature generative Transformer-based model conditioned to imitate the style of four distinct iconic electric guitarists. In order to assess the idiosyncrasies of each guitar player, we adopt a computational musicology met… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted for publication at CMMR 2023

  4. arXiv:2211.03036  [pdf, other

    eess.AS cs.SD

    Preserving background sound in noise-robust voice conversion via multi-task learning

    Authors: Jixun Yao, Yi Lei, Qing Wang, Pengcheng Guo, Ziqian Ning, Lei Xie, Hai Li, Junhui Liu, Danming Xie

    Abstract: Background sound is an informative form of art that is helpful in providing a more immersive experience in real-application voice conversion (VC) scenarios. However, prior research about VC, mainly focusing on clean voices, pay rare attention to VC with background sound. The critical problem for preserving background sound in VC is inevitable speech distortion by the neural separation model and th… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

    Comments: Submitted to ICASSP 2023

  5. arXiv:2107.09244  [pdf, other

    cs.CV cs.AI eess.IV

    S2Looking: A Satellite Side-Looking Dataset for Building Change Detection

    Authors: Li Shen, Yao Lu, Hao Chen, Hao Wei, Donghai Xie, Jiabao Yue, Rui Chen, Shouye Lv, Bitao Jiang

    Abstract: Building-change detection underpins many important applications, especially in the military and crisis-management domains. Recent methods used for change detection have shifted towards deep learning, which depends on the quality of its training data. The assembly of large-scale annotated satellite imagery datasets is therefore essential for global building-change surveillance. Existing datasets al… ▽ More

    Submitted 11 January, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Journal ref: Remote Sens. 2021, 13, 5094

  6. arXiv:2103.12954  [pdf, ps, other

    math.OC cs.LG eess.SY

    Convergence Analysis of Nonconvex Distributed Stochastic Zeroth-order Coordinate Method

    Authors: Shengjun Zhang, Yunlong Dong, Dong Xie, Lisha Yao, Colleen P. Bailey, Shengli Fu

    Abstract: This paper investigates the stochastic distributed nonconvex optimization problem of minimizing a global cost function formed by the summation of $n$ local cost functions. We solve such a problem by involving zeroth-order (ZO) information exchange. In this paper, we propose a ZO distributed primal-dual coordinate method (ZODIAC) to solve the stochastic optimization problem. Agents approximate thei… ▽ More

    Submitted 13 October, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  7. Hierarchical Classification of Pulmonary Lesions: A Large-Scale Radio-Pathomics Study

    Authors: Jiancheng Yang, Mingze Gao, Kaiming Kuang, Bingbing Ni, Yunlang She, Dong Xie, Chang Chen

    Abstract: Diagnosis of pulmonary lesions from computed tomography (CT) is important but challenging for clinical decision making in lung cancer related diseases. Deep learning has achieved great success in computer aided diagnosis (CADx) area for lung cancer, whereas it suffers from label ambiguity due to the difficulty in the radiological diagnosis. Considering that invasive pathological analysis serves as… ▽ More

    Submitted 8 October, 2020; originally announced October 2020.

    Comments: MICCAI 2020 (Early Accepted)

  8. arXiv:2008.11882  [pdf, other

    cs.CV eess.IV

    Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation

    Authors: Xuewen Yang, Dongliang Xie, Xin Wang

    Abstract: State-of-the-art techniques in Generative Adversarial Networks (GANs) have shown remarkable success in image-to-image translation from peer domain X to domain Y using paired image data. However, obtaining abundant paired data is a non-trivial and expensive process in the majority of applications. When there is a need to translate images across n domains, if the training is performed between every… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

    Comments: accepted in proceedings of ACM Multimedia 2018

  9. A Novel Posture Positioning Method for Multi-Joint Manipulators

    Authors: Zhi-Qiang Yao, Yi-Jue Dai, Qing-Na Li, Dang Xie, Ze-Hui Liu

    Abstract: Safety and automatic control are extremely important when operating manipulators. For large engineering manipulators, the main challenge is to accurately recognize the posture of all arm segments. In classical sensing methods, the accuracy of an inclinometer is easily affected by the elastic deformation in the manipulator's arms. This results in big error accumulations when sensing the angle of jo… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 7 pages, 8 figures

    Journal ref: IEEE Sensors Journal, 2020

  10. arXiv:2005.07784  [pdf, other

    eess.IV cs.CV cs.LG

    A Learning-from-noise Dilated Wide Activation Network for denoising Arterial Spin Labeling (ASL) Perfusion Images

    Authors: Danfeng Xie, Yiran Li, Hanlu Yang, Li Bai, Lei Zhang, Ze Wang

    Abstract: Arterial spin labeling (ASL) perfusion MRI provides a non-invasive way to quantify cerebral blood flow (CBF) but it still suffers from a low signal-to-noise-ratio (SNR). Using deep machine learning (DL), several groups have shown encouraging denoising results. Interestingly, the improvement was obtained when the deep neural network was trained using noise-contaminated surrogate reference because o… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  11. arXiv:2004.14774  [pdf, other

    cs.CV cs.LG cs.RO eess.IV stat.ML

    IROS 2019 Lifelong Robotic Vision Challenge -- Lifelong Object Recognition Report

    Authors: Qi She, Fan Feng, Qi Liu, Rosa H. M. Chan, Xinyue Hao, Chuanlin Lan, Qihan Yang, Vincenzo Lomonaco, German I. Parisi, Heechul Bae, Eoin Brophy, Baoquan Chen, Gabriele Graffieti, Vidit Goel, Hyonyoung Han, Sathursan Kanagarajah, Somesh Kumar, Siew-Kei Lam, Tin Lun Lam, Liang Ma, Davide Maltoni, Lorenzo Pellegrini, Duvindu Piyasena, Shiliang Pu, Debdoot Sheet , et al. (11 additional authors not shown)

    Abstract: This report summarizes IROS 2019-Lifelong Robotic Vision Competition (Lifelong Object Recognition Challenge) with methods and results from the top $8$ finalists (out of over~$150$ teams). The competition dataset (L)ifel(O)ng (R)obotic V(IS)ion (OpenLORIS) - Object Recognition (OpenLORIS-object) is designed for driving lifelong/continual learning research and application in robotic vision domain, w… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: 9 pages, 11 figures, 3 tables, accepted into IEEE Robotics and Automation Magazine. arXiv admin note: text overlap with arXiv:1911.06487

  12. arXiv:1911.09349  [pdf, other

    eess.AS cs.CV cs.LG cs.MM

    An End-to-End Audio Classification System based on Raw Waveforms and Mix-Training Strategy

    Authors: Jiaxu Chen, **g Hao, Kai Chen, Di Xie, Shicai Yang, Shiliang Pu

    Abstract: Audio classification can distinguish different kinds of sounds, which is helpful for intelligent applications in daily life. However, it remains a challenging task since the sound events in an audio clip is probably multiple, even overlap**. This paper introduces an end-to-end audio classification system based on raw waveforms and mix-training strategy. Compared to human-designed features which… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: InterSpeech 2019

  13. Deep Reinforcement Learning for Smart Home Energy Management

    Authors: Liang Yu, Weiwei Xie, Di Xie, Yulong Zou, Dengyin Zhang, Zhixin Sun, Linghua Zhang, Yue Zhang, Tao Jiang

    Abstract: In this paper, we investigate an energy cost minimization problem for a smart home in the absence of a building thermal dynamics model with the consideration of a comfortable temperature range. Due to the existence of model uncertainty, parameter uncertainty (e.g., renewable generation output, non-shiftable power demand, outdoor temperature, and electricity price) and temporally-coupled operationa… ▽ More

    Submitted 18 December, 2019; v1 submitted 23 September, 2019; originally announced September 2019.

    Comments: 15 pages, 16 figures

    Journal ref: IEEE Internet of Things Journal, 2019

  14. arXiv:1909.04261  [pdf, other

    stat.ML cs.LG eess.SY

    Interpretable Biomanufacturing Process Risk and Sensitivity Analyses for Quality-by-Design and Stability Control

    Authors: Wei Xie, Bo Wang, Cheng Li, Dongming Xie, Jared Auclair

    Abstract: While biomanufacturing plays a significant role in supporting the economy and ensuring public health, it faces critical challenges, including complexity, high variability, lengthy lead time, and very limited process data, especially for personalized new cell and gene biotherapeutics. Driven by these challenges, we propose an interpretable semantic bioprocess probabilistic knowledge graph and devel… ▽ More

    Submitted 2 June, 2021; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: 41 pages, 8 figures

    Journal ref: Naval Research Logistics, 2021

  15. arXiv:1709.00715  [pdf, ps, other

    eess.SY

    Distributed Real-Time HVAC Control for Cost-Efficient Commercial Buildings under Smart Grid Environment

    Authors: Liang Yu, Di Xie, Tao Jiang, Yulong Zou, Kun Wang

    Abstract: In this paper, we investigate the problem of minimizing the long-term total cost (i.e., the sum of energy cost and thermal discomfort cost) associated with a Heating, Ventilation, and Air Conditioning (HVAC) system of a multizone commercial building under smart grid environment. To be specific, we first formulate a stochastic program to minimize the time average expected total cost with the consid… ▽ More

    Submitted 19 October, 2017; v1 submitted 3 September, 2017; originally announced September 2017.

    Comments: 11 pages, 16 figures, accepted to appear in IEEE Internet of Things Journal

  16. Quantum estimation of detection efficiency with no-knowledge quantum feedback

    Authors: Dong Xie, Chunling Xu, Jianyong Chen, Anmin Wang

    Abstract: We investigate that no-knowledge measurement-based feedback control is utilized to obtain the estimation precision of the detection efficiency. For the feedback operators that concern us, no-knowledge measurement is the optimal way to estimate the detection efficiency. We show that the higher precision can be achieved for the lower or larger detection efficiency. It is found that no-knowledge feed… ▽ More

    Submitted 13 August, 2017; originally announced August 2017.

    Comments: 7pages, 3figures

    Journal ref: Chin. Phys. B Vol. 27, No. 6 (2018) 060303

  17. arXiv:1304.3996  [pdf, other

    cs.GT cs.CR cs.CY eess.SY

    Cyber-Physical Security: A Game Theory Model of Humans Interacting over Control Systems

    Authors: Scott Backhaus, Russell Bent, James Bono, Ritchie Lee, Brendan Tracey, David Wolpert, Yildiray Yildiz

    Abstract: Recent years have seen increased interest in the design and deployment of smart grid devices and control algorithms. Each of these smart communicating devices represents a potential access point for an intruder spurring research into intruder prevention and detection. However, no security measures are complete, and intruding attackers will compromise smart grid devices leading to the attacker and… ▽ More

    Submitted 15 April, 2013; originally announced April 2013.

    Comments: 8 pages, 7 figures, IEEE Transactions on Smart Grids pending