Skip to main content

Showing 1–50 of 61 results for author: Qu, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  2. arXiv:2405.10570  [pdf

    eess.IV cs.AI

    Simultaneous Deep Learning of Myocardium Segmentation and T2 Quantification for Acute Myocardial Infarction MRI

    Authors: Yirong Zhou, Chengyan Wang, Mengtian Lu, Kunyuan Guo, Zi Wang, Dan Ruan, Rui Guo, Peijun Zhao, Jianhua Wang, Naiming Wu, Jianzhong Lin, Yinyin Chen, Hang **, Lianxin Xie, Lilan Wu, Liuhong Zhu, Jianjun Zhou, Congbo Cai, He Wang, Xiaobo Qu

    Abstract: In cardiac Magnetic Resonance Imaging (MRI) analysis, simultaneous myocardial segmentation and T2 quantification are crucial for assessing myocardial pathologies. Existing methods often address these tasks separately, limiting their synergistic potential. To address this, we propose SQNet, a dual-task network integrating Transformer and Convolutional Neural Network (CNN) components. SQNet features… ▽ More

    Submitted 29 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 10 pages, 8 figures, 6 tables

  3. arXiv:2404.13892  [pdf, other

    cs.SD cs.AI eess.AS

    Retrieval-Augmented Audio Deepfake Detection

    Authors: Zuheng Kang, Yayun He, Botao Zhao, Xiaoyang Qu, Junqing Peng, **g Xiao, Jianzong Wang

    Abstract: With recent advances in speech synthesis including text-to-speech (TTS) and voice conversion (VC) systems enabling the generation of ultra-realistic audio deepfakes, there is growing concern about their potential misuse. However, most deepfake (DF) detection methods rely solely on the fuzzy knowledge learned by a single model, resulting in performance bottlenecks and transparency issues. Inspired… ▽ More

    Submitted 23 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by the 2024 International Conference on Multimedia Retrieval (ICMR 2024)

  4. arXiv:2404.06393  [pdf, other

    cs.SD cs.AI eess.AS

    MuPT: A Generative Symbolic Music Pretrained Transformer

    Authors: Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan , et al. (4 additional authors not shown)

    Abstract: In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the chal… ▽ More

    Submitted 10 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  5. arXiv:2404.01082  [pdf, other

    eess.IV

    The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

    Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Li** Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

    Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 25 pages, 17 figures

  6. arXiv:2402.15939  [pdf

    eess.IV cs.LG

    Deep Separable Spatiotemporal Learning for Fast Dynamic Cardiac MRI

    Authors: Zi Wang, Min Xiao, Yirong Zhou, Chengyan Wang, Naiming Wu, Yi Li, Yiwen Gong, Shufu Chang, Yinyin Chen, Liuhong Zhu, Jianjun Zhou, Congbo Cai, He Wang, Di Guo, Guang Yang, Xiaobo Qu

    Abstract: Dynamic magnetic resonance imaging (MRI) plays an indispensable role in cardiac diagnosis. To enable fast imaging, the k-space data can be undersampled but the image reconstruction poses a great challenge of high-dimensional processing. This challenge leads to necessitate extensive training data in many deep learning reconstruction methods. This work proposes a novel and efficient approach, levera… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 10 pages, 11 figures, 3 tables

  7. arXiv:2401.11449  [pdf, other

    eess.SP cs.NI

    Energy Consumption Analysis for Continuous Phase Modulation in Smart-Grid Internet of Things of beyond 5G

    Authors: Hongjian Gao, Yang Lu, Shaoshi Yang, **gsheng Tan, Longlong Nie, Xinyi Qu

    Abstract: Wireless sensor network (WSN) underpinning the smart-grid Internet of Things (SG-IoT) has been a popular research topic in recent years due to its great potential for enabling a wide range of important applications. However, the energy consumption (EC) characteristic of sensor nodes is a key factor that affects the operational performance (e.g., lifetime of sensors) and the total cost of ownership… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 7 figures, 2 tables

    Journal ref: Sensors, vol. 24, no. 2, pp. 1-14, article number 533, Jan. 2024

  8. arXiv:2310.13882  [pdf

    eess.SP

    NMR Spectra Denoising with Vandermonde Constraints

    Authors: Di Guo, Runmin Xu, **yu Wu, Mei** Lin, Xiaofeng Du, Xiaobo Qu

    Abstract: Nuclear magnetic resonance (NMR) spectroscopy serves as an important tool to analyze chemicals and proteins in bioengineering. However, NMR signals are easily contaminated by noise during the data acquisition, which can affect subsequent quantitative analysis. Therefore, denoising NMR signals has been a long-time concern. In this work, we propose an optimization model-based iterative denoising met… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 10 pages, 9 figures

  9. arXiv:2310.11641  [pdf

    eess.IV cs.AI physics.med-ph

    Cloud-Magnetic Resonance Imaging System: In the Era of 6G and Artificial Intelligence

    Authors: Yirong Zhou, Yanhuang Wu, Yuhan Su, **g Li, Jianyun Cai, Yongfu You, Di Guo, Xiaobo Qu

    Abstract: Magnetic Resonance Imaging (MRI) plays an important role in medical diagnosis, generating petabytes of image data annually in large hospitals. This voluminous data stream requires a significant amount of network bandwidth and extensive storage infrastructure. Additionally, local data processing demands substantial manpower and hardware investments. Data isolation across different healthcare instit… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 4pages, 5figures, letters

  10. arXiv:2310.04992  [pdf, other

    eess.IV cs.CV

    VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

    Authors: Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv , et al. (17 additional authors not shown)

    Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassifi… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  11. arXiv:2309.11763  [pdf

    eess.IV

    Bloch Equation Enables Physics-informed Neural Network in Parametric Magnetic Resonance Imaging

    Authors: Qingrui Cai, Liuhong Zhu, Jianjun Zhou, Chen Qian, Di Guo, Xiaobo Qu

    Abstract: Magnetic resonance imaging (MRI) is an important non-invasive imaging method in clinical diagnosis. Beyond the common image structures, parametric imaging can provide the intrinsic tissue property thus could be used in quantitative evaluation. The emerging deep learning approach provides fast and accurate parameter estimation but still encounters the lack of network interpretation and enough train… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  12. arXiv:2309.07178  [pdf

    q-bio.QM cs.AI cs.LG eess.SP

    CloudBrain-NMR: An Intelligent Cloud Computing Platform for NMR Spectroscopy Processing, Reconstruction and Analysis

    Authors: Di Guo, Si** Li, Jun Liu, Zhangren Tu, Tianyu Qiu, **g**g Xu, Liubin Feng, Donghai Lin, Qing Hong, Mei** Lin, Yanqin Lin, Xiaobo Qu

    Abstract: Nuclear Magnetic Resonance (NMR) spectroscopy has served as a powerful analytical tool for studying molecular structure and dynamics in chemistry and biology. However, the processing of raw data acquired from NMR spectrometers and subsequent quantitative analysis involves various specialized tools, which necessitates comprehensive knowledge in programming and NMR. Particularly, the emerging deep l… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 11 pages, 13 figures

  13. arXiv:2309.06681  [pdf

    eess.IV cs.AI

    A plug-and-play synthetic data deep learning for undersampled magnetic resonance image reconstruction

    Authors: Min Xiao, Zi Wang, Jiefeng Guo, Xiaobo Qu

    Abstract: Magnetic resonance imaging (MRI) plays an important role in modern medical diagnostic but suffers from prolonged scan time. Current deep learning methods for undersampled MRI reconstruction exhibit good performance in image de-aliasing which can be tailored to the specific k-space undersampling scenario. But it is very troublesome to configure different deep networks when the sampling setting chan… ▽ More

    Submitted 8 October, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 5 pages, 3 figures

  14. arXiv:2307.13220  [pdf

    eess.IV cs.AI physics.med-ph

    One for Multiple: Physics-informed Synthetic Data Boosts Generalizable Deep Learning for Fast MRI Reconstruction

    Authors: Zi Wang, Xiaotong Yu, Chengyan Wang, Weibo Chen, Jiazheng Wang, Ying-Hua Chu, Hongwei Sun, Rushuai Li, Peiyong Li, Fan Yang, Haiwei Han, Taishan Kang, Jianzhong Lin, Chen Yang, Shufu Chang, Zhang Shi, Sha Hua, Yan Li, Juan Hu, Liuhong Zhu, Jianjun Zhou, Mei**g Lin, Jiefeng Guo, Congbo Cai, Zhong Chen , et al. (3 additional authors not shown)

    Abstract: Magnetic resonance imaging (MRI) is a widely used radiological modality renowned for its radiation-free, comprehensive insights into the human body, facilitating medical diagnoses. However, the drawback of prolonged scan times hinders its accessibility. The k-space undersampling offers a solution, yet the resultant artifacts necessitate meticulous removal during image reconstruction. Although Deep… ▽ More

    Submitted 28 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 38 pages, 19 figures, 5 tables

  15. arXiv:2306.17367  [pdf, other

    eess.IV cs.CV

    Spatially Varying Exposure with 2-by-2 Multiplexing: Optimality and Universality

    Authors: Xiangyu Qu, Yiheng Chi, Stanley H. Chan

    Abstract: The advancement of new digital image sensors has enabled the design of exposure multiplexing schemes where a single image capture can have multiple exposures and conversion gains in an interlaced format, similar to that of a Bayer color filter array. In this paper, we ask the question of how to design such multiplexing schemes for adaptive high-dynamic range (HDR) imaging where the multiplexing sc… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  16. arXiv:2306.11021  [pdf, other

    eess.SP

    CloudBrain-MRS: An Intelligent Cloud Computing Platform for in vivo Magnetic Resonance Spectroscopy Preprocessing, Quantification, and Analysis

    Authors: Xiaodie Chen, Jiayu Li, Dicheng Chen, Yirong Zhou, Zhangren Tu, Mei** Lin, Taishan Kang, Jianzhong Lin, Tao Gong, Liuhong Zhu, Jianjun Zhou, Lin Ou-yang, Jiefeng Guo, Jiyang Dong, Di Guo, Xiaobo Qu

    Abstract: Magnetic resonance spectroscopy (MRS) is an important clinical imaging method for diagnosis of diseases. MRS spectrum is used to observe the signal intensity of metabolites or further infer their concentrations. Although the magnetic resonance vendors commonly provide basic functions of spectra plots and metabolite quantification, the widespread clinical research of MRS is still limited due to the… ▽ More

    Submitted 6 September, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 11 pages, 12 figures

  17. arXiv:2306.08219  [pdf, other

    cs.IR cs.SD eess.AS

    Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects

    Authors: Xinghua Qu, Hongyang Liu, Zhu Sun, Xiang Yin, Yew Soon Ong, Lu Lu, Zejun Ma

    Abstract: Conversational recommender systems (CRSs) have become crucial emerging research topics in the field of RSs, thanks to their natural advantages of explicitly acquiring user preferences via interactive conversations and revealing the reasons behind recommendations. However, the majority of current CRSs are text-based, which is less user-friendly and may pose challenges for certain users, such as tho… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: Accepted by SIGIR 2023 Resource Track

  18. arXiv:2305.04414  [pdf, ps, other

    eess.SP

    Untrained Neural Network based Bayesian Detector for OTFS Modulation Systems

    Authors: Hao Chang, Alva Kosasih, Wibowo Hardjawana, Xinwei Qu, Branka Vucetic

    Abstract: The orthogonal time frequency space (OTFS) symbol detector design for high mobility communication scenarios has received numerous attention lately. Current state-of-the-art OTFS detectors mainly can be divided into two categories; iterative and training-based deep neural network (DNN) detectors. Many practical iterative detectors rely on minimum-mean-square-error (MMSE) denoiser to get the initial… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  19. arXiv:2303.07643  [pdf, other

    cs.SD cs.AI eess.AS

    Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification

    Authors: Zuheng Kang, Yayun He, Jianzong Wang, Junqing Peng, Xiaoyang Qu, **g Xiao

    Abstract: Data-Free Knowledge Distillation (DFKD) has recently attracted growing attention in the academic community, especially with major breakthroughs in computer vision. Despite promising results, the technique has not been well applied to audio and signal processing. Due to the variable duration of audio signals, it has its own unique way of modeling. In this work, we propose feature-rich audio model i… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023. International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)

  20. arXiv:2301.02488  [pdf

    eess.SP cs.AI cs.CV cs.LG eess.IV

    TWR-MCAE: A Data Augmentation Method for Through-the-Wall Radar Human Motion Recognition

    Authors: Weicheng Gao, Xiaopeng Yang, Xiaodong Qu, Tian Lan

    Abstract: To solve the problems of reduced accuracy and prolonging convergence time of through-the-wall radar (TWR) human motion due to wall attenuation, multipath effect, and system interference, we propose a multilink auto-encoding neural network (TWR-MCAE) data augmentation method. Specifically, the TWR-MCAE algorithm is jointly constructed by a singular value decomposition (SVD)-based data preprocessing… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

    Comments: Publisher: IEEE Transactions on Geoscience and Remote Sensing (Volume: 60). Total Pages: 17. Total Figures: 17

    Journal ref: in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-17, 2022, Art no. 5118617

  21. arXiv:2212.01878  [pdf

    eess.IV

    CloudBrain-ReconAI: An Online Platform for MRI Reconstruction and Image Quality Evaluation

    Authors: Yirong Zhou, Chen Qian, Jiayu Li, Zi Wang, Yu Hu, Biao Qu, Liuhong Zhu, Jianjun Zhou, Taishan Kang, Jianzhong Lin, Qing Hong, Jiyang Dong, Di Guo, Xiaobo Qu

    Abstract: Efficient collaboration between engineers and radiologists is important for image reconstruction algorithm development and image quality evaluation in magnetic resonance imaging (MRI). Here, we develop CloudBrain-ReconAI, an online cloud computing platform, for algorithm deployment, fast and blind reader study. This platform supports online image reconstruction using state-of-the-art artificial in… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

    Comments: 8 pages, 11 figures

  22. arXiv:2212.01144  [pdf

    q-bio.BM eess.SP

    Resolution enhancement of NMR by decoupling with low-rank Hankel model

    Authors: Tianyu Qiu, Amir Jahangiri, Xiao Han, Dmitry Lesovoy, Tatiana Agback, Peter Agback, Adnane Achour, Xiaobo Qu, Vladislav Orekhov

    Abstract: Nuclear magnetic resonance (NMR) spectroscopy has become a formidable tool for biochemistry and medicine. Although J-coupling carries essential structural information it may also limit the spectral resolution. Homonuclear decoupling remains a challenging problem. In this work, we introduce a new approach that uses a specific coupling value as prior knowledge, and Hankel property of exponential NMR… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 8 pages, 4 figures

  23. arXiv:2211.14312  [pdf

    q-bio.QM cs.CV cs.LG eess.IV

    Karyotype AI for Precision Oncology

    Authors: Zahra Shamsi, Drew Bryant, Jacob Wilson, Xiaoyu Qu, Avinava Dubey, Konik Kothari, Mostafa Dehghani, Mariya Chavarha, Valerii Likhosherstov, Brian Williams, Michael Frumkin, Fred Appelbaum, Krzysztof Choromanski, Ali Bashir, Min Fang

    Abstract: Chromosome analysis is essential for diagnosing genetic disorders. For hematologic malignancies, identification of somatic clonal aberrations by karyotype analysis remains the standard of care. However, karyoty** is costly and time-consuming because of the largely manual process and the expertise required in identifying and annotating aberrations. Efforts to automate karyotype analysis to date f… ▽ More

    Submitted 19 October, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

  24. arXiv:2211.13479  [pdf

    eess.SP

    Alternating Deep Low-Rank Approach for Exponential Function Reconstruction and Its Biomedical Magnetic Resonance Applications

    Authors: Yihui Huang, Zi Wang, Xinlin Zhang, Jian Cao, Zhangren Tu, Mei** Lin, Di Guo, Xiaobo Qu

    Abstract: Undersampling can accelerate the signal acquisition but at the cost of bringing in artifacts. Removing these artifacts is a fundamental problem in signal processing and this task is also called signal reconstruction. Through modeling signals as the superimposed exponential functions, deep learning has achieved fast and high-fidelity signal reconstruction by training a map** from the undersampled… ▽ More

    Submitted 8 August, 2023; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: 13 pages

  25. arXiv:2210.12723  [pdf

    eess.IV cs.AI cs.LG

    A Faithful Deep Sensitivity Estimation for Accelerated Magnetic Resonance Imaging

    Authors: Zi Wang, Haoming Fang, Chen Qian, Boxuan Shi, Lijun Bao, Liuhong Zhu, Jianjun Zhou, Wen** Wei, Jianzhong Lin, Di Guo, Xiaobo Qu

    Abstract: Magnetic resonance imaging (MRI) is an essential diagnostic tool that suffers from prolonged scan time. To alleviate this limitation, advanced fast MRI technology attracts extensive research interests. Recent deep learning has shown its great potential in improving image quality and reconstruction speed. Faithful coil sensitivity estimation is vital for MRI reconstruction. However, most deep learn… ▽ More

    Submitted 24 December, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 12 pages, 13 figures, 7 tables

  26. arXiv:2210.11388  [pdf

    eess.IV cs.CV

    Physics-informed Deep Diffusion MRI Reconstruction with Synthetic Data: Break Training Data Bottleneck in Artificial Intelligence

    Authors: Chen Qian, Yuncheng Gao, Mingyang Han, Zi Wang, Dan Ruan, Yu Shen, Ya** Wu, Yirong Zhou, Chengyan Wang, Boyu Jiang, Ran Tao, Zhigang Wu, Jiazheng Wang, Liuhong Zhu, Yi Guo, Taishan Kang, Jianzhong Lin, Tao Gong, Chen Yang, Guoqiang Fei, Mei** Lin, Di Guo, Jianjun Zhou, Meiyun Wang, Xiaobo Qu

    Abstract: Diffusion magnetic resonance imaging (MRI) is the only imaging modality for non-invasive movement detection of in vivo water molecules, with significant clinical and research applications. Diffusion MRI (DWI) acquired by multi-shot techniques can achieve higher resolution, better signal-to-noise ratio, and lower geometric distortion than single-shot, but suffers from inter-shot motion-induced arti… ▽ More

    Submitted 5 February, 2024; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: 23 pages, 16 figures

  27. arXiv:2210.08182  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation

    Authors: Chendong Zhao, Jianzong Wang, Xiaoyang Qu, Haoqian Wang, **g Xiao

    Abstract: Unsupervised representation learning for speech audios attained impressive performances for speech recognition tasks, particularly when annotated speech is limited. However, the unsupervised paradigm needs to be carefully designed and little is known about what properties these representations acquire. There is no guarantee that the model learns meaningful representations for valuable information… ▽ More

    Submitted 29 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted to 2022 IEEE Spoken Language Technology Workshop (SLT 2022)

  28. arXiv:2209.10088  [pdf, other

    eess.AS cs.AI cs.LG cs.SD

    Boosting Star-GANs for Voice Conversion with Contrastive Discriminator

    Authors: Shi**g Si, Jianzong Wang, Xulong Zhang, Xiaoyang Qu, Ning Cheng, **g Xiao

    Abstract: Nonparallel multi-domain voice conversion methods such as the StarGAN-VCs have been widely applied in many scenarios. However, the training of these models usually poses a challenge due to their complicated adversarial network architectures. To address this, in this work we leverage the state-of-the-art contrastive learning techniques and incorporate an efficient Siamese network structure into the… ▽ More

    Submitted 27 September, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: 12 pages, 3 figures, Accepted by ICONIP 2022

  29. arXiv:2207.12662  [pdf, other

    cs.LG cs.HC eess.SP

    Time Majority Voting, a PC-based EEG Classifier for Non-expert Users

    Authors: Guangyao Dou, Zheng Zhou, Xiaodong Qu

    Abstract: Using Machine Learning and Deep Learning to predict cognitive tasks from electroencephalography (EEG) signals is a rapidly advancing field in Brain-Computer Interfaces (BCI). In contrast to the fields of computer vision and natural language processing, the data amount of these trials is still rather tiny. Develo** a PC-based machine learning technique to increase the participation of non-expert… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  30. arXiv:2206.13235  [pdf, other

    eess.SP

    Bayesian Neural Network Detector for an Orthogonal Time Frequency Space Modulation

    Authors: Alva Kosasih, Xinwei Qu, Wibowo Hardjawana, Chentao Yue, Branka Vucetic

    Abstract: The orthogonal time-frequency space (OTFS) modulation is proposed for beyond 5G wireless systems to deal with high mobility communications. The existing low complexity OTFS detectors exhibit poor performance in rich scattering environments where there are a large number of moving reflectors that reflect the transmitted signal towards the receiver. In this paper, we propose an OTFS detector, referr… ▽ More

    Submitted 21 September, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted for a publication in IEEE Wireless Communication Letter

  31. arXiv:2205.13249  [pdf, other

    cs.SD cs.LG eess.AS

    DT-SV: A Transformer-based Time-domain Approach for Speaker Verification

    Authors: Nan Zhang, Jianzong Wang, Zhenhou Hong, Chendong Zhao, Xiaoyang Qu, **g Xiao

    Abstract: Speaker verification (SV) aims to determine whether the speaker's identity of a test utterance is the same as the reference speech. In the past few years, extracting speaker embeddings using deep neural networks for SV systems has gone mainstream. Recently, different attention mechanisms and Transformer networks have been explored widely in SV fields. However, utilizing the original Transformer in… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted by IJCNN2022 (The 2022 International Joint Conference on Neural Networks)

  32. arXiv:2205.11738  [pdf, other

    cs.SD cs.AI eess.AS

    Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection

    Authors: Chendong Zhao, Jianzong Wang, Leilai Li, Xiaoyang Qu, **g Xiao

    Abstract: Sound event detection is to infer the event by understanding the surrounding environmental sounds. Due to the scarcity of rare sound events, it becomes challenging for the well-trained detectors which have learned too much prior knowledge. Meanwhile, few-shot learning methods promise a good generalization ability when facing a new limited-data task. Recent approaches have achieved promising result… ▽ More

    Submitted 26 May, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted to IJCNN 2022

  33. arXiv:2203.14559  [pdf

    eess.SP

    A Paired Phase and Magnitude Reconstruction for Advanced Diffusion-Weighted Imaging

    Authors: Chen Qian, Zi Wang, Xinlin Zhang, Boxuan Shi, Boyu Jiang, Ran Tao, **g Li, Yuwei Ge, Taishan Kang, Jianzhong Lin, Di Guo, Xiaobo Qu

    Abstract: Objective: Multi-shot interleaved echo planer imaging can obtain diffusion-weighted images (DWI) with high spatial resolution and low distortion, but suffers from ghost artifacts introduced by phase variations between shots. In this work, we aim at solving the challenging reconstructions under inter-shot motions between shots and a low signal-to-noise ratio. Methods: An explicit phase model with p… ▽ More

    Submitted 8 December, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: 12 pages, 14 figures

  34. arXiv:2203.11178  [pdf

    cs.LG eess.SP physics.med-ph

    Physics-driven Synthetic Data Learning for Biomedical Magnetic Resonance

    Authors: Qinqin Yang, Zi Wang, Kunyuan Guo, Congbo Cai, Xiaobo Qu

    Abstract: Deep learning has innovated the field of computational imaging. One of its bottlenecks is unavailable or insufficient training data. This article reviews an emerging paradigm, imaging physics-based data synthesis (IPADS), that can provide huge training data in biomedical magnetic resonance without or with few real data. Following the physical law of magnetic resonance, IPADS generates signals from… ▽ More

    Submitted 21 May, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  35. arXiv:2203.04583  [pdf, other

    eess.AS cs.SD

    Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks

    Authors: Yizhou Lu, Mingkun Huang, Xinghua Qu, Pengfei Wei, Zejun Ma

    Abstract: Unsupervised cross-lingual speech representation learning (XLSR) has recently shown promising results in speech recognition by leveraging vast amounts of unlabeled data across multiple languages. However, standard XLSR model suffers from language interference problem due to the lack of language specific modeling ability. In this work, we investigate language adaptive training on XLSR models. More… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: To appear in ICASSP 2022

  36. arXiv:2202.11194  [pdf, other

    eess.AS cs.LG cs.SD

    r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation

    Authors: Chendong Zhao, Jianzong Wang, Xiaoyang Qu, Haoqian Wang, **g Xiao

    Abstract: Grapheme-to-phoneme (G2P) conversion is the process of converting the written form of words to their pronunciations. It has an important role for text-to-speech (TTS) synthesis and automatic speech recognition (ASR) systems. In this paper, we aim to evaluate and enhance the robustness of G2P models. We show that neural G2P models are extremely sensitive to orthographical variations in graphemes li… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: 5 pages, 5 figures, accepted to ICASSP 2022

  37. arXiv:2202.09454  [pdf

    eess.SY

    Flow-level Coordination of Connected and Autonomous Vehicles in Multilane Freeway Ramp Merging Areas

    Authors: Jie Zhu, Ivana Tasic, Xiaobo Qu

    Abstract: On-ramp merging areas are deemed to be typical bottlenecks for freeway networks due to the intensive disturbances induced by the frequent merging, weaving, and lane-changing behaviors. The Connected and Autonomous Vehicles (CAVs), benefited from their capabilities of real-time communication and precise motion control, hold an opportunity to promote ramp merging operation through enhanced cooperati… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

  38. arXiv:2112.04721  [pdf

    eess.IV cs.AI cs.CV physics.med-ph

    One-dimensional Deep Low-rank and Sparse Network for Accelerated MRI

    Authors: Zi Wang, Chen Qian, Di Guo, Hongwei Sun, Rushuai Li, Bo Zhao, Xiaobo Qu

    Abstract: Deep learning has shown astonishing performance in accelerated magnetic resonance imaging (MRI). Most state-of-the-art deep learning reconstructions adopt the powerful convolutional neural network and perform 2D convolution since many magnetic resonance images or their corresponding k-space are in 2D. In this work, we present a new approach that explores the 1D convolution, making the deep network… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: 16 pages

  39. arXiv:2110.14345  [pdf, other

    eess.SP

    Bayesian-based Symbol Detector for Orthogonal Time Frequency Space Modulation Systems

    Authors: Xinwei Qu, Alva Kosasih, Wibowo Hardjawana, Vincent Onasis, Branka Vucetic

    Abstract: Recently, the orthogonal time frequency space (OTFS) modulation is proposed for 6G wireless system to deal with high Doppler spread. The high Doppler spread happens when the transmitted signal is reflected towards the receiver by fast moving objects (e.g. high speed cars), which causes inter-carrier interference (ICI). Recent state-of-the-art OTFS detectors fail to achieve an acceptable bit-error-… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  40. arXiv:2108.01875  [pdf

    eess.SY

    Improving Freeway Merging Efficiency via Flow-Level Coordination of Connected and Autonomous Vehicles

    Authors: Jie Zhu, Ivana Tasic, Xiaobo Qu

    Abstract: Freeway on-ramps are typical bottlenecks in the freeway network due to the frequent disturbances caused by their associated merging, weaving, and lane-changing behaviors. With real-time communication and precise motion control, Connected and Autonomous Vehicles (CAVs) provide an opportunity to substantially enhance the traffic operational performance of on-ramp bottlenecks. In this paper, we propo… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  41. arXiv:2107.11650  [pdf, other

    eess.IV eess.SP

    Accelerated MRI Reconstruction with Separable and Enhanced Low-Rank Hankel Regularization

    Authors: Xinlin Zhang, Hengfa Lu, Di Guo, Zongying Lai, Huihui Ye, Xi Peng, Bo Zhao, Xiaobo Qu

    Abstract: The combination of the sparse sampling and the low-rank structured matrix reconstruction has shown promising performance, enabling a significant reduction of the magnetic resonance imaging data acquisition time. However, the low-rank structured approaches demand considerable memory consumption and are time-consuming due to a noticeable number of matrix operations performed on the huge-size block H… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

    Comments: 17 pages, 17 figures

  42. arXiv:2107.04806  [pdf, other

    cs.SD cs.CV eess.AS eess.IV

    Speech2Video: Cross-Modal Distillation for Speech to Video Generation

    Authors: Shi**g Si, Jianzong Wang, Xiaoyang Qu, Ning Cheng, Wenqi Wei, Xinghua Zhu, **g Xiao

    Abstract: This paper investigates a novel task of talking face video generation solely from speeches. The speech-to-video generation technique can spark interesting applications in entertainment, customer service, and human-computer-interaction industries. Indeed, the timbre, accent and speed in speeches could contain rich information relevant to speakers' appearance. The challenge mainly lies in disentangl… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

    Comments: Accepted by InterSpeech2021

  43. arXiv:2107.04803  [pdf, other

    cs.SD eess.AS

    Variational Information Bottleneck for Effective Low-resource Audio Classification

    Authors: Shi**g Si, Jianzong Wang, Huiming Sun, Jianhan Wu, Chuanyao Zhang, Xiaoyang Qu, Ning Cheng, Lei Chen, **g Xiao

    Abstract: Large-scale deep neural networks (DNNs) such as convolutional neural networks (CNNs) have achieved impressive performance in audio classification for their powerful capacity and strong generalization ability. However, when training a DNN model on low-resource tasks, it is usually prone to overfitting the small data and learning too much redundant information. To address this issue, we propose to u… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

    Comments: Accepted by InterSpeech 2021

  44. arXiv:2104.10781  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

    Authors: Ren Yang, Radu Timofte, **g Liu, Yi Xu, Xinjian Zhang, Minyi Zhao, Shuigeng Zhou, Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Xin Li, Fanglong Liu, He Zheng, Lielin Jiang, Qi Zhang, Dongliang He, Fu Li, Qingqing Dang, Yibin Huang, Matteo Maggioni, Zhongqian Fu, Shuai Xiao, Cheng li, Thomas Tanay , et al. (47 additional authors not shown)

    Abstract: This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at… ▽ More

    Submitted 31 August, 2022; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Corrected the MOS values in Table 2, and corrected some minor typos

  45. arXiv:2104.08824  [pdf

    eess.IV

    XCloud-pFISTA: A Medical Intelligence Cloud for Accelerated MRI

    Authors: Yirong Zhou, Chen Qian, Yi Guo, Zi Wang, Jian Wang, Biao Qu, Di Guo, Yongfu You, Xiaobo Qu

    Abstract: Machine learning and artificial intelligence have shown remarkable performance in accelerated magnetic resonance imaging (MRI). Cloud computing technologies have great advantages in building an easily accessible platform to deploy advanced algorithms. In this work, we develop an open-access, easy-to-use and high-performance medical intelligence cloud computing platform (XCloud-pFISTA) to reconstru… ▽ More

    Submitted 10 June, 2021; v1 submitted 18 April, 2021; originally announced April 2021.

  46. Change Detection in Synthetic Aperture Radar Images Using a Dual-Domain Network

    Authors: Xiaofan Qu, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

    Abstract: Change detection from synthetic aperture radar (SAR) imagery is a critical yet challenging task. Existing methods mainly focus on feature extraction in spatial domain, and little attention has been paid to frequency domain. Furthermore, in patch-wise feature analysis, some noisy features in the marginal region may be introduced. To tackle the above two challenges, we propose a Dual-Domain Network.… ▽ More

    Submitted 14 April, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: Accepted by IEEE Geoscience and Remote Sensing Letters, Code: https://github.com/summitgao/SAR_CD_DDNet

  47. arXiv:2101.11442  [pdf

    physics.med-ph cs.LG eess.IV

    Magnetic Resonance Spectroscopy Deep Learning Denoising Using Few In Vivo Data

    Authors: Dicheng Chen, Wanqi Hu, Huiting Liu, Yirong Zhou, Tianyu Qiu, Yihui Huang, Zi Wang, Jiazheng Wang, Liangjie Lin, Zhigang Wu, Hao Chen, Xi Chen, Gen Yan, Di Guo, Jianzhong Lin, Xiaobo Qu

    Abstract: Magnetic Resonance Spectroscopy (MRS) is a noninvasive tool to reveal metabolic information. One challenge of 1H-MRS is the low Signal-Noise Ratio (SNR). To improve the SNR, a typical approach is to perform Signal Averaging (SA) with M repeated samples. The data acquisition time, however, is increased by M times accordingly, and a complete clinical MRS scan takes approximately 10 minutes at a comm… ▽ More

    Submitted 25 October, 2022; v1 submitted 26 January, 2021; originally announced January 2021.

  48. arXiv:2012.14830  [pdf

    cs.LG eess.IV physics.bio-ph physics.med-ph

    A Sparse Model-inspired Deep Thresholding Network for Exponential Signal Reconstruction -- Application in Fast Biological Spectroscopy

    Authors: Zi Wang, Di Guo, Zhangren Tu, Yihui Huang, Yirong Zhou, Jian Wang, Liubin Feng, Donghai Lin, Yongfu You, Tatiana Agback, Vladislav Orekhov, Xiaobo Qu

    Abstract: The non-uniform sampling is a powerful approach to enable fast acquisition but requires sophisticated reconstruction algorithms. Faithful reconstruction from partial sampled exponentials is highly expected in general signal processing and many applications. Deep learning has shown astonishing potential in this field but many existing problems, such as lack of robustness and explainability, greatly… ▽ More

    Submitted 17 January, 2022; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: 30 pages

  49. arXiv:2009.10543  [pdf

    eess.SY

    An unnoticed side effect of electric vehicles

    Authors: Tao Wang, Ying Yang, Tieqiao Tang, Xiaobo Qu

    Abstract: We illustrate that the electrification of our transport system might impose unnecessary extra congestion and delay for daily commuting passengers. By modelling travel behaviors of these passengers, it is found that more of them tend to depart at a narrower peak-hour time window. The occurrence of this shift is mainly caused by (1) the energy consumption of electric vehicles (EVs) is much lower tha… ▽ More

    Submitted 22 September, 2020; originally announced September 2020.

    Comments: 11 pages, 3 figures

  50. Optimal Eco-driving Control of Autonomous and Electric Trucks in Adaptation to Highway Topography: Energy Minimization and Battery Life Extension

    Authors: Yongzhi Zhang, Xiaobo Qu, Lang Tong

    Abstract: In this paper, we develop a model to plan energy-efficient speed trajectories of electric trucks in real-time by taking into account the information of topography and traffic ahead of the vehicle. In this real time control model, a novel state-space model is first developed to capture vehicle speed, acceleration, and state of charge. We then formulate an energy minimization problem and solve it by… ▽ More

    Submitted 22 December, 2021; v1 submitted 10 September, 2020; originally announced September 2020.

    Journal ref: IEEE Transactions on Transportation Electrification, 2022