Skip to main content

Showing 1–50 of 156 results for author: Zhao, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18555  [pdf

    eess.IV cs.CV

    Using a Convolutional Neural Network and Explainable AI to Diagnose Dementia Based on MRI Scans

    Authors: Tyler Morris, Ziming Liu, Longjian Liu, Xiaopeng Zhao

    Abstract: As the number of dementia patients rises, the need for accurate diagnostic procedures rises as well. Current methods, like using an MRI scan, rely on human input, which can be inaccurate. However, the decision logic behind machine learning algorithms and their outputs cannot be explained, as most operate in black-box models. Therefore, to increase the accuracy of diagnosing dementia through MRIs,… ▽ More

    Submitted 25 May, 2024; originally announced June 2024.

    Comments: 4 pages, 4 figures

  2. arXiv:2406.01795  [pdf, other

    eess.IV

    Video Coding with Cross-Component Sample Offset

    Authors: Han Gao, Xin Zhao, Tianqi Liu, Shan Liu

    Abstract: Beyond the exploration of traditional spatial, temporal and subjective visual signal redundancy in image and video compression, recent research has focused on leveraging cross-color component redundancy to enhance coding efficiency. Cross-component coding approaches are motivated by the statistical correlations among different color components, such as those in the Y'CbCr color space, where luma (… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages

  3. arXiv:2405.17818  [pdf, other

    cs.CV eess.IV

    Hyperspectral and multispectral image fusion with arbitrary resolution through self-supervised representations

    Authors: Ting Wang, Zipei Yan, Jizhou Li, Xile Zhao, Chao Wang, Michael Ng

    Abstract: The fusion of a low-resolution hyperspectral image (LR-HSI) with a high-resolution multispectral image (HR-MSI) has emerged as an effective technique for achieving HSI super-resolution (SR). Previous studies have mainly concentrated on estimating the posterior distribution of the latent high-resolution hyperspectral image (HR-HSI), leveraging an appropriate image prior and likelihood computed from… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  4. arXiv:2405.17241  [pdf, other

    cs.CV eess.IV

    NeurTV: Total Variation on the Neural Domain

    Authors: Yisi Luo, Xile Zhao, Kai Ye, Deyu Meng

    Abstract: Recently, we have witnessed the success of total variation (TV) for many imaging applications. However, traditional TV is defined on the original pixel domain, which limits its potential. In this work, we suggest a new TV regularization defined on the neural domain. Concretely, the discrete data is continuously and implicitly represented by a deep neural network (DNN), and we use the derivatives o… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    MSC Class: 94A08; 68U10; 68T45

  5. arXiv:2405.15205  [pdf, other

    eess.IV cs.CV

    Enhancing Generalized Fetal Brain MRI Segmentation using A Cascade Network with Depth-wise Separable Convolution and Attention Mechanism

    Authors: Zhigao Cai, Xing-Ming Zhao

    Abstract: Automatic segmentation of the fetal brain is still challenging due to the health state of fetal development, motion artifacts, and variability across gestational ages, since existing methods rely on high-quality datasets of healthy fetuses. In this work, we propose a novel cascade network called CasUNext to enhance the accuracy and generalization of fetal brain MRI segmentation. CasUNext incorpora… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2404.18418  [pdf, other

    cs.NI eess.SY

    Decomposition Model Assisted Energy-Saving Design in Radio Access Network

    Authors: Xiaoxue Zhao, Yijun Yu, Yexing Li, Dong Li, Yao Wang, Chungang Yang

    Abstract: The continuous emergence of novel services and massive connections involve huge energy consumption towards ultra-dense radio access networks. Moreover, there exist much more number of controllable parameters that can be adjusted to reduce the energy consumption from a network-wide perspective. However, a network-level energy-saving intent usually contains multiple network objectives and constraint… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  7. arXiv:2404.13786  [pdf, other

    eess.SY cs.AI cs.DC cs.LG

    Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving

    Authors: Shuyao Shi, Neiwen Ling, Zhehao Jiang, Xuan Huang, Yuze He, Xiaoguang Zhao, Bufang Yang, Chen Bian, **gfei Xia, Zhenyu Yan, Raymond Yeung, Guoliang Xing

    Abstract: Recently,smart roadside infrastructure (SRI) has demonstrated the potential of achieving fully autonomous driving systems. To explore the potential of infrastructure-assisted autonomous driving, this paper presents the design and deployment of Soar, the first end-to-end SRI system specifically designed to support autonomous driving systems. Soar consists of both software and hardware components ca… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  8. arXiv:2404.06991  [pdf, other

    eess.IV cs.CV

    Ray-driven Spectral CT Reconstruction Based on Neural Base-Material Fields

    Authors: Ligen Shi, Chang Liu, ** Yang, Jun Qiu, Xing Zhao

    Abstract: In spectral CT reconstruction, the basis materials decomposition involves solving a large-scale nonlinear system of integral equations, which is highly ill-posed mathematically. This paper proposes a model that parameterizes the attenuation coefficients of the object using a neural field representation, thereby avoiding the complex calculations of pixel-driven projection coefficient matrices durin… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 14 pages,16 figures

    MSC Class: 68U05; 65D18 ACM Class: I.4.5; I.4.10

  9. arXiv:2403.16136  [pdf, ps, other

    eess.SY

    Data-Driven Sliding Mode Control for Partially Unknown Nonlinear Systems

    Authors: Jianglin Lan, Xianxian Zhao, Congcong Sun

    Abstract: This paper introduces a new design method for data-driven control of nonlinear systems with partially unknown dynamics and unknown bounded disturbance. Since it is not possible to achieve exact nonlinearity cancellation in the presence of unknown disturbance, this paper adapts the idea of sliding mode control (SMC) to ensure system stability and robustness without assuming that the nonlinearity go… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE CDC 2024

  10. arXiv:2403.16132  [pdf, ps, other

    eess.SY cs.LG

    Runtime Monitoring and Fault Detection for Neural Network-Controlled Systems

    Authors: Jianglin Lan, Siyuan Zhan, Ron Patton, Xianxian Zhao

    Abstract: There is an emerging trend in applying deep learning methods to control complex nonlinear systems. This paper considers enhancing the runtime safety of nonlinear systems controlled by neural networks in the presence of disturbance and measurement noise. A robustly stable interval observer is designed to generate sound and precise lower and upper bounds for the neural network, nonlinear function, a… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted to SAFEPROCESS 2024

  11. arXiv:2403.09407  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    LM2D: Lyrics- and Music-Driven Dance Synthesis

    Authors: Wenjie Yin, Xuejiao Zhao, Yi Yu, Hang Yin, Danica Kragic, Mårten Björkman

    Abstract: Dance typically involves professional choreography with complex movements that follow a musical rhythm and can also be influenced by lyrical content. The integration of lyrics in addition to the auditory dimension, enriches the foundational tone and makes motion generation more amenable to its semantic meanings. However, existing dance synthesis methods tend to model motions only conditioned on au… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  12. arXiv:2403.06538  [pdf, other

    cs.RO cs.CV eess.IV

    3DRef: 3D Dataset and Benchmark for Reflection Detection in RGB and Lidar Data

    Authors: Xiting Zhao, Sören Schwertfeger

    Abstract: Reflective surfaces present a persistent challenge for reliable 3D map** and perception in robotics and autonomous systems. However, existing reflection datasets and benchmarks remain limited to sparse 2D data. This paper introduces the first large-scale 3D reflection detection dataset containing more than 50,000 aligned samples of multi-return Lidar, RGB images, and 2D/3D semantic labels across… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  13. arXiv:2402.18231  [pdf, other

    eess.SP

    Joint Beamforming Design and Stream Allocation for Non-Coherent Joint Transmission in Cell-Free MIMO Networks

    Authors: Xi Wang, Xiaotong Zhao, Juncheng Wang, You Li, Qingjiang Shi

    Abstract: We consider joint beamforming and stream allocation to maximize the weighted sum rate (WSR) for non-coherent joint transmission (NCJT) in user-centric cell-free MIMO networks, where distributed access points (APs) are organized in clusters to transmit different signals to serve each user equipment (UE). We for the first time consider the common limits of maximum number of receive streams at UEs in… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  14. arXiv:2402.16371  [pdf, other

    eess.IV

    Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction

    Authors: Wen-Yang Lu, Eduardo Pavez, Antonio Ortega, Xin Zhao, Shan Liu

    Abstract: Current video coding standards, including H.264/AVC, HEVC, and VVC, employ discrete cosine transform (DCT), discrete sine transform (DST), and secondary to Karhunen-Loeve transforms (KLTs) decorrelate the intra-prediction residuals. However, the efficiency of these transforms in decorrelation can be limited when the signal has a non-smooth and non-periodic structure, such as those occurring in tex… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 5 pages, 4 figures

  15. arXiv:2401.00708  [pdf, other

    cs.CV eess.IV

    Revisiting Nonlocal Self-Similarity from Continuous Representation

    Authors: Yisi Luo, Xile Zhao, Deyu Meng

    Abstract: Nonlocal self-similarity (NSS) is an important prior that has been successfully applied in multi-dimensional data processing tasks, e.g., image and video recovery. However, existing NSS-based methods are solely suitable for meshgrid data such as images and videos, but are not suitable for emerging off-meshgrid data, e.g., point cloud and climate data. In this work, we revisit the NSS from the cont… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  16. arXiv:2312.01573  [pdf

    eess.IV cs.CV

    Survey on deep learning in multimodal medical imaging for cancer detection

    Authors: Yan Tian, Zhaocheng Xu, Yujun Ma, Wei** Ding, Ruili Wang, Zhihong Gao, Guohua Cheng, Linyang He, Xuran Zhao

    Abstract: The task of multimodal cancer detection is to determine the locations and categories of lesions by using different imaging techniques, which is one of the key research methods for cancer diagnosis. Recently, deep learning-based object detection has made significant developments due to its strength in semantic feature extraction and nonlinear function fitting. However, multimodal cancer detection r… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Journal ref: Neural Computing and Applications. 2023 Nov 29:1-6

  17. arXiv:2311.08225  [pdf, other

    eess.IV cs.CV

    Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images

    Authors: Zhiyun Song, Zengxin Qi, Xin Wang, Xiangyu Zhao, Zhenrong Shen, Sheng Wang, Manman Fei, Zhe Wang, Di Zang, Dongdong Chen, Linlin Yao, Qian Wang, Xuehai Wu, Lichi Zhang

    Abstract: Cross-modality synthesis (CMS), super-resolution (SR), and their combination (CMSR) have been extensively studied for magnetic resonance imaging (MRI). Their primary goals are to enhance the imaging quality by synthesizing the desired modality and reducing the slice thickness. Despite the promising synthetic results, these techniques are often tailored to specific tasks, thereby limiting their ada… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  18. arXiv:2310.01565  [pdf, other

    cs.LG cs.IR eess.IV

    Causality-informed Rapid Post-hurricane Building Damage Detection in Large Scale from InSAR Imagery

    Authors: Chenguang Wang, Yepeng Liu, Xiaojian Zhang, Xuechun Li, Vladimir Paramygin, Arthriya Subgranon, Peter Sheng, Xilei Zhao, Susu Xu

    Abstract: Timely and accurate assessment of hurricane-induced building damage is crucial for effective post-hurricane response and recovery efforts. Recently, remote sensing technologies provide large-scale optical or Interferometric Synthetic Aperture Radar (InSAR) imagery data immediately after a disastrous event, which can be readily used to conduct rapid building damage assessment. Compared to optical s… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 6 pages, 3 figures

  19. arXiv:2310.00153  [pdf

    physics.med-ph eess.SP physics.app-ph

    Conformal Metamaterials with Active Tunability and Self-adaptivity for Magnetic Resonance Imaging

    Authors: Ke Wu, Xia Zhu, Xiaoguang Zhao, Stephan W. Anderson, Xin Zhang

    Abstract: Ongoing effort has been devoted to applying metamaterials to boost the imaging performance of magnetic resonance imaging owing to their unique capacity for electromagnetic field confinement and enhancement. However, there are still major obstacles to widespread clinical adoption of conventional metamaterials due to several notable restrictions, namely: their typically bulky and rigid structures, d… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: 21 pages, 7 figures

  20. arXiv:2309.07422  [pdf, other

    eess.SY math.OC

    Grid-Aware On-Route Fast-Charging Infrastructure Planning for Battery Electric Bus with Equity Considerations: A Case Study in South King County

    Authors: Xinyi Zhao, Chaoyue Zhao, Grace Jia

    Abstract: The transition from traditional bus fleets to zero-emission ones necessitates the development of effective planning models for battery electric bus (BEB) charging infrastructure. On-route fast charging stations, distinct from on-base charging stations, present unique challenges related to safe operation and power supply capacity, making it difficult to control grid operational costs. This paper es… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 18 pages, 16 figures

  21. arXiv:2309.00971  [pdf, other

    eess.IV cs.CV

    AdLER: Adversarial Training with Label Error Rectification for One-Shot Medical Image Segmentation

    Authors: Xiangyu Zhao, Sheng Wang, Zhiyun Song, Zhenrong Shen, Linlin Yao, Haolei Yuan, Qian Wang, Lichi Zhang

    Abstract: Accurate automatic segmentation of medical images typically requires large datasets with high-quality annotations, making it less applicable in clinical settings due to limited training data. One-shot segmentation based on learned transformations (OSSLT) has shown promise when labeled data is extremely limited, typically including unsupervised deformable registration, data augmentation with learne… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  22. arXiv:2308.08866  [pdf, other

    math.NA eess.IV

    An inexact proximal majorization-minimization Algorithm for remote sensing image stripe noise removal

    Authors: Cheng**g Wang, Xile Zhao, Qingsong Wang, Zepei Ma, Peipei Tang

    Abstract: The stripe noise existing in remote sensing images badly degrades the visual quality and restricts the precision of data analysis. Therefore, many destri** models have been proposed in recent years. In contrast to these existing models, in this paper, we propose a nonconvex model with a DC function (i.e., the difference of convex functions) structure to remove the strip noise. To solve this mode… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 19 pages, 3 figures

    MSC Class: 65K05; 90C26; 90C90

  23. arXiv:2308.03777  [pdf

    physics.bio-ph eess.IV eess.SP

    Lab-in-a-Tube: A portable imaging spectrophotometer for cost-effective, high-throughput, and label-free analysis of centrifugation processes

    Authors: Yuanyuan Wei, Dehua Hu, Bijie Bai, Chenqi Meng, Tsz Kin Chan, Xing Zhao, Yuye Wang, Yi-** Ho, Wu Yuan, Ho-Pui Ho

    Abstract: Centrifuges serve as essential instruments in modern experimental sciences, facilitating a wide range of routine sample processing tasks that necessitate material sedimentation. However, the study for real time observation of the dynamical process during centrifugation has remained elusive. In this study, we developed an innovative Lab_in_a_Tube imaging spectrophotometer that incorporates capabili… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 21 Pages, 6 Figures

  24. arXiv:2308.02844  [pdf, other

    cs.IR cs.SD eess.AS

    Bootstrap** Contrastive Learning Enhanced Music Cold-Start Matching

    Authors: ** Zhao, Ying Zhang, Qiang Xiao, Yuming Ren, Yingchun Yang

    Abstract: We study a particular matching task we call Music Cold-Start Matching. In short, given a cold-start song request, we expect to retrieve songs with similar audiences and then fastly push the cold-start song to the audiences of the retrieved songs to warm up it. However, there are hardly any studies done on this task. Therefore, in this paper, we will formalize the problem of Music Cold-Start Matchi… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted by WWW'2023

    ACM Class: F.2.2; I.2.8

    Journal ref: Companion Proceedings of the ACM Web Conference 2023, April 2023, Pages 351-355

  25. arXiv:2307.11784  [pdf, other

    cs.LG cs.AI cs.SE eess.SY

    What, Indeed, is an Achievable Provable Guarantee for Learning-Enabled Safety Critical Systems

    Authors: Saddek Bensalem, Chih-Hong Cheng, Wei Huang, Xiaowei Huang, Changshun Wu, Xingyu Zhao

    Abstract: Machine learning has made remarkable advancements, but confidently utilising learning-enabled components in safety-critical domains still poses challenges. Among the challenges, it is known that a rigorous, yet practical, way of achieving safety guarantees is one of the most prominent. In this paper, we first discuss the engineering and research challenges associated with the design and verificati… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  26. arXiv:2307.08631  [pdf, ps, other

    cs.IT eess.SP

    Dual-Functional MIMO Beamforming Optimization for RIS-Aided Integrated Sensing and Communication

    Authors: Xin Zhao, Heng Liu, Shiqi Gong, Xin Ju, Chengwen Xing, Nan Zhao

    Abstract: Aiming at providing wireless communication systems with environment-perceptive capacity, emerging integrated sensing and communication (ISAC) technologies face multiple difficulties, especially in balancing the performance trade-off between the communication and radar functions. In this paper, we introduce a reconfigurable intelligent surface (RIS) to assist both data transmission and target detec… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 30 pages, 8 figures, manuscript submitted to IEEE TCOM

  27. arXiv:2307.04390  [pdf

    eess.IV cs.CV cs.LG

    CT-based Subchondral Bone Microstructural Analysis in Knee Osteoarthritis via MR-Guided Distillation Learning

    Authors: Yuqi Hu, Xiangyu Zhao, Gaowei Qing, Kai Xie, Chenglei Liu, Lichi Zhang

    Abstract: Background: MR-based subchondral bone effectively predicts knee osteoarthritis. However, its clinical application is limited by the cost and time of MR. Purpose: We aim to develop a novel distillation-learning-based method named SRRD for subchondral bone microstructural analysis using easily-acquired CT images, which leverages paired MR images to enhance the CT-based analysis model during training… ▽ More

    Submitted 11 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

    Comments: 5 figures, 4 tables

  28. arXiv:2306.15753  [pdf

    physics.soc-ph eess.SY

    Integrated Simulation Platform for Quantifying the Traffic-Induced Environmental and Health Impacts

    Authors: Xuanpeng Zhao, Guoyuan Wu, Akula Venkatram, Ji Luo, Peng Hao, Kanok Boriboonsomsin, Shaohua Hu

    Abstract: Air quality and human exposure to mobile source pollutants have become major concerns in urban transportation. Existing studies mainly focus on mitigating traffic congestion and reducing carbon footprints, with limited understanding of traffic-related health impacts from the environmental justice perspective. To address this gap, we present an innovative integrated simulation platform that models… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 35 pages, 11 figures

  29. arXiv:2306.11977  [pdf

    eess.IV cs.CV

    Encoding Enhanced Complex CNN for Accurate and Highly Accelerated MRI

    Authors: Zimeng Li, Sa Xiao, Cheng Wang, Haidong Li, Xiuchao Zhao, Caohui Duan, Qian Zhou, Qiuchen Rao, Yuan Fang, Junshuai Xie, Lei Shi, Fumin Guo, Chaohui Ye, Xin Zhou

    Abstract: Magnetic resonance imaging (MRI) using hyperpolarized noble gases provides a way to visualize the structure and function of human lung, but the long imaging time limits its broad research and clinical applications. Deep learning has demonstrated great potential for accelerating MRI by reconstructing images from undersampled data. However, most existing deep conventional neural networks (CNN) direc… ▽ More

    Submitted 13 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  30. arXiv:2306.01986  [pdf

    cs.LG cs.AI eess.SY math.OC

    A Novel Correlation-optimized Deep Learning Method for Wind Speed Forecast

    Authors: Yang Yang, ** Lang, Jian Wu, Yanyan Zhang, Xiang Zhao

    Abstract: The increasing installation rate of wind power poses great challenges to the global power system. In order to ensure the reliable operation of the power system, it is necessary to accurately forecast the wind speed and power of the wind turbines. At present, deep learning is progressively applied to the wind speed prediction. Nevertheless, the recent deep learning methods still reflect the embarra… ▽ More

    Submitted 9 June, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  31. arXiv:2305.13957  [pdf, other

    eess.AS

    Eeg2vec: Self-Supervised Electroencephalographic Representation Learning

    Authors: Qiushi Zhu, Xiaoying Zhao, Jie Zhang, Yu Gu, Chao Weng, Yuchen Hu

    Abstract: Recently, many efforts have been made to explore how the brain processes speech using electroencephalographic (EEG) signals, where deep learning-based approaches were shown to be applicable in this field. In order to decode speech signals from EEG signals, linear networks, convolutional neural networks (CNN) and long short-term memory networks are often used in a supervised manner. Recording EEG-s… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 5 pages

  32. arXiv:2305.13869  [pdf, other

    physics.acc-ph cs.AI cs.LG eess.SY

    Trend-Based SAC Beam Control Method with Zero-Shot in Superconducting Linear Accelerator

    Authors: Xiaolong Chen, Xin Qi, Chunguang Su, Yuan He, Zhijun Wang, Kunxiang Sun, Chao **, Weilong Chen, Shuhui Liu, Xiaoying Zhao, Duanyang Jia, Man Yi

    Abstract: The superconducting linear accelerator is a highly flexiable facility for modern scientific discoveries, necessitating weekly reconfiguration and tuning. Accordingly, minimizing setup time proves essential in affording users with ample experimental time. We propose a trend-based soft actor-critic(TBSAC) beam control method with strong robustness, allowing the agents to be trained in a simulated en… ▽ More

    Submitted 25 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  33. arXiv:2305.12805  [pdf, ps, other

    eess.SP

    Decentralized Equalization for Massive MIMO Systems With Colored Noise Samples

    Authors: Xiaotong Zhao, Mian Li, Bo Wang, Enbin Song, Tsung-Hui Chang, Qingjiang Shi

    Abstract: Recently, the decentralized baseband processing (DBP) paradigm and relevant detection methods have been proposed to enable extremely large-scale massive multiple-input multiple-output technology. Under the DBP architecture, base station antennas are divided into several independent clusters, each connected to a local computing fabric. However, current detection methods tailored to DBP only conside… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  34. arXiv:2305.09167  [pdf, other

    cs.SD cs.CL eess.AS

    Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion

    Authors: Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng

    Abstract: Nowadays, recognition-synthesis-based methods have been quite popular with voice conversion (VC). By introducing linguistics features with good disentangling characters extracted from an automatic speech recognition (ASR) model, the VC performance achieved considerable breakthroughs. Recently, self-supervised learning (SSL) methods trained with a large-scale unannotated speech corpus have been app… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted by ICME 2023

  35. arXiv:2305.08078  [pdf, other

    eess.IV cs.CV

    Supervised Domain Adaptation for Recognizing Retinal Diseases from Wide-Field Fundus Images

    Authors: Qijie Wei, **gyuan Yang, Bo Wang, **rui Wang, Jianchun Zhao, Xinyu Zhao, Sheng Yang, Niranchana Manivannan, Youxin Chen, Dayong Ding, **g Zhou, Xirong Li

    Abstract: This paper addresses the emerging task of recognizing multiple retinal diseases from wide-field (WF) and ultra-wide-field (UWF) fundus images. For an effective use of existing large amount of labeled color fundus photo (CFP) data and the relatively small amount of WF and UWF data, we propose a supervised domain adaptation method named Cross-domain Collaborative Learning (CdCL). Inspired by the suc… ▽ More

    Submitted 23 October, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: Accepted by BIBM2023

  36. arXiv:2305.01138  [pdf, other

    eess.IV cs.CV

    High-Fidelity Image Synthesis from Pulmonary Nodule Lesion Maps using Semantic Diffusion Model

    Authors: Xuan Zhao, Benjamin Hou

    Abstract: Lung cancer has been one of the leading causes of cancer-related deaths worldwide for years. With the emergence of deep learning, computer-assisted diagnosis (CAD) models based on learning algorithms can accelerate the nodule screening process, providing valuable assistance to radiologists in their daily clinical workflows. However, develo** such robust and accurate models often requires large-s… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 4 pages, 1 figure, submitted to MIDL 2023

    ACM Class: I.2.1; J.3

  37. arXiv:2303.10232  [pdf, other

    eess.IV cs.CV cs.LG

    LSwinSR: UAV Imagery Super-Resolution based on Linear Swin Transformer

    Authors: Rui Li, Xiaowei Zhao

    Abstract: Super-resolution, which aims to reconstruct high-resolution images from low-resolution images, has drawn considerable attention and has been intensively studied in computer vision and remote sensing communities. The super-resolution technology is especially beneficial for Unmanned Aerial Vehicles (UAV), as the amount and resolution of images captured by UAV are highly limited by physical constrain… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  38. arXiv:2303.08671  [pdf, other

    cs.NI eess.SY

    A Dual-Cluster-Head Based Medium Access Control for Large-Scale UAV Ad-Hoc Networks

    Authors: Xinru Zhao, Zhiqing Wei, Yingying Zou, Hao Ma, Yanpeng Cui, Zhiyong Feng

    Abstract: Unmanned Aerial Vehicle (UAV) ad hoc network has achieved significant growth for its flexibility, extensibility, and high deployability in recent years. The application of clustering scheme for UAV ad hoc network is imperative to enhance the performance of throughput and energy efficiency. In conventional clustering scheme, a single cluster head (CH) is always assigned in each cluster. However, th… ▽ More

    Submitted 26 February, 2023; originally announced March 2023.

    Comments: 10 pages, 12 figures, journal

  39. arXiv:2303.08268  [pdf, other

    cs.RO cs.AI cs.CL cs.LG cs.SD eess.AS

    Chat with the Environment: Interactive Multimodal Perception Using Large Language Models

    Authors: Xufeng Zhao, Mengdi Li, Cornelius Weber, Muhammad Burhan Hafez, Stefan Wermter

    Abstract: Programming robot behavior in a complex world faces challenges on multiple levels, from dextrous low-level skills to high-level planning and reasoning. Recent pre-trained Large Language Models (LLMs) have shown remarkable reasoning ability in few-shot robotic planning. However, it remains challenging to ground LLMs in multimodal sensory input and continuous action output, while enabling a robot to… ▽ More

    Submitted 11 October, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: IROS2023, Detroit. See the project website at https://matcha-agent.github.io

  40. arXiv:2303.06877  [pdf, other

    cs.CV eess.IV

    Progressive Open Space Expansion for Open-Set Model Attribution

    Authors: Tianyun Yang, Danding Wang, Fan Tang, Xinying Zhao, Juan Cao, Sheng Tang

    Abstract: Despite the remarkable progress in generative technology, the Janus-faced issues of intellectual property protection and malicious content supervision have arisen. Efforts have been paid to manage synthetic images by attributing them to a set of potential source models. However, the closed-set classification setting limits the application in real-world scenarios for handling contents generated by… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: accepted to CVPR2023

  41. arXiv:2303.00272  [pdf

    eess.SY

    Influence of Spattering on In-process Layer Surface Roughness during Laser Powder Bed Fusion

    Authors: Haolin Zhang, Chaitanya Krishna Prasad Vallabh, Xiayun Zhao

    Abstract: Laser powder bed fusion (LPBF) holds promise to efficiently produce metal parts. However, LPBF incurs stochastic melt pool (MP) spattering, which would roughen workpiece in-process surface, thus weakening inter-layer bonding and causing issues like porosity, powder contamination, and recoater intervention. Understanding the consequential effect of MP spattering on layer surface remains difficult d… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

  42. arXiv:2302.08342  [pdf, other

    eess.AS cs.SD

    Speech Enhancement with Multi-granularity Vector Quantization

    Authors: Xiao-Ying Zhao, Qiu-Shi Zhu, Jie Zhang

    Abstract: With advances in deep learning, neural network based speech enhancement (SE) has developed rapidly in the last decade. Meanwhile, the self-supervised pre-trained model and vector quantization (VQ) have achieved excellent performance on many speech-related tasks, while they are less explored on SE. As it was shown in our previous work that utilizing a VQ module to discretize noisy speech representa… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  43. arXiv:2302.01728  [pdf, other

    eess.SY cs.DC

    Decentralised and Cooperative Control of Multi-Robot Systems through Distributed Optimisation

    Authors: Yi Dong, Zhongguo Li, Xingyu Zhao, Zhengtao Ding, Xiaowei Huang

    Abstract: Multi-robot cooperative control has gained extensive research interest due to its wide applications in civil, security, and military domains. This paper proposes a cooperative control algorithm for multi-robot systems with general linear dynamics. The algorithm is based on distributed cooperative optimisation and output regulation, and it achieves global optimum by utilising only information share… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted by AAMAS'23

  44. arXiv:2301.03281  [pdf, other

    eess.IV cs.CV

    The state-of-the-art 3D anisotropic intracranial hemorrhage segmentation on non-contrast head CT: The INSTANCE challenge

    Authors: Xiangyu Li, Gongning Luo, Kuanquan Wang, Hongyu Wang, Jun Liu, Xinjie Liang, Jie Jiang, Zhenghao Song, Chunyue Zheng, Haokai Chi, Mingwang Xu, Yingte He, Xinghua Ma, **gwen Guo, Yifan Liu, Chuanpu Li, Zeli Chen, Md Mahfuzur Rahman Siddiquee, Andriy Myronenko, Antoine P. Sanner, Anirban Mukhopadhyay, Ahmed E. Othman, Xingyu Zhao, Wei** Liu, **huang Zhang , et al. (9 additional authors not shown)

    Abstract: Automatic intracranial hemorrhage segmentation in 3D non-contrast head CT (NCCT) scans is significant in clinical practice. Existing hemorrhage segmentation methods usually ignores the anisotropic nature of the NCCT, and are evaluated on different in-house datasets with distinct metrics, making it highly challenging to improve segmentation performance and perform objective comparisons among differ… ▽ More

    Submitted 12 January, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Summarized paper for the MICCAI INSTANCE 2022 Challenge

  45. arXiv:2212.06596  [pdf, other

    cs.IT eess.SP

    Broadband Digital Over-the-Air Computation for Wireless Federated Edge Learning

    Authors: Lizhao You, Xinbo Zhao, Rui Cao, Yulin Shao, Liqun Fu

    Abstract: This paper presents the first orthogonal frequency-division multiplexing(OFDM)-based digital over-the-air computation (AirComp) system for wireless federated edge learning, where multiple edge devices transmit model data simultaneously using non-orthogonal OFDM subcarriers, and the edge server aggregates data directly from the superimposed signal. Existing analog AirComp systems often assume perfe… ▽ More

    Submitted 5 July, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: 20 pages. arXiv admin note: text overlap with arXiv:2111.10508

  46. arXiv:2212.00262  [pdf, other

    cs.CV cs.LG eess.IV

    Low-Rank Tensor Function Representation for Multi-Dimensional Data Recovery

    Authors: Yisi Luo, Xile Zhao, Zhemin Li, Michael K. Ng, Deyu Meng

    Abstract: Since higher-order tensors are naturally suitable for representing multi-dimensional data in real-world, e.g., color images and videos, low-rank tensor representation has become one of the emerging areas in machine learning and computer vision. However, classical low-rank tensor representations can only represent data on finite meshgrid due to their intrinsical discrete nature, which hinders their… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  47. arXiv:2211.01294  [pdf

    eess.SY

    Driver Digital Twin for Online Prediction of Personalized Lane Change Behavior

    Authors: Xishun Liao, Xuanpeng Zhao, Ziran Wang, Zhouqiao Zhao, Kyungtae Han, Rohit Gupta, Matthew J. Barth, Guoyuan Wu

    Abstract: Connected and automated vehicles (CAVs) are supposed to share the road with human-driven vehicles (HDVs) in a foreseeable future. Therefore, considering the mixed traffic environment is more pragmatic, as the well-planned operation of CAVs may be interrupted by HDVs. In the circumstance that human behaviors have significant impacts, CAVs need to understand HDV behaviors to make safe actions. In th… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  48. arXiv:2210.08521  [pdf, other

    cs.CV eess.SP

    Demystifying CNNs for Images by Matched Filters

    Authors: Shengxi Li, Xinyi Zhao, Ljubisa Stankovic, Danilo Mandic

    Abstract: The success of convolution neural networks (CNN) has been revolutionising the way we approach and use intelligent machines in the Big Data era. Despite success, CNNs have been consistently put under scrutiny owing to their \textit{black-box} nature, an \textit{ad hoc} manner of their construction, together with the lack of theoretical support and physical meanings of their operation. This has been… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  49. arXiv:2210.06973  [pdf, other

    eess.SP

    Contrastive Psudo-supervised Classification for Intra-Pulse Modulation of Radar Emitter Signals Using data augmentation

    Authors: HanCong Feng, XinHai Yan, KaiLi Jiang, XinYu Zhao, Bin Tang

    Abstract: The automatic classification of radar waveform is a fundamental technique in electronic countermeasures (ECM).Recent supervised deep learning-based methods have achieved great success in a such classification task.However, those methods require enough labeled samples to work properly and in many circumstances, it is not available.To tackle this problem, in this paper, we propose a three-stages dee… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  50. 3D Matting: A Benchmark Study on Soft Segmentation Method for Pulmonary Nodules Applied in Computed Tomography

    Authors: Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Yi Luo, Huan Luo, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, Zongyuan Ge

    Abstract: Usually, lesions are not isolated but are associated with the surrounding tissues. For example, the growth of a tumour can depend on or infiltrate into the surrounding tissues. Due to the pathological nature of the lesions, it is challenging to distinguish their boundaries in medical imaging. However, these uncertain regions may contain diagnostic information. Therefore, the simple binarization of… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted by Computers in Biology and Medicine. arXiv admin note: substantial text overlap with arXiv:2209.07843