Skip to main content

Showing 1–50 of 80 results for author: Liang, X

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08283  [pdf, other

    cs.RO eess.SY

    A Hybrid Task-Constrained Motion Planning for Collaborative Robots in Intelligent Remanufacturing

    Authors: Wansong Liu, Chang Liu, Xiao Liang, Minghui Zheng

    Abstract: Industrial manipulators have extensively collaborated with human operators to execute tasks, e.g., disassembly of end-of-use products, in intelligent remanufacturing. A safety task execution requires real-time path planning for the manipulator's end-effector to autonomously avoid human operators. This is even more challenging when the end-effector needs to follow a planned path while avoiding the… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2405.14029  [pdf, ps, other

    cs.IT eess.SP

    Analog Beamforming Enabled Multicasting: Finite-Alphabet Inputs and Statistical CSI

    Authors: Yanjun Wu, Zhong Xie, Zhuochen Xie, Chongjun Ouyang, Xuwen Liang

    Abstract: The average multicast rate (AMR) is analyzed in a multicast channel utilizing analog beamforming with finite-alphabet inputs, considering statistical channel state information (CSI). New expressions for the AMR are derived for non-cooperative and cooperative multicasting scenarios. Asymptotic analyses are conducted in the high signal-to-noise ratio regime to derive the array gain and diversity ord… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 5 pages

  3. arXiv:2405.10116  [pdf, other

    eess.SY eess.SP

    Enhancing Energy Efficiency in O-RAN Through Intelligent xApps Deployment

    Authors: Xuanyu Liang, Ahmed Al-Tahmeesschi, Qiao Wang, Swarna Chetty, Chenrui Sun, Hamed Ahmadi

    Abstract: The proliferation of 5G technology presents an unprecedented challenge in managing the energy consumption of densely deployed network infrastructures, particularly Base Stations (BSs), which account for the majority of power usage in mobile networks. The O-RAN architecture, with its emphasis on open and intelligent design, offers a promising framework to address the Energy Efficiency (EE) demands… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures

  4. arXiv:2405.10087  [pdf, other

    eess.SP

    Continuous Transfer Learning for UAV Communication-aware Trajectory Design

    Authors: Chenrui Sun, Gianluca Fontanesi, Swarna Bindu Chetty, Xuanyu Liang, Berk Canberk, Hamed Ahmadi

    Abstract: Deep Reinforcement Learning (DRL) emerges as a prime solution for Unmanned Aerial Vehicle (UAV) trajectory planning, offering proficiency in navigating high-dimensional spaces, adaptability to dynamic environments, and making sequential decisions based on real-time feedback. Despite these advantages, the use of DRL for UAV trajectory planning requires significant retraining when the UAV is confron… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 6 pages

  5. arXiv:2405.07962  [pdf, other

    cs.RO eess.SY

    KG-Planner: Knowledge-Informed Graph Neural Planning for Collaborative Manipulators

    Authors: Wansong Liu, Kareem Eltouny, Sibo Tian, Xiao Liang, Minghui Zheng

    Abstract: This paper presents a novel knowledge-informed graph neural planner (KG-Planner) to address the challenge of efficiently planning collision-free motions for robots in high-dimensional spaces, considering both static and dynamic environments involving humans. Unlike traditional motion planners that struggle with finding a balance between efficiency and optimality, the KG-Planner takes a different a… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2404.15366  [pdf, other

    eess.SP cs.LG

    A Weight-aware-based Multi-source Unsupervised Domain Adaptation Method for Human Motion Intention Recognition

    Authors: Xiao-Yin Liu, Guotao Li, Xiao-Hu Zhou, Xu Liang, Zeng-Guang Hou

    Abstract: Accurate recognition of human motion intention (HMI) is beneficial for exoskeleton robots to improve the wearing comfort level and achieve natural human-robot interaction. A classifier trained on labeled source subjects (domains) performs poorly on unlabeled target subject since the difference in individual motor characteristics. The unsupervised domain adaptation (UDA) method has become an effect… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 13 pages, 5 figures

  7. Improving Disturbance Estimation and Suppression via Learning among Systems with Mismatched Dynamics

    Authors: Harsh Modi, Zhu Chen, Xiao Liang, Minghui Zheng

    Abstract: Iterative learning control (ILC) is a method for reducing system tracking or estimation errors over multiple iterations by using information from past iterations. The disturbance observer (DOB) is used to estimate and mitigate disturbances within the system, while the system is being affected by them. ILC enhances system performance by introducing a feedforward signal in each iteration. However, i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  8. arXiv:2403.14135  [pdf, other

    eess.IV cs.CV

    Powerful Lossy Compression for Noisy Images

    Authors: Shilv Cai, Xiaoguo Liang, Shuning Cao, Luxin Yan, Sheng Zhong, Liqun Chen, Xu Zou

    Abstract: Image compression and denoising represent fundamental challenges in image processing with many real-world applications. To address practical demands, current solutions can be categorized into two main strategies: 1) sequential method; and 2) joint method. However, sequential methods have the disadvantage of error accumulation as there is information loss between multiple individual models. Recentl… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by ICME 2024

  9. arXiv:2403.13562  [pdf, other

    eess.SY

    Augmented Labeled Random Finite Sets and Its Application to Group Target Tracking

    Authors: Chaoqun Yang, Mengdie Xu, Xiaowei Liang, Zhiguo Shi, Heng Zhang, Xianghui Cao

    Abstract: This paper addresses the problem of group target tracking (GTT), wherein multiple closely spaced targets within a group pose a coordinated motion. To improve the tracking performance, the labeled random finite sets (LRFSs) theory is adopted, and this paper develops a new kind of LRFSs, i.e., augmented LRFSs, which introduces group information into the definition of LRFSs. Specifically, for each el… ▽ More

    Submitted 16 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  10. arXiv:2402.17785  [pdf, other

    cs.SD cs.AI eess.AS

    ByteComposer: a Human-like Melody Composition Method based on Language Model Agent

    Authors: Xia Liang, Xingjian Du, Jiaju Lin, Pei Zou, Yuan Wan, Bilei Zhu

    Abstract: Large Language Models (LLM) have shown encouraging progress in multimodal understanding and generation tasks. However, how to design a human-aligned and interpretable melody composition system is still under-explored. To solve this problem, we propose ByteComposer, an agent framework emulating a human's creative pipeline in four separate steps : "Conception Analysis - Draft Composition - Self-Eval… ▽ More

    Submitted 6 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  11. arXiv:2402.17281  [pdf, other

    eess.SP

    GAN Based Near-Field Channel Estimation for Extremely Large-Scale MIMO Systems

    Authors: Ming Ye, Xiao Liang, Cunhua Pan, Yinfei Xu, Ming Jiang, Chunguo Li

    Abstract: Extremely large-scale multiple-input-multiple-output (XL-MIMO) is a promising technique to achieve ultra-high spectral efficiency for future 6G communications. The mixed line-of-sight (LoS) and non-line-of-sight (NLoS) XL-MIMO near-field channel model is adopted to describe the XL-MIMO near-field channel accurately. In this paper, a generative adversarial network (GAN) variant based channel estima… ▽ More

    Submitted 17 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 13 pages, 9 figures, 3 tables, accepted by IEEE TGCN

  12. arXiv:2311.16149  [pdf, other

    physics.ins-det eess.IV physics.optics

    Development towards high-resolution kHz-speed rotation-free volumetric imaging

    Authors: Eleni Myrto Asimakopoulou, Valerio Bellucci, Sarlota Birnsteinova, Zisheng Yao, Yuhe Zhang, Ilia Petrov, Carsten Deiter, Andrea Mazzolari, Marco Romagnoni, Dusan Korytar, Zdenko Zaprazny, Zuzana Kuglerova, Libor Juha, Bratislav Lukic, Alexander Rack, Liubov Samoylova, Francisco Garcia Moreno, Stephen A Hall, Tillmann Neu, Xiaoyu Liang, Patrik Vagovic, Pablo Villanueva-Perez

    Abstract: X-ray multi-projection imaging (XMPI) provides rotation-free 3D movies of optically opaque samples. The absence of rotation enables superior imaging speed and preserves fragile sample dynamics by avoiding the shear forces introduced by conventional rotary tomography. Here, we present our XMPI observations at the ID19 beamline (ESRF, France) of 3D dynamics in melted aluminum with 1000 frames per se… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 12 pages, 7 figures

    Journal ref: Opt. Express 32, (2024), 4413-4426

  13. arXiv:2311.09775  [pdf, other

    cs.AR eess.SP

    MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision Quantization

    Authors: Zeyu Zhu, Fanrong Li, Gang Li, Zejian Liu, Zitao Mo, Qinghao Hu, Xiaoyao Liang, Jian Cheng

    Abstract: Graph Neural Networks (GNNs) are becoming a promising technique in various domains due to their excellent capabilities in modeling non-Euclidean data. Although a spectrum of accelerators has been proposed to accelerate the inference of GNNs, our analysis demonstrates that the latency and energy consumption induced by DRAM access still significantly impedes the improvement of performance and energy… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 15pages, 22 figures. Accepted at HPCA 2024

  14. arXiv:2310.06291  [pdf, other

    eess.IV cs.CV physics.med-ph

    Three-Dimensional Medical Image Fusion with Deformable Cross-Attention

    Authors: Lin Liu, Xinxin Fan, Chulong Zhang, **g**g Dai, Yaoqin Xie, Xiaokun Liang

    Abstract: Multimodal medical image fusion plays an instrumental role in several areas of medical image processing, particularly in disease recognition and tumor detection. Traditional fusion methods tend to process each modality independently before combining the features and reconstructing the fusion image. However, this approach often neglects the fundamental commonalities and disparities between multimod… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  15. arXiv:2309.11656  [pdf, other

    cs.RO eess.SY

    Real-to-Sim Deformable Object Manipulation: Optimizing Physics Models with Residual Map**s for Robotic Surgery

    Authors: Xiao Liang, Fei Liu, Yutong Zhang, Yuelei Li, Shan Lin, Michael Yip

    Abstract: Accurate deformable object manipulation (DOM) is essential for achieving autonomy in robotic surgery, where soft tissues are being displaced, stretched, and dissected. Many DOM methods can be powered by simulation, which ensures realistic deformation by adhering to the governing physical constraints and allowing for model prediction and control. However, real soft objects in robotic surgery, such… ▽ More

    Submitted 29 May, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  16. arXiv:2309.11655  [pdf, other

    cs.RO eess.SY

    Achieving Autonomous Cloth Manipulation with Optimal Control via Differentiable Physics-Aware Regularization and Safety Constraints

    Authors: Yutong Zhang, Fei Liu, Xiao Liang, Michael Yip

    Abstract: Cloth manipulation is a category of deformable object manipulation of great interest to the robotics community, from applications of automated laundry-folding and home organizing and cleaning to textiles and flexible manufacturing. Despite the desire for automated cloth manipulation, the thin-shell dynamics and under-actuation nature of cloth present significant challenges for robots to effectivel… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  17. arXiv:2308.16742  [pdf, other

    eess.IV cs.CV

    Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains

    Authors: Xuan Liu, Yaoqin Xie, Songhui Diao, Shan Tan, Xiaokun Liang

    Abstract: During the process of computed tomography (CT), metallic implants often cause disruptive artifacts in the reconstructed images, impeding accurate diagnosis. Several supervised deep learning-based approaches have been proposed for reducing metal artifacts (MAR). However, these methods heavily rely on training with simulated data, as obtaining paired metal artifact CT and clean CT data in clinical s… ▽ More

    Submitted 5 January, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

  18. arXiv:2308.09473  [pdf, other

    eess.IV

    INR-LDDMM: Fluid-based Medical Image Registration Integrating Implicit Neural Representation and Large Deformation Diffeomorphic Metric Map**

    Authors: Chulong Zhang, Xiaokun Liang

    Abstract: We propose a fluid-based registration framework of medical images based on implicit neural representation. By integrating implicit neural representation and Large Deformable Diffeomorphic Metric Map** (LDDMM), we employ a Multilayer Perceptron (MLP) as a velocity generator while optimizing velocity and image similarity. Moreover, we adopt a coarse-to-fine approach to address the challenge of def… ▽ More

    Submitted 27 November, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

  19. arXiv:2308.03006  [pdf

    cs.CV eess.IV

    High-Resolution Vision Transformers for Pixel-Level Identification of Structural Components and Damage

    Authors: Kareem Eltouny, Seyedomid Sajedi, Xiao Liang

    Abstract: Visual inspection is predominantly used to evaluate the state of civil structures, but recent developments in unmanned aerial vehicles (UAVs) and artificial intelligence have increased the speed, safety, and reliability of the inspection process. In this study, we develop a semantic segmentation network based on vision transformers and Laplacian pyramids scaling networks for efficiently parsing hi… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  20. arXiv:2305.19621  [pdf, other

    eess.IV cs.CV physics.med-ph

    XTransCT: Ultra-Fast Volumetric CT Reconstruction using Two Orthogonal X-Ray Projections for Image-guided Radiation Therapy via a Transformer Network

    Authors: Chulong Zhang, Lin Liu, **g**g Dai, Xuan Liu, Wenfeng He, Yin** Chan, Yaoqin Xie, Feng Chi, Xiaokun Liang

    Abstract: Computed tomography (CT) scans offer a detailed, three-dimensional representation of patients' internal organs. However, conventional CT reconstruction techniques necessitate acquiring hundreds or thousands of x-ray projections through a complete rotational scan of the body, making navigation or positioning during surgery infeasible. In image-guided radiation therapy, a method that reconstructs ul… ▽ More

    Submitted 23 November, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  21. arXiv:2305.15887  [pdf, other

    eess.IV cs.CV

    Diffusion Probabilistic Priors for Zero-Shot Low-Dose CT Image Denoising

    Authors: Xuan Liu, Yaoqin Xie, Jun Cheng, Songhui Diao, Shan Tan, Xiaokun Liang

    Abstract: Denoising low-dose computed tomography (CT) images is a critical task in medical image computing. Supervised deep learning-based approaches have made significant advancements in this area in recent years. However, these methods typically require pairs of low-dose and normal-dose CT images for training, which are challenging to obtain in clinical settings. Existing unsupervised deep learning-based… ▽ More

    Submitted 13 July, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  22. arXiv:2305.14673  [pdf, other

    eess.IV cs.CV cs.LG

    ORRN: An ODE-based Recursive Registration Network for Deformable Respiratory Motion Estimation with Lung 4DCT Images

    Authors: Xiao Liang, Shan Lin, Fei Liu, Dimitri Schreiber, Michael Yip

    Abstract: Deformable Image Registration (DIR) plays a significant role in quantifying deformation in medical data. Recent Deep Learning methods have shown promising accuracy and speedup for registering a pair of medical images. However, in 4D (3D + time) medical data, organ motion, such as respiratory motion and heart beating, can not be effectively modeled by pair-wise methods as they were optimized for im… ▽ More

    Submitted 25 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted by IEEE Transactions on Biomedical Engineering

  23. arXiv:2304.03708  [pdf, other

    eess.IV cs.CV

    Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

    Authors: Gongning Luo, Kuanquan Wang, Jun Liu, Shuo Li, Xinjie Liang, Xiangyu Li, Shaowei Gan, Wei Wang, Suyu Dong, Wenyi Wang, Pengxin Yu, Enyou Liu, Hongrong Wei, Na Wang, Jia Guo, Huiqi Li, Zhao Zhang, Ziwei Zhao, Na Gao, Nan An, Ashkan Pakzad, Bojidar Rangelov, Jiaqi Dou, Song Tian, Zeyu Liu , et al. (5 additional authors not shown)

    Abstract: Efficient automatic segmentation of multi-level (i.e. main and branch) pulmonary arteries (PA) in CTPA images plays a significant role in clinical applications. However, most existing methods concentrate only on main PA or branch PA segmentation separately and ignore segmentation efficiency. Besides, there is no public large-scale dataset focused on PA segmentation, which makes it highly challengi… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  24. Zero-shot Medical Image Translation via Frequency-Guided Diffusion Models

    Authors: Yunxiang Li, Hua-Chieh Shao, Xiao Liang, Liyuan Chen, Ruiqi Li, Steve Jiang, **g Wang, You Zhang

    Abstract: Recently, the diffusion model has emerged as a superior generative model that can produce high quality and realistic images. However, for medical image translation, the existing diffusion models are deficient in accurately retaining structural information since the structure details of source domain images are lost during the forward diffusion process and cannot be fully recovered through learned… ▽ More

    Submitted 27 October, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Journal ref: IEEE Transactions on Medical Imaging, 2023

  25. arXiv:2303.11692  [pdf, other

    cs.SD cs.IR eess.AS

    ByteCover3: Accurate Cover Song Identification on Short Queries

    Authors: Xingjian Du, Zijie Wang, Xia Liang, Huidong Liang, Bilei Zhu, Zejun Ma

    Abstract: Deep learning based methods have become a paradigm for cover song identification (CSI) in recent years, where the ByteCover systems have achieved state-of-the-art results on all the mainstream datasets of CSI. However, with the burgeon of short videos, many real-world applications require matching short music excerpts to full-length music tracks in the database, which is still under-explored and w… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepeted by ICASSP 2023

  26. arXiv:2302.08650  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Gaussian-smoothed Imbalance Data Improves Speech Emotion Recognition

    Authors: Xuefeng Liang, Hexin Jiang, Wenxin Xu, Ying Zhou

    Abstract: In speech emotion recognition tasks, models learn emotional representations from datasets. We find the data distribution in the IEMOCAP dataset is very imbalanced, which may harm models to learn a better representation. To address this issue, we propose a novel Pairwise-emotion Data Distribution Smoothing (PDDS) method. PDDS considers that the distribution of emotional data should be smooth in rea… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 5 pages

  27. arXiv:2302.04330  [pdf, other

    eess.IV

    New starting point registration method for tagged MRI tongue motion estimation

    Authors: **glun Yu, Muhan Shao, Zhangxing Bian, Xiao Liang, Jiachen Zhuo, Maureen Stone, Jerry L. Prince

    Abstract: Accurate tongue motion estimation is essential for tongue function evaluation. The harmonic phase processing (HARP) method and the phase vector incompressible registration algorithm (PVIRA) based on HARP can generate motion estimates from tagged MRI images, but they suffer from tag jum** due to large motions. This paper proposes a new registration method by combining the stationary velocity fiel… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  28. arXiv:2302.01493  [pdf

    eess.IV cs.CV physics.med-ph

    Deep Learning (DL)-based Automatic Segmentation of the Internal Pudendal Artery (IPA) for Reduction of Erectile Dysfunction in Definitive Radiotherapy of Localized Prostate Cancer

    Authors: Anjali Balagopal, Michael Dohopolski, Young Suk Kwon, Steven Montalvo, Howard Morgan, Ti Bai, Dan Nguyen, Xiao Liang, Xinran Zhong, Mu-Han Lin, Neil Desai, Steve Jiang

    Abstract: Background and purpose: Radiation-induced erectile dysfunction (RiED) is commonly seen in prostate cancer patients. Clinical trials have been developed in multiple institutions to investigate whether dose-sparing to the internal-pudendal-arteries (IPA) will improve retention of sexual potency. The IPA is usually not considered a conventional organ-at-risk (OAR) due to segmentation difficulty. In t… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  29. arXiv:2301.03281  [pdf, other

    eess.IV cs.CV

    The state-of-the-art 3D anisotropic intracranial hemorrhage segmentation on non-contrast head CT: The INSTANCE challenge

    Authors: Xiangyu Li, Gongning Luo, Kuanquan Wang, Hongyu Wang, Jun Liu, Xinjie Liang, Jie Jiang, Zhenghao Song, Chunyue Zheng, Haokai Chi, Mingwang Xu, Yingte He, Xinghua Ma, **gwen Guo, Yifan Liu, Chuanpu Li, Zeli Chen, Md Mahfuzur Rahman Siddiquee, Andriy Myronenko, Antoine P. Sanner, Anirban Mukhopadhyay, Ahmed E. Othman, Xingyu Zhao, Wei** Liu, **huang Zhang , et al. (9 additional authors not shown)

    Abstract: Automatic intracranial hemorrhage segmentation in 3D non-contrast head CT (NCCT) scans is significant in clinical practice. Existing hemorrhage segmentation methods usually ignores the anisotropic nature of the NCCT, and are evaluated on different in-house datasets with distinct metrics, making it highly challenging to improve segmentation performance and perform objective comparisons among differ… ▽ More

    Submitted 12 January, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Summarized paper for the MICCAI INSTANCE 2022 Challenge

  30. arXiv:2301.00833  [pdf, other

    eess.AS cs.SD physics.app-ph

    Hyperuniform disordered parametric loudspeaker array

    Authors: Kun Tang, Yuqi Wang, Shaobo Wang, Da Gao, Haojie Li, Xindong Liang, Patrick Sebbah, Yibin Li, ** Zhang, Junhui Shi

    Abstract: A steerable parametric loudspeaker array is known for its directivity and narrow beam width. However, it often suffers from the grating lobes due to periodic array distributions. Here we propose the array configuration of hyperuniform disorder, which is short-range random while correlated at large scales, as a promising alternative distribution of acoustic antennas in phased arrays. Angle-resolved… ▽ More

    Submitted 13 April, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

  31. arXiv:2211.02150  [pdf, ps, other

    cs.CV cs.AI eess.SP

    3D Reconstruction of Multiple Objects by mmWave Radar on UAV

    Authors: Yue Sun, Zhuoming Huang, Honggang Zhang, Xiaohui Liang

    Abstract: In this paper, we explore the feasibility of utilizing a mmWave radar sensor installed on a UAV to reconstruct the 3D shapes of multiple objects in a space. The UAV hovers at various locations in the space, and its onboard radar senor collects raw radar data via scanning the space with Synthetic Aperture Radar (SAR) operation. The radar data is sent to a deep neural network model, which outputs th… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  32. arXiv:2210.12175  [pdf

    eess.IV cs.CV

    High-Fidelity Visual Structural Inspections through Transformers and Learnable Resizers

    Authors: Kareem Eltouny, Seyedomid Sajedi, Xiao Liang

    Abstract: Visual inspection is the predominant technique for evaluating the condition of civil infrastructure. The recent advances in unmanned aerial vehicles (UAVs) and artificial intelligence have made the visual inspections faster, safer, and more reliable. Camera-equipped UAVs are becoming the new standard in the industry by collecting massive amounts of visual data for human inspectors. Meanwhile, ther… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  33. arXiv:2210.12173  [pdf

    eess.SP

    Cepstral Coefficients for Earthquake Damage Assessment of Bridges Leveraging Deep Learning

    Authors: Seyedomid Sajedi, Xiao Liang

    Abstract: Bridges are indispensable elements in resilient communities as essential parts of the lifeline transportation systems. Knowledge about the functionality of bridge structures is crucial, especially after a major earthquake event. In this study, we propose signal processing approaches for automated AI-equipped damage detection of bridges. Mel-scaled filter banks and cepstral coefficients are utilize… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  34. arXiv:2208.10708  [pdf, other

    eess.SP cs.HC cs.LG

    Convolutional Neural Networks with A Topographic Representation Module for EEG-Based Brain-Computer Interfaces

    Authors: Xinbin Liang, Yaru Liu, Yang Yu, Kaixuan Liu, Yadong Liu, Zongtan Zhou

    Abstract: Objective: Convolutional Neural Networks (CNNs) have shown great potential in the field of Brain-Computer Interfaces (BCIs). The raw Electroencephalogram (EEG) signal is usually represented as 2-Dimensional (2-D) matrix composed of channels and time points, which ignores the spatial topological information. Our goal is to make the CNN with the raw EEG signal as input have the ability to learn EEG… ▽ More

    Submitted 30 August, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

  35. arXiv:2208.07655  [pdf, other

    eess.IV cs.CV cs.LG physics.med-ph

    A Hybrid Deep Feature-Based Deformable Image Registration Method for Pathology Images

    Authors: Chulong Zhang, Yuming Jiang, Na Li, Zhicheng Zhang, Md Tauhidul Islam, **g**g Dai, Lin Liu, Wenfeng He, Wenjian Qin, **g Xiong, Yaoqin Xie, Xiaokun Liang

    Abstract: Pathologists need to combine information from differently stained pathology slices for accurate diagnosis. Deformable image registration is a necessary technique for fusing multi-modal pathology slices. This paper proposes a hybrid deep feature-based deformable image registration framework for stained pathology samples. We first extract dense feature points via the detector-based and detector-free… ▽ More

    Submitted 10 April, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 22 pages, 12 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  36. arXiv:2206.06623  [pdf, other

    eess.IV cs.CV

    ULTRA: Uncertainty-aware Label Distribution Learning for Breast Tumor Cellularity Assessment

    Authors: Xiangyu Li, Xinjie Liang, Gongning Luo, Wei Wang, Kuanquan Wang, Shuo Li

    Abstract: Neoadjuvant therapy (NAT) for breast cancer is a common treatment option in clinical practice. Tumor cellularity (TC), which represents the percentage of invasive tumors in the tumor bed, has been widely used to quantify the response of breast cancer to NAT. Therefore, automatic TC estimation is significant in clinical practice. However, existing state-of-the-art methods usually take it as a TC sc… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: Paper accepted by MICCAI 2022

  37. arXiv:2205.05250  [pdf, ps, other

    cs.LG cs.AI eess.SY

    Spatial-temporal associations representation and application for process monitoring using graph convolution neural network

    Authors: Hao Ren, Xiaojun Liang, Chunhua Yang, Zhiwen Chen, Weihua Gui

    Abstract: Thank you very much for the attention and concern of colleagues and scholars in this work. With the comments and guidance of experts, editors, and reviewers, this work has been accepted for publishing in the journal "Process Safety and Environmental Protection". The theme of this paper relies on the Spatial-temporal associations of numerous variables in the same industrial processes, which refers… ▽ More

    Submitted 5 October, 2023; v1 submitted 10 May, 2022; originally announced May 2022.

  38. arXiv:2204.03485   

    eess.SP eess.SY

    Nonlinear Kalman Filter Using Cramer Rao Bound

    Authors: Xin Liang, Yi Jiang

    Abstract: This paper studies the optimal state estimation for a dynamic system, whose transfer function can be nonlinear and the input noise can be of arbitrary distribution. Our algorithm differs from the conventional extended Kalman filter (EKF) and the particle filter (PF) in that it estimates not only the state vector but also the Cramer-Rao bound (CRB), which serves as an accuracy indicator. Combining… ▽ More

    Submitted 21 April, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: The algorithm description in section III is incomplete

  39. arXiv:2203.04295  [pdf, other

    eess.IV cs.CV

    Region Specific Optimization (RSO)-based Deep Interactive Registration

    Authors: Ti Bai, Muhan Lin, Xiao Liang, Biling Wang, Michael Dohopolski, Bin Cai, Dan Nguyen, Steve Jiang

    Abstract: Medical image registration is a fundamental and vital task which will affect the efficacy of many downstream clinical tasks. Deep learning (DL)-based deformable image registration (DIR) methods have been investigated, showing state-of-the-art performance. A test time optimization (TTO) technique was proposed to further improve the DL models' performance. Despite the substantial accuracy improvemen… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  40. arXiv:2202.13627  [pdf, ps, other

    cs.IT eess.SP

    Changeable Rate and Novel Quantization for CSI Feedback Based on Deep Learning

    Authors: Xin Liang, Haoran Chang, Haozhen Li, Xinyu Gu, Lin Zhang

    Abstract: Deep learning (DL)-based channel state information (CSI) feedback improves the capacity and energy efficiency of massive multiple-input multiple-output (MIMO) systems in frequency division duplexing mode. However, multiple neural networks with different lengths of feedback overhead are required by time-varying bandwidth resources. The storage space required at the user equipment (UE) and the base… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  41. arXiv:2112.14839  [pdf, ps, other

    eess.SY cs.AI physics.data-an

    An overview of the quantitative causality analysis and causal graph reconstruction based on a rigorous formalism of information flow

    Authors: X. San Liang

    Abstract: Inference of causal relations from data now has become an important field in artificial intelligence. During the past 16 years, causality analysis (in a quantitative sense) has been developed independently in physics from first principles. This short note is a brief summary of this line of work, including part of the theory and several representative applications.

    Submitted 31 December, 2021; originally announced December 2021.

    Comments: 7 pages, 1 figure. Presented at the First International AIxIA Workshop on Causality, Causal-ITALY, Italian Conference on Artificial Intelligence, November 30, 2021

  42. arXiv:2111.07454  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Interpretability of Speech Pause in Dementia Detection using Adversarial Learning

    Authors: Youxiang Zhu, Bang Tran, Xiaohui Liang, John A. Batsis, Robert M. Roth

    Abstract: Speech pause is an effective biomarker in dementia detection. Recent deep learning models have exploited speech pauses to achieve highly accurate dementia detection, but have not exploited the interpretability of speech pauses, i.e., what and how positions and lengths of speech pauses affect the result of dementia detection. In this paper, we will study the positions and lengths of dementia-sensit… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

  43. Whole Brain Segmentation with Full Volume Neural Network

    Authors: Yeshu Li, Jonathan Cui, Yilun Sheng, Xiao Liang, **gdong Wang, Eric I-Chao Chang, Yan Xu

    Abstract: Whole brain segmentation is an important neuroimaging task that segments the whole brain volume into anatomically labeled regions-of-interest. Convolutional neural networks have demonstrated good performance in this task. Existing solutions, usually segment the brain image by classifying the voxels, or labeling the slices or the sub-volumes separately. Their representation learning is based on par… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

    Comments: Accepted to CMIG

    Journal ref: Computerized Medical Imaging and Graphics, Volume 93, October 2021, 101991

  44. arXiv:2110.10675  [pdf, ps, other

    eess.SP

    The First Airborne Experiment of Sparse Microwave Imaging: Prototype System Design and Result Analysis

    Authors: Zhe Zhang, Bingchen Zhang, Chenglong Jiang, Xingdong Liang, Longyong Chen, Wen Hong, Yirong Wu

    Abstract: In this paper we report the first airborne experiments of sparse microwave imaging, conducted in September 2013 and May 2014, using our prototype sparse microwave imaging radar system. This is the first reported imaging radar system and airborne experiment that specially designed for sparse microwave imaging. Sparse microwave imaging is a novel concept of radar imaging, it is mainly the combinatio… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

  45. arXiv:2110.06142  [pdf, ps, other

    eess.SP eess.IV

    CSI Sensing and Feedback: A Semi-Supervised Learning Approach

    Authors: Haozhen Li, Boyuan Zhang, Xin Liang, Haoran Chang, Xinyu Gu, Lin Zhang

    Abstract: Deep learning-based (DL-based) channel state information (CSI) feedback for a Massive multiple-input multiple-output (MIMO) system has proved to be a creative and efficient application. However, the existing systems ignored the wireless channel environment variation sensing, e.g., indoor and outdoor scenarios. Moreover, systems training requires excess pre-labeled CSI data, which is often unavaila… ▽ More

    Submitted 26 September, 2021; originally announced October 2021.

  46. arXiv:2109.09161  [pdf, other

    cs.CL eess.AS

    Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition

    Authors: Guolin Zheng, Yubei Xiao, Ke Gong, Pan Zhou, Xiaodan Liang, Liang Lin

    Abstract: Unifying acoustic and linguistic representation learning has become increasingly crucial to transfer the knowledge learned on the abundance of high-resource language data for low-resource speech recognition. Existing approaches simply cascade pre-trained acoustic and language models to learn the transfer from speech to text. However, how to solve the representation discrepancy of speech and text i… ▽ More

    Submitted 9 October, 2021; v1 submitted 19 September, 2021; originally announced September 2021.

  47. arXiv:2106.11411  [pdf, other

    cs.SD eess.AS

    Attention-based cross-modal fusion for audio-visual voice activity detection in musical video streams

    Authors: Yuanbo Hou, Zhesong Yu, Xia Liang, Xingjian Du, Bilei Zhu, Zejun Ma, Dick Botteldooren

    Abstract: Many previous audio-visual voice-related works focus on speech, ignoring the singing voice in the growing number of musical video streams on the Internet. For processing diverse musical video data, voice activity detection is a necessary step. This paper attempts to detect the speech and singing voices of target performers in musical video streams using audiovisual information. To integrate inform… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: Accepted by INTERSPEECH 2021

  48. arXiv:2103.00634  [pdf, other

    eess.IV physics.med-ph

    TransCT: Dual-path Transformer for Low Dose Computed Tomography

    Authors: Zhicheng Zhang, Lequan Yu, Xiaokun Liang, Wei Zhao, Lei Xing

    Abstract: Low dose computed tomography (LDCT) has attracted more and more attention in routine clinical diagnosis assessment, therapy planning, etc., which can reduce the dose of X-ray radiation to patients. However, the noise caused by low X-ray exposure degrades the CT image quality and then affects clinical diagnosis accuracy. In this paper, we train a transformer-based neural network to enhance the fina… ▽ More

    Submitted 5 July, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

  49. arXiv:2012.14982  [pdf, other

    cs.LG eess.SP

    Elastic Net based Feature Ranking and Selection

    Authors: Shaode Yu, Haobo Chen, Hang Yu, Zhicheng Zhang, Xiaokun Liang, Wenjian Qin, Yaoqin Xie, ** Shi

    Abstract: Feature selection is important in data representation and intelligent diagnosis. Elastic net is one of the most widely used feature selectors. However, the features selected are dependant on the training data, and their weights dedicated for regularized regression are irrelevant to their importance if used for feature ranking, that degrades the model interpretability and extension. In this study,… ▽ More

    Submitted 29 December, 2020; originally announced December 2020.

  50. arXiv:2012.11896  [pdf, other

    cs.CL cs.SD eess.AS

    Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition

    Authors: Yubei Xiao, Ke Gong, Pan Zhou, Guolin Zheng, Xiaodan Liang, Liang Lin

    Abstract: Low-resource automatic speech recognition (ASR) is challenging, as the low-resource target language data cannot well train an ASR model. To solve this issue, meta-learning formulates ASR for each source language into many small ASR tasks and meta-learns a model initialization on all tasks from different source languages to access fast adaptation on unseen target languages. However, for different s… ▽ More

    Submitted 12 April, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: accepted in AAAI2021