Skip to main content

Showing 1–50 of 223 results for author: Yang, G

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  2. arXiv:2406.17173  [pdf, other

    eess.IV cs.CV cs.LG

    Diff3Dformer: Leveraging Slice Sequence Diffusion for Enhanced 3D CT Classification with Transformer Networks

    Authors: Zihao **, Yingying Fang, Jiahao Huang, Caiwen Xu, Simon Walsh, Guang Yang

    Abstract: The manifestation of symptoms associated with lung diseases can vary in different depths for individual patients, highlighting the significance of 3D information in CT scans for medical image classification. While Vision Transformer has shown superior performance over convolutional neural networks in image classification tasks, their effectiveness is often demonstrated on sufficiently large 2D dat… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: conference

  3. arXiv:2406.16189  [pdf, other

    eess.IV cs.CV

    Fuzzy Attention-based Border Rendering Network for Lung Organ Segmentation

    Authors: Sheng Zhang, Yang Nan, Yingying Fang, Shiyi Wang, Xiaodan Xing, Zhifan Gao, Guang Yang

    Abstract: Automatic lung organ segmentation on CT images is crucial for lung disease diagnosis. However, the unlimited voxel values and class imbalance of lung organs can lead to false-negative/positive and leakage issues in advanced methods. Additionally, some slender lung organs are easily lost during the recycled down/up-sample procedure, e.g., bronchioles & arterioles, causing severe discontinuity issue… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: MICCAI 2024

  4. arXiv:2406.15752  [pdf, other

    eess.AS cs.AI cs.CL

    TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers

    Authors: Yakun Song, Zhuo Chen, Xiaofei Wang, Ziyang Ma, Guanrou Yang, Xie Chen

    Abstract: Neural codec language model (LM) has demonstrated strong capability in zero-shot text-to-speech (TTS) synthesis. However, the codec LM often suffers from limitations in inference speed and stability, due to its auto-regressive nature and implicit alignment between text and audio. In this work, to handle these challenges, we introduce a new variant of neural codec LM, namely TacoLM. Specifically, T… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: INTERSPEECH 2024

  5. arXiv:2406.13788  [pdf, other

    eess.SP

    Groupwise Deformable Registration of Diffusion Tensor Cardiovascular Magnetic Resonance: Disentangling Diffusion Contrast, Respiratory and Cardiac Motions

    Authors: Fanwen Wang, Yihao Luo, Ke Wen, Jiahao Huang, Pedro F. Ferreira, Yaqing Luo, Yinzhe Wu, Camila Munoz, Dudley J. Pennell, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Diffusion tensor based cardiovascular magnetic resonance (DT-CMR) offers a non-invasive method to visualize the myocardial microstructure. With the assumption that the heart is stationary, frames are acquired with multiple repetitions for different diffusion encoding directions. However, motion from poor breath-holding and imprecise cardiac triggering complicates DT-CMR analysis, further challenge… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted by MICCAI 2024

  6. arXiv:2406.13708  [pdf

    eess.IV physics.med-ph

    Low-rank based motion correction followed by automatic frame selection in DT-CMR

    Authors: Fanwen Wang, Pedro F. Ferreira, Camila Munoz, Ke Wen, Yaqing Luo, Jiahao Huang, Yinzhe Wu, Dudley J. Pennell, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Motivation: Post-processing of in-vivo diffusion tensor CMR (DT-CMR) is challenging due to the low SNR and variation in contrast between frames which makes image registration difficult, and the need to manually reject frames corrupted by motion. Goals: To develop a semi-automatic post-processing pipeline for robust DT-CMR registration and automatic frame selection. Approach: We used low intrinsic… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted as ISMRM 2024 Digital poster 2141

    Journal ref: ISMRM 2024 Digital poster 2141

  7. arXiv:2406.08887  [pdf, other

    eess.SP

    Low-Overhead Channel Estimation via 3D Extrapolation for TDD mmWave Massive MIMO Systems Under High-Mobility Scenarios

    Authors: Binggui Zhou, Xi Yang, Shaodan Ma, Feifei Gao, Guanghua Yang

    Abstract: In TDD mmWave massive MIMO systems, the downlink CSI can be attained through uplink channel estimation thanks to the uplink-downlink channel reciprocity. However, the channel aging issue is significant under high-mobility scenarios and thus necessitates frequent uplink channel estimation. In addition, large amounts of antennas and subcarriers lead to high-dimensional CSI matrices, aggravating the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 13 pages, 11 figures, 3 tables. This paper has been submitted to IEEE journal for possible publication

  8. arXiv:2406.05839  [pdf, other

    eess.AS cs.AI

    MaLa-ASR: Multimedia-Assisted LLM-Based ASR

    Authors: Guanrou Yang, Ziyang Ma, Fan Yu, Zhifu Gao, Shiliang Zhang, Xie Chen

    Abstract: As more and more information-rich data like video become available, utilizing multi-modal auxiliary information to enhance audio tasks has sparked widespread research interest. The recent surge in research on LLM-based audio models provides fresh perspectives for tackling audio tasks. Given that LLM can flexibly ingest multiple inputs, we propose MaLa-ASR, an LLM-based ASR model that can integrate… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  9. arXiv:2405.17659  [pdf, other

    eess.IV cs.CV

    Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba

    Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yang Nan, Weiwen Wu, Chengyan Wang, Kuangyu Shi, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

    Abstract: Deep learning has been extensively applied in medical image reconstruction, where Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) represent the predominant paradigms, each possessing distinct advantages and inherent limitations: CNNs exhibit linear complexity with local sensitivity, whereas ViTs demonstrate quadratic complexity with global sensitivity. The emerging Mamba has sh… ▽ More

    Submitted 25 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  10. arXiv:2405.15241  [pdf, other

    eess.IV cs.CV

    Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving

    Authors: Jia He, Bonan Li, Ge Yang, Ziwen Liu

    Abstract: Solving 3D medical inverse problems such as image restoration and reconstruction is crucial in modern medical field. However, the curse of dimensionality in 3D medical data leads mainstream volume-wise methods to suffer from high resource consumption and challenges models to successfully capture the natural distribution, resulting in inevitable volume inconsistency and artifacts. Some recent works… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  11. arXiv:2405.09443  [pdf, other

    cs.IT eess.SP

    Low-Complexity Joint Azimuth-Range-Velocity Estimation for Integrated Sensing and Communication with OFDM Waveform

    Authors: Jun Zhang, Gang Yang, Qibin Ye, Yixuan Huang, Su Hu

    Abstract: Integrated sensing and communication (ISAC) is a main application scenario of the sixth-generation mobile communication systems. Due to the fast-growing number of antennas and subcarriers in cellular systems, the computational complexity of joint azimuth-range-velocity estimation (JARVE) in ISAC systems is extremely high. This paper studies the JARVE problem for a monostatic ISAC system with ortho… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 16 pages, 12 figures, submitted to IEEE journal

  12. arXiv:2405.05030  [pdf

    eess.SY

    Functional Specifications and Testing Requirements of Grid-Forming Type-IV Offshore Wind Power

    Authors: Sulav Ghimire, Gabriel M. G. Guerreiro, Kanakesh V. K., Emerson D. Guest, Kim H. Jensen, Guangya Yang, Xiongfei Wang

    Abstract: Throughout the past few years, various transmission system operators (TSOs) and research institutes have defined several functional specifications for grid-forming (GFM) converters via grid codes, white papers, and technical documents. These institutes and organisations also proposed testing requirements for general inverter-based resources (IBRs) and specific GFM converters. This paper initially… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  13. arXiv:2404.01082  [pdf, other

    eess.IV

    The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

    Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Li** Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

    Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 25 pages, 17 figures

  14. arXiv:2404.00598  [pdf, other

    cs.IT eess.SP

    Robust Beamforming Design and Antenna Selection for Dynamic HRIS-aided Massive MIMO Systems

    Authors: **tao Wang, Binggui Zhou, Chengzhi Ma, Shiqi Gong, Guanghua Yang, Shaodan Ma

    Abstract: In this paper, a dynamic hybrid active-passive reconfigurable intelligent surface (HRIS) is proposed to further enhance the massive multiple-input-multiple-output (MIMO) system, since it supports the dynamic placement of active and passive elements. Specifically, considering the impact of the hardware impairments (HWIs), we investigate the channel-aware configuration of the receive antennas at the… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 5 pages, 2 figures

  15. arXiv:2403.05236  [pdf, other

    eess.SY

    Fault Recovery and Transient Stability of Grid-Forming Converters Equipped with Current Saturation

    Authors: Ali Arjomandi-Nezhad, Yifei Guo, Bikash C. Pal, Guangya Yang

    Abstract: When grid-forming (GFM) inverter-based resources (IBRs) experience large grid disturbances (e.g., short-circuit faults), the current limiter may be triggered and GFM IBRs enter the current saturation mode, inducing nonlinear dynamical behaviors and imposing great challenges to the post-disturbance transient angle stability. This paper presents a systematic study to reveal the fault recovery behavi… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 10 pages, 22 figures

  16. arXiv:2403.03809  [pdf, other

    eess.SP

    Variational Bayesian Learning based Joint Localization and Channel Estimation with Distance-dependent Noise

    Authors: Yunfei Li, Yiting Luo, Weiqiang Tan, Chunguo Li, Shaodan Ma, Guanghua Yang

    Abstract: In the Industrial Internet of Things (IIoTs) and Ocean of Things (OoTs), the advent of massive intelligent services has imposed stringent requirements on both communication and localization, particularly emphasizing precise localization and channel information. This paper focuses on the challenge of jointly optimizing localization and communication in IoT networks. Departing from the conventional… ▽ More

    Submitted 6 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  17. arXiv:2403.01093  [pdf, other

    eess.SP

    Variational Bayesian Learning Based Localization and Channel Reconstruction in RIS-aided Systems

    Authors: Yunfei Li, Yiting Luo, Xianda Wu, Zheng Shi, Shaodan Ma, Guanghua Yang

    Abstract: The emerging immersive and autonomous services have posed stringent requirements on both communications and localization. By considering the great potential of reconfigurable intelligent surface (RIS), this paper focuses on the joint channel estimation and localization for RIS-aided wireless systems. As opposed to existing works that treat channel estimation and localization independently, this pa… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  18. arXiv:2402.18451  [pdf, other

    eess.IV cs.CV

    MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation

    Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yang Nan, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

    Abstract: The recent Mamba model has shown remarkable adaptability for visual representation learning, including in medical imaging tasks. This study introduces MambaMIR, a Mamba-based model for medical image reconstruction, as well as its Generative Adversarial Network-based variant, MambaMIR-GAN. Our proposed MambaMIR inherits several advantages, such as linear complexity, global receptive fields, and dyn… ▽ More

    Submitted 25 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  19. arXiv:2402.15939  [pdf

    eess.IV cs.LG

    Deep Separable Spatiotemporal Learning for Fast Dynamic Cardiac MRI

    Authors: Zi Wang, Min Xiao, Yirong Zhou, Chengyan Wang, Naiming Wu, Yi Li, Yiwen Gong, Shufu Chang, Yinyin Chen, Liuhong Zhu, Jianjun Zhou, Congbo Cai, He Wang, Di Guo, Guang Yang, Xiaobo Qu

    Abstract: Dynamic magnetic resonance imaging (MRI) plays an indispensable role in cardiac diagnosis. To enable fast imaging, the k-space data can be undersampled but the image reconstruction poses a great challenge of high-dimensional processing. This challenge leads to necessitate extensive training data in many deep learning reconstruction methods. This work proposes a novel and efficient approach, levera… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

    Comments: 10 pages, 11 figures, 3 tables

  20. arXiv:2402.14317  [pdf, other

    eess.SY

    Oscillations between Grid-Forming Converters in Weakly Connected Offshore WPPs

    Authors: Sulav Ghimire, Kanakesh V. Kkuni, Gabriel M. G. Guerreiro, Emerson D. Guest, Kim H. Jensen, Guangya Yang

    Abstract: This paper studies control interactions between grid-forming (GFM) converters exhibited by power and frequency oscillations in a weakly connected offshore wind power plant (WPP). Two GFM controls are considered, namely virtual synchronous machine (VSM) and virtual admittance (VAdm) based GFM. The GFM control methods are implemented in wind turbine generators (WTGs) of a verified aggregated model o… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  21. In-Vivo Hyperspectral Human Brain Image Database for Brain Cancer Detection

    Authors: H. Fabelo, S. Ortega, A. Szolna, D. Bulters, J. F. Pineiro, S. Kabwama, A. Shanahan, H. Bulstrode, S. Bisshopp, B. R. Kiran, D. Ravi, R. Lazcano, D. Madronal, C. Sosa, C. Espino, M. Marquez, M. De la Luz Plaza, R. Camacho, D. Carrera, M. Hernandez, G. M. Callico, J. Morera, B. Stanciulescu, G. Z. Yang, R. Salvador , et al. (3 additional authors not shown)

    Abstract: The use of hyperspectral imaging for medical applications is becoming more common in recent years. One of the main obstacles that researchers find when develo** hyperspectral algorithms for medical applications is the lack of specific, publicly available, and hyperspectral medical data. The work described in this paper was developed within the framework of the European project HELICoiD (HypErspe… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 19 pages, 12 figures

    Journal ref: IEEE Access, 2019, 7, pp. 39098 39116

  22. arXiv:2402.08846  [pdf, other

    cs.CL cs.AI cs.MM cs.SD eess.AS

    An Embarrassingly Simple Approach for LLM with Strong ASR Capacity

    Authors: Ziyang Ma, Guanrou Yang, Yifan Yang, Zhifu Gao, Jiaming Wang, Zhihao Du, Fan Yu, Qian Chen, Siqi Zheng, Shiliang Zhang, Xie Chen

    Abstract: In this paper, we focus on solving one of the most important tasks in the field of speech processing, i.e., automatic speech recognition (ASR), with speech foundation encoders and large language models (LLM). Recent works have complex designs such as compressing the output temporally for the speech encoder, tackling modal alignment for the projector, and utilizing parameter-efficient fine-tuning f… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Working in progress and will open-source soon

  23. arXiv:2402.07403  [pdf

    cs.CV eess.IV

    Make it more specific: A novel uncertainty based airway segmentation application on 3D U-Net and its variants

    Authors: Shiyi Wang, Yang Nan, Felder Federico N, Sheng Zhang, Walsh Simon L F, Guang Yang

    Abstract: Each medical segmentation task should be considered with a specific AI algorithm based on its scenario so that the most accurate prediction model can be obtained. The most popular algorithms in medical segmentation, 3D U-Net and its variants, can directly implement the task of lung trachea segmentation, but its failure to consider the special tree-like structure of the trachea suggests that there… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  24. Spatio-spectral classification of hyperspectral images for brain cancer detection during surgical operations

    Authors: H. Fabelo, S. Ortega, D. Ravi, B. R. Kiran, C. Sosa, D. Bulters, G. M. Callico, H. Bulstrode, A. Szolna, J. F. Pineiro, S. Kabwama, D. Madronal, R. Lazcano, A. J. OShanahan, S. Bisshopp, M. Hernandez, A. Baez-Quevedo, G. Z. Yang, B. Stanciulescu, R. Salvador, E. Juarez, R. Sarmiento

    Abstract: Surgery for brain cancer is a major problem in neurosurgery. The diffuse infiltration into the surrounding normal brain by these tumors makes their accurate identification by the naked eye difficult. Since surgery is the common treatment for brain cancer, an accurate radical resection of the tumor leads to improved survival rates for patients. However, the identification of the tumor boundaries du… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  25. arXiv:2402.03473  [pdf, other

    eess.IV cs.CV

    Assessing the Efficacy of Invisible Watermarks in AI-Generated Medical Images

    Authors: Xiaodan Xing, Huiyu Zhou, Yingying Fang, Guang Yang

    Abstract: AI-generated medical images are gaining growing popularity due to their potential to address the data scarcity challenge in the real world. However, the issue of accurate identification of these synthetic images, particularly when they exhibit remarkable realism with their real copies, remains a concern. To mitigate this challenge, image generators such as DALLE and Imagen, have integrated digital… ▽ More

    Submitted 21 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 5 pages

    Journal ref: ISBI 2024

  26. arXiv:2401.16564  [pdf

    eess.SP

    Data and Physics driven Deep Learning Models for Fast MRI Reconstruction: Fundamentals and Methodologies

    Authors: Jiahao Huang, Yinzhe Wu, Fanwen Wang, Yingying Fang, Yang Nan, Cagan Alkan, Lei Xu, Zhifan Gao, Weiwen Wu, Lei Zhu, Zhaolin Chen, Peter Lally, Neal Bangerter, Kawin Setsompop, Yike Guo, Daniel Rueckert, Ge Wang, Guang Yang

    Abstract: Magnetic Resonance Imaging (MRI) is a pivotal clinical diagnostic tool, yet its extended scanning times often compromise patient comfort and image quality, especially in volumetric, temporal and quantitative scans. This review elucidates recent advances in MRI acceleration via data and physics-driven models, leveraging techniques from algorithm unrolling models, enhancement-based models, and plug-… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  27. arXiv:2401.13564  [pdf, ps, other

    cs.IT eess.SP

    RIS Empowered Near-Field Covert Communications

    Authors: Jun Liu, Gang Yang, Yuanwei Liu, Xiangyun Zhou

    Abstract: This paper studies an extremely large-scale reconfigurable intelligent surface (XL-RIS) empowered covert communication system in the near-field region. Alice covertly transmits messages to Bob with the assistance of the XL-RIS, while evading detection by Willie. To enhance the covert communication performance, we maximize the achievable covert rate by jointly optimizing the hybrid analog and digit… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 15 pages, 8 figures, submitted to IEEE journal

  28. arXiv:2312.13752  [pdf

    eess.IV cs.AI cs.CV

    Hunting imaging biomarkers in pulmonary fibrosis: Benchmarks of the AIIB23 challenge

    Authors: Yang Nan, Xiaodan Xing, Shiyi Wang, Zeyu Tang, Federico N Felder, Sheng Zhang, Roberta Eufrasia Ledda, Xiaoliu Ding, Ruiqi Yu, Wei** Liu, Feng Shi, Tianyang Sun, Zehong Cao, Minghui Zhang, Yun Gu, Hanxiao Zhang, Jian Gao, **yu Wang, Wen Tang, Pengxin Yu, Han Kang, Junqiang Chen, Xing Lu, Boyu Zhang, Michail Mamalakis , et al. (16 additional authors not shown)

    Abstract: Airway-related quantitative imaging biomarkers are crucial for examination, diagnosis, and prognosis in pulmonary diseases. However, the manual delineation of airway trees remains prohibitively time-consuming. While significant efforts have been made towards enhancing airway modelling, current public-available datasets concentrate on lung diseases with moderate morphological variations. The intric… ▽ More

    Submitted 16 April, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: 19 pages

  29. arXiv:2312.13154  [pdf, other

    eess.SP

    Joint Range-Velocity-Azimuth Estimation for OFDM-Based Integrated Sensing and Communication

    Authors: Zelin Hu, Qibin Ye, Yixuan Huang, Su Hu, Gang Yang

    Abstract: Orthogonal frequency division multiplexing (OFDM)-based integrated sensing and communication (ISAC) is promising for future sixth-generation mobile communication systems. Existing works focus on the joint estimation of the targets' range and velocity for OFDM-based ISAC systems. In contrast, this paper studies the three-dimensional joint estimation (3DJE) of range, velocity, and azimuth for OFDM-b… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: This manuscript has been submitted to the IEEE journal in 09-Aug-2023

  30. arXiv:2312.12824  [pdf, other

    eess.IV cs.CV

    FedSODA: Federated Cross-assessment and Dynamic Aggregation for Histopathology Segmentation

    Authors: Yuan Zhang, Yaolei Qi, Xiaoming Qi, Lotfi Senhadji, Yongyue Wei, Feng Chen, Guanyu Yang

    Abstract: Federated learning (FL) for histopathology image segmentation involving multiple medical sites plays a crucial role in advancing the field of accurate disease diagnosis and treatment. However, it is still a task of great challenges due to the sample imbalance across clients and large data heterogeneity from disparate organs, variable segmentation tasks, and diverse distribution. Thus, we propose a… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP2024

  31. arXiv:2312.04377  [pdf, other

    cs.IT eess.SP

    HARQ-IR Aided Short Packet Communications: BLER Analysis and Throughput Maximization

    Authors: Fuchao He, Zheng Shi, Guanghua Yang, Xiaofan Li, Xinrong Ye, Shaodan Ma

    Abstract: This paper introduces hybrid automatic repeat request with incremental redundancy (HARQ-IR) to boost the reliability of short packet communications. The finite blocklength information theory and correlated decoding events tremendously preclude the analysis of average block error rate (BLER). Fortunately, the recursive form of average BLER motivates us to calculate its value through the trapezoidal… ▽ More

    Submitted 9 January, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 13 pages, 10 figures

  32. arXiv:2312.04062  [pdf, other

    cs.IT cs.AI eess.SP

    A Low-Overhead Incorporation-Extrapolation based Few-Shot CSI Feedback Framework for Massive MIMO Systems

    Authors: Binggui Zhou, Xi Yang, **tao Wang, Shaodan Ma, Feifei Gao, Guanghua Yang

    Abstract: Accurate channel state information (CSI) is essential for downlink precoding in frequency division duplexing (FDD) massive multiple-input multiple-output (MIMO) systems with orthogonal frequency-division multiplexing (OFDM). However, obtaining CSI through feedback from the user equipment (UE) becomes challenging with the increasing scale of antennas and subcarriers and leads to extremely high CSI… ▽ More

    Submitted 21 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 16 pages, 12 figures, 5 tables. Accepted by IEEE Transactions on Wireless Communications

  33. arXiv:2311.15785  [pdf, other

    eess.SY

    Nonlinear Stability Boundary Assessment of Multi-Converter Systems Based On Reverse Time Trajectory

    Authors: Sujay Ghosh, Mohammad Kazem Bakhshizadeh, Guangya Yang, Łukasz Kocewiak

    Abstract: As the integration of wind power accelerates, wind power plants (WPPs) are expected to play a crucial role in ensuring stability in future power grids. This paper examines the nonlinear stability boundary of a multi-converter system in a wind power plant (WPP) connected to an AC power grid via a long HVAC cable. Traditionally, for nonlinear analysis of WPPs, a simplification is adopted wherein the… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  34. arXiv:2311.09462  [pdf, other

    eess.SY

    Software-Defined Virtual Synchronous Condenser

    Authors: Zimin Jiang, Peng Zhang, Yifan Zhou, Łukasz Kocewiak, Divya Kurthakoti Chandrashekhara, Marie-Lou Picherit, Zefan Tang, Kenneth B. Bowes, Guangya Yang

    Abstract: Synchronous condensers (SCs) play important roles in integrating wind energy into relatively weak power grids. However, the design of SCs usually depends on specific application requirements and may not be adaptive enough to the frequently-changing grid conditions caused by the transition from conventional to renewable power generation. This paper devises a software-defined virtual synchronous con… ▽ More

    Submitted 17 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

  35. arXiv:2311.07991  [pdf, other

    eess.SY

    Nonlinear Stability Boundary Assessment Of Wind Power Plants Based on Reverse-Time Trajectory

    Authors: Sujay Ghosh, Mohammad Kazem Bakhshizadeh, Guangya Yang, Lukasz Kocewiak

    Abstract: This letter determines the nonlinear stability boundary of a wind power plant (WPP) connected to an AC power grid via a long HVAC cable. The analysis focuses on the slow Phase-Locked Loop (PLL) dynamics, with an assumption that the fast current control dynamics can be neglected. To begin, we propose an aggregated reduced-order wind turbine model. This aggregation can be applied up to a limited fre… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  36. arXiv:2311.06552  [pdf, other

    eess.IV cs.CV cs.LG

    Stain Consistency Learning: Handling Stain Variation for Automatic Digital Pathology Segmentation

    Authors: Michael Yeung, Todd Watts, Sean YW Tan, Pedro F. Ferreira, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Stain variation is a unique challenge associated with automated analysis of digital pathology. Numerous methods have been developed to improve the robustness of machine learning methods to stain variation, but comparative studies have demonstrated limited benefits to performance. Moreover, methods to handle stain variation were largely developed for H&E stained data, with evaluation generally limi… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  37. arXiv:2311.01066  [pdf, other

    eess.IV cs.CV

    Dynamic Multimodal Information Bottleneck for Multimodality Classification

    Authors: Yingying Fang, Shuang Wu, Sheng Zhang, Chaoyan Huang, Tieyong Zeng, Xiaodan Xing, Simon Walsh, Guang Yang

    Abstract: Effectively leveraging multimodal data such as various images, laboratory tests and clinical information is gaining traction in a variety of AI-based medical diagnosis and prognosis tasks. Most existing multi-modal techniques only focus on enhancing their performance by leveraging the differences or shared features from various modalities and fusing feature across different modalities. These appro… ▽ More

    Submitted 25 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: WACV 2024

  38. arXiv:2310.20389  [pdf

    eess.IV cs.CV

    High-Resolution Reference Image Assisted Volumetric Super-Resolution of Cardiac Diffusion Weighted Imaging

    Authors: Yinzhe Wu, Jiahao Huang, Fanwen Wang, Pedro Ferreira, Andrew Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Diffusion Tensor Cardiac Magnetic Resonance (DT-CMR) is the only in vivo method to non-invasively examine the microstructure of the human heart. Current research in DT-CMR aims to improve the understanding of how the cardiac microstructure relates to the macroscopic function of the healthy heart as well as how microstructural dysfunction contributes to disease. To get the final DT-CMR metrics, we… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted by SPIE Medical Imaging 2024

  39. arXiv:2310.18346  [pdf, other

    eess.IV cs.CV cs.LG

    Data-Free Distillation Improves Efficiency and Privacy in Federated Thorax Disease Analysis

    Authors: Ming Li, Guang Yang

    Abstract: Thorax disease analysis in large-scale, multi-centre, and multi-scanner settings is often limited by strict privacy policies. Federated learning (FL) offers a potential solution, while traditional parameter-based FL can be limited by issues such as high communication costs, data leakage, and heterogeneity. Distillation-based FL can improve efficiency, but it relies on a proxy dataset, which is oft… ▽ More

    Submitted 31 October, 2023; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted by the IEEE EMBS International Conference on Data Science and Engineering in Healthcare, Medicine & Biology

  40. arXiv:2310.17558  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Towards Matching Phones and Speech Representations

    Authors: Gene-** Yang, Hao Tang

    Abstract: Learning phone types from phone instances has been a long-standing problem, while still being open. In this work, we revisit this problem in the context of self-supervised learning, and pose it as the problem of matching cluster centroids to phone embeddings. We study two key properties that enable matching, namely, whether cluster centroids of self-supervised representations reduce the variabilit… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted to ASRU 2023

  41. arXiv:2310.10964  [pdf, other

    cs.IT eess.SP

    Spectral-Efficiency and Energy-Efficiency of Variable-Length XP-HARQ

    Authors: Jiahui Feng, Zheng Shi, Yaru Fu, Hong Wang, Guanghua Yang, Shaodan Ma

    Abstract: A variable-length cross-packet hybrid automatic repeat request (VL-XP-HARQ) is proposed to boost the spectral efficiency (SE) and the energy efficiency (EE) of communications. The SE is firstly derived in terms of the outage probabilities, with which the SE is proved to be upper bounded by the ergodic capacity (EC). Moreover, to facilitate the maximization of the SE, the asymptotic outage probabil… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  42. arXiv:2310.06457  [pdf, other

    eess.SY

    Small-Signal Stability and SCR Enhancement of Offshore WPPs with Synchronous Condensers

    Authors: Sulav Ghimire, Kanakesh V. Kkuni, Emerson D. Guest, Kim H. Jensen, Guangya Yang

    Abstract: Synchronous condensers (SCs) have been reported to improve the overall stability and short-circuit power of a power system. SCs are also being integrated into offshore wind power plants (WPPs) for the same reason. This paper, investigates the effect of synchronous condensers on an offshore wind power plant with grid-following (GFL) and grid-forming (GFM) converter controls. Primarily, the effect o… ▽ More

    Submitted 30 January, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  43. arXiv:2310.05638  [pdf

    eess.IV cs.CV

    High Accuracy and Cost-Saving Active Learning 3D WD-UNet for Airway Segmentation

    Authors: Shiyi Wang, Yang Nan, Simon Walsh, Guang Yang

    Abstract: We propose a novel Deep Active Learning (DeepAL) model-3D Wasserstein Discriminative UNet (WD-UNet) for reducing the annotation effort of medical 3D Computed Tomography (CT) segmentation. The proposed WD-UNet learns in a semi-supervised way and accelerates learning convergence to meet or exceed the prediction metrics of supervised learning models. Our method can be embedded with different Active L… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  44. arXiv:2310.01826  [pdf, other

    eess.SY

    Grid-Forming Control Methods for Weakly Connected Offshore WPPs

    Authors: Sulav Ghimire, Kanakesh V Kkuni, Simon C Jakobsen, Thyge Knueppel, Kim H Jensen, Emerson Guest, Tonny W Rasmussen, Guangya Yang

    Abstract: Grid-forming control (GFC) has seen numerous technological advances in their control types, applications, and the multitude of services they provide. Some examples of the services they provide include black start, inertial frequency response, and islanded operation capabilities with the possibility of re-synchronization without the need of additional support from other devices such as storage. Sta… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Journal ref: Wind and Solar Integration Workshop 2023, Copenhagen

  45. arXiv:2309.16853  [pdf, other

    eess.SP

    T1/T2 relaxation temporal modelling from accelerated acquisitions using a Latent Transformer

    Authors: Fanwen Wang, Michael Tanzer, Mengyun Qiao, Wenjia Bai, Daniel Rueckert, Guang Yang, Sonia Nielles-Vallespin

    Abstract: Quantitative cardiac magnetic resonance T1 and T2 map** enable myocardial tissue characterisation but the lengthy scan times restrict their widespread clinical application. We propose a deep learning method that incorporates a time dependency Latent Transformer module to model relationships between parameterised time frames for improved reconstruction from undersampled data. The module, implemen… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  46. arXiv:2309.15485  [pdf, other

    eess.IV cs.CV

    Style Transfer and Self-Supervised Learning Powered Myocardium Infarction Super-Resolution Segmentation

    Authors: Lichao Wang, Jiahao Huang, Xiaodan Xing, Yinzhe Wu, Ramyah Rajakulasingam, Andrew D. Scott, Pedro F Ferreira, Ranil De Silva, Sonia Nielles-Vallespin, Guang Yang

    Abstract: This study proposes a pipeline that incorporates a novel style transfer model and a simultaneous super-resolution and segmentation model. The proposed pipeline aims to enhance diffusion tensor imaging (DTI) images by translating them into the late gadolinium enhancement (LGE) domain, which offers a larger amount of data with high-resolution and distinct highlighting of myocardium infarction (MI) a… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 6 pages, 8 figures, conference, accepted by SIPAIM2023

  47. arXiv:2309.13860  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning

    Authors: Guanrou Yang, Ziyang Ma, Zhisheng Zheng, Yakun Song, Zhikang Niu, Xie Chen

    Abstract: Recent years have witnessed significant advancements in self-supervised learning (SSL) methods for speech-processing tasks. Various speech-based SSL models have been developed and present promising performance on a range of downstream tasks including speech recognition. However, existing speech-based SSL models face a common dilemma in terms of computational cost, which might hinder their potentia… ▽ More

    Submitted 29 September, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

  48. arXiv:2309.09180  [pdf, other

    eess.AS cs.AI cs.SD

    Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding with Sequence-to-Sequence Architecture

    Authors: Gaobin Yang, Maokui He, Shutong Niu, Ruoyu Wang, Yanyan Yue, Shuangqing Qian, Shilong Wu, Jun Du, Chin-Hui Lee

    Abstract: We propose a novel neural speaker diarization system using memory-aware multi-speaker embedding with sequence-to-sequence architecture (NSD-MS2S), which integrates the strengths of memory-aware multi-speaker embedding (MA-MSE) and sequence-to-sequence (Seq2Seq) architecture, leading to improvement in both efficiency and performance. Next, we further decrease the memory occupation of decoding by in… ▽ More

    Submitted 26 December, 2023; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Accepted by ICASSP 2024

  49. arXiv:2309.06598  [pdf

    eess.IV

    Efficient Post-processing of Diffusion Tensor Cardiac Magnetic Imaging Using Texture-conserving Deformable Registration

    Authors: Fanwen Wang, Pedro F. Ferreira, Yinzhe Wu, Camila Munoz, Ke Wen, Yaqing Luo, Jiahao Huang, Dudley J. Pennell, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Diffusion tensor cardiac magnetic resonance (DT-CMR) is a method capable of providing non-invasive measurements of myocardial microstructure. Image registration is essential to correct image shifts due to intra and inter breath-hold motion and imperfect cardiac triggering. Registration is challenging in DT-CMR due to the low signal-to-noise and various contrasts induced by the diffusion encoding i… ▽ More

    Submitted 16 May, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 7 pages, 4 figures, conference

  50. arXiv:2309.04710  [pdf, other

    cs.RO cs.AI cs.CV cs.GR eess.SY

    Jade: A Differentiable Physics Engine for Articulated Rigid Bodies with Intersection-Free Frictional Contact

    Authors: Gang Yang, Siyuan Luo, Lin Shao

    Abstract: We present Jade, a differentiable physics engine for articulated rigid bodies. Jade models contacts as the Linear Complementarity Problem (LCP). Compared to existing differentiable simulations, Jade offers features including intersection-free collision simulation and stable LCP solutions for multiple frictional contacts. We use continuous collision detection to detect the time of impact and adopt… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.