Skip to main content

Showing 1–50 of 199 results for author: Wang, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  2. arXiv:2406.18871  [pdf, other

    eess.AS cs.CL

    DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

    Authors: Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu, He Huang, Boris Ginsburg, Yu-Chiang Frank Wang, Hung-yi Lee

    Abstract: Recent speech language models (SLMs) typically incorporate pre-trained speech models to extend the capabilities from large language models (LLMs). In this paper, we propose a Descriptive Speech-Text Alignment approach that leverages speech captioning to bridge the gap between speech and text modalities, enabling SLMs to interpret and generate comprehensive natural language descriptions, thereby fa… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  3. arXiv:2406.13788  [pdf, other

    eess.SP

    Groupwise Deformable Registration of Diffusion Tensor Cardiovascular Magnetic Resonance: Disentangling Diffusion Contrast, Respiratory and Cardiac Motions

    Authors: Fanwen Wang, Yihao Luo, Ke Wen, Jiahao Huang, Pedro F. Ferreira, Yaqing Luo, Yinzhe Wu, Camila Munoz, Dudley J. Pennell, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Diffusion tensor based cardiovascular magnetic resonance (DT-CMR) offers a non-invasive method to visualize the myocardial microstructure. With the assumption that the heart is stationary, frames are acquired with multiple repetitions for different diffusion encoding directions. However, motion from poor breath-holding and imprecise cardiac triggering complicates DT-CMR analysis, further challenge… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted by MICCAI 2024

  4. arXiv:2406.13708  [pdf

    eess.IV physics.med-ph

    Low-rank based motion correction followed by automatic frame selection in DT-CMR

    Authors: Fanwen Wang, Pedro F. Ferreira, Camila Munoz, Ke Wen, Yaqing Luo, Jiahao Huang, Yinzhe Wu, Dudley J. Pennell, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Motivation: Post-processing of in-vivo diffusion tensor CMR (DT-CMR) is challenging due to the low SNR and variation in contrast between frames which makes image registration difficult, and the need to manually reject frames corrupted by motion. Goals: To develop a semi-automatic post-processing pipeline for robust DT-CMR registration and automatic frame selection. Approach: We used low intrinsic… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted as ISMRM 2024 Digital poster 2141

    Journal ref: ISMRM 2024 Digital poster 2141

  5. arXiv:2406.07061  [pdf, other

    eess.IV cs.CV

    Triage of 3D pathology data via 2.5D multiple-instance learning to guide pathologist assessments

    Authors: Gan Gao, Andrew H. Song, Fiona Wang, David Brenes, Rui Wang, Sarah S. L. Chow, Kevin W. Bishop, Lawrence D. True, Faisal Mahmood, Jonathan T. C. Liu

    Abstract: Accurate patient diagnoses based on human tissue biopsies are hindered by current clinical practice, where pathologists assess only a limited number of thin 2D tissue slices sectioned from 3D volumetric tissue. Recent advances in non-destructive 3D pathology, such as open-top light-sheet microscopy, enable comprehensive imaging of spatially heterogeneous tissue morphologies, offering the feasibili… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR CVMI 2024

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2024, pp. 6955-6965

  6. arXiv:2406.05692  [pdf, other

    cs.SD cs.AI eess.AS

    SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion

    Authors: Bingsong Bai, Feng** Wang, Yingming Gao, Ya Li

    Abstract: Diffusion-based singing voice conversion (SVC) models have shown better synthesis quality compared to traditional methods. However, in cross-domain SVC scenarios, where there is a significant disparity in pitch between the source and target voice domains, the models tend to generate audios with hoarseness, posing challenges in achieving high-quality vocal outputs. Therefore, in this paper, we prop… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  7. arXiv:2406.05515  [pdf, other

    cs.SD cs.CL eess.AS

    Mmm whatcha say? Uncovering distal and proximal context effects in first and second-language word perception using psychophysical reverse correlation

    Authors: Paige Tuttösí, H. Henny Yeung, Yue Wang, Fenqi Wang, Guillaume Denis, Jean-Julien Aucouturier, Angelica Lim

    Abstract: Acoustic context effects, where surrounding changes in pitch, rate or timbre influence the perception of a sound, are well documented in speech perception, but how they interact with language background remains unclear. Using a reverse-correlation approach, we systematically varied the pitch and speech rate in phrases around different pairs of vowels for second language (L2) speakers of English (/… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024

  8. arXiv:2405.17659  [pdf, other

    eess.IV cs.CV

    Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba

    Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yang Nan, Weiwen Wu, Chengyan Wang, Kuangyu Shi, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

    Abstract: Deep learning has been extensively applied in medical image reconstruction, where Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) represent the predominant paradigms, each possessing distinct advantages and inherent limitations: CNNs exhibit linear complexity with local sensitivity, whereas ViTs demonstrate quadratic complexity with global sensitivity. The emerging Mamba has sh… ▽ More

    Submitted 25 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  9. arXiv:2405.06442  [pdf, other

    cs.IT eess.SP

    Optimal Beamforming of RIS-Aided Wireless Communications: An Alternating Inner Product Maximization Approach

    Authors: Ru**g Xiong, Tiebin Mi, Jialong Lu, Ke Yin, Kai Wan, Fuhai Wang, Robert Caiming Qiu

    Abstract: This paper investigates a general discrete $\ell_p$-norm maximization problem, with the power enhancement at steering directions through reconfigurable intelligent surfaces (RISs) as an instance. We propose a mathematically concise iterative framework composed of alternating inner product maximizations, well-suited for addressing $\ell_1$- and $\ell_2$-norm maximizations with either discrete or co… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  10. arXiv:2404.16223  [pdf, other

    cs.CV eess.IV

    Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey

    Authors: Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhi**g Sun, Jiaying Zhu , et al. (10 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 RAW Image Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. Th goal of this challenge is to upscale RAW Bayer images by 2x, considering unknown degradations such as nois… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 - NTIRE Workshop

  11. arXiv:2404.15294  [pdf

    eess.SP cs.LG

    Multimodal Physical Fitness Monitoring (PFM) Framework Based on TimeMAE-PFM in Wearable Scenarios

    Authors: Junjie Zhang, Zheming Zhang, Huachen Xiang, Yangquan Tan, Linnan Huo, Fengyi Wang

    Abstract: Physical function monitoring (PFM) plays a crucial role in healthcare especially for the elderly. Traditional assessment methods such as the Short Physical Performance Battery (SPPB) have failed to capture the full dynamic characteristics of physical function. Wearable sensors such as smart wristbands offer a promising solution to this issue. However, challenges exist, such as the computational co… ▽ More

    Submitted 25 March, 2024; originally announced April 2024.

    Comments: 5 pages, 6 figures

  12. arXiv:2404.12769  [pdf

    eess.SY

    Towards Accurate and Efficient Sorting of Retired Lithium-ion Batteries: A Data Driven Based Electrode Aging Assessment Approach

    Authors: Ruohan Guo, Feng Wang, Cungang Hu, Weixiang Shen

    Abstract: Retired batteries (RBs) for second-life applications offer promising economic and environmental benefits. However, accurate and efficient sorting of RBs with discrepant characteristics persists as a pressing challenge. In this study, we introduce a data driven based electrode aging assessment approach to address this concern. To this end, a number of 15 feature points are extracted from battery op… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 40 pages, 25 figures

  13. arXiv:2404.08490  [pdf, other

    eess.SP

    SemHARQ: Semantic-Aware HARQ for Multi-task Semantic Communications

    Authors: Jiang**g Hu, Fengyu Wang, Wenjun Xu, Hui Gao, ** Zhang

    Abstract: Intelligent task-oriented semantic communications (SemComs) have witnessed great progress with the development of deep learning (DL). In this paper, we propose a semantic-aware hybrid automatic repeat request (SemHARQ) framework for the robust and efficient transmissions of semantic features. First, to improve the robustness and effectiveness of semantic coding, a multi-task semantic encoder is pr… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  14. arXiv:2404.01082  [pdf, other

    eess.IV

    The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

    Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Li** Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

    Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 25 pages, 17 figures

  15. arXiv:2403.18134  [pdf, other

    eess.IV cs.CV

    Integrative Graph-Transformer Framework for Histopathology Whole Slide Image Representation and Classification

    Authors: Zhan Shi, **gwei Zhang, Jun Kong, Fusheng Wang

    Abstract: In digital pathology, the multiple instance learning (MIL) strategy is widely used in the weakly supervised histopathology whole slide image (WSI) classification task where giga-pixel WSIs are only labeled at the slide level. However, existing attention-based MIL approaches often overlook contextual information and intrinsic spatial relationships between neighboring tissue tiles, while graph-based… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  16. arXiv:2403.01137  [pdf, other

    cs.CV cs.GR eess.IV

    Neural radiance fields-based holography [Invited]

    Authors: Minsung Kang, Fan Wang, Kai Kumano, Tomoyoshi Ito, Tomoyoshi Shimobaba

    Abstract: This study presents a novel approach for generating holograms based on the neural radiance fields (NeRF) technique. Generating three-dimensional (3D) data is difficult in hologram computation. NeRF is a state-of-the-art technique for 3D light-field reconstruction from 2D images based on volume rendering. The NeRF can rapidly predict new-view images that do not include a training dataset. In this s… ▽ More

    Submitted 9 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  17. arXiv:2403.00897  [pdf, other

    eess.IV astro-ph.GA cs.AI cs.CV cs.LG

    VisRec: A Semi-Supervised Approach to Radio Interferometric Data Reconstruction

    Authors: Ruoqi Wang, Haitao Wang, Qiong Luo, Feng Wang, Hejun Wu

    Abstract: Radio telescopes produce visibility data about celestial objects, but these data are sparse and noisy. As a result, images created on raw visibility data are of low quality. Recent studies have used deep learning models to reconstruct visibility data to get cleaner images. However, these methods rely on a substantial amount of labeled training data, which requires significant labeling effort from… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  18. arXiv:2402.18451  [pdf, other

    eess.IV cs.CV

    MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation

    Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yang Nan, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

    Abstract: The recent Mamba model has shown remarkable adaptability for visual representation learning, including in medical imaging tasks. This study introduces MambaMIR, a Mamba-based model for medical image reconstruction, as well as its Generative Adversarial Network-based variant, MambaMIR-GAN. Our proposed MambaMIR inherits several advantages, such as linear complexity, global receptive fields, and dyn… ▽ More

    Submitted 25 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  19. arXiv:2402.16321  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech

    Authors: Szu-Wei Fu, Kuo-Hsuan Hung, Yu Tsao, Yu-Chiang Frank Wang

    Abstract: Speech quality estimation has recently undergone a paradigm shift from human-hearing expert designs to machine-learning models. However, current models rely mainly on supervised learning, which is time-consuming and expensive for label collection. To solve this problem, we propose VQScore, a self-supervised metric for evaluating speech based on the quantization error of a vector-quantized-variatio… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Published as a conference paper at ICLR 2024

  20. arXiv:2402.11186  [pdf, other

    eess.IV physics.med-ph

    Low-Dose CT Reconstruction Using Dataset-free Learning

    Authors: Feng Wang, Renfang Wang, Hong Qiu

    Abstract: Low-Dose computer tomography (LDCT) is an ideal alternative to reduce radiation risk in clinical applications. Although supervised-deep-learning-based reconstruction methods have demonstrated superior performance compared to conventional model-driven reconstruction algorithms, they require collecting massive pairs of low-dose and norm-dose CT images for neural network training, which limits their… ▽ More

    Submitted 22 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  21. Deep-Learning Channel Estimation for IRS-Assisted Integrated Sensing and Communication System

    Authors: Yu Liu, Ibrahim Al-Nahhal, Octavia A. Dobre, Fanggang Wang

    Abstract: Integrated sensing and communication (ISAC), and intelligent reflecting surface (IRS) are envisioned as revolutionary technologies to enhance spectral and energy efficiencies for next wireless system generations. For the first time, this paper focuses on the channel estimation problem in an IRS-assisted ISAC system. This problem is challenging due to the lack of signal processing capacity in passi… ▽ More

    Submitted 7 April, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

    Journal ref: Published in IEEE Transactions on Vehicular Technology, vol. 72, no. 5, pp. 6181-6193, May 2023

  22. Extreme Learning Machine-based Channel Estimation in IRS-Assisted Multi-User ISAC System

    Authors: Yu Liu, Ibrahim Al-Nahhal, Octavia A. Dobre, Fanggang Wang, Hyundong Shin

    Abstract: Multi-user integrated sensing and communication (ISAC) assisted by intelligent reflecting surface (IRS) has been recently investigated to provide a high spectral and energy efficiency transmission. This paper proposes a practical channel estimation approach for the first time to an IRS-assisted multiuser ISAC system. The estimation problem in such a system is challenging since the sensing and comm… ▽ More

    Submitted 7 April, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

    Journal ref: Published in IEEE Transactions on Communications, vol. 71, no. 12, pp. 6993-7007, Dec. 2023

  23. Deep-Learning-Based Channel Estimation for IRS-Assisted ISAC System

    Authors: Yu Liu, Ibrahim Al-Nahhal, Octavia A. Dobre, Fanggang Wang

    Abstract: Integrated sensing and communication (ISAC) and intelligent reflecting surface (IRS) are viewed as promising technologies for future generations of wireless networks. This paper investigates the channel estimation problem in an IRS-assisted ISAC system. A deep-learning framework is proposed to estimate the sensing and communication (S&C) channels in such a system. Considering different propagation… ▽ More

    Submitted 7 April, 2024; v1 submitted 29 January, 2024; originally announced February 2024.

    Journal ref: Published in IEEE Global Communications Conference, Rio de Janeiro, Brazil, Dec. 2022, pp. 4220-4225

  24. arXiv:2401.16564  [pdf

    eess.SP

    Data and Physics driven Deep Learning Models for Fast MRI Reconstruction: Fundamentals and Methodologies

    Authors: Jiahao Huang, Yinzhe Wu, Fanwen Wang, Yingying Fang, Yang Nan, Cagan Alkan, Lei Xu, Zhifan Gao, Weiwen Wu, Lei Zhu, Zhaolin Chen, Peter Lally, Neal Bangerter, Kawin Setsompop, Yike Guo, Daniel Rueckert, Ge Wang, Guang Yang

    Abstract: Magnetic Resonance Imaging (MRI) is a pivotal clinical diagnostic tool, yet its extended scanning times often compromise patient comfort and image quality, especially in volumetric, temporal and quantitative scans. This review elucidates recent advances in MRI acceleration via data and physics-driven models, leveraging techniques from algorithm unrolling models, enhancement-based models, and plug-… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  25. arXiv:2401.15513  [pdf, other

    eess.IV cs.CV

    MiTU-Net: A fine-tuned U-Net with SegFormer backbone for segmenting pubic symphysis-fetal head

    Authors: Fangyijie Wang, Guenole Silvestre, Kathleen Curran

    Abstract: Ultrasound measurements have been examined as potential tools for predicting the likelihood of successful vaginal delivery. The angle of progression (AoP) is a measurable parameter that can be obtained during the initial stage of labor. The AoP is defined as the angle between a straight line along the longitudinal axis of the pubic symphysis (PS) and a line from the inferior edge of the PS to the… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: The 5th place in the Pubic Symphysis-Fetal Head Segmentation Challenge in MICCAI 2023

  26. arXiv:2401.15111  [pdf, other

    eess.IV cs.CV cs.LG

    Improving Fairness of Automated Chest X-ray Diagnosis by Contrastive Learning

    Authors: Mingquan Lin, Tianhao Li, Zhaoyi Sun, Gregory Holste, Ying Ding, Fei Wang, George Shih, Yifan Peng

    Abstract: Purpose: Limited studies exploring concrete methods or approaches to tackle and enhance model fairness in the radiology domain. Our proposed AI model utilizes supervised contrastive learning to minimize bias in CXR diagnosis. Materials and Methods: In this retrospective study, we evaluated our proposed method on two datasets: the Medical Imaging and Data Resource Center (MIDRC) dataset with 77,8… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 23 pages, 5 figures

    MSC Class: arms.org

  27. arXiv:2401.13998  [pdf, other

    eess.IV cs.CV

    WAL-Net: Weakly supervised auxiliary task learning network for carotid plaques classification

    Authors: Haitao Gan, Lingchao Fu, Ran Zhou, Weiyan Gan, Furong Wang, Xiaoyan Wu, Zhi Yang, Zhongwei Huang

    Abstract: The classification of carotid artery ultrasound images is a crucial means for diagnosing carotid plaques, holding significant clinical relevance for predicting the risk of stroke. Recent research suggests that utilizing plaque segmentation as an auxiliary task for classification can enhance performance by leveraging the correlation between segmentation and classification tasks. However, this appro… ▽ More

    Submitted 27 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  28. arXiv:2401.06419  [pdf, other

    math.OC eess.SP

    Energy-Efficient Data Offloading for Earth Observation Satellite Networks

    Authors: Lijun He, Ziye Jia, Juncheng Wang, Feng Wang, Erick Lansard, Chau Yuen

    Abstract: In Earth Observation Satellite Networks (EOSNs) with a large number of battery-carrying satellites, proper power allocation and task scheduling are crucial to improving the data offloading efficiency. As such, we jointly optimize power allocation and task scheduling to achieve energy-efficient data offloading in EOSNs, aiming to balance the objectives of reducing the total energy consumption and i… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  29. arXiv:2312.10687  [pdf, other

    eess.AS cs.SD

    MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis

    Authors: Wenhao Guan, Yishuang Li, Tao Li, Hukai Huang, Feng Wang, Jiayan Lin, Lingyan Huang, Lin Li, Qingyang Hong

    Abstract: The style transfer task in Text-to-Speech refers to the process of transferring style information into text content to generate corresponding speech with a specific style. However, most existing style transfer approaches are either based on fixed emotional labels or reference speech clips, which cannot achieve flexible style transfer. Recently, some methods have adopted text descriptions to guide… ▽ More

    Submitted 31 January, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI2024

  30. arXiv:2312.03299  [pdf, other

    cs.IT eess.SP

    Channel-Transferable Semantic Communications for Multi-User OFDM-NOMA Systems

    Authors: Lan Lin, Wenjun Xu, Fengyu Wang, Yimeng Zhang, Wei Zhang, ** Zhang

    Abstract: Semantic communications are expected to become the core new paradigms of the sixth generation (6G) wireless networks. Most existing works implicitly utilize channel information for codecs training, which leads to poor communications when channel type or statistical characteristics change. To tackle this issue posed by various channels, a novel channel-transferable semantic communications (CT-SemCo… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  31. arXiv:2311.18788  [pdf, other

    eess.IV cs.AI cs.CV cs.MM physics.med-ph

    Automated interpretation of congenital heart disease from multi-view echocardiograms

    Authors: **g Wang, Xiaofeng Liu, Fangyun Wang, Lin Zheng, Fengqiao Gao, Hanwen Zhang, Xin Zhang, Wanqing Xie, Binbin Wang

    Abstract: Congenital heart disease (CHD) is the most common birth defect and the leading cause of neonate death in China. Clinical diagnosis can be based on the selected 2D key-frames from five views. Limited by the availability of multi-view data, most methods have to rely on the insufficient single view analysis. This study proposes to automatically analyze the multi-view echocardiograms with a practical… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: Published in Medical Image Analysis

    Journal ref: Medical Image Analysis (Volume 69, April 2021, 101942)

  32. arXiv:2311.14488  [pdf, other

    eess.IV

    Lightweight Framework for Automated Kidney Stone Detection using coronal CT images

    Authors: Fangyijie Wang, Guenole Silvestre, Kathleen M. Curran

    Abstract: Kidney stone disease results in millions of annual visits to emergency departments in the United States. Computed tomography (CT) scans serve as the standard imaging modality for efficient detection of kidney stones. Various approaches utilizing convolutional neural networks (CNNs) have been proposed to implement automatic diagnosis of kidney stones. However, there is a growing interest in employi… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 5 pages, 2 figures, 3 tables

  33. arXiv:2311.14275  [pdf, other

    cs.CV cs.SD eess.AS

    Cooperative Dual Attention for Audio-Visual Speech Enhancement with Facial Cues

    Authors: Feixiang Wang, Shuang Yang, Shiguang Shan, Xilin Chen

    Abstract: In this work, we focus on leveraging facial cues beyond the lip region for robust Audio-Visual Speech Enhancement (AVSE). The facial region, encompassing the lip region, reflects additional speech-related attributes such as gender, skin color, nationality, etc., which contribute to the effectiveness of AVSE. However, static and dynamic speech-unrelated attributes also exist, causing appearance cha… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: Accepted to BMVC 2023 15 pages, 2 figures

  34. arXiv:2311.11582  [pdf, other

    cs.IT eess.SP

    Asymptotic CRB Analysis of Random RIS-Assisted Large-Scale Localization Systems

    Authors: Zhengyu Wang, Hongzheng Liu, Ru**g Xiong, Fuhai Wang, Robert Caiming Qiu

    Abstract: This paper studies the performance of a randomly RIS-assisted multi-target localization system, in which the configurations of the RIS are randomly set to avoid high-complexity optimization. We first focus on the scenario where the number of RIS elements is significantly large, and then obtain the scaling law of Cramér-Rao bound (CRB) under certain conditions, which shows that CRB decreases in the… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  35. arXiv:2311.11222  [pdf, other

    eess.IV

    Wireless Regional Imaging through Reconfigurable Intelligent Surfaces: Passive Mode

    Authors: Fuhai Wang, Chun Wang, Ru**g Xiong, Zhengyu Wang, Tiebin Mi, Robert Caiming Qiu

    Abstract: In this paper, we propose a multi-RIS-aided wireless imaging framework in 3D facing the distributed placement of multi-sensor networks. The system creates a randomized reflection pattern by adjusting the RIS phase shift, enabling the receiver to capture signals within the designated space of interest (SoI). Firstly, a multi-RIS-aided linear imaging channel modeling is proposed. We introduce a theo… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  36. arXiv:2311.05921  [pdf, other

    eess.SY

    A Wi-Fi Signal-Based Human Activity Recognition Using High-Dimensional Factor Models

    Authors: Junshuo Liu, Fuhai Wang, Zhe Li, Ru**g Xiong, Tiebin Mi, Robert Caiming Qiu

    Abstract: Passive sensing techniques based on Wi-Fi signals have emerged as a promising technology in advanced wireless communication systems due to their widespread application and cost-effectiveness. However, the proliferation of low-cost Internet of Things (IoT) devices has led to dense network deployments, resulting in increased levels of noise and interference in Wi-Fi environments. This, in turn, lead… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  37. arXiv:2310.20389  [pdf

    eess.IV cs.CV

    High-Resolution Reference Image Assisted Volumetric Super-Resolution of Cardiac Diffusion Weighted Imaging

    Authors: Yinzhe Wu, Jiahao Huang, Fanwen Wang, Pedro Ferreira, Andrew Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Diffusion Tensor Cardiac Magnetic Resonance (DT-CMR) is the only in vivo method to non-invasively examine the microstructure of the human heart. Current research in DT-CMR aims to improve the understanding of how the cardiac microstructure relates to the macroscopic function of the healthy heart as well as how microstructural dysfunction contributes to disease. To get the final DT-CMR metrics, we… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Accepted by SPIE Medical Imaging 2024

  38. arXiv:2310.17997  [pdf

    physics.optics cs.AI eess.IV

    Deep Learning Enables Large Depth-of-Field Images for Sub-Diffraction-Limit Scanning Superlens Microscopy

    Authors: Hui Sun, Hao Luo, Feifei Wang, Qingjiu Chen, Meng Chen, Xiaoduo Wang, Haibo Yu, Guanglie Zhang, Lianqing Liu, Jian** Wang, Dapeng Wu, Wen Jung Li

    Abstract: Scanning electron microscopy (SEM) is indispensable in diverse applications ranging from microelectronics to food processing because it provides large depth-of-field images with a resolution beyond the optical diffraction limit. However, the technology requires coating conductive films on insulator samples and a vacuum environment. We use deep learning to obtain the map** relationship between op… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 13 pages,7 figures

  39. arXiv:2310.17760  [pdf, other

    stat.ME eess.SP

    Novel Models for Multiple Dependent Heteroskedastic Time Series

    Authors: Fangyijie Wang, Michael Salter-Townshend

    Abstract: Functional magnetic resonance imaging or functional MRI (fMRI) is a very popular tool used for differing brain regions by measuring brain activity. It is affected by physiological noise, such as head and brain movement in the scanner from breathing, heart beats, or the subject fidgeting. The purpose of this paper is to propose a novel approach to handling fMRI data for infants with high volatility… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 18 pages

  40. arXiv:2309.16853  [pdf, other

    eess.SP

    T1/T2 relaxation temporal modelling from accelerated acquisitions using a Latent Transformer

    Authors: Fanwen Wang, Michael Tanzer, Mengyun Qiao, Wenjia Bai, Daniel Rueckert, Guang Yang, Sonia Nielles-Vallespin

    Abstract: Quantitative cardiac magnetic resonance T1 and T2 map** enable myocardial tissue characterisation but the lengthy scan times restrict their widespread clinical application. We propose a deep learning method that incorporates a time dependency Latent Transformer module to model relationships between parameterised time frames for improved reconstruction from undersampled data. The module, implemen… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  41. arXiv:2309.11965  [pdf, ps, other

    eess.SY

    Coordination Control of Discrete Event Systems under Cyber Attacks

    Authors: Fei Wang, Jan Komenda, Feng Lin

    Abstract: This paper investigates the coordination control of discrete event systems in the presence of combined sensor and actuator attacks. Discrete event systems are modeled as automata, and sensor attacks are defined using specific attack languages. The approach involves employing multiple local supervisors to control the system. The primary objective is to devise these local supervisors to ensure the s… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 9 pages, references added, proof of Theorem 3

  42. arXiv:2309.08385  [pdf, other

    cs.LG eess.SP

    A Unified View Between Tensor Hypergraph Neural Networks And Signal Denoising

    Authors: Fuli Wang, Karelia Pena-Pena, Wei Qian, Gonzalo R. Arce

    Abstract: Hypergraph Neural networks (HyperGNNs) and hypergraph signal denoising (HyperGSD) are two fundamental topics in higher-order network modeling. Understanding the connection between these two domains is particularly useful for designing novel HyperGNNs from a HyperGSD perspective, and vice versa. In particular, the tensor-hypergraph convolutional network (T-HGCN) has emerged as a powerful architectu… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 5 pages, accepted by EUSIPCO 2023

  43. arXiv:2309.07198  [pdf, other

    eess.IV physics.app-ph physics.optics

    Temporal compressive edge imaging enabled by a lensless diffuser camera

    Authors: Ze Zheng, Baolei Liu, Jiaqi Song, Lei Ding, Xiaolan Zhong, David Mcgloin, Fan Wang

    Abstract: Lensless imagers based on diffusers or encoding masks enable high-dimensional imaging from a single shot measurement and have been applied in various applications. However, to further extract image information such as edge detection, conventional post-processing filtering operations are needed after the reconstruction of the original object images in the diffuser imaging systems. Here, we present… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: 5 pages, 4 figures

    Journal ref: Optics Letters, 49(11), 3058-3061 (2024)

  44. arXiv:2309.06598  [pdf

    eess.IV

    Efficient Post-processing of Diffusion Tensor Cardiac Magnetic Imaging Using Texture-conserving Deformable Registration

    Authors: Fanwen Wang, Pedro F. Ferreira, Yinzhe Wu, Camila Munoz, Ke Wen, Yaqing Luo, Jiahao Huang, Dudley J. Pennell, Andrew D. Scott, Sonia Nielles-Vallespin, Guang Yang

    Abstract: Diffusion tensor cardiac magnetic resonance (DT-CMR) is a method capable of providing non-invasive measurements of myocardial microstructure. Image registration is essential to correct image shifts due to intra and inter breath-hold motion and imperfect cardiac triggering. Registration is challenging in DT-CMR due to the low signal-to-noise and various contrasts induced by the diffusion encoding i… ▽ More

    Submitted 16 May, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: 7 pages, 4 figures, conference

  45. arXiv:2309.05658  [pdf, other

    cs.MM cs.NI eess.IV

    From Capture to Display: A Survey on Volumetric Video

    Authors: Yili **, Kaiyuan Hu, Junhua Liu, Fangxin Wang, Xue Liu

    Abstract: Volumetric video, which offers immersive viewing experiences, is gaining increasing prominence. With its six degrees of freedom, it provides viewers with greater immersion and interactivity compared to traditional videos. Despite their potential, volumetric video services poses significant challenges. This survey conducts a comprehensive review of the existing literature on volumetric video. We fi… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Submitted

  46. arXiv:2309.03440  [pdf, other

    eess.IV cs.CV cs.LG

    Punctate White Matter Lesion Segmentation in Preterm Infants Powered by Counterfactually Generative Learning

    Authors: Zehua Ren, Yongheng Sun, Miaomiao Wang, Yuying Feng, Xianjun Li, Chao **, Jian Yang, Chunfeng Lian, Fan Wang

    Abstract: Accurate segmentation of punctate white matter lesions (PWMLs) are fundamental for the timely diagnosis and treatment of related developmental disorders. Automated PWMLs segmentation from infant brain MR images is challenging, considering that the lesions are typically small and low-contrast, and the number of lesions may dramatically change across subjects. Existing learning-based methods directl… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 10 pages, 3 figures, Medical Image Computing and Computer Assisted Intervention(MICCAI)

  47. arXiv:2309.00514  [pdf

    cs.CV eess.IV

    A Machine Vision Method for Correction of Eccentric Error: Based on Adaptive Enhancement Algorithm

    Authors: Fanyi Wang, Pin Cao, Yihui Zhang, Haotian Hu, Yongying Yang

    Abstract: In the procedure of surface defects detection for large-aperture aspherical optical elements, it is of vital significance to adjust the optical axis of the element to be coaxial with the mechanical spin axis accurately. Therefore, a machine vision method for eccentric error correction is proposed in this paper. Focusing on the severe defocus blur of reference crosshair image caused by the imaging… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  48. arXiv:2308.12478  [pdf

    cs.SD cs.AI eess.AS

    Attention-Based Acoustic Feature Fusion Network for Depression Detection

    Authors: Xiao Xu, Yang Wang, Xinru Wei, Fei Wang, Xizhe Zhang

    Abstract: Depression, a common mental disorder, significantly influences individuals and imposes considerable societal impacts. The complexity and heterogeneity of the disorder necessitate prompt and effective detection, which nonetheless, poses a difficult challenge. This situation highlights an urgent requirement for improved detection methods. Exploiting auditory data through advanced machine learning pa… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  49. arXiv:2308.06599  [pdf, other

    eess.IV

    Semantic Communications with Explicit Semantic Base for Image Transmission

    Authors: Yuan Zheng, Fengyu Wang, Wenjun Xu, Miao Pan, ** Zhang

    Abstract: Semantic communications, aiming at ensuring the successful delivery of the meaning of information, are expected to be one of the potential techniques for the next generation communications. However, the knowledge forming and synchronizing mechanism that enables semantic communication systems to extract and interpret the semantics of information according to the communication intents is still immat… ▽ More

    Submitted 14 January, 2024; v1 submitted 12 August, 2023; originally announced August 2023.

  50. arXiv:2308.05591  [pdf, other

    eess.SY cs.IT cs.NI eess.SP

    Optimizing Cache Content Placement in Integrated Terrestrial and Non-terrestrial Networks

    Authors: Feng Wang, Giovanni Geraci, Tony Q. S. Quek

    Abstract: Non-terrestrial networks (NTN) offer potential for efficient content broadcast in remote regions, thereby extending the reach of digital services. In this paper, we introduce a novel approach to optimize wireless edge content placement using NTN. Specifically, we dynamically select content for placement via NTN links based on popularity and suitability for delivery through NTN, while considering t… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: accepted by IEEE GLOBECOM 2023