Skip to main content

Showing 1–50 of 245 results for author: Zhang, P

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.17661  [pdf, other

    eess.SY

    Physics-Informed AI Inverter

    Authors: Qing Shen, Yifan Zhou, Peng Zhang, Yacov A. Shamash, Xiaochuan Luo, Bin Wang, Huanfeng Zhao, Roshan Sharma, Bo Chen

    Abstract: This letter devises an AI-Inverter that pilots the use of a physics-informed neural network (PINN) to enable AI-based electromagnetic transient simulations (EMT) of grid-forming inverters. The contributions are threefold: (1) A PINN-enabled AI-Inverter is formulated; (2) An enhanced learning strategy, balanced-adaptive PINN, is devised; (3) extensive validations and comparative analysis of the acc… ▽ More

    Submitted 1 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, **ming Guo, Xiaolin Chen, **gcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2406.09182  [pdf, ps, other

    eess.SP cs.LG

    Federated Contrastive Learning for Personalized Semantic Communication

    Authors: Yining Wang, Wanli Ni, Wenqiang Yi, Xiaodong Xu, ** Zhang, Arumugam Nallanathan

    Abstract: In this letter, we design a federated contrastive learning (FedCL) framework aimed at supporting personalized semantic communication. Our FedCL enables collaborative training of local semantic encoders across multiple clients and a global semantic decoder owned by the base station. This framework supports heterogeneous semantic encoders since it does not require client-side model aggregation. Furt… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: IEEE Communications Letters

  4. arXiv:2406.07390  [pdf, other

    eess.SP cs.IT eess.IV

    DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling

    Authors: Sixian Wang, **cheng Dai, Kailin Tan, Xiaoqi Qin, Kai Niu, ** Zhang

    Abstract: End-to-end visual communication systems typically optimize a trade-off between channel bandwidth costs and signal-level distortion metrics. However, under challenging physical conditions, this traditional discriminative communication paradigm often results in unrealistic reconstructions with perceptible blurring and aliasing artifacts, despite the inclusion of perceptual or adversarial losses for… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2406.05916  [pdf, other

    quant-ph eess.SY

    Reforming Quantum Microgrid Formation

    Authors: Chaofan Lin, Peng Zhang, Mikhail A. Bragin, Yacov A. Shamash

    Abstract: This letter introduces a novel compact and lossless quantum microgrid formation (qMGF) approach to achieve efficient operational optimization of the power system and improvement of resilience. This is achieved through lossless reformulation to ensure that the results are equivalent to those produced by the classical MGF by exploiting graph-theory-empowered quadratic unconstrained binary optimizati… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  6. arXiv:2405.17114  [pdf, other

    cs.IT eess.SP

    Holographic MIMO Systems, Their Channel Estimation and Performance

    Authors: Yuanbin Chen, Ying Wang, Zhaocheng Wang, ** Zhang

    Abstract: Holographic multiple-input multiple-output (MIMO) systems constitute a promising technology in support of next-generation wireless communications, thus paving the way for a smart programmable radio environment. However, despite its significant potential, further fundamental issues remain to be addressed, such as the acquisition of accurate channel information. Indeed, the conventional angular-doma… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: This article has been accepted for publication in IEEE VTM

  7. arXiv:2405.15163  [pdf, other

    quant-ph eess.SY

    Provably Quantum-Secure Microgrids through Enhanced Quantum Distributed Control

    Authors: Pouya Babahajiani, Peng Zhang, Ji Liu, Tzu-Chieh Wei

    Abstract: Distributed control of multi-inverter microgrids has attracted considerable attention as it can achieve the combined goals of flexible plug-and-play architecture guaranteeing frequency and voltage regulation while preserving power sharing among nonidentical distributed energy resources (DERs). However, it turns out that cybersecurity has emerged as a serious concern in distributed control schemes.… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2405.14113  [pdf, other

    eess.IV cs.CV

    Multi-modality Regional Alignment Network for Covid X-Ray Survival Prediction and Report Generation

    Authors: Zhusi Zhong, Jie Li, John Sollee, Scott Collins, Harrison Bai, Paul Zhang, Terrence Healey, Michael Atalay, Xinbo Gao, Zhicheng Jiao

    Abstract: In response to the worldwide COVID-19 pandemic, advanced automated technologies have emerged as valuable tools to aid healthcare professionals in managing an increased workload by improving radiology report generation and prognostic analysis. This study proposes Multi-modality Regional Alignment Network (MRANet), an explainable model for radiology report generation and survival prediction that foc… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  9. arXiv:2405.09179  [pdf, other

    eess.SP

    Integrated Sensing and Communication Enabled Cooperative Passive Sensing Using Mobile Communication System

    Authors: Zhiqing Wei, Haotian Liu, Hujun Li, Wangjun Jiang, Zhiyong Feng, Huici Wu, ** Zhang

    Abstract: Integrated sensing and communication (ISAC) is a potential technology of the sixth-generation (6G) mobile communication system, which enables communication base station (BS) with sensing capability. However, the performance of single-BS sensing is limited, which can be overcome by multi-BS cooperative sensing. There are three types of multi-BS cooperative sensing, including cooperative active sens… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 16 pages, 11 figures, Submitted to IEEE Transactions on Mobile Computing

  10. arXiv:2405.07830  [pdf, other

    eess.SP

    Joint Precoding for RIS-Assisted Wideband THz Cell-Free Massive MIMO Systems

    Authors: Xin Su, Ruisi He, Peng Zhang, Bo Ai

    Abstract: Terahertz (THz) cell-free massive multiple-input-multiple-output (mMIMO) networks have been envisioned as a prospective technology for achieving higher system capacity, improved performance, and ultra-high reliability in 6G networks. However, due to severe attenuation and limited scattering in THz transmission, as well as high power consumption for increased number of access points (APs), further… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  11. arXiv:2405.07442  [pdf

    cs.SD cs.AI eess.AS q-bio.QM

    Rene: A Pre-trained Multi-modal Architecture for Auscultation of Respiratory Diseases

    Authors: Pengfei Zhang, Zhihang Zheng, Shichen Zhang, Minghao Yang, Shaojun Tang

    Abstract: Compared with invasive examinations that require tissue sampling, respiratory sound testing is a non-invasive examination method that is safer and easier for patients to accept. In this study, we introduce Rene, a pioneering large-scale model tailored for respiratory sound recognition. Rene has been rigorously fine-tuned with an extensive dataset featuring a broad array of respiratory audio sample… ▽ More

    Submitted 6 June, 2024; v1 submitted 12 May, 2024; originally announced May 2024.

  12. arXiv:2404.12979  [pdf, other

    cs.SD cs.LG eess.AS

    TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition

    Authors: Chengxin Chen, Pengyuan Zhang

    Abstract: One persistent challenge in Speech Emotion Recognition (SER) is the ubiquitous environmental noise, which frequently results in diminished SER performance in practical use. In this paper, we introduce a Two-level Refinement Network, dubbed TRNet, to address this challenge. Specifically, a pre-trained speech enhancement module is employed for front-end noise reduction and noise level estimation. La… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 13 pages, 3 figures

  13. arXiv:2404.10556  [pdf, other

    cs.NI eess.SP

    Generative AI for Advanced UAV Networking

    Authors: Geng Sun, Wenwen Xie, Dusit Niyato, Hongyang Du, Jiawen Kang, **g Wu, Sumei Sun, ** Zhang

    Abstract: With the impressive achievements of chatGPT and Sora, generative artificial intelligence (GAI) has received increasing attention. Not limited to the field of content generation, GAI is also widely used to solve the problems in wireless communication scenarios due to its powerful learning and generalization capabilities. Therefore, we discuss key applications of GAI in improving unmanned aerial veh… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  14. arXiv:2404.08490  [pdf, other

    eess.SP

    SemHARQ: Semantic-Aware HARQ for Multi-task Semantic Communications

    Authors: Jiang**g Hu, Fengyu Wang, Wenjun Xu, Hui Gao, ** Zhang

    Abstract: Intelligent task-oriented semantic communications (SemComs) have witnessed great progress with the development of deep learning (DL). In this paper, we propose a semantic-aware hybrid automatic repeat request (SemHARQ) framework for the robust and efficient transmissions of semantic features. First, to improve the robustness and effectiveness of semantic coding, a multi-task semantic encoder is pr… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  15. arXiv:2404.06007  [pdf, other

    cs.IT cs.AI cs.LG eess.SP

    Collaborative Edge AI Inference over Cloud-RAN

    Authors: Pengfei Zhang, Dingzhu Wen, Guangxu Zhu, Qimei Chen, Kaifeng Han, Yuanming Shi

    Abstract: In this paper, a cloud radio access network (Cloud-RAN) based collaborative edge AI inference architecture is proposed. Specifically, geographically distributed devices capture real-time noise-corrupted sensory data samples and extract the noisy local feature vectors, which are then aggregated at each remote radio head (RRH) to suppress sensing noise. To realize efficient uplink feature aggregatio… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: This paper is accepted by IEEE Transactions on Communications on 08-Apr-2024

  16. arXiv:2403.17324  [pdf, ps, other

    eess.SP

    Unsupervised Learning for Joint Beamforming Design in RIS-aided ISAC Systems

    Authors: Junjie Ye, Lei Huang, Zhen Chen, Peichang Zhang, Mohamed Rihan

    Abstract: It is critical to design efficient beamforming in reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) systems for enhancing spectrum utilization. However, conventional methods often have limitations, either incurring high computational complexity due to iterative algorithms or sacrificing performance when using heuristic methods. To achieve both low complexit… ▽ More

    Submitted 15 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Accpeted by IEEE Wireless Communications Letters

  17. arXiv:2403.13820  [pdf, other

    cs.LG cs.CR eess.SP

    Identity information based on human magnetocardiography signals

    Authors: Pengju Zhang, Chenxi Sun, Jianwei Zhang, Hong Guo

    Abstract: We have developed an individual identification system based on magnetocardiography (MCG) signals captured using optically pumped magnetometers (OPMs). Our system utilizes pattern recognition to analyze the signals obtained at different positions on the body, by scanning the matrices composed of MCG signals with a 2*2 window. In order to make use of the spatial information of MCG signals, we transf… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 7 pages, 5 figures. Author manuscript accepted for AAAI 2024 Spring Symposium on Clinical Foundation Models

  18. arXiv:2403.12167  [pdf, other

    eess.IV cs.CV

    Generalizing deep learning models for medical image classification

    Authors: Sarah Matta, Mathieu Lamard, Philippe Zhang, Alexandre Le Guilcher, Laurent Borderie, Béatrice Cochener, Gwenolé Quellec

    Abstract: Numerous Deep Learning (DL) models have been developed for a large spectrum of medical image analysis applications, which promises to reshape various facets of medical practice. Despite early advances in DL model validation and implementation, which encourage healthcare institutions to adopt them, some fundamental questions remain: are the DL models capable of generalizing? What causes a drop in D… ▽ More

    Submitted 21 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  19. arXiv:2403.11667  [pdf, other

    cs.CV eess.IV

    Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection

    Authors: Julia Wolleb, Florentin Bieder, Paul Friedrich, Peter Zhang, Alicia Durrer, Philippe C. Cattin

    Abstract: The high performance of denoising diffusion models for image generation has paved the way for their application in unsupervised medical anomaly detection. As diffusion-based methods require a lot of GPU memory and have long sampling times, we present a novel and fast unsupervised anomaly detection approach based on latent Bernoulli diffusion models. We first apply an autoencoder to compress the in… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  20. arXiv:2403.04594  [pdf, other

    cs.SD eess.AS

    A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds

    Authors: Xuenan Xu, Xiaohang Xu, Zeyu Xie, **yue Zhang, Mengyue Wu, Kai Yu

    Abstract: Recently, there has been an increasing focus on audio-text cross-modal learning. However, most of the existing audio-text datasets contain only simple descriptions of sound events. Compared with classification labels, the advantages of such descriptions are significantly limited. In this paper, we first analyze the detailed information that human descriptions of audio may contain beyond sound even… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  21. arXiv:2403.03015  [pdf, other

    cs.IT eess.SP

    Low Complexity Channel Estimation for RIS-Assisted THz Systems with Beam Split

    Authors: Xin Su, Ruisi He, Peng Zhang, Bo Ai

    Abstract: To support extremely high data rates, reconfigurable intelligent surface (RIS)-assisted terahertz (THz) communication is considered to be a promising technology for future sixth-generation networks. However, due to the typical employment of hybrid beamforming architecture in THz systems, as well as the passive nature of RIS which lacks the capability to process pilot signals, obtaining channel sta… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  22. arXiv:2402.17645  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation

    Authors: Shuangrui Ding, Zihan Liu, Xiaoyi Dong, Pan Zhang, Rui Qian, Conghui He, Dahua Lin, Jiaqi Wang

    Abstract: We present SongComposer, an innovative LLM designed for song composition. It could understand and generate melodies and lyrics in symbolic song representations, by leveraging the capability of LLM. Existing music-related LLM treated the music as quantized audio signals, while such implicit encoding leads to inefficient encoding and poor flexibility. In contrast, we resort to symbolic song represen… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: project page: https://pjlab-songcomposer.github.io/ code: https://github.com/pjlab-songcomposer/songcomposer

  23. arXiv:2402.16581  [pdf, other

    eess.IV

    Rate Splitting Multiple Access-Enabled Adaptive Panoramic Video Semantic Transmission

    Authors: Haixiao Gao, Mengying Sun, Xiaodong Xu, Shujun Han, Bizhu Wang, **gxuan Zhang, ** Zhang

    Abstract: In this paper, we propose an adaptive panoramic video semantic transmission (APVST) framework enabled by rate splitting multiple access (RSMA). The APVST framework consists of a semantic transmitter and receiver, utilizing a deep joint source-channel coding structure to adaptively extract and encode semantic features from panoramic frames. To achieve higher spectral efficiency and conserve bandwid… ▽ More

    Submitted 23 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  24. arXiv:2402.09709  [pdf, other

    eess.IV

    ME-ViT: A Single-Load Memory-Efficient FPGA Accelerator for Vision Transformers

    Authors: Kyle Marino, Pengmiao Zhang, Viktor Prasanna

    Abstract: Vision Transformers (ViTs) have emerged as a state-of-the-art solution for object classification tasks. However, their computational demands and high parameter count make them unsuitable for real-time inference, prompting the need for efficient hardware implementations. Existing hardware accelerators for ViTs suffer from frequent off-chip memory access, restricting the achievable throughput by mem… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    ACM Class: C.3

  25. arXiv:2401.13980  [pdf, other

    cs.IT eess.IV

    A Nearly Information Theoretically Secure Approach for Semantic Communications over Wiretap Channel

    Authors: Weixuan Chen, Shuo Shao, Qianqian Yang, Zhaoyang Zhang, ** Zhang

    Abstract: This paper addresses the challenge of achieving information-theoretic security in semantic communication (SeCom) over a wiretap channel, where a legitimate receiver coexists with an eavesdropper experiencing a poorer channel condition. Despite previous efforts to secure SeCom against eavesdroppers, achieving information-theoretic security in such schemes remains an open issue. In this work, we pro… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 13 pages, 16 figures

  26. arXiv:2401.10242  [pdf, other

    cs.OH cs.GR cs.HC cs.SD eess.AS

    DanceMeld: Unraveling Dance Phrases with Hierarchical Latent Codes for Music-to-Dance Synthesis

    Authors: Xin Gao, Li Hu, Peng Zhang, Bang Zhang, Liefeng Bo

    Abstract: In the realm of 3D digital human applications, music-to-dance presents a challenging task. Given the one-to-many relationship between music and dance, previous methods have been limited in their approach, relying solely on matching and generating corresponding dance movements based on music rhythm. In the professional field of choreography, a dance phrase consists of several dance poses and dance… ▽ More

    Submitted 30 November, 2023; originally announced January 2024.

    Comments: 10 pages, 8 figures

  27. arXiv:2401.05182  [pdf, other

    cs.IT eess.SP

    Integrated Sensing and Communication with Reconfigurable Distributed Antenna and Reflecting Surface: Joint Beamforming and Mode Selection

    Authors: **** Zhang, **tao Wang, Yulin Shao, Shaodan Ma

    Abstract: This paper presents a new integrated sensing and communication (ISAC) framework, leveraging the recent advancements of reconfigurable distributed antenna and reflecting surface (RDARS). RDARS is a programmable surface structure comprising numerous elements, each of which can be flexibly configured to operate either in a reflection mode, resembling a passive reconfigurable intelligent surface (RIS)… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 13 pages, 9 figures

  28. arXiv:2401.03615  [pdf, other

    eess.IV cs.CV cs.LG

    Automated Detection of Myopic Maculopathy in MMAC 2023: Achievements in Classification, Segmentation, and Spherical Equivalent Prediction

    Authors: Yihao Li, Philippe Zhang, Yubo Tan, **g Zhang, Zhihan Wang, Weili Jiang, Pierre-Henri Conze, Mathieu Lamard, Gwenolé Quellec, Mostafa El Habib Daho

    Abstract: Myopic macular degeneration is the most common complication of myopia and the primary cause of vision loss in individuals with pathological myopia. Early detection and prompt treatment are crucial in preventing vision impairment due to myopic maculopathy. This was the focus of the Myopic Maculopathy Analysis Challenge (MMAC), in which we participated. In task 1, classification of myopic maculopath… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 18 pages

  29. arXiv:2401.01176  [pdf, other

    cs.IT cs.LG eess.SP

    Fundamental Limitation of Semantic Communications: Neural Estimation for Rate-Distortion

    Authors: Dongxu Li, Jianhao Huang, Chuan Huang, Xiaoqi Qin, Han Zhang, ** Zhang

    Abstract: This paper studies the fundamental limit of semantic communications over the discrete memoryless channel. We consider the scenario to send a semantic source consisting of an observation state and its corresponding semantic state, both of which are recovered at the receiver. To derive the performance limitation, we adopt the semantic rate-distortion function (SRDF) to study the relationship among t… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  30. arXiv:2312.15593  [pdf, other

    cs.SD cs.AI eess.AS

    DSNet: Disentangled Siamese Network with Neutral Calibration for Speech Emotion Recognition

    Authors: Chengxin Chen, Pengyuan Zhang

    Abstract: One persistent challenge in deep learning based speech emotion recognition (SER) is the unconscious encoding of emotion-irrelevant factors (e.g., speaker or phonetic variability), which limits the generalization of SER in practical use. In this paper, we propose DSNet, a Disentangled Siamese Network with neutral calibration, to meet the demand for a more robust and explainable SER model. Specifica… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 15 pages, 4 figures

  31. arXiv:2312.10287  [pdf, other

    eess.SP

    Towards 6G Digital Twin Channel Using Radio Environment Knowledge Pool

    Authors: Jialin Wang, Jianhua Zhang, Yuxiang Zhang, Yutong Sun, Gaofeng, Nie, Lianzheng Shi, ** Zhang, Guangyi Liu

    Abstract: The digital twin channel (DTC) is crucial for 6G wireless autonomous networks as it replicates the wireless channel fading states in 6G air interface transmissions. It is well known that the physical environment influences channels. A key task for accurately twinning channels in complex 6G scenarios is establishing precise relationships between the environment and the channels. In this article, th… ▽ More

    Submitted 26 March, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

  32. arXiv:2312.08862  [pdf, other

    cs.IT eess.SP

    Semantics-Division Duplexing: A Novel Full-Duplex Paradigm

    Authors: Kai Niu, Zijian Liang, Chao Dong, **cheng Dai, Zhongwei Si, ** Zhang

    Abstract: In-band full-duplex (IBFD) is a theoretically effective solution to increase the overall throughput for the future wireless communications system by enabling transmission and reception over the same time-frequency resources. However, reliable source reconstruction remains a great challenge in the practical IBFD systems due to the non-ideal elimination of the self-interference and the inherent limi… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures, submitted to IEEE Wireless Communications Magazine

  33. arXiv:2312.04014  [pdf, other

    eess.SY

    Resilience-Assuring Hydrogen-Powered Microgrids

    Authors: Chaofan Lin, Peng Zhang, Xiaonan Lu

    Abstract: Green hydrogen has shown great potential to power microgrids as a primary source, yet the operation methodology under extreme events is still an open area. To fill this gap, this letter establishes an operational optimization strategy towards resilient hydrogen-powered microgrids, where the frequency and voltage regulation characteristics of hydrogen sources under advanced controls are accurately… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  34. arXiv:2312.03299  [pdf, other

    cs.IT eess.SP

    Channel-Transferable Semantic Communications for Multi-User OFDM-NOMA Systems

    Authors: Lan Lin, Wenjun Xu, Fengyu Wang, Yimeng Zhang, Wei Zhang, ** Zhang

    Abstract: Semantic communications are expected to become the core new paradigms of the sixth generation (6G) wireless networks. Most existing works implicitly utilize channel information for codecs training, which leads to poor communications when channel type or statistical characteristics change. To tackle this issue posed by various channels, a novel channel-transferable semantic communications (CT-SemCo… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  35. arXiv:2312.03196  [pdf, other

    cs.LG eess.SP

    Domain Invariant Representation Learning and Sleep Dynamics Modeling for Automatic Sleep Staging

    Authors: Seungyeon Lee, Thai-Hoang Pham, Zhao Cheng, ** Zhang

    Abstract: Sleep staging has become a critical task in diagnosing and treating sleep disorders to prevent sleep related diseases. With growing large scale sleep databases, significant progress has been made toward automatic sleep staging. However, previous studies face critical problems in sleep studies; the heterogeneity of subjects' physiological signals, the inability to extract meaningful information fro… ▽ More

    Submitted 9 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  36. arXiv:2312.02163  [pdf, other

    cs.IT cs.PF eess.SP

    Cooperation Based Joint Active and Passive Sensing with Asynchronous Transceivers for Perceptive Mobile Networks

    Authors: Wangjun Jiang, Zhiqing Wei, Shaoshi Yang, Zhiyong Feng, ** Zhang

    Abstract: Perceptive mobile network (PMN) is an emerging concept for next-generation wireless networks capable of conducting integrated sensing and communication (ISAC). A major challenge for realizing high performance sensing in PMNs is how to deal with spatially separated asynchronous transceivers. Asynchronicity results in timing offsets (TOs) and carrier frequency offsets (CFOs), which further cause amb… ▽ More

    Submitted 12 October, 2023; originally announced December 2023.

    Comments: 31 pages, 8 figures

  37. arXiv:2311.18186  [pdf, other

    physics.med-ph eess.IV

    Material decomposition for dual-energy propagation-based phase-contrast CT

    Authors: Suyu Liao, Huitao Zhang, Peng Zhang, Yining Zhu

    Abstract: Material decomposition refers to using the energy dependence of material physical properties to differentiate materials in a sample, which is a very important application in computed tomography(CT). In propagation-based X-ray phase-contrast CT, the phase retrieval and Reconstruction are always independent. Moreover, like in conventional CT, the material decomposition methods in this technique can… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  38. arXiv:2311.16433  [pdf, ps, other

    eess.SP

    Energy Efficiency Optimization in Active Reconfigurable Intelligent Surface-Aided Integrated Sensing and Communication Systems

    Authors: Junjie Ye, Mohamed Rihan, Peichang Zhang, Lei Huang, Stefano Buzzi, Zhen Chen

    Abstract: Energy efficiency (EE) is a challenging task in integrated sensing and communication (ISAC) systems, where high spectral efficiency and low energy consumption appear as conflicting requirements. Although passive reconfigurable intelligent surface (RIS) has emerged as a promising technology for enhancing the EE of the ISAC system, the multiplicative fading feature hinders its effectiveness. This pa… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  39. arXiv:2311.13139  [pdf, ps, other

    cs.IT eess.SP

    Joint Distributed Precoding and Beamforming for RIS-aided Cell-Free Massive MIMO Systems

    Authors: Peng Zhang, Jiayi Zhang, Huahua Xiao, Xiaodan Zhang, Derrick Wing Kwan Ng, Bo Ai

    Abstract: The amalgamation of cell-free networks and reconfigurable intelligent surface (RIS) has become a prospective technique for future sixth-generation wireless communication systems. In this paper, we focus on the precoding and beamforming design for a downlink RIS-aided cell-free network. The design is formulated as a non-convex optimization problem by jointly optimizing the combining vector, active… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  40. arXiv:2311.09462  [pdf, other

    eess.SY

    Software-Defined Virtual Synchronous Condenser

    Authors: Zimin Jiang, Peng Zhang, Yifan Zhou, Ɓukasz Kocewiak, Divya Kurthakoti Chandrashekhara, Marie-Lou Picherit, Zefan Tang, Kenneth B. Bowes, Guangya Yang

    Abstract: Synchronous condensers (SCs) play important roles in integrating wind energy into relatively weak power grids. However, the design of SCs usually depends on specific application requirements and may not be adaptive enough to the frequently-changing grid conditions caused by the transition from conventional to renewable power generation. This paper devises a software-defined virtual synchronous con… ▽ More

    Submitted 17 November, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

  41. arXiv:2311.06523  [pdf, other

    cs.NI eess.SP

    Generative AI for Space-Air-Ground Integrated Networks (SAGIN)

    Authors: Ruichen Zhang, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, ** Zhang, Dong In Kim

    Abstract: Recently, generative AI technologies have emerged as a significant advancement in artificial intelligence field, renowned for their language and image generation capabilities. Meantime, space-air-ground integrated network (SAGIN) is an integral part of future B5G/6G for achieving ubiquitous connectivity. Inspired by this, this article explores an integration of generative AI in SAGIN, focusing on… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 9page, 5 figures

  42. arXiv:2310.18630  [pdf, other

    cs.IT eess.SP

    Joint Localization and Communication Enhancement in Uplink Integrated Sensing and Communications System with Clock Asynchronism

    Authors: Xu Chen, XinXin He, Zhiyong Feng, Zhiqing Wei, Qixun Zhang, Xin Yuan, ** Zhang

    Abstract: In this paper, we propose a joint single-base localization and communication enhancement scheme for the uplink (UL) integrated sensing and communications (ISAC) system with asynchronism, which can achieve accurate single-base localization of user equipment (UE) and significantly improve the communication reliability despite the existence of timing offset (TO) due to the clock asynchronism between… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: 13 pages, 11 figures, submitted to JSAC special issue "Positioning and Sensing Over Wireless Networks"

  43. arXiv:2310.12014  [pdf, other

    eess.AS

    Enhancing Spoofing Speech Detection Using Rhythm Information

    Authors: **gze Lu, Yuxiang Zhang, Wenchao Wang, Zengqiang Shang, Pengyuan Zhang

    Abstract: Current spoofing speech detection systems need more convincing evidence. In this paper, the flaws of rhythm information inherent in the TTS-generated speech are analyzed to increase the reliability of detection systems. TTS models take text as input and utilize acoustic models to predict rhythm information, which introduces artifacts in the rhythm information. By filtering out vocal tract response… ▽ More

    Submitted 25 November, 2023; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Five pages, two figures

  44. arXiv:2310.09078  [pdf, other

    cs.NI eess.SP

    DNFS-VNE: Deep Neuro Fuzzy System Driven Virtual Network Embedding

    Authors: Ailing Xiao, Ning Chen, Sheng Wu, Peiying Zhang, Suzhi Cao, Chunxiao Jiang

    Abstract: By decoupling substrate resources, network virtualization (NV) is a promising solution for meeting diverse demands and ensuring differentiated quality of service (QoS). In particular, virtual network embedding (VNE) is a critical enabling technology that enhances the flexibility and scalability of network deployment by addressing the coupling of Internet processes and services. However, in the exi… ▽ More

    Submitted 7 December, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

  45. arXiv:2310.08134  [pdf, other

    eess.SP

    Sensing-assisted Accurate and Fast Beam Management for Cellular-connected mmWave UAV Network

    Authors: Yanpeng Cui, Qixun Zhang, Zhiyong Feng, Qin Wen, Ying Zhou, Zhiqing Wei, ** Zhang

    Abstract: Beam management, including initial access (IA) and beam tracking, is essential to the millimeter-wave Unmanned Aerial Vehicle (UAV) network. However, conventional communication-only and feedback-based schemes suffer a high delay and low accuracy of beam alignment since they only enable the receiver to passively hear the information of the transmitter from the radio domain. This paper presents a no… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  46. arXiv:2310.07180  [pdf, other

    cs.NI eess.SP

    Integrated Sensing and Communication enabled Multiple Base Stations Cooperative Sensing Towards 6G

    Authors: Zhiqing Wei, Wangjun Jiang, Zhiyong Feng, Huici Wu, Ning Zhang, Kaifeng Han, Ruizhong Xu, ** Zhang

    Abstract: Driven by the intelligent applications of sixth-generation (6G) mobile communication systems such as smart city and autonomous driving, which connect the physical and cyber space, the integrated sensing and communication (ISAC) brings a revolutionary change to the base stations (BSs) of 6G by integrating radar sensing and communication in the same hardware and wireless resource. However, with the… ▽ More

    Submitted 24 November, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 11 pages 6 figures

    Journal ref: IEEE NetWork 2023

  47. arXiv:2310.06425  [pdf, other

    eess.SY

    6G Wireless Communications in 7-24 GHz Band: Opportunities, Techniques, and Challenges

    Authors: Zhuangzhuang Cui, Peize Zhang, Sofie Pollin

    Abstract: The sixth generation (6G) wireless communication nowadays is seeking a new spectrum to inherit the pros and discard the cons of sub-6 GHz, millimeter-wave (mmWave), and sub-terahertz (THz) bands. To this end, an upper mid-band with a Frequency Range (FR) spanning from 7 GHz to 24 GHz, also known as FR3, has emerged as a focal point in 6G communications. Thus, as an inevitable prerequisite, a compr… ▽ More

    Submitted 2 June, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 8 pages, 6 figures, 1 table, submitted to IEEE for potential publication

  48. arXiv:2310.06238  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.MM cs.SD eess.AS

    Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering

    Authors: Xiulong Liu, Zhikang Dong, Peng Zhang

    Abstract: In recent years, there has been a growing emphasis on the intersection of audio, vision, and text modalities, driving forward the advancements in multimodal research. However, strong bias that exists in any modality can lead to the model neglecting the others. Consequently, the model's ability to effectively reason across these diverse modalities is compromised, impeding further advancement. In th… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  49. arXiv:2310.05444  [pdf, other

    cs.IT eess.SP

    Waveform Design for MIMO-OFDM Integrated Sensing and Communication System: An Information Theoretical Approach

    Authors: Zhiqing Wei, **ghui Piao, Xin Yuan, Huici Wu, J. Andrew Zhang, Zhiyong Feng, Lin Wang, ** Zhang

    Abstract: Integrated sensing and communication (ISAC) is regarded as the enabling technology in the future 5th-Generation-Advanced (5G-A) and 6th-Generation (6G) mobile communication system. ISAC waveform design is critical in ISAC system. However, the difference of the performance metrics between sensing and communication brings challenges for the ISAC waveform design. This paper applies the unified perfor… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

  50. arXiv:2310.03265  [pdf, other

    cs.NI eess.SP

    Integrated Communication, Sensing, and Computation Framework for 6G Networks

    Authors: Xu Chen, Zhiyong Feng, J. Andrew Zhang, Zhaohui Yang, Xin Yuan, Xinxin He, ** Zhang

    Abstract: In the sixth generation (6G) era, intelligent machine network (IMN) applications, such as intelligent transportation, require collaborative machines with communication, sensing, and computation (CSC) capabilities. This article proposes an integrated communication, sensing, and computation (ICSAC) framework for 6G to achieve the reciprocity among CSC functions to enhance the reliability and latency… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 8 pages, 5 figures, submitted to IEEE VTM