Skip to main content

Showing 1–50 of 77 results for author: Xia, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18009  [pdf, other

    eess.AS cs.SD

    E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS

    Authors: Sefik Emre Eskimez, Xiaofei Wang, Manthan Thakker, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Hemin Yang, Zirun Zhu, Min Tang, Xu Tan, Yanqing Liu, Sheng Zhao, Naoyuki Kanda

    Abstract: This paper introduces Embarrassingly Easy Text-to-Speech (E2 TTS), a fully non-autoregressive zero-shot text-to-speech system that offers human-level naturalness and state-of-the-art speaker similarity and intelligibility. In the E2 TTS framework, the text input is converted into a character sequence with filler tokens. The flow-matching-based mel spectrogram generator is then trained based on the… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2406.16083  [pdf, other

    eess.IV cs.CV

    Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning

    Authors: Ruisheng Gao, Zeyu Xiao, Zhiwei Xiong

    Abstract: Transformer-based methods have demonstrated impressive performance in 4D light field (LF) super-resolution by effectively modeling long-range spatial-angular correlations, but their quadratic complexity hinders the efficient processing of high resolution 4D inputs, resulting in slow inference speed and high memory cost. As a compromise, most prior work adopts a patch-based strategy, which fails to… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 17 pages,7 figures

  3. arXiv:2406.09190  [pdf, other

    eess.SP

    Rethinking Waveform for 6G: Harnessing Delay-Doppler Alignment Modulation

    Authors: Zhiqiang Xiao, Xianda Liu, Yong Zeng, J. Andrew Zhang, Shi **, Rui Zhang

    Abstract: Waveform design has served as a cornerstone for each generation of mobile communication systems. The future sixth-generation (6G) mobile communication networks are expected to employ larger-scale antenna arrays and exploit higher-frequency bands for further boosting data transmission rate and providing ubiquitous wireless sensing. This brings new opportunities and challenges for 6G waveform design… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.04281  [pdf, other

    eess.AS

    Total-Duration-Aware Duration Modeling for Text-to-Speech Systems

    Authors: Sefik Emre Eskimez, Xiaofei Wang, Manthan Thakker, Chung-Hsien Tsai, Canrun Li, Zhen Xiao, Hemin Yang, Zirun Zhu, Min Tang, **yu Li, Sheng Zhao, Naoyuki Kanda

    Abstract: Accurate control of the total duration of generated speech by adjusting the speech rate is crucial for various text-to-speech (TTS) applications. However, the impact of adjusting the speech rate on speech quality, such as intelligibility and speaker characteristics, has been underexplored. In this work, we propose a novel total-duration-aware (TDA) duration model for TTS, where phoneme durations a… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  5. arXiv:2406.00604  [pdf, other

    eess.SP

    Multipath Exploitation for Fluctuating Target Detection in RIS-Assisted ISAC Systems

    Authors: Shoushuo Zhang, Zichao Xiao, Rang Liu, Ming Li, Wei Wang, Qian Liu

    Abstract: Integrated sensing and communication (ISAC) systems are typically deployed in multipath environments, which is usually deemed as a challenging issue for wireless communications. However, the multipath propagation can also provide extra illumination and observation perspectives for radar sensing, which offers spatial diversity gain for detecting targets with spatial radar cross-section (RCS) fluctu… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: submitted to IEEE WCL

  6. arXiv:2405.06971  [pdf, other

    eess.SY

    Controlling network-coupled neural dynamics with nonlinear network control theory

    Authors: Zhongye Xia, Weibin Li, Zhichao Liang, Kexin Lou, Quanying Liu

    Abstract: This paper addresses the problem of controlling the temporal dynamics of complex nonlinear network-coupled dynamical systems, specifically in terms of neurodynamics. Based on the Lyapunov direct method, we derive a control strategy with theoretical guarantees of controllability. To verify the performance of the derived control strategy, we perform numerical experiments on two nonlinear network-cou… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  7. arXiv:2404.15643  [pdf, ps, other

    cs.IT eess.SP

    Dynamic Beam Coverage for Satellite Communications Aided by Movable-Antenna Array

    Authors: Lipeng Zhu, Xiangyu Pi, Wenyan Ma, Zhenyu Xiao, Rui Zhang

    Abstract: Due to the ultra-dense constellation, efficient beam coverage and interference mitigation are crucial to low-earth orbit (LEO) satellite communication systems, while the conventional directional antennas and fixed-position antenna (FPA) arrays both have limited degrees of freedom (DoFs) in beamforming to adapt to the time-varying coverage requirement of terrestrial users. To address this challenge… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  8. arXiv:2403.19951  [pdf, ps, other

    eess.SP

    Fractional Delay Alignment Modulation for Spatially Sparse Wireless Communications

    Authors: Zhiwen Zhou, Zhiqiang Xiao, Yong Zeng

    Abstract: Delay alignment modulation (DAM) is a novel transmission technique for wireless systems with high spatial resolution by leveraging delay compensation and path-based beamforming, to mitigate the inter-symbol interference (ISI) without resorting to complex channel equalization or multi-carrier transmission. However, most existing studies on DAM consider a simplified scenario by assuming that the cha… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE WCNC 2024

  9. arXiv:2402.15185  [pdf, other

    cs.IT eess.SP

    Pre-Chirp-Domain Index Modulation for Affine Frequency Division Multiplexing

    Authors: Guangyao Liu, Tianqi Mao, Ruiqi Liu, Zhenyu Xiao

    Abstract: Affine frequency division multiplexing (AFDM), tailored as a novel multicarrier technique utilizing chirp signals for high-mobility communications, exhibits marked advantages compared to traditional orthogonal frequency division multiplexing (OFDM). AFDM is based on the discrete affine Fourier transform (DAFT) with two modifiable parameters of the chirp signals, termed as the pre-chirp parameter a… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  10. arXiv:2402.11445  [pdf, other

    eess.SY

    Balanced Truncation of Linear Systems with Quadratic Outputs in Limited Time and Frequency Intervals

    Authors: Qiu-Yan Song, Umair Zulfiqar, Zhi-Hua Xiao, Mohammad Monir Uddin, Victor Sreeram

    Abstract: Model order reduction involves constructing a reduced-order approximation of a high-order model while retaining its essential characteristics. This reduced-order model serves as a substitute for the original one in various applications such as simulation, analysis, and design. Often, there's a need to maintain high accuracy within a specific time or frequency interval, while errors beyond this lim… ▽ More

    Submitted 5 March, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

  11. arXiv:2402.07383  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

    Authors: Naoyuki Kanda, Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker, Hemin Yang, Zirun Zhu, Min Tang, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Yufei Xia, **zhu Li, Yanqing Liu, Sheng Zhao, Michael Zeng

    Abstract: Laughter is one of the most expressive and natural aspects of human speech, conveying emotions, social cues, and humor. However, most text-to-speech (TTS) systems lack the ability to produce realistic and appropriate laughter sounds, limiting their applications and user experience. While there have been prior works to generate natural laughter, they fell short in terms of controlling the timing an… ▽ More

    Submitted 4 March, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: See https://aka.ms/elate/ for demo samples, v2: subjective evaluation has been added

  12. arXiv:2401.14612  [pdf, ps, other

    math.OC eess.SY

    On Inhomogeneous Infinite Products of Stochastic Matrices and Applications

    Authors: Zhaoyue Xia, Jun Du, Chunxiao Jiang, H. Vincent Poor, Zhu Han, Yong Ren

    Abstract: With the growth of magnitude of multi-agent networks, distributed optimization holds considerable significance within complex systems. Convergence, a pivotal goal in this domain, is contingent upon the analysis of infinite products of stochastic matrices (IPSMs). In this work, convergence properties of inhomogeneous IPSMs are investigated. The convergence rate of inhomogeneous IPSMs towards an abs… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  13. arXiv:2401.08974  [pdf, ps, other

    cs.IT eess.SP

    Performance Analysis and Optimization for Movable Antenna Aided Wideband Communications

    Authors: Lipeng Zhu, Wenyan Ma, Zhenyu Xiao, Rui Zhang

    Abstract: Movable antenna (MA) has emerged as a promising technology to enhance wireless communication performance by enabling the local movement of antennas at the transmitter (Tx) and/or receiver (Rx) for achieving more favorable channel conditions. As the existing studies on MA-aided wireless communications have mainly considered narrow-band transmission in flat fading channels, we investigate in this pa… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  14. arXiv:2312.17454  [pdf, ps, other

    cs.IT eess.SP

    Sparsity Exploitation via Joint Receive Processing and Transmit Beamforming Design for MIMO-OFDM ISAC Systems

    Authors: Zichao Xiao, Rang Liu, Ming Li, Wei Wang, Qian Liu

    Abstract: Integrated sensing and communication (ISAC) is widely recognized as a pivotal enabling technique for the advancement of future wireless networks. This paper aims to efficiently exploit the inherent sparsity of echo signals for the multi-input-multi-output (MIMO) orthogonal frequency division multiplexing (OFDM) based ISAC system. A novel joint receive echo processing and transmit beamforming desig… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 13 pages, 6 Figures, submitted to IEEE Trans

  15. arXiv:2312.15863  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

    Authors: Hangyu Mao, Rui Zhao, Ziyue Li, Zhiwei Xu, Hao Chen, Yiqun Chen, Bin Zhang, Zhen Xiao, Junge Zhang, Jiang** Yin

    Abstract: Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL. This work studies the former. Specifically, the Perception and Decision-making Interleaving Transformer (PDiT) network is proposed, which cascades two Transformers in a very natural way: the perceiving one focuses on \emph{the environmental perception} by processing the observation at t… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: Proc. of the 23rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2024, full paper with oral presentation). Cover our preliminary study: arXiv:2212.14538

  16. arXiv:2312.06969  [pdf, ps, other

    cs.IT eess.SP

    Channel Estimation for Movable Antenna Communication Systems: A Framework Based on Compressed Sensing

    Authors: Zhenyu Xiao, Songqi Cao, Lipeng Zhu, Yanming Liu, Xiang-Gen Xia, Rui Zhang

    Abstract: Movable antenna (MA) is a new technology with great potential to improve communication performance by enabling local movement of antennas for pursuing better channel conditions. In particular, the acquisition of complete channel state information (CSI) between the transmitter (Tx) and receiver (Rx) regions is an essential problem for MA systems to reap performance gains. In this paper, we propose… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  17. arXiv:2312.05256  [pdf, other

    eess.IV cs.AI

    Holistic Evaluation of GPT-4V for Biomedical Imaging

    Authors: Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, **gyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang , et al. (25 additional authors not shown)

    Abstract: In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor… ▽ More

    Submitted 10 November, 2023; originally announced December 2023.

  18. arXiv:2311.05478  [pdf, other

    cs.CV eess.IV

    Robust Retraining-free GAN Fingerprinting via Personalized Normalization

    Authors: Jianwei Fei, Zhihua Xia, Benedetta Tondi, Mauro Barni

    Abstract: In recent years, there has been significant growth in the commercial applications of generative models, licensed and distributed by model developers to users, who in turn use them to offer services. In this scenario, there is a need to track and identify the responsible user in the presence of a violation of the license agreement or any kind of malicious usage. Although there are methods enabling… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  19. arXiv:2310.11326  [pdf, other

    cs.IT eess.SP

    Integrated Sensing and Channel Estimation by Exploiting Dual Timescales for Delay-Doppler Alignment Modulation

    Authors: Zhiqiang Xiao, Yong Zeng, Fuxi Wen, Zaichen Zhang, Derrick Wing Kwan Ng

    Abstract: For integrated sensing and communication (ISAC) systems, the channel information essential for communication and sensing tasks fluctuates across different timescales. Specifically, wireless sensing primarily focuses on acquiring path state information (PSI) (e.g., delay, angle, and Doppler) of individual multi-path components to sense the environment, which usually evolves much more slowly than th… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  20. arXiv:2308.09512  [pdf, other

    cs.IT eess.SP

    Multiuser Communications with Movable-Antenna Base Station: Joint Antenna Positioning, Receive Combining, and Power Control

    Authors: Zhenyu Xiao, Xiangyu Pi, Lipeng Zhu, Xiang-Gen Xia, Rui Zhang

    Abstract: Movable antenna (MA) is an emerging technology which enables a local movement of the antenna in the transmitter/receiver region for improving the channel condition and communication performance. In this paper, we study the deployment of multiple MAs at the base station (BS) for enhancing the multiuser communication performance. First, we model the multiuser channel in the uplink to characterize th… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2308.05546

  21. arXiv:2308.05546  [pdf, other

    cs.IT eess.SP

    Multiuser Communications with Movable-Antenna Base Station Via Antenna Position Optimization

    Authors: Xiangyu Pi, Lipeng Zhu, Zhenyu Xiao, Rui Zhang

    Abstract: This paper studies the deployment of multiple movable antennas (MAs) at the base station (BS) for enhancing the multiuser communication performance. First, we model the multiuser channel in the uplink to characterize the wireless channel variation caused by MAs' movement at the BS. Then, an optimization problem is formulated to maximize the minimum achievable rate among multiple users for MA-aided… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  22. arXiv:2308.04162  [pdf, other

    cs.CV eess.AS eess.IV

    EPCFormer: Expression Prompt Collaboration Transformer for Universal Referring Video Object Segmentation

    Authors: Jiajun Chen, Jiacheng Lin, Zhiqiang Xiao, Haolong Fu, Ke Nai, Kailun Yang, Zhiyong Li

    Abstract: Audio-guided Video Object Segmentation (A-VOS) and Referring Video Object Segmentation (R-VOS) are two highly-related tasks, which both aim to segment specific objects from video sequences according to user-provided expression prompts. However, due to the challenges in modeling representations for different modalities, contemporary methods struggle to strike a balance between interaction flexibili… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: The source code will be made publicly available at https://github.com/lab206/EPCFormer

  23. arXiv:2308.03387  [pdf, other

    eess.SP

    A Novel Joint Angle-Range-Velocity Estimation Method for MIMO-OFDM ISAC Systems

    Authors: Zichao Xiao, Rang Liu, Ming Li, Qian Liu

    Abstract: Integrated sensing and communications (ISAC) is emerging as a key technique for next-generation wireless systems. In order to expedite the practical implementation of ISAC within pervasive mobile networks, it is essential to equip widely-deployed base stations with radar sensing capabilities. Thus, the utilization of standardized multiple-input multiple-output (MIMO) orthogonal frequency division… ▽ More

    Submitted 28 October, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: 13 pages, 8 figures, submitted to IEEE Trans

  24. arXiv:2306.01598  [pdf, other

    cs.CV cs.RO eess.IV

    Towards Source-free Domain Adaptive Semantic Segmentation via Importance-aware and Prototype-contrast Learning

    Authors: Yihong Cao, Hui Zhang, Xiao Lu, Zheng Xiao, Kailun Yang, Yaonan Wang

    Abstract: Domain adaptive semantic segmentation enables robust pixel-wise understanding in real-world driving scenes. Source-free domain adaptation, as a more practical technique, addresses the concerns of data privacy and storage limitations in typical unsupervised domain adaptation methods, making it especially relevant in the context of intelligent vehicles. It utilizes a well-trained source model and un… ▽ More

    Submitted 26 March, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted to IEEE Transactions on Intelligent Vehicles (T-IV). The source code is publicly available at https://github.com/yihong-97/Source-free-IAPC

  25. arXiv:2305.18994  [pdf, other

    cs.CV eess.IV

    Toward Real-World Light Field Super-Resolution

    Authors: Zeyu Xiao, Ruisheng Gao, Yutong Liu, Yueyi Zhang, Zhiwei Xiong

    Abstract: Deep learning has opened up new possibilities for light field super-resolution (SR), but existing methods trained on synthetic datasets with simple degradations (e.g., bicubic downsampling) suffer from poor performance when applied to complex real-world scenarios. To address this problem, we introduce LytroZoom, the first real-world light field SR dataset capturing paired low- and high-resolution… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: CVPRW 2023

  26. arXiv:2305.18847  [pdf, other

    eess.SP

    Low-Range-Sidelobe Waveform Design for MIMO-OFDM ISAC Systems

    Authors: Peishi Li, Zichao Xiao, Ming Li, Rang Liu, Qian Liu

    Abstract: Integrated sensing and communication (ISAC) is a promising technology in future wireless systems owing to its efficient hardware and spectrum utilization. In this paper, we consider a multi-input multi-output (MIMO) orthogonal frequency division multiplexing (OFDM) ISAC system and propose a novel waveform design to provide better radar ranging performance by taking range sidelobe suppression into… ▽ More

    Submitted 23 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  27. arXiv:2305.11013  [pdf, other

    cs.SD cs.CL eess.AS

    FunASR: A Fundamental End-to-End Speech Recognition Toolkit

    Authors: Zhifu Gao, Zerui Li, Jiaming Wang, Haoneng Luo, Xian Shi, Mengzhe Chen, Yabin Li, Lingyun Zuo, Zhihao Du, Zhangyu Xiao, Shiliang Zhang

    Abstract: This paper introduces FunASR, an open-source speech recognition toolkit designed to bridge the gap between academic research and industrial applications. FunASR offers models trained on large-scale industrial corpora and the ability to deploy them in applications. The toolkit's flagship model, Paraformer, is a non-autoregressive end-to-end speech recognition model that has been trained on a manual… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 5 pages, 3 figures, accepted by INTERSPEECH 2023

  28. arXiv:2304.09620  [pdf

    eess.IV cs.CV

    DCELANM-Net:Medical Image Segmentation based on Dual Channel Efficient Layer Aggregation Network with Learner

    Authors: Chengzhun Lu, Zhangrun Xia, Krzysztof Przystupa, Orest Kochan, Jun Su

    Abstract: The DCELANM-Net structure, which this article offers, is a model that ingeniously combines a Dual Channel Efficient Layer Aggregation Network (DCELAN) and a Micro Masked Autoencoder (Micro-MAE). On the one hand, for the DCELAN, the features are more effectively fitted by deepening the network structure; the deeper network can successfully learn and fuse the features, which can more accurately loca… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  29. arXiv:2304.04267  [pdf, other

    eess.SP

    From Data-driven Learning to Physics-inspired Inferring: A Novel Mobile MIMO Channel Prediction Scheme Based on Neural ODE

    Authors: Zhuoran Xiao, Zhaoyang Zhang, Zirui Chen, Zhaohui Yang, Chongwen Huang, Xiaoming Chen

    Abstract: In this paper, we propose an innovative learning-based channel prediction scheme so as to achieve higher prediction accuracy and reduce the requirements of huge amounts and strict sequential format of channel data. Inspired by the idea of the neural ordinary differential equation (Neural ODE), we first prove that the channel prediction problem can be modeled as an ODE problem with a known initial… ▽ More

    Submitted 29 November, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

  30. arXiv:2303.06324  [pdf, other

    cs.DC eess.SY

    OCCL: a Deadlock-free Library for GPU Collective Communication

    Authors: Lichen Pan, Juncheng Liu, **hui Yuan, Rongkai Zhang, Pengze Li, Zhen Xiao

    Abstract: Various distributed deep neural network (DNN) training technologies lead to increasingly complicated use of collective communications on GPU. The deadlock-prone collectives on GPU force researchers to guarantee that collectives are enqueued in a consistent order on each GPU to prevent deadlocks. In complex distributed DNN training scenarios, manual hardcoding is the only practical way for deadlock… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  31. arXiv:2303.05736  [pdf, ps, other

    cs.IT eess.SP

    Cramér-Rao Bounds for Near-Field Sensing with Extremely Large-Scale MIMO

    Authors: Huizhi Wang, Zhiqiang Xiao, Yong Zeng

    Abstract: Mobile communication networks were designed to mainly support ubiquitous wireless communications, yet they are also expected to achieve radio sensing capabilities in the near future. However, most prior studies on radio sensing usually rely on far-field assumption with uniform plane wave (UPW) models. With the ever-increasing antenna size, together with the growing demands to sense nearby targets,… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

  32. arXiv:2212.09930  [pdf, other

    eess.SY

    Frequency-limited H$_2$ Model Order Reduction Based on Relative Error

    Authors: Umair Zulfiqar, Xin Du, Qiuyan Song, Zhi-Hua Xiao, Victor Sreeram

    Abstract: Frequency-limited model order reduction aims to approximate a high-order model with a reduced-order model that maintains high fidelity within a specific frequency range. Beyond this range, a decrease in accuracy is acceptable due to the nature of the problem. The quality of the reduced-order model is typically evaluated using absolute or relative measures of approximation error. Relative error, wh… ▽ More

    Submitted 24 June, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: text overlap with arXiv:2212.08247

  33. arXiv:2212.08247  [pdf, other

    eess.SY

    Relative Error-based Time-limited H2 Model Order Reduction via Oblique Projection

    Authors: Umair Zulfiqar, Xin Du, Qiuyan Song, Zhi-Hua Xiao, Victor Sreeram

    Abstract: In time-limited model order reduction, a reduced-order approximation of the original high-order model is obtained that accurately approximates the original model within the desired limited time interval. Accuracy outside that time interval is not that important. The error incurred when a reduced-order model is used as a surrogate for the original model can be quantified in absolute or relative ter… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  34. arXiv:2212.05503  [pdf

    cs.CV eess.IV

    Low-rank Tensor Assisted K-space Generative Model for Parallel Imaging Reconstruction

    Authors: Wei Zhang, Zengwei Xiao, Hui Tao, Minghui Zhang, Xiaoling Xu, Qiegen Liu

    Abstract: Although recent deep learning methods, especially generative models, have shown good performance in fast magnetic resonance imaging, there is still much room for improvement in high-dimensional generation. Considering that internal dimensions in score-based generative models have a critical impact on estimating the gradient of the data distribution, we present a new idea, low-rank tensor assisted… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  35. arXiv:2212.05294  [pdf, ps, other

    cs.SD cs.IT eess.AS

    Variational Speech Waveform Compression to Catalyze Semantic Communications

    Authors: Shengshi Yao, Zixuan Xiao, Sixian Wang, **cheng Dai, Kai Niu, ** Zhang

    Abstract: We propose a novel neural waveform compression method to catalyze emerging speech semantic communications. By introducing nonlinear transform and variational modeling, we effectively capture the dependencies within speech frames and estimate the probabilistic distribution of the speech feature more accurately, giving rise to better compression performance. In particular, the speech signals are ana… ▽ More

    Submitted 13 December, 2022; v1 submitted 10 December, 2022; originally announced December 2022.

  36. arXiv:2212.00555  [pdf, other

    q-bio.NC cs.AI eess.IV

    A Structure-guided Effective and Temporal-lag Connectivity Network for Revealing Brain Disorder Mechanisms

    Authors: Zhengwang Xia, Tao Zhou, Saqib Mamoon, Amani Alfakih, Jianfeng Lu

    Abstract: Brain network provides important insights for the diagnosis of many brain disorders, and how to effectively model the brain structure has become one of the core issues in the domain of brain imaging analysis. Recently, various computational methods have been proposed to estimate the causal relationship (i.e., effective connectivity) between brain regions. Compared with traditional correlation-base… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  37. arXiv:2211.12671  [pdf, other

    eess.SP

    3-D Positioning and Resource Allocation for Multi-UAV Base Stations Under Blockage-Aware Channel Model

    Authors: Pengfei Yi, Lipeng Zhu, Zhenyu Xiao, Rui Zhang, Zhu Han, Xiang-Gen Xia

    Abstract: In this paper, we propose to deploy multiple unmanned aerial vehicle (UAV) mounted base stations to serve ground users in outdoor environments with obstacles. In particular, the geographic information is employed to capture the blockage effects for air-to-ground (A2G) links caused by buildings, and a realistic blockage-aware A2G channel model is proposed to characterize the continuous variation of… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  38. arXiv:2211.02283  [pdf, ps, other

    cs.SD cs.IT eess.AS

    Wireless Deep Speech Semantic Transmission

    Authors: Zixuan Xiao, Shengshi Yao, **cheng Dai, Sixian Wang, Kai Niu, ** Zhang

    Abstract: In this paper, we propose a new class of high-efficiency semantic coded transmission methods for end-to-end speech transmission over wireless channels. We name the whole system as deep speech semantic transmission (DSST). Specifically, we introduce a nonlinear transform to map the speech source to semantic latent space and feed semantic features into source-channel encoder to generate the channel-… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  39. arXiv:2209.08456  [pdf, other

    eess.SP cs.AI cs.IT cs.LG

    Deep Learning-Based Rate-Splitting Multiple Access for Reconfigurable Intelligent Surface-Aided Tera-Hertz Massive MIMO

    Authors: Minghui Wu, Zhen Gao, Yang Huang, Zhenyu Xiao, Derrick Wing Kwan Ng, Zhaoyang Zhang

    Abstract: Reconfigurable intelligent surface (RIS) can significantly enhance the service coverage of Tera-Hertz massive multiple-input multiple-output (MIMO) communication systems. However, obtaining accurate high-dimensional channel state information (CSI) with limited pilot and feedback signaling overhead is challenging, severely degrading the performance of conventional spatial division multiple access.… ▽ More

    Submitted 5 December, 2022; v1 submitted 17 September, 2022; originally announced September 2022.

    Comments: Accepted by IEEE Journal on Selected Areas in Communications

  40. arXiv:2208.03055  [pdf, ps, other

    cs.IT eess.SP

    Joint Beamforming Design in DFRC Systems for Wideband Sensing and OFDM Communications

    Authors: Zichao Xiao, Rang Liu, Ming Li, Yang Liu, Qian Liu

    Abstract: Dual-function radar-communication (DFRC) systems, which can efficiently utilize the congested spectrum and costly hardware resources by employing one common waveform for both sensing and communication (S&C), have attracted increasing attention. While the orthogonal frequency division multiplexing (OFDM) technique has been widely adopted to support high-quality communications, it also has great pot… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Comments: 6pages, 3 figures, accepted by Globecom

  41. arXiv:2208.00837  [pdf

    eess.SP

    The feasibility of Q-band millimeter wave on hand-gesture recognition for indoor FTTR scenario

    Authors: Yuxuan Hu, Zhaoyang Xia, Yanbo Zhao, Feng Xu

    Abstract: The generalization for different scenarios and dif-ferent users is an urgent problem for millimeter wave gesture recognition for indoor fiber-to-the-room (FTTR) scenario. In order to solve this problem and verify the feasibility of FTTR Q-band millimeter wave in gesture recognition, we build a real-time millimeter wave gesture recognition system. The moving hand-gestures are represented as a varie… ▽ More

    Submitted 5 July, 2023; v1 submitted 22 July, 2022; originally announced August 2022.

  42. arXiv:2207.11945  [pdf, other

    eess.SP

    Terahertz-Band Near-Space Communications: From a Physical-Layer Perspective

    Authors: Tianqi Mao, Leyi Zhang, Zhenyu Xiao, Zhu Han, Xiang-Gen Xia

    Abstract: Facilitated by rapid technological development of the near-space platform stations (NSPS), near-space communication (NS-COM) is envisioned to play a pivotal role in the space-air-ground integrated network for sixth-generation (6G) communications and beyond. In NS-COM, ultra-broadband wireless connectivity between NSPSs and various airborne/spaceborne platforms is required for a plethora of bandwid… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  43. arXiv:2207.11896  [pdf, ps, other

    eess.SP

    LEO Satellite Access Network (LEO-SAN) Towards 6G: Challenges and Approaches

    Authors: Zhenyu Xiao, Junyi Yang, Tianqi Mao, Chong Xu, Rui Zhang, Zhu Han, Xiang-Gen Xia

    Abstract: With the rapid development of satellite communication technologies, the space-based access network has been envisioned as a promising complementary part of the future 6G network. Aside from terrestrial base stations, satellite nodes, especially the low-earth-orbit (LEO) satellites, can also serve as base stations for Internet access, and constitute the LEO-satellite-based access network (LEO-SAN).… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  44. arXiv:2207.11883  [pdf, other

    eess.SP

    Near Space Communications (NS-COM): A New Regime in Space-Air-Ground Integrated Network (SAGIN)

    Authors: Zhenyu Xiao, Tianqi Mao, Zhu Han, Xiang-Gen Xia

    Abstract: Precipitated by the technological innovations of the near-space platform stations (NSPS), the near space communication (NS-COM) network has emerged as an indispensable part of the next-generation space-air-ground integrated network (SAGIN) that facilitates ubiquitous coverage and broadband data transfer. This paper aims to provide a comprehensive overview of NS-COM. Firstly, we investigate the dif… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  45. arXiv:2207.10969  [pdf, ps, other

    eess.SP

    Convergence Theory of Generalized Distributed Subgradient Method with Random Quantization

    Authors: Zhaoyue Xia, Jun Du, Yong Ren

    Abstract: The distributed subgradient method (DSG) is a widely discussed algorithm to cope with large-scale distributed optimization problems in the arising machine learning applications. Most exisiting works on DSG focus on ideal communication between the cooperative agents such that the shared information between agents is exact and perfect. This assumption, however, could lead to potential privacy concer… ▽ More

    Submitted 23 August, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

  46. arXiv:2207.03736  [pdf, other

    eess.SP

    Mobile MIMO Channel Prediction with ODE-RNN: a Physics-Inspired Adaptive Approach

    Authors: Zhuoran Xiao, Zhaoyang Zhang, Zirui Chen, Zhaohui Yang, Richeng **

    Abstract: Obtaining accurate channel state information (CSI) is crucial and challenging for multiple-input multiple-output (MIMO) wireless communication systems. Conventional channel estimation method cannot guarantee the accuracy of mobile CSI while requires high signaling overhead. Through exploring the intrinsic correlation among a set of historical CSI instances randomly obtained in a certain communicat… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 7 pages, conference

  47. arXiv:2207.03647  [pdf, ps, other

    eess.SP cs.IT

    Integrated Sensing and Communication with Delay Alignment Modulation: Performance Analysis and Beamforming Optimization

    Authors: Zhiqiang Xiao, Yong Zeng

    Abstract: Delay alignment modulation (DAM) has been recently proposed to enable manipulable channel delay spread for efficient single- or multi-carrier communications. In particular, with perfect delay alignment, inter-symbol interference (ISI) can be eliminated even with single-carrier (SC) transmission, without relying on sophisticated channel equalization. The key ideas of DAM are delay pre-compensation… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  48. arXiv:2206.03714  [pdf, ps, other

    eess.SP

    Integrated Sensing and Communication with Delay Alignment Modulation

    Authors: Zhiqiang Xiao, Yong Zeng

    Abstract: Delay alignment modulation (DAM) has been recently proposed to enable inter-symbol interference (ISI)-free single-carrier (SC) communication without relying on sophisticated channel equalization. The key idea of DAM is to pre-introduce deliberate symbol delays at the transmitter side, so that all multi-path signal components may arrive at the receiver simultaneously and be superimposed constructiv… ▽ More

    Submitted 8 June, 2022; originally announced June 2022.

  49. arXiv:2205.00891  [pdf, ps, other

    cs.IT eess.SP

    Low-Complexity Designs of Symbol-Level Precoding for MU-MISO Systems

    Authors: Zichao Xiao, Rang Liu, Ming Li, Yang Liu, Qian Liu

    Abstract: Symbol-level precoding (SLP), which converts the harmful multi-user interference (MUI) into beneficial signals, can significantly improve symbol-error-rate (SER) performance in multi-user communication systems. While enjoying symbolic gain, however, the complicated non-linear symbol-by-symbol precoder design suffers high computational complexity exponential with the number of users, which is unaff… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: 15 pages, 10 figures, submitted to IEEE

  50. arXiv:2203.10726  [pdf, other

    eess.IV cs.CV

    TransFusion: Multi-view Divergent Fusion for Medical Image Segmentation with Transformers

    Authors: Di Liu, Yunhe Gao, Qilong Zhangli, Ligong Han, Xiaoxiao He, Zhaoyang Xia, Song Wen, Qi Chang, Zhennan Yan, Mu Zhou, Dimitris Metaxas

    Abstract: Combining information from multi-view images is crucial to improve the performance and robustness of automated methods for disease diagnosis. However, due to the non-alignment characteristics of multi-view images, building correlation and data fusion across views largely remain an open problem. In this study, we present TransFusion, a Transformer-based architecture to merge divergent multi-view im… ▽ More

    Submitted 5 September, 2022; v1 submitted 21 March, 2022; originally announced March 2022.