Skip to main content

Showing 1–50 of 137 results for author: Yu, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19483  [pdf, ps, other

    eess.SP

    Localization in Multipath Environments via Active Sensing with Reconfigurable Intelligent Surfaces

    Authors: Yinghan Li, Wei Yu

    Abstract: This paper investigates an uplink pilot-based wireless indoor localization problem in a multipath environment for a single-input single-output (SISO) narrowband communication system aided by reconfigurable intelligent surface (RIS). The indoor localization problem is challenging because the uplink channel consists of multiple overlap** propagation paths with varying amplitudes and phases, which… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.18361  [pdf, other

    cs.CV cs.AI eess.IV

    Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process

    Authors: Tianyu Lin, Zhiguang Chen, Zhonghao Yan, Weijiang Yu, Fudan Zheng

    Abstract: Diffusion models have demonstrated their effectiveness across various generative tasks. However, when applied to medical image segmentation, these models encounter several challenges, including significant resource and time requirements. They also necessitate a multi-step reverse process and multiple samples to produce reliable predictions. To address these challenges, we introduce the first laten… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted at MICCAI 2024. Code and citation info see https://github.com/lin-tianyu/Stable-Diffusion-Seg

  3. arXiv:2406.07914  [pdf, other

    cs.SD eess.AS

    Can Large Language Models Understand Spatial Audio?

    Authors: Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Jun Zhang, Lu Lu, Zejun Ma, Yuxuan Wang, Chao Zhang

    Abstract: This paper explores enabling large language models (LLMs) to understand spatial information from multichannel audio, a skill currently lacking in auditory LLMs. By leveraging LLMs' advanced cognitive and inferential abilities, the aim is to enhance understanding of 3D environments via audio. We study 3 spatial audio tasks: sound source localization (SSL), far-field speech recognition (FSR), and lo… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  4. arXiv:2406.04737  [pdf, other

    eess.SP eess.SY

    Fast-Fading Channel and Power Optimization of the Magnetic Inductive Cellular Network

    Authors: Honglei Ma, Erwu Liu, Zhijun Fang, Rui Wang, Yongbin Gao, Wenjun Yu, Dongming Zhang

    Abstract: The cellular network of magnetic Induction (MI) communication holds promise in long-distance underground environments. In the traditional MI communication, there is no fast-fading channel since the MI channel is treated as a quasi-static channel. However, for the vehicle (mobile) MI (VMI) communication, the unpredictable antenna vibration brings the remarkable fast-fading. As such fast-fading cann… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE TWC for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  5. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhi**g Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Hai** Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  6. arXiv:2405.03129  [pdf, other

    eess.SP cs.IT

    Active Sensing for Multiuser Beam Tracking with Reconfigurable Intelligent Surface

    Authors: Han Han, Tao Jiang, Wei Yu

    Abstract: This paper studies a beam tracking problem in which an access point (AP), in collaboration with a reconfigurable intelligent surface (RIS), dynamically adjusts its downlink beamformers and the reflection pattern at the RIS in order to maintain reliable communications with multiple mobile user equipments (UEs). Specifically, the mobile UEs send uplink pilots to the AP periodically during the channe… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  7. arXiv:2404.13640  [pdf, other

    cs.MM cs.CV eess.IV

    Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer

    Authors: Kepeng Xu, Li Xu, Gang He, Wenxin Yu, Yunsong Li

    Abstract: Multiple complex degradations are coupled in low-quality video faces in the real world. Therefore, blind video face restoration is a highly challenging ill-posed problem, requiring not only hallucinating high-fidelity details but also enhancing temporal coherence across diverse pose variations. Restoring each frame independently in a naive manner inevitably introduces temporal incoherence and arti… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 9 pages

  8. arXiv:2404.13392  [pdf, other

    cs.IT eess.SP math.OC

    Beamforming Design for Integrated Sensing and Communications Using Uplink-Downlink Duality

    Authors: Kareem M. Attiah, Wei Yu

    Abstract: This paper presents a novel optimization framework for beamforming design in integrated sensing and communication systems where a base station seeks to minimize the Bayesian Cramér-Rao bound of a sensing problem while satisfying quality of service constraints for the communication users. Prior approaches formulate the design problem as a semidefinite program for which acquiring a beamforming solut… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 6 pages, 2 figures, accepted at ISIT2024

  9. arXiv:2403.09004  [pdf, ps, other

    cs.IT eess.SP

    Meta-Learning-Based Fronthaul Compression for Cloud Radio Access Networks

    Authors: Ruihua Qiao, Tao Jiang, Wei Yu

    Abstract: This paper investigates the fronthaul compression problem in a user-centric cloud radio access network, in which single-antenna users are served by a central processor (CP) cooperatively via a cluster of remote radio heads (RRHs). To satisfy the fronthaul capacity constraint, this paper proposes a transform-compress-forward scheme, which consists of well-designed transformation matrices and unifor… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 15 Pages, 13 Figures; accepted in IEEE Transactions on Wireless Communications

  10. arXiv:2403.00134  [pdf, other

    cs.IT eess.SP

    Active Sensing for Reciprocal MIMO Channels

    Authors: Tao Jiang, Wei Yu

    Abstract: This paper addresses the design of transmit precoder and receive combiner matrices to support $N_{\rm s}$ independent data streams over a time-division duplex (TDD) point-to-point massive multiple-input multiple-output (MIMO) channel with either a fully digital or a hybrid structure. The optimal precoder and combiner design amounts to finding the top-$N_{\rm s}$ singular vectors of the channel mat… ▽ More

    Submitted 6 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: This paper is accepted in IEEE Transactions on Signal Processing

  11. Hybrid Online-Offline Learning for Task Offloading in Mobile Edge Computing Systems

    Authors: Muhammad Sohaib, Sang-Woon Jeon, Wei Yu

    Abstract: We consider a multi-user multi-server mobile edge computing (MEC) system, in which users arrive on a network randomly over time and generate computation tasks, which will be computed either locally on their own computing devices or be offloaded to one of the MEC servers. Under such a dynamic network environment, we propose a novel task offloading policy based on hybrid online-offline learning, whi… ▽ More

    Submitted 27 February, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: accepted by IEEE Transactions on Wireless Communications

    Journal ref: IEEE Transactions on Wireless Communications (2023)

  12. arXiv:2401.12025  [pdf, other

    cs.IT eess.SP math.OC

    A Survey of Recent Advances in Optimization Methods for Wireless Communications

    Authors: Ya-Feng Liu, Tsung-Hui Chang, Mingyi Hong, Zheyu Wu, Anthony Man-Cho So, Eduard A. Jorswieck, Wei Yu

    Abstract: Mathematical optimization is now widely regarded as an indispensable modeling and solution tool for the design of wireless communications systems. While optimization has played a significant role in the revolutionary progress in wireless communication and networking technologies from 1G to 5G and onto the future 6G, the innovations in wireless technologies have also substantially transformed the n… ▽ More

    Submitted 7 June, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 39 pages, 5 figures, accepted for publication in IEEE Journal on Selected Areas in Communications

  13. Bayes-Optimal Unsupervised Learning for Channel Estimation in Near-Field Holographic MIMO

    Authors: Wentao Yu, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Ross Murch, Khaled B. Letaief

    Abstract: Holographic MIMO (HMIMO) is being increasingly recognized as a key enabling technology for 6G wireless systems through the deployment of an extremely large number of antennas within a compact space to fully exploit the potentials of the electromagnetic (EM) channel. Nevertheless, the benefits of HMIMO systems cannot be fully unleashed without an efficient means to estimate the high-dimensional cha… ▽ More

    Submitted 15 June, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: 16 pages, 7 figures, 3 tables, accepted by IEEE Journal of Selected Topics in Signal Processing

  14. arXiv:2312.09002  [pdf, other

    cs.IT cs.LG eess.SP

    Localization with Reconfigurable Intelligent Surface: An Active Sensing Approach

    Authors: Zhongze Zhang, Tao Jiang, Wei Yu

    Abstract: This paper addresses an uplink localization problem in which a base station (BS) aims to locate a remote user with the help of reconfigurable intelligent surfaces (RISs). We propose a strategy in which the user transmits pilots sequentially and the BS adaptively adjusts the sensing vectors, including the BS beamforming vector and multiple RIS reflection coefficients based on the observations alrea… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted in IEEE Transactions on Wireless Communications. This is an extended version of the previous arXiv paper arXiv:2310.13160

  15. arXiv:2311.15299  [pdf, ps, other

    cs.IT eess.SP math.OC

    Covariance-Based Activity Detection in Cooperative Multi-Cell Massive MIMO: Scaling Law and Efficient Algorithms

    Authors: Ziyue Wang, Ya-Feng Liu, Zhaorui Wang, Wei Yu

    Abstract: This paper focuses on the covariance-based activity detection problem in a multi-cell massive multiple-input multiple-output (MIMO) system. In this system, active devices transmit their signature sequences to multiple base stations (BSs), and the BSs cooperatively detect the active devices based on the received signals. While the scaling law for the covariance-based activity detection in the singl… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 54 pages, 11 figures, submitted for possible publication

  16. arXiv:2311.13626  [pdf, other

    eess.IV cs.AI cs.IR physics.optics

    Physics-driven generative adversarial networks empower single-pixel infrared hyperspectral imaging

    Authors: Dong-Yin Wang, Shu-Hang Bie, Xi-Hao Chen, Wen-Kai Yu

    Abstract: A physics-driven generative adversarial network (GAN) was established here for single-pixel hyperspectral imaging (HSI) in the infrared spectrum, to eliminate the extensive data training work required by traditional data-driven model. Within the GAN framework, the physical process of single-pixel imaging (SPI) was integrated into the generator, and the actual and estimated one-dimensional (1D) buc… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: 14 pages, 8 figures

  17. arXiv:2311.07908  [pdf, other

    eess.SP cs.IT

    Learning Bayes-Optimal Channel Estimation for Holographic MIMO in Unknown EM Environments

    Authors: Wentao Yu, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Ross D. Murch, Khaled B. Letaief

    Abstract: Holographic MIMO (HMIMO) has recently been recognized as a promising enabler for future 6G systems through the use of an ultra-massive number of antennas in a compact space to exploit the propagation characteristics of the electromagnetic (EM) channel. Nevertheless, the promised gain of HMIMO could not be fully unleashed without an efficient means to estimate the high-dimensional channel. Bayes-op… ▽ More

    Submitted 4 February, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: 6 pages, 3 figures, 1 table, accepted for presentation at IEEE ICC 2024, Denver, CO, USA

  18. arXiv:2311.04546  [pdf, ps, other

    eess.SP cs.IT

    Discerning and Enhancing the Weighted Sum-Rate Maximization Algorithms in Communications

    Authors: Zepeng Zhang, Zi** Zhao, Kaiming Shen, Daniel P. Palomar, Wei Yu

    Abstract: Weighted sum-rate (WSR) maximization plays a critical role in communication system design. This paper examines three optimization methods for WSR maximization, which ensure convergence to stationary points: two block coordinate ascent (BCA) algorithms, namely, weighted sum-minimum mean-square error (WMMSE) and WSR maximization via fractional programming (WSR-FP), along with a minorization-maximiza… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  19. arXiv:2310.13289  [pdf, other

    cs.SD cs.CL eess.AS

    SALMONN: Towards Generic Hearing Abilities for Large Language Models

    Authors: Changli Tang, Wenyi Yu, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang

    Abstract: Hearing is arguably an essential ability of artificial intelligence (AI) agents in the physical world, which refers to the perception and understanding of general auditory information consisting of at least three types of sounds: speech, audio events, and music. In this paper, we propose SALMONN, a speech audio language music open neural network, built by integrating a pre-trained text-based large… ▽ More

    Submitted 8 April, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  20. arXiv:2310.13160  [pdf, other

    eess.SP cs.IT

    Active Sensing for Localization with Reconfigurable Intelligent Surface

    Authors: Zhongze Zhang, Tao Jiang, Wei Yu

    Abstract: This paper addresses an uplink localization problem in which the base station (BS) aims to locate a remote user with the aid of reconfigurable intelligent surface (RIS). This paper proposes a strategy in which the user transmits pilots over multiple time frames, and the BS adaptively adjusts the RIS reflection coefficients based on the observations already received so far in order to produce an ac… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted in IEEE International Conference on Communications (ICC) 2023

  21. arXiv:2310.05863  [pdf, other

    eess.AS cs.AI cs.CV cs.SD

    Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models

    Authors: Guangzhi Sun, Wenyi Yu, Changli Tang, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang

    Abstract: Audio-visual large language models (LLM) have drawn significant attention, yet the fine-grained combination of both input streams is rather under-explored, which is challenging but necessary for LLMs to understand general video inputs. To this end, a fine-grained audio-visual joint representation (FAVOR) learning framework for multimodal LLMs is proposed in this paper, which extends a text-based L… ▽ More

    Submitted 10 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  22. arXiv:2310.05021  [pdf, other

    eess.SY

    Toward Intelligent Emergency Control for Large-scale Power Systems: Convergence of Learning, Physics, Computing and Control

    Authors: Qiuhua Huang, Renke Huang, Tianzhixi Yin, Sohom Datta, Xueqing Sun, Jason Hou, Jie Tan, Wenhao Yu, Yuan Liu, Xinya Li, Bruce Palmer, Ang Li, Xinda Ke, Marianna Vaiman, Song Wang, Yousu Chen

    Abstract: This paper has delved into the pressing need for intelligent emergency control in large-scale power systems, which are experiencing significant transformations and are operating closer to their limits with more uncertainties. Learning-based control methods are promising and have shown effectiveness for intelligent power system control. However, when they are applied to large-scale power systems, t… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

    Comments: submitted to PSCC 2024

  23. arXiv:2310.03347  [pdf, other

    eess.SY

    Razumikhin-type ISS Lyapunov function and small gain theorem for discrete time time-delay systems with application to a biased min-consensus protocol

    Authors: Yuanqiu Mo, Wenwu Yu, Huazhou Hou, Soura Dasgupta

    Abstract: This paper considers small gain theorems for the global asymptotic and exponential input-to-state stability for discrete time time-delay systems using Razumikhin-type Lyapunov function. Among other things, unlike the existing literature, it provides both necessary and sufficient conditions for exponential input-to-state stability in terms of the Razumikhin-type Lyapunov function and the small gain… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  24. arXiv:2310.02086  [pdf, other

    eess.SY

    Bearing-Based Target Entrap** Control of Multiple Uncertain Agents With Arbitrary Maneuvers

    Authors: Haifan Su, Ziwen Yang, Shanying Zhu, Cailian Chen, Wenbin Yu

    Abstract: This paper is concerned with bearing-based cooperative target entrap** control of multiple uncertain agents with arbitrary maneuvers including shape deformation, rotations, scalings, etc. A leader-follower structure is used, where the leaders move with the predesigned trajectories, and the followers are steered by an estimation-based control method, integrating a distance estimator using bearing… ▽ More

    Submitted 6 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 13 pages, 6 figures, the paper has been accepted by IFAC WC 2023

  25. arXiv:2309.13963  [pdf, other

    eess.AS cs.CL cs.SD

    Connecting Speech Encoder and Large Language Model for ASR

    Authors: Wenyi Yu, Changli Tang, Guangzhi Sun, Xianzhao Chen, Tian Tan, Wei Li, Lu Lu, Zejun Ma, Chao Zhang

    Abstract: The impressive capability and versatility of large language models (LLMs) have aroused increasing attention in automatic speech recognition (ASR), with several pioneering studies attempting to build integrated ASR models by connecting a speech encoder with an LLM. This paper presents a comparative study of three commonly used structures as connectors, including fully connected layers, multi-head c… ▽ More

    Submitted 26 September, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

  26. arXiv:2309.11717  [pdf, other

    eess.SP

    A class-weighted supervised contrastive learning long-tailed bearing fault diagnosis approach using quadratic neural network

    Authors: Wei-En Yu, **wei Sun, Shi** Zhang, Xiaoge Zhang, **g-Xiao Liao

    Abstract: Deep learning has achieved remarkable success in bearing fault diagnosis. However, its performance oftentimes deteriorates when dealing with highly imbalanced or long-tailed data, while such cases are prevalent in industrial settings because fault is a rare event that occurs with an extremely low probability. Conventional data augmentation methods face fundamental limitations due to the scarcity o… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  27. arXiv:2309.09575  [pdf, other

    eess.SP cs.IT

    AI-Native Transceiver Design for Near-Field Ultra-Massive MIMO: Principles and Techniques

    Authors: Wentao Yu, Yifan Ma, Hengtao He, Shenghui Song, Jun Zhang, Khaled B. Letaief

    Abstract: Ultra-massive multiple-input multiple-output (UMMIMO) is a cutting-edge technology that promises to revolutionize wireless networks by providing an unprecedentedly high spectral and energy efficiency. The enlarged array aperture of UM-MIMO facilitates the accessibility of the near-field region, thereby offering a novel degree of freedom for communications and sensing. Nevertheless, the transceiver… ▽ More

    Submitted 3 January, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 7 pages, 3 figures, 2 tables, magazine manuscript, submitted to IEEE for possible publication

  28. arXiv:2308.01802  [pdf, ps, other

    cs.IT eess.SP

    Multi-Carrier Modulation: An Evolution from Time-Frequency Domain to Delay-Doppler Domain

    Authors: Hai Lin, **hong Yuan, Wei Yu, **gxian Wu, Lajos Hanzo

    Abstract: The recently proposed orthogonal delay-Doppler division multiplexing (ODDM) modulation, which is based on the new delay-Doppler (DD) domain orthogonal pulse (DDOP), is studied. A substantial benefit of the DDOP-based ODDM or general delay-Doppler domain multi-carrier (DDMC) modulation is that it achieves orthogonality with respect to the fine time and frequency resolutions of the DD domain. We fir… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: This paper has been submitted to the IEEE for possible publication. The supplementary material of this work will be posted at https://www.omu.ac.jp/eng/ees-sic/oddm/

  29. arXiv:2307.16865  [pdf, other

    cs.CV eess.IV

    Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models

    Authors: Weikang Yu, Yonghao Xu, Pedram Ghamisi

    Abstract: Deep neural networks (DNNs) have risen to prominence as key solutions in numerous AI applications for earth observation (AI4EO). However, their susceptibility to adversarial examples poses a critical challenge, compromising the reliability of AI4EO algorithms. This paper presents a novel Universal Adversarial Defense approach in Remote Sensing Imagery (UAD-RS), leveraging pre-trained diffusion mod… ▽ More

    Submitted 27 May, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

  30. arXiv:2307.04327  [pdf

    cs.RO eess.SY

    Legal Decision-making for Highway Automated Driving

    Authors: Xiaohan Ma, Wenhao Yu, Chengxiang Zhao, Changjun Wang, Wenhui Zhou, Guangming Zhao, Mingyue Ma, Weida Wang, Lin Yang, Rui Mu, Hong Wang, Jun Li

    Abstract: Compliance with traffic laws is a fundamental requirement for human drivers on the road, and autonomous vehicles must adhere to traffic laws as well. However, current autonomous vehicles prioritize safety and collision avoidance primarily in their decision-making and planning, which will lead to misunderstandings and distrust from human drivers and may even result in accidents in mixed traffic flo… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: 14 pages, 17 figures

  31. arXiv:2307.03394  [pdf, other

    eess.IV cs.MM

    Towards Robust SDRTV-to-HDRTV via Dual Inverse Degradation Network

    Authors: Kepeng Xu, Li Xu, Gang He, Wenxin Yu, Yunsong Li

    Abstract: In this study, we address the emerging necessity of converting Standard Dynamic Range Television (SDRTV) content into High Dynamic Range Television (HDRTV) in light of the limited number of native HDRTV content. A principal technical challenge in this conversion is the exacerbation of coding artifacts inherent in SDRTV, which detrimentally impacts the quality of the resulting HDRTV. To address thi… ▽ More

    Submitted 14 January, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 13 pages

  32. arXiv:2305.19301  [pdf, other

    eess.IV cs.CV cs.IT cs.LG

    On the Choice of Perception Loss Function for Learned Video Compression

    Authors: Sadaf Salehkalaibar, Buu Phan, Jun Chen, Wei Yu, Ashish Khisti

    Abstract: We study causal, low-latency, sequential video compression when the output is subjected to both a mean squared-error (MSE) distortion loss as well as a perception loss to target realism. Motivated by prior approaches, we consider two different perception loss functions (PLFs). The first, PLF-JD, considers the joint distribution (JD) of all the video frames up to the current one, while the second m… ▽ More

    Submitted 22 August, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  33. arXiv:2305.12423  [pdf, other

    eess.SP cs.IT eess.IV

    Task-Oriented Communication with Out-of-Distribution Detection: An Information Bottleneck Framework

    Authors: Hongru Li, Wentao Yu, Hengtao He, Jiawei Shao, Shenghui Song, Jun Zhang, Khaled B. Letaief

    Abstract: Task-oriented communication is an emerging paradigm for next-generation communication networks, which extracts and transmits task-relevant information, instead of raw data, for downstream applications. Most existing deep learning (DL)-based task-oriented communication systems adopt a closed-world scenario, assuming either the same data distribution for training and testing, or the system could hav… ▽ More

    Submitted 27 January, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: code available in github, accepted by IEEE GLOBECOM2023

  34. arXiv:2305.07130  [pdf, other

    cs.IT eess.SP

    Active Sensing for Two-Sided Beam Alignment and Reflection Design Using **-Pong Pilots

    Authors: Tao Jiang, Foad Sohrabi, Wei Yu

    Abstract: Beam alignment is an important task for millimeter-wave (mmWave) communication, because constructing aligned narrow beams both at the transmitter (Tx) and the receiver (Rx) is crucial in terms of compensating the significant path loss in very high-frequency bands. However, beam alignment is also a highly nontrivial task because large antenna arrays typically have a limited number of radio-frequenc… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: This paper is accepted in IEEE Journal on Selected Areas in Information Theory

  35. arXiv:2305.00273  [pdf, other

    cs.CV eess.IV

    Sparsity-Aware Optimal Transport for Unsupervised Restoration Learning

    Authors: Fei Wen, Wei Wang, Wenxian Yu

    Abstract: Recent studies show that, without any prior model, the unsupervised restoration learning problem can be optimally formulated as an optimal transport (OT) problem, which has shown promising performance on denoising tasks to approach the performance of supervised methods. However, it still significantly lags behind state-of-the-art supervised methods on complex restoration tasks such as super-resolu… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: 15 pages, 9 figures

  36. FAN-Net: Fourier-Based Adaptive Normalization For Cross-Domain Stroke Lesion Segmentation

    Authors: Weiyi Yu, Yiming Lei, Hongming Shan

    Abstract: Since stroke is the main cause of various cerebrovascular diseases, deep learning-based stroke lesion segmentation on magnetic resonance (MR) images has attracted considerable attention. However, the existing methods often neglect the domain shift among MR images collected from different sites, which has limited performance improvement. To address this problem, we intend to change style informatio… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: Accepted by IEEE ICASSP 2023

    Journal ref: IEEE ICASSP 2023

  37. arXiv:2304.07748  [pdf

    eess.SY

    Online SOC Estimation of Lithium-ion Battery Based on Improved Adaptive H Infinity Extended Kalman Filter

    Authors: Jierui Wang, Wentao Yu, Guoyang Cheng, Lin Chen

    Abstract: For the battery management system of electric vehicle, accurate estimation of the State of Charge of Lithium-ion battery can effectively avoid structural damage caused by overcharge or over discharge inside the battery. Considering that the lithium-ion battery is a time-varying nonlinear system, which needs real-time State of Charge estimation, a joint algorithm of forgetting factor recursive leas… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: 6 pages

  38. arXiv:2303.10823  [pdf, other

    eess.SP

    MF-JMoDL-Net: A Deep Network for Azimuth Undersampling Pattern Design and Ambiguity Suppression for Sparse SAR Imaging

    Authors: Yuwei Wu, Zhe Zhang, Xiaolan Qiu, Yao Zhao, Weidong Yu

    Abstract: repetition frequency (PRF). Given the system complexity and resource constraints, it is often difficult to achieve high imaging performance and low ambiguity without compromising the swath. In this paper, we propose a joint optimization framework for sparse strip SAR imaging algorithms and azimuth undersampling patterns based on a deep convolutional neural network, combined with matched filter (MF… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  39. arXiv:2303.01193  [pdf, other

    cs.LG eess.SY math.NA

    Interpretable System Identification and Long-term Prediction on Time-Series Data

    Authors: Xiaoyi Liu, Duxin Chen, Wenjia Wei, Xia Zhu, Wenwu Yu

    Abstract: Time-series prediction has drawn considerable attention during the past decades fueled by the emerging advances of deep learning methods. However, most neural network based methods lack interpretability and fail in extracting the hidden mechanism of the targeted physical system. To overcome these shortcomings, an interpretable sparse system identification method without any prior knowledge is prop… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  40. arXiv:2302.12304  [pdf, other

    cs.LG cs.AI cs.NI eess.SP stat.ML

    Uncertainty Injection: A Deep Learning Method for Robust Optimization

    Authors: Wei Cui, Wei Yu

    Abstract: This paper proposes a paradigm of uncertainty injection for training deep learning model to solve robust optimization problems. The majority of existing studies on deep learning focus on the model learning capability, while assuming the quality and accuracy of the inputs data can be guaranteed. However, in realistic applications of deep learning for solving optimization problems, the accuracy of i… ▽ More

    Submitted 26 February, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 13 pages, 7 figures. To appear in IEEE Transactions on Wireless Communications

  41. arXiv:2301.09198  [pdf, other

    eess.AS cs.SD

    Estimation of Source and Receiver Positions, Room Geometry and Reflection Coefficients From a Single Room Impulse Response

    Authors: Wangyang Yu, W. Bastiaan Kleijn

    Abstract: We propose an algorithm to estimate source and receiver positions, room geometry and reflection coefficients from a single room impulse response simultaneously. It is based on a symmetry analysis of the room impulse response. The proposed method utilizes the times of arrivals of the direct path, first order reflections and second order reflections. The proposed method is robust to erroneous pulses… ▽ More

    Submitted 22 January, 2023; originally announced January 2023.

  42. arXiv:2301.01474  [pdf, ps, other

    eess.SY cs.LG

    UAV aided Metaverse over Wireless Communications: A Reinforcement Learning Approach

    Authors: Peiyuan Si, Wenhan Yu, Jun Zhao, Kwok-Yan Lam, Qing Yang

    Abstract: Metaverse is expected to create a virtual world closely connected with reality to provide users with immersive experience with the support of 5G high data rate communication technique. A huge amount of data in physical world needs to be synchronized to the virtual world to provide immersive experience for users, and there will be higher requirements on coverage to include more users into Metaverse… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  43. arXiv:2212.04156  [pdf, other

    eess.SY

    No driver, No Regulation? --Online Legal Driving Behavior Monitoring for Self-driving Vehicles

    Authors: Wenhao Yu, Chengxiang Zhao, Jiaxin Liu, Yingkai Yang, Xiaohan Ma, Jun Li, Weida Wang, Hong Wang, Ding Zhao, Xiaosong Hu

    Abstract: Defined traffic laws must be respected by all vehicles. However, it is essential to know which behaviors violate the current laws, especially when a responsibility issue is involved in an accident. This brings challenges of digitizing human-driver-oriented traffic laws and monitoring vehicles' behaviors continuously. To address these challenges, this paper aims to digitize traffic law comprehensiv… ▽ More

    Submitted 1 June, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: 22 pages, 11 figures

  44. arXiv:2212.02715  [pdf, other

    eess.SY cs.AI cs.LG math.OC

    Efficient Learning of Voltage Control Strategies via Model-based Deep Reinforcement Learning

    Authors: Ramij R. Hossain, Tianzhixi Yin, Yan Du, Renke Huang, Jie Tan, Wenhao Yu, Yuan Liu, Qiuhua Huang

    Abstract: This article proposes a model-based deep reinforcement learning (DRL) method to design emergency control strategies for short-term voltage stability problems in power systems. Recent advances show promising results in model-free DRL-based methods for power systems, but model-free methods suffer from poor sample efficiency and training time, both critical for making state-of-the-art DRL algorithms… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  45. An Adaptive and Robust Deep Learning Framework for THz Ultra-Massive MIMO Channel Estimation

    Authors: Wentao Yu, Yifei Shen, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Khaled B. Letaief

    Abstract: Terahertz ultra-massive MIMO (THz UM-MIMO) is envisioned as one of the key enablers of 6G wireless networks, for which channel estimation is highly challenging. Traditional analytical estimation methods are no longer effective, as the enlarged array aperture and the small wavelength result in a mixture of far-field and near-field paths, constituting a hybrid-field channel. Deep learning (DL)-based… ▽ More

    Submitted 8 June, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 15 pages, 11 figures, 5 tables, accepted by IEEE Journal of Selected Topics in Signal Processing (JSTSP)

  46. arXiv:2211.15079  [pdf, other

    cs.IT cs.LG eess.SP

    Lightweight and Flexible Deep Equilibrium Learning for CSI Feedback in FDD Massive MIMO

    Authors: Yifan Ma, Wentao Yu, Xianghao Yu, Jun Zhang, Shenghui Song, Khaled B. Letaief

    Abstract: In frequency-division duplexing (FDD) massive multiple-input multiple-output (MIMO) systems, downlink channel state information (CSI) needs to be sent back to the base station (BS) by the users, which causes prohibitive feedback overhead. In this paper, we propose a lightweight and flexible deep learning-based CSI feedback approach by capitalizing on deep equilibrium models. Different from existin… ▽ More

    Submitted 5 June, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: submitted to IEEE for possible publication

  47. arXiv:2211.07985  [pdf, ps, other

    eess.SP cs.IT

    Blind Performance Prediction for Deep Learning Based Ultra-Massive MIMO Channel Estimation

    Authors: Wentao Yu, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Khaled B. Letaief

    Abstract: Reliability is of paramount importance for the physical layer of wireless systems due to its decisive impact on end-to-end performance. However, the uncertainty of prevailing deep learning (DL)-based physical layer algorithms is hard to quantify due to the black-box nature of neural networks. This limitation is a major obstacle that hinders their practical deployment. In this paper, we attempt to… ▽ More

    Submitted 5 February, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: 6 pages, 3 figures, 1 table, accepted by IEEE ICC 2023

  48. Scaling Law Analysis for Covariance Based Activity Detection in Cooperative Multi-Cell Massive MIMO

    Authors: Ziyue Wang, Ya-Feng Liu, Zhaorui Wang, Wei Yu

    Abstract: This paper studies the covariance based activity detection problem in a multi-cell massive multiple-input multiple-output (MIMO) system, where the active devices transmit their signature sequences to multiple base stations (BSs), and the BSs cooperatively detect the active devices based on the received signals. The scaling law of covariance based activity detection in the single-cell scenario has… ▽ More

    Submitted 14 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: 5 pages, 2 figures, accepted for publication in IEEE ICASSP 2023

  49. arXiv:2210.02596  [pdf, other

    cs.IT eess.SP

    Role of Deep Learning in Wireless Communications

    Authors: Wei Yu, Foad Sohrabi, Tao Jiang

    Abstract: Traditional communication system design has always been based on the paradigm of first establishing a mathematical model of the communication channel, then designing and optimizing the system according to the model. The advent of modern machine learning techniques, specifically deep neural networks, has opened up opportunities for data-driven system design and optimization. This article draws exam… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: 13 pages, 12 figures, To appear in IEEE BITS the Information Theory Magazine

  50. arXiv:2209.13425  [pdf, other

    cs.NI cs.LG eess.SP

    Resource Allocation for Mobile Metaverse with the Internet of Vehicles over 6G Wireless Communications: A Deep Reinforcement Learning Approach

    Authors: Terence Jie Chua, Wenhan Yu, Jun Zhao

    Abstract: Improving the interactivity and interconnectivity between people is one of the highlights of the Metaverse. The Metaverse relies on a core approach, digital twinning, which is a means to replicate physical world objects, people, actions and scenes onto the virtual world. Being able to access scenes and information associated with the physical world, in the Metaverse in real-time and under mobility… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: This paper appears in the Proceedings of 8th IEEE World Forum on the Internet of Things (WFIoT) 2022. Please feel free to contact us for questions or remarks