Skip to main content

Showing 1–50 of 413 results for author: Xue, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18538  [pdf, other

    cs.CV cs.AI eess.IV

    VideoQA-SC: Adaptive Semantic Communication for Video Question Answering

    Authors: Jiangyuan Guo, Wei Chen, Yuxuan Sun, Jialong Xu, Bo Ai

    Abstract: Although semantic communication (SC) has shown its potential in efficiently transmitting multi-modal data such as text, speeches and images, SC for videos has focused primarily on pixel-level reconstruction. However, these SC systems may be suboptimal for downstream intelligent tasks. Moreover, SC systems without pixel-level video reconstruction present advantages by achieving higher bandwidth eff… ▽ More

    Submitted 17 May, 2024; originally announced June 2024.

  2. arXiv:2406.16946  [pdf, ps, other

    eess.SP

    Networked ISAC for Low-Altitude Economy: Coordinated Transmit Beamforming and UAV Trajectory Design

    Authors: Gaoyuan Cheng, Xianxin Song, Zhonghao Lyu, Jie Xu

    Abstract: This paper exploits the networked integrated sensing and communications (ISAC) to support low-altitude economy (LAE), in which a set of networked ground base stations (GBSs) cooperatively transmit joint information and sensing signals to communicate with multiple authorized unmanned aerial vehicles (UAVs) and concurrently detect unauthorized objects over the interested region in the three-dimensio… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2405.07568

  3. arXiv:2406.16868  [pdf, other

    eess.SP cs.AI

    Neural Network-based Two-Dimensional Filtering for OTFS Symbol Detection

    Authors: Jiarui Xu, Karim Said, Lizhong Zheng, Lingjia Liu

    Abstract: Orthogonal time frequency space (OTFS) is a promising modulation scheme for wireless communication in high-mobility scenarios. Recently, a reservoir computing (RC) based approach has been introduced for online subframe-based symbol detection in the OTFS system, where only the limited over-the-air (OTA) pilot symbols are utilized for training. However, the previous RC-based approach does not design… ▽ More

    Submitted 8 March, 2024; originally announced June 2024.

    Comments: 6 pages, conference paper. arXiv admin note: substantial text overlap with arXiv:2311.08543

  4. arXiv:2406.14973  [pdf, other

    cs.CV eess.IV

    LU2Net: A Lightweight Network for Real-time Underwater Image Enhancement

    Authors: Haodong Yang, Jisheng Xu, Zhiliang Lin, Jian** He

    Abstract: Computer vision techniques have empowered underwater robots to effectively undertake a multitude of tasks, including object tracking and path planning. However, underwater optical factors like light refraction and absorption present challenges to underwater vision, which cause degradation of underwater images. A variety of underwater image enhancement methods have been proposed to improve the effe… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  5. arXiv:2406.14664  [pdf, ps, other

    eess.SP

    Experimental Validation of Cooperative RSS-based Localization with Unknown Transmit Power, Path Loss Exponent, and Precise Anchor Location

    Authors: Yingquan Li, Bodhibrata Mukhopadhyay, Jiajie Xu, Mohamed-Slim Alouini

    Abstract: Received signal strength (RSS)--based cooperative localization has gained significant attention due to its straightforward system architectures and cost-effectiveness. In this paper, we propose Cooperative Localization Techniques (with Unknown Parameters), referred to as CTUP(s), which consider uncertainty in anchor nodes' locations and assume the transmit power and \textcolor{blue}{path loss expo… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2406.14092  [pdf, other

    cs.CL eess.AS

    Seamless Language Expansion: Enhancing Multilingual Mastery in Self-Supervised Models

    Authors: **g Xu, Minglin Wu, Xixin Wu, Helen Meng

    Abstract: Self-supervised (SSL) models have shown great performance in various downstream tasks. However, they are typically developed for limited languages, and may encounter new languages in real-world. Develo** a SSL model for each new language is costly. Thus, it is vital to figure out how to efficiently adapt existed SSL models to a new language without impairing its original abilities. We propose ad… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  7. arXiv:2406.12426  [pdf, other

    cs.IT eess.SP

    Multi-Active-IRS-Assisted Cooperative Sensing: Cramér-Rao Bound and Joint Beamforming Design

    Authors: Yuan Fang, Xianghao Yu, Jie Xu, Ying-Jun Angela Zhang

    Abstract: This paper studies the multi-intelligent reflecting surface (IRS)-assisted cooperative sensing, in which multiple active IRSs are deployed in a distributed manner to facilitate multi-view target sensing at the non-line-of-sight (NLoS) area of the base station (BS). Different from prior works employing passive IRSs, we leverage active IRSs with the capability of amplifying the reflected signals to… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2404.13536

  8. Simultaneously Transmitting and Reflecting Surfaces for Ubiquitous Next Generation Multiple Access in 6G and Beyond

    Authors: Xidong Mu, Jiaqi Xu, Zhaolin Wang, Naofal Al-Dhahir

    Abstract: The ultimate goal of next generation multiple access (NGMA) is to support massive terminals and facilitate multiple functionalities over the limited radio resources of wireless networks in the most efficient manner possible. However, the random and uncontrollable wireless radio environment is a major obstacle to realizing this NGMA vision. Given the prominent feature of achieving 360° smart radio… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 25 pages, 18 figures, 7 tables

  9. arXiv:2406.10276  [pdf, other

    cs.CL cs.SD eess.AS

    Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation

    Authors: Peidong Wang, Jian Xue, **yu Li, Junkun Chen, Aswin Shanmugam Subramanian

    Abstract: Language-agnostic many-to-one end-to-end speech translation models can convert audio signals from different source languages into text in a target language. These models do not need source language identification, which improves user experience. In some cases, the input language can be given or estimated. Our goal is to use this additional language information while preserving the quality of the o… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  10. arXiv:2406.10236  [pdf, other

    eess.IV cs.AI

    Lightening Anything in Medical Images

    Authors: Ben Fei, Yixuan Li, Weidong Yang, Hengjun Gao, **gyi Xu, Lipeng Ma, Yatian Yang, **hong Zhou

    Abstract: The development of medical imaging techniques has made a significant contribution to clinical decision-making. However, the existence of suboptimal imaging quality, as indicated by irregular illumination or imbalanced intensity, presents significant obstacles in automating disease screening, analysis, and diagnosis. Existing approaches for natural image enhancement are mostly trained with numerous… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 23 pages, 6 figures

  11. arXiv:2406.09664  [pdf, other

    cs.SD eess.AS

    Frequency-mix Knowledge Distillation for Fake Speech Detection

    Authors: Cunhang Fan, Shunbo Dong, Jun Xue, Yujie Chen, Jiangyan Yi, Zhao Lv

    Abstract: In the telephony scenarios, the fake speech detection (FSD) task to combat speech spoofing attacks is challenging. Data augmentation (DA) methods are considered effective means to address the FSD task in telephony scenarios, typically divided into time domain and frequency domain stages. While each has its advantages, both can result in information loss. To tackle this issue, we propose a novel DA… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  12. arXiv:2406.07662  [pdf, other

    eess.IV cs.AI cs.CV cs.LG q-bio.NC

    Progress Towards Decoding Visual Imagery via fNIRS

    Authors: Michel Adamic, Wellington Avelino, Anna Brandenberger, Bryan Chiang, Hunter Davis, Stephen Fay, Andrew Gregory, Aayush Gupta, Raphael Hotter, Grace Jiang, Fiona Leng, Stephen Polcyn, Thomas Ribeiro, Paul Scotti, Michelle Wang, Marley Xiong, Jonathan Xu

    Abstract: We demonstrate the possibility of reconstructing images from fNIRS brain activity and start building a prototype to match the required specs. By training an image reconstruction model on downsampled fMRI data, we discovered that cm-scale spatial resolution is sufficient for image generation. We obtained 71% retrieval accuracy with 1-cm resolution, compared to 93% on the full-resolution fMRI, and 2… ▽ More

    Submitted 22 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  13. arXiv:2406.06086  [pdf, other

    cs.SD eess.AS

    RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

    Authors: Yujie Chen, Jiangyan Yi, Jun Xue, Chenglong Wang, Xiaohui Zhang, Shunbo Dong, Siding Zeng, Jianhua Tao, Lv Zhao, Cunhang Fan

    Abstract: Fake artefacts for discriminating between bonafide and fake audio can exist in both short- and long-range segments. Therefore, combining local and global feature information can effectively discriminate between bonafide and fake audio. This paper proposes an end-to-end bidirectional state space model, named RawBMamba, to capture both short- and long-range discriminative information for audio deepf… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  14. arXiv:2406.03714  [pdf, other

    cs.SD eess.AS

    Retrieval Augmented Generation in Prompt-based Text-to-Speech Synthesis with Context-Aware Contrastive Language-Audio Pretraining

    Authors: **long Xue, Yayue Deng, Yingming Gao, Ya Li

    Abstract: Recent prompt-based text-to-speech (TTS) models can clone an unseen speaker using only a short speech prompt. They leverage a strong in-context ability to mimic the speech prompts, including speaker style, prosody, and emotion. Therefore, the selection of a speech prompt greatly influences the generated speech, akin to the importance of a prompt in large language models (LLMs). However, current pr… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  15. arXiv:2406.03706  [pdf, other

    cs.SD cs.CL eess.AS

    Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model

    Authors: **long Xue, Yayue Deng, Yicheng Han, Yingming Gao, Ya Li

    Abstract: Recent advances in large language models (LLMs) and development of audio codecs greatly propel the zero-shot TTS. They can synthesize personalized speech with only a 3-second speech of an unseen speaker as acoustic prompt. However, they only support short speech prompts and cannot leverage longer context information, as required in audiobook and conversational TTS scenarios. In this paper, we intr… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  16. arXiv:2406.01922  [pdf, ps, other

    eess.SP cs.IT

    Performance Analysis of Hybrid Cellular and Cell-free MIMO Network

    Authors: Zhuoyin Dai, **gran Xu, Xiaoli Xu, Ruoguang Li, Yong Zeng

    Abstract: Cell-free wireless communication is envisioned as one of the most promising network architectures, which can achieve stable and uniform communication performance while improving the system energy and spectrum efficiency. The deployment of cell-free networks is envisioned to be a longterm evolutionary process, in which cell-free access points (APs) will be gradually introduced into the communicatio… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  17. arXiv:2405.17329  [pdf, other

    cs.IT eess.SP

    Joint MIMO Transceiver and Reflector Design for Reconfigurable Intelligent Surface-Assisted Communication

    Authors: Yaqiong Zhao, **dan Xu, Wei Xu, Kezhi Wang, Xinquan Ye, Chau Yuen, Xiaohu You

    Abstract: In this paper, we consider a reconfigurable intelligent surface (RIS)-assisted multiple-input multiple-output communication system with multiple antennas at both the base station (BS) and the user. We plan to maximize the achievable rate through jointly optimizing the transmit precoding matrix, the receive combining matrix, and the RIS reflection matrix under the constraints of the transmit power… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 14 pages, 12 figures

  18. arXiv:2405.17250  [pdf, ps, other

    cs.RO eess.SY

    "Pass the butter": A study on desktop-classic multitasking robotic arm based on advanced YOLOv7 and BERT

    Authors: Haohua Que, Wenbin Pan, Jie Xu, Hao Luo, Pei Wang, Li Zhang

    Abstract: In recent years, various intelligent autonomous robots have begun to appear in daily life and production. Desktop-level robots are characterized by their flexible deployment, rapid response, and suitability for light workload environments. In order to meet the current societal demand for service robot technology, this study proposes using a miniaturized desktop-level robot (by ROS) as a carrier, l… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  19. arXiv:2405.13634  [pdf, other

    eess.SP

    Secure Communications in Near-Filed ISCAP Systems with Extremely Large-Scale Antenna Arrays

    Authors: Zixiang Ren, Siyao Zhang, Xinmin Li, Ling Qiu, Jie Xu, Derrick Wing Kwan Ng

    Abstract: This paper investigates secure communications in a near-field multi-functional integrated sensing, communication, and powering (ISCAP) system with an extremely large-scale antenna arrays (ELAA) equipped at the base station (BS). In this system, the BS sends confidential messages to a single communication user (CU), and at the same time wirelessly senses a point target and charges multiple energy r… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 6 pages

  20. arXiv:2405.10514  [pdf, other

    cs.IT eess.SP

    Secrecy Performance Analysis of Multi-Functional RIS-Assisted NOMA Networks

    Authors: Yingjie Pei, Wanli Ni, ** Xu, Xinwei Yue, Xiaofeng Tao, Dusit Niyato

    Abstract: Although reconfigurable intelligent surface (RIS) can improve the secrecy communication performance of wireless users, it still faces challenges such as limited coverage and double-fading effect. To address these issues, in this paper, we utilize a novel multi-functional RIS (MF-RIS) to enhance the secrecy performance of wireless users, and investigate the physical layer secrecy problem in non-ort… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 14 pages, 9 figures, submitted to IEEE transactions on wireless communication

  21. arXiv:2405.09353  [pdf, other

    eess.IV cs.CV

    Large coordinate kernel attention network for lightweight image super-resolution

    Authors: Fangwei Hao, Jiesheng Wu, Haotian Lu, Ji Du, **g Xu

    Abstract: The multi-scale receptive field and large kernel attention (LKA) module have been shown to significantly improve performance in the lightweight image super-resolution task. However, existing lightweight super-resolution (SR) methods seldom pay attention to designing efficient building block with multi-scale receptive field for local modeling, and their LKA modules face a quadratic increase in comp… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  22. arXiv:2405.07568  [pdf, ps, other

    eess.SP

    Networked ISAC for Low-Altitude Economy: Transmit Beamforming and UAV Trajectory Design

    Authors: Gaoyuan Cheng, Xianxin Song, Zhonghao Lyu, Jie Xu

    Abstract: This paper studies the exploitation of networked integrated sensing and communications (ISAC) to support low-altitude economy (LAE), in which a set of networked ground base stations (GBSs) transmit wireless signals to cooperatively communicate with multiple authorized unmanned aerial vehicles (UAVs) and concurrently use the echo signals to detect the invasion of unauthorized objects in interested… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  23. arXiv:2405.05007  [pdf, other

    eess.IV cs.CV

    HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation

    Authors: Jiashu Xu

    Abstract: Automatic medical image segmentation technology has the potential to expedite pathological diagnoses, thereby enhancing the efficiency of patient care. However, medical images often have complex textures and structures, and the models often face the problem of reduced image resolution and information loss due to downsampling. To address this issue, we propose HC-Mamba, a new medical image segmenta… ▽ More

    Submitted 11 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  24. arXiv:2405.00736  [pdf, other

    eess.SP cs.LG

    Joint Signal Detection and Automatic Modulation Classification via Deep Learning

    Authors: Huijun Xing, Xuhui Zhang, Shuo Chang, **ke Ren, Zixun Zhang, Jie Xu, Shuguang Cui

    Abstract: Signal detection and modulation classification are two crucial tasks in various wireless communication systems. Different from prior works that investigate them independently, this paper studies the joint signal detection and automatic modulation classification (AMC) by considering a realistic and complex scenario, in which multiple signals with different modulation schemes coexist at different ca… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

  25. arXiv:2404.19265  [pdf, other

    cs.CV eess.IV

    Map** New Realities: Ground Truth Image Creation with Pix2Pix Image-to-Image Translation

    Authors: Zhenglin Li, Bo Guan, Yuanzhou Wei, Yiming Zhou, **gyu Zhang, **xin Xu

    Abstract: Generative Adversarial Networks (GANs) have significantly advanced image processing, with Pix2Pix being a notable framework for image-to-image translation. This paper explores a novel application of Pix2Pix to transform abstract map images into realistic ground truth images, addressing the scarcity of such images crucial for domains like urban planning and autonomous vehicle training. We detail th… ▽ More

    Submitted 30 April, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

  26. arXiv:2404.16289  [pdf, other

    cs.IT eess.SP

    Deep Joint CSI Feedback and Multiuser Precoding for MIMO OFDM Systems

    Authors: Yiran Guo, Wei Chen, Jialong Xu, Lun Li, Bo Ai

    Abstract: The design of precoding plays a crucial role in achieving a high downlink sum-rate in multiuser multiple-input multiple-output (MIMO) orthogonal frequency-division multiplexing (OFDM) systems. In this correspondence, we propose a deep learning based joint CSI feedback and multiuser precoding method in frequency division duplex systems, aiming at maximizing the downlink sum-rate performance in an e… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  27. arXiv:2404.13536  [pdf, other

    cs.IT eess.SP

    Joint Transmit and Reflective Beamforming for Multi-Active-IRS-Assisted Cooperative Sensing

    Authors: Yuan Fang, Xianghao Yu, Jie Xu

    Abstract: This paper studies multi-active intelligent-reflecting-surface (IRS) cooperative sensing, in which multiple active IRSs are deployed in a distributed manner to help the base station (BS) provide multi-view sensing. We focus on the scenario where the sensing target is located in the non-line-of-sight (NLoS) area of the BS. Based on the received echo signal, the BS aims to estimate the target's dire… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  28. arXiv:2404.13391  [pdf, other

    eess.SY cs.LG math.OC

    Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context

    Authors: Jianyu Xu, Qiuzhuang Sun, Yang Yang, Huadong Mo, Daoyi Dong

    Abstract: The 2019-20 Australia bushfire incurred numerous economic losses and significantly affected the operations of power systems. A power station or transmission line can be significantly affected due to bushfires, leading to an increase in operational costs. We study a fundamental but challenging problem of planning the optimal power flow (OPF) for power systems subject to bushfires. Considering the s… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  29. arXiv:2404.10233  [pdf, ps, other

    eess.SP

    Little Pilot is Needed for Channel Estimation with Integrated Super-Resolution Sensing and Communication

    Authors: **gran Xu, Huizhi Wang, Yong Zeng, Xiaoli Xu

    Abstract: Integrated super-resolution sensing and communication (ISSAC) is a promising technology to achieve extremely high sensing performance for critical parameters, such as the angles of the wireless channels. In this paper, we propose an ISSAC-based channel estimation method, which requires little or even no pilot, yet still achieves accurate channel state information (CSI) estimation. The key idea is… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 6 pages, 5 figures, accepted by IEEE WCNC 2024 workshops

  30. arXiv:2404.09140  [pdf, other

    cs.LG cs.IT eess.SP

    RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion

    Authors: Guoxuan Chi, Zheng Yang, Chenshu Wu, **gao Xu, Yuchong Gao, Yunhao Liu, Tony Xiao Han

    Abstract: Along with AIGC shines in CV and NLP, its potential in the wireless domain has also emerged in recent years. Yet, existing RF-oriented generative solutions are ill-suited for generating high-quality, time-series RF data due to limited representation capabilities. In this work, inspired by the stellar achievements of the diffusion model in CV and NLP, we adapt it to the RF domain and propose RF-Dif… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Accepted by MobiCom 2024

    ACM Class: I.2.0

  31. arXiv:2404.07448  [pdf, other

    cs.CV cs.CL eess.IV

    Transferable and Principled Efficiency for Open-Vocabulary Segmentation

    Authors: **gxuan Xu, Wuyang Chen, Yao Zhao, Yunchao Wei

    Abstract: Recent success of pre-trained foundation vision-language models makes Open-Vocabulary Segmentation (OVS) possible. Despite the promising performance, this approach introduces heavy computational overheads for two challenges: 1) large model sizes of the backbone; 2) expensive costs during the fine-tuning. These challenges hinder this OVS strategy from being widely applicable and affordable in real-… ▽ More

    Submitted 3 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  32. arXiv:2404.03915  [pdf, other

    eess.SY

    Nonlinear Kalman Filtering based on Self-Attention Mechanism and Lattice Trajectory Piecewise Linear Approximation

    Authors: Jiaming Wang, Xinyu Geng, Jun Xu

    Abstract: The traditional Kalman filter (KF) is widely applied in control systems, but it relies heavily on the accuracy of the system model and noise parameters, leading to potential performance degradation when facing inaccuracies. To address this issue, introducing neural networks into the KF framework offers a data-driven solution to compensate for these inaccuracies, improving the filter's performance… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures

  33. arXiv:2403.16353  [pdf, other

    cs.IT eess.SP

    Energy-Efficient Hybrid Beamforming with Dynamic On-off Control for Integrated Sensing, Communications, and Powering

    Authors: Zeyu Hao, Yuan Fang, Xianghao Yu, Jie Xu, Ling Qiu, Lexi Xu, Shuguang Cui

    Abstract: This paper investigates the energy-efficient hybrid beamforming design for a multi-functional integrated sensing, communications, and powering (ISCAP) system. In this system, a base station (BS) with a hybrid analog-digital (HAD) architecture sends unified wireless signals to communicate with multiple information receivers (IRs), sense multiple point targets, and wirelessly charge multiple energy… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 13 pages, 6 figures, submitted to IEEE Transactions on Communications

  34. arXiv:2403.15971  [pdf, other

    eess.IV

    PSHop: A Lightweight Feed-Forward Method for 3D Prostate Gland Segmentation

    Authors: Yi**g Yang, Vasileios Magoulianitis, Jiaxin Yang, **tang Xue, Masatomo Kaneko, Giovanni Cacciamani, Andre Abreu, Vinay Duddalwar, C. -C. Jay Kuo, Inderbir S. Gill, Chrysostomos Nikias

    Abstract: Automatic prostate segmentation is an important step in computer-aided diagnosis of prostate cancer and treatment planning. Existing methods of prostate segmentation are based on deep learning models which have a large size and lack of transparency which is essential for physicians. In this paper, a new data-driven 3D prostate segmentation method on MRI is proposed, named PSHop. Different from dee… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 11 pages, 5 figures, 5 tables

  35. arXiv:2403.15969  [pdf, other

    eess.IV

    PCa-RadHop: A Transparent and Lightweight Feed-forward Method for Clinically Significant Prostate Cancer Segmentation

    Authors: Vasileios Magoulianitis, Jiaxin Yang, Yi**g Yang, **tang Xue, Masatomo Kaneko, Giovanni Cacciamani, Andre Abreu, Vinay Duddalwar, C. -C. Jay Kuo, Inderbir S. Gill, Chrysostomos Nikias

    Abstract: Prostate Cancer is one of the most frequently occurring cancers in men, with a low survival rate if not early diagnosed. PI-RADS reading has a high false positive rate, thus increasing the diagnostic incurred costs and patient discomfort. Deep learning (DL) models achieve a high segmentation performance, although require a large model size and complexity. Also, DL models lack of feature interpreta… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 13 pages, 7 figures, 5 tables

  36. Secure Outage Analysis for RIS-Aided MISO Systems with Randomly Located Eavesdroppers

    Authors: Wei Shi, **dan Xu, Wei Xu, Chau Yuen, A. Lee Swindlehurst, Xiaohu You, Chunming Zhao

    Abstract: In this paper, we consider the physical layer security of an RIS-assisted multiple-antenna communication system with randomly located eavesdroppers. The exact distributions of the received signal-to-noise-ratios (SNRs) at the legitimate user and the eavesdroppers located according to a Poisson point process (PPP) are derived, and a closed-form expression for the secrecy outage probability (SOP) is… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted by 2023 IEEE Globecom Workshops (GC Wkshps). arXiv admin note: substantial text overlap with arXiv:2312.16814

  37. arXiv:2403.13648  [pdf, other

    eess.SY

    Priority-based Energy Allocation in Buildings for Distributed Model Predictive Control

    Authors: Hongyi Li, Jun Xu

    Abstract: Many countries are facing energy shortage today and most of the global energy is consumed by HVAC systems in buildings. For the scenarios where the energy system is not sufficiently supplied to HVAC systems, a priority-based allocation scheme based on distributed model predictive control is proposed in this paper, which distributes the energy rationally based on priority order. According to the sc… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  38. arXiv:2403.13601  [pdf, other

    eess.SY

    Lattice piecewise affine approximation of explicit model predictive control with application to satellite attitude control

    Authors: Zhengqi Xu, Jun Xu, Ai-Guo Wu, Shuning Wang

    Abstract: Satellite attitude cotrol is a crucial part of aerospace technology, and model predictive control(MPC) is one of the most promising controllers in this area, which will be less effective if real-time online optimization can not be achieved. Explicit MPC converts the online calculation into a table lookup process, however the solution is difficult to obtain if the system dimension is high or the co… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  39. arXiv:2403.11953  [pdf, other

    eess.IV cs.CV

    Advancing COVID-19 Detection in 3D CT Scans

    Authors: Qingqiu Li, Runtian Yuan, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

    Abstract: To make a more accurate diagnosis of COVID-19, we propose a straightforward yet effective model. Firstly, we analyse the characteristics of 3D CT scans and remove the non-lung parts, facilitating the model to focus on lesion-related areas and reducing computational cost. We use ResNeSt50 as the strong feature extractor, initializing it with pretrained weights which have COVID-19-specific prior kno… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  40. arXiv:2403.11809  [pdf, other

    cs.IT eess.SP

    Sensing-Enhanced Channel Estimation for Near-Field XL-MIMO Systems

    Authors: Shicong Liu, Xianghao Yu, Zhen Gao, Jie Xu, Derrick Wing Kwan Ng, Shuguang Cui

    Abstract: Future sixth-generation (6G) systems are expected to leverage extremely large-scale multiple-input multiple-output (XL-MIMO) technology, which significantly expands the range of the near-field region. The spherical wavefront characteristics in the near field introduce additional degrees of freedom (DoFs), namely distance and angle, into the channel model, which leads to unique challenges in channe… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 14 pages, 9 figures

  41. arXiv:2403.11498  [pdf, other

    eess.IV cs.CV

    Domain Adaptation Using Pseudo Labels for COVID-19 Detection

    Authors: Runtian Yuan, Qingqiu Li, Junlin Hou, Jilan Xu, Yuejie Zhang, Rui Feng, Hao Chen

    Abstract: In response to the need for rapid and accurate COVID-19 diagnosis during the global pandemic, we present a two-stage framework that leverages pseudo labels for domain adaptation to enhance the detection of COVID-19 from CT scans. By utilizing annotated data from one domain and non-annotated data from another, the model overcomes the challenge of data scarcity and variability, common in emergent he… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  42. arXiv:2403.08758  [pdf

    eess.IV cs.CV

    Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI

    Authors: Shihan Qiu, Shaoyan Pan, Yikang Liu, Lin Zhao, Jian Xu, Qi Liu, Terrence Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun

    Abstract: Current deep learning reconstruction for accelerated cardiac cine MRI suffers from spatial and temporal blurring. We aim to improve image sharpness and motion delineation for cine MRI under high undersampling rates. A spatiotemporal diffusion enhancement model conditional on an existing deep learning reconstruction along with a novel paired sampling strategy was developed. The diffusion model prov… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  43. arXiv:2403.08749  [pdf

    eess.IV cs.CV

    Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI

    Authors: Shihan Qiu, Shaoyan Pan, Yikang Liu, Lin Zhao, Jian Xu, Qi Liu, Terrence Chen, Eric Z. Chen, Xiao Chen, Shanhui Sun

    Abstract: The currently limited quality of accelerated cardiac cine reconstruction may potentially be improved by the emerging diffusion models, but the clinically unacceptable long processing time poses a challenge. We aim to develop a clinically feasible diffusion-model-based reconstruction pipeline to improve the image quality of cine MRI. A multi-in multi-out diffusion enhancement model together with fa… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  44. arXiv:2403.07923  [pdf

    cs.NI cs.AI cs.LG eess.IV eess.SY

    The Fusion of Deep Reinforcement Learning and Edge Computing for Real-time Monitoring and Control Optimization in IoT Environments

    Authors: **gyu Xu, Weixiang Wan, Linying Pan, Wenjian Sun, Yuxiang Liu

    Abstract: In response to the demand for real-time performance and control quality in industrial Internet of Things (IoT) environments, this paper proposes an optimization control system based on deep reinforcement learning and edge computing. The system leverages cloud-edge collaboration, deploys lightweight policy networks at the edge, predicts system states, and outputs controls at a high frequency, enabl… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  45. arXiv:2403.06993  [pdf

    cs.RO cs.AI cs.LG eess.IV eess.SY

    Automatic driving lane change safety prediction model based on LSTM

    Authors: Wenjian Sun, Linying Pan, **gyu Xu, Weixiang Wan, Yong Wang

    Abstract: Autonomous driving technology can improve traffic safety and reduce traffic accidents. In addition, it improves traffic flow, reduces congestion, saves energy and increases travel efficiency. In the relatively mature automatic driving technology, the automatic driving function is divided into several modules: perception, decision-making, planning and control, and a reasonable division of labor can… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

  46. arXiv:2403.05906  [pdf, other

    eess.IV cs.CV

    Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration

    Authors: **gyun Xue, Tao Wang, Jun Wang, Kaihao Zhang, Wenhan Luo, Wenqi Ren, Zikun Liu, Hyunhee Park, Xiaochun Cao

    Abstract: Under-Display Camera (UDC) is an emerging technology that achieves full-screen display via hiding the camera under the display panel. However, the current implementation of UDC causes serious degradation. The incident light required for camera imaging undergoes attenuation and diffraction when passing through the display panel, leading to various artifacts in UDC imaging. Presently, the prevailing… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 13 pages, 10 figures, conference or other essential info

  47. arXiv:2403.05179  [pdf, ps, other

    eess.SY

    Device Fault Prediction Model based on LSTM and Random Forest

    Authors: **g Xu, Yongbo Zhang

    Abstract: The quality of power grid equipment forms the material foundation for the safety of the large power grid. Ensuring the quality of equipment entering the grid is a core task in material management. Currently, the inspection of incoming materials involves the generation of sampling plans, sampling, sealing, sample delivery, and testing. Due to the lack of a comprehensive control system and effective… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  48. arXiv:2403.05079  [pdf, ps, other

    eess.SY

    Sampling Model for Grid Material Inspection Based on Analytic Hierarchy Process with Absolute Measurement

    Authors: **g Xu, Yongbo Zhang

    Abstract: The quality of power grid equipment forms the material foundation for the safety of the large power grid. Ensuring the quality of equipment entering the grid is a core task in material management. Currently, the inspection of incoming materials involves the generation of sampling plans, sampling, sealing, sample delivery, and testing. Due to the lack of a comprehensive control system and effective… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  49. arXiv:2403.05076  [pdf

    eess.SY

    Correlation analysis technique of key parameters for transformer material inspection based on FP-tree and knowledge graph

    Authors: **g Xu, Yongbo Zhang

    Abstract: As one of the key equipment in the distribution system, the distribution transformer directly affects the reliability of the user power supply. The probability of accidents occurring in the operation of transformer equipment is high, so it has become a focus of material inspection in recent years. However, the large amount of raw data from sample testing is not being used effectively. Given the ab… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  50. arXiv:2403.04145  [pdf, other

    eess.SY

    A Crosstalk-Aware Timing Prediction Method in Routing

    Authors: Leilei **, Jiajie Xu, Wenjie Fu, Hao Yan, Longxing Shi

    Abstract: With shrinking interconnect spacing in advanced technology nodes, existing timing predictions become less precise due to the challenging quantification of crosstalk-induced delay. During the routing, the crosstalk effect is typically modeled by predicting coupling capacitance with congestion information. However, the timing estimation tends to be overly pessimistic, as the crosstalk-induced delay… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 6 pages, 8 figures

    ACM Class: I.6.4; B.7.3