Skip to main content

Showing 1–50 of 62 results for author: Luo, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14977  [pdf, other

    cs.AI eess.IV

    Trustworthy Enhanced Multi-view Multi-modal Alzheimer's Disease Prediction with Brain-wide Imaging Transcriptomics Data

    Authors: Shan Cong, Zhoujie Fan, Hongwei Liu, Yinghan Zhang, Xin Wang, Haoran Luo, Xiaohui Yao

    Abstract: Brain transcriptomics provides insights into the molecular mechanisms by which the brain coordinates its functions and processes. However, existing multimodal methods for predicting Alzheimer's disease (AD) primarily rely on imaging and sometimes genetic data, often neglecting the transcriptomic basis of brain. Furthermore, while striving to integrate complementary information between modalities,… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2405.19925  [pdf, other

    eess.SP

    Integrated Sensing and Communications Framework for 6G Networks

    Authors: Hongliang Luo, Tengyu Zhang, Chuanbin Zhao, Yucong Wang, Bo Lin, Yuhua Jiang, Dongqi Luo, Feifei Gao

    Abstract: In this paper, we propose a novel integrated sensing and communications (ISAC) framework for the sixth generation (6G) mobile networks, in which we decompose the real physical world into static environment, dynamic targets, and various object materials. The ubiquitous static environment occupies the vast majority of the physical world, for which we design static environment reconstruction (SER) sc… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  3. arXiv:2405.17250  [pdf, ps, other

    cs.RO eess.SY

    "Pass the butter": A study on desktop-classic multitasking robotic arm based on advanced YOLOv7 and BERT

    Authors: Haohua Que, Wenbin Pan, Jie Xu, Hao Luo, Pei Wang, Li Zhang

    Abstract: In recent years, various intelligent autonomous robots have begun to appear in daily life and production. Desktop-level robots are characterized by their flexible deployment, rapid response, and suitability for light workload environments. In order to meet the current societal demand for service robot technology, this study proposes using a miniaturized desktop-level robot (by ROS) as a carrier, l… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2405.07115  [pdf, other

    eess.SP cs.IT

    Digital Twin Aided Compressive Sensing: Enabling Site-Specific MIMO Hybrid Precoding

    Authors: Hao Luo, Ahmed Alkhateeb

    Abstract: Compressive sensing is a promising solution for the channel estimation in multiple-input multiple-output (MIMO) systems with large antenna arrays and constrained hardware. Utilizing site-specific channel data from real-world systems, deep learning can be employed to learn the compressive sensing measurement vectors with minimum redundancy, thereby focusing sensing power on promising spatial direct… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures

  5. arXiv:2403.10832  [pdf, other

    cs.IT eess.SP

    Joint Power Allocation and Beamforming for In-band Full-duplex Multi-cell Multi-user Networks

    Authors: Haifeng Luo, Navneet Garg, Mark Holm, Tharmalingam Ratnarajah

    Abstract: This paper investigates a robust joint power allocation and beamforming scheme for in-band full-duplex multi-cell multi-user (IBFD-MCMU) networks. A mean-squared error (MSE) minimization problem is formulated with constraints on the power budgets and residual self-interference (RSI) power. The problem is not convex, so we decompose it into two sub-problems: interference management beamforming and… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  6. arXiv:2403.06720  [pdf, other

    cs.IT eess.SP

    On the Secrecy Rate of In-Band Full-duplex Two-way Wiretap Channel

    Authors: Navneet Garg, Haifeng Luo, Tharmalingam Ratnarajah

    Abstract: In this paper, we consider a two-way wiretap Multi-Input Multi-Output Multi-antenna Eve (MIMOME) channel, where both nodes (Alice and Bob) transmit and receive in an in-band full-duplex (IBFD) manner. For this system with keyless security, we provide a novel artificial noise (AN) based signal design, where the AN is injected in both signal and null spaces. We present an ergodic secrecy rate approx… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  7. arXiv:2402.17268  [pdf, other

    eess.SY

    Reinforcement Learning Based Robust Volt/Var Control in Active Distribution Networks With Imprecisely Known Delay

    Authors: Hong Cheng, Huan Luo, Zhi Liu, Wei Sun, Weitao Li, Qiyue Li

    Abstract: Active distribution networks (ADNs) incorporating massive photovoltaic (PV) devices encounter challenges of rapid voltage fluctuations and potential violations. Due to the fluctuation and intermittency of PV generation, the state gap, arising from time-inconsistent states and exacerbated by imprecisely known system delays, significantly impacts the accuracy of voltage control. This paper addresses… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  8. arXiv:2401.15919  [pdf, other

    eess.SP cs.IT

    Integrated Imaging and Communication with Reconfigurable Intelligent Surfaces

    Authors: Hao Luo, Ahmed Alkhateeb

    Abstract: Reconfigurable intelligent surfaces, with their large number of antennas, offer an interesting opportunity for high spatial-resolution imaging. In this paper, we propose a novel RIS-aided integrated imaging and communication system that can reduce the RIS beam training overhead for communication by leveraging the imaging of the surrounding environment. In particular, using the RIS as a wireless im… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 6 pages, 4 figures. To appear in Asilomar 2023

  9. arXiv:2401.09761  [pdf, other

    eess.SP cs.IT

    ISAC with Backscattering RFID Tags: Joint Beamforming Design

    Authors: Hao Luo, Umut Demirhan, Ahmed Alkhateeb

    Abstract: In this paper, we explore an integrated sensing and communication (ISAC) system with backscattering RFID tags. In this setup, an access point employs a communication beam to serve a user while leveraging a sensing beam to detect an RFID tag. Under the total transmit power constraint of the system, our objective is to design sensing and communication beams by considering the tag detection and commu… ▽ More

    Submitted 31 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 5 pages, 5 figures. To appear in IEEE ICC 2024

  10. arXiv:2312.16441  [pdf, other

    eess.SP

    6D Radar Sensing and Tracking in Monostatic Integrated Sensing and Communications System

    Authors: Hongliang Luo, Feifei Gao, Fan Liu, Shi **

    Abstract: In this paper, we propose a novel scheme for sixdimensional (6D) radar sensing and tracking of dynamic target based on multiple input and multiple output (MIMO) array for monostatic integrated sensing and communications (ISAC) system. Unlike most existing ISAC studies believing that only the radial velocity of far-field dynamic target can be measured based on one single base station (BS), we find… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  11. arXiv:2311.01700  [pdf, other

    eess.SP

    Moving Target Sensing for ISAC Systems in Clutter Environment

    Authors: Dongqi Luo, Huihui Wu, Hongliang Luo, Bo Lin, Feifei Gao

    Abstract: In this paper, we consider the moving target sensing problem for integrated sensing and communication (ISAC) systems in clutter environment. Scatterers produce strong clutter, deteriorating the performance of ISAC systems in practice. Given that scatterers are typically stationary and the targets of interest are usually moving, we here focus on sensing the moving targets. Specifically, we adopt a… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  12. arXiv:2311.01674  [pdf, other

    eess.SP

    Integrated Sensing and Communications in Clutter Environment

    Authors: Hongliang Luo, Yucong Wang, Dongqi Luo, Jianwei Zhao, Huihui Wu, Shaodan Ma, Feifei Gao

    Abstract: In this paper, we propose a practical integrated sensing and communications (ISAC) framework to sense dynamic targets from clutter environment while ensuring users communications quality. To implement communications function and sensing function simultaneously, we design multiple communications beams that can communicate with the users as well as one sensing beam that can rotate and scan the entir… ▽ More

    Submitted 5 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  13. arXiv:2310.17997  [pdf

    physics.optics cs.AI eess.IV

    Deep Learning Enables Large Depth-of-Field Images for Sub-Diffraction-Limit Scanning Superlens Microscopy

    Authors: Hui Sun, Hao Luo, Feifei Wang, Qingjiu Chen, Meng Chen, Xiaoduo Wang, Haibo Yu, Guanglie Zhang, Lianqing Liu, Jian** Wang, Dapeng Wu, Wen Jung Li

    Abstract: Scanning electron microscopy (SEM) is indispensable in diverse applications ranging from microelectronics to food processing because it provides large depth-of-field images with a resolution beyond the optical diffraction limit. However, the technology requires coating conductive films on insulator samples and a vacuum environment. We use deep learning to obtain the map** relationship between op… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 13 pages,7 figures

  14. arXiv:2310.03748  [pdf

    eess.SP cs.HC cs.LG

    Phase Synchrony Component Self-Organization in Brain Computer Interface

    Authors: Xu Niu, Na Lu, Huan Luo, Ruofan Yan

    Abstract: Phase synchrony information plays a crucial role in analyzing functional brain connectivity and identifying brain activities. A widely adopted feature extraction pipeline, composed of preprocessing, selection of EEG acquisition channels, and phase locking value (PLV) calculation, has achieved success in motor imagery classification (MI). However, this pipeline is manual and reliant on expert knowl… ▽ More

    Submitted 11 October, 2023; v1 submitted 21 September, 2023; originally announced October 2023.

  15. arXiv:2309.14405  [pdf, other

    cs.SD cs.AI eess.AS

    Joint Audio and Speech Understanding

    Authors: Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James Glass

    Abstract: Humans are surrounded by audio signals that include both speech and non-speech sounds. The recognition and understanding of speech and non-speech audio events, along with a profound comprehension of the relationship between them, constitute fundamental cognitive capabilities. For the first time, we build a machine learning model, called LTU-AS, that has a conceptually similar universal audio perce… ▽ More

    Submitted 10 December, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at ASRU 2023. Code, dataset, and pretrained models are at https://github.com/yuangongnd/ltu. Interactive demo at https://huggingface.co/spaces/yuangongfdu/ltu-2

  16. arXiv:2309.14012  [pdf, other

    eess.SP

    Beam Squint Assisted User Localization in Near-Field Integrated Sensing and Communications Systems

    Authors: Hongliang Luo, Feifei Gao, Wanmai Yuan, Shun Zhang

    Abstract: Integrated sensing and communication (ISAC) has been regarded as a key technology for 6G wireless communications, in which large-scale multiple input and multiple output (MIMO) array with higher and wider frequency bands will be adopted. However, recent studies show that the beam squint phenomenon can not be ignored in wideband MIMO system, which generally deteriorates the communications performan… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: This paper has been accepted by IEEE Transactions on Wireless Communications (TWC) on 18 September 2023

  17. arXiv:2308.01558  [pdf, other

    eess.SP

    Millimeter Wave V2V Beam Tracking using Radar: Algorithms and Real-World Demonstration

    Authors: Hao Luo, Umut Demirhan, Ahmed Alkhateeb

    Abstract: Utilizing radar sensing for assisting communication has attracted increasing interest thanks to its potential in dynamic environments. A particularly interesting problem for this approach appears in the vehicle-to-vehicle (V2V) millimeter wave and terahertz communication scenarios, where the narrow beams change with the movement of both vehicles. To address this problem, in this work, we develop a… ▽ More

    Submitted 27 October, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: 5 pages, 5 figures. To appear in EUSIPCO 2023. The dataset is available on the DeepSense 6G website http://deepsense6g.net/

  18. arXiv:2305.12064  [pdf, other

    eess.SP

    YOLO: An Efficient Terahertz Band Integrated Sensing and Communications Scheme with Beam Squint

    Authors: Hongliang Luo, Feifei Gao, Hai Lin, Shaodan Ma, H. Vincent Poor

    Abstract: Using communications signals for dynamic target sensing is an important component of integrated sensing and communications (ISAC). In this paper, we propose to utilize the beam squint effect to realize fast non-cooperative dynamic target sensing in massive multiple input and multiple output (MIMO) Terahertz band communications systems. Specifically, we construct a wideband channel model of the ech… ▽ More

    Submitted 5 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: This paper has been accepted by IEEE Transactions on Wireless Communications (TWC)

  19. arXiv:2305.11013  [pdf, other

    cs.SD cs.CL eess.AS

    FunASR: A Fundamental End-to-End Speech Recognition Toolkit

    Authors: Zhifu Gao, Zerui Li, Jiaming Wang, Haoneng Luo, Xian Shi, Mengzhe Chen, Yabin Li, Lingyun Zuo, Zhihao Du, Zhangyu Xiao, Shiliang Zhang

    Abstract: This paper introduces FunASR, an open-source speech recognition toolkit designed to bridge the gap between academic research and industrial applications. FunASR offers models trained on large-scale industrial corpora and the ability to deploy them in applications. The toolkit's flagship model, Paraformer, is a non-autoregressive end-to-end speech recognition model that has been trained on a manual… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 5 pages, 3 figures, accepted by INTERSPEECH 2023

  20. arXiv:2305.10790  [pdf, other

    eess.AS cs.SD

    Listen, Think, and Understand

    Authors: Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James Glass

    Abstract: The ability of artificial intelligence (AI) systems to perceive and comprehend audio signals is crucial for many applications. Although significant progress has been made in this area since the development of AudioSet, most existing models are designed to map audio inputs to pre-defined, discrete sound label sets. In contrast, humans possess the ability to not only classify sounds into general cat… ▽ More

    Submitted 19 February, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted at ICLR 2024. Code, dataset, and models are available at https://github.com/YuanGongND/ltu. The interactive demo is at https://huggingface.co/spaces/yuangongfdu/ltu

  21. arXiv:2305.10680  [pdf, other

    cs.SD cs.CL eess.AS

    Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System

    Authors: Xian Shi, Haoneng Luo, Zhifu Gao, Shiliang Zhang, Zhijie Yan

    Abstract: Estimating confidence scores for recognition results is a classic task in ASR field and of vital importance for kinds of downstream tasks and training strategies. Previous end-to-end~(E2E) based confidence estimation models (CEM) predict score sequences of equal length with input transcriptions, leading to unreliable estimation when deletion and insertion errors occur. In this paper we proposed CI… ▽ More

    Submitted 24 May, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: 5 pages, 4 figures, Interspeech2023

  22. Inflation Reduction Act impacts on the economics of clean hydrogen and liquid fuels

    Authors: Fangwei Cheng, Hongxi Luo, Jesse D. Jenkins, Eric D. Larson

    Abstract: The Inflation Reduction Act (IRA) in the United States provides unprecedented incentives for deploying low-carbon hydrogen and liquid fuels, among other low greenhouse gas (GHG) emissions technologies. To better understand the prospective competitiveness of low-carbon or negative-carbon hydrogen and liquid fuels under the IRA in the early 2030s, we examine the impacts of IRA provisions on costs of… ▽ More

    Submitted 14 August, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  23. arXiv:2304.13244  [pdf

    cs.NI eess.SP

    ESCM: An Efficient and Secure Communication Mechanism for UAV Networks

    Authors: Haoxiang Luo, Yifan Wu, Gang Sun, Hongfang Yu, Mohsen Guizani

    Abstract: UAV (unmanned aerial vehicle) is rapidly gaining traction in various human activities and has become an integral component of the satellite-air-ground-sea (SAGS) integrated network. As high-speed moving objects, UAVs not only have extremely strict requirements for communication delay, but also cannot be maliciously controlled as a weapon by the attacker. Therefore, an efficient and secure communic… ▽ More

    Submitted 16 June, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

  24. arXiv:2304.08697  [pdf

    cs.NI cs.PF eess.SP

    Performance Analysis and Comparison of Non-ideal Wireless PBFT and RAFT Consensus Networks in 6G Communications

    Authors: Haoxiang Luo, Xiangyue Yang, Hongfang Yu, Gang Sun, Bo Lei, Mohsen Guizani

    Abstract: Due to advantages in security and privacy, blockchain is considered a key enabling technology to support 6G communications. Practical Byzantine Fault Tolerance (PBFT) and RAFT are seen as the most applicable consensus mechanisms (CMs) in blockchain-enabled wireless networks. However, previous studies on PBFT and RAFT rarely consider the channel performance of the physical layer, such as path loss… ▽ More

    Submitted 2 August, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2303.15759

  25. arXiv:2303.15759  [pdf

    cs.NI eess.SP

    Performance Analysis of Non-ideal Wireless PBFT Networks with mmWave and Terahertz Signals

    Authors: Haoxiang Luo, Xiangyue Yang, Hongfang Yu, Gang Sun, Shizhong Xu, Long Luo

    Abstract: Due to advantages in security and privacy, blockchain is considered a key enabling technology to support 6G communications. Practical Byzantine Fault Tolerance (PBFT) is seen as the most applicable consensus mechanism in blockchain-enabled wireless networks. However, previous studies on PBFT do not consider the channel performance of the physical layer, such as path loss and channel fading, result… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: IEEE International Conference on Metaverse Computing, Networking and Applications (MetaCom) 2023

  26. arXiv:2302.11249  [pdf, ps, other

    eess.SP

    RIS-Aided Integrated Sensing and Communication: Joint Beamforming and Reflection Design

    Authors: Honghao Luo, Rang Liu, Ming Li, Qian Liu

    Abstract: Integrated sensing and communication (ISAC) has been envisioned as a promising technique to alleviate the spectrum congestion problem. Inspired by the applications of reconfigurable intelligent surface (RIS) in dynamically manipulating wireless propagation environment, in this paper, we investigate to deploy a RIS in an ISAC system to pursue performance improvement. Particularly, we consider a RIS… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE TVT

  27. arXiv:2302.10686  [pdf, other

    cs.SD cs.AI eess.AS

    Interpretable Spectrum Transformation Attacks to Speaker Recognition

    Authors: Jiadi Yao, Hong Luo, Xiao-Lei Zhang

    Abstract: The success of adversarial attacks to speaker recognition is mainly in white-box scenarios. When applying the adversarial voices that are generated by attacking white-box surrogate models to black-box victim models, i.e. \textit{transfer-based} black-box attacks, the transferability of the adversarial voices is not only far from satisfactory, but also lacks interpretable basis. To address these is… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  28. arXiv:2302.09332  [pdf, other

    eess.SP

    Incipient Fault Detection in Power Distribution System: A Time-Frequency Embedded Deep Learning Based Approach

    Authors: Qiyue Li, Huan Luo, Hong Cheng, Yuxing Deng, Wei Sun, Weitao Li, Zhi Liu

    Abstract: Incipient fault detection in power distribution systems is crucial to improve the reliability of the grid. However, the non-stationary nature and the inadequacy of the training dataset due to the self-recovery of the incipient fault signal, make the incipient fault detection in power distribution systems a great challenge. In this paper, we focus on incipient fault detection in power distribution… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: 15 pages

  29. arXiv:2211.12956   

    eess.SY cs.AI cs.LG

    Reinforcement learning for traffic signal control in hybrid action space

    Authors: Haoqing Luo, sheng **

    Abstract: The prevailing reinforcement-learning-based traffic signal control methods are typically staging-optimizable or duration-optimizable, depending on the action spaces. In this paper, we propose a novel control architecture, TBO, which is based on hybrid proximal policy optimization. To the best of our knowledge, TBO is the first RL-based algorithm to implement synchronous optimization of the staging… ▽ More

    Submitted 25 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: There are serious problems with the innovation of the paper

  30. arXiv:2211.08210  [pdf, other

    eess.SP cs.IT

    Reconfigurable Intelligent Surface Aided Wireless Sensing for Scene Depth Estimation

    Authors: Abdelrahman Taha, Hao Luo, Ahmed Alkhateeb

    Abstract: Current scene depth estimation approaches mainly rely on optical sensing, which carries privacy concerns and suffers from estimation ambiguity for distant, shiny, and transparent surfaces/objects. Reconfigurable intelligent surfaces (RISs) provide a path for employing a massive number of antennas using low-cost and energy-efficient architectures. This has the potential for realizing RIS-aided wire… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Submitted to IEEE

  31. 3D Matting: A Benchmark Study on Soft Segmentation Method for Pulmonary Nodules Applied in Computed Tomography

    Authors: Lin Wang, Xiufen Ye, Donghao Zhang, Wanji He, Lie Ju, Yi Luo, Huan Luo, Xin Wang, Wei Feng, Kaimin Song, Xin Zhao, Zongyuan Ge

    Abstract: Usually, lesions are not isolated but are associated with the surrounding tissues. For example, the growth of a tumour can depend on or infiltrate into the surrounding tissues. Due to the pathological nature of the lesions, it is challenging to distinguish their boundaries in medical imaging. However, these uncertain regions may contain diagnostic information. Therefore, the simple binarization of… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: Accepted by Computers in Biology and Medicine. arXiv admin note: substantial text overlap with arXiv:2209.07843

  32. arXiv:2208.01854  [pdf, other

    eess.SP

    Joint Beamforming Design for RIS-Assisted Integrated Sensing and Communication Systems

    Authors: Honghao Luo, Rang Liu, Ming Li, Yang Liu, Qian Liu

    Abstract: Integrated sensing and communication (ISAC) has been envisioned as a promising technology to tackle the spectrum congestion problem for future networks. In this correspondence, we investigate to deploy a reconfigurable intelligent surface (RIS) in an ISAC system for achieving better performance. In particular, a multi-antenna base station (BS) simultaneously serves multiple single-antenna users wi… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted by IEEE TVT

  33. arXiv:2207.00474  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Weakly-supervised High-fidelity Ultrasound Video Synthesis with Feature Decoupling

    Authors: Jiamin Liang, Xin Yang, Yuhao Huang, Kai Liu, Xinrui Zhou, Xindi Hu, Zehui Lin, Huanjia Luo, Yuanji Zhang, Yi Xiong, Dong Ni

    Abstract: Ultrasound (US) is widely used for its advantages of real-time imaging, radiation-free and portability. In clinical practice, analysis and diagnosis often rely on US sequences rather than a single image to obtain dynamic anatomical information. This is challenging for novices to learn because practicing with adequate videos from patients is clinically unpractical. In this paper, we propose a novel… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted by MICCAI 2022

  34. arXiv:2206.08518  [pdf, ps, other

    eess.SP

    Integrated Sensing and Communication with Reconfigurable Intelligent Surfaces: Opportunities, Applications, and Future Directions

    Authors: Rang Liu, Ming Li, Honghao Luo, Qian Liu, A. Lee Swindlehurst

    Abstract: Integrated sensing and communication (ISAC) is emerging as a key enabler to address the growing spectrum congestion problem and satisfy increasing demands for ubiquitous sensing and communication. By sharing various resources and information, ISAC achieves much higher spectral, energy, hardware, and economic efficiencies. Concurrently, reconfigurable intelligent surface (RIS) technology has been d… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: submitted to IEEE journal

  35. arXiv:2205.11392  [pdf, other

    eess.SP

    Beam Squint Assisted User Localization in Near-Field Communications Systems

    Authors: Hongliang Luo, Feifei Gao

    Abstract: The beam squint phenomenon in massive multi-input and multi-output wideband communications has been widely concerned recently, which generally deteriorates the beamforming performance. In this paper, we find that with the aid of the time-delay lines (TDs), the range and trajectory of the beam squint of a near-field communications system can be freely controlled, and hence it is possible to reverse… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  36. arXiv:2205.10682  [pdf, other

    cs.LG eess.SY

    A Novel Markov Model for Near-Term Railway Delay Prediction

    Authors: ** Xu, Weiqi Wang, Zheming Gao, Haochen Luo, Qian Wu

    Abstract: Predicting the near-future delay with accuracy for trains is momentous for railway operations and passengers' traveling experience. This work aims to design prediction models for train delays based on Netherlands Railway data. We first develop a chi-square test to show that the delay evolution over stations follows a first-order Markov chain. We then propose a delay prediction model based on non-h… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Comments: 36 pages, 3 figures, 4 tables

  37. arXiv:2108.05673  [pdf

    eess.SY

    An Extreme Learning Machine-Based System Frequency Nadir Constraint Linearization Method

    Authors: Likai Liu, Zechun Hu, Nikhil Pathak, Haocheng Luo

    Abstract: Large-scale integration of converter-based renewable energy sources (RESs) into the power system will lead to a higher risk of frequency nadir limit violation and even frequency instability after the large power disturbance. Therefore, it is essential to consider the frequency nadir constraint (FNC) in power system scheduling. Nevertheless, the FNC is highly nonlinear and non-convex. The state-of-… ▽ More

    Submitted 25 October, 2021; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: This paper has been submitted to the CSEE Journal of Power and Energy Systems

  38. arXiv:2105.08985  [pdf, ps, other

    eess.SP

    Integrated Communication and Navigation for Ultra-Dense LEO Satellite Networks: Vision, Challenges and Solutions

    Authors: Yu Wang, Hejia Luo, Ying Chen, Jun Wang, Rong Li, Bin Wang

    Abstract: Next generation beyond 5G networks are expected to provide both Terabits per second data rate communication services and centimeter-level accuracy localization services in an efficient, seamless and cost-effective manner. However, most of the current communication and localization systems are separately designed, leading to an under-utilization of radio resources and network performance degradatio… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 15 pages,5 figures

  39. arXiv:2105.03072  [pdf, other

    eess.IV cs.CV

    NTIRE 2021 Challenge on Perceptual Image Quality Assessment

    Authors: **** Gu, Haoming Cai, Chao Dong, Jimmy S. Ren, Yu Qiao, Shuhang Gu, Radu Timofte, Manri Cheon, Sungjun Yoon, Byungyeon Kang, Junwoo Lee, Qing Zhang, Haiyang Guo, Yi Bin, Yuqing Hou, Hengliang Luo, **gyu Guo, Zirui Wang, Hai Wang, Wenming Yang, Qingyan Bai, Shuwei Shi, Weihao Xia, Mingdeng Cao, Jiahao Wang , et al. (25 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2021 challenge on perceptual image quality assessment (IQA), held in conjunction with the New Trends in Image Restoration and Enhancement workshop (NTIRE) workshop at CVPR 2021. As a new type of image processing technology, perceptual image processing algorithms based on Generative Adversarial Networks (GAN) have produced images with more realistic textures. These o… ▽ More

    Submitted 28 June, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

  40. arXiv:2104.04702  [pdf, other

    cs.SD eess.AS

    Boundary and Context Aware Training for CIF-based Non-Autoregressive End-to-end ASR

    Authors: Fan Yu, Haoneng Luo, Pengcheng Guo, Yuhao Liang, Zhuoyuan Yao, Lei Xie, Yingying Gao, Lei**g Hou, Shilei Zhang

    Abstract: Continuous integrate-and-fire (CIF) based models, which use a soft and monotonic alignment mechanism, have been well applied in non-autoregressive (NAR) speech recognition with competitive performance compared with other NAR methods. However, such an alignment learning strategy may suffer from an erroneous acoustic boundary estimation, severely hindering the convergence speed as well as the system… ▽ More

    Submitted 26 September, 2021; v1 submitted 10 April, 2021; originally announced April 2021.

    Comments: 5 pages,4 figures

  41. arXiv:2103.14502  [pdf, other

    eess.IV cs.CV

    Agent with Warm Start and Adaptive Dynamic Termination for Plane Localization in 3D Ultrasound

    Authors: Xin Yang, Haoran Dou, Ruobing Huang, Wufeng Xue, Yuhao Huang, Jikuan Qian, Yuanji Zhang, Huanjia Luo, Huizhi Guo, Tianfu Wang, Yi Xiong, Dong Ni

    Abstract: Accurate standard plane (SP) localization is the fundamental step for prenatal ultrasound (US) diagnosis. Typically, dozens of US SPs are collected to determine the clinical diagnosis. 2D US has to perform scanning for each SP, which is time-consuming and operator-dependent. While 3D US containing multiple SPs in one shot has the inherent advantages of less user-dependency and more efficiency. Aut… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted by IEEE Transactions on Medical Imaging (12 pages, 8 figures, 11 tabels)

  42. arXiv:2103.01698  [pdf, other

    eess.IV cs.CV

    Super-resolving Compressed Images via Parallel and Series Integration of Artifact Reduction and Resolution Enhancement

    Authors: Hongming Luo, Fei Zhou, Guangsen Liao, Guo** Qiu

    Abstract: In real-world applications, such as sharing photos on social media platforms, images are always not only sub-sampled but also heavily compressed thus often containing various artefacts. Simple methods for enhancing the resolution of such images will exacerbate the artefacts, rendering them visually objectionable. In spite of its high practical values, super-resolving compressed images is not well… ▽ More

    Submitted 21 November, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: This paper have been accepted by Elsevier Signal Processing

  43. arXiv:2101.08039  [pdf, other

    eess.IV cs.CV cs.LG

    Bridge the Vision Gap from Field to Command: A Deep Learning Network Enhancing Illumination and Details

    Authors: Zhuqing Jiang, Chang Liu, Ya'nan Wang, Kai Li, Aidong Men, Haiying Wang, Haiyong Luo

    Abstract: With the goal of tuning up the brightness, low-light image enhancement enjoys numerous applications, such as surveillance, remote sensing and computational photography. Images captured under low-light conditions often suffer from poor visibility and blur. Solely brightening the dark regions will inevitably amplify the blur, thus may lead to detail loss. In this paper, we propose a simple yet effec… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

  44. arXiv:2101.02384  [pdf, other

    eess.IV cs.CV

    VHS to HDTV Video Translation using Multi-task Adversarial Learning

    Authors: Hongming Luo, Guangsen Liao, Xianxu Hou, Bozhi Liu, Fei Zhou, Guo** Qiu

    Abstract: There are large amount of valuable video archives in Video Home System (VHS) format. However, due to the analog nature, their quality is often poor. Compared to High-definition television (HDTV), VHS video not only has a dull color appearance but also has a lower resolution and often appears blurry. In this paper, we focus on the problem of translating VHS video to HDTV video and have developed a… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

    Comments: MMM2020 final version

  45. arXiv:2008.02519  [pdf

    eess.AS cs.SD

    Spectral-change enhancement with prior SNR for the hearing impaired

    Authors: Xiang Li, Xin Tian, Henry Luo, **yu Qian, Xihong Wu, Dingsheng Luo, **g Chen

    Abstract: A previous signal processing algorithm that aimed to enhance spectral changes (SCE) over time showed benefit for hearing-impaired (HI) listeners to recognize speech in background noise. In this work, the previous SCE was manipulated to perform on target-dominant segments, rather than treating all frames equally. Instantaneous signal-to-noise ratios (SNRs) were calculated to determine whether the s… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: Accepted by 23rd International Congress on Acoustics (ICA 2019), see http://pub.dega-akustik.de/ICA2019/data/articles/000051.pdf

  46. arXiv:2007.15273  [pdf, other

    cs.CV eess.IV eess.SP

    Searching Collaborative Agents for Multi-plane Localization in 3D Ultrasound

    Authors: Yuhao Huang, Xin Yang, Rui Li, Jikuan Qian, Xiaoqiong Huang, Wenlong Shi, Haoran Dou, Chaoyu Chen, Yuanji Zhang, Huanjia Luo, Alejandro Frangi, Yi Xiong, Dong Ni

    Abstract: 3D ultrasound (US) is widely used due to its rich diagnostic information, portability and low cost. Automated standard plane (SP) localization in US volume not only improves efficiency and reduces user-dependence, but also boosts 3D US interpretation. In this study, we propose a novel Multi-Agent Reinforcement Learning (MARL) framework to localize multiple uterine SPs in 3D US simultaneously. Our… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Comments: Early accepted by MICCAI 2020

  47. arXiv:2006.01712  [pdf, other

    cs.SD eess.AS

    Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition

    Authors: Shiliang Zhang, Zhifu Gao, Haoneng Luo, Ming Lei, Jie Gao, Zhijie Yan, Lei Xie

    Abstract: Recently, streaming end-to-end automatic speech recognition (E2E-ASR) has gained more and more attention. Many efforts have been paid to turn the non-streaming attention-based E2E-ASR system into streaming architecture. In this work, we propose a novel online E2E-ASR system by using Streaming Chunk-Aware Multihead Attention(SCAMA) and a latency control memory equipped self-attention network (LC-SA… ▽ More

    Submitted 20 May, 2020; originally announced June 2020.

    Comments: submitted to INTERSPEECH2020

  48. arXiv:2005.10463  [pdf, other

    cs.SD cs.CL eess.AS

    Simplified Self-Attention for Transformer-based End-to-End Speech Recognition

    Authors: Haoneng Luo, Shiliang Zhang, Ming Lei, Lei Xie

    Abstract: Transformer models have been introduced into end-to-end speech recognition with state-of-the-art performance on various tasks owing to their superiority in modeling long-term dependencies. However, such improvements are usually obtained through the use of very large neural networks. Transformer models mainly include two submodules - position-wise feedforward layers and self-attention (SAN) layers.… ▽ More

    Submitted 17 November, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: Accepted to SLT 2021

  49. arXiv:2003.11232  [pdf, ps, other

    eess.SP cs.IT

    QoS-Based Source and Relay Secure Optimization Design with Presence of Channel Uncertainty

    Authors: Meng Zhang, Jian Huang, Hui Yu, Hanwen Luo, Wen Chen

    Abstract: In this letter, we study relay-aided networks with presence of single eavesdropper. We provide joint beamforming design of the source and relay that can minimize the overall power consumption while satisfying our predefined quality-of-service (QoS) requirements. Additionally, we investigate the case that the channel between relay and eavesdropper suffers from channel uncertainty. Finally, simulati… ▽ More

    Submitted 25 March, 2020; originally announced March 2020.

    Comments: CL

  50. arXiv:2003.10676  [pdf, ps, other

    eess.SP cs.IT

    Robust Beamforming Design for Sum Secrecy Rate Optimization in MU-MISO Networks

    Authors: Pu Zhao, Meng Zhang, Hui Yu, Hanwen Luo, Wen Chen

    Abstract: This paper studies the beamforming design problem of a multi-user downlink network, assuming imperfect channel state information known to the base station. In this scenario, the base station is equipped with multiple antennas, and each user is wiretapped by a specific eavesdropper where each user or eavesdropper is equipped with one antenna. It is supposed that the base station employs transmit be… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: TIFS