Search | arXiv e-print repository

Intelligent Reflecting Surface-Aided Radar Spoofing

Authors: Haozhe Wang, Beixiong Zheng, Xiaodan Shao, Rui Zhang

Abstract: Electronic countermeasure (ECM) technology plays a critical role in modern electronic warfare, which can interfere with enemy radar detection systems by noise or deceptive signals. However, the conventional active jamming strategy incurs additional hardware and power costs and has the potential threat of exposing the target itself. To tackle the above challenges, we propose a new intelligent refle… ▽ More Electronic countermeasure (ECM) technology plays a critical role in modern electronic warfare, which can interfere with enemy radar detection systems by noise or deceptive signals. However, the conventional active jamming strategy incurs additional hardware and power costs and has the potential threat of exposing the target itself. To tackle the above challenges, we propose a new intelligent reflecting surface (IRS)-aided radar spoofing strategy in this letter, where IRS is deployed on the surface of a target to help eliminate the signals reflected towards the hostile radar to shield the target, while simultaneously redirecting its reflected signal towards a surrounding clutter to generate deceptive angle-of-arrival (AoA) sensing information for the radar. We optimize the IRS's reflection to maximize the received signal power at the radar from the direction of the selected clutter subject to the constraint that its received power from the direction of the target is lower than a given detection threshold. We first solve this non-convex optimization problem using the semidefinite relaxation (SDR) method and further propose a lower-complexity solution for real-time implementation. Simulation results validate the efficacy of our proposed IRS-aided spoofing system as compared to various benchmark schemes. △ Less

Submitted 11 May, 2024; originally announced May 2024.

Comments: 5 pages, 4 figures

arXiv:2404.15946 [pdf]

Mammo-CLIP: Leveraging Contrastive Language-Image Pre-training (CLIP) for Enhanced Breast Cancer Diagnosis with Multi-view Mammography

Authors: Xuxin Chen, Yuheng Li, Mingzhe Hu, Ella Salari, Xiaoqian Chen, Richard L. J. Qiu, Bin Zheng, Xiaofeng Yang

Abstract: Although fusion of information from multiple views of mammograms plays an important role to increase accuracy of breast cancer detection, develo** multi-view mammograms-based computer-aided diagnosis (CAD) schemes still faces challenges and no such CAD schemes have been used in clinical practice. To overcome the challenges, we investigate a new approach based on Contrastive Language-Image Pre-tr… ▽ More Although fusion of information from multiple views of mammograms plays an important role to increase accuracy of breast cancer detection, develo** multi-view mammograms-based computer-aided diagnosis (CAD) schemes still faces challenges and no such CAD schemes have been used in clinical practice. To overcome the challenges, we investigate a new approach based on Contrastive Language-Image Pre-training (CLIP), which has sparked interest across various medical imaging tasks. By solving the challenges in (1) effectively adapting the single-view CLIP for multi-view feature fusion and (2) efficiently fine-tuning this parameter-dense model with limited samples and computational resources, we introduce Mammo-CLIP, the first multi-modal framework to process multi-view mammograms and corresponding simple texts. Mammo-CLIP uses an early feature fusion strategy to learn multi-view relationships in four mammograms acquired from the CC and MLO views of the left and right breasts. To enhance learning efficiency, plug-and-play adapters are added into CLIP image and text encoders for fine-tuning parameters and limiting updates to about 1% of the parameters. For framework evaluation, we assembled two datasets retrospectively. The first dataset, comprising 470 malignant and 479 benign cases, was used for few-shot fine-tuning and internal evaluation of the proposed Mammo-CLIP via 5-fold cross-validation. The second dataset, including 60 malignant and 294 benign cases, was used to test generalizability of Mammo-CLIP. Study results show that Mammo-CLIP outperforms the state-of-art cross-view transformer in AUC (0.841 vs. 0.817, 0.837 vs. 0.807) on both datasets. It also surpasses previous two CLIP-based methods by 20.3% and 14.3%. This study highlights the potential of applying the finetuned vision-language models for develo** next-generation, image-text-based CAD schemes of breast cancer. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.08366 [pdf, other]

Intelligent Reflecting Surface-Enabled Anti-Detection for Secure Sensing and Communications

Authors: Beixiong Zheng, Xue Xiong, Tiantian Ma, Jie Tang, Derrick Wing Kwan Ng, A. Lee Swindlehurst, Rui Zhang

Abstract: The ever-increasing reliance on wireless communication and sensing has led to growing concerns over the vulnerability of sensitive information to unauthorized detection and interception. Traditional anti-detection methods are often inadequate, suffering from limited adaptability and diminished effectiveness against advanced detection technologies. To overcome these challenges, this article present… ▽ More The ever-increasing reliance on wireless communication and sensing has led to growing concerns over the vulnerability of sensitive information to unauthorized detection and interception. Traditional anti-detection methods are often inadequate, suffering from limited adaptability and diminished effectiveness against advanced detection technologies. To overcome these challenges, this article presents the intelligent reflecting surface (IRS) as a groundbreaking technology for enabling flexible electromagnetic manipulation, which has the potential to revolutionize anti-detection in both electromagnetic stealth/spoofing (evading radar detection) and covert communications (facilitating secure information exchange). We explore the fundamental principles of IRS and its advantages over traditional anti-detection techniques and discuss various design challenges associated with implementing IRS-based anti-detection systems. Through the examination of case studies and future research directions, we provide a comprehensive overview of the potential of IRS technology to serve as a formidable shield in the modern wireless landscape. △ Less

Submitted 21 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

Comments: 7 pages, 5 figures

arXiv:2404.06830 [pdf, ps, other]

EMF Exposure Mitigation via MAC Scheduling

Authors: Silvio Mandelli, Lorenzo Maggi, Bill Zheng, Christophe Grangeat, Azra Zejnilagic

Abstract: International standards bodies define Electromagnetic field (EMF) emission requirements that can be translated into control of the base station actual Effective Isotropic Radiated Power (EIRP), i.e., averaged over a sliding time window. In this work we show how to comply with such requirements by designing a water-filling power allocation method operating at the MAC scheduler level. Our method ens… ▽ More International standards bodies define Electromagnetic field (EMF) emission requirements that can be translated into control of the base station actual Effective Isotropic Radiated Power (EIRP), i.e., averaged over a sliding time window. In this work we show how to comply with such requirements by designing a water-filling power allocation method operating at the MAC scheduler level. Our method ensures throughput fairness across users while constraining the EIRP to a value that is produced by an outer-loop procedure which is not the focus of our paper. The low computational complexity of our technique is appealing given the tight computational requirements of the MAC scheduler. Our proposal is evaluated against the prior art approaches through massive-MIMO system level simulations that include realistic modeling of physical and MAC level cellular procedures. We conclude that our proposal effectively mitigates EMF exposure with considerably less impact on network performance, making it a standout candidate for 5G and future 6G MAC scheduler implementations. △ Less

Submitted 19 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

Comments: 5 pages, 3 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2403.17627 [pdf, other]

Waveform Design for Joint Communication and SAR Imaging Under Random Signaling

Authors: Bowen Zheng, Fan Liu

Abstract: Conventional synthetic aperture radar (SAR) imaging systems typically employ deterministic signal designs, which lack the capability to convey communication information and are thus not suitable for integrated sensing and communication (ISAC) scenarios. In this letter, we propose a joint communication and SAR imaging (JCASAR) system based on orthogonal frequency-division multiplexing (OFDM) signal… ▽ More Conventional synthetic aperture radar (SAR) imaging systems typically employ deterministic signal designs, which lack the capability to convey communication information and are thus not suitable for integrated sensing and communication (ISAC) scenarios. In this letter, we propose a joint communication and SAR imaging (JCASAR) system based on orthogonal frequency-division multiplexing (OFDM) signal with cyclic prefix (CP), which is capable of reconstructing the target profile while serving a communication user. In contrast to traditional matched filters, we propose a least squares (LS) estimator for range profiling. Then the SAR image is obtained followed by range cell migration correction (RCMC) and azimuth processing. By minimizing the mean squared error (MSE) of the proposed LS estimator, we investigate the optimal waveform design for SAR imaging, and JCASAR under random signaling, where power allocation strategies are conceived for Gaussian-distributed ISAC signals, in an effort to strike a flexible performance tradeoff between the communication and SAR imaging tasks. Numerical results are provided to validate the effectiveness of the proposed ISAC waveform design for JCASAR systems. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: 5 pages

arXiv:2403.12352 [pdf, other]

A New Intelligent Reflecting Surface-Aided Electromagnetic Stealth Strategy

Authors: Xue Xiong, Beixiong Zheng, A. Lee Swindlehurst, Jie Tang, Wen Wu

Abstract: Electromagnetic wave absorbing material (EWAM) plays an essential role in manufacturing stealth aircraft, which can achieve the electromagnetic stealth (ES) by reducing the strength of the signal reflected back to the radar system. However, the stealth performance is limited by the coating thickness, incident wave angles, and working frequencies. To tackle these limitations, we propose a new intel… ▽ More Electromagnetic wave absorbing material (EWAM) plays an essential role in manufacturing stealth aircraft, which can achieve the electromagnetic stealth (ES) by reducing the strength of the signal reflected back to the radar system. However, the stealth performance is limited by the coating thickness, incident wave angles, and working frequencies. To tackle these limitations, we propose a new intelligent reflecting surface (IRS)-aided ES system where an IRS is deployed at the target to synergize with EWAM for effectively mitigating the echo signal and thus reducing the radar detection probability. Considering the monotonic relationship between the detection probability and the received signal-to-noise-ratio (SNR) at the radar, we formulate an optimization problem that minimizes the SNR under the reflection constraint of each IRS element, and a semi-closed-form solution is derived by using Karush-Kuhn-Tucker (KKT) conditions. Simulation results validate the superiority of the proposed IRS-aided ES system compared to various benchmarks. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: 5 pages, 4 figures

arXiv:2403.11556 [pdf, other]

Hierarchical Frequency-based Upsampling and Refining for Compressed Video Quality Enhancement

Authors: Qianyu Zhang, Bolun Zheng, Xinying Chen, Quan Chen, Zhunjie Zhu, Can** Wang, Zongpeng Li, Chengang Yan

Abstract: Video compression artifacts arise due to the quantization operation in the frequency domain. The goal of video quality enhancement is to reduce compression artifacts and reconstruct a visually-pleasant result. In this work, we propose a hierarchical frequency-based upsampling and refining neural network (HFUR) for compressed video quality enhancement. HFUR consists of two modules: implicit frequen… ▽ More Video compression artifacts arise due to the quantization operation in the frequency domain. The goal of video quality enhancement is to reduce compression artifacts and reconstruct a visually-pleasant result. In this work, we propose a hierarchical frequency-based upsampling and refining neural network (HFUR) for compressed video quality enhancement. HFUR consists of two modules: implicit frequency upsampling module (ImpFreqUp) and hierarchical and iterative refinement module (HIR). ImpFreqUp exploits DCT-domain prior derived through implicit DCT transform, and accurately reconstructs the DCT-domain loss via a coarse-to-fine transfer. Consequently, HIR is introduced to facilitate cross-collaboration and information compensation between the scales, thus further refine the feature maps and promote the visual quality of the final output. We demonstrate the effectiveness of the proposed modules via ablation experiments and visualized results. Extensive experiments on public benchmarks show that HFUR achieves state-of-the-art performance for both constant bit rate and constant QP modes. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2401.02678 [pdf, other]

MusicAOG: an Energy-Based Model for Learning and Sampling a Hierarchical Representation of Symbolic Music

Authors: Yikai Qian, Tianle Wang, Xinyi Tong, Xin **, Duo Xu, Bo Zheng, Tiezheng Ge, Feng Yu, Song-Chun Zhu

Abstract: In addressing the challenge of interpretability and generalizability of artificial music intelligence, this paper introduces a novel symbolic representation that amalgamates both explicit and implicit musical information across diverse traditions and granularities. Utilizing a hierarchical and-or graph representation, the model employs nodes and edges to encapsulate a broad spectrum of musical ele… ▽ More In addressing the challenge of interpretability and generalizability of artificial music intelligence, this paper introduces a novel symbolic representation that amalgamates both explicit and implicit musical information across diverse traditions and granularities. Utilizing a hierarchical and-or graph representation, the model employs nodes and edges to encapsulate a broad spectrum of musical elements, including structures, textures, rhythms, and harmonies. This hierarchical approach expands the representability across various scales of music. This representation serves as the foundation for an energy-based model, uniquely tailored to learn musical concepts through a flexible algorithm framework relying on the minimax entropy principle. Utilizing an adapted Metropolis-Hastings sampling technique, the model enables fine-grained control over music generation. A comprehensive empirical evaluation, contrasting this novel approach with existing methodologies, manifests considerable advancements in interpretability and controllability. This study marks a substantial contribution to the fields of music analysis, composition, and computational musicology. △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2312.16918 [pdf, other]

Intelligent Surfaces Empowered Wireless Network: Recent Advances and The Road to 6G

Authors: Qingqing Wu, Beixiong Zheng, Changsheng You, Lipeng Zhu, Kaiming Shen, Xiaodan Shao, Weidong Mei, Boya Di, Hongliang Zhang, Ertugrul Basar, Lingyang Song, Marco Di Renzo, Zhi-Quan Luo, Rui Zhang

Abstract: Intelligent surfaces (ISs) have emerged as a key technology to empower a wide range of appealing applications for wireless networks, due to their low cost, high energy efficiency, flexibility of deployment and capability of constructing favorable wireless channels/radio environments. Moreover, the recent advent of several new IS architectures further expanded their electromagnetic functionalities… ▽ More Intelligent surfaces (ISs) have emerged as a key technology to empower a wide range of appealing applications for wireless networks, due to their low cost, high energy efficiency, flexibility of deployment and capability of constructing favorable wireless channels/radio environments. Moreover, the recent advent of several new IS architectures further expanded their electromagnetic functionalities from passive reflection to active amplification, simultaneous reflection and refraction, as well as holographic beamforming. However, the research on ISs is still in rapid progress and there have been recent technological advances in ISs and their emerging applications that are worthy of a timely review. Thus, we provide in this paper a comprehensive survey on the recent development and advances of ISs aided wireless networks. Specifically, we start with an overview on the anticipated use cases of ISs in future wireless networks such as 6G, followed by a summary of the recent standardization activities related to ISs. Then, the main design issues of the commonly adopted reflection-based IS and their state-of-the-art solutions are presented in detail, including reflection optimization, deployment, signal modulation, wireless sensing, and integrated sensing and communications. Finally, recent progress and new challenges in advanced IS architectures are discussed to inspire futrue research. △ Less

Submitted 24 March, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

arXiv:2312.01940 [pdf, ps, other]

Intelligent Reflecting Surface-Aided Electromagnetic Stealth Against Radar Detection

Authors: Beixiong Zheng, Xue Xiong, Jie Tang, Rui Zhang

Abstract: While traditional electromagnetic stealth materials/metasurfaces can render a target virtually invisible to some extent, they lack flexibility and adaptability, and can only operate within a limited frequency and angle/direction range, making it challenging to ensure the expected stealth performance. In view of this, we propose in this paper a new intelligent reflecting surface (IRS)-aided electro… ▽ More While traditional electromagnetic stealth materials/metasurfaces can render a target virtually invisible to some extent, they lack flexibility and adaptability, and can only operate within a limited frequency and angle/direction range, making it challenging to ensure the expected stealth performance. In view of this, we propose in this paper a new intelligent reflecting surface (IRS)-aided electromagnetic stealth system mounted on targets to evade radar detection, by utilizing the tunable passive reflecting elements of IRS to achieve flexible and adaptive electromagnetic stealth in a cost-effective manner. Specifically, we optimize the IRS's reflection at the target to minimize the sum received signal power of all adversary radars. We first address the IRS's reflection optimization problem using the Lagrange multiplier method and derive a semi-closed-form optimal solution for the single-radar setup, which is then generalized to the multi-radar case. To meet real-time processing requirements, we further propose low-complexity closed-form solutions based on the reverse alignment/cancellation and minimum mean-square error (MMSE) criteria for the single-radar and multi-radar cases, respectively. Additionally, we propose practical low-complexity estimation schemes at the target to acquire angle-of-arrival (AoA) and/or path gain information via a small number of receive sensing devices. Simulation results validate the performance advantages of our proposed IRS-aided electromagnetic stealth system with the proposed IRS reflection designs. △ Less

Submitted 4 December, 2023; originally announced December 2023.

Comments: 13 pages (double-column), 10 figures, submitted in October

arXiv:2310.01342 [pdf, other]

Near-field Integrated Sensing and Communication: Opportunities and Challenges

Authors: Jiayi Cong, Changsheng You, Jiapeng Li, Li Chen, Beixiong Zheng, Yuanwei Liu, Wen Wu, Yi Gong, Shi **, Rui Zhang

Abstract: With the extremely large-scale array XL-array deployed in future wireless systems, wireless communication and sensing are expected to operate in the radiative near-field region, which needs to be characterized by the spherical rather than planar wavefronts. Unlike most existing works that considered far-field integrated sensing and communication (ISAC), we study in this article the new near-field… ▽ More With the extremely large-scale array XL-array deployed in future wireless systems, wireless communication and sensing are expected to operate in the radiative near-field region, which needs to be characterized by the spherical rather than planar wavefronts. Unlike most existing works that considered far-field integrated sensing and communication (ISAC), we study in this article the new near-field ISAC, which integrates both functions of sensing and communication in the near-field region. To this end, we first discuss the appealing advantages of near-field communication and sensing over their far-field counterparts, respectively. Then, we introduce three approaches for near-field ISAC, including joint near-field communication and sensing, sensing-assisted near-field communication, and communication-assisted near-field sensing. We discuss their individual research opportunities, new design issues, as well as propose promising solutions. Finally, several important directions in near-field ISAC are also highlighted to motivate future work. △ Less

Submitted 17 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: This work is submitted to IEEE for possible publication

arXiv:2307.16440 [pdf, other]

Towards Head Computed Tomography Image Reconstruction Standardization with Deep Learning Assisted Automatic Detection

Authors: Bowen Zheng, Chenxi Huang, Yuemei Luo

Abstract: Three-dimensional (3D) reconstruction of head Computed Tomography (CT) images elucidates the intricate spatial relationships of tissue structures, thereby assisting in accurate diagnosis. Nonetheless, securing an optimal head CT scan without deviation is challenging in clinical settings, owing to poor positioning by technicians, patient's physical constraints, or CT scanner tilt angle restrictions… ▽ More Three-dimensional (3D) reconstruction of head Computed Tomography (CT) images elucidates the intricate spatial relationships of tissue structures, thereby assisting in accurate diagnosis. Nonetheless, securing an optimal head CT scan without deviation is challenging in clinical settings, owing to poor positioning by technicians, patient's physical constraints, or CT scanner tilt angle restrictions. Manual formatting and reconstruction not only introduce subjectivity but also strain time and labor resources. To address these issues, we propose an efficient automatic head CT images 3D reconstruction method, improving accuracy and repeatability, as well as diminishing manual intervention. Our approach employs a deep learning-based object detection algorithm, identifying and evaluating orbitomeatal line landmarks to automatically reformat the images prior to reconstruction. Given the dearth of existing evaluations of object detection algorithms in the context of head CT images, we compared ten methods from both theoretical and experimental perspectives. By exploring their precision, efficiency, and robustness, we singled out the lightweight YOLOv8 as the aptest algorithm for our task, with an mAP of 92.77% and impressive robustness against class imbalance. Our qualitative evaluation of standardized reconstruction results demonstrates the clinical practicability and validity of our method. △ Less

Submitted 15 September, 2023; v1 submitted 31 July, 2023; originally announced July 2023.

arXiv:2306.16206 [pdf, other]

Near-Field Beam Management for Extremely Large-Scale Array Communications

Authors: Changsheng You, Yunpu Zhang, Chenyu Wu, Yong Zeng, Beixiong Zheng, Li Chen, Linglong Dai, A. Lee Swindlehurst

Abstract: Extremely large-scale arrays (XL-arrays) have emerged as a promising technology to achieve super-high spectral efficiency and spatial resolution in future wireless systems. The large aperture of XL-arrays means that spherical rather than planar wavefronts must be considered, and a paradigm shift from far-field to near-field communications is necessary. Unlike existing works that have mainly consid… ▽ More Extremely large-scale arrays (XL-arrays) have emerged as a promising technology to achieve super-high spectral efficiency and spatial resolution in future wireless systems. The large aperture of XL-arrays means that spherical rather than planar wavefronts must be considered, and a paradigm shift from far-field to near-field communications is necessary. Unlike existing works that have mainly considered far-field beam management, we study the new near-field beam management for XL-arrays. We first provide an overview of near-field communications and introduce various applications of XL-arrays in both outdoor and indoor scenarios. Then, three typical near-field beam management methods for XL-arrays are discussed: near-field beam training, beam tracking, and beam scheduling. We point out their main design issues and propose promising solutions to address them. Moreover, other important directions in near-field communications are also highlighted to motivate future research. △ Less

Submitted 28 June, 2023; originally announced June 2023.

Comments: We studied the new near-field beam management for XL-arrays. This paper has been submitted to IEEE for possible publication

arXiv:2303.08019 [pdf, other]

Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection

Authors: **chao Li, Kaitao Song, Junan Li, Bo Zheng, Dongsheng Li, Xixin Wu, Xunying Liu, Helen Meng

Abstract: With the global population aging rapidly, Alzheimer's disease (AD) is particularly prominent in older adults, which has an insidious onset and leads to a gradual, irreversible deterioration in cognitive domains (memory, communication, etc.). Speech-based AD detection opens up the possibility of widespread screening and timely disease intervention. Recent advances in pre-trained models motivate AD… ▽ More With the global population aging rapidly, Alzheimer's disease (AD) is particularly prominent in older adults, which has an insidious onset and leads to a gradual, irreversible deterioration in cognitive domains (memory, communication, etc.). Speech-based AD detection opens up the possibility of widespread screening and timely disease intervention. Recent advances in pre-trained models motivate AD detection modeling to shift from low-level features to high-level representations. This paper presents several efficient methods to extract better AD-related cues from high-level acoustic and linguistic features. Based on these features, the paper also proposes a novel task-oriented approach by modeling the relationship between the participants' description and the cognitive task. Experiments are carried out on the ADReSS dataset in a binary classification setup, and models are evaluated on the unseen test set. Results and comparison with recent literature demonstrate the efficiency and superior performance of proposed acoustic, linguistic and task-oriented methods. The findings also show the importance of semantic and syntactic information, and feasibility of automation and generalization with the promising audio-only and task-oriented methods for the AD detection task. △ Less

Submitted 14 March, 2023; originally announced March 2023.

Comments: 5 pages, 3 figures, 3 tables

arXiv:2302.12428 [pdf]

A holistically 3D-printed flexible millimeter-wave Doppler radar: Towards fully printed high-frequency multilayer flexible hybrid electronics systems

Authors: Hong Tang, Yingjie Zhang, Bowen Zheng, Sensong An, Mohammad Haerinia, Yunxi Dong, Yi Huang, Wei Guo, Hualiang Zhang

Abstract: Flexible hybrid electronics (FHE) is an emerging technology enabled through the integration of advanced semiconductor devices and 3D printing technology. It unlocks tremendous market potential by realizing low-cost flexible circuits and systems that can be conformally integrated into various applications. However, the operating frequencies of most reported FHE systems are relatively low. It is als… ▽ More Flexible hybrid electronics (FHE) is an emerging technology enabled through the integration of advanced semiconductor devices and 3D printing technology. It unlocks tremendous market potential by realizing low-cost flexible circuits and systems that can be conformally integrated into various applications. However, the operating frequencies of most reported FHE systems are relatively low. It is also worth to note that reported FHE systems have been limited to relatively simple design concept (since complex systems will impose challenges in aspects such as multilayer interconnections, printing materials, and bonding layers). Here, we report a fully 3D-printed flexible four-layer millimeter-wave Doppler radar (i.e., a millimeter-wave FHE system). The sensing performance and flexibility of the 3D-printed radar are characterized and validated by general field tests and bending tests, respectively. Our results demonstrate the feasibility of develo** fully 3D-printed high-frequency multilayer FHE, which can be conformally integrated into irregular surfaces (e.g., vehicle bumpers) for applications such as vehicle radars and wearable electronics. △ Less

Submitted 23 February, 2023; originally announced February 2023.

MSC Class: 78-05

arXiv:2301.07277 [pdf, other]

Mixed Near- and Far-Field Communications for Extremely Large-Scale Array: An Interference Perspective

Authors: Yunpu Zhang, Changsheng You, Li Chen, Beixiong Zheng

Abstract: Extremely large-scale array (XL-array) is envisioned to achieve super-high spectral efficiency in future wireless networks. Different from the existing works that mostly focus on the near-field communications, we consider in this paper a new and practical scenario, called mixed near- and far-field communications, where there exist both near- and far-field users in the network. For this scenario, w… ▽ More Extremely large-scale array (XL-array) is envisioned to achieve super-high spectral efficiency in future wireless networks. Different from the existing works that mostly focus on the near-field communications, we consider in this paper a new and practical scenario, called mixed near- and far-field communications, where there exist both near- and far-field users in the network. For this scenario, we first obtain a closed-form expression for the inter-user interference at the near-field user caused by the far-field beam by using Fresnel functions, based on which the effects of the number of BS antennas, far-field user (FU) angle, near-field user (NU) angle and distance are analyzed. We show that the strong interference exists when the number of the BS antennas and the NU distance are relatively small, and/or the NU and FU angle-difference is small. Then, we further obtain the achievable rate of the NU as well as its rate loss caused by the FU interference. Last, numerical results are provided to corroborate our analytical results. △ Less

Submitted 28 January, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

Comments: We studied the multi-user interference in the mixed near- and far-field communications. This paper has been submitted to IEEE for possible publication

arXiv:2210.16539 [pdf, other]

Exploiting prompt learning with pre-trained language models for Alzheimer's Disease detection

Authors: Yi Wang, Jiajun Deng, Tianzi Wang, Bo Zheng, Shoukang Hu, Xunying Liu, Helen Meng

Abstract: Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care and to delay further progression. Speech based automatic AD screening systems provide a non-intrusive and more scalable alternative to other clinical screening techniques. Textual embedding features produced by pre-trained language models (PLMs) such as BERT are widely used in such systems. However, PLM domain f… ▽ More Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care and to delay further progression. Speech based automatic AD screening systems provide a non-intrusive and more scalable alternative to other clinical screening techniques. Textual embedding features produced by pre-trained language models (PLMs) such as BERT are widely used in such systems. However, PLM domain fine-tuning is commonly based on the masked word or sentence prediction costs that are inconsistent with the back-end AD detection task. To this end, this paper investigates the use of prompt-based fine-tuning of PLMs that consistently uses AD classification errors as the training objective function. Disfluency features based on hesitation or pause filler token frequencies are further incorporated into prompt phrases during PLM fine-tuning. The decision voting based combination among systems using different PLMs (BERT and RoBERTa) or systems with different fine-tuning paradigms (conventional masked-language modelling fine-tuning and prompt-based fine-tuning) is further applied. Mean, standard deviation and the maximum among accuracy scores over 15 experiment runs are adopted as performance measurements for the AD detection system. Mean detection accuracy of 84.20% (with std 2.09%, best 87.5%) and 82.64% (with std 4.0%, best 89.58%) were obtained using manual and ASR speech transcripts respectively on the ADReSS20 test set consisting of 48 elderly speakers. △ Less

Submitted 31 March, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

Comments: Accepted ICASSP 2023 (will update with IEEE vision later)

arXiv:2209.06390 [pdf, ps, other]

Multi-Active Multi-Passive (MAMP)-IRS Aided Wireless Communication: A Multi-Hop Beam Routing Design

Authors: Yunpu Zhang, Changsheng You, Beixiong Zheng

Abstract: Prior studies on intelligent reflecting surface (IRS) have mostly considered wireless communication systems aided by a single passive IRS, which, however, has limited control over wireless propagation environment and suffers severe product-distance path-loss. To address these issues, we propose in this paper a new multi-active multi-passive (MAMP)-IRS aided wireless communication system, where a n… ▽ More Prior studies on intelligent reflecting surface (IRS) have mostly considered wireless communication systems aided by a single passive IRS, which, however, has limited control over wireless propagation environment and suffers severe product-distance path-loss. To address these issues, we propose in this paper a new multi-active multi-passive (MAMP)-IRS aided wireless communication system, where a number of active and passive IRSs are deployed to assist the downlink communication in complex environment, by establishing a multi-hop reflection path across active and passive IRSs. An optimization problem is formulated to maximize the achievable rate of a typical user by designing the active-and-passive IRS routing path as well as the joint beamforming of the BS and selected active/passive IRSs. To draw useful insights into the optimal design, we first consider a special case of the single-active multi-passive (SAMP)-IRS aided system. For this case, we propose an efficient algorithm to obtain its optimal solution by first optimizing the joint beamforming given any SAMP-IRS routing path, and then optimizing the routing path by using a new path decomposition method and graph theory. Next, for the general MAMP-IRS aided system, we show that its challenging beam routing optimization problem can be efficiently solved by a new two-phase approach. Its key idea is to first optimize the inner passive-IRS beam routing between each two active IRSs for effective channel power gain maximization, followed by an outer active-IRS beam routing optimization for rate maximization. Last, numerical results are provided to demonstrate the effectiveness of the proposed MAMP-IRS beam routing scheme. △ Less

Submitted 6 January, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

Comments: In this updated version, we refine some results in the original paper. We studied the multi-hop beam routing design for a new multi-active multi-passive (MAMP)-IRS aided wireless communication system. This paper has been submitted to IEEE for possible publication. arXiv admin note: text overlap with arXiv:2208.11877

arXiv:2207.03157 [pdf, other]

Roadside IRS-Aided Vehicular Communication: Efficient Channel Estimation and Low-Complexity Beamforming Design

Authors: Zixuan Huang, Beixiong Zheng, Rui Zhang

Abstract: Intelligent reflecting surface (IRS) has emerged as a promising technique to control wireless propagation environment for enhancing the communication performance cost-effectively. However, the rapidly time-varying channel in high-mobility communication scenarios such as vehicular communication renders it challenging to obtain the instantaneous channel state information (CSI) efficiently for IRS wi… ▽ More Intelligent reflecting surface (IRS) has emerged as a promising technique to control wireless propagation environment for enhancing the communication performance cost-effectively. However, the rapidly time-varying channel in high-mobility communication scenarios such as vehicular communication renders it challenging to obtain the instantaneous channel state information (CSI) efficiently for IRS with a large number of reflecting elements. In this paper, we propose a new roadside IRS-aided vehicular communication system to tackle this challenge. Specifically, by exploiting the symmetrical deployment of IRSs with inter-laced equal intervals on both sides of the road and the cooperation among nearby IRS controllers, we propose a new two-stage channel estimation scheme with off-line and online training, respectively, to obtain the static/time-varying CSI required by the proposed low-complexity passive beamforming scheme efficiently. The proposed IRS beamforming and online channel estimation designs leverage the existing uplink pilots in wireless networks and do not require any change of the existing transmission protocol. Moreover, they can be implemented by each of IRS controllers independently, without the need of any real-time feedback from the user's serving BS. Simulation results show that the proposed designs can efficiently achieve the high IRS passive beamforming gain and thus significantly enhance the achievable communication throughput for high-speed vehicular communications. △ Less

Submitted 15 February, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

arXiv:2206.10096 [pdf]

Transformers Improve Breast Cancer Diagnosis from Unregistered Multi-View Mammograms

Authors: Xuxin Chen, Ke Zhang, Neman Abdoli, Patrik W. Gilley, Ximin Wang, Hong Liu, Bin Zheng, Yuchen Qiu

Abstract: Deep convolutional neural networks (CNNs) have been widely used in various medical imaging tasks. However, due to the intrinsic locality of convolution operation, CNNs generally cannot model long-range dependencies well, which are important for accurately identifying or map** corresponding breast lesion features computed from unregistered multiple mammograms. This motivates us to leverage the ar… ▽ More Deep convolutional neural networks (CNNs) have been widely used in various medical imaging tasks. However, due to the intrinsic locality of convolution operation, CNNs generally cannot model long-range dependencies well, which are important for accurately identifying or map** corresponding breast lesion features computed from unregistered multiple mammograms. This motivates us to leverage the architecture of Multi-view Vision Transformers to capture long-range relationships of multiple mammograms from the same patient in one examination. For this purpose, we employ local Transformer blocks to separately learn patch relationships within four mammograms acquired from two-view (CC/MLO) of two-side (right/left) breasts. The outputs from different views and sides are concatenated and fed into global Transformer blocks, to jointly learn patch relationships between four images representing two different views of the left and right breasts. To evaluate the proposed model, we retrospectively assembled a dataset involving 949 sets of mammograms, which include 470 malignant cases and 479 normal or benign cases. We trained and evaluated the model using a five-fold cross-validation method. Without any arduous preprocessing steps (e.g., optimal window crop**, chest wall or pectoral muscle removal, two-view image registration, etc.), our four-image (two-view-two-side) Transformer-based model achieves case classification performance with an area under ROC curve (AUC = 0.818), which significantly outperforms AUC = 0.784 achieved by the state-of-the-art multi-view CNNs (p = 0.009). It also outperforms two one-view-two-side models that achieve AUC of 0.724 (CC view) and 0.769 (MLO view), respectively. The study demonstrates the potential of using Transformers to develop high-performing computer-aided diagnosis schemes that combine four mammograms. △ Less

Submitted 20 June, 2022; originally announced June 2022.

arXiv:2203.10219 [pdf, other]

doi 10.1109/TSP.2022.3146791

Efficient DOA Estimation Method for Reconfigurable Intelligent Surfaces Aided UAV Swarm

Authors: Peng Chen, Zhimin Chen, Beixiong Zheng, Xianbin Wang

Abstract: The conventional direction of arrival (DOA) estimation methods are performed with multiple receiving channels. In this paper, a changeling DOA estimation problem is addressed in a different scenario with only one full-functional receiving channel. A new unmanned aerial vehicle (UAV) swarm system using multiple lifted reconfigurable intelligent surface (RIS) is proposed for the DOA estimation. The… ▽ More The conventional direction of arrival (DOA) estimation methods are performed with multiple receiving channels. In this paper, a changeling DOA estimation problem is addressed in a different scenario with only one full-functional receiving channel. A new unmanned aerial vehicle (UAV) swarm system using multiple lifted reconfigurable intelligent surface (RIS) is proposed for the DOA estimation. The UAV movement degrades the DOA estimation performance significantly, and the existing atomic norm minimization (ANM) methods cannot be used in the scenario with array perturbation. Specifically, considering the position perturbation of UAVs, a new atomic norm-based DOA estimation method is proposed, where an atomic norm is defined with the parameter of the position perturbation. Then, a customized semi-definite programming (SDP) method is derived to solve the atomic norm-based method, where different from the traditional SDP method, an additional transforming matrix is formulated. Moreover, a gradient descent method is applied to refine the estimated DOA and the position perturbation further. Simulation results show that the proposed method achieves much better DOA estimation performance in the RIS-aided UAV swarm system with only one receiving channel than various benchmark schemes. △ Less

Submitted 18 March, 2022; originally announced March 2022.

Journal ref: IEEE Transactions on Signal Processing (2022): 743 - 755

arXiv:2202.04370 [pdf, ps, other]

Simultaneous Transmit Diversity and Passive Beamforming with Large-Scale Intelligent Reflecting Surface: Far-Field or Near-Field?

Authors: Beixiong Zheng, Rui Zhang

Abstract: Intelligent reflecting surface (IRS) has emerged as a cost-effective solution to enhance wireless communication performance via passive signal reflection. Existing works on IRS have mainly focused on investigating IRS's passive beamforming/reflection design to boost the communication rate for users assuming that their channel state information (CSI) is fully or partially known. However, how to exp… ▽ More Intelligent reflecting surface (IRS) has emerged as a cost-effective solution to enhance wireless communication performance via passive signal reflection. Existing works on IRS have mainly focused on investigating IRS's passive beamforming/reflection design to boost the communication rate for users assuming that their channel state information (CSI) is fully or partially known. However, how to exploit IRS to improve the wireless transmission reliability without any CSI, which is typical in high-mobility/delay-sensitive communication scenarios, remains largely open. In this paper, we study a new IRS-aided communication system with the IRS integrated to its aided access point (AP) to achieve both functions of transmit diversity and passive beamforming simultaneously. Specifically, we first show an interesting result that the IRS's passive beamforming gain in any direction is invariant to the common phase-shift applied to all of its reflecting elements. Accordingly, we design the common phase-shift of IRS elements to achieve transmit diversity at the AP side without the need of any CSI of the users. In addition, we propose a practical method for the users to estimate the CSI at the receiver side for information decoding. Meanwhile, we show that the conventional passive beamforming gain of IRS can be retained for the other users with their CSI known at the AP. Furthermore, we derive the asymptotic performance of both IRS-aided transmit diversity and passive beamforming in closed-form, by considering the large-scale IRS with an infinite number of elements. Numerical results validate our analysis and show the performance gains of the proposed IRS-aided simultaneous transmit diversity and passive beamforming scheme over other benchmark schemes. △ Less

Submitted 16 July, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

Comments: Large-scale IRS-aided simultaneous "transmit diversity" and "passive beamforming": Far-Field or Near-Field? (31 pages, 9 figures)

arXiv:2202.02550 [pdf, ps, other]

Intelligent Reflecting Surface-Aided Spectrum Sensing for Cognitive Radio

Authors: Shaoe Lin, Beixiong Zheng, Fangjiong Chen, Rui Zhang

Abstract: Spectrum sensing is a key enabling technique for cognitive radio (CR), which provides essential information on the spectrum availability. However, due to severe wireless channel fading and path loss, the primary user (PU) signals received at the CR or secondary user (SU) can be practically too weak for reliable detection. To tackle this issue, we consider in this letter a new intelligent reflectin… ▽ More Spectrum sensing is a key enabling technique for cognitive radio (CR), which provides essential information on the spectrum availability. However, due to severe wireless channel fading and path loss, the primary user (PU) signals received at the CR or secondary user (SU) can be practically too weak for reliable detection. To tackle this issue, we consider in this letter a new intelligent reflecting surface (IRS)-aided spectrum sensing scheme for CR, by exploiting the large aperture and passive beamforming gains of IRS to boost the PU signal strength received at the SU to facilitate its spectrum sensing. Specifically, by dynamically changing the IRS reflection over time according to a given codebook, its reflected signal power varies substantially at the SU, which is utilized for opportunistic signal detection. Furthermore, we propose a weighted energy detection method by combining the received signal power values over different IRS reflections, which significantly improves the detection performance. Simulation results validate the performance gain of the proposed IRS-aided spectrum sensing scheme, as compared to different benchmark schemes. △ Less

Submitted 5 February, 2022; originally announced February 2022.

Comments: Accepted by IEEE Wireless Communications Letters (5 pages, 4 figures)

Journal ref: IEEE Wireless Communications Letters, 2022

arXiv:2201.10675 [pdf]

Virtual Adversarial Training for Semi-supervised Breast Mass Classification

Authors: Xuxin Chen, Ximin Wang, Ke Zhang, Kar-Ming Fung, Theresa C. Thai, Kathleen Moore, Robert S. Mannel, Hong Liu, Bin Zheng, Yuchen Qiu

Abstract: This study aims to develop a novel computer-aided diagnosis (CAD) scheme for mammographic breast mass classification using semi-supervised learning. Although supervised deep learning has achieved huge success across various medical image analysis tasks, its success relies on large amounts of high-quality annotations, which can be challenging to acquire in practice. To overcome this limitation, we… ▽ More This study aims to develop a novel computer-aided diagnosis (CAD) scheme for mammographic breast mass classification using semi-supervised learning. Although supervised deep learning has achieved huge success across various medical image analysis tasks, its success relies on large amounts of high-quality annotations, which can be challenging to acquire in practice. To overcome this limitation, we propose employing a semi-supervised method, i.e., virtual adversarial training (VAT), to leverage and learn useful information underlying in unlabeled data for better classification of breast masses. Accordingly, our VAT-based models have two types of losses, namely supervised and virtual adversarial losses. The former loss acts as in supervised classification, while the latter loss aims at enhancing model robustness against virtual adversarial perturbation, thus improving model generalizability. To evaluate the performance of our VAT-based CAD scheme, we retrospectively assembled a total of 1024 breast mass images, with equal number of benign and malignant masses. A large CNN and a small CNN were used in this investigation, and both were trained with and without the adversarial loss. When the labeled ratios were 40% and 80%, VAT-based CNNs delivered the highest classification accuracy of 0.740 and 0.760, respectively. The experimental results suggest that the VAT-based CAD scheme can effectively utilize meaningful knowledge from unlabeled data to better classify mammographic breast mass images. △ Less

Submitted 25 January, 2022; originally announced January 2022.

Comments: To appear in the conference Biophotonics and Immune Responses of SPIE

arXiv:2201.09112 [pdf, other]

Safety-driven Interactive Planning for Neural Network-based Lane Changing

Authors: Xiangguo Liu, Ruochen Jiao, Bowen Zheng, Dave Liang, Qi Zhu

Abstract: Neural network-based driving planners have shown great promises in improving task performance of autonomous driving. However, it is critical and yet very challenging to ensure the safety of systems with neural network based components, especially in dense and highly interactive traffic environments. In this work, we propose a safety-driven interactive planning framework for neural network-based la… ▽ More Neural network-based driving planners have shown great promises in improving task performance of autonomous driving. However, it is critical and yet very challenging to ensure the safety of systems with neural network based components, especially in dense and highly interactive traffic environments. In this work, we propose a safety-driven interactive planning framework for neural network-based lane changing. To prevent over conservative planning, we identify the driving behavior of surrounding vehicles and assess their aggressiveness, and then adapt the planned trajectory for the ego vehicle accordingly in an interactive manner. The ego vehicle can proceed to change lanes if a safe evasion trajectory exists even in the predicted worst case; otherwise, it can stay around the current lateral position or return back to the original lane. We quantitatively demonstrate the effectiveness of our planner design and its advantage over baseline methods through extensive simulations with diverse and comprehensive experimental settings, as well as in real-world scenarios collected by an autonomous vehicle company. △ Less

Submitted 18 September, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

arXiv:2201.02913 [pdf, ps, other]

Intelligent Reflecting Surface-Aided LEO Satellite Communication: Cooperative Passive Beamforming and Distributed Channel Estimation

Authors: Beixiong Zheng, Shaoe Lin, Rui Zhang

Abstract: We consider in this paper a new intelligent reflecting surface (IRS)-aided LEO satellite communication system, by utilizing the controllable phase shifts of massive passive reflecting elements to achieve flexible beamforming, which copes with the time-varying channel between the high-mobility satellite (SAT) and ground node (GN) cost-effectively. In particular, we propose a new architecture for IR… ▽ More We consider in this paper a new intelligent reflecting surface (IRS)-aided LEO satellite communication system, by utilizing the controllable phase shifts of massive passive reflecting elements to achieve flexible beamforming, which copes with the time-varying channel between the high-mobility satellite (SAT) and ground node (GN) cost-effectively. In particular, we propose a new architecture for IRS-aided LEO satellite communication where IRSs are deployed at both sides of the SAT and GN, and study their cooperative passive beamforming (CPB) design over line-of-sight (LoS)-dominant single-reflection and double-reflection channels. Specifically, we jointly optimize the active transmit/receive beamforming at the SAT/GN as well as the CPB at two-sided IRSs to maximize the overall channel gain from the SAT to each GN. Interestingly, we show that under LoS channel conditions, the high-dimensional SAT-GN channel can be decomposed into the outer product of two low-dimensional vectors. By exploiting the decomposed SAT-GN channel, we decouple the original beamforming optimization problem into two simpler subproblems corresponding to the SAT and GN sides, respectively, which are both solved in closed-form. Furthermore, we propose an efficient transmission protocol to conduct channel estimation and beam tracking, which only requires independent processing of the SAT and GN in a distributed manner, thus substantially reducing the implementation complexity. Simulation results validate the performance advantages of the proposed IRS-aided LEO satellite communication system with two-sided cooperative IRSs, as compared to various baseline schemes such as the conventional reflect-array and one-sided IRS. △ Less

Submitted 8 January, 2022; originally announced January 2022.

Comments: major revision, JSAC

arXiv:2111.04330 [pdf, other]

Characterizing the adversarial vulnerability of speech self-supervised learning

Authors: Haibin Wu, Bo Zheng, Xu Li, Xixin Wu, Hung-yi Lee, Helen Meng

Abstract: A leaderboard named Speech processing Universal PERformance Benchmark (SUPERB), which aims at benchmarking the performance of a shared self-supervised learning (SSL) speech model across various downstream speech tasks with minimal modification of architectures and small amount of data, has fueled the research for speech representation learning. The SUPERB demonstrates speech SSL upstream models im… ▽ More A leaderboard named Speech processing Universal PERformance Benchmark (SUPERB), which aims at benchmarking the performance of a shared self-supervised learning (SSL) speech model across various downstream speech tasks with minimal modification of architectures and small amount of data, has fueled the research for speech representation learning. The SUPERB demonstrates speech SSL upstream models improve the performance of various downstream tasks through just minimal adaptation. As the paradigm of the self-supervised learning upstream model followed by downstream tasks arouses more attention in the speech community, characterizing the adversarial robustness of such paradigm is of high priority. In this paper, we make the first attempt to investigate the adversarial vulnerability of such paradigm under the attacks from both zero-knowledge adversaries and limited-knowledge adversaries. The experimental results illustrate that the paradigm proposed by SUPERB is seriously vulnerable to limited-knowledge adversaries, and the attacks generated by zero-knowledge adversaries are with transferability. The XAB test verifies the imperceptibility of crafted adversarial attacks. △ Less

Submitted 29 March, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

Comments: Accepted by ICASSP 2022

arXiv:2110.01292 [pdf, other]

A Survey on Channel Estimation and Practical Passive Beamforming Design for Intelligent Reflecting Surface Aided Wireless Communications

Authors: Beixiong Zheng, Changsheng You, Weidong Mei, Rui Zhang

Abstract: Intelligent reflecting surface (IRS) has emerged as a key enabling technology to realize smart and reconfigurable radio environment for wireless communications, by digitally controlling the signal reflection via a large number of passive reflecting elements in real-time. Different from conventional wireless communication techniques that only adapt to but have no or limited control over dynamic wir… ▽ More Intelligent reflecting surface (IRS) has emerged as a key enabling technology to realize smart and reconfigurable radio environment for wireless communications, by digitally controlling the signal reflection via a large number of passive reflecting elements in real-time. Different from conventional wireless communication techniques that only adapt to but have no or limited control over dynamic wireless channels, IRS provides a new and cost-effective means to combat the wireless channel impairments in a proactive manner. However, despite its great potential, IRS faces new and unique challenges in its efficient integration into wireless communication systems, especially its channel estimation and passive beamforming design under various practical hardware constraints. In this paper, we provide a comprehensive survey on the up-to-date research in IRS-aided wireless communications, with an emphasis on the promising solutions to tackle practical design issues. Furthermore, we discuss new and emerging IRS architectures and applications as well as their practical design problems to motivate future research. △ Less

Submitted 1 February, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

Comments: Accepted by IEEE Communications Surveys and Tutorials (76 pages, 17 figures, and 10 tables). In this paper, we provide a comprehensive survey on the up-to-date research in IRS-aided wireless communications, with an emphasis on the promising solutions to tackle practical design issues

Journal ref: IEEE Communications Surveys and Tutorials, 2022

arXiv:2109.13641 [pdf, other]

Intelligent Reflecting Surface Aided Wireless Networks: From Single-Reflection to Multi-Reflection Design and Optimization

Authors: Weidong Mei, Beixiong Zheng, Changsheng You, Rui Zhang

Abstract: Intelligent reflecting surface (IRS) has emerged as a promising technique for wireless communication networks. By dynamically tuning the reflection amplitudes/phase shifts of a large number of passive elements, IRS enables flexible wireless channel control and configuration, and thereby enhances the wireless signal transmission rate and reliability significantly. Despite the vast literature on des… ▽ More Intelligent reflecting surface (IRS) has emerged as a promising technique for wireless communication networks. By dynamically tuning the reflection amplitudes/phase shifts of a large number of passive elements, IRS enables flexible wireless channel control and configuration, and thereby enhances the wireless signal transmission rate and reliability significantly. Despite the vast literature on designing and optimizing assorted IRS-aided wireless systems, prior works have mainly focused on enhancing wireless links with single signal reflection only by one or multiple IRSs, which may be insufficient to boost the wireless link capacity under some harsh propagation conditions (e.g., indoor environment with dense blockages/obstructions). This issue can be tackled by employing two or more IRSs to assist each wireless link and jointly exploiting their single as well as multiple signal reflections over them. However, the resultant double-/multi-IRS aided wireless systems face more complex design issues as well as new practical challenges for implementation as compared to the conventional single-IRS counterpart, in terms of IRS reflection optimization, channel acquisition, as well as IRS deployment and association/selection. As such, a new paradigm for designing multi-IRS cooperative passive beamforming and joint active/passive beam routing arises which calls for innovative design approaches and optimization methods. In this paper, we give a tutorial overview of multi-IRS aided wireless networks, with an emphasis on addressing the new challenges due to multi-IRS signal reflection and routing. Moreover, we point out important directions worthy of research and investigation in the future. △ Less

Submitted 25 April, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

Comments: Invited paper. Accepted for publication in the Proceedings of the IEEE

arXiv:2108.01873 [pdf]

doi 10.1109/LPT.2022.3142538

1.71 Tb/s Single-Channel and 56.51 Tb/s DWDM Transmission over 96.5 km Field-Deployed SSMF

Authors: Fabio Pittala, Ralf-Peter Braun, Georg Boecherer, Patrick Schulte, Maximilian Schaedler, Stefano Bettelli, Stefano Calabro, Maxim Kuschnerov, Andreas Gladisch, Fritz-Joachim Westphal, Changsong Xie, Rongfu Chen, Qibing Wang, Bofang Zheng

Abstract: We report an industry leading optical dense wavelength division multiplexing (DWDM) field trial with line rates per channel exceeding 1.66 Tb/s using 130 GBaud dual-polarization probabilistic constellation sha** 256-ary quadrature amplitude modulation (DP-PCS256QAM) in a high capacity data center interconnect (DCI) scenario. This research trial was performed on 96.5 km of field-deployed standard… ▽ More We report an industry leading optical dense wavelength division multiplexing (DWDM) field trial with line rates per channel exceeding 1.66 Tb/s using 130 GBaud dual-polarization probabilistic constellation sha** 256-ary quadrature amplitude modulation (DP-PCS256QAM) in a high capacity data center interconnect (DCI) scenario. This research trial was performed on 96.5 km of field-deployed standard single mode G.652 fiber infrastructure of Deutsche Telekom in Germany employing Erbium-doped fiber amplifier (EDFA)-only amplification. A total of 34 channels were transmitted with 150 GHz spacing for a total fiber capacity of 56.51 Tb/s and a spectral efficiency higher than 11bit/s/Hz. In the single-channel transmission scenario 1.71 Tb/s was achieved over the same link. In addition, we successfully demonstrate record net bitrates of 1.88 Tb/s in back-to-back (B2B) using 130 GBaud DP-PCS400QAM. △ Less

Submitted 4 August, 2021; originally announced August 2021.

Comments: This work has been submitted to the IEEE Photonics Technology Letters (PTL) for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

arXiv:2106.02274 [pdf, ps, other]

Transforming Fading Channel from Fast to Slow: Intelligent Refracting Surface Aided High-Mobility Communication

Authors: Zixuan Huang, Beixiong Zheng, Rui Zhang

Abstract: Intelligent reflecting/refracting surface (IRS) has recently emerged as a promising solution to reconfigure wireless propagation environment for enhancing the communication performance. In this paper, we study a new IRS-aided high-mobility communication system by employing the intelligent refracting surface with a high-speed vehicle to aid its passenger's communication with a remote base station (… ▽ More Intelligent reflecting/refracting surface (IRS) has recently emerged as a promising solution to reconfigure wireless propagation environment for enhancing the communication performance. In this paper, we study a new IRS-aided high-mobility communication system by employing the intelligent refracting surface with a high-speed vehicle to aid its passenger's communication with a remote base station (BS). Due to the environment's random scattering and vehicle's high mobility, a rapidly time-varying channel is typically resulted between the static BS and fast-moving IRS/user, which renders the channel estimation for IRS with a large number of elements more challenging. In order to reap the high IRS passive beamforming gain with low channel training overhead, we propose a new and efficient transmission protocol to achieve both IRS channel estimation and refraction optimization for data transmission. Specifically, by exploiting the quasi-static channel between the IRS and user both moving at the same high speed as well as the line-of-sight (LoS) dominant channel between the BS and IRS, the user first estimates the LoS component of the cascaded BS-IRS-user channel, based on which IRS passive refraction is designed to maximize the corresponding IRS-refracted channel gain. Then, the user estimates the resultant IRS-refracted channel as well as the non-IRS-refracted channel for setting an additional common phase shift at all IRS refracting elements so as to align these two channels for maximizing the overall channel gain for data transmission. Simulation results show significant performance improvement of the proposed design as compared to various benchmark schemes. The proposed on-vehicle IRS system is further compared with a baseline scheme of deploying fixed intelligent reflecting surfaces on the roadside to assist high-speed vehicular communications, which achieves significant rate improvement. △ Less

Submitted 13 December, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

Comments: Conference version (arXiv:2011.03147). Accepted by TWC

arXiv:2105.13381 [pdf]

doi 10.1016/j.media.2022.102444

Recent advances and clinical applications of deep learning in medical image analysis

Authors: Xuxin Chen, Ximin Wang, Ke Zhang, Kar-Ming Fung, Theresa C. Thai, Kathleen Moore, Robert S. Mannel, Hong Liu, Bin Zheng, Yuchen Qiu

Abstract: Deep learning has received extensive research interest in develo** new medical image processing algorithms, and deep learning based models have been remarkably successful in a variety of medical imaging tasks to support disease detection and diagnosis. Despite the success, the further improvement of deep learning models in medical image analysis is majorly bottlenecked by the lack of large-sized… ▽ More Deep learning has received extensive research interest in develo** new medical image processing algorithms, and deep learning based models have been remarkably successful in a variety of medical imaging tasks to support disease detection and diagnosis. Despite the success, the further improvement of deep learning models in medical image analysis is majorly bottlenecked by the lack of large-sized and well-annotated datasets. In the past five years, many studies have focused on addressing this challenge. In this paper, we reviewed and summarized these recent studies to provide a comprehensive overview of applying deep learning methods in various medical image analysis tasks. Especially, we emphasize the latest progress and contributions of state-of-the-art unsupervised and semi-supervised deep learning in medical image analysis, which are summarized based on different application scenarios, including classification, segmentation, detection, and image registration. We also discuss the major technical challenges and suggest the possible solutions in future research efforts. △ Less

Submitted 8 April, 2022; v1 submitted 27 May, 2021; originally announced May 2021.

Comments: To appear in the journal Medical Image Analysis. The registration section was revised

arXiv:2103.11736 [pdf, other]

Automatic Pulmonary Artery-Vein Separation in CT Images using Twin-Pipe Network and Topology Reconstruction

Authors: Lin Pan, Yaoyong Zheng, Liqin Huang, Liuqing Chen, Zhen Zhang, Rongda Fu, Bin Zheng, Shaohua Zheng

Abstract: With the development of medical computer-aided diagnostic systems, pulmonary artery-vein(A/V) separation plays a crucial role in assisting doctors in preoperative planning for lung cancer surgery. However, distinguishing arterial from venous irrigation in chest CT images remains a challenge due to the similarity and complex structure of the arteries and veins. We propose a novel method for automat… ▽ More With the development of medical computer-aided diagnostic systems, pulmonary artery-vein(A/V) separation plays a crucial role in assisting doctors in preoperative planning for lung cancer surgery. However, distinguishing arterial from venous irrigation in chest CT images remains a challenge due to the similarity and complex structure of the arteries and veins. We propose a novel method for automatic separation of pulmonary arteries and veins from chest CT images. The method consists of three parts. First, global connection information and local feature information are used to construct a complete topological tree and ensure the continuity of vessel reconstruction. Second, the Twin-Pipe network proposed can automatically learn the differences between arteries and veins at different levels to reduce classification errors caused by changes in terminal vessel characteristics. Finally, the topology optimizer considers interbranch and intrabranch topological relationships to maintain spatial consistency to avoid the misclassification of A/V irrigations. We validate the performance of the method on chest CT images. Compared with manual classification, the proposed method achieves an average accuracy of 96.2% on noncontrast chest CT. In addition, the method has been proven to have good generalization, that is, the accuracies of 93.8% and 94.8% are obtained for CT scans from other devices and other modes, respectively. The result of pulmonary artery-vein obtained by the proposed method can provide better assistance for preoperative planning of lung cancer surgery. △ Less

Submitted 28 May, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

arXiv:2102.12755 [pdf, other]

Coarse-to-fine Airway Segmentation Using Multi information Fusion Network and CNN-based Region Growing

Authors: **quan Guo, Rongda Fu, Lin Pan, Shaohua Zheng, Liqin Huang, Bin Zheng, Bingwei He

Abstract: Automatic airway segmentation from chest computed tomography (CT) scans plays an important role in pulmonary disease diagnosis and computer-assisted therapy. However, low contrast at peripheral branches and complex tree-like structures remain as two mainly challenges for airway segmentation. Recent research has illustrated that deep learning methods perform well in segmentation tasks. Motivated by… ▽ More Automatic airway segmentation from chest computed tomography (CT) scans plays an important role in pulmonary disease diagnosis and computer-assisted therapy. However, low contrast at peripheral branches and complex tree-like structures remain as two mainly challenges for airway segmentation. Recent research has illustrated that deep learning methods perform well in segmentation tasks. Motivated by these works, a coarse-to-fine segmentation framework is proposed to obtain a complete airway tree. Our framework segments the overall airway and small branches via the multi-information fusion convolution neural network (Mif-CNN) and the CNN-based region growing, respectively. In Mif-CNN, atrous spatial pyramid pooling (ASPP) is integrated into a u-shaped network, and it can expend the receptive field and capture multi-scale information. Meanwhile, boundary and location information are incorporated into semantic information. These information are fused to help Mif-CNN utilize additional context knowledge and useful features. To improve the performance of the segmentation result, the CNN-based region growing method is designed to focus on obtaining small branches. A voxel classification network (VCN), which can entirely capture the rich information around each voxel, is applied to classify the voxels into airway and non-airway. In addition, a shape reconstruction method is used to refine the airway tree. △ Less

Submitted 25 February, 2021; originally announced February 2021.

arXiv:2102.10919 [pdf, ps, other]

doi 10.1016/j.cmpb.2021.106363

Interpretative Computer-aided Lung Cancer Diagnosis: from Radiology Analysis to Malignancy Evaluation

Authors: Shaohua Zheng, Zhiqiang Shen, Chenhao Peia, Wangbin Ding, Hao** Lin, Jiepeng Zheng, Lin Pan, Bin Zheng, Liqin Huang

Abstract: Background and Objective:Computer-aided diagnosis (CAD) systems promote diagnosis effectiveness and alleviate pressure of radiologists. A CAD system for lung cancer diagnosis includes nodule candidate detection and nodule malignancy evaluation. Recently, deep learning-based pulmonary nodule detection has reached satisfactory performance ready for clinical application. However, deep learning-based… ▽ More Background and Objective:Computer-aided diagnosis (CAD) systems promote diagnosis effectiveness and alleviate pressure of radiologists. A CAD system for lung cancer diagnosis includes nodule candidate detection and nodule malignancy evaluation. Recently, deep learning-based pulmonary nodule detection has reached satisfactory performance ready for clinical application. However, deep learning-based nodule malignancy evaluation depends on heuristic inference from low-dose computed tomography volume to malignant probability, which lacks clinical cognition. Methods:In this paper, we propose a joint radiology analysis and malignancy evaluation network (R2MNet) to evaluate the pulmonary nodule malignancy via radiology characteristics analysis. Radiological features are extracted as channel descriptor to highlight specific regions of the input volume that are critical for nodule malignancy evaluation. In addition, for model explanations, we propose channel-dependent activation map** to visualize the features and shed light on the decision process of deep neural network. Results:Experimental results on the LIDC-IDRI dataset demonstrate that the proposed method achieved area under curve of 96.27% on nodule radiology analysis and AUC of 97.52% on nodule malignancy evaluation. In addition, explanations of CDAM features proved that the shape and density of nodule regions were two critical factors that influence a nodule to be inferred as malignant, which conforms with the diagnosis cognition of experienced radiologists. Conclusion:Incorporating radiology analysis with nodule malignant evaluation, the network inference process conforms to the diagnostic procedure of radiologists and increases the confidence of evaluation results. Besides, model interpretation with CDAM features shed light on the regions which DNNs focus on when they estimate nodule malignancy probabilities. △ Less

Submitted 22 February, 2021; originally announced February 2021.

Comments: 11 pages, 8 figures

arXiv:2011.03147 [pdf, ps, other]

Transforming Fading Channel from Fast to Slow: IRS-Assisted High-Mobility Communication

Authors: Zixuan Huang, Beixiong Zheng, Rui Zhang

Abstract: In this paper, we study a new intelligent refracting surface (IRS)-assisted high-mobility communication with the IRS deployed in a high-speed moving vehicle to assist its passenger's communication with a static base station (BS) on the roadside. The vehicle's high Doppler frequency results in a fast fading channel between the BS and the passenger/user, which renders channel estimation for the IRS… ▽ More In this paper, we study a new intelligent refracting surface (IRS)-assisted high-mobility communication with the IRS deployed in a high-speed moving vehicle to assist its passenger's communication with a static base station (BS) on the roadside. The vehicle's high Doppler frequency results in a fast fading channel between the BS and the passenger/user, which renders channel estimation for the IRS with a large number of refracting elements a more challenging task as compared to the conventional case with low-mobility users only. In order to mitigate the Doppler effect and reap the full IRS passive beamforming gain with low training overhead, we propose a new and efficient transmission protocol to execute channel estimation and IRS refraction design for data transmission. Specifically, by exploiting the quasi-static channel between the IRS and user both moving at the same high speed, we first estimate the cascaded BS-IRS-user channel with the Doppler effect compensated. Then, we estimate the instantaneous BS-user fast fading channel (without IRS refraction) and tune the IRS refraction over time accordingly to align the cascaded channel with the BS-user direct channel, thus maximizing the IRS's passive beamforming gain as well as converting their combined channel from fast to slow fading. Simulation results show the effectiveness of the proposed channel estimation scheme and passive beamforming design as compared to various benchmark schemes. △ Less

Submitted 9 March, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

Comments: Accepted by ICC 2021

arXiv:2011.02880 [pdf, other]

Covariance Self-Attention Dual Path UNet for Rectal Tumor Segmentation

Authors: Haijun Gao, Bochuan Zheng, Dazhi Pan, Xiangyin Zeng

Abstract: Deep learning algorithms are preferable for rectal tumor segmentation. However, it is still a challenge task to accurately segment and identify the locations and sizes of rectal tumors by using deep learning methods. To increase the capability of extracting enough feature information for rectal tumor segmentation, we propose a Covariance Self-Attention Dual Path UNet (CSA-DPUNet). The proposed net… ▽ More Deep learning algorithms are preferable for rectal tumor segmentation. However, it is still a challenge task to accurately segment and identify the locations and sizes of rectal tumors by using deep learning methods. To increase the capability of extracting enough feature information for rectal tumor segmentation, we propose a Covariance Self-Attention Dual Path UNet (CSA-DPUNet). The proposed network mainly includes two improvements on UNet: 1) modify UNet that has only one path structure to consist of two contracting path and two expansive paths (nam new network as DPUNet), which can help extract more feature information from CT images; 2) employ the criss-cross self-attention module into DPUNet, meanwhile, replace the original calculation method of correlation operation with covariance operation, which can further enhances the characterization ability of DPUNet and improves the segmentation accuracy of rectal tumors. Experiments illustrate that compared with the current state-of-the-art results, CSA-DPUNet brings 15.31%, 7.2%, 11.8%, and 9.5% improvement in Dice coefficient, P, R, F1, respectively, which demonstrates that our proposed CSA-DPUNet is effective for rectal tumor segmentation. △ Less

Submitted 5 January, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

arXiv:2009.12157 [pdf, other]

SOUP: Spatial-Temporal Demand Forecasting and Competitive Supply

Authors: Bolong Zheng, Qi Hu, Lingfeng Ming, Jilin Hu, Lu Chen, Kai Zheng, Christian S. Jensen

Abstract: We consider a setting with an evolving set of requests for transportation from an origin to a destination before a deadline and a set of agents capable of servicing the requests. In this setting, an assignment authority is to assign agents to requests such that the average idle time of the agents is minimized. An example is the scheduling of taxis (agents) to meet incoming requests for trips while… ▽ More We consider a setting with an evolving set of requests for transportation from an origin to a destination before a deadline and a set of agents capable of servicing the requests. In this setting, an assignment authority is to assign agents to requests such that the average idle time of the agents is minimized. An example is the scheduling of taxis (agents) to meet incoming requests for trips while ensuring that the taxis are empty as little as possible. In this paper, we study the problem of spatial-temporal demand forecasting and competitive supply (SOUP). We address the problem in two steps. First, we build a granular model that provides spatial-temporal predictions of requests. Specifically, we propose a Spatial-Temporal Graph Convolutional Sequential Learning (ST-GCSL) algorithm that predicts the service requests across locations and time slots. Second, we provide means of routing agents to request origins while avoiding competition among the agents. In particular, we develop a demand-aware route planning (DROP) algorithm that considers both the spatial-temporal predictions and the supplydemand state. We report on extensive experiments with realworld and synthetic data that offer insight into the performance of the solution and show that it is capable of outperforming the state-of-the-art proposals. △ Less

Submitted 18 January, 2021; v1 submitted 24 September, 2020; originally announced September 2020.

arXiv:2009.09937 [pdf]

doi 10.1109/TBME.2021.3054248

Applying a random projection algorithm to optimize machine learning model for breast lesion classification

Authors: Morteza Heidari, Sivaramakrishnan Lakshmivarahan, Seyedehnafiseh Mirniaharikandehei, Gopichandh Danala, Sai Kiran R. Maryada, Hong Liu, Bin Zheng

Abstract: Machine learning is widely used in develo** computer-aided diagnosis (CAD) schemes of medical images. However, CAD usually computes large number of image features from the targeted regions, which creates a challenge of how to identify a small and optimal feature vector to build robust machine learning models. In this study, we investigate feasibility of applying a random projection algorithm to… ▽ More Machine learning is widely used in develo** computer-aided diagnosis (CAD) schemes of medical images. However, CAD usually computes large number of image features from the targeted regions, which creates a challenge of how to identify a small and optimal feature vector to build robust machine learning models. In this study, we investigate feasibility of applying a random projection algorithm to build an optimal feature vector from the initially CAD-generated large feature pool and improve performance of machine learning model. We assemble a retrospective dataset involving 1,487 cases of mammograms in which 644 cases have confirmed malignant mass lesions and 843 have benign lesions. A CAD scheme is first applied to segment mass regions and initially compute 181 features. Then, support vector machine (SVM) models embedded with several feature dimensionality reduction methods are built to predict likelihood of lesions being malignant. All SVM models are trained and tested using a leave-one-case-out cross-validation method. SVM generates a likelihood score of each segmented mass region depicting on one-view mammogram. By fusion of two scores of the same mass depicting on two-view mammograms, a case-based likelihood score is also evaluated. Comparing with the principle component analyses, nonnegative matrix factorization, and Chi-squared methods, SVM embedded with the random projection algorithm yielded a significantly higher case-based lesion classification performance with the area under ROC curve of 0.84+0.01 (p<0.02). The study demonstrates that the random project algorithm is a promising method to generate optimal feature vectors to help improve performance of machine learning models of medical images. △ Less

Submitted 9 September, 2020; originally announced September 2020.

Comments: 11 pages, 6 figures

Journal ref: IEEE Transactions on Biomedical Engineering, 2021

arXiv:2009.00675 [pdf]

Applying a random projection algorithm to optimize machine learning model for predicting peritoneal metastasis in gastric cancer patients using CT images

Authors: Seyedehnafiseh Mirniaharikandehei, Morteza Heidari, Gopichandh Danala, Sivaramakrishnan Lakshmivarahan, Bin Zheng

Abstract: Background and Objective: Non-invasively predicting the risk of cancer metastasis before surgery plays an essential role in determining optimal treatment methods for cancer patients (including who can benefit from neoadjuvant chemotherapy). Although develo** radiomics based machine learning (ML) models has attracted broad research interest for this purpose, it often faces a challenge of how to b… ▽ More Background and Objective: Non-invasively predicting the risk of cancer metastasis before surgery plays an essential role in determining optimal treatment methods for cancer patients (including who can benefit from neoadjuvant chemotherapy). Although develo** radiomics based machine learning (ML) models has attracted broad research interest for this purpose, it often faces a challenge of how to build a highly performed and robust ML model using small and imbalanced image datasets. Methods: In this study, we explore a new approach to build an optimal ML model. A retrospective dataset involving abdominal computed tomography (CT) images acquired from 159 patients diagnosed with gastric cancer is assembled. Among them, 121 cases have peritoneal metastasis (PM), while 38 cases do not have PM. A computer-aided detection (CAD) scheme is first applied to segment primary gastric tumor volumes and initially computes 315 image features. Then, two Gradient Boosting Machine (GBM) models embedded with two different feature dimensionality reduction methods, namely, the principal component analysis (PCA) and a random projection algorithm (RPA) and a synthetic minority oversampling technique, are built to predict the risk of the patients having PM. All GBM models are trained and tested using a leave-one-case-out cross-validation method. Results: Results show that the GBM embedded with RPA yielded a significantly higher prediction accuracy (71.2%) than using PCA (65.2%) (p<0.05). Conclusions: The study demonstrated that CT images of the primary gastric tumors contain discriminatory information to predict the risk of PM, and RPA is a promising method to generate optimal feature vector, improving the performance of ML models of medical images. △ Less

Submitted 1 September, 2020; originally announced September 2020.

Comments: 24 pages, 7 figures

arXiv:2008.04476 [pdf, ps, other]

Fast Channel Estimation for IRS-Assisted OFDM

Authors: Beixiong Zheng, Changsheng You, Rui Zhang

Abstract: In this letter, we study efficient channel estimation for an intelligent reflecting surface (IRS)-assisted orthogonal frequency division multiplexing (OFDM) system to achieve minimum training time. First, a fast channel estimation scheme with reduced OFDM symbol duration is proposed for arbitrary frequency-selective fading channels. Next, under the typical condition that the IRS-user channel is li… ▽ More In this letter, we study efficient channel estimation for an intelligent reflecting surface (IRS)-assisted orthogonal frequency division multiplexing (OFDM) system to achieve minimum training time. First, a fast channel estimation scheme with reduced OFDM symbol duration is proposed for arbitrary frequency-selective fading channels. Next, under the typical condition that the IRS-user channel is line-of-sight (LoS) dominant, another fast channel estimation scheme based on the novel concept of sampling-wise IRS reflection variation is proposed. Moreover, the pilot signal and IRS training reflection pattern are jointly optimized for both proposed schemes. Finally, the proposed schemes are compared in terms of training time and channel estimation performance via simulations, as well as against benchmark schemes. △ Less

Submitted 10 August, 2020; originally announced August 2020.

Comments: 5 pages, 4 figures

arXiv:2008.02555 [pdf, ps, other]

Reconfigurable Intelligent Surfaces with Reflection Pattern Modulation: Beamforming Design and Performance Analysis

Authors: Shaoe Lin, Beixiong Zheng, George C. Alexandropoulos, Miaowen Wen, Marco Di Renzo, Fangjiong Chen

Abstract: Recent considerations for reconfigurable intelligent surfaces (RISs) assume that RISs can convey information by reflection without the need of transmit radio frequency chains, which, however, is a challenging task. In this paper, we propose an RIS-enhanced multiple-input single-output system with reflection pattern modulation, where the RIS can configure its reflection state for boosting the recei… ▽ More Recent considerations for reconfigurable intelligent surfaces (RISs) assume that RISs can convey information by reflection without the need of transmit radio frequency chains, which, however, is a challenging task. In this paper, we propose an RIS-enhanced multiple-input single-output system with reflection pattern modulation, where the RIS can configure its reflection state for boosting the received signal power via passive beamforming and simultaneously conveying its own information via reflection. We formulate an optimization problem to maximize the average received signal power by jointly optimizing the active beamforming at the access point (AP) and passive beamforming at the RIS for the case where the RIS's state information is statistically known by the AP, and propose a high-quality suboptimal solution based on the alternating optimization technique. We analyze the asymptotic outage probability of the proposed scheme under Rayleigh fading channels, for which a closed-form expression is derived. The achievable rate of the proposed scheme is also investigated for the case where the transmitted symbol is drawn from a finite constellation. Simulation results validate the effectiveness of the proposed scheme and reveal the effect of various system parameters on the achievable rate performance. It is shown that the proposed scheme outperforms the conventional RIS-assisted system without information transfer in terms of achievable rate performance. △ Less

Submitted 6 August, 2020; originally announced August 2020.

Comments: 31 pages; 7 figures; under minor revision for an IEEE journal

arXiv:2006.12229 [pdf]

doi 10.1016/j.ijmedinf.2020.104284

Improving performance of CNN to predict likelihood of COVID-19 using chest X-ray images with preprocessing algorithms

Authors: Morteza Heidari, Seyedehnafiseh Mirniaharikandehei, Abolfazl Zargari Khuzani, Gopichandh Danala, Yuchen Qiu, Bin Zheng

Abstract: As the rapid spread of coronavirus disease (COVID-19) worldwide, chest X-ray radiography has also been used to detect COVID-19 infected pneumonia and assess its severity or monitor its prognosis in the hospitals due to its low cost, low radiation dose, and wide accessibility. However, how to more accurately and efficiently detect COVID-19 infected pneumonia and distinguish it from other community-… ▽ More As the rapid spread of coronavirus disease (COVID-19) worldwide, chest X-ray radiography has also been used to detect COVID-19 infected pneumonia and assess its severity or monitor its prognosis in the hospitals due to its low cost, low radiation dose, and wide accessibility. However, how to more accurately and efficiently detect COVID-19 infected pneumonia and distinguish it from other community-acquired pneumonia remains a challenge. In order to address this challenge, we in this study develop and test a new computer-aided diagnosis (CAD) scheme. It includes several image pre-processing algorithms to remove diaphragms, normalize image contrast-to-noise ratio, and generate three input images, then links to a transfer learning based convolutional neural network (a VGG16 based CNN model) to classify chest X-ray images into three classes of COVID-19 infected pneumonia, other community-acquired pneumonia and normal (non-pneumonia) cases. To this purpose, a publicly available dataset of 8,474 chest X-ray images is used, which includes 415 confirmed COVID-19 infected pneumonia, 5,179 community-acquired pneumonia, and 2,880 non-pneumonia cases. The dataset is divided into two subsets with 90% and 10% of images in each subset to train and test the CNN-based CAD scheme. The testing results achieve 94.0% of overall accuracy in classifying three classes and 98.6% accuracy in detecting Covid-19 infected cases. Thus, the study demonstrates the feasibility of develo** a CAD scheme of chest X-ray images and providing radiologists useful decision-making supporting tools in detecting and diagnosis of COVID-19 infected pneumonia. △ Less

Submitted 11 June, 2020; originally announced June 2020.

Comments: 11 pages, 5 figures, 2 tables

Journal ref: International Journal of Medical Informatics, 104284. 23 Sep. 2020

arXiv:2004.07812 [pdf, other]

doi 10.1007/978-3-030-59416-9_12

Bus Frequency Optimization: When Waiting Time Matters in User Satisfaction

Authors: Songsong Mo, Zhifeng Bao, Baihua Zheng, Zhiyong Peng

Abstract: Reorganizing bus frequency to cater for the actual travel demand can save the cost of the public transport system significantly. Many, if not all, existing studies formulate this as a bus frequency optimization problem which tries to minimize passengers' average waiting time. However, many investigations have confirmed that the user satisfaction drops faster as the waiting time increases. Conseque… ▽ More Reorganizing bus frequency to cater for the actual travel demand can save the cost of the public transport system significantly. Many, if not all, existing studies formulate this as a bus frequency optimization problem which tries to minimize passengers' average waiting time. However, many investigations have confirmed that the user satisfaction drops faster as the waiting time increases. Consequently, this paper studies the bus frequency optimization problem considering the user satisfaction. Specifically, for the first time to our best knowledge, we study how to schedule the buses such that the total number of passengers who could receive their bus services within the waiting time threshold is maximized. We prove that this problem is NP-hard, and present an index-based algorithm with $(1-1/e)$ approximation ratio. By exploiting the locality property of routes in a bus network, we propose a partition-based greedy method which achieves a $(1-ρ)(1-1/e)$ approximation ratio. Then we propose a progressive partition-based greedy method to further improve the efficiency while achieving a $(1-ρ)(1-1/e-\varepsilon)$ approximation ratio. Experiments on a real city-wide bus dataset in Singapore verify the efficiency, effectiveness, and scalability of our methods. △ Less

Submitted 23 March, 2020; originally announced April 2020.

Journal ref: International Conference on Database Systems for Advanced Applications 2020

arXiv:2003.09669 [pdf, other]

BiCANet: Bi-directional Contextual Aggregating Network for Image Semantic Segmentation

Authors: Quan Zhou, Dechun Cong, Bin Kang, Xiaofu Wu, Baoyu Zheng, Huimin Lu, Longin Jan Latecki

Abstract: Exploring contextual information in convolution neural networks (CNNs) has gained substantial attention in recent years for semantic segmentation. This paper introduces a Bi-directional Contextual Aggregating Network, called BiCANet, for semantic segmentation. Unlike previous approaches that encode context in feature space, BiCANet aggregates contextual cues from a categorical perspective, which i… ▽ More Exploring contextual information in convolution neural networks (CNNs) has gained substantial attention in recent years for semantic segmentation. This paper introduces a Bi-directional Contextual Aggregating Network, called BiCANet, for semantic segmentation. Unlike previous approaches that encode context in feature space, BiCANet aggregates contextual cues from a categorical perspective, which is mainly consist of three parts: contextual condensed projection block (CCPB), bi-directional context interaction block (BCIB), and muti-scale contextual fusion block (MCFB). More specifically, CCPB learns a category-based map** through a split-transform-merge architecture, which condenses contextual cues with different receptive fields from intermediate layer. BCIB, on the other hand, employs dense skipped-connections to enhance the class-level context exchanging. Finally, MCFB integrates multi-scale contextual cues by investigating short- and long-ranged spatial dependencies. To evaluate BiCANet, we have conducted extensive experiments on three semantic segmentation datasets: PASCAL VOC 2012, Cityscapes, and ADE20K. The experimental results demonstrate that BiCANet outperforms recent state-of-the-art networks without any postprocess techniques. Particularly, BiCANet achieves the mIoU score of 86.7%, 82.4% and 38.66% on PASCAL VOC 2012, Cityscapes and ADE20K testset, respectively. △ Less

Submitted 21 March, 2020; originally announced March 2020.

arXiv:1911.03461 [pdf, other]

AIM 2019 Challenge on Image Demoireing: Methods and Results

Authors: Shanxin Yuan, Radu Timofte, Gregory Slabaugh, Ales Leonardis, Bolun Zheng, Xin Ye, Xiang Tian, Yaowu Chen, Xi Cheng, Zhenyong Fu, Jian Yang, Ming Hong, Wenying Lin, Wen** Yang, Yanyun Qu, Hong-Kyu Shin, Joon-Yeon Kim, Sung-Jea Ko, Hang Dong, Yu Guo, Jie Wang, Xuan Ding, Zongyan Han, Sourya Dipta Das, Kuldeep Purohit , et al. (3 additional authors not shown)

Abstract: This paper reviews the first-ever image demoireing challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ICCV 2019. This paper describes the challenge, and focuses on the proposed solutions and their results. Demoireing is a difficult task of removing moire patterns from an image to reveal an underlying clean image. A new dataset, called LCDMoire wa… ▽ More This paper reviews the first-ever image demoireing challenge that was part of the Advances in Image Manipulation (AIM) workshop, held in conjunction with ICCV 2019. This paper describes the challenge, and focuses on the proposed solutions and their results. Demoireing is a difficult task of removing moire patterns from an image to reveal an underlying clean image. A new dataset, called LCDMoire was created for this challenge, and consists of 10,200 synthetically generated image pairs (moire and clean ground truth). The challenge was divided into 2 tracks. Track 1 targeted fidelity, measuring the ability of demoire methods to obtain a moire-free image compared with the ground truth, while Track 2 examined the perceptual quality of demoire methods. The tracks had 60 and 39 registered participants, respectively. A total of eight teams competed in the final testing phase. The entries span the current the state-of-the-art in the image demoireing problem. △ Less

Submitted 8 November, 2019; originally announced November 2019.

Comments: arXiv admin note: text overlap with arXiv:1911.02498

arXiv:1911.02750 [pdf, other]

Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework

Authors: Mingbo Ma, Baigong Zheng, Kaibo Liu, Renjie Zheng, Hairong Liu, Kainan Peng, Kenneth Church, Liang Huang

Abstract: Text-to-speech synthesis (TTS) has witnessed rapid progress in recent years, where neural methods became capable of producing audios with high naturalness. However, these efforts still suffer from two types of latencies: (a) the {\em computational latency} (synthesizing time), which grows linearly with the sentence length even with parallel approaches, and (b) the {\em input latency} in scenarios… ▽ More Text-to-speech synthesis (TTS) has witnessed rapid progress in recent years, where neural methods became capable of producing audios with high naturalness. However, these efforts still suffer from two types of latencies: (a) the {\em computational latency} (synthesizing time), which grows linearly with the sentence length even with parallel approaches, and (b) the {\em input latency} in scenarios where the input text is incrementally generated (such as in simultaneous translation, dialog generation, and assistive technologies). To reduce these latencies, we devise the first neural incremental TTS approach based on the recently proposed prefix-to-prefix framework. We synthesize speech in an online fashion, playing a segment of audio while generating the next, resulting in an $O(1)$ rather than $O(n)$ latency. △ Less

Submitted 6 October, 2020; v1 submitted 6 November, 2019; originally announced November 2019.

Comments: Findings of EMNLP 2020

arXiv:1909.03272 [pdf, ps, other]

doi 10.1109/LWC.2019.2961357

Intelligent Reflecting Surface-Enhanced OFDM: Channel Estimation and Reflection Optimization

Authors: Beixiong Zheng, Rui Zhang

Abstract: In the intelligent reflecting surface (IRS)-enhanced wireless communication system, channel state information (CSI) is of paramount importance for achieving the passive beamforming gain of IRS, which, however, is a practically challenging task due to its massive number of passive elements without transmitting/receiving capabilities. In this letter, we propose a practical transmission protocol to e… ▽ More In the intelligent reflecting surface (IRS)-enhanced wireless communication system, channel state information (CSI) is of paramount importance for achieving the passive beamforming gain of IRS, which, however, is a practically challenging task due to its massive number of passive elements without transmitting/receiving capabilities. In this letter, we propose a practical transmission protocol to execute channel estimation and reflection optimization successively for an IRS-enhanced orthogonal frequency division multiplexing (OFDM) system. Under the unit-modulus constraint, a novel reflection pattern at the IRS is designed to aid the channel estimation at the access point (AP) based on the received pilot signals from the user, for which the channel estimation error is derived in closed-form. With the estimated CSI, the reflection coefficients are then optimized by a low-complexity algorithm based on the resolved strongest signal path in the time domain. Simulation results corroborate the effectiveness of the proposed channel estimation and reflection optimization methods. △ Less

Submitted 29 January, 2020; v1 submitted 7 September, 2019; originally announced September 2019.

Comments: Early Access in IEEE Wireless Communications Letters. Please refer to "https://ieeexplore.ieee.org/document/8937491/". In this work, we propose practical a practical transmission protocol to execute optimal channel estimation and reflection optimization successively for an IRS-enhanced OFDM system, which is also applicable to the narrow-band IRS system

Journal ref: IEEE Wireless Communications Letters, 2019

Showing 1–48 of 48 results for author: Zheng, B