Skip to main content

Showing 1–50 of 89 results for author: Jiang, H

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.13165  [pdf, other

    eess.IV cs.AI cs.CV cs.RO

    Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model

    Authors: Haojun Jiang, Zhenguo Sun, Ning Jia, Meng Li, Yu Sun, Shaqi Luo, Shiji Song, Gao Huang

    Abstract: Echocardiography is the only technique capable of real-time imaging of the heart and is vital for diagnosing the majority of cardiac diseases. However, there is a severe shortage of experienced cardiac sonographers, due to the heart's complex structure and significant operational challenges. To mitigate this situation, we present a Cardiac Copilot system capable of providing real-time probe moveme… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Early Accepted by MICCAI 2024

  2. arXiv:2405.14802  [pdf, other

    eess.IV cs.CV

    Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation

    Authors: Hongxu Jiang, Muhammad Imran, Linhai Ma, Teng Zhang, Yuyin Zhou, Muxuan Liang, Kuang Gong, Wei Shao

    Abstract: Denoising diffusion probabilistic models (DDPMs) have achieved unprecedented success in computer vision. However, they remain underutilized in medical imaging, a field crucial for disease diagnosis and treatment planning. This is primarily due to the high computational cost associated with (1) the use of large number of time steps (e.g., 1,000) in diffusion processes and (2) the increased dimensio… ▽ More

    Submitted 23 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.00542  [pdf, other

    eess.IV cs.CV

    UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement

    Authors: Ruiquan Ge, Zhaojie Fang, Pengxue Wei, Zhanghao Chen, Hongyang Jiang, Ahmed Elazab, Wangting Li, Xiang Wan, Shaochong Zhang, Changmiao Wang

    Abstract: Fundus photography, in combination with the ultra-wide-angle fundus (UWF) techniques, becomes an indispensable diagnostic tool in clinical settings by offering a more comprehensive view of the retina. Nonetheless, UWF fluorescein angiography (UWF-FA) necessitates the administration of a fluorescent dye via injection into the patient's hand or elbow unlike UWF scanning laser ophthalmoscopy (UWF-SLO… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  4. arXiv:2405.00130  [pdf, other

    eess.IV cs.CV cs.LG

    A Flexible 2.5D Medical Image Segmentation Approach with In-Slice and Cross-Slice Attention

    Authors: Amarjeet Kumar, Hongxu Jiang, Muhammad Imran, Cyndi Valdes, Gabriela Leon, Dahyun Kang, Parvathi Nataraj, Yuyin Zhou, Michael D. Weiss, Wei Shao

    Abstract: Deep learning has become the de facto method for medical image segmentation, with 3D segmentation models excelling in capturing complex 3D structures and 2D models offering high computational efficiency. However, segmenting 2.5D images, which have high in-plane but low through-plane resolution, is a relatively unexplored challenge. While applying 2D models to individual slices of a 2.5D image is f… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  5. arXiv:2403.12781  [pdf, other

    eess.SP

    Large-Scale RIS Enabled Air-Ground Channels: Near-Field Modeling and Analysis

    Authors: Hao Jiang, Wangqi Shi, Zaichen Zhang, Cunhua Pan, Qingqing Wu, Feng Shu, Ruiqi Liu, Jiangzhou Wang

    Abstract: Existing works mainly rely on the far-field planar-wave-based channel model to assess the performance of reconfigurable intelligent surface (RIS)-enabled wireless communication systems. However, when the transmitter and receiver are in near-field ranges, this will result in relatively low computing accuracy. To tackle this challenge, we initially develop an analytical framework for sub-array parti… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  6. Human Activity Recognition with Low-Resolution Infrared Array Sensor Using Semi-supervised Cross-domain Neural Networks for Indoor Environment

    Authors: Cunyi Yin, Xiren Miao, **g Chen, Hao Jiang, Deying Chen, Yixuan Tong, Shaocong Zheng

    Abstract: Low-resolution infrared-based human activity recognition (HAR) attracted enormous interests due to its low-cost and private. In this paper, a novel semi-supervised crossdomain neural network (SCDNN) based on 8 $\times$ 8 low-resolution infrared sensor is proposed for accurately identifying human activity despite changes in the environment at a low-cost. The SCDNN consists of feature extractor, dom… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  7. PowerSkel: A Device-Free Framework Using CSI Signal for Human Skeleton Estimation in Power Station

    Authors: Cunyi Yin, Xiren Miao, **g Chen, Hao Jiang, Jianfei Yang, Yunjiao Zhou, Min Wu, Zhenghua Chen

    Abstract: Safety monitoring of power operations in power stations is crucial for preventing accidents and ensuring stable power supply. However, conventional methods such as wearable devices and video surveillance have limitations such as high cost, dependence on light, and visual blind spots. WiFi-based human pose estimation is a suitable method for monitoring power operations due to its low cost, device-f… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  8. arXiv:2402.15634  [pdf, other

    cs.IT eess.SP

    Sense-Then-Train: A Novel Beam Training Design for Near-Field MIMO Systems

    Authors: Hao Jiang, Zhaolin Wang, Yuanwei Liu

    Abstract: A novel sense-then-train (STT) scheme is proposed for beam training in near-field multiple-input multiple-output (MIMO) systems. Compared to conventional codebook-based schemes, the proposed STT scheme is capable of not only addressing the complex spherical-wave propagation but also effectively exploiting the additional degrees-of-freedoms (DoFs). The STT scheme is tailored for both single-beam an… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Submitted to IEEE

  9. arXiv:2401.05363  [pdf, other

    eess.SP cs.LG

    Generalizable Sleep Staging via Multi-Level Domain Alignment

    Authors: Jiquan Wang, Sha Zhao, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan

    Abstract: Automatic sleep staging is essential for sleep assessment and disorder diagnosis. Most existing methods depend on one specific dataset and are limited to be generalized to other unseen datasets, for which the training data and testing data are from the same dataset. In this paper, we introduce domain generalization into automatic sleep staging and propose the task of generalizable sleep staging wh… ▽ More

    Submitted 27 January, 2024; v1 submitted 13 December, 2023; originally announced January 2024.

    Comments: Accepted by the Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI-24)

  10. arXiv:2401.03626  [pdf, other

    eess.SP

    Hybrid Vector Message Passing for Generalized Bilinear Factorization

    Authors: Hao Jiang, Xiaojun Yuan, Qinghua Guo

    Abstract: In this paper, we propose a new message passing algorithm that utilizes hybrid vector message passing (HVMP) to solve the generalized bilinear factorization (GBF) problem. The proposed GBF-HVMP algorithm integrates expectation propagation (EP) and variational message passing (VMP) via variational free energy minimization, yielding tractable Gaussian messages. Furthermore, GBF-HVMP enables vector/m… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  11. arXiv:2312.07911  [pdf

    eess.IV cs.CV

    Projective Parallel Single-Pixel Imaging: 3D Structured Light Scanning Under Global Illumination

    Authors: Yuxi Li, Hongzhi Jiang, Huijie Zhao, Xudong Li

    Abstract: We present projective parallel single-pixel imaging (pPSI), a 3D photography method that provides a robust and efficient way to analyze the light transport behavior and enables separation of light effect due to global illumination, thereby achieving 3D structured light scanning under global illumination. The light transport behavior is described by the light transport coefficients (LTC), which con… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 21 pages,13 figures

  12. arXiv:2312.05256  [pdf, other

    eess.IV cs.AI

    Holistic Evaluation of GPT-4V for Biomedical Imaging

    Authors: Zhengliang Liu, Hanqi Jiang, Tianyang Zhong, Zihao Wu, Chong Ma, Yiwei Li, Xiaowei Yu, Yutong Zhang, Yi Pan, Peng Shu, Yanjun Lyu, Lu Zhang, Junjie Yao, Peixin Dong, Chao Cao, Zhenxiang Xiao, Jiaqi Wang, Huan Zhao, Shaochen Xu, Yaonai Wei, **gyuan Chen, Haixing Dai, Peilong Wang, Hao He, Zewei Wang , et al. (25 additional authors not shown)

    Abstract: In this paper, we present a large-scale evaluation probing GPT-4V's capabilities and limitations for biomedical image analysis. GPT-4V represents a breakthrough in artificial general intelligence (AGI) for computer vision, with applications in the biomedical domain. We assess GPT-4V's performance across 16 medical imaging categories, including radiology, oncology, ophthalmology, pathology, and mor… ▽ More

    Submitted 10 November, 2023; originally announced December 2023.

  13. arXiv:2311.15292  [pdf, other

    cs.IT eess.SP

    Active-Sensing-Based Beam Alignment for Near Field MIMO Communications

    Authors: Hao Jiang, Zhaolin Wang, Yuanwei Liu

    Abstract: An active-sensing-based learning algorithm is proposed to solve the near-field beam alignment problem with the aid of wavenumber-domain transform matrices (WTMs). Specifically, WTMs can transform the antenna-domain channel into a sparse representation in the wavenumber domain. The dimensions of WTMs can be further reduced by exploiting the dominance of line-of-sight (LoS) links. By employing these… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  14. arXiv:2311.08075  [pdf, ps, other

    eess.IV cs.CV cs.HC

    GlanceSeg: Real-time microaneurysm lesion segmentation with gaze-map-guided foundation model for early detection of diabetic retinopathy

    Authors: Hongyang Jiang, Mengdi Gao, Zirong Liu, Chen Tang, Xiaoqing Zhang, Shuai Jiang, Wu Yuan, Jiang Liu

    Abstract: Early-stage diabetic retinopathy (DR) presents challenges in clinical diagnosis due to inconspicuous and minute microangioma lesions, resulting in limited research in this area. Additionally, the potential of emerging foundation models, such as the segment anything model (SAM), in medical scenarios remains rarely explored. In this work, we propose a human-in-the-loop, label-free early DR diagnosis… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 12 pages, 10 figures

  15. arXiv:2311.07096  [pdf, other

    cs.IT eess.SP

    Optimal Configuration of Reconfigurable Intelligent Surfaces with Arbitrary Discrete Phase Shifts

    Authors: Seyedkhashayar Hashemi, Hai Jiang, Masoud Ardakani

    Abstract: We address the reflection optimization problem for a reconfigurable intelligent surface (RIS), where the RIS elements feature a set of non-uniformly spaced discrete phase shifts. This is motivated by the actual behavior of practical RIS elements, where it is shown that a uniform phase shift assumption is not realistic. A problem is formulated to find the optimal refection amplitudes and reflection… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  16. arXiv:2309.16075  [pdf, other

    eess.SY

    A review of variable-pitch propellers and their control strategies in aerospace systems

    Authors: Hanjie Jiang, Ye Zhou, Hann Woei Ho

    Abstract: The relentless pursuit of aircraft flight efficiency has thrust variable-pitch propeller technology into the forefront of aviation innovation. This technology, rooted in the ancient power unit of propellers, has found renewed significance, particularly in the realms of unmanned aerial vehicles and urban air mobility. This underscores the profound interplay between visionary aviation concepts and t… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  17. arXiv:2309.12552  [pdf, other

    eess.SY

    Adaptive Model Predictive Control for Engine-Driven Ducted Fan Lift Systems using an Associated Linear Parameter Varying Model

    Authors: Hanjie Jiang, Ye Zhou, Hann Woei Ho, Wenjie Hu

    Abstract: Ducted fan lift systems (DFLSs) powered by two-stroke aviation piston engines present a challenging control problem due to their complex multivariable dynamics. Current controllers for these systems typically rely on proportional-integral algorithms combined with data tables, which rely on accurate models and are not adaptive to handle time-varying dynamics or system uncertainties. This paper prop… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  18. arXiv:2308.08313  [pdf, other

    eess.IV cs.CV

    ECPC-IDS:A benchmark endometrail cancer PET/CT image dataset for evaluation of semantic segmentation and detection of hypermetabolic regions

    Authors: Dechao Tang, Tianming Du, Deguo Ma, Zhiyu Ma, Hongzan Sun, Marcin Grzegorzek, Huiyan Jiang, Chen Li

    Abstract: Endometrial cancer is one of the most common tumors in the female reproductive system and is the third most common gynecological malignancy that causes death after ovarian and cervical cancer. Early diagnosis can significantly improve the 5-year survival rate of patients. With the development of artificial intelligence, computer-assisted diagnosis plays an increasingly important role in improving… ▽ More

    Submitted 11 October, 2023; v1 submitted 16 August, 2023; originally announced August 2023.

    Comments: 14 pages,6 figures

  19. arXiv:2307.16518  [pdf, other

    cs.IT eess.SP

    Continuous-Time Channel Prediction Based on Tensor Neural Ordinary Differential Equation

    Authors: Mingyao Cui, Hao Jiang, Yuhao Chen, Yang Du, Linglong Dai

    Abstract: Channel prediction is critical to address the channel aging issue in mobile scenarios. Existing channel prediction techniques are mainly designed for discrete channel prediction, which can only predict the future channel in a fixed time slot per frame, while the other intra-frame channels are usually recovered by interpolation. However, these approaches suffer from a serious interpolation loss, es… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: A tensor neural ODE based method is proposed to predict continuous-time wireless channels

  20. arXiv:2307.07995  [pdf, other

    eess.SP

    Channel Modeling for Heterogeneous Vehicular ISAC System with Shared Clusters

    Authors: Zaichen Zhang, Yingmeng Ge, Haibo Wang, Hao Jiang, Liang Wu, Ziyang Zhang

    Abstract: In this paper, we consider the channel modeling of a heterogeneous vehicular integrated sensing and communication (ISAC) system, where a dual-functional multi-antenna base station (BS) intends to communicate with a multi-antenna vehicular receiver (MR) and sense the surrounding environments simultaneously. The time-varying complex channel impulse responses (CIRs) of the sensing and communication c… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: 6 pages, 3 figures. This work has been submitted to IEEE for possible publication

  21. arXiv:2307.01665  [pdf

    eess.SP

    Multicarrier Modulation-Based Digital Radio-over-Fibre System Achieving Unequal Bit Protection with Over 10 dB SNR Gain

    Authors: Yicheng Xu, Yixiao Zhu, Xiaobo Zeng, Mengfan Fu, Hexun Jiang, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: We propose a multicarrier modulation-based digital radio-over-fibre system achieving unequal bit protection by bit and power allocation for subcarriers. A theoretical SNR gain of 16.1 dB is obtained in the AWGN channel and the simulation results show a 13.5 dB gain in the bandwidth-limited case.

    Submitted 4 July, 2023; originally announced July 2023.

  22. arXiv:2306.04202  [pdf, other

    cs.MM cs.CV eess.IV

    Video Compression with Arbitrary Rescaling Network

    Authors: Mengxi Guo, Shijie Zhao, Hao Jiang, Junlin Li, Li Zhang

    Abstract: Most video platforms provide video streaming services with different qualities, and the quality of the services is usually adjusted by the resolution of the videos. So high-resolution videos need to be downsampled for compression. In order to solve the problem of video coding at different resolutions, we propose a rate-guided arbitrary rescaling network (RARN) for video resizing before encoding. T… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted as a one-page poster by 2023 Data Compression Conference (DCC). This is the full paper

  23. arXiv:2306.02682  [pdf, other

    cs.CL eess.AS

    End-to-End Word-Level Pronunciation Assessment with MASK Pre-training

    Authors: Yukang Liang, Kaitao Song, Shaoguang Mao, Huiqiang Jiang, Luna Qiu, Yuqing Yang, Dongsheng Li, Linli Xu, Lili Qiu

    Abstract: Pronunciation assessment is a major challenge in the computer-aided pronunciation training system, especially at the word (phoneme)-level. To obtain word (phoneme)-level scores, current methods usually rely on aligning components to obtain acoustic features of each word (phoneme), which limits the performance of assessment to the accuracy of alignments. Therefore, to address this problem, we propo… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by InterSpeech 2023

  24. arXiv:2305.19956  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MicroSegNet: A Deep Learning Approach for Prostate Segmentation on Micro-Ultrasound Images

    Authors: Hongxu Jiang, Muhammad Imran, Preethika Muralidharan, Anjali Patel, Jake Pensa, Muxuan Liang, Tarik Benidir, Joseph R. Grajo, Jason P. Joseph, Russell Terry, John Michael DiBianco, Li-Ming Su, Yuyin Zhou, Wayne G. Brisbane, Wei Shao

    Abstract: Micro-ultrasound (micro-US) is a novel 29-MHz ultrasound technique that provides 3-4 times higher resolution than traditional ultrasound, potentially enabling low-cost, accurate diagnosis of prostate cancer. Accurate prostate segmentation is crucial for prostate volume measurement, cancer diagnosis, prostate biopsy, and treatment planning. However, prostate segmentation on micro-US is challenging… ▽ More

    Submitted 25 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Computerized Medical Imaging and Graphics (2024): 102326

  25. arXiv:2305.16049  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    CN-Celeb-AV: A Multi-Genre Audio-Visual Dataset for Person Recognition

    Authors: Lantian Li, Xiaolou Li, Haoyu Jiang, Chen Chen, Ruihai Hou, Dong Wang

    Abstract: Audio-visual person recognition (AVPR) has received extensive attention. However, most datasets used for AVPR research so far are collected in constrained environments, and thus cannot reflect the true performance of AVPR systems in real-world scenarios. To meet the request for research on AVPR in unconstrained conditions, this paper presents a multi-genre AVPR dataset collected `in the wild', nam… ▽ More

    Submitted 28 July, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: INTERSPEECH 2023

  26. arXiv:2305.14997  [pdf, other

    eess.SP

    3GPP-Like GBSM THz Channel Modeling for Indoor Office and Urban Microcellular Scenarios

    Authors: Zhaowei Chang, Jianhua Zhang, Pan Tang, Lei Tian, Hao Jiang, Ximan Liu, and Guangyi Liu

    Abstract: Terahertz (THz) communication is envisioned as one of the possible technologies for the sixth-generation (6G) communication system due to its rich spectrum. To evaluate the performance of THz communication, it is essential to propose THz channel models within the common framework of the geometry-based stochastic model (GBSM) in the 3rd Generation Partnership Project (3GPP). This paper focuses on 3… ▽ More

    Submitted 22 April, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  27. arXiv:2304.12719  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Eye tracking guided deep multiple instance learning with dual cross-attention for fundus disease detection

    Authors: Hongyang Jiang, **gqi Huang, Chen Tang, Xiaoqing Zhang, Mengdi Gao, Jiang Liu

    Abstract: Deep neural networks (DNNs) have promoted the development of computer aided diagnosis (CAD) systems for fundus diseases, hel** ophthalmologists reduce missed diagnosis and misdiagnosis rate. However, the majority of CAD systems are data-driven but lack of medical prior knowledge which can be performance-friendly. In this regard, we innovatively proposed a human-in-the-loop (HITL) CAD system by l… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 10 pages, 9 figures

    MSC Class: none

  28. arXiv:2303.16024  [pdf, other

    cs.CV cs.SD eess.AS

    Egocentric Auditory Attention Localization in Conversations

    Authors: Fiona Ryan, Hao Jiang, Abhinav Shukla, James M. Rehg, Vamsi Krishna Ithapu

    Abstract: In a noisy conversation environment such as a dinner party, people often exhibit selective auditory attention, or the ability to focus on a particular speaker while tuning out others. Recognizing who somebody is listening to in a conversation is essential for develo** technologies that can understand social behavior and devices that can augment human hearing by amplifying particular sound source… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  29. arXiv:2302.08650  [pdf, other

    cs.SD cs.CV cs.LG eess.AS

    Gaussian-smoothed Imbalance Data Improves Speech Emotion Recognition

    Authors: Xuefeng Liang, Hexin Jiang, Wenxin Xu, Ying Zhou

    Abstract: In speech emotion recognition tasks, models learn emotional representations from datasets. We find the data distribution in the IEMOCAP dataset is very imbalanced, which may harm models to learn a better representation. To address this issue, we propose a novel Pairwise-emotion Data Distribution Smoothing (PDDS) method. PDDS considers that the distribution of emotional data should be smooth in rea… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 5 pages

  30. arXiv:2301.10624  [pdf, ps, other

    cs.IT eess.SP

    Energy-Delay Tradeoff in Helper-Assisted NOMA-MEC Systems: A Four-Sided Matching Algorithm

    Authors: Mengmeng Ren, Long Yang, Hai Jiang, Jian Chen, Yuchen Zhou

    Abstract: This paper designs a helper-assisted resource allocation strategy in non-orthogonal multiple access (NOMA)-enabled mobile edge computing (MEC) systems, in order to guarantee the quality of service (QoS) of the energy/delay-sensitive user equipments (UEs). To achieve a tradeoff between the energy consumption and the delay, we introduce a novel performance metric, called \emph{energy-delay tradeoff}… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

  31. arXiv:2301.02184  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    Chat2Map: Efficient Scene Map** from Multi-Ego Conversations

    Authors: Sagnik Majumder, Hao Jiang, Pierre Moulon, Ethan Henderson, Paul Calamia, Kristen Grauman, Vamsi Krishna Ithapu

    Abstract: Can conversational videos captured from multiple egocentric viewpoints reveal the map of a scene in a cost-efficient way? We seek to answer this question by proposing a new problem: efficiently building the map of a previously unseen 3D environment by exploiting shared information in the egocentric audio-visual observations of participants in a natural conversation. Our hypothesis is that as multi… ▽ More

    Submitted 20 April, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: Accepted to CVPR 2023

  32. arXiv:2212.08653  [pdf, other

    cs.CV eess.IV

    Attentive Mask CLIP

    Authors: Yifan Yang, Weiquan Huang, Yixuan Wei, Houwen Peng, Xinyang Jiang, Huiqiang Jiang, Fangyun Wei, Yin Wang, Han Hu, Lili Qiu, Yuqing Yang

    Abstract: Image token removal is an efficient augmentation strategy for reducing the cost of computing image features. However, this efficient augmentation strategy has been found to adversely affect the accuracy of CLIP-based training. We hypothesize that removing a large portion of image tokens may improperly discard the semantic content associated with a given text description, thus constituting an incor… ▽ More

    Submitted 9 October, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 2771-2781

  33. Rate-Splitting Multiple Access for Uplink Massive MIMO With Electromagnetic Exposure Constraints

    Authors: Hanyu Jiang, Li You, Ahmed Elzanaty, Jue Wang, Wen** Wang, Xiqi Gao, Mohamed-Slim Alouini

    Abstract: Over the past few years, the prevalence of wireless devices has become one of the essential sources of electromagnetic (EM) radiation to the public. Facing with the swift development of wireless communications, people are skeptical about the risks of long-term exposure to EM radiation. As EM exposure is required to be restricted at user terminals, it is inefficient to blindly decrease the transmit… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: to appear in IEEE Journal on Selected Areas in Communications

    Journal ref: IEEE Journal on Selected Areas in Communications, vol. 41, no. 5, pp. 1383-1397, May 2023

  34. arXiv:2211.09332  [pdf

    eess.SY cs.RO

    iNavFIter-M: Matrix Formulation of Functional Iteration for Inertial Navigation Computation

    Authors: Hongyan Jiang, Maoran Zhu, Yanyan Fu, Yuanxin Wu

    Abstract: The acquisition of attitude, velocity, and position is an essential task in the field of inertial navigation, achieved by integrating the measurements from inertial sensors. Recently, the ultra-precision inertial navigation computation has been tackled by the functional iteration approach (iNavFIter) that drives the non-commutativity errors almost to the computer truncation error level. This paper… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 30 pages, 7 figures

  35. arXiv:2209.15351  [pdf

    cs.NI eess.SP

    Efficient Ambient LoRa Backscatter with On-Off Keying Modulation

    Authors: Xiuzhen Guo, Longfei Shangguan, Yuan He, Jia Zhang, Haotian Jiang, Awais Ahmad Siddiqi, Yunhao Liu

    Abstract: Backscatter communication holds potential for ubiquitous and low-cost connectivity among low-power IoT devices. To avoid interference between the carrier signal and the backscatter signal, recent works propose a frequency-shifting technique to separate these two signals in the frequency domain. Such proposals, however, have to occupy the precious wireless spectrum that is already overcrowded, and… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  36. arXiv:2209.15348  [pdf

    cs.NI eess.SP

    Saiyan: Design and Implementation of a Low-power Demodulator for LoRa Backscatter Systems

    Authors: Xiuzhen Guo, Longfei Shangguan, Yuan He, Nan **g, Jiacheng Zhang, Haotian Jiang, Yunhao Liu

    Abstract: The radio range of backscatter systems continues growing as new wireless communication primitives are continuously invented. Nevertheless, both the bit error rate and the packet loss rate of backscatter signals increase rapidly with the radio range, thereby necessitating the cooperation between the access point and the backscatter tags through a feedback loop. Unfortunately, the low-power nature o… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  37. arXiv:2209.06779  [pdf, ps, other

    cs.RO eess.SY

    Efficient Planar Pose Estimation via UWB Measurements

    Authors: Haodong Jiang, Wentao Wang, Yuan Shen, Xinghan Li, Xiaoqiang Ren, Biqiang Mu, Junfeng Wu

    Abstract: State estimation is an essential part of autonomous systems. Integrating the Ultra-Wideband(UWB) technique has been shown to correct the long-term estimation drift and bypass the complexity of loop closure detection. However, few works on robotics adopt UWB as a stand-alone state estimation solution. The primary purpose of this work is to investigate planar pose estimation using only UWB range mea… ▽ More

    Submitted 27 February, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Update the content and improve consistency with the ICRA version

  38. arXiv:2208.05122  [pdf, other

    eess.AS

    Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech

    Authors: Kaitao Song, Teng Wan, Bixia Wang, Huiqiang Jiang, Luna Qiu, Jiahang Xu, Li** Jiang, Qun Lou, Yuqing Yang, Dongsheng Li, Xudong Wang, Lili Qiu

    Abstract: Hypernasality is an abnormal resonance in human speech production, especially in patients with craniofacial anomalies such as cleft palate. In clinical application, hypernasality estimation is crucial in cleft palate diagnosis, as its results determine the subsequent surgery and additional speech therapy. Therefore, designing an automatic hypernasality assessment method will facilitate speech-lang… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted by InterSpeech 2022

  39. arXiv:2207.05706  [pdf

    eess.SP physics.optics

    Optical Field Recovery in Jones Space

    Authors: Qi Wu, Yixiao Zhu, Hexun Jiang, Qunbi Zhuge, Weisheng Hu

    Abstract: Optical full-field recovery makes it possible to compensate for fiber impairments such as chromatic dispersion and polarization mode dispersion (PMD) in the digital signal processing. For cost-sensitive short-reach optical networks, some advanced single-polarization (SP) optical field recovery schemes are recently proposed to avoid chromatic dispersion-induced power fading effect, and improve the… ▽ More

    Submitted 13 July, 2022; v1 submitted 22 June, 2022; originally announced July 2022.

    Comments: 8 pages and 9 figures

  40. arXiv:2207.00215   

    eess.IV cs.CV

    Polarized Color Image Denoising using Pocoformer

    Authors: Zhuoxiao Li, Haiyang Jiang, Yinqiang Zheng

    Abstract: Polarized color photography provides both visual textures and object surficial information in one single snapshot. However, the use of the directional polarizing filter array causes extremely lower photon count and SNR compared to conventional color imaging. Thus, the feature essentially leads to unpleasant noisy images and destroys polarization analysis performance. It is a challenge for traditio… ▽ More

    Submitted 1 March, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: New version is accpeted by CVPR 2023 and great modifications are taken

  41. arXiv:2206.04992  [pdf, other

    cs.IT eess.SP

    Artificial Intelligence Enabled NOMA Towards Next Generation Multiple Access

    Authors: Xiaoxia Xu, Yuanwei Liu, Xidong Mu, Qimei Chen, Hao Jiang, Zhiguo Ding

    Abstract: This article focuses on the application of artificial intelligence (AI) in non-orthogonal multiple-access (NOMA), which aims to achieve automated, adaptive, and high-efficiency multi-user communications towards next generation multiple access (NGMA). First, the limitations of current scenario-specific multiple-antenna NOMA schemes are discussed, and the importance of AI for NGMA is highlighted. Th… ▽ More

    Submitted 13 December, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: This article has been accepted by IEEE Wireless Communications Magazine

  42. arXiv:2206.04186  [pdf, other

    cs.LG eess.SP

    Reinforced Inverse Scattering

    Authors: Hanyang Jiang, Yuehaw Khoo, Haizhao Yang

    Abstract: Inverse wave scattering aims at determining the properties of an object using data on how the object scatters incoming waves. In order to collect information, sensors are put in different locations to send and receive waves from each other. The choice of sensor positions and incident wave frequencies determines the reconstruction quality of scatterer properties. This paper introduces reinforcement… ▽ More

    Submitted 2 November, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    MSC Class: 68Txx; 49MXX; 65N21

  43. arXiv:2206.01408  [pdf, other

    cs.CV cs.LG eess.IV

    MetaLR: Meta-tuning of Learning Rates for Transfer Learning in Medical Imaging

    Authors: Yixiong Chen, Li Liu, **gxian Li, Hua Jiang, Chris Ding, Zongwei Zhou

    Abstract: In medical image analysis, transfer learning is a powerful method for deep neural networks (DNNs) to generalize well on limited medical data. Prior efforts have focused on develo** pre-training algorithms on domains such as lung ultrasound, chest X-ray, and liver CT to bridge domain gaps. However, we find that model fine-tuning also plays a crucial role in adapting medical knowledge to target ta… ▽ More

    Submitted 29 May, 2023; v1 submitted 3 June, 2022; originally announced June 2022.

    Comments: MICCAI 2023

  44. Hybrid RIS and DMA Assisted Multiuser MIMO Uplink Transmission With Electromagnetic Exposure Constraints

    Authors: Hanyu Jiang, Li You, Jue Wang, Wen** Wang, Xiqi Gao

    Abstract: In the fifth-generation and beyond era, reconfigurable intelligent surface (RIS) and dynamic metasurface antennas (DMAs) are emerging metamaterials kee** up with the demand for high-quality wireless communication services, which promote the diversification of portable wireless terminals. However, along with the rapid expansion of wireless devices, the electromagnetic (EM) radiation increases unc… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: 14 pages, 6 figures

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, vol. 16, no. 5, pp. 1055-1069, Aug. 2022

  45. arXiv:2205.04120  [pdf, other

    cs.SD cs.CL eess.AS

    Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech

    Authors: Yang Li, Cheng Yu, Guangzhi Sun, Hua Jiang, Fanglei Sun, Weiqin Zu, Ying Wen, Yang Yang, Jun Wang

    Abstract: Modelling prosody variation is critical for synthesizing natural and expressive speech in end-to-end text-to-speech (TTS) systems. In this paper, a cross-utterance conditional VAE (CUC-VAE) is proposed to estimate a posterior probability distribution of the latent prosody features for each phoneme by conditioning on acoustic features, speaker information, and text features obtained from both past… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: ACL 2022 camera ready

  46. arXiv:2204.11840  [pdf, other

    cs.LG cs.AI eess.SP q-bio.NC

    Dynamic Ensemble Bayesian Filter for Robust Control of a Human Brain-machine Interface

    Authors: Yu Qi, Xinyun Zhu, Kedi Xu, Feixiao Ren, Hongjie Jiang, Junming Zhu, Jianmin Zhang, Gang Pan, Yueming Wang

    Abstract: Objective: Brain-machine interfaces (BMIs) aim to provide direct brain control of devices such as prostheses and computer cursors, which have demonstrated great potential for mobility restoration. One major limitation of current BMIs lies in the unstable performance in online control due to the variability of neural signals, which seriously hinders the clinical availability of BMIs. Method: To dea… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  47. arXiv:2204.02839  [pdf

    eess.IV cs.CV

    CCAT-NET: A Novel Transformer Based Semi-supervised Framework for Covid-19 Lung Lesion Segmentation

    Authors: Mingyang Liu, Li Xiao, Huiqin Jiang, Qing He

    Abstract: The spread of the novel coronavirus disease 2019 (COVID-19) has claimed millions of lives. Automatic segmentation of lesions from CT images can assist doctors with screening, treatment, and monitoring. However, accurate segmentation of lesions from CT images can be very challenging due to data and model limitations. Recently, Transformer-based networks have attracted a lot of attention in the area… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  48. arXiv:2202.11279  [pdf

    cs.CV eess.IV

    An End-to-End Cascaded Image Deraining and Object Detection Neural Network

    Authors: Kaige Wang, Tianming Wang, Jianchuang Qu, Huatao Jiang, Qing Li, Lin Chang

    Abstract: While the deep learning-based image deraining methods have made great progress in recent years, there are two major shortcomings in their application in real-world situations. Firstly, the gap between the low-level vision task represented by rain removal and the high-level vision task represented by object detection is significant, and the low-level vision task can hardly contribute to the high-le… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  49. arXiv:2202.05126  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning for Computational Cytology: A Survey

    Authors: Hao Jiang, Yanning Zhou, Yi Lin, Ronald CK Chan, Jiang Liu, Hao Chen

    Abstract: Computational cytology is a critical, rapid-develo**, yet challenging topic in the field of medical image computing which analyzes the digitized cytology image by computer-aided technologies for cancer screening. Recently, an increasing number of deep learning (DL) algorithms have made significant progress in medical image analysis, leading to the boosting publications of cytological studies. To… ▽ More

    Submitted 16 February, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Journal ref: Medical Image Analysis, Nov 2022

  50. arXiv:2201.03186  [pdf, other

    eess.IV cs.CV

    MyoPS: A Benchmark of Myocardial Pathology Segmentation Combining Three-Sequence Cardiac Magnetic Resonance Images

    Authors: Lei Li, Fu** Wu, Sihan Wang, Xinzhe Luo, Carlos Martin-Isla, Shuwei Zhai, Jianpeng Zhang, Yanfei Liu7, Zhen Zhang, Markus J. Ankenbrand, Haochuan Jiang, Xiaoran Zhang, Linhong Wang, Tewodros Weldebirhan Arega, Elif Altunok, Zhou Zhao, Feiyan Li, Jun Ma, ** Yang, Elodie Puybareau, Ilkay Oksuz, Stephanie Bricq, Weisheng Li, Kumaradevan Punithakumar, Sotirios A. Tsaftaris , et al. (7 additional authors not shown)

    Abstract: Assessment of myocardial viability is essential in diagnosis and treatment management of patients suffering from myocardial infarction, and classification of pathology on myocardium is the key to this assessment. This work defines a new task of medical image analysis, i.e., to perform myocardial pathology segmentation (MyoPS) combining three-sequence cardiac magnetic resonance (CMR) images, which… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.