Skip to main content

Showing 1–50 of 195 results for author: He, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14973  [pdf, other

    cs.CV eess.IV

    LU2Net: A Lightweight Network for Real-time Underwater Image Enhancement

    Authors: Haodong Yang, Jisheng Xu, Zhiliang Lin, Jian** He

    Abstract: Computer vision techniques have empowered underwater robots to effectively undertake a multitude of tasks, including object tracking and path planning. However, underwater optical factors like light refraction and absorption present challenges to underwater vision, which cause degradation of underwater images. A variety of underwater image enhancement methods have been proposed to improve the effe… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2406.12783  [pdf, ps, other

    cs.NE cs.DC eess.SY math.NA

    Zeroing neural dynamics solving time-variant complex conjugate matrix equation

    Authors: Jiakuang He, Dongqing Wu

    Abstract: Complex conjugate matrix equations (CCME) have aroused the interest of many researchers because of computations and antilinear systems. Existing research is dominated by its time-invariant solving methods, but lacks proposed theories for solving its time-variant version. Moreover, artificial neural networks are rarely studied for solving CCME. In this paper, starting with the earliest CCME, zeroin… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.12186  [pdf, ps, other

    eess.IV cs.CV

    Unlocking the Potential of Early Epochs: Uncertainty-aware CT Metal Artifact Reduction

    Authors: Xinquan Yang, Guanqun Zhou, Wei Sun, Youjian Zhang, Zhongya Wang, Jiahui He, Zhicheng Zhang

    Abstract: In computed tomography (CT), the presence of metallic implants in patients often leads to disruptive artifacts in the reconstructed images, hindering accurate diagnosis. Recently, a large amount of supervised deep learning-based approaches have been proposed for metal artifact reduction (MAR). However, these methods neglect the influence of initial training weights. In this paper, we have discover… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2405.15830  [pdf, other

    eess.IV

    Diff-DTI: Fast Diffusion Tensor Imaging Using A Feature-Enhanced Joint Diffusion Model

    Authors: Lang Zhang, **ling He, Dong Liang, Hairong Zheng, Yanjie Zhu

    Abstract: Magnetic resonance diffusion tensor imaging (DTI) is a critical tool for neural disease diagnosis. However, long scan time greatly hinders the widespread clinical use of DTI. To accelerate image acquisition, a feature-enhanced joint diffusion model (Diff-DTI) is proposed to obtain accurate DTI parameter maps from a limited number of diffusion-weighted images (DWIs). Diff-DTI introduces a joint dif… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures

  5. arXiv:2405.15241  [pdf, other

    eess.IV cs.CV

    Blaze3DM: Marry Triplane Representation with Diffusion for 3D Medical Inverse Problem Solving

    Authors: Jia He, Bonan Li, Ge Yang, Ziwen Liu

    Abstract: Solving 3D medical inverse problems such as image restoration and reconstruction is crucial in modern medical field. However, the curse of dimensionality in 3D medical data leads mainstream volume-wise methods to suffer from high resource consumption and challenges models to successfully capture the natural distribution, resulting in inevitable volume inconsistency and artifacts. Some recent works… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2405.04258  [pdf, other

    eess.SY

    A Weighted Least-Squares Method for Non-Asymptotic Identification of Markov Parameters from Multiple Trajectories

    Authors: Jiabao He, Cristian R. Rojas, Håkan Hjalmarsson

    Abstract: Markov parameters play a key role in system identification. There exists many algorithms where these parameters are estimated using least-squares in a first, pre-processing, step, including subspace identification and multi-step least-squares algorithms, such as Weighted Null-Space Fitting. Recently, there has been an increasing interest in non-asymptotic analysis of estimation algorithms. In this… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  7. arXiv:2405.04250  [pdf, other

    eess.SY

    Weighted Least-Squares PARSIM

    Authors: Jiabao He, Cristian R. Rojas, Håkan Hjalmarsson

    Abstract: Subspace identification methods (SIMs) have proven very powerful for estimating linear state-space models. To overcome the deficiencies of classical SIMs, a significant number of algorithms has appeared over the last two decades, where most of them involve a common intermediate step, that is to estimate the range space of the extended observability matrix. In this contribution, an optimized versio… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  8. arXiv:2404.19500  [pdf, other

    cs.CV cs.AI cs.MM eess.IV

    Towards Real-world Video Face Restoration: A New Benchmark

    Authors: Ziyan Chen, **gwen He, Xinqi Lin, Yu Qiao, Chao Dong

    Abstract: Blind face restoration (BFR) on images has significantly progressed over the last several years, while real-world video face restoration (VFR), which is more challenging for more complex face motions such as moving gaze directions and facial orientations involved, remains unsolved. Typical BFR methods are evaluated on privately synthesized datasets or self-collected real-world low-quality face ima… ▽ More

    Submitted 4 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Project page: https://ziyannchen.github.io/projects/VFRxBenchmark/

  9. arXiv:2404.17411  [pdf, ps, other

    eess.SP

    Low-Complexity Near-Field Channel Estimation for Hybrid RIS Assisted Systems

    Authors: Rafaela Schroeder, Jiguang He, Hamza Djelouat, Markku Juntti

    Abstract: We investigate the channel estimation (CE) problem for hybrid RIS assisted systems and focus on the near-field (NF) regime. Different from their far-field counterparts, NF channels possess a block-sparsity property, which is leveraged in the two developed CE algorithms: (i) boundary estimation and sub-vector recovery (BESVR) and (ii) linear total variation regularization (TVR). In addition, we ado… ▽ More

    Submitted 30 April, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: 5 pages, 5 figures

  10. arXiv:2404.17331  [pdf, ps, other

    eess.SY

    Finite Sample Analysis for a Class of Subspace Identification Methods

    Authors: Jiabao He, Ingvar Ziemann, Cristian R. Rojas, Håkan Hjalmarsson

    Abstract: While subspace identification methods (SIMs) are appealing due to their simple parameterization for MIMO systems and robust numerical realizations, a comprehensive statistical analysis of SIMs remains an open problem, especially in the non-asymptotic regime. In this work, we provide a finite sample analysis for a class of SIMs, which reveals that the convergence rates for estimating Markov paramet… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  11. arXiv:2404.14879  [pdf, other

    eess.SP

    Device-Free 3D Drone Localization in RIS-Assisted mmWave MIMO Networks

    Authors: Jiguang He, Charles Vanwynsberghe, Hui Chen, Chongwen Huang, Aymen Fakhreddine

    Abstract: In this paper, we investigate the potential of reconfigurable intelligent surfaces (RISs) in facilitating passive/device-free three-dimensional (3D) drone localization within existing cellular infrastructure operating at millimeter-wave (mmWave) frequencies and employing multiple antennas at the transceivers. The developed localization system operates in the bi-static mode without requiring direct… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 6 pages, 5 figures, submitted to IEEE GLOBECOM 2024

  12. arXiv:2404.12257  [pdf, other

    cs.CV cs.AI cs.LG cs.MM eess.IV

    Food Portion Estimation via 3D Object Scaling

    Authors: Gautham Vinod, Jiangpeng He, Zeman Shao, Fengqing Zhu

    Abstract: Image-based methods to analyze food images have alleviated the user burden and biases associated with traditional methods. However, accurate portion estimation remains a major challenge due to the loss of 3D information in the 2D representation of foods captured by smartphone cameras or wearable devices. In this paper, we propose a new framework to estimate both food volume and energy from 2D imag… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  13. arXiv:2404.07507  [pdf, other

    eess.IV cs.CV

    Learning to Classify New Foods Incrementally Via Compressed Exemplars

    Authors: Justin Yang, Zhihao Duan, Jiangpeng He, Fengqing Zhu

    Abstract: Food image classification systems play a crucial role in health monitoring and diet tracking through image-based dietary assessment techniques. However, existing food recognition systems rely on static datasets characterized by a pre-defined fixed number of food classes. This contrasts drastically with the reality of food consumption, which features constantly changing data. Therefore, food image… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  14. arXiv:2403.08343  [pdf, ps, other

    cs.IT eess.SP

    Coverage and Rate Analysis for Integrated Sensing and Communication Networks

    Authors: Xu Gan, Chongwen Huang, Zhaohui Yang, Xiaoming Chen, Jiguang He, Zhaoyang Zhang, Chau Yuen, Yong Liang Guan, Mérouane Debbah

    Abstract: Integrated sensing and communication (ISAC) is increasingly recognized as a pivotal technology for next-generation cellular networks, offering mutual benefits in both sensing and communication capabilities. This advancement necessitates a re-examination of the fundamental limits within networks where these two functions coexist via shared spectrum and infrastructures. However, traditional stochast… ▽ More

    Submitted 22 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  15. arXiv:2403.06074  [pdf, other

    cs.IT eess.SP

    Hashing Beam Training for Near-Field Communications

    Authors: Yuan Xu, Li Wei, Chongwen Huang, Chen Zhu, Zhaohui Yang, Jun Yang, Jiguang He, Zhaoyang Zhang, Mérouane Debbah

    Abstract: In this paper, we investigate the millimeter-wave (mmWave) near-field beam training problem to find the correct beam direction. In order to address the high complexity and low identification accuracy of existing beam training techniques, we propose an efficient hashing multi-arm beam (HMB) training scheme for the near-field scenario. Specifically, we first design a set of sparse bases based on the… ▽ More

    Submitted 9 April, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.04913

  16. arXiv:2403.06073  [pdf, other

    cs.IT eess.SP

    Stochastic Geometry Analysis for Distributed RISs-Assisted mmWave Communications

    Authors: Yuan Xu, Li Wei, Chongwen Huang, Yongxu Zhu, Zhaohui Yang, Jun Yang, Jiguang He, Zhaoyang Zhang, Mérouane Debbah

    Abstract: Millimeter wave (mmWave) has attracted considerable attention due to its wide bandwidth and high frequency. However, it is highly susceptible to blockages, resulting in significant degradation of the coverage and the sum rate. A promising approach is deploying distributed reconfigurable intelligent surfaces (RISs), which can establish extra communication links. In this paper, we investigate the im… ▽ More

    Submitted 9 April, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.06154

  17. arXiv:2403.05970  [pdf, other

    cs.IT eess.SP

    Electromagnetic Hybrid Beamforming for Holographic Communications

    Authors: Ran Ji, Chongwen Huang, Xiaoming Chen, Wei E. I. Sha, Linglong Dai, Jiguang He, Zhaoyang Zhang, Chau Yuen, Mérouane Debbah

    Abstract: It is well known that there is inherent radiation pattern distortion for the commercial base station antenna array, which usually needs three antenna sectors to cover the whole space. To eliminate pattern distortion and further enhance beamforming performance, we propose an electromagnetic hybrid beamforming (EHB) scheme based on a three-dimensional (3D) superdirective holographic antenna array. S… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 13 pages

  18. arXiv:2402.18862  [pdf, other

    eess.IV

    Towards Backward-Compatible Continual Learning of Image Compression

    Authors: Zhihao Duan, Ming Lu, Justin Yang, Jiangpeng He, Zhan Ma, Fengqing Zhu

    Abstract: This paper explores the possibility of extending the capability of pre-trained neural image compressors (e.g., adapting to new data or target bitrates) without breaking backward compatibility, the ability to decode bitstreams encoded by the original model. We refer to this problem as continual learning of image compression. Our initial findings show that baseline solutions, such as end-to-end fine… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to CVPR 2024

  19. arXiv:2402.16619  [pdf

    eess.IV cs.CV physics.med-ph

    Magnetic resonance delta radiomics to track radiation response in lung tumors receiving stereotactic MRI-guided radiotherapy

    Authors: Yining Zha, Benjamin H. Kann, Zezhong Ye, Anna Zapaishchykova, John He, Shu-Hui Hsu, Jonathan E. Leeman, Kelly J. Fitzgerald, David E. Kozono, Raymond H. Mak, Hugo J. W. L. Aerts

    Abstract: Introduction: Lung cancer is a leading cause of cancer-related mortality, and stereotactic body radiotherapy (SBRT) has become a standard treatment for early-stage lung cancer. However, the heterogeneous response to radiation at the tumor level poses challenges. Currently, standardized dosage regimens lack adaptation based on individual patient or tumor characteristics. Thus, we explore the potent… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  20. arXiv:2402.16129  [pdf, other

    eess.SP

    Localization in Reconfigurable Intelligent Surface Aided mmWave Systems: A Multiple Measurement Vector Based Channel Estimation Method

    Authors: Kunlun Li, Jiguang He, Mohammed El-Hajjar, Lie-Liang Yang

    Abstract: The sparsity of millimeter wave (mmWave) channels in the angular and temporal domains is beneficial to channel estimation, while the associated channel parameters can be utilized for localization. However, line-of-sight (LoS) blockage poses a significant challenge on the localization in mmWave systems, potentially leading to substantial positioning errors. A promising solution is to employ reconfi… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  21. arXiv:2402.15857  [pdf, other

    eess.SP

    ELAA Near-Field Localization and Sensing with Partial Blockage Detection

    Authors: Hui Chen, Pinjun Zheng, Yu Ge, Ahmed Elzanaty, Jiguang He, Tareq Y. Al-Naffouri, Henk Wymeersch

    Abstract: High-frequency communication systems bring extremely large aperture arrays (ELAA) and large bandwidths, integrating localization and (bi-static) sensing functions without extra infrastructure. Such systems are likely to operate in the near-field (NF), where the performance of localization and sensing is degraded if a simplified far-field channel model is considered. However, when taking advantage… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  22. arXiv:2402.09752  [pdf

    physics.optics eess.SY physics.app-ph quant-ph

    Vector spectrometer with Hertz-level resolution and super-recognition capability

    Authors: Ting Qing, Shupeng Li, Huashan Yang, Lihan Wang, Yijie Fang, Xiaohu Tang, Meihui Cao, Jianming Lu, Jijun He, Junqiu Liu, Yueguang Lyu, Shilong Pan

    Abstract: High-resolution optical spectrometers are crucial in revealing intricate characteristics of signals, determining laser frequencies, measuring physical constants, identifying substances, and advancing biosensing applications. Conventional spectrometers, however, often grapple with inherent trade-offs among spectral resolution, wavelength range, and accuracy. Furthermore, even at high resolution, re… ▽ More

    Submitted 6 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 21 pages, 6 figures

  23. arXiv:2402.09372  [pdf, other

    eess.IV cs.AI cs.CV

    Deep Rib Fracture Instance Segmentation and Classification from CT on the RibFrac Challenge

    Authors: Jiancheng Yang, Rui Shi, Liang **, Xiaoyang Huang, Kaiming Kuang, Donglai Wei, Shixuan Gu, Jianying Liu, Pengfei Liu, Zhizhong Chai, Yongjie Xiao, Hao Chen, Liming Xu, Bang Du, Xiangyi Yan, Hao Tang, Adam Alessio, Gregory Holste, Jiapeng Zhang, Xiaoming Wang, Jianye He, Lixuan Che, Hanspeter Pfister, Ming Li, Bingbing Ni

    Abstract: Rib fractures are a common and potentially severe injury that can be challenging and labor-intensive to detect in CT scans. While there have been efforts to address this field, the lack of large-scale annotated datasets and evaluation benchmarks has hindered the development and validation of deep learning algorithms. To address this issue, the RibFrac Challenge was introduced, providing a benchmar… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Challenge paper for MICCAI RibFrac Challenge (https://ribfrac.grand-challenge.org/)

  24. arXiv:2402.09181  [pdf, other

    eess.IV cs.CV

    OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM

    Authors: Yutao Hu, Tianbin Li, Quanfeng Lu, Wenqi Shao, Junjun He, Yu Qiao, ** Luo

    Abstract: Large Vision-Language Models (LVLMs) have demonstrated remarkable capabilities in various multimodal tasks. However, their potential in the medical domain remains largely unexplored. A significant challenge arises from the scarcity of diverse medical images spanning various modalities and anatomical regions, which is essential in real-world medical applications. To solve this problem, in this pape… ▽ More

    Submitted 21 April, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  25. arXiv:2402.07259  [pdf, ps, other

    eess.SP

    RIS-Augmented Millimeter-Wave MIMO Systems for Passive Drone Detection

    Authors: Jiguang He, Aymen Fakhreddine, George C. Alexandropoulos

    Abstract: In the past decade, the number of amateur drones is increasing, and this trend is expected to continue in the future. The security issues brought by abuse and misconduct of drones become more and more severe and may incur a negative impact to the society. In this paper, we leverage existing cellular multiple-input multiple-output (MIMO) base station (BS) infrastructure, operating at millimeter wav… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: 6 pages, 6 figures, submitted to IEEE PIMRC 2024

  26. arXiv:2402.06154  [pdf, other

    cs.IT eess.SP

    Coverage and Rate Analysis for Distributed RISs-Assisted mmWave Communications

    Authors: Yuan Xu, Chongwen Huang, Wei Li, Yongxu Zhu, Zhaohui Yang, Jiguang He, Jun Yang, Zhaoyang Zhang, Chau Yuen, Merouane Debbah

    Abstract: The millimeter wave (mmWave) has received considerable interest due to its expansive bandwidth and high frequency. However, a noteworthy challenge arises from its vulnerability to blockages, leading to reduced coverage and achievable rates. To address these limitations, a potential solution is to deploy distributed reconfigurable intelligent surfaces (RISs), which comprise many low-cost and passiv… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  27. arXiv:2401.15619  [pdf, ps, other

    eess.SP

    A semidefinite programming approach for robust elliptic localization

    Authors: Wenxin Xiong, Jiajun He, Zhang-Lei Shi, Keyuan Hu, Hing Cheung So, Chi-Sing Leung

    Abstract: This short communication addresses the problem of elliptic localization with outlier measurements, whose occurrences are prevalent in various location-enabled applications and can significantly compromise the positioning performance if not adequately handled. In contrast to the reliance on $M$-estimation adopted in the majority of existing solutions, we take a different path, specifically explorin… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  28. arXiv:2401.13260  [pdf, other

    cs.CL cs.MM cs.SD eess.AS

    MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, Asr Error Detection, and Asr Error Correction

    Authors: Jiajun He, Xiaohan Shi, Xingfeng Li, Tomoki Toda

    Abstract: The prevalent approach in speech emotion recognition (SER) involves integrating both audio and textual information to comprehensively identify the speaker's emotion, with the text generally obtained through automatic speech recognition (ASR). An essential issue of this approach is that ASR errors from the text modality can worsen the performance of SER. Previous studies have proposed using an auxi… ▽ More

    Submitted 28 May, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  29. arXiv:2312.16946  [pdf, other

    eess.SP eess.SY

    LEO Satellite and RIS: Two Keys to Seamless Indoor and Outdoor Localization

    Authors: Pinjun Zheng, Xing Liu, Jiguang He, Gonzalo Seco-Granados, Tareq Y. Al-Naffouri

    Abstract: The contemporary landscape of wireless technology underscores the critical role of precise localization services. Traditional global navigation satellite systems (GNSS)-based solutions, however, fall short when it comes to indoor environments, and existing indoor localization techniques such as electromagnetic fingerprinting methods face challenges of high implementation costs and limited coverage… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  30. arXiv:2312.16572  [pdf, other

    eess.SY

    Observation-based Optimal Control Law Learning with LQR Reconstruction

    Authors: Chendi Qu, Jian** He, Xiaoming Duan

    Abstract: Designing controllers to generate various trajectories has been studied for years, while recently, recovering an optimal controller from trajectories receives increasing attention. In this paper, we reveal that the inherent linear quadratic regulator (LQR) problem of a moving agent can be reconstructed based on its trajectory observations only, which enables one to learn the optimal control law of… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  31. arXiv:2312.15481  [pdf, other

    eess.SP

    A Novel Field-Free SOT Magnetic Tunnel Junction With Local VCMA-Induced Switching

    Authors: Rui Zhou, Haiyang Zhang, Hao Wang, ** He, Qijun Huang, Sheng Chang

    Abstract: By integrating the local voltage-controlled magnetic anisotropy (VCMA) effect, Dzyaloshinskii-Moriya interaction (DMI) effect, and spin-orbit torque (SOT) effect, we propose a novel device structure for field-free magnetic tunnel junction (MTJ). Micromagnetic simulation shows that the device utilizes the chiral symmetry breaking caused by the DMI effect to induce a non-collinear spin texture under… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  32. arXiv:2312.10741  [pdf, other

    eess.AS cs.CL cs.SD

    StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis

    Authors: Yu Zhang, Rongjie Huang, Ruiqi Li, **Zheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao

    Abstract: Style transfer for out-of-domain (OOD) singing voice synthesis (SVS) focuses on generating high-quality singing voices with unseen styles (such as timbre, emotion, pronunciation, and articulation skills) derived from reference singing voice samples. However, the endeavor to model the intricate nuances of singing voice styles is an arduous task, as singing voices possess a remarkable degree of expr… ▽ More

    Submitted 2 January, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI 2024

  33. arXiv:2312.09576  [pdf, other

    eess.IV cs.CV

    SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma

    Authors: Xiangde Luo, Jia Fu, Yunxin Zhong, Shuolin Liu, Bing Han, Mehdi Astaraki, Simone Bendazzoli, Iuliana Toma-Dasu, Yiwen Ye, Ziyang Chen, Yong Xia, Yanzhou Su, ** Ye, Junjun He, Zhaohu Xing, Hongqiu Wang, Lei Zhu, Kaixiang Yang, Xin Fang, Zhiwei Wang, Chan Woong Lee, Sang Joon Park, Jaehee Chun, Constantin Ulrich, Klaus H. Maier-Hein , et al. (17 additional authors not shown)

    Abstract: Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC) treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting patient prognosis. Previously, the delineation of GTVs and OARs was performed by experienced radiation oncologists. Recently, deep learning has achieved promising results… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: A challenge report of SegRap2023 (organized in conjunction with MICCAI2023)

  34. arXiv:2312.09420  [pdf, other

    eess.SP cs.AI cs.IT

    Fairness-Driven Optimization of RIS-Augmented 5G Networks for Seamless 3D UAV Connectivity Using DRL Algorithms

    Authors: Yu Tian, Ahmed Alhammadi, Jiguang He, Aymen Fakhreddine, Faouzi Bader

    Abstract: In this paper, we study the problem of joint active and passive beamforming for reconfigurable intelligent surface (RIS)-assisted massive multiple-input multiple-output systems towards the extension of the wireless cellular coverage in 3D, where multiple RISs, each equipped with an array of passive elements, are deployed to assist a base station (BS) to simultaneously serve multiple unmanned aeria… ▽ More

    Submitted 14 November, 2023; originally announced December 2023.

  35. arXiv:2312.07846  [pdf, other

    eess.IV

    Prompted Contextual Transformer for Incomplete-View CT Reconstruction

    Authors: Chenglong Ma, Zilong Li, Junjun He, Jun** Zhang, Yi Zhang, Hongming Shan

    Abstract: Incomplete-view computed tomography (CT) can shorten the data acquisition time and allow scanning of large objects, including sparse-view and limited-angle scenarios, each with various settings, such as different view numbers or angular ranges. However, the reconstructed images present severe, varying artifacts due to different missing projection data patterns. Existing methods tackle these scenar… ▽ More

    Submitted 11 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  36. arXiv:2311.13622  [pdf, other

    cs.CV eess.IV

    TDiffDe: A Truncated Diffusion Model for Remote Sensing Hyperspectral Image Denoising

    Authors: Jiang He, Yajie Li, Jie L, Qiangqiang Yuan

    Abstract: Hyperspectral images play a crucial role in precision agriculture, environmental monitoring or ecological analysis. However, due to sensor equipment and the imaging environment, the observed hyperspectral images are often inevitably corrupted by various noise. In this study, we proposed a truncated diffusion model, called TDiffDe, to recover the useful information in hyperspectral images gradually… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  37. arXiv:2311.11969  [pdf, other

    eess.IV cs.CV

    SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks

    Authors: ** Ye, Junlong Cheng, Jianpin Chen, Zhongying Deng, Tianbin Li, Haoyu Wang, Yanzhou Su, Ziyan Huang, Jilong Chen, Lei Jiang, Hui Sun, Min Zhu, Shaoting Zhang, Junjun He, Yu Qiao

    Abstract: Segment Anything Model (SAM) has achieved impressive results for natural image segmentation with input prompts such as points and bounding boxes. Its success largely owes to massive labeled training data. However, directly applying SAM to medical image segmentation cannot perform well because SAM lacks medical knowledge -- it does not use medical images for training. To incorporate medical knowled… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  38. arXiv:2311.07093  [pdf, other

    cs.SD cs.CL eess.AS

    On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition

    Authors: Xiaohan Shi, Jiajun He, Xingfeng Li, Tomoki Toda

    Abstract: This paper proposes an efficient attempt to noisy speech emotion recognition (NSER). Conventional NSER approaches have proven effective in mitigating the impact of artificial noise sources, such as white Gaussian noise, but are limited to non-stationary noises in real-world environments due to their complexity and uncertainty. To overcome this limitation, we introduce a new method for NSER by adop… ▽ More

    Submitted 14 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Submitted to ICASSP 2024

  39. arXiv:2311.03653  [pdf, ps, other

    cs.IT eess.SP

    On the Performance of LoRa Empowered Communication for Wireless Body Area Networks

    Authors: Minling Zhang, Guofa Cai, Zhi** Xu, Jiguang He, Markku Juntti

    Abstract: To remotely monitor the physiological status of the human body, long range (LoRa) communication has been considered as an eminently suitable candidate for wireless body area networks (WBANs). Typically, a Rayleigh-lognormal fading channel is encountered by the LoRa links of the WBAN. In this context, we characterize the performance of the LoRa system in WBAN scenarios with an emphasis on the physi… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  40. arXiv:2310.19288  [pdf, other

    eess.IV cs.CV

    EDiffSR: An Efficient Diffusion Probabilistic Model for Remote Sensing Image Super-Resolution

    Authors: Yi Xiao, Qiangqiang Yuan, Kui Jiang, Jiang He, Xianyu **, Liangpei Zhang

    Abstract: Recently, convolutional networks have achieved remarkable development in remote sensing image Super-Resoltuion (SR) by minimizing the regression objectives, e.g., MSE loss. However, despite achieving impressive performance, these methods often suffer from poor visual quality with over-smooth issues. Generative adversarial networks have the potential to infer intricate details, but they are easy to… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Submitted to IEEE TGRS

  41. arXiv:2310.14217  [pdf, ps, other

    cs.IT eess.SP

    On the Sum Secrecy Rate of Multi-User Holographic MIMO Networks

    Authors: Arthur S. de Sena, Jiguang He, Ahmed Al Hammadi, Chongwen Huang, Faouzi Bader, Merouane Debbah, Mathias Fink

    Abstract: The emerging concept of extremely-large holographic multiple-input multiple-output (HMIMO), beneficial from compactly and densely packed cost-efficient radiating meta-atoms, has been demonstrated for enhanced degrees of freedom even in pure line-of-sight conditions, enabling tremendous multiplexing gain for the next-generation communication systems. Most of the reported works focus on energy and s… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 7 pages, 7 figures, submitted to IEEE ICC 2024

  42. arXiv:2310.10300  [pdf, other

    cs.SD cs.IR eess.AS

    BeatDance: A Beat-Based Model-Agnostic Contrastive Learning Framework for Music-Dance Retrieval

    Authors: Kaixing Yang, Xukun Zhou, Xulong Tang, Ran Diao, Hongyan Liu, Jun He, Zhaoxin Fan

    Abstract: Dance and music are closely related forms of expression, with mutual retrieval between dance videos and music being a fundamental task in various fields like education, art, and sports. However, existing methods often suffer from unnatural generation effects or fail to fully explore the correlation between music and dance. To overcome these challenges, we propose BeatDance, a novel beat-based mode… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  43. arXiv:2310.08021  [pdf, other

    eess.SP

    Channel-robust Automatic Modulation Classification Using Spectral Quotient Cumulants

    Authors: Sai Huang, Yuting Chen, Jiashuo He, Shuo Chang, Zhiyong Feng

    Abstract: Automatic modulation classification (AMC) is to identify the modulation format of the received signal corrupted by the channel effects and noise. Most existing works focus on the impact of noise while relatively little attention has been paid to the impact of channel effects. However, the instability posed by multipath fading channels leads to significant performance degradation. To mitigate the a… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: THIS WORK HAS BEEN SUBMITTED TO THE IEEE FOR POSSIBLE PUBLICATION. COPYRIGHT MAY BE TRANSFERRED WITHOUT NOTICE, AFTER WHICH THIS VERSION MAY NO LONGER BE ACCESSIBLE,5 Pages

  44. arXiv:2309.16389  [pdf, other

    cs.IT eess.SP

    A Universal Framework for Holographic MIMO Sensing

    Authors: Charles Vanwynsberghe, Jiguang He, Mérouane Debbah

    Abstract: This paper addresses the sensing space identification of arbitrarily shaped continuous antennas. In the context of holographic multiple-input multiple-output (MIMO), a.k.a. large intelligent surfaces, these antennas offer benefits such as super-directivity and near-field operability. The sensing space reveals two key aspects: (a) its dimension specifies the maximally achievable spatial degrees of… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  45. arXiv:2309.15462  [pdf, other

    cs.RO cs.LG eess.SY

    DTC: Deep Tracking Control

    Authors: Fabian Jenelten, Junzhe He, Farbod Farshidian, Marco Hutter

    Abstract: Legged locomotion is a complex control problem that requires both accuracy and robustness to cope with real-world challenges. Legged systems have traditionally been controlled using trajectory optimization with inverse dynamics. Such hierarchical model-based methods are appealing due to intuitive cost function tuning, accurate planning, generalization, and most importantly, the insightful understa… ▽ More

    Submitted 22 January, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  46. arXiv:2309.11992  [pdf, other

    eess.SP cs.NI

    UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning

    Authors: Jia He, Ziye Jia, Chao Dong, Junyu Liu, Qihui Wu, **gxian Liu

    Abstract: Unmanned aerial vehicles (UAVs) are recognized as promising technologies for area coverage due to the flexibility and adaptability. However, the ability of a single UAV is limited, and as for the large-scale three-dimensional (3D) scenario, UAV swarms can establish seamless wireless communication services. Hence, in this work, we consider a scenario of UAV swarm deployment and trajectory to satisf… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  47. Symbol Detection for Coarsely Quantized OTFS

    Authors: Junwei He, Haochuan Zhang, Chao Dong, Huimin Zhu

    Abstract: This paper explicitly models a coarse and noisy quantization in a communication system empowered by orthogonal time frequency space (OTFS) for cost and power efficiency. We first point out, with coarse quantization, the effective channel is imbalanced and thus no longer able to circularly shift the transmitted symbols along the delay-Doppler domain. Meanwhile, the effective channel is non-isotropi… ▽ More

    Submitted 20 January, 2024; v1 submitted 20 September, 2023; originally announced September 2023.

  48. arXiv:2309.09565  [pdf, other

    eess.SP

    A Covariance Adaptive Student's t Based Kalman Filter

    Authors: Benyang Gong, Jiacheng He, Gang Wang, Bei Peng

    Abstract: In the classical Kalman filter(KF), the estimated state is a linear combination of the one-step predicted state and measurement state, their confidence level change when the prediction mean square error matrix and covariance matrix of measurement noise vary. The existing student's t based Kalman filter(TKF) works similarly to the way KF works, they both work well with impulse noise, but when it co… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  49. arXiv:2309.08088  [pdf, ps, other

    eess.SY

    Interactive Model Fusion-Based GM-PHD Filter

    Authors: Jiacheng He, Shan Zhong, Bei Peng, Gang Wang, Qizhen Wang

    Abstract: In multi-target tracking (MTT), non-Gaussian measurement noise from sensors can diminish the performance of the Gaussian-assumed Gaussian mixture probability hypothesis density (GM-PHD) filter. In this paper, an approach that transforms the MTT problem under non-Gaussian conditions into an MTT problem under Gaussian conditions is developed. Specifically, measurement noise with a non-Gaussian distr… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: conference

  50. arXiv:2309.04084  [pdf, other

    cs.CV cs.MM eess.IV

    Towards Efficient SDRTV-to-HDRTV by Learning from Image Formation

    Authors: Xiangyu Chen, Zheyuan Li, Zhengwen Zhang, Jimmy S. Ren, Yihao Liu, **gwen He, Yu Qiao, Jiantao Zhou, Chao Dong

    Abstract: Modern displays are capable of rendering video content with high dynamic range (HDR) and wide color gamut (WCG). However, the majority of available resources are still in standard dynamic range (SDR). As a result, there is significant value in transforming existing SDR content into the HDRTV standard. In this paper, we define and analyze the SDRTV-to-HDRTV task by modeling the formation of SDRTV/H… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Extended version of HDRTVNet