Skip to main content

Showing 1–50 of 114 results for author: Xu, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19856  [pdf

    eess.SP

    LUT-boosted CDR and Equalization for Burst-mode 50/100 Gbit/s Bandwidth-limited Flexible PON

    Authors: Yanlu Huang, Liyan Wu, Shangya Han, Kai **, Kun Xu, Yanni Ou

    Abstract: We proposed and experimentally demonstrated a look-up table boosted fast CDR and equalization scheme for the burst-mode 50/100 Gbps bandwidth-limited flexible PON, requiring no preamble for convergence and achieved the same bit error rate performance as in the case of long preambles.

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.14794  [pdf, other

    eess.IV cs.CV cs.LG

    ImageFlowNet: Forecasting Multiscale Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical Images

    Authors: Chen Liu, Ke Xu, Liangbo L. Shen, Guillaume Huguet, Zilong Wang, Alexander Tong, Danilo Bzdok, Jay Stewart, Jay C. Wang, Lucian V. Del Priore, Smita Krishnaswamy

    Abstract: The forecasting of disease progression from images is a holy grail for clinical decision making. However, this task is complicated by the inherent high dimensionality, temporal sparsity and sampling irregularity in longitudinal image acquisitions. Existing methods often rely on extracting hand-crafted features and performing time-series analysis in this vector space, leading to a loss of rich spat… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  4. arXiv:2405.16011  [pdf, ps, other

    eess.SP

    Semantic Importance-Aware Communications with Semantic Correction Using Large Language Models

    Authors: Shuaishuai Guo, Yanhu Wang, Jia Ye, Anbang Zhang, Kun Xu

    Abstract: Semantic communications, a promising approach for agent-human and agent-agent interactions, typically operate at a feature level, lacking true semantic understanding. This paper explores understanding-level semantic communications (ULSC), transforming visual data into human-intelligible semantic content. We employ an image caption neural network (ICNN) to derive semantic representations from visua… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  5. arXiv:2405.01961  [pdf, other

    eess.SP

    Rescale-Invariant Federated Reinforcement Learning for Resource Allocation in V2X Networks

    Authors: Kaidi Xu, Shenglong Zhou, Geoffrey Ye Li

    Abstract: Federated Reinforcement Learning (FRL) offers a promising solution to various practical challenges in resource allocation for vehicle-to-everything (V2X) networks. However, the data discrepancy among individual agents can significantly degrade the performance of FRL-based algorithms. To address this limitation, we exploit the node-wise invariance property of ReLU-activated neural networks, with th… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  6. arXiv:2404.16223  [pdf, other

    cs.CV eess.IV

    Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey

    Authors: Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhi**g Sun, Jiaying Zhu , et al. (10 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 RAW Image Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. Th goal of this challenge is to upscale RAW Bayer images by 2x, considering unknown degradations such as nois… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 - NTIRE Workshop

  7. arXiv:2404.13640  [pdf, other

    cs.MM cs.CV eess.IV

    Beyond Alignment: Blind Video Face Restoration via Parsing-Guided Temporal-Coherent Transformer

    Authors: Kepeng Xu, Li Xu, Gang He, Wenxin Yu, Yunsong Li

    Abstract: Multiple complex degradations are coupled in low-quality video faces in the real world. Therefore, blind video face restoration is a highly challenging ill-posed problem, requiring not only hallucinating high-fidelity details but also enhancing temporal coherence across diverse pose variations. Restoring each frame independently in a naive manner inevitably introduces temporal incoherence and arti… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 9 pages

  8. arXiv:2404.11313  [pdf, other

    eess.IV cs.AI

    NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

    Authors: Xin Li, Kun Yuan, Ya**g Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo , et al. (43 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i.e., Kuaishou/Kwai Platform. The KVQ database is divided into three parts, including 2926 videos for training, 420 videos for validation, and 854 videos for testing. The… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024 Workshop. The challenge report for CVPR NTIRE2024 Short-form UGC Video Quality Assessment Challenge

  9. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  10. arXiv:2403.15853  [pdf

    eess.IV cs.CV

    An edge detection-based deep learning approach for tear meniscus height measurement

    Authors: Kesheng Wang, Kunhui Xu, Xiaoyu Chen, Chunlei He, Jianfeng Zhang, Dexing Kong, Qi Dai, Shoujun Huang

    Abstract: Automatic measurements of tear meniscus height (TMH) have been achieved by using deep learning techniques; however, annotation is significantly influenced by subjective factors and is both time-consuming and labor-intensive. In this paper, we introduce an automatic TMH measurement technique based on edge detection-assisted annotation within a deep learning framework. This method generates mask lab… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 22 pages, 5 figures

  11. arXiv:2403.10573  [pdf, other

    eess.IV cs.CR cs.CV cs.LG

    Medical Unlearnable Examples: Securing Medical Data from Unauthorized Traning via Sparsity-Aware Local Masking

    Authors: Weixiang Sun, Yixin Liu, Zhiling Yan, Kaidi Xu, Lichao Sun

    Abstract: With the rapid growth of artificial intelligence (AI) in healthcare, there has been a significant increase in the generation and storage of sensitive medical data. This abundance of data, in turn, has propelled the advancement of medical AI technologies. However, concerns about unauthorized data exploitation, such as training commercial AI models, often deter researchers from making their invaluab… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  12. arXiv:2403.09923  [pdf, other

    eess.SY

    Optimal Sequencing and Motion Control in a Roundabout with Safety Guarantees

    Authors: Yingqing Chen, Christos G. Cassandras, Kaiyuan Xu

    Abstract: This paper develops a controller for Connected and Automated Vehicles (CAVs) traversing a single-lane roundabout. The controller simultaneously determines the optimal sequence and associated optimal motion control jointly minimizing travel time and energy consumption while providing speed-dependent safety guarantees, as well as satisfying velocity and acceleration constraints. This is achieved by… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  13. arXiv:2403.07274  [pdf, other

    cs.IT eess.SP

    Achievable Rate Analysis and Optimization of Double-RIS Assisted Spatially Correlated MIMO with Statistical CSI

    Authors: Kaizhe Xu, Jiajia Guo, Jun Zhang, Shi **, Shaodan Ma

    Abstract: Reconfigurable intelligent surface (RIS) is a novel meta-material which can form a smart radio environment by dynamically altering reflection directions of the im**ing electromagnetic waves. In the prior literature, the inter-RIS links which also contribute to the performance of the whole system are usually neglected when multiple RISs are deployed. In this paper we investigate a general double-… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  14. arXiv:2402.13276  [pdf, other

    eess.AS cs.AI cs.SD

    When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

    Authors: Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps

    Abstract: Depression is a critical concern in global mental health, prompting extensive research into AI-based detection methods. Among various AI technologies, Large Language Models (LLMs) stand out for their versatility in mental healthcare applications. However, their primary limitation arises from their exclusive dependence on textual input, which constrains their overall capabilities. Furthermore, the… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  15. arXiv:2401.14248  [pdf

    eess.IV cs.CV

    On generalisability of segment anything model for nuclear instance segmentation in histology images

    Authors: Kesi Xu, Lea Goetz, Nasir Rajpoot

    Abstract: Pre-trained on a large and diverse dataset, the segment anything model (SAM) is the first promptable foundation model in computer vision aiming at object segmentation tasks. In this work, we evaluate SAM for the task of nuclear instance segmentation performance with zero-shot learning and finetuning. We compare SAM with other representative methods in nuclear instance segmentation, especially in t… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  16. arXiv:2312.05763  [pdf, ps, other

    cs.IT eess.SP

    Fluid Antennas-Enabled Multiuser Uplink: A Low-Complexity Gradient Descent for Total Transmit Power Minimization

    Authors: Guojie Hu, Qingqing Wu, Kui Xu, Jian Ouyang, Jiangbo Si, Yunlong Cai, Naofal Al-Dhahir

    Abstract: We investigate multiuser uplink communication from multiple single-antenna users to a base station (BS), which is equipped with a movable-antenna (MA) array and adopts zero-forcing receivers to decode multiple signals. We aim to optimize the MAs' positions at the BS, to minimize the total transmit power of all users subject to the minimum rate requirement. After applying transformations, we show t… ▽ More

    Submitted 8 January, 2024; v1 submitted 9 December, 2023; originally announced December 2023.

  17. arXiv:2312.00857  [pdf, other

    cs.LG cs.AI cs.HC eess.SP

    Latent Space Explorer: Visual Analytics for Multimodal Latent Space Exploration

    Authors: Bum Chul Kwon, Samuel Friedman, Kai Xu, Steven A Lubitz, Anthony Philippakis, Puneet Batra, Patrick T Ellinor, Kenney Ng

    Abstract: Machine learning models built on training data with multiple modalities can reveal new insights that are not accessible through unimodal datasets. For example, cardiac magnetic resonance images (MRIs) and electrocardiograms (ECGs) are both known to capture useful information about subjects' cardiovascular health status. A multimodal machine learning model trained from large datasets can potentiall… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 7 pages, 5 figures

  18. arXiv:2311.12947  [pdf, other

    cs.AI eess.SY

    PINNs-Based Uncertainty Quantification for Transient Stability Analysis

    Authors: Ren Wang, Ming Zhong, Kaidi Xu, Lola Giráldez Sánchez-Cortés, Ignacio de Cominges Guerra

    Abstract: This paper addresses the challenge of transient stability in power systems with missing parameters and uncertainty propagation in swing equations. We introduce a novel application of Physics-Informed Neural Networks (PINNs), specifically an Ensemble of PINNs (E-PINNs), to estimate critical parameters like rotor angle and inertia coefficient with enhanced accuracy and reduced computational load. E-… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  19. arXiv:2311.11814  [pdf, ps, other

    cs.IT eess.SP

    Movable-Antenna-Array-Enabled Communications with CoMP Reception

    Authors: Guojie Hu, Qingqing Wu, Jian Ouyang, Kui Xu, Yunlong Cai, Naofal Al-Dhahir

    Abstract: We consider the movable-antenna (MA) arrayenabled wireless communication with coordinate multi-point (CoMP) reception, where multiple destinations adopt the maximal ratio combination technique to jointly decode the common message sent from the transmitter equipped with the MA array. Our goal is to maximize the effective received signal-to-noise ratio, by jointly optimizing the transmit beamforming… ▽ More

    Submitted 25 January, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

  20. arXiv:2311.07104  [pdf, ps, other

    cs.IT eess.SP

    Secure Wireless Communication via Movable-Antenna Array

    Authors: Guojie Hu, Qingqing Wu, Kui Xu, Jiangbo Si, Naofal Al-Dhahir

    Abstract: Movable antenna (MA) array is a novel technology recently developed where positions of transmit/receive antennas can be flexibly adjusted in the specified region to reconfigure the wireless channel and achieve a higher capacity. In this letter, we, for the first time, investigate the MA array-assisted physical-layer security where the confidential information is transmitted from a MA array-enabled… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  21. arXiv:2311.02376  [pdf, ps, other

    cs.IT eess.SP

    Intelligent Reflecting Surface-Aided Wireless Communication with Movable Elements

    Authors: Guojie Hu, Qingqing Wu, Dognhui Xu, Kui Xu, Jiangbo Si, Yunlong Cai, Naofal Al-Dhahir

    Abstract: Intelligent reflecting surface (IRS) has been recognized as a powerful technology for boosting communication performance. To reduce manufacturing and control costs, it is preferable to consider discrete phase shifts (DPSs) for IRS, which are set by default as uniformly distributed in the range of $[ - Ï€,Ï€)$ in the literature. Such setting, however, cannot achieve a desirable performance over the g… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  22. arXiv:2310.19656  [pdf, other

    eess.IV cs.CV cs.LG

    Domain Generalization in Computational Pathology: Survey and Guidelines

    Authors: Mostafa Jahanifar, Manahil Raza, Kesi Xu, Trinh Vuong, Rob Jewsbury, Adam Shephard, Neda Zamanitajeddin, ** Tae Kwak, Shan E Ahmed Raza, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Deep learning models have exhibited exceptional effectiveness in Computational Pathology (CPath) by tackling intricate tasks across an array of histology image analysis applications. Nevertheless, the presence of out-of-distribution data (stemming from a multitude of sources such as disparate imaging devices and diverse tissue preparation methods) can cause \emph{domain shift} (DS). DS decreases t… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Extended Version

  23. arXiv:2310.13993  [pdf, other

    eess.SP

    Green Beamforming Design for Integrated Sensing and Communication Systems: A Practical Approach Using Beam-Matching Error Metrics

    Authors: Ke Xu, Jie Hu, Kun Yang

    Abstract: In this paper, we propose a green beamforming design for the integrated sensing and communication (ISAC) system, using beam-matching error to assess radar performance. The beam-matching error metric, which considers the mean square error between the desired and designed beam patterns, provides a more practical evaluation approach. To tackle the non-convex challenge inherent in beamforming design,… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  24. arXiv:2310.13984  [pdf, other

    eess.SP

    Robust NOMA-assisted OTFS-ISAC Network Design with 3D Motion Prediction Topology

    Authors: Ke Xu, Jie Hu, Christos Masouros, Kun Yang

    Abstract: This paper proposes a novel non-orthogonal multiple access (NOMA)-assisted orthogonal time-frequency space (OTFS)-integrated sensing and communication (ISAC) network, which uses unmanned aerial vehicles (UAVs) as air base stations to support multiple users. By employing ISAC, the UAV extracts position and velocity information from the user's echo signals, and non-orthogonal power allocation is con… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  25. arXiv:2310.09858  [pdf, other

    cs.LG cs.AI eess.SP

    Federated Reinforcement Learning for Resource Allocation in V2X Networks

    Authors: Kaidi Xu, Shenglong Zhou, Geoffrey Ye Li

    Abstract: Resource allocation significantly impacts the performance of vehicle-to-everything (V2X) networks. Most existing algorithms for resource allocation are based on optimization or machine learning (e.g., reinforcement learning). In this paper, we explore resource allocation in a V2X network under the framework of federated reinforcement learning (FRL). On one hand, the usage of RL overcomes many chal… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Submitted to TWC

  26. arXiv:2310.06328  [pdf, other

    cs.LG eess.SP

    Antenna Response Consistency Driven Self-supervised Learning for WIFI-based Human Activity Recognition

    Authors: Ke Xu, Jiangtao Wang, Hongyuan Zhu, Dingchang Zheng

    Abstract: Self-supervised learning (SSL) for WiFi-based human activity recognition (HAR) holds great promise due to its ability to address the challenge of insufficient labeled data. However, directly transplanting SSL algorithms, especially contrastive learning, originally designed for other domains to CSI data, often fails to achieve the expected performance. We attribute this issue to the inappropriate a… ▽ More

    Submitted 28 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

  27. arXiv:2310.05369  [pdf, other

    cs.SD eess.AS

    AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification

    Authors: Li Wang, Jiaqi Li, Yuhao Luo, Jiahao Zheng, Lei Wang, Hao Li, Ke Xu, Chengfang Fang, Jie Shi, Zhizheng Wu

    Abstract: It is known that deep neural networks are vulnerable to adversarial attacks. Although Automatic Speaker Verification (ASV) built on top of deep neural networks exhibits robust performance in controlled scenarios, many studies confirm that ASV is vulnerable to adversarial attacks. The lack of a standard dataset is a bottleneck for further research, especially reproducible research. In this study, w… ▽ More

    Submitted 16 January, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Accepted by ICASSP2024

  28. arXiv:2309.12953  [pdf

    eess.IV cs.CV

    Inter-vendor harmonization of Computed Tomography (CT) reconstruction kernels using unpaired image translation

    Authors: Aravind R. Krishnan, Kaiwen Xu, Thomas Li, Chenyu Gao, Lucas W. Remedios, Praitayini Kanakaraj, Ho Hin Lee, Shunxing Bao, Kim L. Sandler, Fabien Maldonado, Ivana Isgum, Bennett A. Landman

    Abstract: The reconstruction kernel in computed tomography (CT) generation determines the texture of the image. Consistency in reconstruction kernels is important as the underlying CT texture can impact measurements during quantitative image analysis. Harmonization (i.e., kernel conversion) minimizes differences in measurements due to inconsistent reconstruction kernels. Existing methods investigate harmoni… ▽ More

    Submitted 26 January, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 10 pages, 6 figures, 1 table, Submitted to SPIE Medical Imaging : Image Processing. San Diego, CA. February 2024

  29. arXiv:2309.10510  [pdf, other

    eess.SY cs.NE

    Logic Design of Neural Networks for High-Throughput and Low-Power Applications

    Authors: Kangwei Xu, Grace Li Zhang, Ulf Schlichtmann, Bing Li

    Abstract: Neural networks (NNs) have been successfully deployed in various fields. In NNs, a large number of multiplyaccumulate (MAC) operations need to be performed. Most existing digital hardware platforms rely on parallel MAC units to accelerate these MAC operations. However, under a given area constraint, the number of MAC units in such platforms is limited, so MAC units have to be reused to perform MAC… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: accepted by ASPDAC 2024

  30. arXiv:2308.15942  [pdf

    eess.IV cs.CV

    Stage-by-stage Wavelet Optimization Refinement Diffusion Model for Sparse-View CT Reconstruction

    Authors: Kai Xu, Shiyu Lu, Bin Huang, Weiwen Wu, Qiegen Liu

    Abstract: Diffusion models have emerged as potential tools to tackle the challenge of sparse-view CT reconstruction, displaying superior performance compared to conventional methods. Nevertheless, these prevailing diffusion models predominantly focus on the sinogram or image domains, which can lead to instability during model training, potentially culminating in convergence towards local minimal solutions.… ▽ More

    Submitted 3 September, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  31. arXiv:2308.03182  [pdf, other

    eess.SY

    Scaling up the Optimal Safe Control of Connected and Automated Vehicles to a Traffic Network: A Hierarchical Framework of Modular Control Zones

    Authors: Kaiyuan Xu, Christos G. Cassandras

    Abstract: We consider the problem of scaling up optimal and safe controllers for Connected and Automated Vehicles (CAVs) from a single Control Zone (CZ) around a traffic conflict area to an entire network. The goal is to jointly minimize travel time and energy consumption for all CAVs, while providing speed-dependent safety guarantees within a CZ and satisfying velocity and acceleration constraints. A hiera… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.04348

  32. arXiv:2308.02412  [pdf, other

    eess.SP cs.AI cs.HC cs.LG

    Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study

    Authors: Ke Xu, Jiangtao Wang, Hongyuan Zhu, Dingchang Zheng

    Abstract: Recently, with the advancement of the Internet of Things (IoT), WiFi CSI-based HAR has gained increasing attention from academic and industry communities. By integrating the deep learning technology with CSI-based HAR, researchers achieve state-of-the-art performance without the need of expert knowledge. However, the scarcity of labeled CSI data remains the most prominent challenge when applying d… ▽ More

    Submitted 19 July, 2023; originally announced August 2023.

  33. arXiv:2307.03394  [pdf, other

    eess.IV cs.MM

    Towards Robust SDRTV-to-HDRTV via Dual Inverse Degradation Network

    Authors: Kepeng Xu, Li Xu, Gang He, Wenxin Yu, Yunsong Li

    Abstract: In this study, we address the emerging necessity of converting Standard Dynamic Range Television (SDRTV) content into High Dynamic Range Television (HDRTV) in light of the limited number of native HDRTV content. A principal technical challenge in this conversion is the exacerbation of coding artifacts inherent in SDRTV, which detrimentally impacts the quality of the resulting HDRTV. To address thi… ▽ More

    Submitted 14 January, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 13 pages

  34. arXiv:2306.15320  [pdf, ps, other

    eess.SP

    Pulse Shape-Aided Multipath Delay Estimation for Fine-Grained WiFi Sensing

    Authors: Ke Xu, He Chen, Chenshu Wu

    Abstract: Due to the finite bandwidth of practical wireless systems, one multipath component can manifest itself as a discrete pulse consisting of multiple taps in the digital delay domain. This effect is called channel leakage, which complicates the multipath delay estimation problem. In this paper, we develop a new algorithm to estimate multipath delays of leaked channels by leveraging the knowledge of pu… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  35. arXiv:2306.15310  [pdf, other

    eess.SP

    RF-Based Simultaneous Localization and Source Seeking for Multi-Robot Systems

    Authors: Ke Xu, Rui Zhang, He Chen

    Abstract: This paper considers a radio-frequency (RF)-based simultaneous localization and source-seeking (SLASS) problem in multi-robot systems, where multiple robots jointly localize themselves and an RF source using distance-only measurements extracted from RF signals and then control themselves to approach the source. We design a Rao-Blackwellized particle filter-based algorithm to realize the joint loca… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  36. arXiv:2305.18355  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization

    Authors: Fei Kong, **hao Duan, RuiPeng Ma, Hengtao Shen, Xiaofeng Zhu, Xiaoshuang Shi, Kaidi Xu

    Abstract: Recently, diffusion models have achieved remarkable success in generating tasks, including image and audio generation. However, like other generative models, diffusion models are prone to privacy issues. In this paper, we propose an efficient query-based membership inference attack (MIA), namely Proximal Initialization Attack (PIA), which utilizes groundtruth trajectory obtained by $ε$ initialized… ▽ More

    Submitted 9 October, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  37. arXiv:2304.03760  [pdf, other

    eess.IV cs.CV

    Zero-shot CT Field-of-view Completion with Unconditional Generative Diffusion Prior

    Authors: Kaiwen Xu, Aravind R. Krishnan, Thomas Z. Li, Yuankai Huo, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman

    Abstract: Anatomically consistent field-of-view (FOV) completion to recover truncated body sections has important applications in quantitative analyses of computed tomography (CT) with limited FOV. Existing solution based on conditional generative models relies on the fidelity of synthetic truncation patterns at training phase, which poses limitations for the generalizability of the method to potential unkn… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: Submitted to MIDL 2023, short paper track

  38. arXiv:2304.02836  [pdf, other

    eess.IV cs.CV cs.LG

    Longitudinal Multimodal Transformer Integrating Imaging and Latent Clinical Signatures From Routine EHRs for Pulmonary Nodule Classification

    Authors: Thomas Z. Li, John M. Still, Kaiwen Xu, Ho Hin Lee, Leon Y. Cai, Aravind R. Krishnan, Riqiang Gao, Mirza S. Khan, Sanja Antic, Michael Kammer, Kim L. Sandler, Fabien Maldonado, Bennett A. Landman, Thomas A. Lasko

    Abstract: The accuracy of predictive models for solitary pulmonary nodule (SPN) diagnosis can be greatly increased by incorporating repeat imaging and medical context, such as electronic health records (EHRs). However, clinically routine modalities such as imaging and diagnostic codes can be asynchronous and irregularly sampled over different time scales which are obstacles to longitudinal multimodal learni… ▽ More

    Submitted 29 June, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to MICCAI 2023

  39. arXiv:2303.15826  [pdf, other

    eess.IV cs.AI cs.CV

    MS-MT: Multi-Scale Mean Teacher with Contrastive Unpaired Translation for Cross-Modality Vestibular Schwannoma and Cochlea Segmentation

    Authors: Ziyuan Zhao, Kaixin Xu, Huai Zhe Yeo, Xulei Yang, Cuntai Guan

    Abstract: Domain shift has been a long-standing issue for medical image segmentation. Recently, unsupervised domain adaptation (UDA) methods have achieved promising cross-modality segmentation performance by distilling knowledge from a label-rich source domain to a target domain without labels. In this work, we propose a multi-scale self-ensembling based UDA framework for automatic segmentation of two key b… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

    Comments: Accepted by BrainLes MICCAI proceedings (5th solution for MICCAI 2022 Cross-Modality Domain Adaptation (crossMoDA) Challenge)

  40. Orthogonal-Time-Frequency-Space Signal Design for Integrated Data and Energy Transfer: Benefits from Doppler Offsets

    Authors: Jie Hu, Ke Xu, Kun Yang

    Abstract: Integrated data and energy transfer (IDET) is an advanced technology for enabling energy sustainability for massively deployed low-power electronic consumption components. However, the existing work of IDET using the orthogonal-frequency-division-multiplexing (OFDM) waveforms is designed for static scenarios, which would be severely affected by the destructive Doppler offset in high-mobility scena… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  41. arXiv:2212.14739  [pdf

    eess.SP cs.AI

    Semantic optical fiber communication system

    Authors: Zhenming Yu, Hongyu Huang, Liming Cheng, Wei Zhang, Yueqiu Mu, Kun Xu

    Abstract: The current optical communication systems minimize bit or symbol errors without considering the semantic meaning behind digital bits, thus transmitting a lot of unnecessary information. We propose and experimentally demonstrate a semantic optical fiber communication (SOFC) system. Instead of encoding information into bits for transmission, semantic information is extracted from the source using de… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  42. arXiv:2212.07967  [pdf, ps, other

    eess.SY cs.LG cs.MA

    Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet

    Authors: Kaidi Xu, Nguyen Van Huynh, Geoffrey Ye Li

    Abstract: In heterogeneous networks (HetNets), the overlap of small cells and the macro cell causes severe cross-tier interference. Although there exist some approaches to address this problem, they usually require global channel state information, which is hard to obtain in practice, and get the sub-optimal power allocation policy with high computational complexity. To overcome these limitations, we propos… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  43. arXiv:2212.02078  [pdf, other

    eess.IV cs.AI cs.CV

    LE-UDA: Label-efficient unsupervised domain adaptation for medical image segmentation

    Authors: Ziyuan Zhao, Fangcheng Zhou, Kaixin Xu, Zeng Zeng, Cuntai Guan, S. Kevin Zhou

    Abstract: While deep learning methods hitherto have achieved considerable success in medical image segmentation, they are still hampered by two limitations: (i) reliance on large-scale well-labeled datasets, which are difficult to curate due to the expert-driven and time-consuming nature of pixel-level annotations in clinical practices, and (ii) failure to generalize from one domain to another, especially w… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted by IEEE Transactions on Medical Imaging, 2022

  44. arXiv:2212.00059  [pdf, other

    eess.IV cs.CV

    Single Slice Thigh CT Muscle Group Segmentation with Domain Adaptation and Self-Training

    Authors: Qi Yang, Xin Yu, Ho Hin Lee, Leon Y. Cai, Kaiwen Xu, Shunxing Bao, Yuankai Huo, Ann Zenobia Moore, Sokratis Makrogiannis, Luigi Ferrucci, Bennett A. Landman

    Abstract: Objective: Thigh muscle group segmentation is important for assessment of muscle anatomy, metabolic disease and aging. Many efforts have been put into quantifying muscle tissues with magnetic resonance (MR) imaging including manual annotation of individual muscles. However, leveraging publicly available annotations in MR images to achieve muscle group segmentation on single slice computed tomograp… ▽ More

    Submitted 30 November, 2022; originally announced December 2022.

  45. arXiv:2211.16666  [pdf, ps, other

    cs.IT eess.SP

    Secrecy Rate Maximization of RIS-assisted SWIPT Systems: A Two-Timescale Beamforming Design Approach

    Authors: Ming-Min Zhao, Kaidi Xu, Yunlong Cai, Yong Niu, Lajos Hanzo

    Abstract: Reconfigurable intelligent surfaces (RISs) achieve high passive beamforming gains for signal enhancement or interference nulling by dynamically adjusting their reflection coefficients. Their employment is particularly appealing for improving both the wireless security and the efficiency of radio frequency (RF)-based wireless power transfer. Motivated by this, we conceive and investigate a RIS-assi… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 16 pages, 12 figures, accepted for publication in IEEE Transactions on Wireless Communications

  46. arXiv:2211.05962  [pdf

    eess.IV cs.CV

    Feature-aggregated spatiotemporal spine surface estimation for wearable patch ultrasound volumetric imaging

    Authors: Baichuan Jiang, Keshuai Xu, Ahbay Moghekar, Peter Kazanzides, Emad Boctor

    Abstract: Clear identification of bone structures is crucial for ultrasound-guided lumbar interventions, but it can be challenging due to the complex shapes of the self-shadowing vertebra anatomy and the extensive background speckle noise from the surrounding soft tissue structures. Therefore, we propose to use a patch-like wearable ultrasound solution to capture the reflective bone surfaces from multiple i… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

  47. arXiv:2211.05256  [pdf, other

    eess.IV cs.CV

    Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Cheng-Ming Chiang, Hsien-Kai Kuo, Yu-Syuan Xu, Man-Yu Lee, Allen Lu, Chia-Ming Cheng, Chih-Cheng Chen, Jia-Ying Yong, Hong-Han Shuai, Wen-Huang Cheng, Zhuang Jia, Tianyu Xu, Yijian Zhang, Long Bao, Heng Sun, Diankai Zhang, Si Gao, Shaoli Liu, Biao Wu, Xiaofeng Zhang, Chengjian Zheng, Kaidi Lu, Ning Wang , et al. (29 additional authors not shown)

    Abstract: Video super-resolution is one of the most popular tasks on mobile devices, being widely used for an automatic improvement of low-bitrate and low-resolution video streams. While numerous solutions have been proposed for this problem, they are usually quite computationally demanding, demonstrating low FPS rates and power efficiency on mobile devices. In this Mobile AI challenge, we address this prob… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.08826, arXiv:2105.07809, arXiv:2211.04470, arXiv:2211.03885

  48. arXiv:2211.03885  [pdf, other

    cs.CV eess.IV

    Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Ziyao Yi, Yan Xiang, Zibin Liu, Shaoqing Li, Keming Shi, Dehui Kong, Ke Xu, Minsu Kwon, Yaqi Wu, Jiesi Zheng, Zhihao Fan, Xun Wu, Feng Zhang, Albert No, Minhyeok Cho, Zewen Chen, Xiaze Zhang, Ran Li , et al. (13 additional authors not shown)

    Abstract: The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. Th… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  49. arXiv:2211.02297  [pdf, other

    eess.IV

    SDRTV-to-HDRTV Conversion via Spatial-Temporal Feature Fusion

    Authors: Kepeng Xu, Li Xu, Gang He, Chang Wu, Zijia Ma, Ming Sun, Yu-Wing Tai

    Abstract: HDR(High Dynamic Range) video can reproduce realistic scenes more realistically, with a wider gamut and broader brightness range. HDR video resources are still scarce, and most videos are still stored in SDR (Standard Dynamic Range) format. Therefore, SDRTV-to-HDRTV Conversion (SDR video to HDR video) can significantly enhance the user's video viewing experience. Since the correlation between adja… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: 8 pages

  50. arXiv:2209.08326  [pdf, other

    eess.AS cs.CL

    Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition

    Authors: Ye Bai, Jie Li, Wen**g Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang

    Abstract: While transformers and their variant conformers show promising performance in speech recognition, the parameterized property leads to much memory cost during training and inference. Some works use cross-layer weight-sharing to reduce the parameters of the model. However, the inevitable loss of capacity harms the model performance. To address this issue, this paper proposes a parameter-efficient co… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: accepted in INTERSPEECH 2022