Skip to main content

Showing 1–50 of 107 results for author: Shi, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19043  [pdf

    eess.IV cs.AI cs.CV cs.DB

    CMRxRecon2024: A Multi-Modality, Multi-View K-Space Dataset Boosting Universal Machine Learning for Accelerated Cardiac MRI

    Authors: Zi Wang, Fanwen Wang, Chen Qin, Jun Lyu, Ouyang Cheng, Shuo Wang, Yan Li, Mengyao Yu, Haoyu Zhang, Kunyuan Guo, Zhang Shi, Qirong Li, Ziqiang Xu, Ya**g Zhang, Hao Li, Sha Hua, Binghua Chen, Longyu Sun, Mengting Sun, Qin Li, Ying-Hua Chu, Wenjia Bai, **g Qin, Xiahai Zhuang, Claudia Prieto , et al. (7 additional authors not shown)

    Abstract: Cardiac magnetic resonance imaging (MRI) has emerged as a clinically gold-standard technique for diagnosing cardiac diseases, thanks to its ability to provide diverse information with multiple modalities and anatomical views. Accelerated cardiac MRI is highly expected to achieve time-efficient and patient-friendly imaging, and then advanced image reconstruction approaches are required to recover h… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures, 2 tables

  2. arXiv:2406.18993  [pdf, ps, other

    eess.SP

    Interference Cancellation Based Neural Receiver for Superimposed Pilot in Multi-Layer Transmission

    Authors: Han Xiao, Wenqiang Tian, Shi **, Wendong Liu, Jia Shen, Zhihua Shi, Zhi Zhang

    Abstract: In this paper, an interference cancellation based neural receiver for superimposed pilot (SIP) in multi-layer transmission is proposed, where the data and pilot are non-orthogonally superimposed in the same time-frequency resource. Specifically, to deal with the intra-layer and inter-layer interference of SIP under multi-layer transmission, the interference cancellation with superimposed symbol ai… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2404.15284  [pdf, other

    eess.SP cs.AI

    Global 4D Ionospheric STEC Prediction based on DeepONet for GNSS Rays

    Authors: Dijia Cai, Zenghui Shi, Haiyang Fu, Huan Liu, Hongyi Qian, Yun Sui, Feng Xu, Ya-Qiu **

    Abstract: The ionosphere is a vitally dynamic charged particle region in the Earth's upper atmosphere, playing a crucial role in applications such as radio communication and satellite navigation. The Slant Total Electron Contents (STEC) is an important parameter for characterizing wave propagation, representing the integrated electron density along the ray of radio signals passing through the ionosphere. Th… ▽ More

    Submitted 12 March, 2024; originally announced April 2024.

  4. arXiv:2404.07956  [pdf, other

    cs.LG cs.AI cs.RO eess.SY math.OC

    Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation

    Authors: Lujie Yang, Hongkai Dai, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, Huan Zhang

    Abstract: Learning-based neural network (NN) control policies have shown impressive empirical performance in a wide range of tasks in robotics and control. However, formal (Lyapunov) stability guarantees over the region-of-attraction (ROA) for NN controllers with nonlinear dynamical systems are challenging to obtain, and most existing approaches rely on expensive solvers such as sums-of-squares (SOS), mixed… ▽ More

    Submitted 4 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: Paper accepted by ICML 2024

  5. arXiv:2404.01082  [pdf, other

    eess.IV

    The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023

    Authors: Jun Lyu, Chen Qin, Shuo Wang, Fanwen Wang, Yan Li, Zi Wang, Kunyuan Guo, Cheng Ouyang, Michael Tänzer, Meng Liu, Longyu Sun, Mengting Sun, Qin Li, Zhang Shi, Sha Hua, Hao Li, Zhensen Chen, Zhenlin Zhang, Bingyu Xin, Dimitris N. Metaxas, George Yiasemis, Jonas Teuwen, Li** Zhang, Weitian Chen, Yidong Zhao , et al. (25 additional authors not shown)

    Abstract: Cardiac MRI, crucial for evaluating heart structure and function, faces limitations like slow imaging and motion artifacts. Undersampling reconstruction, especially data-driven algorithms, has emerged as a promising solution to accelerate scans and enhance imaging performance using highly under-sampled data. Nevertheless, the scarcity of publicly available cardiac k-space datasets and evaluation p… ▽ More

    Submitted 16 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 25 pages, 17 figures

  6. arXiv:2404.00863  [pdf, other

    eess.AS

    Voice Conversion Augmentation for Speaker Recognition on Defective Datasets

    Authors: Ruijie Tao, Zhan Shi, Yidi Jiang, Tianchi Liu, Haizhou Li

    Abstract: Modern speaker recognition system relies on abundant and balanced datasets for classification training. However, diverse defective datasets, such as partially-labelled, small-scale, and imbalanced datasets, are common in real-world applications. Previous works usually studied specific solutions for each scenario from the algorithm perspective. However, the root cause of these problems lies in data… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 5 pages

  7. arXiv:2403.20198  [pdf, other

    cs.IT eess.SY

    Minimizing End-to-End Latency for Joint Source-Channel Coding Systems

    Authors: Kaiyi Chi, Qianqian Yang, Yuanchao Shu, Zhaohui Yang, Zhiguo Shi

    Abstract: While existing studies have highlighted the advantages of deep learning (DL)-based joint source-channel coding (JSCC) schemes in enhancing transmission efficiency, they often overlook the crucial aspect of resource management during the deployment phase. In this paper, we propose an approach to minimize the transmission latency in an uplink JSCC-based system. We first analyze the correlation betwe… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 7 Pages, 5 Figures, accepted by 2024 IEEE ICC Workshop

  8. arXiv:2403.18134  [pdf, other

    eess.IV cs.CV

    Integrative Graph-Transformer Framework for Histopathology Whole Slide Image Representation and Classification

    Authors: Zhan Shi, **gwei Zhang, Jun Kong, Fusheng Wang

    Abstract: In digital pathology, the multiple instance learning (MIL) strategy is widely used in the weakly supervised histopathology whole slide image (WSI) classification task where giga-pixel WSIs are only labeled at the slide level. However, existing attention-based MIL approaches often overlook contextual information and intrinsic spatial relationships between neighboring tissue tiles, while graph-based… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  9. arXiv:2403.13562  [pdf, other

    eess.SY

    Augmented Labeled Random Finite Sets and Its Application to Group Target Tracking

    Authors: Chaoqun Yang, Mengdie Xu, Xiaowei Liang, Zhiguo Shi, Heng Zhang, Xianghui Cao

    Abstract: This paper addresses the problem of group target tracking (GTT), wherein multiple closely spaced targets within a group pose a coordinated motion. To improve the tracking performance, the labeled random finite sets (LRFSs) theory is adopted, and this paper develops a new kind of LRFSs, i.e., augmented LRFSs, which introduces group information into the definition of LRFSs. Specifically, for each el… ▽ More

    Submitted 16 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  10. arXiv:2403.01093  [pdf, other

    eess.SP

    Variational Bayesian Learning Based Localization and Channel Reconstruction in RIS-aided Systems

    Authors: Yunfei Li, Yiting Luo, Xianda Wu, Zheng Shi, Shaodan Ma, Guanghua Yang

    Abstract: The emerging immersive and autonomous services have posed stringent requirements on both communications and localization. By considering the great potential of reconfigurable intelligent surface (RIS), this paper focuses on the joint channel estimation and localization for RIS-aided wireless systems. As opposed to existing works that treat channel estimation and localization independently, this pa… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  11. arXiv:2401.15619  [pdf, ps, other

    eess.SP

    A semidefinite programming approach for robust elliptic localization

    Authors: Wenxin Xiong, Jiajun He, Zhang-Lei Shi, Keyuan Hu, Hing Cheung So, Chi-Sing Leung

    Abstract: This short communication addresses the problem of elliptic localization with outlier measurements, whose occurrences are prevalent in various location-enabled applications and can significantly compromise the positioning performance if not adequately handled. In contrast to the reliance on $M$-estimation adopted in the majority of existing solutions, we take a different path, specifically explorin… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  12. arXiv:2401.15564  [pdf

    eess.SY cs.AI

    Design of UAV flight state recognition and trajectory prediction system based on trajectory feature construction

    Authors: Xingyu Zhou, Zhuoyong Shi

    Abstract: With the impact of artificial intelligence on the traditional UAV industry, autonomous UAV flight has become a current hot research field. Based on the demand for research on critical technologies for autonomous flying UAVs, this paper addresses the field of flight state recognition and trajectory prediction of UAVs. This paper proposes a method to improve the accuracy of UAV trajectory prediction… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  13. arXiv:2401.11960  [pdf, other

    cs.CV eess.IV

    Observation-Guided Meteorological Field Downscaling at Station Scale: A Benchmark and a New Method

    Authors: Zili Liu, Hao Chen, Lei Bai, Wenyuan Li, Keyan Chen, Zhengyi Wang, Wanli Ouyang, Zhengxia Zou, Zhenwei Shi

    Abstract: Downscaling (DS) of meteorological variables involves obtaining high-resolution states from low-resolution meteorological fields and is an important task in weather forecasting. Previous methods based on deep learning treat downscaling as a super-resolution task in computer vision and utilize high-resolution gridded meteorological fields as supervision to improve resolution at specific grid scales… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  14. arXiv:2312.15575  [pdf, other

    eess.IV cs.CV cs.LG

    Neural Born Series Operator for Biomedical Ultrasound Computed Tomography

    Authors: Zhijun Zeng, Yihang Zheng, Youjia Zheng, Yubing Li, Zuoqiang Shi, He Sun

    Abstract: Ultrasound Computed Tomography (USCT) provides a radiation-free option for high-resolution clinical imaging. Despite its potential, the computationally intensive Full Waveform Inversion (FWI) required for tissue property reconstruction limits its clinical utility. This paper introduces the Neural Born Series Operator (NBSO), a novel technique designed to speed up wave simulations, thereby facilita… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    ACM Class: I.4.5; J.3

  15. arXiv:2312.04377  [pdf, other

    cs.IT eess.SP

    HARQ-IR Aided Short Packet Communications: BLER Analysis and Throughput Maximization

    Authors: Fuchao He, Zheng Shi, Guanghua Yang, Xiaofan Li, Xinrong Ye, Shaodan Ma

    Abstract: This paper introduces hybrid automatic repeat request with incremental redundancy (HARQ-IR) to boost the reliability of short packet communications. The finite blocklength information theory and correlated decoding events tremendously preclude the analysis of average block error rate (BLER). Fortunately, the recursive form of average BLER motivates us to calculate its value through the trapezoidal… ▽ More

    Submitted 9 January, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 13 pages, 10 figures

  16. arXiv:2311.02389  [pdf, other

    eess.SY cs.GT cs.RO

    Multiplayer Homicidal Chauffeur Reach-Avoid Games: A Pursuit Enclosure Function Approach

    Authors: Rui Yan, Xiaoming Duan, Rui Zou, Xin He, Zongying Shi, Francesco Bullo

    Abstract: This paper presents a multiplayer Homicidal Chauffeur reach-avoid differential game, which involves Dubins-car pursuers and simple-motion evaders. The goal of the pursuers is to cooperatively protect a planar convex region from the evaders, who strive to reach the region. We propose a cooperative strategy for the pursuers based on subgames for multiple pursuers against one evader and optimal task… ▽ More

    Submitted 22 December, 2023; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 17 pages, 5 figures

  17. arXiv:2310.15548  [pdf, ps, other

    eess.SP

    Knowledge-driven Meta-learning for CSI Feedback

    Authors: Han Xiao, Wenqiang Tian, Wendong Liu, Jiajia Guo, Zhi Zhang, Shi **, Zhihua Shi, Li Guo, Jia Shen

    Abstract: Accurate and effective channel state information (CSI) feedback is a key technology for massive multiple-input and multiple-output systems. Recently, deep learning (DL) has been introduced for CSI feedback enhancement through massive collected training data and lengthy training time, which is quite costly and impractical for realistic deployment. In this article, a knowledge-driven meta-learning a… ▽ More

    Submitted 25 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2301.13475

  18. arXiv:2310.10964  [pdf, other

    cs.IT eess.SP

    Spectral-Efficiency and Energy-Efficiency of Variable-Length XP-HARQ

    Authors: Jiahui Feng, Zheng Shi, Yaru Fu, Hong Wang, Guanghua Yang, Shaodan Ma

    Abstract: A variable-length cross-packet hybrid automatic repeat request (VL-XP-HARQ) is proposed to boost the spectral efficiency (SE) and the energy efficiency (EE) of communications. The SE is firstly derived in terms of the outage probabilities, with which the SE is proved to be upper bounded by the ergodic capacity (EC). Moreover, to facilitate the maximization of the SE, the asymptotic outage probabil… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  19. arXiv:2310.06259  [pdf, other

    eess.IV cs.SD eess.AS

    Cross-modal Cognitive Consensus guided Audio-Visual Segmentation

    Authors: Zhaofeng Shi, Qingbo Wu, Fanman Meng, Linfeng Xu, Hongliang Li

    Abstract: Audio-Visual Segmentation (AVS) aims to extract the sounding object from a video frame, which is represented by a pixel-wise segmentation mask for application scenarios such as multi-modal video editing, augmented reality, and intelligent robot systems. The pioneering work conducts this task through dense feature-level audio-visual interaction, which ignores the dimension gap between different mod… ▽ More

    Submitted 8 May, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: 14 pages

    MSC Class: 68U10 ACM Class: I.4.6

  20. arXiv:2309.16372  [pdf, other

    cs.CV eess.IV

    Aperture Diffraction for Compact Snapshot Spectral Imaging

    Authors: Tao Lv, Hao Ye, Quan Yuan, Zhan Shi, Yibo Wang, Shuming Wang, Xun Cao

    Abstract: We demonstrate a compact, cost-effective snapshot spectral imaging system named Aperture Diffraction Imaging Spectrometer (ADIS), which consists only of an imaging lens with an ultra-thin orthogonal aperture mask and a mosaic filter sensor, requiring no additional physical footprint compared to common RGB cameras. Then we introduce a new optical design that each point in the object space is multip… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: accepted by International Conference on Computer Vision (ICCV) 2023

  21. arXiv:2309.07141  [pdf

    eess.SP cs.AI cs.LG

    Design of Recognition and Evaluation System for Table Tennis Players' Motor Skills Based on Artificial Intelligence

    Authors: Zhuo-yong Shi, Ye-tao Jia, Ke-xin Zhang, Ding-han Wang, Long-meng Ji, Yong Wu

    Abstract: With the rapid development of electronic science and technology, the research on wearable devices is constantly updated, but for now, it is not comprehensive for wearable devices to recognize and analyze the movement of specific sports. Based on this, this paper improves wearable devices of table tennis sport, and realizes the pattern recognition and evaluation of table tennis players' motor skill… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    Comments: 34pages, 16figures

    MSC Class: 93-01 ACM Class: G.1; H.4

  22. arXiv:2308.04304  [pdf, other

    cs.IT cs.CR cs.LG eess.IV

    The Model Inversion Eavesdrop** Attack in Semantic Communication Systems

    Authors: Yuhao Chen, Qianqian Yang, Zhiguo Shi, Jiming Chen

    Abstract: In recent years, semantic communication has been a popular research topic for its superiority in communication efficiency. As semantic communication relies on deep learning to extract meaning from raw messages, it is vulnerable to attacks targeting deep learning models. In this paper, we introduce the model inversion eavesdrop** attack (MIEA) to reveal the risk of privacy leaks in the semantic c… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted by 2023 IEEE Global Communications Conference (GLOBECOM)

  23. arXiv:2308.02140  [pdf, ps, other

    cs.IT eess.SP

    Deep Reinforcement Learning Empowered Rate Selection of XP-HARQ

    Authors: Da Wu, Jiahui Feng, Zheng Shi, Hongjiang Lei, Guanghua Yang, Shaodan Ma

    Abstract: The complex transmission mechanism of cross-packet hybrid automatic repeat request (XP-HARQ) hinders its optimal system design. To overcome this difficulty, this letter attempts to use the deep reinforcement learning (DRL) to solve the rate selection problem of XP-HARQ over correlated fading channels. In particular, the long term average throughput (LTAT) is maximized by properly choosing the incr… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  24. arXiv:2308.02131  [pdf, other

    cs.IT eess.SP

    Graph Convolutional Network Enabled Power-Constrained HARQ Strategy for URLLC

    Authors: Yi Chen, Zheng Shi, Hong Wang, Yaru Fu, Guanghua Yang, Shaodan Ma, Haichuan Ding

    Abstract: In this paper, a power-constrained hybrid automatic repeat request (HARQ) transmission strategy is developed to support ultra-reliable low-latency communications (URLLC). In particular, we aim to minimize the delivery latency of HARQ schemes over time-correlated fading channels, meanwhile ensuring the high reliability and limited power consumption. To ease the optimization, the simple asymptotic o… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  25. arXiv:2307.13220  [pdf

    eess.IV cs.AI physics.med-ph

    One for Multiple: Physics-informed Synthetic Data Boosts Generalizable Deep Learning for Fast MRI Reconstruction

    Authors: Zi Wang, Xiaotong Yu, Chengyan Wang, Weibo Chen, Jiazheng Wang, Ying-Hua Chu, Hongwei Sun, Rushuai Li, Peiyong Li, Fan Yang, Haiwei Han, Taishan Kang, Jianzhong Lin, Chen Yang, Shufu Chang, Zhang Shi, Sha Hua, Yan Li, Juan Hu, Liuhong Zhu, Jianjun Zhou, Mei**g Lin, Jiefeng Guo, Congbo Cai, Zhong Chen , et al. (3 additional authors not shown)

    Abstract: Magnetic resonance imaging (MRI) is a widely used radiological modality renowned for its radiation-free, comprehensive insights into the human body, facilitating medical diagnoses. However, the drawback of prolonged scan times hinders its accessibility. The k-space undersampling offers a solution, yet the resultant artifacts necessitate meticulous removal during image reconstruction. Although Deep… ▽ More

    Submitted 28 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 38 pages, 19 figures, 5 tables

  26. arXiv:2306.13296  [pdf, other

    eess.SP

    Semantic-aware Transmission for Robust Point Cloud Classification

    Authors: Tianxiao Han, Kaiyi Chi, Qianqian Yang, Zhiguo Shi

    Abstract: As three-dimensional (3D) data acquisition devices become increasingly prevalent, the demand for 3D point cloud transmission is growing. In this study, we introduce a semantic-aware communication system for robust point cloud classification that capitalizes on the advantages of pre-trained Point-BERT models. Our proposed method comprises four main components: the semantic encoder, channel encoder,… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: submitted to globecom 2023

  27. arXiv:2305.03546  [pdf, other

    eess.IV cs.CV

    Breast Cancer Immunohistochemical Image Generation: a Benchmark Dataset and Challenge Review

    Authors: Chuang Zhu, Shengjie Liu, Zekuan Yu, Feng Xu, Arpit Aggarwal, Germán Corredor, Anant Madabhushi, Qixun Qu, Hongwei Fan, Fangda Li, Yueheng Li, Xianchao Guan, Yongbing Zhang, Vivek Kumar Singh, Farhan Akram, Md. Mostafa Kamal Sarker, Zhongyue Shi, Mulan **

    Abstract: For invasive breast cancer, immunohistochemical (IHC) techniques are often used to detect the expression level of human epidermal growth factor receptor-2 (HER2) in breast tissue to formulate a precise treatment plan. From the perspective of saving manpower, material and time costs, directly generating IHC-stained images from Hematoxylin and Eosin (H&E) stained images is a valuable research direct… ▽ More

    Submitted 22 September, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: 12 pages, 12 figures, 2tables

  28. arXiv:2305.01871  [pdf

    physics.med-ph eess.IV

    Convolutional neural network-based single-shot speckle tracking for x-ray phase-contrast imaging

    Authors: Serena Qinyun Z. Shi, Nadav Shapira, Peter B. Noël, Sebastian Meyer

    Abstract: X-ray phase-contrast imaging offers enhanced sensitivity for weakly-attenuating materials, such as breast and brain tissue, but has yet to be widely implemented clinically due to high coherence requirements and expensive x-ray optics. Speckle-based phase contrast imaging has been proposed as an affordable and simple alternative; however, obtaining high-quality phase-contrast images requires accura… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  29. arXiv:2304.12184  [pdf, other

    eess.SP cs.AI cs.IT cs.LG

    Active RIS-aided EH-NOMA Networks: A Deep Reinforcement Learning Approach

    Authors: Zhaoyuan Shi, Huabing Lu, Xianzhong Xie, Helin Yang, Chongwen Huang, Jun Cai, Zhiguo Ding

    Abstract: An active reconfigurable intelligent surface (RIS)-aided multi-user downlink communication system is investigated, where non-orthogonal multiple access (NOMA) is employed to improve spectral efficiency, and the active RIS is powered by energy harvesting (EH). The problem of joint control of the RIS's amplification matrix and phase shift matrix is formulated to maximize the communication success ra… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  30. arXiv:2304.11341  [pdf, ps, other

    cs.IT eess.SP

    Performance Analysis and Optimal Design of HARQ-IR-Aided Terahertz Communications

    Authors: Ziyang Song, Zheng Shi, Jiaji Su, Qing** Dou, Guanghua Yang, Haichuan Ding, Shaodan Ma

    Abstract: Terahertz (THz) communications are envisioned to be a promising technology for 6G thanks to its broad bandwidth. However, the large path loss, antenna misalignment, and atmospheric influence of THz communications severely deteriorate its reliability. To address this, hybrid automatic repeat request (HARQ) is recognized as an effective technique to ensure reliable THz communications. This paper del… ▽ More

    Submitted 22 April, 2023; originally announced April 2023.

    Comments: Blockage, hybrid automatic repeat request (HARQ), outage probability, terahertz (THz) communications

  31. arXiv:2303.14095  [pdf, other

    cs.CV cs.RO eess.IV

    PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View

    Authors: Ze Shi, Hao Shi, Kailun Yang, Zhe Yin, Yining Lin, Kaiwei Wang

    Abstract: Visual place recognition has gained significant attention in recent years as a crucial technology in autonomous driving and robotics. Currently, the two main approaches are the perspective view retrieval (P2P) paradigm and the equirectangular image retrieval (E2E) paradigm. However, it is practical and natural to assume that users only have consumer-grade pinhole cameras to obtain query perspectiv… ▽ More

    Submitted 28 July, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Accepted to ITSC 2023. Code and datasets will be made available at https://github.com/zafirshi/PanoVPR

  32. arXiv:2302.12662  [pdf, other

    eess.IV cs.CV

    FedDBL: Communication and Data Efficient Federated Deep-Broad Learning for Histopathological Tissue Classification

    Authors: Tianpeng Deng, Yanqi Huang, Guoqiang Han, Zhenwei Shi, Jiatai Lin, Qi Dou, Zaiyi Liu, Xiao-**g Guo, C. L. Philip Chen, Chu Han

    Abstract: Histopathological tissue classification is a fundamental task in computational pathology. Deep learning-based models have achieved superior performance but centralized training with data centralization suffers from the privacy leakage problem. Federated learning (FL) can safeguard privacy by kee** training samples locally, but existing FL-based frameworks require a large number of well-annotated… ▽ More

    Submitted 17 December, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

  33. arXiv:2302.12004  [pdf

    cs.LG eess.SP

    Knowledge Distillation-based Information Sharing for Online Process Monitoring in Decentralized Manufacturing System

    Authors: Zhangyue Shi, Yuxuan Li, Chenang Liu

    Abstract: In advanced manufacturing, the incorporation of sensing technology provides an opportunity to achieve efficient in-situ process monitoring using machine learning methods. Meanwhile, the advances of information technologies also enable a connected and decentralized environment for manufacturing systems, making different manufacturing units in the system collaborate more closely. In a decentralized… ▽ More

    Submitted 25 July, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

  34. arXiv:2302.02608  [pdf, ps, other

    cs.IT eess.SP

    Cooperative Task-Oriented Communication for Multi-Modal Data with Transmission Control

    Authors: Shiqi Wang, Qianqian Yang, Zhiguo Shi, Zhaohui Yang, Zhaoyang Zhang

    Abstract: Real-time intelligence applications in Internet of Things (IoT) environment depend on timely data communication. However, it is challenging to transmit and analyse massive data of various modalities. Recently proposed task-oriented communication methods based on deep learning have showed its superiority in communication efficiency. In this paper, we propose a cooperative task-oriented communicatio… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

  35. arXiv:2301.13475  [pdf, ps, other

    eess.SP

    A Knowledge-Driven Meta-Learning Method for CSI Feedback

    Authors: Han Xiao, Wenqiang Tian, Wendong Liu, Zhi Zhang, Zhihua Shi, Li Guo, Jia Shen

    Abstract: Accurate and effective channel state information (CSI) feedback is a key technology for massive multiple-input and multiple-output (MIMO) systems. Recently, deep learning (DL) has been introduced to enhance CSI feedback in massive MIMO application, where the massive collected training data and lengthy training time are costly and impractical for realistic deployment. In this paper, a knowledge-dri… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

  36. arXiv:2211.10287  [pdf, other

    eess.IV

    Generative Model Based Highly Efficient Semantic Communication Approach for Image Transmission

    Authors: Tianxiao Han, Jiancheng Tang, Qianqian Yang, Yi** Duan, Zhaoyang Zhang, Zhiguo Shi

    Abstract: Deep learning (DL) based semantic communication methods have been explored to transmit images efficiently in recent years. In this paper, we propose a generative model based semantic communication to further improve the efficiency of image transmission and protect private information. In particular, the transmitter extracts the interpretable latent representation from the original image by a gener… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: submitted to ICASSP 2023

  37. arXiv:2211.00648  [pdf

    eess.IV physics.optics

    Non-line-of-sight imaging with arbitrary illumination and detection pattern

    Authors: Xintong Liu, Jianyu Wang, Zuoqiang Shi, Xing Fu, Lingyun Qiu

    Abstract: Non-line-of-sight (NLOS) imaging aims at reconstructing targets obscured from the direct line of sight. Existing NLOS imaging algorithms require dense measurements at rectangular grid points in a large area of the relay surface, which severely hinders their availability to variable relay scenarios in practical applications such as robotic vision, autonomous driving, rescue operations and remote se… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: main article: 32 pages with 8 figures; supplementary information: 49 pages with 26 figures

  38. arXiv:2210.15903  [pdf, other

    eess.AS cs.SD eess.SP

    Speaker recognition with two-step multi-modal deep cleansing

    Authors: Ruijie Tao, Kong Aik Lee, Zhan Shi, Haizhou Li

    Abstract: Neural network-based speaker recognition has achieved significant improvement in recent years. A robust speaker representation learns meaningful knowledge from both hard and easy samples in the training set to achieve good performance. However, noisy samples (i.e., with wrong labels) in the training set induce confusion and cause the network to learn the incorrect representation. In this paper, we… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures

  39. arXiv:2210.06385  [pdf, other

    eess.IV cs.CV physics.med-ph

    The Extreme Cardiac MRI Analysis Challenge under Respiratory Motion (CMRxMotion)

    Authors: Shuo Wang, Chen Qin, Chengyan Wang, Kang Wang, Haoran Wang, Chen Chen, Cheng Ouyang, Xutong Kuang, Chengliang Dai, Yuanhan Mo, Zhang Shi, Chenchen Dai, Xinrong Chen, He Wang, Wenjia Bai

    Abstract: The quality of cardiac magnetic resonance (CMR) imaging is susceptible to respiratory motion artifacts. The model robustness of automated segmentation techniques in face of real-world respiratory motion artifacts is unclear. This manuscript describes the design of extreme cardiac MRI analysis challenge under respiratory motion (CMRxMotion Challenge). The challenge aims to establish a public benchm… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Summary of CMRxMotion Challenge Design

  40. arXiv:2209.13638  [pdf, ps, other

    cs.IT eess.SP

    Outage Probability Analysis of HARQ-Aided Terahertz Communications

    Authors: Ziyang Song, Zheng Shi, Qing** Dou, Guanghua Yang, Yunfei Li, Shaodan Ma

    Abstract: Although terahertz (THz) communications can provide mobile broadband services, it usually has a large path loss and is vulnerable to antenna misalignment. This significantly degrades the reception reliability. To address this issue, the hybrid automatic repeat request (HARQ) is proposed to further enhance the reliability of THz communications. This paper provides an in-depth investigation on the o… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  41. arXiv:2209.11382  [pdf, ps, other

    cs.IT eess.SP

    Zero-Forcing Based Downlink Virtual MIMO-NOMA Communications in IoT Networks

    Authors: Zheng Shi, Hong Wang, Yaru Fu, Guanghua Yang, Shaodan Ma, Fen Hou, Theodoros A. Tsiftsis

    Abstract: To support massive connectivity and boost spectral efficiency for internet of things (IoT), a downlink scheme combining virtual multiple-input multiple-output (MIMO) and nonorthogonal multiple access (NOMA) is proposed. All the single-antenna IoT devices in each cluster cooperate with each other to establish a virtual MIMO entity, and multiple independent data streams are requested by each cluster… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  42. Ziv-Zakai Bound for DOAs Estimation

    Authors: Zongyu Zhang, Zhiguo Shi, Yujie Gu

    Abstract: Lower bounds on the mean square error (MSE) play an important role in evaluating the direction-of-arrival (DOA) estimation performance. Among numerous bounds for DOA estimation, the local Cramer-Rao bound (CRB) is only tight asymptotically. By contrast, the existing global tight Ziv-Zakai bound (ZZB) is appropriate for evaluating the single source estimation only. In this paper, we derive an expli… ▽ More

    Submitted 6 December, 2022; v1 submitted 8 September, 2022; originally announced September 2022.

  43. arXiv:2209.01424  [pdf, ps, other

    eess.SP

    Dynamic Write-Voltage Design and Read-Voltage Optimization for MLC NAND Flash Memory

    Authors: Runbin Cai, Yi Fang, Zhifang Shi, Lin Dai, Guojun Han

    Abstract: To mitigate the impact of noise and interference on multi-level-cell (MLC) flash memory with the use of low-density parity-check (LDPC) codes, we propose a dynamic write-voltage design scheme considering the asymmetric property of raw bit error rate (RBER), which can obtain the optimal write voltage by minimizing a cost function. In order to further improve the decoding performance of flash memory… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 12 pages, 6 figures, submitted to China Communication

  44. arXiv:2207.07370  [pdf, other

    eess.IV cs.CV

    CKD-TransBTS: Clinical Knowledge-Driven Hybrid Transformer with Modality-Correlated Cross-Attention for Brain Tumor Segmentation

    Authors: Jianwei Lin, Jiatai Lin, Cheng Lu, Hao Chen, Huan Lin, Bingchao Zhao, Zhenwei Shi, Bingjiang Qiu, Xipeng Pan, Zeyan Xu, Biao Huang, Changhong Liang, Guoqiang Han, Zaiyi Liu, Chu Han

    Abstract: Brain tumor segmentation (BTS) in magnetic resonance image (MRI) is crucial for brain tumor diagnosis, cancer management and research purposes. With the great success of the ten-year BraTS challenges as well as the advances of CNN and Transformer algorithms, a lot of outstanding BTS models have been proposed to tackle the difficulties of BTS in different technical aspects. However, existing studie… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  45. arXiv:2206.09867  [pdf, other

    eess.SP cs.CV

    WiFi-based Spatiotemporal Human Action Perception

    Authors: Yanling Hao, Zhiyuan Shi, Yuanwei Liu

    Abstract: WiFi-based sensing for human activity recognition (HAR) has recently become a hot topic as it brings great benefits when compared with video-based HAR, such as eliminating the demands of line-of-sight (LOS) and preserving privacy. Making the WiFi signals to 'see' the action, however, is quite coarse and thus still in its infancy. An end-to-end spatiotemporal WiFi signal neural network (STWNN) is p… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  46. arXiv:2205.12727  [pdf, other

    eess.AS cs.SD

    Semantic-preserved Communication System for Highly Efficient Speech Transmission

    Authors: Tianxiao Han, Qianqian Yang, Zhiguo Shi, Shibo He, Zhaoyang Zhang

    Abstract: Deep learning (DL) based semantic communication methods have been explored for the efficient transmission of images, text, and speech in recent years. In contrast to traditional wireless communication methods that focus on the transmission of abstract symbols, semantic communication approaches attempt to achieve better transmission efficiency by only sending the semantic-related information of the… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.03211

  47. arXiv:2205.11962  [pdf, other

    cs.CV eess.IV eess.SP

    A Wireless-Vision Dataset for Privacy Preserving Human Activity Recognition

    Authors: Yanling Hao, Zhiyuan Shi, Yuanwei Liu

    Abstract: Human Activity Recognition (HAR) has recently received remarkable attention in numerous applications such as assisted living and remote monitoring. Existing solutions based on sensors and vision technologies have obtained achievements but still suffering from considerable limitations in the environmental requirement. Wireless signals like WiFi-based sensing have emerged as a new paradigm since it… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  48. arXiv:2205.11945  [pdf, other

    cs.CV eess.SP

    GraSens: A Gabor Residual Anti-aliasing Sensing Framework for Action Recognition using WiFi

    Authors: Yanling Hao, Zhiyuan Shi, Xidong Mu, Yuanwei Liu

    Abstract: WiFi-based human action recognition (HAR) has been regarded as a promising solution in applications such as smart living and remote monitoring due to the pervasive and unobtrusive nature of WiFi signals. However, the efficacy of WiFi signals is prone to be influenced by the change in the ambient environment and varies over different sub-carriers. To remedy this issue, we propose an end-to-end Gabo… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  49. arXiv:2205.08390  [pdf, other

    eess.IV cs.CV

    HoVer-Trans: Anatomy-aware HoVer-Transformer for ROI-free Breast Cancer Diagnosis in Ultrasound Images

    Authors: Yuhao Mo, Chu Han, Yu Liu, Min Liu, Zhenwei Shi, Jiatai Lin, Bingchao Zhao, Chunwang Huang, Bingjiang Qiu, Yanfen Cui, Lei Wu, Xipeng Pan, Zeyan Xu, Xiaomei Huang, Zaiyi Liu, Ying Wang, Changhong Liang

    Abstract: Ultrasonography is an important routine examination for breast cancer diagnosis, due to its non-invasive, radiation-free and low-cost properties. However, the diagnostic accuracy of breast cancer is still limited due to its inherent limitations. It would be a tremendous success if we can precisely diagnose breast cancer by breast ultrasound images (BUS). Many learning-based computer-aided diagnost… ▽ More

    Submitted 15 July, 2022; v1 submitted 17 May, 2022; originally announced May 2022.

  50. arXiv:2204.14154  [pdf, other

    cs.IT eess.SP

    Outage Performance of Uplink Rate Splitting Multiple Access with Randomly Deployed Users

    Authors: Huabing Lu, Xianzhong Xie, Zhaoyuan Shi, Hongjian Lei, Nan Zhao, Jun Cai

    Abstract: With the rapid proliferation of smart devices in wireless networks, more powerful technologies are expected to fulfill the network requirements of high throughput, massive connectivity, and diversify quality of service. To this end, rate splitting multiple access (RSMA) is proposed as a promising solution to improve spectral efficiency and provide better fairness for the next-generation mobile net… ▽ More

    Submitted 10 April, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 38 pages,8 figures