Skip to main content

Showing 1–50 of 92 results for author: Jiang, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.17225  [pdf, other

    eess.IV cs.CV

    Multimodal Cross-Task Interaction for Survival Analysis in Whole Slide Pathological Images

    Authors: Songhan Jiang, Zhengyu Gan, Linghan Cai, Yifeng Wang, Yongbing Zhang

    Abstract: Survival prediction, utilizing pathological images and genomic profiles, is increasingly important in cancer analysis and prognosis. Despite significant progress, precise survival analysis still faces two main challenges: (1) The massive pixels contained in whole slide images (WSIs) complicate the process of pathological images, making it difficult to generate an effective representation of the tu… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.07823  [pdf, other

    cs.CL cs.SD eess.AS

    PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding

    Authors: Trang Le, Daniel Lazar, Suyoun Kim, Shan Jiang, Duc Le, Adithya Sagar, Aleksandr Livshits, Ahmed Aly, Akshat Shrivastava

    Abstract: Spoken Language Understanding (SLU) is a critical component of voice assistants; it consists of converting speech to semantic parses for task execution. Previous works have explored end-to-end models to improve the quality and robustness of SLU models with Deliberation, however these models have remained autoregressive, resulting in higher latencies. In this work we introduce PRoDeliberation, a no… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2404.16223  [pdf, other

    cs.CV eess.IV

    Deep RAW Image Super-Resolution. A NTIRE 2024 Challenge Survey

    Authors: Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte, Jianxing Zhang, Jia Li, Fan Wang, Xiaopeng Li, Zikun Liu, Hyunhee Park, Sejun Song, Changho Kim, Zhijuan Huang, Hongyuan Yu, Cheng Wan, Wending Xiang, Jiamin Lin, Hang Zhong, Qiaosong Zhang, Yue Sun, Xuanwu Yin, Kunlong Zuo, Senyan Xu, Siyuan Jiang, Zhi**g Sun, Jiaying Zhu , et al. (10 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 RAW Image Super-Resolution Challenge, highlighting the proposed solutions and results. New methods for RAW Super-Resolution could be essential in modern Image Signal Processing (ISP) pipelines, however, this problem is not as explored as in the RGB domain. Th goal of this challenge is to upscale RAW Bayer images by 2x, considering unknown degradations such as nois… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 - NTIRE Workshop

  4. arXiv:2403.18339  [pdf, other

    eess.IV cs.CV

    H2ASeg: Hierarchical Adaptive Interaction and Weighting Network for Tumor Segmentation in PET/CT Images

    Authors: **peng Lu, **gyun Chen, Linghan Cai, Songhan Jiang, Yongbing Zhang

    Abstract: Positron emission tomography (PET) combined with computed tomography (CT) imaging is routinely used in cancer diagnosis and prognosis by providing complementary information. Automatically segmenting tumors in PET/CT images can significantly improve examination efficiency. Traditional multi-modal segmentation solutions mainly rely on concatenation operations for modality fusion, which fail to effec… ▽ More

    Submitted 28 March, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: 10 pages,4 figures

  5. arXiv:2403.06798  [pdf, other

    eess.IV cs.CV cs.LG

    Dynamic Perturbation-Adaptive Adversarial Training on Medical Image Classification

    Authors: Shuai Li, Xiaoguang Ma, Shancheng Jiang, Lu Meng

    Abstract: Remarkable successes were made in Medical Image Classification (MIC) recently, mainly due to wide applications of convolutional neural networks (CNNs). However, adversarial examples (AEs) exhibited imperceptible similarity with raw data, raising serious concerns on network robustness. Although adversarial training (AT), in responding to malevolent AEs, was recognized as an effective approach to im… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, 2 tables

  6. arXiv:2402.19434  [pdf, other

    cs.IT eess.SP

    Digital Twin Aided Massive MIMO: CSI Compression and Feedback

    Authors: Shuaifeng Jiang, Ahmed Alkhateeb

    Abstract: Deep learning (DL) approaches have demonstrated high performance in compressing and reconstructing the channel state information (CSI) and reducing the CSI feedback overhead in massive MIMO systems. One key challenge, however, with the DL approaches is the demand for extensive training data. Collecting this real-world CSI data incurs significant overhead that hinders the DL approaches from scaling… ▽ More

    Submitted 29 February, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted in ICC 2024. Dataset and code files will be available soon on the DeepMIMO website https://www.deepmimo.net/

  7. arXiv:2312.17266  [pdf

    eess.IV cs.AI cs.CV cs.RO

    Automatic laminectomy cutting plane planning based on artificial intelligence in robot assisted laminectomy surgery

    Authors: Zhuofu Li, Yonghong Zhang, Chengxia Wang, Shanshan Liu, Xiongkang Song, Xuquan Ji, Shuai Jiang, Woquan Zhong, Lei Hu, Weishi Li

    Abstract: Objective: This study aims to use artificial intelligence to realize the automatic planning of laminectomy, and verify the method. Methods: We propose a two-stage approach for automatic laminectomy cutting plane planning. The first stage was the identification of key points. 7 key points were manually marked on each CT image. The Spatial Pyramid Upsampling Network (SPU-Net) algorithm developed by… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  8. arXiv:2312.05829  [pdf, other

    cs.IT eess.SP

    EM Based p-norm-like Constraint RLS Algorithm for Sparse System Identification

    Authors: Shuyang Jiang, Kung Yao

    Abstract: In this paper, the recursive least squares (RLS) algorithm is considered in the sparse system identification setting. The cost function of RLS algorithm is regularized by a $p$-norm-like ($0 \leq p \leq 1$) constraint of the estimated system parameters. In order to minimize the regularized cost function, we transform it into a penalized maximum likelihood (ML) problem, which is solved by the expec… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 11 pages, 3 figures, journal manuscript

  9. arXiv:2312.03225  [pdf, other

    cs.RO eess.SY

    Snake Robot with Tactile Perception Navigates on Large-scale Challenging Terrain

    Authors: Shuo Jiang, Adarsh Salagame, Alireza Ramezani, Lawson Wong

    Abstract: Along with the advancement of robot skin technology, there has been notable progress in the development of snake robots featuring body-surface tactile perception. In this study, we proposed a locomotion control framework for snake robots that integrates tactile perception to augment their adaptability to various terrains. Our approach embraces a hierarchical reinforcement learning (HRL) architectu… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  10. arXiv:2312.03223  [pdf, other

    cs.RO eess.SY

    Hierarchical RL-Guided Large-scale Navigation of a Snake Robot

    Authors: Shuo Jiang, Adarsh Salagame, Alireza Ramezani, Lawson Wong

    Abstract: Classical snake robot control leverages mimicking snake-like gaits tuned for specific environments. However, to operate adaptively in unstructured environments, gait generation must be dynamically scheduled. In this work, we present a four-layer hierarchical control scheme to enable the snake robot to navigate freely in large-scale environments. The proposed model decomposes navigation into global… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: text overlap with arXiv:2311.14878

  11. arXiv:2311.08075  [pdf, ps, other

    eess.IV cs.CV cs.HC

    GlanceSeg: Real-time microaneurysm lesion segmentation with gaze-map-guided foundation model for early detection of diabetic retinopathy

    Authors: Hongyang Jiang, Mengdi Gao, Zirong Liu, Chen Tang, Xiaoqing Zhang, Shuai Jiang, Wu Yuan, Jiang Liu

    Abstract: Early-stage diabetic retinopathy (DR) presents challenges in clinical diagnosis due to inconspicuous and minute microangioma lesions, resulting in limited research in this area. Additionally, the potential of emerging foundation models, such as the segment anything model (SAM), in medical scenarios remains rarely explored. In this work, we propose a human-in-the-loop, label-free early DR diagnosis… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 12 pages, 10 figures

  12. arXiv:2311.05432  [pdf, other

    cs.CV eess.IV

    Dual Pipeline Style Transfer with Input Distribution Differentiation

    Authors: ShiQi Jiang, JunJie Kang, YuJian Li

    Abstract: The color and texture dual pipeline architecture (CTDP) suppresses texture representation and artifacts through masked total variation loss (Mtv), and further experiments have shown that smooth input can almost completely eliminate texture representation. We have demonstrated through experiments that smooth input is not the key reason for removing texture representations, but rather the distributi… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  13. arXiv:2311.00483  [pdf, other

    eess.IV cs.CV

    DEFN: Dual-Encoder Fourier Group Harmonics Network for Three-Dimensional Indistinct-Boundary Object Segmentation

    Authors: Xiaohua Jiang, Yihao Guo, Jian Huang, Yuting Wu, Meiyi Luo, Zhaoyang Xu, Qianni Zhang, Xingru Huang, Hong He, Shaowei Jiang, **g Ye, Mang Xiao

    Abstract: The precise spatial and quantitative delineation of indistinct-boundary medical objects is paramount for the accuracy of diagnostic protocols, efficacy of surgical interventions, and reliability of postoperative assessments. Despite their significance, the effective segmentation and instantaneous three-dimensional reconstruction are significantly impeded by the paucity of representative samples in… ▽ More

    Submitted 19 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 36pages,16figures,7tables

    MSC Class: 68; 92 ACM Class: I.4; J.3

  14. arXiv:2310.20427  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing and Enhancing Robustness of Deep Learning Models with Corruption Emulation in Digital Pathology

    Authors: Peixiang Huang, Songtao Zhang, Yulu Gan, Rui Xu, Rongqi Zhu, Wenkang Qin, Limei Guo, Shan Jiang, Lin Luo

    Abstract: Deep learning in digital pathology brings intelligence and automation as substantial enhancements to pathological analysis, the gold standard of clinical diagnosis. However, multiple steps from tissue preparation to slide imaging introduce various image corruptions, making it difficult for deep neural network (DNN) models to achieve stable diagnostic results for clinical use. In order to assess an… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  15. arXiv:2310.06339  [pdf, other

    eess.IV cs.LG

    Automatic nodule identification and differentiation in ultrasound videos to facilitate per-nodule examination

    Authors: Siyuan Jiang, Yan Ding, Yuling Wang, Lei Xu, Wenli Dai, Wanru Chang, Jianfeng Zhang, Jie Yu, Jianqiao Zhou, Chunquan Zhang, ** Liang, Dexing Kong

    Abstract: Ultrasound is a vital diagnostic technique in health screening, with the advantages of non-invasive, cost-effective, and radiation free, and therefore is widely applied in the diagnosis of nodules. However, it relies heavily on the expertise and clinical experience of the sonographer. In ultrasound images, a single nodule might present heterogeneous appearances in different cross-sectional views w… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  16. arXiv:2310.04992  [pdf, other

    eess.IV cs.CV

    VisionFM: a Multi-Modal Multi-Task Vision Foundation Model for Generalist Ophthalmic Artificial Intelligence

    Authors: Jianing Qiu, Jian Wu, Hao Wei, Peilun Shi, Minqing Zhang, Yunyun Sun, Lin Li, Hanruo Liu, Hongyi Liu, Simeng Hou, Yuyang Zhao, Xuehui Shi, Junfang Xian, Xiaoxia Qu, Sirui Zhu, Lijie Pan, Xiaoniao Chen, Xiaojia Zhang, Shuai Jiang, Kebing Wang, Chenlong Yang, Mingqiang Chen, Sujie Fan, Jianhua Hu, Aiguo Lv , et al. (17 additional authors not shown)

    Abstract: We present VisionFM, a foundation model pre-trained with 3.4 million ophthalmic images from 560,457 individuals, covering a broad range of ophthalmic diseases, modalities, imaging devices, and demography. After pre-training, VisionFM provides a foundation to foster multiple ophthalmic artificial intelligence (AI) applications, such as disease screening and diagnosis, disease prognosis, subclassifi… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  17. arXiv:2306.16846  [pdf, other

    cs.CV eess.IV

    Lightweight texture transfer based on texture feature preset

    Authors: ShiQi Jiang

    Abstract: In the task of texture transfer, reference texture images typically exhibit highly repetitive texture features, and the texture transfer results from different content images under the same style also share remarkably similar texture patterns. Encoding such highly similar texture features often requires deep layers and a large number of channels, making it is also the main source of the entire mod… ▽ More

    Submitted 1 January, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

  18. arXiv:2306.10515  [pdf, other

    eess.SP cs.CV

    Vision Guided MIMO Radar Beamforming for Enhanced Vital Signs Detection in Crowds

    Authors: Shuaifeng Jiang, Ahmed Alkhateeb, Daniel W. Bliss, Yu Rong

    Abstract: Radar as a remote sensing technology has been used to analyze human activity for decades. Despite all the great features such as motion sensitivity, privacy preservation, penetrability, and more, radar has limited spatial degrees of freedom compared to optical sensors and thus makes it challenging to sense crowded environments without prior information. In this paper, we develop a novel dual-sensi… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  19. Zero-shot Medical Image Translation via Frequency-Guided Diffusion Models

    Authors: Yunxiang Li, Hua-Chieh Shao, Xiao Liang, Liyuan Chen, Ruiqi Li, Steve Jiang, **g Wang, You Zhang

    Abstract: Recently, the diffusion model has emerged as a superior generative model that can produce high quality and realistic images. However, for medical image translation, the existing diffusion models are deficient in accurately retaining structural information since the structure details of source domain images are lost during the forward diffusion process and cannot be fully recovered through learned… ▽ More

    Submitted 27 October, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Journal ref: IEEE Transactions on Medical Imaging, 2023

  20. arXiv:2303.08140  [pdf, other

    eess.IV cs.LG physics.bio-ph

    Digital staining in optical microscopy using deep learning -- a review

    Authors: Lucas Kreiss, Shaowei Jiang, Xiang Li, Shiqi Xu, Kevin C. Zhou, Alexander Mühlberg, Kyung Chul Lee, Kanghyun Kim, Amey Chaware, Michael Ando, Laura Barisoni, Seung Ah Lee, Guoan Zheng, Kyle Lafata, Oliver Friedrich, Roarke Horstmeyer

    Abstract: Until recently, conventional biochemical staining had the undisputed status as well-established benchmark for most biomedical problems related to clinical diagnostics, fundamental research and biotechnology. Despite this role as gold-standard, staining protocols face several challenges, such as a need for extensive, manual processing of samples, substantial time delays, altered tissue homeostasis,… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Review article, 4 main Figures, 3 Tables, 2 supplementary figures

  21. arXiv:2303.00334  [pdf, other

    eess.IV cs.CV

    Online Streaming Video Super-Resolution with Convolutional Look-Up Table

    Authors: Guanghao Yin, Zefan Qu, Xinyang Jiang, Shan Jiang, Zhenhua Han, Ningxin Zheng, Xiaohong Liu, Huan Yang, Yuqing Yang, Dongsheng Li, Lili Qiu

    Abstract: Online video streaming has fundamental limitations on the transmission bandwidth and computational capacity and super-resolution is a promising potential solution. However, applying existing video super-resolution methods to online streaming is non-trivial. Existing video codecs and streaming protocols (\eg, WebRTC) dynamically change the video quality both spatially and temporally, which leads to… ▽ More

    Submitted 25 July, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  22. arXiv:2302.01493  [pdf

    eess.IV cs.CV physics.med-ph

    Deep Learning (DL)-based Automatic Segmentation of the Internal Pudendal Artery (IPA) for Reduction of Erectile Dysfunction in Definitive Radiotherapy of Localized Prostate Cancer

    Authors: Anjali Balagopal, Michael Dohopolski, Young Suk Kwon, Steven Montalvo, Howard Morgan, Ti Bai, Dan Nguyen, Xiao Liang, Xinran Zhong, Mu-Han Lin, Neil Desai, Steve Jiang

    Abstract: Background and purpose: Radiation-induced erectile dysfunction (RiED) is commonly seen in prostate cancer patients. Clinical trials have been developed in multiple institutions to investigate whether dose-sparing to the internal-pudendal-arteries (IPA) will improve retention of sexual potency. The IPA is usually not considered a conventional organ-at-risk (OAR) due to segmentation difficulty. In t… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  23. arXiv:2301.11283  [pdf, other

    eess.SP cs.IT cs.LG

    Real-Time Digital Twins: Vision and Research Directions for 6G and Beyond

    Authors: Ahmed Alkhateeb, Shuaifeng Jiang, Gouranga Charan

    Abstract: This article presents a vision where \textit{real-time} digital twins of the physical wireless environments are continuously updated using multi-modal sensing data from the distributed infrastructure and user devices, and are used to make communication and sensing decisions. This vision is mainly enabled by the advances in precise 3D maps, multi-modal sensing, ray-tracing computations, and machine… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: The 6G digital twin research platform will be available soon on https://deepverse6g.net/

  24. arXiv:2301.07682  [pdf, other

    eess.SP cs.IT

    Digital Twin Based Beam Prediction: Can we Train in the Digital World and Deploy in Reality?

    Authors: Shuaifeng Jiang, Ahmed Alkhateeb

    Abstract: Realizing the potential gains of large-scale MIMO systems requires the accurate estimation of their channels or the fine adjustment of their narrow beams. This, however, is typically associated with high channel acquisition/beam swee** overhead that scales with the number of antennas. Machine and deep learning represent promising approaches to overcome these challenges thanks to their powerful a… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: The dataset is available on the DeepSense 6G website

  25. arXiv:2211.16806  [pdf, other

    eess.IV cs.CV cs.LG

    Toward Robust Diagnosis: A Contour Attention Preserving Adversarial Defense for COVID-19 Detection

    Authors: Kun Xiang, Xing Zhang, **wen She, **peng Liu, Haohan Wang, Shiqi Deng, Shancheng Jiang

    Abstract: As the COVID-19 pandemic puts pressure on healthcare systems worldwide, the computed tomography image based AI diagnostic system has become a sustainable solution for early diagnosis. However, the model-wise vulnerability under adversarial perturbation hinders its deployment in practical situation. The existing adversarial training strategies are difficult to generalized into medical imaging field… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Comments: Accepted to AAAI 2023

  26. arXiv:2211.13487  [pdf, other

    eess.SP cs.IT

    Sensing Aided Reconfigurable Intelligent Surfaces for 3GPP 5G Transparent Operation

    Authors: Shuaifeng Jiang, Ahmed Hindy, Ahmed Alkhateeb

    Abstract: Can reconfigurable intelligent surfaces (RISs) operate in a standalone mode that is completely transparent to the 3GPP 5G initial access process? Realizing that may greatly simplify the deployment and operation of these surfaces and reduce the infrastructure control overhead. This paper investigates the feasibility of building standalone/transparent RIS systems and shows that one key challenge lie… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: The RIS dataset and script files will be available soon. arXiv admin note: text overlap with arXiv:2211.07563

  27. arXiv:2211.07563  [pdf, other

    eess.SP cs.IT

    Camera Aided Reconfigurable Intelligent Surfaces: Computer Vision Based Fast Beam Selection

    Authors: Shuaifeng Jiang, Ahmed Hindy, Ahmed Alkhateeb

    Abstract: Reconfigurable intelligent surfaces (RISs) have attracted increasing interest due to their ability to improve the coverage, reliability, and energy efficiency of millimeter wave (mmWave) communication systems. However, designing the RIS beamforming typically requires large channel estimation or beam training overhead, which degrades the efficiency of these systems. In this paper, we propose to equ… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: 6 pages, 6 figures. The RIS dataset and code files will be available soon!

  28. arXiv:2210.05673  [pdf

    eess.IV cs.CV stat.AP

    Performance Deterioration of Deep Learning Models after Clinical Deployment: A Case Study with Auto-segmentation for Definitive Prostate Cancer Radiotherapy

    Authors: Biling Wang, Michael Dohopolski, Ti Bai, Junjie Wu, Raquibul Hannan, Neil Desai, Aurelie Garant, Daniel Yang, Dan Nguyen, Mu-Han Lin, Robert Timmerman, Xinlei Wang, Steve Jiang

    Abstract: We evaluated the temporal performance of a deep learning (DL) based artificial intelligence (AI) model for auto segmentation in prostate radiotherapy, seeking to correlate its efficacy with changes in clinical landscapes. Our study involved 1328 prostate cancer patients who underwent definitive radiotherapy from January 2006 to August 2022 at the University of Texas Southwestern Medical Center. We… ▽ More

    Submitted 16 November, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

  29. arXiv:2209.11321  [pdf, other

    cs.IT eess.SP

    Sensing Aided OTFS Channel Estimation for Massive MIMO Systems

    Authors: Shuaifeng Jiang, Ahmed Alkhateeb

    Abstract: Orthogonal time frequency space (OTFS) modulation has the potential to enable robust communications in highly-mobile scenarios. Estimating the channels for OTFS systems, however, is associated with high pilot signaling overhead that scales with the maximum delay and Doppler spreads. This becomes particularly challenging for massive MIMO systems where the overhead also scales with the number of ant… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

    Comments: submitted to IEEE

  30. arXiv:2207.04829  [pdf, ps, other

    cs.IT eess.SP

    Low-complexity Joint Phase Adjustment and Receive Beamforming for Directional Modulation Networks via IRS

    Authors: Rongen Dong, Shaohua Jiang, Xinhai Hua, Yin Teng, Feng Shu, Jiangzhou Wang

    Abstract: Intelligent reflecting surface (IRS) is a revolutionary and low-cost technology for boosting the spectrum and energy efficiencies in future wireless communication network. In order to create controllable multipath transmission in the conventional line-of-sight (LOS) wireless communication environment, an IRS-aided directional modulation (DM) network is considered. In this paper, to improve the tra… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  31. arXiv:2207.00172  [pdf, other

    cs.CV eess.SY

    Turbo: Opportunistic Enhancement for Edge Video Analytics

    Authors: Yan Lu, Shiqi Jiang, Ting Cao, Yuanchao Shu

    Abstract: Edge computing is being widely used for video analytics. To alleviate the inherent tension between accuracy and cost, various video analytics pipelines have been proposed to optimize the usage of GPU on edge nodes. Nonetheless, we find that GPU compute resources provisioned for edge nodes are commonly under-utilized due to video content variations, subsampling and filtering at different places of… ▽ More

    Submitted 29 June, 2022; originally announced July 2022.

  32. arXiv:2205.09420  [pdf, other

    cs.IT eess.SP

    Multicast Scheduling over Multiple Channels: A Distribution-Embedding Deep Reinforcement Learning Method

    Authors: Ran Li, Chuan Huang, Xiaoqi Qin, Shengpei Jiang

    Abstract: Multicasting is an efficient technique for simultaneously transmitting common messages from the base station (BS) to multiple mobile users (MUs). Multicast scheduling over multiple channels, which aims to jointly minimize the energy consumption of the BS and the latency of serving asynchronized requests from the MUs, is formulated as an infinite-horizon Markov decision process (MDP) problem with a… ▽ More

    Submitted 21 August, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

  33. arXiv:2205.09377  [pdf, other

    cs.IT eess.SP

    Coexistence between Task- and Data-Oriented Communications: A Whittle's Index Guided Multi-Agent Reinforcement Learning Approach

    Authors: Ran Li, Chuan Huang, Xiaoqi Qin, Shengpei Jiang, Nan Ma, Shuguang Cui

    Abstract: We investigate the coexistence of task-oriented and data-oriented communications in a IoT system that shares a group of channels, and study the scheduling problem to jointly optimize the weighted age of incorrect information (AoII) and throughput, which are the performance metrics of the two types of communications, respectively. This problem is formulated as a Markov decision problem, which is di… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  34. arXiv:2204.03947  [pdf, other

    physics.optics eess.IV

    Lensless coherent diffraction imaging based on spatial light modulator with unknown modulation curve

    Authors: Hao Sha, Chao He, Shaowei Jiang, Pengming Song, Shuai Liu, Wenzhen Zou, Peiwu Qin, Haoqian Wang, Yongbing Zhang

    Abstract: Lensless imaging is a popular research field for the advantages of small size, wide field-of-view and low aberration in recent years. However, some traditional lensless imaging methods suffer from slow convergence, mechanical errors and conjugate solution interference, which limit its further application and development. In this work, we proposed a lensless imaging method based on spatial light mo… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  35. arXiv:2203.08921  [pdf, other

    eess.IV cs.CV

    Hybrid Pixel-Unshuffled Network for Lightweight Image Super-Resolution

    Authors: Bin Sun, Yulun Zhang, Songyao Jiang, Yun Fu

    Abstract: Convolutional neural network (CNN) has achieved great success on image super-resolution (SR). However, most deep CNN-based SR models take massive computations to obtain high performance. Downsampling features for multi-resolution fusion is an efficient and effective way to improve the performance of visual recognition. Still, it is counter-intuitive in the SR task, which needs to project a low-res… ▽ More

    Submitted 29 November, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

  36. arXiv:2203.05548  [pdf, other

    eess.SP cs.IT

    LiDAR Aided Future Beam Prediction in Real-World Millimeter Wave V2I Communications

    Authors: Shuaifeng Jiang, Gouranga Charan, Ahmed Alkhateeb

    Abstract: This paper presents the first large-scale real-world evaluation for using LiDAR data to guide the mmWave beam prediction task. A machine learning (ML) model that leverages the LiDAR sensory data to predict the current and future beams was developed. Based on the large-scale real-world dataset, DeepSense 6G, this model was evaluated in a vehicle-to-infrastructure communication scenario with highly-… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: The dataset and code files will be available on the DeepSense 6G website https://deepsense6g.net/

  37. arXiv:2203.04295  [pdf, other

    eess.IV cs.CV

    Region Specific Optimization (RSO)-based Deep Interactive Registration

    Authors: Ti Bai, Muhan Lin, Xiao Liang, Biling Wang, Michael Dohopolski, Bin Cai, Dan Nguyen, Steve Jiang

    Abstract: Medical image registration is a fundamental and vital task which will affect the efficacy of many downstream clinical tasks. Deep learning (DL)-based deformable image registration (DIR) methods have been investigated, showing state-of-the-art performance. A test time optimization (TTO) technique was proposed to further improve the DL models' performance. Despite the substantial accuracy improvemen… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  38. arXiv:2202.04567  [pdf

    cs.LG eess.SY math.OC

    Optimal Hyperparameters and Structure Setting of Multi-Objective Robust CNN Systems via Generalized Taguchi Method and Objective Vector Norm

    Authors: Sheng-Guo Wang, Shanshan Jiang

    Abstract: Recently, Machine Learning (ML), Artificial Intelligence (AI), and Convolutional Neural Network (CNN) have made huge progress with broad applications, where their systems have deep learning structures and a large number of hyperparameters that determine the quality and performance of the CNNs and AI systems. These systems may have multi-objective ML and AI performance needs. There is a key require… ▽ More

    Submitted 10 February, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 10 pages. Corresponding Author: Sheng-Guo Wang, [email protected]. To add the arXiv stamp to the first page

    MSC Class: 68T20 ACM Class: I.2.m

  39. arXiv:2201.09376  [pdf, other

    eess.IV cs.CV

    ReconFormer: Accelerated MRI Reconstruction Using Recurrent Transformer

    Authors: Pengfei Guo, Yiqun Mei, **yuan Zhou, Shanshan Jiang, Vishal M. Patel

    Abstract: Accelerating magnetic resonance image (MRI) reconstruction process is a challenging ill-posed inverse problem due to the excessive under-sampling operation in k-space. In this paper, we propose a recurrent transformer model, namely ReconFormer, for MRI reconstruction which can iteratively reconstruct high fertility magnetic resonance images from highly under-sampled k-space data. In particular, th… ▽ More

    Submitted 27 January, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

  40. arXiv:2112.08133  [pdf

    physics.ins-det eess.IV physics.optics

    Ptychographic sensor for large-scale lensless microbial monitoring with high spatiotemporal resolution

    Authors: Shaowei Jiang, Chengfei Guo, Zichao Bian, Ruihai Wang, Jiakai Zhu, Pengming Song, Patrick Hu, Derek Hu, Zibang Zhang, Kazunori Hoshino, Bin Feng, Guoan Zheng

    Abstract: Traditional microbial detection methods often rely on the overall property of microbial cultures and cannot resolve individual growth event at high spatiotemporal resolution. As a result, they require bacteria to grow to confluence and then interpret the results. Here, we demonstrate the application of an integrated ptychographic sensor for lensless cytometric analysis of microbial cultures over a… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

    Comments: 18 pages, 6 figures

  41. A Survey: Deep Learning for Hyperspectral Image Classification with Few Labeled Samples

    Authors: Sen Jia, Shuguo Jiang, Zhijie Lin, Nanying Li, Meng Xu, Shiqi Yu

    Abstract: With the rapid development of deep learning technology and improvement in computing capability, deep learning has been widely used in the field of hyperspectral image (HSI) classification. In general, deep learning models often contain many trainable parameters and require a massive number of labeled samples to achieve optimal performance. However, in regard to HSI classification, a large number o… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Journal ref: Neurocomputing, Volume 448, 2021, Pages 179-204

  42. arXiv:2111.14803  [pdf, other

    eess.SP cs.IT

    Computer Vision Aided Beam Tracking in A Real-World Millimeter Wave Deployment

    Authors: Shuaifeng Jiang, Ahmed Alkhateeb

    Abstract: Millimeter-wave (mmWave) and terahertz (THz) communications require beamforming to acquire adequate receive signal-to-noise ratio (SNR). To find the optimal beam, current beam management solutions perform beam training over a large number of beams in pre-defined codebooks. The beam training overhead increases the access latency and can become infeasible for high-mobility applications. To reduce or… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: Submitted to IEEE. The dataset and code files will be available on the DeepSense 6G website http://deepsense6g.net/

  43. arXiv:2110.11558  [pdf

    eess.IV cs.CV q-bio.QM

    MHAttnSurv: Multi-Head Attention for Survival Prediction Using Whole-Slide Pathology Images

    Authors: Shuai Jiang, Arief A. Suriawinata, Saeed Hassanpour

    Abstract: In pathology, whole-slide images (WSI) based survival prediction has attracted increasing interest. However, given the large size of WSIs and the lack of pathologist annotations, extracting the prognostic information from WSIs remains a challenging task. Previous studies have used multiple instance learning approaches to combine the information from multiple randomly sampled patches, but different… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

  44. High-throughput lensless whole slide imaging via continuous height-varying modulation of tilted sensor

    Authors: Shaowei Jiang, Chengfei Guo, Patrick Hu, Derek Hu, Pengming Song, Tianbo Wang, Zichao Bian, Zibang Zhang, Guoan Zheng

    Abstract: We report a new lensless microscopy configuration by integrating the concepts of transverse translational ptychography and defocus multi-height phase retrieval. In this approach, we place a tilted image sensor under the specimen for linearly-increasing phase modulation along one lateral direction. Similar to the operation of ptychography, we laterally translate the specimen and acquire the diffrac… ▽ More

    Submitted 28 September, 2021; originally announced October 2021.

  45. arXiv:2108.13816  [pdf

    eess.AS

    Maximum F1-score training for end-to-end mispronunciation detection and diagnosis of L2 English speech

    Authors: Bi-Cheng Yan, Shao-Wei Fan Jiang, Fu-An Chao, Berlin Chen

    Abstract: End-to-end (E2E) neural models are increasingly attracting attention as a promising modeling approach for mispronunciation detection and diagnosis (MDD). Typically, these models are trained by optimizing a cross-entropy criterion, which corresponds to improving the log-likelihood of the training data. However, there is a discrepancy between the objectives of model training and the MDD evaluation,… ▽ More

    Submitted 9 July, 2022; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted by IEEE International Conference on Multimedia and Expo (ICME 2022)

  46. arXiv:2108.11627  [pdf

    cs.MM cs.SD eess.AS

    Towards Robust Mispronunciation Detection and Diagnosis for L2 English Learners with Accent-Modulating Methods

    Authors: Shao-Wei Fan Jiang, Bi-Cheng Yan, Tien-Hong Lo, Fu-An Chao, Berlin Chen

    Abstract: With the acceleration of globalization, more and more people are willing or required to learn second languages (L2). One of the major remaining challenges facing current mispronunciation and diagnosis (MDD) models for use in computer-assisted pronunciation training (CAPT) is to handle speech from L2 learners with a diverse set of accents. In this paper, we set out to mitigate the adverse effects o… ▽ More

    Submitted 3 October, 2021; v1 submitted 26 August, 2021; originally announced August 2021.

    Comments: Accepted by ASRU 2021

  47. arXiv:2108.03783  [pdf

    eess.SY

    Dynamic Modelling of Combined Cycle Power Plant for Load Frequency Control With Large Penetration of Renewable Energy

    Authors: Songyao Jiang, Takeyoshi Kato

    Abstract: As the concern about climate change and energy shortage grow stronger, the incorporation of renewable energy in the power system in the future is foreseeable. In a hybrid power system with a large penetration of PV generation, PV panel is regarded as a negative load in the power system. With the accurate prediction of PV output power, load frequency control could be done by controlling the thermal… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: 2014 7th JUACEP Research Workshop at Nagoya University, Japan

  48. arXiv:2107.13465  [pdf

    cs.CV eess.IV

    A Proof-of-Concept Study of Artificial Intelligence Assisted Contour Revision

    Authors: Ti Bai, Anjali Balagopal, Michael Dohopolski, Howard E. Morgan, Rafe McBeth, Jun Tan, Mu-Han Lin, David J. Sher, Dan Nguyen, Steve Jiang

    Abstract: Automatic segmentation of anatomical structures is critical for many medical applications. However, the results are not always clinically acceptable and require tedious manual revision. Here, we present a novel concept called artificial intelligence assisted contour revision (AIACR) and demonstrate its feasibility. The proposed clinical workflow of AIACR is as follows given an initial contour that… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  49. arXiv:2107.01531  [pdf

    eess.AS cs.SD eess.SP

    TENET: A Time-reversal Enhancement Network for Noise-robust ASR

    Authors: Fu-An Chao, Shao-Wei Fan Jiang, Bi-Cheng Yan, Jeih-weih Hung, Berlin Chen

    Abstract: Due to the unprecedented breakthroughs brought about by deep learning, speech enhancement (SE) techniques have been developed rapidly and play an important role prior to acoustic modeling to mitigate noise effects on speech. To increase the perceptual quality of speech, current state-of-the-art in the SE field adopts adversarial training by connecting an objective metric to the discriminator. Howe… ▽ More

    Submitted 14 September, 2021; v1 submitted 3 July, 2021; originally announced July 2021.

    Comments: Accepted to ASRU 2021

  50. arXiv:2106.08886  [pdf, other

    eess.IV cs.CV

    Over-and-Under Complete Convolutional RNN for MRI Reconstruction

    Authors: Pengfei Guo, Jeya Maria Jose Valanarasu, Puyang Wang, **yuan Zhou, Shanshan Jiang, Vishal M. Patel

    Abstract: Reconstructing magnetic resonance (MR) images from undersampled data is a challenging problem due to various artifacts introduced by the under-sampling operation. Recent deep learning-based methods for MR image reconstruction usually leverage a generic auto-encoder architecture which captures low-level features at the initial layers and high-level features at the deeper layers. Such networks focus… ▽ More

    Submitted 24 June, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted to MICCAI 2021