Skip to main content

Showing 1–50 of 68 results for author: He, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00987  [pdf, other

    cs.NI eess.SY

    Exploiting Dependency-Aware Priority Adjustment for Mixed-Criticality TSN Flow Scheduling

    Authors: Miao Guo, Yifei Sun, Chaojie Gu, Shibo He, Zhiguo Shi

    Abstract: Time-Sensitive Networking (TSN) serves as a one-size-fits-all solution for mixed-criticality communication, in which flow scheduling is vital to guarantee real-time transmissions. Traditional approaches statically assign priorities to flows based on their associated applications, resulting in significant queuing delays. In this paper, we observe that assigning different priorities to a flow leads… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by IWQoS'24

  2. arXiv:2406.18548  [pdf

    eess.IV cs.CV

    Exploration of Multi-Scale Image Fusion Systems in Intelligent Medical Image Analysis

    Authors: Yuxiang Hu, Haowei Yang, Ting Xu, Shuyao He, Jiajie Yuan, Haozhang Deng

    Abstract: The diagnosis of brain cancer relies heavily on medical imaging techniques, with MRI being the most commonly used. It is necessary to perform automatic segmentation of brain tumors on MRI images. This project intends to build an MRI algorithm based on U-Net. The residual network and the module used to enhance the context information are combined, and the void space convolution pooling pyramid is a… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

  3. arXiv:2406.00492  [pdf, other

    eess.IV cs.CV cs.LG

    SAM-VMNet: Deep Neural Networks For Coronary Angiography Vessel Segmentation

    Authors: Xueying Zeng, Baixiang Huang, Yu Luo, Guangyu Wei, Songyan He, Yushuang Shao

    Abstract: Coronary artery disease (CAD) is one of the most prevalent diseases in the cardiovascular field and one of the major contributors to death worldwide. Computed Tomography Angiography (CTA) images are regarded as the authoritative standard for the diagnosis of coronary artery disease, and by performing vessel segmentation and stenosis detection on CTA images, physicians are able to diagnose coronary… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  4. arXiv:2404.16611  [pdf, ps, other

    cs.IT eess.SP

    Towards Symbiotic SAGIN Through Inter-operator Resource and Service Sharing: Joint Orchestration of User Association and Radio Resources

    Authors: Shizhao He, Jungang Ge, Ying-Chang Liang, Dusit Niyato

    Abstract: The space-air-ground integrated network (SAGIN) is a pivotal architecture to support ubiquitous connectivity in the upcoming 6G era. Inter-operator resource and service sharing is a promising way to realize such a huge network, utilizing resources efficiently and reducing construction costs. Given the rationality of operators, the configuration of resources and services in SAGIN should focus on bo… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  5. arXiv:2404.14700  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    FlashSpeech: Efficient Zero-Shot Speech Synthesis

    Authors: Zhen Ye, Zeqian Ju, Haohe Liu, Xu Tan, Jianyi Chen, Yiwen Lu, Peiwen Sun, Jiahao Pan, Weizhen Bian, Shulin He, Qifeng Liu, Yike Guo, Wei Xue

    Abstract: Recent progress in large-scale zero-shot speech synthesis has been significantly advanced by language models and diffusion models. However, the generation process of both methods is slow and computationally intensive. Efficient speech synthesis using a lower computing budget to achieve quality on par with previous work remains a significant challenge. In this paper, we present FlashSpeech, a large… ▽ More

    Submitted 24 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Efficient zero-shot speech synthesis

  6. arXiv:2404.12170  [pdf, other

    eess.SP cs.IT

    Secure Semantic Communication for Image Transmission in the Presence of Eavesdroppers

    Authors: Shunpu Tang, Chen Liu, Qianqian Yang, Shibo He, Dusit Niyato

    Abstract: Semantic communication (SemCom) has emerged as a key technology for the forthcoming sixth-generation (6G) network, attributed to its enhanced communication efficiency and robustness against channel noise. However, the open nature of wireless channels renders them vulnerable to eavesdrop**, posing a serious threat to privacy. To address this issue, we propose a novel secure semantic communication… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  7. arXiv:2404.11313  [pdf, other

    eess.IV cs.AI

    NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

    Authors: Xin Li, Kun Yuan, Ya**g Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo , et al. (43 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 Challenge on Shortform UGC Video Quality Assessment (S-UGC VQA), where various excellent solutions are submitted and evaluated on the collected dataset KVQ from popular short-form video platform, i.e., Kuaishou/Kwai Platform. The KVQ database is divided into three parts, including 2926 videos for training, 420 videos for validation, and 854 videos for testing. The… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR2024 Workshop. The challenge report for CVPR NTIRE2024 Short-form UGC Video Quality Assessment Challenge

  8. arXiv:2404.10365  [pdf, other

    cs.NI cs.LG eess.SP

    Learning Wireless Data Knowledge Graph for Green Intelligent Communications: Methodology and Experiments

    Authors: Yongming Huang, Xiaohu You, Hang Zhan, Shiwen He, Ningning Fu, Wei Xu

    Abstract: Intelligent communications have played a pivotal role in sha** the evolution of 6G networks. Native artificial intelligence (AI) within green communication systems must meet stringent real-time requirements. To achieve this, deploying lightweight and resource-efficient AI models is necessary. However, as wireless networks generate a multitude of data fields and indicators during operation, only… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 12 pages,11 figures

  9. arXiv:2404.08943  [pdf, other

    math.OC eess.SY

    A Novel State-Centric Necessary Condition for Time-Optimal Control of Controllable Linear Systems Based on Augmented Switching Laws

    Authors: Yunan Wang, Chuxiong Hu, Yujie Lin, Zeyang Li, Shize Lin, Suqin He

    Abstract: Most existing necessary conditions for optimal control based on adjoining methods require both state information and costate information, yet the lack of costates for a given feasible trajectory in practice impedes the determination of optimality. This paper establishes a novel theoretical framework for time-optimal control of controllable linear systems, proposing the augmented switching law that… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  10. arXiv:2403.17675  [pdf, other

    math.OC eess.SY

    Chattering Phenomena in Time-Optimal Control for High-Order Chain-of-Integrators Systems with Full State Constraints

    Authors: Yunan Wang, Chuxiong Hu, Zeyang Li, Yujie Lin, Shize Lin, Suqin He

    Abstract: Time-optimal control for high-order chain-of-integrators systems with full state constraints remains an open and challenging problem in the optimal control theory domain. The behaviors of optimal control in high-order problems lack precision characterization, even where the existence of the chattering phenomenon remains unknown and overlooked. This paper establishes a theoretical framework for cha… ▽ More

    Submitted 29 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  11. arXiv:2402.14225  [pdf, other

    eess.AS cs.SD

    SICRN: Advancing Speech Enhancement through State Space Model and Inplace Convolution Techniques

    Authors: Changjiang Zhao, Shulin He, Xueliang Zhang

    Abstract: Speech enhancement aims to improve speech quality and intelligibility, especially in noisy environments where background noise degrades speech signals. Currently, deep learning methods achieve great success in speech enhancement, e.g. the representative convolutional recurrent neural network (CRN) and its variants. However, CRN typically employs consecutive downsampling and upsampling convolution… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  12. arXiv:2312.15633  [pdf, other

    cs.CV eess.IV

    MuLA-GAN: Multi-Level Attention GAN for Enhanced Underwater Visibility

    Authors: Ahsan Baidar Bakht, Zikai Jia, Muhayy ud Din, Waseem Akram, Lyes Saad Soud, Lakmal Seneviratne, Defu Lin, Shaoming He, Irfan Hussain

    Abstract: The underwater environment presents unique challenges, including color distortions, reduced contrast, and blurriness, hindering accurate analysis. In this work, we introduce MuLA-GAN, a novel approach that leverages the synergistic power of Generative Adversarial Networks (GANs) and Multi-Level Attention mechanisms for comprehensive underwater image enhancement. The integration of Multi-Level Atte… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  13. arXiv:2312.10979  [pdf, ps, other

    cs.SD eess.AS

    3S-TSE: Efficient Three-Stage Target Speaker Extraction for Real-Time and Low-Resource Applications

    Authors: Shulin He, **jiang liu, Hao Li, Yang Yang, Fei Chen, Xueliang Zhang

    Abstract: Target speaker extraction (TSE) aims to isolate a specific voice from multiple mixed speakers relying on a registerd sample. Since voiceprint features usually vary greatly, current end-to-end neural networks require large model parameters which are computational intensive and impractical for real-time applications, espetially on resource-constrained platforms. In this paper, we address the TSE tas… ▽ More

    Submitted 4 January, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024

  14. arXiv:2312.05062  [pdf, ps, other

    eess.IV

    Deep Learning Enabled Semantic Communication Systems for Video Transmission

    Authors: Zhenguo Zhang, Qianqian Yang, Shibo He, Jiming Chen

    Abstract: Semantic communication has emerged as a promising approach for improving efficient transmission in the next generation of wireless networks. Inspired by the success of semantic communication in different areas, we aim to provide a new semantic communication scheme from the semantic level. In this paper, we propose a novel DL-based semantic communication system for video transmission, which compact… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  15. arXiv:2311.07039  [pdf, other

    eess.SY

    Time-Optimal Control for High-Order Chain-of-Integrators Systems with Full State Constraints and Arbitrary Terminal States (Extended Version)

    Authors: Yunan Wang, Chuxiong Hu, Zeyang Li, Shize Lin, Suqin He, Yu Zhu

    Abstract: Time-optimal control for high-order chain-of-integrators systems with full state constraints and arbitrarily given terminal states remains a challenging problem in the optimal control theory domain, yet to be resolved. To enhance further comprehension of the problem, this paper establishes a novel notation system and theoretical framework, providing the switching manifold for high-order problems i… ▽ More

    Submitted 28 March, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

  16. arXiv:2309.10393  [pdf, ps, other

    cs.SD eess.AS

    Hierarchical Modeling of Spatial Cues via Spherical Harmonics for Multi-Channel Speech Enhancement

    Authors: Jiahui Pan, Shulin He, Hui Zhang, Xueliang Zhang

    Abstract: Multi-channel speech enhancement utilizes spatial information from multiple microphones to extract the target speech. However, most existing methods do not explicitly model spatial cues, instead relying on implicit learning from multi-channel spectra. To better leverage spatial information, we propose explicitly incorporating spatial modeling by applying spherical harmonic transforms (SHT) to the… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  17. arXiv:2309.10379  [pdf, ps, other

    cs.SD eess.AS

    PDPCRN: Parallel Dual-Path CRN with Bi-directional Inter-Branch Interactions for Multi-Channel Speech Enhancement

    Authors: Jiahui Pan, Shulin He, Tianci Wu, Hui Zhang, Xueliang Zhang

    Abstract: Multi-channel speech enhancement seeks to utilize spatial information to distinguish target speech from interfering signals. While deep learning approaches like the dual-path convolutional recurrent network (DPCRN) have made strides, challenges persist in effectively modeling inter-channel correlations and amalgamating multi-level information. In response, we introduce the Parallel Dual-Path Convo… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  18. arXiv:2307.16228  [pdf, other

    cs.MA cs.AI cs.LG eess.SY

    Robust Electric Vehicle Balancing of Autonomous Mobility-On-Demand System: A Multi-Agent Reinforcement Learning Approach

    Authors: Sihong He, Shuo Han, Fei Miao

    Abstract: Electric autonomous vehicles (EAVs) are getting attention in future autonomous mobility-on-demand (AMoD) systems due to their economic and societal benefits. However, EAVs' unique charging patterns (long charging time, high charging frequency, unpredictable charging behaviors, etc.) make it challenging to accurately predict the EAVs supply in E-AMoD systems. Furthermore, the mobility demand's pred… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: accepted to International Conference on Intelligent Robots and Systems (IROS2023)

  19. arXiv:2307.16212  [pdf, other

    cs.LG cs.AI cs.GT cs.MA eess.SY

    Robust Multi-Agent Reinforcement Learning with State Uncertainty

    Authors: Sihong He, Songyang Han, Sanbao Su, Shuo Han, Shaofeng Zou, Fei Miao

    Abstract: In real-world multi-agent reinforcement learning (MARL) applications, agents may not have perfect state information (e.g., due to inaccurate measurement or malicious attacks), which challenges the robustness of agents' policies. Though robustness is getting important in MARL deployment, little prior work has studied state uncertainties in MARL, neither in problem formulation nor algorithm design.… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: 50 pages, Published in TMLR, Transactions on Machine Learning Research (06/2023)

  20. arXiv:2306.16250  [pdf, other

    cs.SD eess.AS

    MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation

    Authors: Jun Chen, Wei Rao, Zilin Wang, Jiuxin Lin, Yukai Ju, Shulin He, Yannan Wang, Zhiyong Wu

    Abstract: The previous SpEx+ has yielded outstanding performance in speaker extraction and attracted much attention. However, it still encounters inadequate utilization of multi-scale information and speaker embedding. To this end, this paper proposes a new effective speaker extraction system with multi-scale interfusion and conditional speaker modulation (ConSM), which is called MC-SpEx. First of all, we d… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted by InterSpeech 2023

  21. arXiv:2306.08454  [pdf, other

    cs.SD eess.AS

    Gesper: A Restoration-Enhancement Framework for General Speech Reconstruction

    Authors: Wenzhe Liu, Yupeng Shi, Jun Chen, Wei Rao, Shulin He, Andong Li, Yannan Wang, Zhiyong Wu

    Abstract: This paper describes a real-time General Speech Reconstruction (Gesper) system submitted to the ICASSP 2023 Speech Signal Improvement (SSI) Challenge. This novel proposed system is a two-stage architecture, in which the speech restoration is performed, and then cascaded by speech enhancement. We propose a complex spectral map**-based generative adversarial network (CSM-GAN) as the speech restora… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted by InterSpeech 2023

  22. arXiv:2304.09324  [pdf, other

    eess.IV cs.CV

    Computer-Vision Benchmark Segment-Anything Model (SAM) in Medical Images: Accuracy in 12 Datasets

    Authors: Sheng He, Rina Bao, **gpeng Li, Jeffrey Stout, Atle Bjornerud, P. Ellen Grant, Yangming Ou

    Abstract: Background: The segment-anything model (SAM), introduced in April 2023, shows promise as a benchmark model and a universal solution to segment various natural images. It comes without previously-required re-training or fine-tuning specific to each new dataset. Purpose: To test SAM's accuracy in various medical image segmentation tasks and investigate potential factors that may affect its accurac… ▽ More

    Submitted 5 May, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: Technical Report

  23. arXiv:2304.09156  [pdf, other

    cs.RO eess.SY

    Using simulation to design an MPC policy for field navigation using GPS sensing

    Authors: Harry Zhang, Stefan Caldararu, Ishaan Mahajan, Shouvik Chatterjee, Thomas Hansen, Abhiraj Dashora, Sriram Ashokkumar, Luning Fang, Xiangru Xu, Shen He, Dan Negrut

    Abstract: Modeling a robust control system with a precise GPS-based state estimation capability in simulation can be useful in field navigation applications as it allows for testing and validation in a controlled environment. This testing process would enable navigation systems to be developed and optimized in simulation with direct transferability to real-world scenarios. The multi-physics simulation engin… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 10 pages,5 figures,submitted to ECCOMAS Thematic Conference on Multibody Dynamics

  24. arXiv:2304.07036  [pdf, other

    eess.IV cs.CV cs.LG

    Hierarchical Agent-based Reinforcement Learning Framework for Automated Quality Assessment of Fetal Ultrasound Video

    Authors: Si**g Liu, Qilong Ying, Shuangchi He, Xin Yang, Dong Ni, Ruobing Huang

    Abstract: Ultrasound is the primary modality to examine fetal growth during pregnancy, while the image quality could be affected by various factors. Quality assessment is essential for controlling the quality of ultrasound images to guarantee both the perceptual and diagnostic values. Existing automated approaches often require heavy structural annotations and the predictions may not necessarily be consiste… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  25. arXiv:2304.01401  [pdf, other

    eess.IV cs.CV

    U-Netmer: U-Net meets Transformer for medical image segmentation

    Authors: Sheng He, Rina Bao, P. Ellen Grant, Yangming Ou

    Abstract: The combination of the U-Net based deep learning models and Transformer is a new trend for medical image segmentation. U-Net can extract the detailed local semantic and texture information and Transformer can learn the long-rang dependencies among pixels in the input image. However, directly adapting the Transformer for segmentation has ``token-flatten" problem (flattens the local patches into 1D… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 10 pages, 5 figures, under review

  26. arXiv:2303.07704  [pdf, other

    eess.AS cs.SD

    TEA-PSE 3.0: Tencent-Ethereal-Audio-Lab Personalized Speech Enhancement System For ICASSP 2023 DNS Challenge

    Authors: Yukai Ju, Jun Chen, Shimin Zhang, Shulin He, Wei Rao, Weixin Zhu, Yannan Wang, Tao Yu, Shidong Shang

    Abstract: This paper introduces the Unbeatable Team's submission to the ICASSP 2023 Deep Noise Suppression (DNS) Challenge. We expand our previous work, TEA-PSE, to its upgraded version -- TEA-PSE 3.0. Specifically, TEA-PSE 3.0 incorporates a residual LSTM after squeezed temporal convolution network (S-TCN) to enhance sequence modeling capabilities. Additionally, the local-global representation (LGR) struct… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted by ICASSP 2023

  27. arXiv:2303.06593  [pdf, other

    eess.SP

    Domain-Knowledge-Aided Airborne Ground Moving Targets Tracking

    Authors: Jianduo Chai, Shaoming He, Hyo-Sang Shin

    Abstract: This paper investigates the problem of traffic surveillance using an unmanned aerial vehicle (UAV) and proposes a domain-knowledge-aided airborne ground moving targets tracking algorithm. To improve the accuracy of multiple targets tracking, the proposed algorithm incorporates domain knowledge into the joint probabilistic data association (JPDA) filter as state constraints. The domain knowledge co… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

  28. arXiv:2302.14246  [pdf, other

    eess.SY cs.RO math.OC

    i2LQR: Iterative LQR for Iterative Tasks in Dynamic Environments

    Authors: Yifan Zeng, Suiyi He, Han Hoang Nguyen, Yihan Li, Zhongyu Li, Koushil Sreenath, Jun Zeng

    Abstract: This work introduces a novel control strategy called Iterative Linear Quadratic Regulator for Iterative Tasks (i2LQR), which aims to improve closed-loop performance with local trajectory optimization for iterative tasks in a dynamic environment. The proposed algorithm is reference-free and utilizes historical data from previous iterations to enhance the performance of the autonomous system. Unlike… ▽ More

    Submitted 6 September, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted by 2023 62nd IEEE Conference on Decision and Control (CDC)

  29. arXiv:2302.13770  [pdf, other

    cs.CV eess.IV

    Mask Reference Image Quality Assessment

    Authors: Pengxiang Xiao, Shuai He, Limin Liu, Anlong Ming

    Abstract: Understanding semantic information is an essential step in knowing what is being learned in both full-reference (FR) and no-reference (NR) image quality assessment (IQA) methods. However, especially for many severely distorted images, even if there is an undistorted image as a reference (FR-IQA), it is difficult to perceive the lost semantic and texture information of distorted images directly. In… ▽ More

    Submitted 19 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 10 pages, 6 figures

  30. Federated Multi-Agent Deep Reinforcement Learning Approach via Physics-Informed Reward for Multi-Microgrid Energy Management

    Authors: Yuanzheng Li, Shangyang He, Yang Li, Yang Shi, Zhigang Zeng

    Abstract: The utilization of large-scale distributed renewable energy promotes the development of the multi-microgrid (MMG), which raises the need of develo** an effective energy management method to minimize economic costs and keep self energy-sufficiency. The multi-agent deep reinforcement learning (MADRL) has been widely used for the energy management problem because of its real-time scheduling ability… ▽ More

    Submitted 29 December, 2022; originally announced January 2023.

    Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems 35 (2024) 5902-5914

  31. arXiv:2212.09206  [pdf, other

    eess.IV cs.CV

    Segmentation Ability Map: Interpret deep features for medical image segmentation

    Authors: Sheng He, Yanfang Feng, P. Ellen Grant, Yangming Ou

    Abstract: Deep convolutional neural networks (CNNs) have been widely used for medical image segmentation. In most studies, only the output layer is exploited to compute the final segmentation results and the hidden representations of the deep learned features have not been well understood. In this paper, we propose a prototype segmentation (ProtoSeg) method to compute a binary segmentation map based on deep… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Journal ref: Medical Image Analysis, 2023

  32. arXiv:2212.02764  [pdf, other

    eess.IV cs.CV cs.LG

    A Trustworthy Framework for Medical Image Analysis with Deep Learning

    Authors: Kai Ma, Siyuan He, Pengcheng Xi, Ashkan Ebadi, Stéphane Tremblay, Alexander Wong

    Abstract: Computer vision and machine learning are playing an increasingly important role in computer-assisted diagnosis; however, the application of deep learning to medical imaging has challenges in data availability and data imbalance, and it is especially important that models for medical imaging are built to be trustworthy. Therefore, we propose TRUDLMIA, a trustworthy deep learning framework for medic… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

  33. arXiv:2212.01106   

    eess.AS eess.SP

    ExARN: self-attending RNN for target speaker extraction

    Authors: Pengjie Shen, Shulin He, Xueliang Zhang

    Abstract: Target speaker extraction is to extract the target speaker, specified by enrollment utterance, in an environment with other competing speakers. Therefore, the task needs to solve two problems, speaker identification and separation, at the same time. In this paper, we combine self-attention and Recurrent Neural Networks (RNN). Further, we exploit various ways to combining different auxiliary inform… ▽ More

    Submitted 12 March, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: The overall quality of the article is not good enough

  34. arXiv:2211.13797  [pdf, other

    math.OC cs.RO eess.SY

    Data-Driven Distributionally Robust Electric Vehicle Balancing for Autonomous Mobility-on-Demand Systems under Demand and Supply Uncertainties

    Authors: Sihong He, Zhili Zhang, Shuo Han, Lynn Pepin, Guang Wang, Desheng Zhang, John Stankovic, Fei Miao

    Abstract: Electric vehicles (EVs) are being rapidly adopted due to their economic and societal benefits. Autonomous mobility-on-demand (AMoD) systems also embrace this trend. However, the long charging time and high recharging frequency of EVs pose challenges to efficiently managing EV AMoD systems. The complicated dynamic charging and mobility process of EV AMoD systems makes the demand and supply uncertai… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: 16 pages

  35. arXiv:2211.12340  [pdf, other

    eess.IV cs.CV

    DOLCE: A Model-Based Probabilistic Diffusion Framework for Limited-Angle CT Reconstruction

    Authors: Jiaming Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Stewart He, K. Aditya Mohan, Ulugbek S. Kamilov, Hyo** Kim

    Abstract: Limited-Angle Computed Tomography (LACT) is a non-destructive evaluation technique used in a variety of applications ranging from security to medicine. The limited angle coverage in LACT is often a dominant source of severe artifacts in the reconstructed images, making it a challenging inverse problem. We present DOLCE, a new deep model-based framework for LACT that uses a conditional diffusion mo… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 29 pages, 21 figures

  36. arXiv:2210.15853  [pdf, other

    cs.SD eess.AS

    Speech Enhancement with Intelligent Neural Homomorphic Synthesis

    Authors: Shulin He, Wei Rao, **jiang Liu, Jun Chen, Yukai Ju, Xueliang Zhang, Yannan Wang, Shidong Shang

    Abstract: Most neural network speech enhancement models ignore speech production mathematical models by directly map** Fourier transform spectrums or waveforms. In this work, we propose a neural source filter network for speech enhancement. Specifically, we use homomorphic signal processing and cepstral analysis to obtain noisy speech's excitation and vocal tract. Unlike traditional signal processing, we… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: Submitted to ICASSP 2023

  37. arXiv:2210.15849  [pdf, ps, other

    cs.SD eess.AS

    Hierarchical speaker representation for target speaker extraction

    Authors: Shulin He, Huaiwen Zhang, Wei Rao, Kanghao Zhang, Yukai Ju, Yang Yang, Xueliang Zhang

    Abstract: Target speaker extraction aims to isolate a specific speaker's voice from a composite of multiple sound sources, guided by an enrollment utterance or called anchor. Current methods predominantly derive speaker embeddings from the anchor and integrate them into the separation network to separate the voice of the target speaker. However, the representation of the speaker embedding is too simplistic,… ▽ More

    Submitted 4 January, 2024; v1 submitted 27 October, 2022; originally announced October 2022.

    Comments: Accepted to ICASSP 2024

  38. arXiv:2210.04747  [pdf, other

    cs.IT eess.SP

    An NLoS-based Enhanced Sensing Method for MmWave Communication System

    Authors: Shiwen He, Kangli Cai, Shiyue Huang, Zhenyu Anz, Wei Huang, Ning Gao

    Abstract: The millimeter-wave (mmWave)-based Wi-Fi sensing technology has recently attracted extensive attention since it provides a possibility to realize higher sensing accuracy. However, current works mainly concentrate on sensing scenarios where the line-of-sight (LoS) path exists, which significantly limits their applications. To address the problem, we propose an enhanced mmWave sensing algorithm in t… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  39. arXiv:2209.08230  [pdf, other

    cs.MA cs.LG cs.RO eess.SY

    A Robust and Constrained Multi-Agent Reinforcement Learning Electric Vehicle Rebalancing Method in AMoD Systems

    Authors: Sihong He, Yue Wang, Shuo Han, Shaofeng Zou, Fei Miao

    Abstract: Electric vehicles (EVs) play critical roles in autonomous mobility-on-demand (AMoD) systems, but their unique charging patterns increase the model uncertainties in AMoD systems (e.g. state transition probability). Since there usually exists a mismatch between the training and test/true environments, incorporating model uncertainty into system design is of critical importance in real-world applicat… ▽ More

    Submitted 27 September, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: 8 pages, accepted to IROS2023

  40. arXiv:2208.13113  [pdf, other

    eess.IV cs.CV

    Accurate and Robust Lesion RECIST Diameter Prediction and Segmentation with Transformers

    Authors: Youbao Tang, Ning Zhang, Yirui Wang, Shenghua He, Mei Han, **g Xiao, Ruei-Sung Lin

    Abstract: Automatically measuring lesion/tumor size with RECIST (Response Evaluation Criteria In Solid Tumors) diameters and segmentation is important for computer-aided diagnosis. Although it has been studied in recent years, there is still space to improve its accuracy and robustness, such as (1) enhancing features by incorporating rich contextual information while kee** a high spatial resolution and (2… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: One of a series of works about lesion RECIST diameter prediction and weakly-supervised lesion segmentation (MICCAI 2022)

  41. arXiv:2205.12727  [pdf, other

    eess.AS cs.SD

    Semantic-preserved Communication System for Highly Efficient Speech Transmission

    Authors: Tianxiao Han, Qianqian Yang, Zhiguo Shi, Shibo He, Zhaoyang Zhang

    Abstract: Deep learning (DL) based semantic communication methods have been explored for the efficient transmission of images, text, and speech in recent years. In contrast to traditional wireless communication methods that focus on the transmission of abstract symbols, semantic communication approaches attempt to achieve better transmission efficiency by only sending the semantic-related information of the… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.03211

  42. Leveraging RIS-Enabled Smart Signal Propagation for Solving Infeasible Localization Problems

    Authors: Kamran Keykhosravi, Benoit Denis, George C. Alexandropoulos, Zhongxia Simon He, Antonio Albanese, Vincenzo Sciancalepore, Henk Wymeersch

    Abstract: Reconfigurable intelligent surfaces (RISs) have tremendous potential for both communication and localization. While communication benefits are now well-understood, the breakthrough nature of the technology may well lie in its capability to provide location estimates when conventional approaches fail, (e.g., due to insufficient available infrastructure). A limited number of example scenarios have b… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  43. Probabilistic Charging Power Forecast of EVCS: Reinforcement Learning Assisted Deep Learning Approach

    Authors: Yuanzheng Li, Shangyang He, Yang Li, Leijiao Ge, Suhua Lou, Zhigang Zeng

    Abstract: The electric vehicle (EV) and electric vehicle charging station (EVCS) have been widely deployed with the development of large-scale transportation electrifications. However, since charging behaviors of EVs show large uncertainties, the forecasting of EVCS charging power is non-trivial. This paper tackles this issue by proposing a reinforcement learning assisted deep learning framework for the pro… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

    Comments: Accepted by IEEE Transactions on Intelligent Vehicles

    Journal ref: IEEE Transactions on Intelligent Vehicles 8 (2023) 344-357

  44. arXiv:2204.06929  [pdf, other

    eess.IV cs.CV cs.LG

    Sketch guided and progressive growing GAN for realistic and editable ultrasound image synthesis

    Authors: Jiamin Liang, Xin Yang, Yuhao Huang, Haoming Li, Shuangchi He, Xindi Hu, Zejian Chen, Wufeng Xue, Jun Cheng, Dong Ni

    Abstract: Ultrasound (US) imaging is widely used for anatomical structure inspection in clinical diagnosis. The training of new sonographers and deep learning based algorithms for US image analysis usually requires a large amount of data. However, obtaining and labeling large-scale US imaging data are not easy tasks, especially for diseases with low incidence. Realistic US image synthesis can alleviate this… ▽ More

    Submitted 25 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted by Medical Image Analysis (13 figures, 4 tabels)

  45. arXiv:2202.04754  [pdf, other

    eess.IV cs.CV

    Wireless Transmission of Images With The Assistance of Multi-level Semantic Information

    Authors: Zhenguo Zhang, Qianqian Yang, Shibo He, Mingyang Sun, Jiming Chen

    Abstract: Semantic-oriented communication has been considered as a promising to boost the bandwidth efficiency by only transmitting the semantics of the data. In this paper, we propose a multi-level semantic aware communication system for wireless image transmission, named MLSC-image, which is based on the deep learning techniques and trained in an end to end manner. In particular, the proposed model includ… ▽ More

    Submitted 8 December, 2023; v1 submitted 8 February, 2022; originally announced February 2022.

  46. arXiv:2202.03211  [pdf, other

    eess.AS cs.SD

    Semantic-aware Speech to Text Transmission with Redundancy Removal

    Authors: Tianxiao Han, Qianqian Yang, Zhiguo Shi, Shibo He, Zhaoyang Zhang

    Abstract: Deep learning (DL) based semantic communication methods have been explored for the efficient transmission of images, text, and speech in recent years. In contrast to traditional wireless communication methods that focus on the transmission of abstract symbols, semantic communication approaches attempt to achieve better transmission efficiency by only sending the semantic-related information of the… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  47. arXiv:2112.08363  [pdf, other

    cs.LG cs.CV eess.IV

    Performance or Trust? Why Not Both. Deep AUC Maximization with Self-Supervised Learning for COVID-19 Chest X-ray Classifications

    Authors: Siyuan He, Pengcheng Xi, Ashkan Ebadi, Stephane Tremblay, Alexander Wong

    Abstract: Effective representation learning is the key in improving model performance for medical image analysis. In training deep learning models, a compromise often must be made between performance and trust, both of which are essential for medical applications. Moreover, models optimized with cross-entropy loss tend to suffer from unwarranted overconfidence in the majority class and over-cautiousness in… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 3 pages

    Journal ref: Published at CVIS 2021: 7th Annual Conference on Vision and Intelligent Systems

  48. arXiv:2112.06435  [pdf, other

    cs.RO cs.MA eess.SY

    Autonomous Racing with Multiple Vehicles using a Parallelized Optimization with Safety Guarantee using Control Barrier Functions

    Authors: Suiyi He, Jun Zeng, Koushil Sreenath

    Abstract: This paper presents a novel planning and control strategy for competing with multiple vehicles in a car racing scenario. The proposed racing strategy switches between two modes. When there are no surrounding vehicles, a learning-based model predictive control (MPC) trajectory planner is used to guarantee that the ego vehicle achieves better lap timing performance. When the ego vehicle is competing… ▽ More

    Submitted 27 March, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Accepted to IEEE International Conference on Robotics and Automation (ICRA 2022)

  49. arXiv:2112.01738  [pdf, ps, other

    cs.IT eess.SP

    Joint User Scheduling and Beamforming Design for Multiuser MISO Downlink Systems

    Authors: S. He, J. Yuan, Z. An, W. Huang, Y. Huang, Y. Zhang

    Abstract: In multiuser communication systems, user scheduling and beamforming (US-BF) design are two fundamental problems that are usually studied separately in the existing literature. In this work, we focus on the joint US-BF design with the goal of maximizing the set cardinality of scheduled users, which is computationally challenging due to the non-convex objective function and the coupled constraints w… ▽ More

    Submitted 4 July, 2022; v1 submitted 3 December, 2021; originally announced December 2021.

    Comments: 31 pages, 9 figures, submit to IEEE Transactions on Wireless Communications

  50. arXiv:2110.03775  [pdf

    eess.IV cs.CV physics.med-ph

    Proposing a System Level Machine Learning Hybrid Architecture and Approach for a Comprehensive Autism Spectrum Disorder Diagnosis

    Authors: Ryan Liu, Spencer He

    Abstract: Autism Spectrum Disorder (ASD) is a severe neuropsychiatric disorder that affects intellectual development, social behavior, and facial features, and the number of cases is still significantly increasing. Due to the variety of symptoms ASD displays, the diagnosis process remains challenging, with numerous misdiagnoses as well as lengthy and expensive diagnoses. Fortunately, if ASD is diagnosed and… ▽ More

    Submitted 18 September, 2021; originally announced October 2021.