Skip to main content

Showing 1–50 of 137 results for author: Yu, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.00933  [pdf, other

    cs.DC eess.SP

    Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks: Design Optimization and Analysis

    Authors: Xueyao Zhang, Bo Yang, Zhiwen Yu, Xuelin Cao, George C. Alexandropoulos, Yan Zhang, Merouane Debbah, Chau Yuen

    Abstract: This paper investigates autonomous driving safety improvement via task offloading from cellular vehicles (CVs) to a multi-access edge computing (MEC) server using vehicle-to-infrastructure (V2I) links. Considering that the latter links can be reused by vehicle-to-vehicle (V2V) communications to improve spectrum utilization, the receiver of the V2I link may suffer from severe interference that can… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.18055  [pdf, other

    cs.IT eess.SP

    Filtering Reconfigurable Intelligent Computational Surface for RF Spectrum Purification

    Authors: Kaining Wang, Bo Yang, Zhiwen Yu, Xuelin Cao, Mérouane Debbah, Chau Yuen

    Abstract: The increasing demand for communication is degrading the electromagnetic (EM) transmission environment due to severe EM interference, significantly reducing the efficiency of the radio frequency (RF) spectrum. Metasurfaces, a promising technology for controlling desired EM waves, have recently received significant attention from both academia and industry. However, the potential impact of out-of-b… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.16012  [pdf

    eess.IV cs.CV

    Wound Tissue Segmentation in Diabetic Foot Ulcer Images Using Deep Learning: A Pilot Study

    Authors: Mrinal Kanti Dhar, Chuanbo Wang, Yash Patel, Taiyu Zhang, Jeffrey Niezgoda, Sandeep Gopalakrishnan, Keke Chen, Zeyun Yu

    Abstract: Identifying individual tissues, so-called tissue segmentation, in diabetic foot ulcer (DFU) images is a challenging task and little work has been published, largely due to the limited availability of a clinical image dataset. To address this gap, we have created a DFUTissue dataset for the research community to evaluate wound tissue segmentation algorithms. The dataset contains 110 images with tis… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  4. arXiv:2406.13335  [pdf, other

    cs.NI eess.SP

    AI-Empowered Multiple Access for 6G: A Survey of Spectrum Sensing, Protocol Designs, and Optimizations

    Authors: Xuelin Cao, Bo Yang, Kaining Wang, Xinghua Li, Zhiwen Yu, Chau Yuen, Yan Zhang, Zhu Han

    Abstract: With the rapidly increasing number of bandwidth-intensive terminals capable of intelligent computing and communication, such as smart devices equipped with shallow neural network models, the complexity of multiple access for these intelligent terminals is increasing due to the dynamic network environment and ubiquitous connectivity in 6G systems. Traditional multiple access (MA) design and optimiz… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  5. Interpretable modulated differentiable STFT and physics-informed balanced spectrum metric for freight train wheelset bearing cross-machine transfer fault diagnosis under speed fluctuations

    Authors: Chao He, Hongmei Shi, Ruixin Li, Jianbo Li, ZuJun Yu

    Abstract: The service conditions of wheelset bearings has a direct impact on the safe operation of railway heavy haul freight trains as the key components. However, speed fluctuation of the trains and few fault samples are the two main problems that restrict the accuracy of bearing fault diagnosis. Therefore, a cross-machine transfer diagnosis (pyDSN) network coupled with interpretable modulated differentia… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Journal ref: Advanced Engineering Informatics, 2024

  6. arXiv:2406.09546  [pdf, other

    cs.CV eess.IV

    Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment

    Authors: Fengbin Guan, Xin Li, Zihao Yu, Yiting Lu, Zhibo Chen

    Abstract: In this work, we take the first exploration of the recently popular foundation model, i.e., State Space Model/Mamba, in image quality assessment, aiming at observing and excavating the perception potential in vision Mamba. A series of works on Mamba has shown its significant potential in various fields, e.g., segmentation and classification. However, the perception capability of Mamba has been und… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 17 pages,3 figures

  7. arXiv:2406.09058  [pdf, ps, other

    cs.IT eess.SP

    Environment-Aware Codebook Design for RIS-Assisted MU-MISO Communications: Implementation and Performance Analysis

    Authors: Zhiheng Yu, Jiancheng An, Ertugrul Basar, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) provides a new electromagnetic response control solution, which can proactively reshape the characteristics of wireless channel environments. In RIS-assisted communication systems, the acquisition of channel state information (CSI) and the optimization of reflecting coefficients constitute major design challenges. To address these issues, codebook-based sol… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 36 pages, 12 figures, 2 tables, accepted by IEEE TCOM. arXiv admin note: text overlap with arXiv:2404.00265

  8. arXiv:2405.17295  [pdf, other

    eess.SP

    In-sensor Computing ANN Capacitive Sensors

    Authors: Guihua Zhao, Yating Peng, Jiaxin Zhu, Xin Tang, Zhiyi Yu

    Abstract: This letter proposes an in-sensor computing multiply-and-accumulate (MAC) circuit based on capacitance. The MAC circuits can constitute an artificial neural network(ANN) layer and be operated as ANN classifiers and autoencoders. The proposed circuit is a promising scheme for capacitive ANN image sensors, showing competitively high efficiency and lower power.

    Submitted 27 May, 2024; originally announced May 2024.

  9. arXiv:2405.08800  [pdf

    eess.SY

    Estimation of Participation Factors for Power System Oscillation from Measurements

    Authors: Tianwei Xia, Zhe Yu, Kai Sun, Di Shi, Kaiyang Huang

    Abstract: In a power system, when the participation factors of generators are computed to rank their participations into an oscillatory mode, a model-based approach is conventionally used on the linearized system model by means of the corresponding right and left eigenvectors. This paper proposes a new approach for estimating participation factors directly from measurement data on generator responses under… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  10. arXiv:2405.06995  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Benchmarking Cross-Domain Audio-Visual Deception Detection

    Authors: Xiaobao Guo, Zitong Yu, Nithish Muthuchamy Selvaraj, Bingquan Shen, Adams Wai-Kin Kong, Alex C. Kot

    Abstract: Automated deception detection is crucial for assisting humans in accurately assessing truthfulness and identifying deceptive behavior. Conventional contact-based techniques, like polygraph devices, rely on physiological signals to determine the authenticity of an individual's statements. Nevertheless, recent developments in automated deception detection have demonstrated that multimodal features d… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 10 pages

  11. arXiv:2404.07215  [pdf, other

    cs.NI cs.AI eess.SP

    Computation Offloading for Multi-server Multi-access Edge Vehicular Networks: A DDQN-based Method

    Authors: Siyu Wang, Bo Yang, Zhiwen Yu, Xuelin Cao, Yan Zhang, Chau Yuen

    Abstract: In this paper, we investigate a multi-user offloading problem in the overlap** domain of a multi-server mobile edge computing system. We divide the original problem into two stages: the offloading decision making stage and the request scheduling stage. To prevent the terminal from going out of service area during offloading, we consider the mobility parameter of the terminal according to the hum… ▽ More

    Submitted 20 February, 2024; originally announced April 2024.

  12. arXiv:2404.05217  [pdf, other

    eess.SY

    Network-Constrained Unit Commitment with Flexible Temporal Resolution

    Authors: Zekuan Yu, Haiwang Zhong, Guangchun Ruan, Xinfei Yan

    Abstract: Modern network-constrained unit commitment (NCUC) bears a heavy computational burden due to the ever-growing model scale. This situation becomes more challenging when detailed operational characteristics, complicated constraints, and multiple objectives are considered. We propose a novel simplification method to determine the flexible temporal resolution for acceleration and near-optimal solutions… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 11 pages, 10 figures. Accepted by IEEE Transactions on Power Systems

  13. arXiv:2404.01170  [pdf, other

    cs.RO eess.IV

    Force-EvT: A Closer Look at Robotic Gripper Force Measurement with Event-based Vision Transformer

    Authors: Qianyu Guo, Ziqing Yu, Jiaming Fu, Yawen Lu, Yahya Zweiri, Dongming Gan

    Abstract: Robotic grippers are receiving increasing attention in various industries as essential components of robots for interacting and manipulating objects. While significant progress has been made in the past, conventional rigid grippers still have limitations in handling irregular objects and can damage fragile objects. We have shown that soft grippers offer deformability to adapt to a variety of objec… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 6 pages, 5 figures

  14. arXiv:2404.00265  [pdf, ps, other

    cs.IT eess.SP

    Environment-Aware Codebook for RIS-Assisted MU-MISO Communications: Implementation and Performance Analysis

    Authors: Zhiheng Yu, Jiancheng An, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) provides a new electromagnetic response control solution, which can reshape the characteristics of wireless channels. In this paper, we propose a novel environment-aware codebook protocol for RIS-assisted multi-user multiple-input single-output (MU-MISO) systems. Specifically, we first introduce a channel training protocol which consists of off-line and on-… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures, accepted by VTC2024-Spring

  15. arXiv:2403.14250  [pdf, other

    eess.IV cs.CR cs.CV

    Safeguarding Medical Image Segmentation Datasets against Unauthorized Training via Contour- and Texture-Aware Perturbations

    Authors: Xun Lin, Yi Yu, Song Xia, Jue Jiang, Haoran Wang, Zitong Yu, Yizhong Liu, Ying Fu, Shuai Wang, Wenzhong Tang, Alex Kot

    Abstract: The widespread availability of publicly accessible medical images has significantly propelled advancements in various research and clinical fields. Nonetheless, concerns regarding unauthorized training of AI systems for commercial purposes and the duties of patient privacy protection have led numerous institutions to hesitate to share their images. This is particularly true for medical image segme… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  16. arXiv:2403.11061  [pdf, other

    eess.SP

    Beamforming Design for Double-Active-RIS-aided Communication Systems with Inter-Excitation

    Authors: Boshi Wang, Cunhua Pan, Hong Ren, Zhiyuan Yu, Yang Zhang, Mengyu Liu, Gui Zhou

    Abstract: In this paper, we investigate a double-active-reconfigurable intelligent surface (RIS)-aided downlink wireless communication system, where a multi-antenna base station (BS) serves multiple single-antenna users with both double reflection and single reflection links. Due to the signal amplification capability of active RISs, the mutual influence between active RISs, which is termed as the "inter-ex… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  17. arXiv:2403.09612  [pdf, other

    physics.optics cs.CV cs.LG eess.IV

    Compute-first optical detection for noise-resilient visual perception

    Authors: Jungmin Kim, Nanfang Yu, Zongfu Yu

    Abstract: In the context of visual perception, the optical signal from a scene is transferred into the electronic domain by detectors in the form of image data, which are then processed for the extraction of visual information. In noisy and weak-signal environments such as thermal imaging for night vision applications, however, the performance of neural computing tasks faces a significant bottleneck due to… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Main 9 pages, 5 figures, Supplementary information 5 pages

  18. arXiv:2403.04228  [pdf, other

    cs.CV eess.IV

    Single-Image HDR Reconstruction Assisted Ghost Suppression and Detail Preservation Network for Multi-Exposure HDR Imaging

    Authors: Huafeng Li, Zhenmei Yang, Yafei Zhang, Dapeng Tao, Zhengtao Yu

    Abstract: The reconstruction of high dynamic range (HDR) images from multi-exposure low dynamic range (LDR) images in dynamic scenes presents significant challenges, especially in preserving and restoring information in oversaturated regions and avoiding ghosting artifacts. While current methods often struggle to address these challenges, our work aims to bridge this gap by develo** a multi-exposure HDR i… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: IEEE Transactions on Computational Imaging

  19. arXiv:2402.13692  [pdf, other

    eess.SP

    Reconfigurable Intelligent Surface assisted Integrated Communication, Sensing, and Computation Systems

    Authors: Jiahua Wan, Hong Ren, Zhiyuan Yu, Zhenkun Zhang, Yang Zhang, Cunhua Pan, Jiangzhou Wang

    Abstract: This paper studies a mobile edge computing (MEC) assisted integrated sensing and communication (ISAC), where reconfigurable intelligent surface (RIS) is used to alleviate the attenuation of communication links during computational offloading. In this paradigm, the dual function radar and communication (DFRC)-enabled user equipments (UEs) simultaneously perform radar sensing and communication tasks… ▽ More

    Submitted 14 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  20. arXiv:2402.07485  [pdf, other

    cs.SD eess.AS

    MINT: Boosting Audio-Language Model via Multi-Target Pre-Training and Instruction Tuning

    Authors: Hang Zhao, Yifei Xin, Zhesong Yu, Bilei Zhu, Lu Lu, Zejun Ma

    Abstract: In the realm of audio-language pre-training (ALP), the challenge of achieving cross-modal alignment is significant. Moreover, the integration of audio inputs with diverse distributions and task variations poses challenges in develo** generic audio-language models. In this study, we present MINT, a novel ALP framework boosting audio-language models through multi-target pre-training and instructio… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  21. arXiv:2402.05847  [pdf, other

    eess.SP

    Reconfigurable Intelligent Surface-Aided Dual-Function Radar and Communication Systems With MU-MIMO Communication

    Authors: Yasheng **, Hong Ren, Cunhua Pan, Zhiyuan Yu, Ruisong Weng, Boshi Wang, Gui Zhou, Yongchao He, Maged Elkashlan

    Abstract: In this paper, we investigate an reconfigurable intelligent surface (RIS)-aided integrated sensing and communication (ISAC) system. Our objective is to maximize the achievable sum rate of the multi-antenna communication users through the joint active and passive beamforming. {Specifically}, the weighted minimum mean-square error (WMMSE) method is { first} used to reformulate the original problem i… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  22. arXiv:2402.04532  [pdf, other

    eess.SP

    Joint Beamforming Design for Double Active RIS-assisted Radar-Communication Coexistence Systems

    Authors: Mengyu Liu, Hong Ren, Cunhua Pan, Boshi Wang, Zhiyuan Yu, Ruisong Weng, Kangda Zhi, Yongchao He

    Abstract: Integrated sensing and communication (ISAC) technology has been considered as one of the key candidate technologies in the next-generation wireless communication systems. However, when radar and communication equipment coexist in the same system, i.e. radar-communication coexistence (RCC), the interference from communication systems to radar can be large and cannot be ignored. Recently, reconfigur… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  23. arXiv:2402.02122  [pdf, other

    eess.SP

    Secure Wireless Communication in Active RIS-Assisted DFRC System

    Authors: Yang Zhang, Hong Ren, Cunhua Pan, Boshi Wang, Zhiyuan Yu, Ruisong Weng, Tuo Wu, Yongchao He

    Abstract: This work considers a dual-functional radar and communication (DFRC) system with an active reconfigurable intelligent surface (RIS) and a potential eavesdropper. Our purpose is to maximize the secrecy rate (SR) of the system by jointly designing the beamforming matrix at the DFRC base station (BS) and the reflecting coefficients at the active RIS, subject to the signal-to-interference-plus-noise-r… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 13 pages, 9 figures

  24. arXiv:2402.00398  [pdf, other

    cs.DC eess.SP

    Reconfigurable Intelligent Computational Surfaces for MEC-Assisted Autonomous Driving Networks

    Authors: Bo Yang, Xueyao Zhang, Zhiwen Yu, Xuelin Cao, Chongwen Huang, George C. Alexandropoulos, Yan Zhang, Merouane Debbah, Chau Yuen

    Abstract: In this paper, we focus on improving autonomous driving safety via task offloading from cellular vehicles (CVs), using vehicle-to-infrastructure (V2I) links, to an multi-access edge computing (MEC) server. Considering that the frequencies used for V2I links can be reused for vehicle-to-vehicle (V2V) communications to improve spectrum utilization, the receiver of each V2I link may suffer from sever… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  25. arXiv:2401.15803  [pdf, other

    cs.RO cs.AI cs.CV eess.SY

    GarchingSim: An Autonomous Driving Simulator with Photorealistic Scenes and Minimalist Workflow

    Authors: Liguo Zhou, Yinglei Song, Yichao Gao, Zhou Yu, Michael Sodamin, Hongshen Liu, Liang Ma, Lian Liu, Hao Liu, Yang Liu, Haichuan Li, Guang Chen, Alois Knoll

    Abstract: Conducting real road testing for autonomous driving algorithms can be expensive and sometimes impractical, particularly for small startups and research institutes. Thus, simulation becomes an important method for evaluating these algorithms. However, the availability of free and open-source simulators is limited, and the installation and configuration process can be daunting for beginners and inte… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  26. arXiv:2401.08522  [pdf, other

    cs.CV cs.LG eess.IV

    Video Quality Assessment Based on Swin TransformerV2 and Coarse to Fine Strategy

    Authors: Zihao Yu, Fengbin Guan, Yiting Lu, Xin Li, Zhibo Chen

    Abstract: The objective of non-reference video quality assessment is to evaluate the quality of distorted video without access to reference high-definition references. In this study, we introduce an enhanced spatial perception module, pre-trained on multiple image quality assessment datasets, and a lightweight temporal fusion module to address the no-reference visual quality assessment (NR-VQA) task. This m… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  27. UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction

    Authors: Jiaxin Guo, Minghan Wang, Xiaosong Qiao, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhengzhe Yu, Yinglu Li, Chang Su, Min Zhang, Shimin Tao, Hao Yang

    Abstract: Error correction techniques have been used to refine the output sentences from automatic speech recognition (ASR) models and achieve a lower word error rate (WER). Previous works usually adopt end-to-end models and has strong dependency on Pseudo Paired Data and Original Paired Data. But when only pre-training on Pseudo Paired Data, previous models have negative effect on correction. While fine-tu… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted in ICASSP 2023

  28. arXiv:2401.03002  [pdf, other

    eess.IV cs.CV

    Prompt-driven Latent Domain Generalization for Medical Image Classification

    Authors: Siyuan Yan, Chi Liu, Zhen Yu, Lie Ju, Dwarikanath Mahapatra, Brigid Betz-Stablein, Victoria Mar, Monika Janda, Peter Soyer, Zongyuan Ge

    Abstract: Deep learning models for medical image analysis easily suffer from distribution shifts caused by dataset artifacts bias, camera variations, differences in the imaging station, etc., leading to unreliable diagnoses in real-world clinical settings. Domain generalization (DG) methods, which aim to train models on multiple domains to perform well on unseen domains, offer a promising direction to solve… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 10 pages

  29. arXiv:2312.05707  [pdf, other

    eess.IV cs.CV cs.LG eess.SP physics.med-ph

    Non-Cartesian Self-Supervised Physics-Driven Deep Learning Reconstruction for Highly-Accelerated Multi-Echo Spiral fMRI

    Authors: Hongyi Gu, Chi Zhang, Zidan Yu, Christoph Rettenmeier, V. Andrew Stenger, Mehmet Akçakaya

    Abstract: Functional MRI (fMRI) is an important tool for non-invasive studies of brain function. Over the past decade, multi-echo fMRI methods that sample multiple echo times has become popular with potential to improve quantification. While these acquisitions are typically performed with Cartesian trajectories, non-Cartesian trajectories, in particular spiral acquisitions, hold promise for denser sampling… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: Submitted to 2024 ISBI

  30. arXiv:2312.03490  [pdf, other

    eess.IV cs.CV

    PneumoLLM: Harnessing the Power of Large Language Model for Pneumoconiosis Diagnosis

    Authors: Meiyue Song, Zhihua Yu, Jiaxin Wang, Jiarui Wang, Yuting Lu, Baicun Li, Xiaoxu Wang, Qinghua Huang, Zhijun Li, Nikolaos I. Kanellakis, Jiangfeng Liu, **g Wang, Binglu Wang, Juntao Yang

    Abstract: The conventional pretraining-and-finetuning paradigm, while effective for common diseases with ample data, faces challenges in diagnosing data-scarce occupational diseases like pneumoconiosis. Recently, large language models (LLMs) have exhibits unprecedented ability when conducting multiple tasks in dialogue, bringing opportunities to diagnosis. A common strategy might involve using adapter layer… ▽ More

    Submitted 28 June, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Medical Image Analysis

  31. arXiv:2312.01586  [pdf, ps, other

    math.OC eess.SY

    On the Maximization of Long-Run Reward CVaR for Markov Decision Processes

    Authors: Li Xia, Zhihui Yu, Peter W. Glynn

    Abstract: This paper studies the optimization of Markov decision processes (MDPs) from a risk-seeking perspective, where the risk is measured by conditional value-at-risk (CVaR). The objective is to find a policy that maximizes the long-run CVaR of instantaneous rewards over an infinite horizon across all history-dependent randomized policies. By establishing two optimality inequalities of opposing directio… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: Risk-seeking optimization of CVaR in MDP

  32. arXiv:2311.15164  [pdf

    physics.optics eess.IV

    Neural-Optic Co-Designed Polarization-Multiplexed Metalens for Compact Computational Spectral Imaging

    Authors: Qiangbo Zhang, Peicheng Lin, Chang Wang, Yang Zhang, Zeqing Yu, Xinyu Liu, Ting Xu, Zhenrong Zheng

    Abstract: As the realm of spectral imaging applications extends its reach into the domains of mobile technology and augmented reality, the demands for compact yet high-fidelity systems become increasingly pronounced. Conventional methodologies, exemplified by coded aperture snapshot spectral imaging systems, are significantly limited by their cumbersome physical dimensions and form factors. To address this… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

  33. arXiv:2311.12461  [pdf, other

    eess.IV cs.CV

    HiFi-Syn: Hierarchical Granularity Discrimination for High-Fidelity Synthesis of MR Images with Structure Preservation

    Authors: Ziqi Yu, Botao Zhao, Shengjie Zhang, Xiang Chen, Jianfeng Feng, Tingying Peng, Xiao-Yong Zhang

    Abstract: Synthesizing medical images while preserving their structural information is crucial in medical research. In such scenarios, the preservation of anatomical content becomes especially important. Although recent advances have been made by incorporating instance-level information to guide translation, these methods overlook the spatial coherence of structural-level representation and the anatomical i… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  34. arXiv:2311.03756  [pdf, other

    cs.LG cs.AI eess.SY

    Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning

    Authors: Yao Zhang, Zhiwen Yu, Jun Zhang, Liang Wang, Tom H. Luan, Bin Guo, Chau Yuen

    Abstract: This paper considers optimal traffic signal control in smart cities, which has been taken as a complex networked system control problem. Given the interacting dynamics among traffic lights and road networks, attaining controller adaptivity and scalability stands out as a primary challenge. Capturing the spatial-temporal correlation among traffic lights under the framework of Multi-Agent Reinforcem… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  35. arXiv:2311.01653  [pdf

    eess.IV cs.CV

    INeAT: Iterative Neural Adaptive Tomography

    Authors: Bo Xiong, Changqing Su, Zihan Lin, You Zhou, Zhaofei Yu

    Abstract: Computed Tomography (CT) with its remarkable capability for three-dimensional imaging from multiple projections, enjoys a broad range of applications in clinical diagnosis, scientific observation, and industrial detection. Neural Adaptive Tomography (NeAT) is a recently proposed 3D rendering method based on neural radiance field for CT, and it demonstrates superior performance compared to traditio… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  36. Numerical Derivative-based Flexible Integration Algorithm for Power Electronic Systems Simulation Considering Nonlinear Components

    Authors: Han Xu, Bochen Shi, Zhujun Yu, Jialin Zheng, Zhengming Zhao

    Abstract: Simulation is an efficient tool in the design and control of power electronic systems. However, quick and accurate simulation of them is still challenging, especially when the system contains a large number of switches and state variables. Conventional general-purpose integration algorithms assume nonlinearity within systems but face inefficiency in handling the piecewise characteristics of power… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 10 pages, 8 figures

  37. arXiv:2310.10159  [pdf, other

    cs.SD cs.CL eess.AS

    Joint Music and Language Attention Models for Zero-shot Music Tagging

    Authors: Xingjian Du, Zhesong Yu, Jiaju Lin, Bilei Zhu, Qiuqiang Kong

    Abstract: Music tagging is a task to predict the tags of music recordings. However, previous music tagging research primarily focuses on close-set music tagging tasks which can not be generalized to new tags. In this work, we propose a zero-shot music tagging system modeled by a joint music and language attention (JMLA) model to address the open-set music tagging problem. The JMLA model consists of an audio… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: \begin{keywords} Music tagging, joint music and language attention models, Music Foundation Model. \end{keywords}

  38. arXiv:2310.04231  [pdf, other

    cs.RO eess.SY

    Indoor Positioning based on Active Radar Sensing and Passive Reflectors: Concepts & Initial Results

    Authors: Pascal Schlachter, Zhibin Yu, Naveed Iqbal, Xiaofeng Wu, Sven Hinderer, Bin Yang

    Abstract: To navigate reliably in indoor environments, an industrial autonomous vehicle must know its position. However, current indoor vehicle positioning technologies either lack accuracy, usability or are too expensive. Thus, we propose a novel concept called local reference point assisted active radar positioning, which is able to overcome these drawbacks. It is based on distributing passive retroreflec… ▽ More

    Submitted 31 January, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Accepted as a work-in-progress paper at the 13th International Conference on Indoor Positioning and Indoor Navigation (IPIN 2023)

    Journal ref: Proceedings of the Work-in-Progress Papers at the 13th International Conference on Indoor Positioning and Indoor Navigation (IPIN-WiP 2023), September 25 - 28, 2023, Nuremberg, Germany (https://ceur-ws.org/Vol-3581/)

  39. arXiv:2309.04672  [pdf, other

    eess.IV cs.CV

    SSHNN: Semi-Supervised Hybrid NAS Network for Echocardiographic Image Segmentation

    Authors: Renqi Chen, **g**g Luo, Fan Nian, Yuhui Cen, Yiheng Peng, Zekuan Yu

    Abstract: Accurate medical image segmentation especially for echocardiographic images with unmissable noise requires elaborate network design. Compared with manual design, Neural Architecture Search (NAS) realizes better segmentation results due to larger search space and automatic optimization, but most of the existing methods are weak in layer-wise feature aggregation and adopt a ``strong encoder, weak de… ▽ More

    Submitted 27 December, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: Accepted by ICASSP2024

  40. arXiv:2309.04204  [pdf, ps, other

    cs.NI eess.SP

    Task Offloading Optimization in Mobile Edge Computing under Uncertain Processing Cycles and Intermittent Communications

    Authors: Tao Deng, Zhanwei Yu, Di Yuan

    Abstract: Mobile edge computing (MEC) has been regarded as a promising approach to deal with explosive computation requirements by enabling cloud computing capabilities at the edge of networks. Existing models of MEC impose some strong assumptions on the known processing cycles and unintermittent communications. However, practical MEC systems are constrained by various uncertainties and intermittent communi… ▽ More

    Submitted 7 October, 2023; v1 submitted 8 September, 2023; originally announced September 2023.

  41. arXiv:2309.00907  [pdf, other

    eess.SP cs.LG

    A Multi-Head Ensemble Multi-Task Learning Approach for Dynamical Computation Offloading

    Authors: Ruihuai Liang, Bo Yang, Zhiwen Yu, Xuelin Cao, Derrick Wing Kwan Ng, Chau Yuen

    Abstract: Computation offloading has become a popular solution to support computationally intensive and latency-sensitive applications by transferring computing tasks to mobile edge servers (MESs) for execution, which is known as mobile/multi-access edge computing (MEC). To improve the MEC performance, it is required to design an optimal offloading strategy that includes offloading decision (i.e., whether o… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  42. arXiv:2308.16476  [pdf, other

    eess.SY

    Multi-Stage Expansion Planning for Decarbonizing Thermal Generation Supported Renewable Power Systems Using Hydrogen and Ammonia Storage

    Authors: Zhipeng Yu, ** Lin, Feng Liu, Jiarong Li, Yingtian Chi, Yonghua Song, Zhengwei Ren

    Abstract: Large-scale centralized development of wind and solar energy and peer-to-grid transmission of renewable energy source (RES) via high voltage direct current (HVDC) has been regarded as one of the most promising ways to achieve goals of peak carbon and carbon neutrality in China. Traditionally, large-scale thermal generation is needed to economically support the load demand of HVDC with a given prof… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 10 pages, 8 figures

  43. arXiv:2308.12490  [pdf, other

    cs.CL cs.SD eess.AS

    MultiPA: A Multi-task Speech Pronunciation Assessment Model for Open Response Scenarios

    Authors: Yu-Wen Chen, Zhou Yu, Julia Hirschberg

    Abstract: Pronunciation assessment models designed for open response scenarios enable users to practice language skills in a manner similar to real-life communication. However, previous open-response pronunciation assessment models have predominantly focused on a single pronunciation task, such as sentence-level accuracy, rather than offering a comprehensive assessment in various aspects. We propose MultiPA… ▽ More

    Submitted 4 June, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2024

  44. arXiv:2308.08847  [pdf, other

    eess.AS cs.SD

    META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection

    Authors: **bo Hu, Yin Cao, Ming Wu, Feiran Yang, Ziying Yu, Wenwu Wang, Mark D. Plumbley, Jun Yang

    Abstract: For learning-based sound event localization and detection (SELD) methods, different acoustic environments in the training and test sets may result in large performance differences in the validation and evaluation stages. Different environments, such as different sizes of rooms, different reverberation times, and different background noise, may be reasons for a learning-based system to fail. On the… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Submitted to DCASE 2023 Workshop

  45. arXiv:2307.12264  [pdf, ps, other

    cs.NI eess.SP

    QoE-Driven Video Transmission: Energy-Efficient Multi-UAV Network Optimization

    Authors: Kesong Wu, Xianbin Cao, Peng Yang, Zongyang Yu, Dapeng Oliver Wu, Tony Q. S. Quek

    Abstract: This paper is concerned with the issue of improving video subscribers' quality of experience (QoE) by deploying a multi-unmanned aerial vehicle (UAV) network. Different from existing works, we characterize subscribers' QoE by video bitrates, latency, and frame freezing and propose to improve their QoE by energy-efficiently and dynamically optimizing the multi-UAV network in terms of serving UAV se… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

  46. arXiv:2307.05799  [pdf

    eess.IV cs.CV

    3D Medical Image Segmentation based on multi-scale MPU-Net

    Authors: Zeqiu. Yu, Shuo. Han, Ziheng. Song

    Abstract: The high cure rate of cancer is inextricably linked to physicians' accuracy in diagnosis and treatment, therefore a model that can accomplish high-precision tumor segmentation has become a necessity in many applications of the medical industry. It can effectively lower the rate of misdiagnosis while considerably lessening the burden on clinicians. However, fully automated target organ segmentation… ▽ More

    Submitted 24 July, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: 37 pages

  47. arXiv:2306.15686  [pdf, other

    eess.AS cs.CL

    Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning

    Authors: Zhongzhi Yu, Yang Zhang, Kaizhi Qian, Yonggan Fu, Yingyan Lin

    Abstract: Despite the impressive performance recently achieved by automatic speech recognition (ASR), we observe two primary challenges that hinder its broader applications: (1) The difficulty of introducing scalability into the model to support more languages with limited training, inference, and storage overhead; (2) The low-resource adaptation ability that enables effective low-resource adaptation while… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  48. arXiv:2306.13093  [pdf, ps, other

    eess.SP cs.ET

    Robust Divergence Angle for Inter-satellite Laser Communications under Target Deviation Uncertainty

    Authors: Zhanwei Yu, Yi Zhao, Di Yuan

    Abstract: Performance degradation due to target deviation by, for example, drift or jitter, presents a significant issue to inter-satellite laser communications. In particular, with periodic acquisition for positioning the satellite receiver, deviation may arise in the time period between two consecutive acquisition operations. One solution to mitigate the issue is to use a divergence angle at the transmitt… ▽ More

    Submitted 13 May, 2023; originally announced June 2023.

  49. arXiv:2306.04249  [pdf, other

    physics.med-ph cs.CV eess.IV

    DEMIST: A deep-learning-based task-specific denoising approach for myocardial perfusion SPECT

    Authors: Md Ashequr Rahman, Zitong Yu, Richard Laforest, Craig K. Abbey, Barry A. Siegel, Abhinav K. Jha

    Abstract: There is an important need for methods to process myocardial perfusion imaging (MPI) SPECT images acquired at lower radiation dose and/or acquisition time such that the processed images improve observer performance on the clinical task of detecting perfusion defects. To address this need, we build upon concepts from model-observer theory and our understanding of the human visual system to propose… ▽ More

    Submitted 25 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

  50. arXiv:2305.03546  [pdf, other

    eess.IV cs.CV

    Breast Cancer Immunohistochemical Image Generation: a Benchmark Dataset and Challenge Review

    Authors: Chuang Zhu, Shengjie Liu, Zekuan Yu, Feng Xu, Arpit Aggarwal, Germán Corredor, Anant Madabhushi, Qixun Qu, Hongwei Fan, Fangda Li, Yueheng Li, Xianchao Guan, Yongbing Zhang, Vivek Kumar Singh, Farhan Akram, Md. Mostafa Kamal Sarker, Zhongyue Shi, Mulan **

    Abstract: For invasive breast cancer, immunohistochemical (IHC) techniques are often used to detect the expression level of human epidermal growth factor receptor-2 (HER2) in breast tissue to formulate a precise treatment plan. From the perspective of saving manpower, material and time costs, directly generating IHC-stained images from Hematoxylin and Eosin (H&E) stained images is a valuable research direct… ▽ More

    Submitted 22 September, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: 12 pages, 12 figures, 2tables