Skip to main content

Showing 1–50 of 96 results for author: He, Z

Searching in archive eess. Search in all archives.
.
  1. Generative Iris Prior Embedded Transformer for Iris Restoration

    Authors: Yubo Huang, Jia Wang, Peipei Li, Liuyu Xiang, Peigang Li, Zhaofeng He

    Abstract: Iris restoration from complexly degraded iris images, aiming to improve iris recognition performance, is a challenging problem. Due to the complex degradation, directly training a convolutional neural network (CNN) without prior cannot yield satisfactory results. In this work, we propose a generative iris prior embedded Transformer model (Gformer), in which we build a hierarchical encoder-decoder… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Our code is available at https://github.com/sawyercharlton/Gformer

    Journal ref: 2023 IEEE International Conference on Multimedia and Expo (ICME), Brisbane, Australia, 2023, pp. 510-515

  2. arXiv:2407.00014  [pdf

    cs.RO eess.SY

    Simplifying Kinematic Parameter Estimation in sEMG Prosthetic Hands: A Two-Point Approach

    Authors: Gang Liu, Zhenxiang Wang, Ziyang He, Shanshan Guo, Rui Zhang, Dezhong Yao

    Abstract: Regression-based sEMG prosthetic hands are widely used for their ability to provide continuous kinematic parameters. However, establishing these models traditionally requires complex kinematic sensor systems to collect corresponding kinematic data in synchronization with EMG, which is cumbersome and user-unfriendly. This paper presents a simplified approach utilizing only two data points to depict… ▽ More

    Submitted 1 May, 2024; originally announced July 2024.

    Comments: 13 pages

  3. arXiv:2406.13705  [pdf, other

    eess.IV cs.AI cs.CV

    EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy

    Authors: Long Bai, Qiaozhi Tan, Tong Chen, Wan Jun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Zhen Chen, **lin Wu, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren

    Abstract: Wireless Capsule Endoscopy (WCE) is highly valued for its non-invasive and painless approach, though its effectiveness is compromised by uneven illumination from hardware constraints and complex internal dynamics, leading to overexposed or underexposed images. While researchers have discussed the challenges of low-light enhancement in WCE, the issue of correcting for different exposure levels rema… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: To appear in MICCAI 2024. Code and dataset availability: https://github.com/longbai1006/EndoUIC

  4. arXiv:2406.03888  [pdf, ps, other

    cs.IT eess.SP

    MSE-Based Training and Transmission Optimization for MIMO ISAC Systems

    Authors: Zhenyao He, Wei Xu, Hong Shen, Yonina C. Eldar, Xiaohu You

    Abstract: In this paper, we investigate a multiple-input multiple-output (MIMO) integrated sensing and communication (ISAC) system under typical block-fading channels. As a non-trivial extension to most existing works on ISAC, both the training and transmission signals sent by the ISAC transmitter are exploited for sensing. Specifically, we develop two training and transmission design schemes to minimize a… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2404.19547  [pdf, other

    eess.SY cs.MA math.OC

    Distributed Traffic Signal Control via Coordinated Maximum Pressure-plus-Penalty

    Authors: Vinzenz Tütsch, Zhiyu He, Florian Dörfler, Kenan Zhang

    Abstract: This paper develops an adaptive traffic control policy inspired by Maximum Pressure (MP) while imposing coordination across intersections. The proposed Coordinated Maximum Pressure-plus-Penalty (CMPP) control policy features a local objective for each intersection that consists of the total pressure within the neighborhood and a penalty accounting for the queue capacities and continuous green time… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  6. arXiv:2404.06265  [pdf, other

    cs.CV eess.IV

    Spatial-Temporal Multi-level Association for Video Object Segmentation

    Authors: Deshui Miao, Xin Li, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang

    Abstract: Existing semi-supervised video object segmentation methods either focus on temporal feature matching or spatial-temporal feature modeling. However, they do not address the issues of sufficient target interaction and efficient parallel processing simultaneously, thereby constraining the learning of dynamic, target-aware features. To tackle these limitations, this paper proposes a spatial-temporal m… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  7. arXiv:2404.04483  [pdf

    eess.IV cs.CV

    FastHDRNet: A new efficient method for SDR-to-HDR Translation

    Authors: Siyuan Tian, Hao Wang, Yiren Rong, Junhao Wang, Renjie Dai, Zhengxiao He

    Abstract: Modern displays nowadays possess the capability to render video content with a high dynamic range (HDR) and an extensive color gamut .However, the majority of available resources are still in standard dynamic range (SDR). Therefore, we need to identify an effective methodology for this objective.The existing deep neural networks (DNN) based SDR to HDR conversion methods outperforms conventional me… ▽ More

    Submitted 11 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: 16 pages, 4 figures

  8. arXiv:2404.04355  [pdf, other

    math.OC eess.SY

    Gray-Box Nonlinear Feedback Optimization

    Authors: Zhiyu He, Saverio Bolognani, Michael Muehlebach, Florian Dörfler

    Abstract: Feedback optimization enables autonomous optimality seeking of a dynamical system through its closed-loop interconnection with iterative optimization algorithms. Among various iteration structures, model-based approaches require the input-output sensitivity of the system to construct gradients, whereas model-free approaches bypass this need by estimating gradients from real-time evaluations of the… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  9. arXiv:2404.00481  [pdf, other

    stat.ML cs.LG eess.SY

    Convolutional Bayesian Filtering

    Authors: Wenhan Cao, Shiqi Liu, Chang Liu, Zeyu He, Stephen S. -T. Yau, Shengbo Eben Li

    Abstract: Bayesian filtering serves as the mainstream framework of state estimation in dynamic systems. Its standard version utilizes total probability rule and Bayes' law alternatively, where how to define and compute conditional probability is critical to state distribution inference. Previously, the conditional probability is assumed to be exactly known, which represents a measure of the occurrence proba… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  10. arXiv:2403.16252  [pdf, other

    cs.RO eess.SY

    Legged Robot State Estimation within Non-inertial Environments

    Authors: Zijian He, Sangli Teng, Tzu-Yuan Lin, Maani Ghaffari, Yan Gu

    Abstract: This paper investigates the robot state estimation problem within a non-inertial environment. The proposed state estimation approach relaxes the common assumption of static ground in the system modeling. The process and measurement models explicitly treat the movement of the non-inertial environments without requiring knowledge of its motion in the inertial frame or relying on GPS or sensing envir… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  11. arXiv:2403.06463  [pdf, other

    eess.SY

    A prediction-based forward-looking vehicle dispatching strategy for dynamic ride-pooling

    Authors: Xiaolei Wang, Chen Yang, Yuzhen Feng, Luohan Hu, Zhengbing He

    Abstract: For on-demand dynamic ride-pooling services, e.g., Uber Pool and Didi Pinche, a well-designed vehicle dispatching strategy is crucial for platform profitability and passenger experience. Most existing dispatching strategies overlook incoming pairing opportunities, therefore suffer from short-sighted limitations. In this paper, we propose a forward-looking vehicle dispatching strategy, which first… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  12. arXiv:2403.01153  [pdf, other

    eess.SP

    Transfer Learning-Enhanced Instantaneous Multi-Person Indoor Localization by CSI

    Authors: Zhiyuan He, Ke Deng, Jiangchao Gong, Yi Zhou, Desheng Wang

    Abstract: Passive indoor localization, integral to smart buildings, emergency response, and indoor navigation, has traditionally been limited by a focus on single-target localization and reliance on multi-packet CSI. We introduce a novel Multi-target loss, notably enhancing multi-person localization. Utilizing this loss function, our instantaneous CSI-ResNet achieves an impressive 99.21% accuracy at 0.6m pr… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  13. arXiv:2402.01104  [pdf, other

    eess.SY

    Simulation Framework for Vehicle and Electric Scooter Interaction

    Authors: Zhitong He, Lingxi Li

    Abstract: The number of shared micro-mobility services such as electric scooters (e-scooters) has an increasing trend due to the advantages of high efficiency and low cost in short-range travel in urban areas. However, due to the unique characteristics of moving behavior, it is commonly seen that e-scooters may share the road with other motor vehicles. The lack of protection may lead to severe injury for e-… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: The paper has been accepted by 26th IEEE International Conference on Intelligent Transportation Systems ITSC 2023

  14. arXiv:2401.14029  [pdf, other

    math.OC cs.LG eess.SY

    Towards a Systems Theory of Algorithms

    Authors: Florian Dörfler, Zhiyu He, Giuseppe Belgioioso, Saverio Bolognani, John Lygeros, Michael Muehlebach

    Abstract: Traditionally, numerical algorithms are seen as isolated pieces of code confined to an {\em in silico} existence. However, this perspective is not appropriate for many modern computational approaches in control, learning, or optimization, wherein {\em in vivo} algorithms interact with their environment. Examples of such {\em open algorithms} include various real-time optimization-based control str… ▽ More

    Submitted 30 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  15. arXiv:2401.09127  [pdf, other

    cs.IT eess.SP

    AI Empowered Channel Semantic Acquisition for 6G Integrated Sensing and Communication Networks

    Authors: Yifei Zhang, Zhen Gao, **g**g Zhao, Ziming He, Yunsheng Zhang, Chen Lu, Pei Xiao

    Abstract: Motivated by the need for increased spectral efficiency and the proliferation of intelligent applications, the sixth-generation (6G) mobile network is anticipated to integrate the dual-functions of communication and sensing (C&S). Although the millimeter wave (mmWave) communication and mmWave radar share similar multiple-input multiple-output (MIMO) architecture for integration, the full potential… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: 9 pages, 5 figures, accepted by the IEEE journal

  16. arXiv:2312.11302  [pdf, other

    cs.IT eess.SP

    AFDM-SCMA: A Promising Waveform for Massive Connectivity over High Mobility Channels

    Authors: Qu Luo, Pei Xiao, Zilong Liu, Ziwei Wan, Thomos Nikolaos, Zhen Gao, Ziming He

    Abstract: This paper studies the affine frequency division multiplexing (AFDM)-empowered sparse code multiple access (SCMA) system, referred to as AFDM-SCMA, for supporting massive connectivity in high-mobility environments. First, by placing the sparse codewords on the AFDM chirp subcarriers, the input-output (I/O) relation of AFDM-SCMA systems is presented. Next, we delve into the generalized receiver des… ▽ More

    Submitted 11 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  17. arXiv:2312.01679  [pdf, other

    eess.IV cs.CV cs.LG

    Adversarial Medical Image with Hierarchical Feature Hiding

    Authors: Qingsong Yao, Zecheng He, Yuexiang Li, Yi Lin, Kai Ma, Yefeng Zheng, S. Kevin Zhou

    Abstract: Deep learning based methods for medical images can be easily compromised by adversarial examples (AEs), posing a great security flaw in clinical decision-making. It has been discovered that conventional adversarial attacks like PGD which optimize the classification logits, are easy to distinguish in the feature space, resulting in accurate reactive defenses. To better understand this phenomenon an… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Our code is available at \url{https://github.com/qsyao/Hierarchical_Feature_Constraint}. arXiv admin note: text overlap with arXiv:2012.09501

  18. arXiv:2311.09408  [pdf, other

    math.OC eess.SY

    Decentralized Feedback Optimization via Sensitivity Decoupling: Stability and Sub-optimality

    Authors: Wenbin Wang, Zhiyu He, Giuseppe Belgioioso, Saverio Bolognani, Florian Dörfler

    Abstract: Online feedback optimization is a controller design paradigm for optimizing the steady-state behavior of a dynamical system. It employs an optimization algorithm as a dynamic feedback controller and utilizes real-time measurements to bypass knowing exact plant dynamics and disturbances. Different from existing centralized settings, we present a fully decentralized feedback optimization controller… ▽ More

    Submitted 28 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  19. arXiv:2307.14262  [pdf, other

    eess.IV cs.CV

    Artifact Restoration in Histology Images with Diffusion Probabilistic Models

    Authors: Zhenqi He, Junjun He, ** Ye, Yiqing Shen

    Abstract: Histological whole slide images (WSIs) can be usually compromised by artifacts, such as tissue folding and bubbles, which will increase the examination difficulty for both pathologists and Computer-Aided Diagnosis (CAD) systems. Existing approaches to restoring artifact images are confined to Generative Adversarial Networks (GANs), where the restoration process is formulated as an image-to-image t… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted by MICCAI2023

  20. arXiv:2307.08051  [pdf, other

    eess.IV cs.CV

    TransNuSeg: A Lightweight Multi-Task Transformer for Nuclei Segmentation

    Authors: Zhenqi He, Mathias Unberath, **g Ke, Yiqing Shen

    Abstract: Nuclei appear small in size, yet, in real clinical practice, the global spatial information and correlation of the color or brightness contrast between nuclei and background, have been considered a crucial component for accurate nuclei segmentation. However, the field of automatic nuclei segmentation is dominated by Convolutional Neural Networks (CNNs), meanwhile, the potential of the recently pre… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Early accepted by MICCAI2023

  21. arXiv:2307.07445  [pdf, other

    cs.NI cs.AI cs.LG eess.SY

    TSNet-SAC: Leveraging Transformers for Efficient Task Scheduling

    Authors: Ke Deng, Zhiyuan He, Hao Zhang, Haohan Lin, Desheng Wang

    Abstract: In future 6G Mobile Edge Computing (MEC), autopilot systems require the capability of processing multimodal data with strong interdependencies. However, traditional heuristic algorithms are inadequate for real-time scheduling due to their requirement for multiple iterations to derive the optimal scheme. We propose a novel TSNet-SAC based on Transformer, that utilizes heuristic algorithms solely to… ▽ More

    Submitted 16 June, 2023; originally announced July 2023.

  22. arXiv:2306.08417  [pdf, other

    cs.NI eess.SY

    A Novel Channel-Constrained Model for 6G Vehicular Networks with Traffic Spikes

    Authors: Ke Deng, Zhiyuan He, Haohan Lin, Hao Zhang, Desheng Wang

    Abstract: Mobile Edge Computing (MEC) holds excellent potential in Congestion Management (CM) of 6G vehicular networks. A reasonable schedule of MEC ensures a more reliable and efficient CM system. Unfortunately, existing parallel and sequential models cannot cope with scarce computing resources and constrained channels, especially during traffic rush hour. In this paper, we propose a channel-constrained mu… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  23. arXiv:2306.01210  [pdf

    eess.SP cs.CV

    A new method using deep transfer learning on ECG to predict the response to cardiac resynchronization therapy

    Authors: Zhuo He, Hong** Si, Xinwei Zhang, Qing-Hui Chen, Jiangang Zou, Weihua Zhou

    Abstract: Background: Cardiac resynchronization therapy (CRT) has emerged as an effective treatment for heart failure patients with electrical dyssynchrony. However, accurately predicting which patients will respond to CRT remains a challenge. This study explores the application of deep transfer learning techniques to train a predictive model for CRT response. Methods: In this study, the short-time Fourier… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  24. Synthetic Datasets for Autonomous Driving: A Survey

    Authors: Zhihang Song, Zimin He, Xingyu Li, Qiming Ma, Ruibo Ming, Zhiqi Mao, Huaxin Pei, Lihui Peng, Jianming Hu, Danya Yao, Yi Zhang

    Abstract: Autonomous driving techniques have been flourishing in recent years while thirsting for huge amounts of high-quality data. However, it is difficult for real-world datasets to keep up with the pace of changing requirements due to their expensive and time-consuming experimental and labeling costs. Therefore, more and more researchers are turning to synthetic datasets to easily generate rich and chan… ▽ More

    Submitted 27 February, 2024; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: 19 pages, 5 figures

    Journal ref: in IEEE Transactions on Intelligent Vehicles, vol. 9, no. 1, pp. 1847-1864, Jan. 2024

  25. arXiv:2304.07497  [pdf, other

    eess.SY

    Globally Composite-Learning-Based Intelligent Fast Finite-Time Control for Uncertain Strict-Feedback Systems with Nonlinearly Periodic Disturbances

    Authors: Xidong Wang, Zhan Li, Zhen He

    Abstract: This brief aims at the issue of globally composite-learning-based neural fast finite-time (F-FnT) tracking control for a class of uncertain systems in strict-feedback form subject to nonlinearly periodic disturbances. First, uncertain dynamics with periodic parameters are identified by incorporating Fourier series expansion (FSE) into an intelligent estimator, which leverages the feedback of newly… ▽ More

    Submitted 21 September, 2023; v1 submitted 15 April, 2023; originally announced April 2023.

    Comments: 5 pages, 3 figures

  26. arXiv:2302.09469  [pdf, ps, other

    cs.IT eess.SP

    Integrated sensing and full-duplex communication: Joint transceiver beamforming and power allocation

    Authors: Zhenyao He, Wei Xu, Hong Shen, Derrick Wing Kwan Ng, Yonina C. Eldar, Xiaohu You

    Abstract: Beamforming design has been widely investigated for integrated sensing and communication (ISAC) systems with full-duplex (FD) sensing and half-duplex (HD) communication. To achieve higher spectral efficiency, in this paper, we extend existing ISAC beamforming design by considering the FD capability for both radar and communication. Specifically, we consider an ISAC system, where the base station (… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.00229

  27. arXiv:2212.10901  [pdf, other

    cs.SD cs.CL cs.IR cs.MM eess.AS

    ALCAP: Alignment-Augmented Music Captioner

    Authors: Zihao He, Weituo Hao, Wei-Tsung Lu, Changyou Chen, Kristina Lerman, Xuchen Song

    Abstract: Music captioning has gained significant attention in the wake of the rising prominence of streaming media platforms. Traditional approaches often prioritize either the audio or lyrics aspect of the music, inadvertently ignoring the intricate interplay between the two. However, a comprehensive understanding of music necessitates the integration of both these elements. In this study, we delve into t… ▽ More

    Submitted 21 October, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

  28. arXiv:2212.00661  [pdf, other

    quant-ph eess.SY

    Hybrid Gate-Pulse Model for Variational Quantum Algorithms

    Authors: Zhiding Liang, Zhixin Song, **glei Cheng, Zichang He, Ji Liu, Hanrui Wang, Ruiyang Qin, Yiru Wang, Song Han, Xuehai Qian, Yiyu Shi

    Abstract: Current quantum programs are mostly synthesized and compiled on the gate-level, where quantum circuits are composed of quantum gates. The gate-level workflow, however, introduces significant redundancy when quantum gates are eventually transformed into control signals and applied on quantum devices. For superconducting quantum computers, the control signals are microwave pulses. Therefore, pulse-l… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 8 pages, 6 figures

  29. arXiv:2211.07472  [pdf

    physics.med-ph eess.SP

    A new method using machine learning to integrate ECG and gated SPECT MPI for Cardiac Resynchronization Therapy Decision Support on behalf of the VISION-CRT

    Authors: Fernando de A. Fernandes, Kristoffer Larsen, Zhuo He, Erivelton Nascimento, Amalia Peix, Qiuying Sha, Diana Paez, Ernest V. Garcia, Weihua Zhou, Claudio T Mesquita

    Abstract: Cardiac resynchronization therapy (CRT) has been established as an important therapy for heart failure. Mechanical dyssynchrony has the potential to predict responders to CRT. The aim of this study was to report the development and the validation of machine learning (ML) models which integrates ECG, gated SPECT MPI (GMPS) and clinical variables to predict patients' response to CRT. This analysis i… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

  30. arXiv:2211.05622  [pdf, other

    eess.IV cs.CV

    InstantGroup: Instant Template Generation for Scalable Group of Brain MRI Registration

    Authors: Ziyi He, Albert C. S. Chung

    Abstract: Template generation is a critical step in groupwise image registration, which involves aligning a group of subjects into a common space. While existing methods can generate high-quality template images, they often incur substantial time costs or are limited by fixed group scales. In this paper, we present InstantGroup, an efficient groupwise template generation framework based on variational autoe… ▽ More

    Submitted 26 June, 2024; v1 submitted 10 November, 2022; originally announced November 2022.

  31. arXiv:2211.00229  [pdf, ps, other

    cs.IT eess.SP

    Full-Duplex Communication for ISAC: Joint Beamforming and Power Optimization

    Authors: Zhenyao He, Wei Xu, Hong Shen, Derrick Wing Kwan Ng, Yonina C. Eldar, Xiaohu You

    Abstract: Beamforming design has been widely investigated for integrated sensing and communication (ISAC) systems with full-duplex (FD) sensing and half-duplex (HD) communication. To achieve higher spectral efficiency, in this paper, we extend existing ISAC beamforming design by considering the FD capability for both radar and communication. Specifically, we consider an ISAC system, where the BS performs ta… ▽ More

    Submitted 18 April, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted to an IEEE Journal

  32. Electromagnetic Effective-Degree-of-Freedom Limit of a MIMO System in 2-D Inhomogeneous Environment

    Authors: Shuai S. A. Yuan, Zi He, Sheng Sun, Xiaoming Chen, Chongwen Huang, Wei E. I. Sha

    Abstract: Compared with a single-input-single-output (SISO) wireless communication system, the benefit of multiple-input-multiple-output (MIMO) technology originates from its extra degree of freedom (DOF), also referred as scattering channels or spatial electromagnetic (EM) modes, brought by spatial multiplexing. When the physical sizes of transmitting and receiving arrays are fixed, and there are sufficien… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Journal ref: Electronics 2022, 11(19), 3232

  33. arXiv:2210.01272  [pdf, ps, other

    cs.CV cs.LG eess.IV

    A systematic review of the use of Deep Learning in Satellite Imagery for Agriculture

    Authors: Brandon Victor, Zhen He, Aiden Nibali

    Abstract: Agricultural research is essential for increasing food production to meet the requirements of an increasing population in the coming decades. Recently, satellite technology has been improving rapidly and deep learning has seen much success in generic computer vision tasks and many application areas which presents an important opportunity to improve analysis of agricultural land. Here we present a… ▽ More

    Submitted 14 December, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 23 pages, 5 figures and 10 tables in main paper. Supplementary materials section also included in main pdf. Update: All tables with specific references have been moved to supplementary. Main text now uses only aggregated information

  34. arXiv:2209.12702  [pdf, other

    eess.AS cs.SD

    End-to-End Lyrics Recognition with Self-supervised Learning

    Authors: Xiangyu Zhang, Shuyue Stella Li, Zhanhong He, Roberto Togneri, Leibny Paola Garcia

    Abstract: Lyrics recognition is an important task in music processing. Despite traditional algorithms such as the hybrid HMM- TDNN model achieving good performance, studies on applying end-to-end models and self-supervised learning (SSL) are limited. In this paper, we first establish an end-to-end baseline for lyrics recognition and then explore the performance of SSL models on lyrics recognition task. We e… ▽ More

    Submitted 26 October, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: 4 pages, 2 figures, 3 tables

  35. Automatic reorientation by deep learning to generate short axis SPECT myocardial perfusion images

    Authors: Fubao Zhu, Guojie Wang, Chen Zhao, Saurabh Malhotra, Min Zhao, Zhuo He, Jianzhou Shi, Zhixin Jiang, Weihua Zhou

    Abstract: Single photon emission computed tomography (SPECT) myocardial perfusion images (MPI) can be displayed both in traditional short-axis (SA) cardiac planes and polar maps for interpretation and quantification. It is essential to reorient the reconstructed transaxial SPECT MPI into standard SA slices. This study is aimed to develop a deep-learning-based approach for automatic reorientation of MPI. Met… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: 27 pages,7 figures

  36. arXiv:2206.02425  [pdf, other

    eess.IV cs.CV

    mmFormer: Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain Tumor Segmentation

    Authors: Yao Zhang, Nanjun He, Jiawei Yang, Yuexiang Li, Dong Wei, Yawen Huang, Yang Zhang, Zhiqiang He, Yefeng Zheng

    Abstract: Accurate brain tumor segmentation from Magnetic Resonance Imaging (MRI) is desirable to joint learning of multimodal images. However, in clinical practice, it is not always possible to acquire a complete set of MRIs, and the problem of missing modalities causes severe performance degradation in existing multimodal segmentation methods. In this work, we present the first attempt to exploit the Tran… ▽ More

    Submitted 4 August, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted to MICCAI 2022

  37. arXiv:2205.10651  [pdf, other

    eess.IV cs.LG cs.NE

    Tensor Shape Search for Optimum Data Compression

    Authors: Ryan Solgi, Zichang He, William Jiahua Liang, Zheng Zhang

    Abstract: Various tensor decomposition methods have been proposed for data compression. In real world applications of the tensor decomposition, selecting the tensor shape for the given data poses a challenge and the shape of the tensor may affect the error and the compression ratio. In this work, we study the effect of the tensor shape on the tensor decomposition and propose an optimization model to find an… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  38. arXiv:2205.10605  [pdf, other

    q-bio.NC cs.CV eess.IV

    Brain Cortical Functional Gradients Predict Cortical Folding Patterns via Attention Mesh Convolution

    Authors: Li Yang, Zhibin He, Changhe Li, Junwei Han, Dajiang Zhu, Tianming Liu, Tuo Zhang

    Abstract: Since gyri and sulci, two basic anatomical building blocks of cortical folding patterns, were suggested to bear different functional roles, a precise map** from brain function to gyro-sulcal patterns can provide profound insights into both biological and artificial neural networks. However, there lacks a generic theory and effective computational model so far, due to the highly nonlinear relatio… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  39. arXiv:2204.12264  [pdf, ps, other

    cs.IT eess.SP

    Energy Efficient Beamforming Optimization for Integrated Sensing and Communication

    Authors: Zhenyao He, Wei Xu, Hong Shen, Yongming Huang, Huahua Xiao

    Abstract: This paper investigates the optimization of beamforming design in a system with integrated sensing and communication (ISAC), where the base station (BS) sends signals for simultaneous multiuser communication and radar sensing. We aim at maximizing the energy efficiency (EE) of the multiuser communication while guaranteeing the sensing requirement in terms of individual radar beampattern gains. The… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: Accepted by IEEE WCL

  40. Leveraging RIS-Enabled Smart Signal Propagation for Solving Infeasible Localization Problems

    Authors: Kamran Keykhosravi, Benoit Denis, George C. Alexandropoulos, Zhongxia Simon He, Antonio Albanese, Vincenzo Sciancalepore, Henk Wymeersch

    Abstract: Reconfigurable intelligent surfaces (RISs) have tremendous potential for both communication and localization. While communication benefits are now well-understood, the breakthrough nature of the technology may well lie in its capability to provide location estimates when conventional approaches fail, (e.g., due to insufficient available infrastructure). A limited number of example scenarios have b… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  41. arXiv:2202.10814  [pdf, other

    eess.SY

    Resilient Average Consensus: A Detection and Compensation Approach

    Authors: Wenzhe Zheng, Zhiyu He, Jian** He, Chengcheng Zhao, Chongrong Fang

    Abstract: We study the problem of resilient average consensus for multi-agent systems with misbehaving nodes. To protect consensus valuefrom being influenced by misbehaving nodes, we address this problem by detecting misbehaviors, mitigating the corresponding adverse impact and achieving the resilient average consensus. In this paper, general types of misbehaviors are considered,including deception attacks,… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  42. Model-Free Nonlinear Feedback Optimization

    Authors: Zhiyu He, Saverio Bolognani, Jian** He, Florian Dörfler, ** Guan

    Abstract: Feedback optimization is a control paradigm that enables physical systems to autonomously reach efficient operating points. Its central idea is to interconnect optimization iterations in closed-loop with the physical plant. Since iterative gradient-based methods are extensively used to achieve optimality, feedback optimization controllers typically require the knowledge of the steady-state sensiti… ▽ More

    Submitted 22 December, 2023; v1 submitted 7 January, 2022; originally announced January 2022.

    Comments: Published on IEEE Transactions on Automatic Control

  43. Electromagnetic Effective Degree of Freedom of a MIMO System in Free Space

    Authors: Shuai S. A. Yuan, Zi He, Xiaoming Chen, Chongwen Huang, Wei E. I. Sha

    Abstract: Effective degree of freedom (EDOF) of a multiple-input-multiple-output (MIMO) system represents its equivalent number of independent single-input-single-output (SISO) systems, which directly characterizes the communication performance. Traditional EDOF only considers single polarization, where the full polarized components degrade into two independent transverse components under the far-field appr… ▽ More

    Submitted 1 January, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: 5 pages, 5 figures

    Journal ref: IEEE Antennas and Wireless Propagation Letters, 2021

  44. Joint Sensing, Communication, and Computation Resource Allocation for Cooperative Perception in Fog-Based Vehicular Networks

    Authors: Xinran Zhang, Zhimin He, Yaohua Sun, Shuo Yuan, Mugen Peng

    Abstract: To enlarge the perception range and reliability of individual autonomous vehicles, cooperative perception has been received much attention. However, considering the high volume of shared messages, limited bandwidth and computation resources in vehicular networks become bottlenecks. In this paper, we investigate how to balance the volume of shared messages and constrained resources in fog-based veh… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

    Comments: Accepted by WCSP 2021

  45. arXiv:2111.15102  [pdf, other

    eess.SP

    Manifold Optimization Methods for Hybrid beamforming in mmWave Dual-Function Radar-Communication System

    Authors: Bowen Wang, Ziyang Cheng, Zishu He

    Abstract: As a cost-effective alternative, hybrid analog and digital beamforming architecture is a promising scheme for millimeter wave (mmWave) system. This paper considers two hybrid beamforming architectures, i.e. the partially-connected and fully-connected structures, for mmWave dual-function radar communication (DFRC) system, where the transmitter communicates with the downlink users and detects radar… ▽ More

    Submitted 4 September, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

  46. arXiv:2111.08380  [pdf, other

    cs.MM cs.SD eess.AS

    Video Background Music Generation with Controllable Music Transformer

    Authors: Shangzhe Di, Zeren Jiang, Si Liu, Zhaokai Wang, Leyan Zhu, Zexin He, Hongming Liu, Shuicheng Yan

    Abstract: In this work, we address the task of video background music generation. Some previous works achieve effective music generation but are unable to generate melodious music tailored to a particular video, and none of them considers the video-music rhythmic consistency. To generate the background music that matches the given video, we first establish the rhythmic relations between video and background… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted to ACM Multimedia 2021. Project website at https://wzk1015.github.io/cmt/

  47. One-Bit ADCs/DACs based MIMO Radar: Performance Analysis and Joint Design

    Authors: Minglong Deng, Ziyang Cheng, Linlong Wu, Bhavani Shankar, Zishu He

    Abstract: Extremely low-resolution (e.g. one-bit) analog-to-digital converters (ADCs) and digital-to-analog converters (DACs) can substantially reduce hardware cost and power consumption for MIMO radar especially with large scale antennas. In this paper, we focus on the detection performance analysis and joint design for the MIMO radar with one-bit ADCs and DACs. Specifically, under the assumption of low si… ▽ More

    Submitted 24 December, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

  48. arXiv:2110.05443  [pdf

    eess.IV cs.CV

    Spatial-temporal V-Net for automatic segmentation and quantification of right ventricles in gated myocardial perfusion SPECT images

    Authors: Chen Zhao, Shi Shi, Zhuo He, Cheng Wang, Zhongqiang Zhao, Xinli Li, Yanli Zhou, Weihua Zhou

    Abstract: Background. Functional assessment of right ventricle (RV) using gated myocardial perfusion single-photon emission computed tomography (MPS) heavily relies on the precise extraction of right ventricular contours. In this paper, we present a new deep-learning-based model integrating both the spatial and temporal features in gated MPS images to perform the segmentation of the RV epicardium and endoca… ▽ More

    Submitted 26 December, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: 15 pages, 8 figures

  49. arXiv:2109.05056  [pdf, other

    cs.CL cs.SD eess.AS

    Speaker Turn Modeling for Dialogue Act Classification

    Authors: Zihao He, Leili Tavabi, Kristina Lerman, Mohammad Soleymani

    Abstract: Dialogue Act (DA) classification is the task of classifying utterances with respect to the function they serve in a dialogue. Existing approaches to DA classification model utterances without incorporating the turn changes among speakers throughout the dialogue, therefore treating it no different than non-interactive written text. In this paper, we propose to integrate the turn changes in conversa… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  50. arXiv:2107.09842  [pdf, other

    eess.IV cs.CV

    Modality-aware Mutual Learning for Multi-modal Medical Image Segmentation

    Authors: Yao Zhang, Jiawei Yang, Jiang Tian, Zhongchao Shi, Cheng Zhong, Yang Zhang, Zhiqiang He

    Abstract: Liver cancer is one of the most common cancers worldwide. Due to inconspicuous texture changes of liver tumor, contrast-enhanced computed tomography (CT) imaging is effective for the diagnosis of liver cancer. In this paper, we focus on improving automated liver tumor segmentation by integrating multi-modal CT images. To this end, we propose a novel mutual learning (ML) strategy for effective and… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.