Skip to main content

Showing 1–25 of 25 results for author: Zeng, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14052  [pdf, other

    eess.IV cs.CV

    Perspective+ Unet: Enhancing Segmentation with Bi-Path Fusion and Efficient Non-Local Attention for Superior Receptive Fields

    Authors: **tong Hu, Siyan Chen, Zhiyi Pan, Sen Zeng, Wenming Yang

    Abstract: Precise segmentation of medical images is fundamental for extracting critical clinical information, which plays a pivotal role in enhancing the accuracy of diagnoses, formulating effective treatment plans, and improving patient outcomes. Although Convolutional Neural Networks (CNNs) and non-local attention methods have achieved notable success in medical image segmentation, they either struggle to… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 13 pages, 5 figures

  2. arXiv:2406.06086  [pdf, other

    cs.SD eess.AS

    RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

    Authors: Yujie Chen, Jiangyan Yi, Jun Xue, Chenglong Wang, Xiaohui Zhang, Shunbo Dong, Siding Zeng, Jianhua Tao, Lv Zhao, Cunhang Fan

    Abstract: Fake artefacts for discriminating between bonafide and fake audio can exist in both short- and long-range segments. Therefore, combining local and global feature information can effectively discriminate between bonafide and fake audio. This paper proposes an end-to-end bidirectional state space model, named RawBMamba, to capture both short- and long-range discriminative information for audio deepf… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  3. arXiv:2404.12060  [pdf, other

    eess.SP

    Environment-aware UAV Communications: CKM Construction and Predictive Beamforming

    Authors: Shiqi Zeng, Xiaoli Xu, Yong Zeng

    Abstract: Predictive millimeter-wave (mmWave) beamforming is a promising technique to enable low-latency and high-rate ground-air communications for cellular-connected unmanned aerial vehicles (UAVs). However, the high vulnerability of mmWave to blockages poses practical challenges to the implementation of such a technology. In this paper, we tackle the challenges by proposing a channel knowledge map (CKM)-… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  4. arXiv:2401.08149  [pdf, ps, other

    cs.IT eess.SP

    Channel Estimation for Holographic Communications in Hybrid Near-Far Field

    Authors: Shaohua Yue, Shuhao Zeng, Liang Liu, Boya Di

    Abstract: To realize holographic communications, a potential technology for spectrum efficiency improvement in the future sixth-generation (6G) network, antenna arrays inlaid with numerous antenna elements will be deployed. However, the increase in antenna aperture size makes some users lie in the Fresnel region, leading to the hybrid near-field and far-field communication mode, where the conventional far-f… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 6 pages, 5 figures

  5. arXiv:2401.07791  [pdf, other

    eess.SP

    Near-Far Field Channel Modeling for Holographic MIMO Using Expectation-Maximization Methods

    Authors: Houfeng Chen, Shuhao Zeng, Hao Guo, Tommy Svensson, Hongliang Zhang

    Abstract: Holographic Multiple-Input Multiple-Output (HMIMO), which densely integrates numerous antennas into a limited space, is anticipated to provide higher rates for future 6G wireless communications. The increase in antenna aperture size makes the near-field region enlarge, causing some users to be located in the near-field region. Thus, we are facing a hybrid near-field and far-field communication pro… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  6. arXiv:2312.09651  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection

    Authors: Xiaohui Zhang, Jiangyan Yi, Chenglong Wang, Chuyuan Zhang, Siding Zeng, Jianhua Tao

    Abstract: The rapid evolution of speech synthesis and voice conversion has raised substantial concerns due to the potential misuse of such technology, prompting a pressing need for effective audio deepfake detection mechanisms. Existing detection models have shown remarkable success in discriminating known deepfake audio, but struggle when encountering new attack types. To address this challenge, one of the… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by the main track The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024)

  7. arXiv:2310.08045  [pdf, other

    cs.RO eess.SY

    Model Predictive Inferential Control of Neural State-Space Models for Autonomous Vehicle Motion Planning

    Authors: Iman Askari, Xumein Tu, Shen Zeng, Huazhen Fang

    Abstract: Model predictive control (MPC) has proven useful in enabling safe and optimal motion planning for autonomous vehicles. In this paper, we investigate how to achieve MPC-based motion planning when a neural state-space model represents the vehicle dynamics. As the neural state-space model will lead to highly complex, nonlinear and nonconvex optimization landscapes, mainstream gradient-based MPC metho… ▽ More

    Submitted 19 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  8. arXiv:2310.07464  [pdf

    eess.IV cs.LG q-bio.QM

    Deep Learning Predicts Biomarker Status and Discovers Related Histomorphology Characteristics for Low-Grade Glioma

    Authors: Zijie Fang, Yihan Liu, Yifeng Wang, Xiangyang Zhang, Yang Chen, Chang**g Cai, Yiyang Lin, Ying Han, Zhi Wang, Shan Zeng, Hong Shen, Jun Tan, Yongbing Zhang

    Abstract: Biomarker detection is an indispensable part in the diagnosis and treatment of low-grade glioma (LGG). However, current LGG biomarker detection methods rely on expensive and complex molecular genetic testing, for which professionals are required to analyze the results, and intra-rater variability is often reported. To overcome these challenges, we propose an interpretable deep learning pipeline, a… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 47 pages, 6 figures

  9. arXiv:2309.02106  [pdf, other

    cs.CL cs.AI cs.LG eess.AS

    Leveraging Label Information for Multimodal Emotion Recognition

    Authors: Peiying Wang, Sunlu Zeng, Junqing Chen, Lu Fan, Meng Chen, Youzheng Wu, Xiaodong He

    Abstract: Multimodal emotion recognition (MER) aims to detect the emotional status of a given expression by combining the speech and text information. Intuitively, label information should be capable of hel** the model locate the salient tokens/frames relevant to the specific emotion, which finally facilitates the MER task. Inspired by this, we propose a novel approach for MER by leveraging label informat… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by Interspeech 2023

  10. arXiv:2305.06511  [pdf, other

    eess.IV cs.CV

    ParamNet: A Parameter-variable Network for Fast Stain Normalization

    Authors: Hongtao Kang, Die Luo, Li Chen, Junbo Hu, Shenghua Cheng, Tingwei Quan, Shaoqun Zeng, Xiuli Liu

    Abstract: In practice, digital pathology images are often affected by various factors, resulting in very large differences in color and brightness. Stain normalization can effectively reduce the differences in color and brightness of digital pathology images, thus improving the performance of computer-aided diagnostic systems. Conventional stain normalization methods rely on one or several reference images,… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  11. arXiv:2211.03059  [pdf, other

    eess.SP

    Intelligent Omni-Surfaces Aided Wireless Communications: Does the Reciprocity Hold?

    Authors: Shaohua Yue, Shuhao Zeng, Hongliang Zhang, Fenghan Lin, Liang Liu, Boya Di

    Abstract: Intelligent omni-surfaces (IOS) have attracted great attention recently due to its potential to achieve full-dimensional communications by simultaneously reflecting and refracting signals toward both sides of the surface. However, it still remains an open question whether the reciprocity holds between the uplink and downlink channels in the IOS-aided wireless communications. In this work, we first… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

    Comments: 5 pages, 6 figures

  12. arXiv:2207.02662  [pdf, other

    cs.IT eess.SP

    Reconfigurable Refractive Surfaces: An Energy-Efficient Way to Holographic MIMO

    Authors: Shuhao Zeng, Hongliang Zhang, Boya Di, Haichao Qin, Xin Su, Lingyang Song

    Abstract: Holographic Multiple Input Multiple Output (HMIMO), which integrates massive antenna elements into a compact space to achieve a spatially continuous aperture, plays an important role in future wireless networks. With numerous antenna elements, it is hard to implement the HMIMO via phased arrays due to unacceptable power consumption. To address this issue, reconfigurable refractive surface (RRS) is… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 5 pages, 4 figures

  13. Sampling-Based Nonlinear MPC of Neural Network Dynamics with Application to Autonomous Vehicle Motion Planning

    Authors: Iman Askari, Babak Badnava, Thomas Woodruff, Shen Zeng, Huazhen Fang

    Abstract: Control of machine learning models has emerged as an important paradigm for a broad range of robotics applications. In this paper, we present a sampling-based nonlinear model predictive control (NMPC) approach for control of neural network dynamics. We show its design in two parts: 1) formulating conventional optimization-based NMPC as a Bayesian state estimation problem, and 2) using particle fil… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: To appear in 2022 American Control Conference (ACC)

    Journal ref: 2022 American Control Conference (ACC), 2022, pp. 2084-2090

  14. Nonlinear Model Predictive Control Based on Constraint-Aware Particle Filtering/Smoothing

    Authors: Iman Askari, Shen Zeng, Huazhen Fang

    Abstract: Nonlinear model predictive control (NMPC) has gained widespread use in many applications. Its formulation traditionally involves repetitively solving a nonlinear constrained optimization problem online. In this paper, we investigate NMPC through the lens of Bayesian estimation and highlight that the Monte Carlo sampling method can offer a favorable way to implement NMPC. We develop a constraint-aw… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: Published in 2021 American Control Conference (ACC)

  15. arXiv:2110.11991  [pdf, other

    eess.SY cs.LG

    A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow

    Authors: Sihan Zeng, Alyssa Kody, Youngdae Kim, Kibaek Kim, Daniel K. Molzahn

    Abstract: With the increasing penetration of distributed energy resources, distributed optimization algorithms have attracted significant attention for power systems applications due to their potential for superior scalability, privacy, and robustness to a single point-of-failure. The Alternating Direction Method of Multipliers (ADMM) is a popular distributed optimization algorithm; however, its convergence… ▽ More

    Submitted 5 May, 2022; v1 submitted 22 October, 2021; originally announced October 2021.

  16. arXiv:2104.11091  [pdf, other

    cs.IT eess.SP

    Trajectory Optimization and Resource Allocation for OFDMA UAV Relay Networks

    Authors: Shuhao Zeng, Hongliang Zhang, Boya Di, Lingyang Song

    Abstract: In this paper, we consider a single-cell multi-user orthogonal frequency division multiple access (OFDMA) network with one unmanned aerial vehicle (UAV), which works as an amplify-and-forward relay to improve the quality-of-service (QoS) of the user equipments (UEs) in the cell edge. Aiming to improve the throughput while guaranteeing the user fairness, we jointly optimize the communication mode,… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

    Comments: 33 pages, 6 figures, to be published in IEEE Transactions on Wireless Communications

  17. StainNet: a fast and robust stain normalization network

    Authors: Hongtao Kang, Die Luo, Weihua Feng, Junbo Hu, Shaoqun Zeng, Tingwei Quan, Xiuli Liu

    Abstract: Stain normalization often refers to transferring the color distribution of the source image to that of the target image and has been widely used in biomedical image analysis. The conventional stain normalization is regarded as constructing a pixel-by-pixel color map** model, which only depends on one reference image, and can not accurately achieve the style transformation between image datasets.… ▽ More

    Submitted 23 July, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

    Comments: 14 pages, 8 figures

    Journal ref: Front. Med. 8:746307 (2021)

  18. arXiv:2012.00909  [pdf, other

    cs.CV eess.IV

    Visually Imperceptible Adversarial Patch Attacks on Digital Images

    Authors: Yaguan Qian, Jiamin Wang, Bin Wang, Shaoning Zeng, Zhaoquan Gu, Shouling Ji, Wassim Swaileh

    Abstract: The vulnerability of deep neural networks (DNNs) to adversarial examples has attracted more attention. Many algorithms have been proposed to craft powerful adversarial examples. However, most of these algorithms modified the global or local region of pixels without taking network explanations into account. Hence, the perturbations are redundant, which are easily detected by human eyes. In this pap… ▽ More

    Submitted 27 April, 2021; v1 submitted 1 December, 2020; originally announced December 2020.

  19. arXiv:2009.09574  [pdf, other

    eess.IV cs.CV

    Reconstruct high-resolution multi-focal plane images from a single 2D wide field image

    Authors: Jiabo Ma, Sibo Liu, Shenghua Cheng, Xiuli Liu, Li Cheng, Shaoqun Zeng

    Abstract: High-resolution 3D medical images are important for analysis and diagnosis, but axial scanning to acquire them is very time-consuming. In this paper, we propose a fast end-to-end multi-focal plane imaging network (MFPINet) to reconstruct high-resolution multi-focal plane images from a single 2D low-resolution wild filed image without relying on scanning. To acquire realistic MFP images fast, the p… ▽ More

    Submitted 20 September, 2020; originally announced September 2020.

    Comments: 9 pages, 4 figures,3 Tables

  20. arXiv:2007.01056  [pdf, other

    eess.IV cs.LG

    Hyperspectral Image Denoising with Partially Orthogonal Matrix Vector Tensor Factorization

    Authors: Zhen Long, Yipeng Liu, Sixing Zeng, Jiani Liu, Fei Wen, Ce Zhu

    Abstract: Hyperspectral image (HSI) has some advantages over natural image for various applications due to the extra spectral information. During the acquisition, it is often contaminated by severe noises including Gaussian noise, impulse noise, deadlines, and stripes. The image quality degeneration would badly effect some applications. In this paper, we present a HSI restoration method named smooth and rob… ▽ More

    Submitted 28 June, 2020; originally announced July 2020.

  21. arXiv:2001.00692  [pdf

    cs.CV eess.IV q-bio.QM

    FFusionCGAN: An end-to-end fusion method for few-focus images using conditional GAN in cytopathological digital slides

    Authors: Xiebo Geng, Sibo Liua, Wei Han, Xu Li, Jiabo Ma, **gya Yu, Xiuli Liu, Sahoqun Zeng, Li Chen, Shenghua Cheng

    Abstract: Multi-focus image fusion technologies compress different focus depth images into an image in which most objects are in focus. However, although existing image fusion techniques, including traditional algorithms and deep learning-based algorithms, can generate high-quality fused images, they need multiple images with different focus depths in the same field of view. This criterion may not be met in… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

  22. arXiv:1909.12111  [pdf

    cs.CV cs.LG eess.IV

    Two-stage Image Classification Supervised by a Single Teacher Single Student Model

    Authors: Jianhang Zhou, Shaoning Zeng, Bob Zhang

    Abstract: The two-stage strategy has been widely used in image classification. However, these methods barely take the classification criteria of the first stage into consideration in the second prediction stage. In this paper, we propose a novel two-stage representation method (TSR), and convert it to a Single-Teacher Single-Student (STSS) problem in our two-stage image classification framework. We seek the… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

    Comments: Accepted by 30th British Machine Vision Conference (BMVC2019)

    Report number: #155

  23. arXiv:1909.05184  [pdf

    eess.IV cs.CV

    Multi-stage domain adversarial style reconstruction for cytopathological image stain normalization

    Authors: Xihao Chen, **gya Yu, Li Chen, Shaoqun Zeng, Xiuli Liu, Shenghua Cheng

    Abstract: The different stain styles of cytopathological images have a negative effect on the generalization ability of automated image analysis algorithms. This article proposes a new framework that normalizes the stain style for cytopathological images through a stain removal module and a multi-stage domain adversarial style reconstruction module. We convert colorful images into grayscale images with a co… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

  24. arXiv:1904.11419  [pdf

    stat.ML cs.LG eess.IV

    Time Series Simulation by Conditional Generative Adversarial Net

    Authors: Rao Fu, Jie Chen, Shutian Zeng, Yi** Zhuang, Agus Sudjianto

    Abstract: Generative Adversarial Net (GAN) has been proven to be a powerful machine learning tool in image data analysis and generation. In this paper, we propose to use Conditional Generative Adversarial Net (CGAN) to learn and simulate time series data. The conditions can be both categorical and continuous variables containing different kinds of auxiliary information. Our simulation studies show that CGAN… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

  25. arXiv:1812.02800  [pdf, ps, other

    eess.SP

    On Lossless Causal Compression of Periodic Signals

    Authors: Jan Maximilian Montenbruck, Shen Zeng

    Abstract: We present and study a scheme for lossless causal compression of periodic real-valued signals. In particular, our technique compresses a vector-valued signal to a scalar-valued signal by mixing it with another periodic signal. The conditions for being able to reconstruct the original signal then amount to certain non-resonances between the periods of the two signals. The proposed compression schem… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.