Skip to main content

Showing 1–50 of 67 results for author: Zeng, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.14964  [pdf

    eess.SY

    Black Start Operation of Grid-Forming Converters Based on Generalized Three-phase Droop Control Under Unbalanced Conditions

    Authors: Zexian Zeng, Prajwal Bhagwat, Maryam Saeedifard, Dominic Groß

    Abstract: This paper focuses on the challenging task of bottom-up restoration in a complete blackout system using Grid-forming (GFM) converters. Challenges arise due to the limited current capability of power converters, resulting in distinct dynamic responses and fault current characteristics compared to synchronous generators. Additionally, GFM control needs to address the presence of unbalanced condition… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2402.17780  [pdf, other

    eess.SP cs.LG physics.med-ph

    Constraint Latent Space Matters: An Anti-anomalous Waveform Transformation Solution from Photoplethysmography to Arterial Blood Pressure

    Authors: Cheng Bian, Xiaoyu Li, Qi Bi, Guangpu Zhu, Jiegeng Lyu, Weile Zhang, Yelei Li, Zi**g Zeng

    Abstract: Arterial blood pressure (ABP) holds substantial promise for proactive cardiovascular health management. Notwithstanding its potential, the invasive nature of ABP measurements confines their utility primarily to clinical environments, limiting their applicability for continuous monitoring beyond medical facilities. The conversion of photoplethysmography (PPG) signals into ABP equivalents has garner… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted by AAAI-2024, main track

  3. arXiv:2312.15575  [pdf, other

    eess.IV cs.CV cs.LG

    Neural Born Series Operator for Biomedical Ultrasound Computed Tomography

    Authors: Zhijun Zeng, Yihang Zheng, Youjia Zheng, Yubing Li, Zuoqiang Shi, He Sun

    Abstract: Ultrasound Computed Tomography (USCT) provides a radiation-free option for high-resolution clinical imaging. Despite its potential, the computationally intensive Full Waveform Inversion (FWI) required for tissue property reconstruction limits its clinical utility. This paper introduces the Neural Born Series Operator (NBSO), a novel technique designed to speed up wave simulations, thereby facilita… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    ACM Class: I.4.5; J.3

  4. arXiv:2312.10052  [pdf, other

    eess.SP cs.LG

    ESTformer: Transformer Utilizing Spatiotemporal Dependencies for EEG Super-resolution

    Authors: Dongdong Li, Zhongliang Zeng, Zhe Wang, Hai Yang

    Abstract: Towards practical applications of Electroencephalography (EEG) data, lightweight acquisition devices, equipped with a few electrodes, result in a predicament where analysis methods can only leverage EEG data with extremely low spatial resolution. Recent methods mainly focus on using mathematical interpolation methods and Convolutional Neural Networks for EEG super-resolution (SR), but they suffer… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  5. arXiv:2311.12840  [pdf, other

    cs.CV cs.AI eess.IV

    Wafer Map Defect Patterns Semi-Supervised Classification Using Latent Vector Representation

    Authors: Qiyu Wei, Wei Zhao, Xiaoyan Zheng, Zeng Zeng

    Abstract: As the globalization of semiconductor design and manufacturing processes continues, the demand for defect detection during integrated circuit fabrication stages is becoming increasingly critical, playing a significant role in enhancing the yield of semiconductor products. Traditional wafer map defect pattern detection methods involve manual inspection using electron microscopes to collect sample i… ▽ More

    Submitted 6 October, 2023; originally announced November 2023.

    Comments: 6 pages, 2 figures, CIS confernece

  6. A Demand-Supply Cooperative Responding Strategy in Power System with High Renewable Energy Penetration

    Authors: Yuanzheng Li, Xinxin Long, Yang Li, Yizhou Ding, Tao Yang, Zhigang Zeng

    Abstract: Industrial demand response (IDR) plays an important role in promoting the utilization of renewable energy (RE) in power systems. However, it will lead to power adjustments on the supply side, which is also a non-negligible factor in affecting RE utilization. To comprehensively analyze this impact while enhancing RE utilization, this paper proposes a power demand-supply cooperative response (PDSCR)… ▽ More

    Submitted 1 December, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE Transactions on Control Systems Technology

    Journal ref: IEEE Transactions on Control Systems Technology 32 (2024) 874-890

  7. arXiv:2308.01040  [pdf, other

    cs.CR cs.SD eess.AS

    Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time

    Authors: Xinfeng Li, Chen Yan, Xuancun Lu, Zihan Zeng, Xiaoyu Ji, Wenyuan Xu

    Abstract: Automatic speech recognition (ASR) systems have been shown to be vulnerable to adversarial examples (AEs). Recent success all assumes that users will not notice or disrupt the attack process despite the existence of music/noise-like sounds and spontaneous responses from voice assistants. Nonetheless, in practical user-present scenarios, user awareness may nullify existing attack attempts that laun… ▽ More

    Submitted 12 September, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted by NDSS Symposium 2024. Please cite this paper as "Xinfeng Li, Chen Yan, Xuancun Lu, Zihan Zeng, Xiaoyu Ji, Wenyuan Xu. Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time. In Network and Distributed System Security (NDSS) Symposium 2024."

  8. Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Image Segmentation

    Authors: Ziyuan Zhao, Fangcheng Zhou, Zeng Zeng, Cuntai Guan, S. Kevin Zhou

    Abstract: Domain shift and label scarcity heavily limit deep learning applications to various medical image analysis tasks. Unsupervised domain adaptation (UDA) techniques have recently achieved promising cross-modality medical image segmentation by transferring knowledge from a label-rich source domain to an unlabeled target domain. However, it is also difficult to collect annotations from the source domai… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted by MICCAI 2022 (top 13% paper; early accept)

    Journal ref: Medical Image Computing and Computer Assisted Intervention, MICCAI 2022. Lecture Notes in Computer Science, vol 13435. Springer, Cham

  9. arXiv:2303.01927  [pdf, other

    cs.IT eess.SP

    A Generalized Nyquist-Shannon Sampling Theorem Using the Koopman Operator

    Authors: Zhexuan Zeng, Ye Yuan

    Abstract: The sampling theorem plays a fundamental role for the recovery of continuous-time signals from discrete-time samples in the field of signal processing. The sampling theorem of non-band-limited signals has evolved into one of the most challenging problems. In this work, a generalized sampling theorem -- which builds on the Koopman operator -- is proved for signals in generator-bounded space (Theore… ▽ More

    Submitted 6 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  10. arXiv:2301.07414  [pdf

    eess.SY

    A Smart Adaptively Reconfigurable DC Battery for Higher Efficiency of Electric Vehicle Drive Trains

    Authors: Zhongxi Li, Aobo Yang, Gerry Chen, Nima Tashakor, Zhiyong Zeng, Angel V. Peterchev, Stefan M. Goetz

    Abstract: This paper proposes a drive train topology with a dynamically reconfigurable dc battery, which breaks hard-wired batteries into smaller subunits. It can rapidly control the output voltage and even contribute to voltage sha** of the inverter. Based upon the rapid development of low-voltage transistors and modular circuit topologies in the recent years, the proposed technology uses recent 48 V pow… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: 11 pages, 9 figures

  11. Federated Multi-Agent Deep Reinforcement Learning Approach via Physics-Informed Reward for Multi-Microgrid Energy Management

    Authors: Yuanzheng Li, Shangyang He, Yang Li, Yang Shi, Zhigang Zeng

    Abstract: The utilization of large-scale distributed renewable energy promotes the development of the multi-microgrid (MMG), which raises the need of develo** an effective energy management method to minimize economic costs and keep self energy-sufficiency. The multi-agent deep reinforcement learning (MADRL) has been widely used for the energy management problem because of its real-time scheduling ability… ▽ More

    Submitted 29 December, 2022; originally announced January 2023.

    Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems 35 (2024) 5902-5914

  12. arXiv:2212.03814  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    iQuery: Instruments as Queries for Audio-Visual Sound Separation

    Authors: Jiaben Chen, Renrui Zhang, Dongze Lian, Jiaqi Yang, Ziyao Zeng, Jianbo Shi

    Abstract: Current audio-visual separation methods share a standard architecture design where an audio encoder-decoder network is fused with visual encoding features at the encoder bottleneck. This design confounds the learning of multi-modal feature encoding with robust sound decoding for audio separation. To generalize to a new instrument: one must finetune the entire visual and audio network for all music… ▽ More

    Submitted 8 December, 2022; v1 submitted 7 December, 2022; originally announced December 2022.

  13. arXiv:2212.02078  [pdf, other

    eess.IV cs.AI cs.CV

    LE-UDA: Label-efficient unsupervised domain adaptation for medical image segmentation

    Authors: Ziyuan Zhao, Fangcheng Zhou, Kaixin Xu, Zeng Zeng, Cuntai Guan, S. Kevin Zhou

    Abstract: While deep learning methods hitherto have achieved considerable success in medical image segmentation, they are still hampered by two limitations: (i) reliance on large-scale well-labeled datasets, which are difficult to curate due to the expert-driven and time-consuming nature of pixel-level annotations in clinical practices, and (ii) failure to generalize from one domain to another, especially w… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted by IEEE Transactions on Medical Imaging, 2022

  14. arXiv:2211.05910  [pdf, other

    eess.IV cs.CV

    Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Maurizio Denna, Abdel Younes, Ganzorig Gankhuyag, **gang Huh, Myeong Kyun Kim, Kihwan Yoon, Hyeon-Cheol Moon, Seungho Lee, Yoonsik Choe, **woo Jeong, Sungjei Kim, Maciej Smyl, Tomasz Latkowski, Pawel Kubik, Michal Sokolski, Yujie Ma, Jiahao Chao, Zhou Zhou, Hongfan Gao, Zhengfeng Yang, Zhenbing Zeng, Zhengyang Zhuge, Chenghua Li , et al. (71 additional authors not shown)

    Abstract: Image super-resolution is a common task on mobile and IoT devices, where one often needs to upscale and enhance low-resolution images and video frames. While numerous solutions have been proposed for this problem in the past, they are usually not compatible with low-power mobile NPUs having many computational and memory constraints. In this Mobile AI challenge, we address this problem and propose… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: arXiv admin note: text overlap with arXiv:2105.07825, arXiv:2105.08826, arXiv:2211.04470, arXiv:2211.03885, arXiv:2211.05256

  15. TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training

    Authors: Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Zhen Zeng, Edward Xiao, **g Xiao

    Abstract: Non-parallel many-to-many voice conversion remains an interesting but challenging speech processing task. Recently, AutoVC, a conditional autoencoder based method, achieved excellent conversion results by disentangling the speaker identity and the speech content using information-constraining bottlenecks. However, due to the pure autoencoder training method, it is difficult to evaluate the separat… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: ASRU 6 pages

    Journal ref: 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2021, pp. 938-945

  16. arXiv:2207.10284  [pdf, other

    cs.LG cs.CL eess.SP

    Multi Resolution Analysis (MRA) for Approximate Self-Attention

    Authors: Zhanpeng Zeng, Sourav Pal, Jeffery Kline, Glenn M Fung, Vikas Singh

    Abstract: Transformers have emerged as a preferred model for many tasks in natural langugage processing and vision. Recent efforts on training and deploying Transformers more efficiently have identified many strategies to approximate the self-attention matrix, a key module in a Transformer architecture. Effective ideas include various prespecified sparsity patterns, low-rank basis expansions and combination… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: ICML2022

  17. ACT-Net: Asymmetric Co-Teacher Network for Semi-supervised Memory-efficient Medical Image Segmentation

    Authors: Ziyuan Zhao, Andong Zhu, Zeng Zeng, Bharadwaj Veeravalli, Cuntai Guan

    Abstract: While deep models have shown promising performance in medical image segmentation, they heavily rely on a large amount of well-annotated data, which is difficult to access, especially in clinical practice. On the other hand, high-accuracy deep models usually come in large model sizes, limiting their employment in real scenarios. In this work, we propose a novel asymmetric co-teacher framework, ACT-… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Journal ref: 2022 IEEE International Conference on Image Processing (ICIP)

  18. MMGL: Multi-Scale Multi-View Global-Local Contrastive learning for Semi-supervised Cardiac Image Segmentation

    Authors: Ziyuan Zhao, **xuan Hu, Zeng Zeng, Xulei Yang, Peisheng Qian, Bharadwaj Veeravalli, Cuntai Guan

    Abstract: With large-scale well-labeled datasets, deep learning has shown significant success in medical image segmentation. However, it is challenging to acquire abundant annotations in clinical practice due to extensive expertise requirements and costly labeling efforts. Recently, contrastive learning has shown a strong capacity for visual representation learning on unlabeled data, achieving impressive pe… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted by IEEE International Conference on Image Processing (ICIP 2022)

    Journal ref: 2022 IEEE International Conference on Image Processing (ICIP)

  19. Bio-inspired Intelligence with Applications to Robotics: A Survey

    Authors: Junfei Li, Zhe Xu, Danjie Zhu, Kevin Dong, Tao Yan, Zhu Zeng, Simon X. Yang

    Abstract: In the past decades, considerable attention has been paid to bio-inspired intelligence and its applications to robotics. This paper provides a comprehensive survey of bio-inspired intelligence, with a focus on neurodynamics approaches, to various robotic applications, particularly to path planning and control of autonomous robotic systems. Firstly, the bio-inspired shunting model and its variants… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  20. Residual Channel Attention Network for Brain Glioma Segmentation

    Authors: Yiming Yao, Peisheng Qian, Ziyuan Zhao, Zeng Zeng

    Abstract: A glioma is a malignant brain tumor that seriously affects cognitive functions and lowers patients' life quality. Segmentation of brain glioma is challenging because of interclass ambiguities in tumor regions. Recently, deep learning approaches have achieved outstanding performance in the automatic segmentation of brain glioma. However, existing algorithms fail to exploit channel-wise feature inte… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: Accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2022)

    Journal ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  21. arXiv:2205.10757  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Deep Feature Fusion via Graph Convolutional Network for Intracranial Artery Labeling

    Authors: Yaxin Zhu, Peisheng Qian, Ziyuan Zhao, Zeng Zeng

    Abstract: Intracranial arteries are critical blood vessels that supply the brain with oxygenated blood. Intracranial artery labels provide valuable guidance and navigation to numerous clinical applications and disease diagnoses. Various machine learning algorithms have been carried out for automation in the anatomical labeling of cerebral arteries. However, the task remains challenging because of the high c… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: Accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2022)

    Journal ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  22. Self-supervised Assisted Active Learning for Skin Lesion Segmentation

    Authors: Ziyuan Zhao, Wen**g Lu, Zeng Zeng, Kaixin Xu, Bharadwaj Veeravalli, Cuntai Guan

    Abstract: Label scarcity has been a long-standing issue for biomedical image segmentation, due to high annotation costs and professional requirements. Recently, active learning (AL) strategies strive to reduce annotation costs by querying a small portion of data for annotation, receiving much traction in the field of medical imaging. However, most of the existing AL methods have to initialize models with so… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted by the 44th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2022)

    Journal ref: 2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  23. arXiv:2204.14021  [pdf, ps, other

    eess.SY math.DS math.OC

    A Sampling Theorem for Exact Identification of Continuous-time Nonlinear Dynamical Systems

    Authors: Zhexuan Zeng, Zuogong Yue, Alexandre Mauroy, Jorge Goncalves, Ye Yuan

    Abstract: Low sampling frequency challenges the exact identification of the continuous-time (CT) dynamical system from sampled data, even when its model is identifiable. The necessary and sufficient condition is proposed -- which is built from Koopman operator -- to the exact identification of the CT system from sampled data. The condition gives a Nyquist-Shannon-like critical frequency for exact identifica… ▽ More

    Submitted 29 April, 2022; originally announced April 2022.

  24. arXiv:2204.13851  [pdf, other

    eess.IV cs.CV cs.LG

    COVID-Net US-X: Enhanced Deep Neural Network for Detection of COVID-19 Patient Cases from Convex Ultrasound Imaging Through Extended Linear-Convex Ultrasound Augmentation Learning

    Authors: E. Zhixuan Zeng, Adrian Florea, Alexander Wong

    Abstract: As the global population continues to face significant negative impact by the on-going COVID-19 pandemic, there has been an increasing usage of point-of-care ultrasound (POCUS) imaging as a low-cost and effective imaging modality of choice in the COVID-19 clinical workflow. A major barrier with widespread adoption of POCUS in the COVID-19 clinical workflow is the scarcity of expert clinicians that… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: 6 pages

  25. arXiv:2204.08661  [pdf

    eess.SP

    Dir-MUSIC Algorithm for DOA Estimation of Partial Discharge Based on Signal Strength represented by Antenna Gain Array Manifold

    Authors: Wencong Xu, Yandong Li, Bingshu Chen, Yue Hu, Jianxu Li, Zi**g Zeng

    Abstract: Inspection robots are widely used in the field of smart grid monitoring in substations, and partial discharge (PD) is an important sign of the insulation state of equipments. PD direction of arrival (DOA) algorithms using conventional beamforming and time difference of arrival (TDOA) require large-scale antenna arrays and high computational complexity, which make them difficult to implement on ins… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: 8 pages,13 figures,24 references

  26. Probabilistic Charging Power Forecast of EVCS: Reinforcement Learning Assisted Deep Learning Approach

    Authors: Yuanzheng Li, Shangyang He, Yang Li, Leijiao Ge, Suhua Lou, Zhigang Zeng

    Abstract: The electric vehicle (EV) and electric vehicle charging station (EVCS) have been widely deployed with the development of large-scale transportation electrifications. However, since charging behaviors of EVs show large uncertainties, the forecasting of EVCS charging power is non-trivial. This paper tackles this issue by proposing a reinforcement learning assisted deep learning framework for the pro… ▽ More

    Submitted 16 April, 2022; originally announced April 2022.

    Comments: Accepted by IEEE Transactions on Intelligent Vehicles

    Journal ref: IEEE Transactions on Intelligent Vehicles 8 (2023) 344-357

  27. arXiv:2204.03329  [pdf

    cs.RO eess.SY

    Information-driven Path Planning for Hybrid Aerial Underwater Vehicles

    Authors: Zheng Zeng, Chengke Xiong, Xinyi Yuan, Yulin Bai, Yufei **, Di Lu, Lian Lian

    Abstract: This paper presents a novel Rapidly-exploring Adaptive Sampling Tree (RAST) algorithm for the adaptive sampling mission of a hybrid aerial underwater vehicle (HAUV) in an air-sea 3D environment. This algorithm innovatively combines the tournament-based point selection sampling strategy, the information heuristic search process and the framework of Rapidly-exploring Random Tree (RRT) algorithm. Hen… ▽ More

    Submitted 8 April, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  28. Adaptive Mean-Residue Loss for Robust Facial Age Estimation

    Authors: Ziyuan Zhao, Peisheng Qian, Yubo Hou, Zeng Zeng

    Abstract: Automated facial age estimation has diverse real-world applications in multimedia analysis, e.g., video surveillance, and human-computer interaction. However, due to the randomness and ambiguity of the aging process, age assessment is challenging. Most research work over the topic regards the task as one of age regression, classification, and ranking problems, and cannot well leverage age distribu… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted by IEEE International Conference on Multimedia and Expo (ICME 2022)

    Journal ref: 2022 IEEE International Conference on Multimedia and Expo (ICME)

  29. MT-UDA: Towards Unsupervised Cross-modality Medical Image Segmentation with Limited Source Labels

    Authors: Ziyuan Zhao, Kaixin Xu, Shumeng Li, Zeng Zeng, Cuntai Guan

    Abstract: The success of deep convolutional neural networks (DCNNs) benefits from high volumes of annotated data. However, annotating medical images is laborious, expensive, and requires human expertise, which induces the label scarcity problem. Especially when encountering the domain shift, the problem becomes more serious. Although deep unsupervised domain adaptation (UDA) can leverage well-established so… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Accept by MICCAI 2021, code at: https://github.com/jacobzhaoziyuan/MT-UDA

    Journal ref: Medical Image Computing and Computer Assisted Intervention, MICCAI 2021. Lecture Notes in Computer Science, vol 12901. Springer, Cham

  30. arXiv:2201.01895  [pdf, ps, other

    eess.SY

    Event-based EV Charging Scheduling in A Microgrid of Buildings

    Authors: Qilong Huang, Li Yang, Chen Hou, Zhiyong Zeng, Yaowen Qi

    Abstract: With the popularization of the electric vehicles (EVs), EV charging demand is becoming an important load in the building. Considering the mobility of EVs from building to building and their uncertain charging demand, it is of great practical interest to control the EV charging process in a microgrid of buildings to optimize the total operation cost while ensuring the transmission safety between th… ▽ More

    Submitted 5 September, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

  31. arXiv:2111.05133  [pdf, other

    eess.IV cs.CV

    Approaching the Limit of Image Rescaling via Flow Guidance

    Authors: Shang Li, Guixuan Zhang, Zhengxiong Luo, Jie Liu, Zhi Zeng, Shuwu Zhang

    Abstract: Image downscaling and upscaling are two basic rescaling operations. Once the image is downscaled, it is difficult to be reconstructed via upscaling due to the loss of information. To make these two processes more compatible and improve the reconstruction performance, some efforts model them as a joint encoding-decoding task, with the constraint that the downscaled (i.e. encoded) low-resolution (LR… ▽ More

    Submitted 8 January, 2023; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: BMVC 2021

  32. DeepGOMIMO: Deep Learning-Aided Generalized Optical MIMO with CSI-Free Blind Detection

    Authors: Xin Zhong, Chen Chen, Shu Fu, Zhihong Zeng, Min Liu

    Abstract: Generalized optical multiple-input multiple-output (GOMIMO) techniques have been recently shown to be promising for high-speed optical wireless communication (OWC) systems. In this paper, we propose a novel deep learning-aided GOMIMO (DeepGOMIMO) framework for GOMIMO systems, where channel state information (CSI)-free blind detection can be enabled by employing a specially designed deep neural net… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  33. Two Eyes Are Better Than One: Exploiting Binocular Correlation for Diabetic Retinopathy Severity Grading

    Authors: Peisheng Qian, Ziyuan Zhao, Cong Chen, Zeng Zeng, Xiaoli Li

    Abstract: Diabetic retinopathy (DR) is one of the most common eye conditions among diabetic patients. However, vision loss occurs primarily in the late stages of DR, and the symptoms of visual impairment, ranging from mild to severe, can vary greatly, adding to the burden of diagnosis and treatment in clinical practice. Deep learning methods based on retinal images have achieved remarkable success in automa… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: Accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, IEEE EMBC 2021

    Journal ref: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  34. arXiv:2108.06761  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Multi-Slice Dense-Sparse Learning for Efficient Liver and Tumor Segmentation

    Authors: Ziyuan Zhao, Zeyu Ma, Yanjie Liu, Zeng Zeng, Pierce KH Chow

    Abstract: Accurate automatic liver and tumor segmentation plays a vital role in treatment planning and disease monitoring. Recently, deep convolutional neural network (DCNNs) has obtained tremendous success in 2D and 3D medical image segmentation. However, 2D DCNNs cannot fully leverage the inter-slice information, while 3D DCNNs are computationally expensive and memory intensive. To address these issues, w… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: Accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, IEEE EMBC 2021

    Journal ref: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  35. arXiv:2108.06086  [pdf, ps, other

    cs.IT eess.SP eess.SY

    A VCSEL Array Transmission System with Novel Beam Activation Mechanisms

    Authors: Zhihong Zeng, Mohammad Dehghani Soltani, Majid Safari, Harald Haas

    Abstract: Optical wireless communication (OWC) is considered to be a promising technology which will alleviate traffic burden caused by the increasing number of mobile devices. In this study, a novel vertical-cavity surface-emitting laser (VCSEL) array is proposed for indoor OWC systems. To activate the best beam for a mobile user, two beam activation methods are proposed for the system. The method based on… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: 30 pages, 15 figures, journal

  36. arXiv:2108.06025  [pdf, ps, other

    cs.IT eess.SP eess.SY

    Interference Mitigation using Optimized Angle Diversity Receiver in LiFi Cellular Network

    Authors: Zhihong Zeng, Chen Chen, Svetislav Savovi, Mohammad Dehghani Soltani, Cheng Chen, Majid Safari, Harald Haas

    Abstract: Light-fidelity (LiFi) is an emerging technology for high-speed short-range mobile communications. Inter-cell interference (ICI) is an important issue that limits the system performance in an optical attocell network. Angle diversity receivers (ADRs) have been proposed to mitigate ICI. In this paper, the structure of pyramid receivers (PRs) and truncated pyramid receivers (TPRs) are studied. The co… ▽ More

    Submitted 12 June, 2024; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: 15 pages, 16 figures, journal

  37. arXiv:2106.10401  [pdf

    eess.SP cs.LG

    Parallel frequency function-deep neural network for efficient complex broadband signal approximation

    Authors: Zhi Zeng, Pengpeng Shi, Fulei Ma, Peihan Qi

    Abstract: A neural network is essentially a high-dimensional complex map** model by adjusting network weights for feature fitting. However, the spectral bias in network training leads to unbearable training epochs for fitting the high-frequency components in broadband signals. To improve the fitting efficiency of high-frequency components, the PhaseDNN was proposed recently by combining complex frequency… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  38. Hierarchical Consistency Regularized Mean Teacher for Semi-supervised 3D Left Atrium Segmentation

    Authors: Shumeng Li, Ziyuan Zhao, Kaixin Xu, Zeng Zeng, Cuntai Guan

    Abstract: Deep learning has achieved promising segmentation performance on 3D left atrium MR images. However, annotations for segmentation tasks are expensive, costly and difficult to obtain. In this paper, we introduce a novel hierarchical consistency regularized mean teacher framework for 3D left atrium segmentation. In each iteration, the student model is optimized by multi-scale deep supervision and hie… ▽ More

    Submitted 15 August, 2021; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: Accepted in 43rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, IEEE EMBC 2021

    Journal ref: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  39. arXiv:2104.05418  [pdf, other

    cs.LG cs.CV cs.SD eess.AS eess.IV

    Contrastive Learning of Global-Local Video Representations

    Authors: Shuang Ma, Zhaoyang Zeng, Daniel McDuff, Yale Song

    Abstract: Contrastive learning has delivered impressive results for various tasks in the self-supervised regime. However, existing approaches optimize for learning representations specific to downstream scenarios, i.e., \textit{global} representations suitable for tasks such as classification or \textit{local} representations for tasks such as detection and localization. While they produce satisfactory resu… ▽ More

    Submitted 27 October, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

  40. arXiv:2102.10815  [pdf, other

    eess.AS

    LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

    Authors: Zhen Zeng, Jianzong Wang, Ning Cheng, **g Xiao

    Abstract: In this paper, we propose a novel conditional convolution network, named location-variable convolution, to model the dependencies of the waveform sequence. Different from the use of unified convolution kernels in WaveNet to capture the dependencies of arbitrary waveform, the location-variable convolution uses convolution kernels with different coefficients to perform convolution operations on diff… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP 2021. arXiv admin note: text overlap with arXiv:2012.01684

  41. arXiv:2101.09057  [pdf, other

    cs.CV cs.AI eess.IV

    DSAL: Deeply Supervised Active Learning from Strong and Weak Labelers for Biomedical Image Segmentation

    Authors: Ziyuan Zhao, Zeng Zeng, Kaixin Xu, Cen Chen, Cuntai Guan

    Abstract: Image segmentation is one of the most essential biomedical image processing problems for different imaging modalities, including microscopy and X-ray in the Internet-of-Medical-Things (IoMT) domain. However, annotating biomedical images is knowledge-driven, time-consuming, and labor-intensive, making it difficult to obtain abundant labels with limited costs. Active learning strategies come into ea… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Published as a journal paper at IEEE J-BHI

  42. arXiv:2012.02626  [pdf, other

    eess.AS cs.CL cs.SD

    GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis

    Authors: Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Lingwei Kong, **g Xiao

    Abstract: This paper introduces a graphical representation approach of prosody boundary (GraphPB) in the task of Chinese speech synthesis, intending to parse the semantic and syntactic relationship of input sequences in a graphical domain for improving the prosody performance. The nodes of the graph embedding are formed by prosodic words, and the edges are formed by the other prosodic boundaries, namely pro… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: Accepted to SLT 2021

  43. arXiv:2012.01684  [pdf, other

    cs.SD cs.AI eess.AS

    MelGlow: Efficient Waveform Generative Network Based on Location-Variable Convolution

    Authors: Zhen Zeng, Jianzong Wang, Ning Cheng, **g Xiao

    Abstract: Recent neural vocoders usually use a WaveNet-like network to capture the long-term dependencies of the waveform, but a large number of parameters are required to obtain good modeling capabilities. In this paper, an efficient network, named location-variable convolution, is proposed to model the dependencies of waveforms. Different from the use of unified convolution kernels in WaveNet to capture t… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: will be presented in SLT 2021

  44. arXiv:2011.00101  [pdf, ps, other

    cs.CR cs.HC cs.LG eess.SP

    EEG-Based Brain-Computer Interfaces Are Vulnerable to Backdoor Attacks

    Authors: Lubin Meng, Jian Huang, Zhigang Zeng, Xue Jiang, Shan Yu, Tzyy-** Jung, Chin-Teng Lin, Ricardo Chavarriaga, Dongrui Wu

    Abstract: Research and development of electroencephalogram (EEG) based brain-computer interfaces (BCIs) have advanced rapidly, partly due to deeper understanding of the brain and wide adoption of sophisticated machine learning approaches for decoding the EEG signals. However, recent studies have shown that machine learning algorithms are vulnerable to adversarial attacks. This article proposes to use narrow… ▽ More

    Submitted 2 January, 2021; v1 submitted 30 October, 2020; originally announced November 2020.

    Journal ref: IEEE Transactions on Neural Systems and Rehabilitation Engineering, 2023

  45. Sea-Net: Squeeze-And-Excitation Attention Net For Diabetic Retinopathy Grading

    Authors: Ziyuan Zhao, Kartik Chopra, Zeng Zeng, Xiaoli Li

    Abstract: Diabetes is one of the most common disease in individuals. \textit{Diabetic retinopathy} (DR) is a complication of diabetes, which could lead to blindness. Automatic DR grading based on retinal images provides a great diagnostic and prognostic value for treatment planning. However, the subtle differences among severity levels make it difficult to capture important features using conventional metho… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: Accepted to ICIP 2020

    Journal ref: 2020 IEEE International Conference on Image Processing (ICIP), pp. 2496-2500

  46. arXiv:2008.05656  [pdf, other

    eess.AS cs.CL cs.SD

    Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit

    Authors: Zhen Zeng, Jianzong Wang, Ning Cheng, **g Xiao

    Abstract: Recent neural speech synthesis systems have gradually focused on the control of prosody to improve the quality of synthesized speech, but they rarely consider the variability of prosody and the correlation between prosody and semantics together. In this paper, a prosody learning mechanism is proposed to model the prosody of speech based on TTS system, where the prosody information of speech is ext… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: will be presented in INTERSPEECH 2020

  47. arXiv:2007.13284  [pdf

    cs.CV cs.LG eess.IV

    Research Progress of Convolutional Neural Network and its Application in Object Detection

    Authors: Wei Zhang, Zuoxiang Zeng

    Abstract: With the improvement of computer performance and the increase of data volume, the object detection based on convolutional neural network (CNN) has become the main algorithm for object detection. This paper summarizes the research progress of convolutional neural networks and their applications in object detection, and focuses on analyzing and discussing a specific idea and method of applying convo… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

    Comments: 11 pages, journal paper

    ACM Class: I.2

  48. arXiv:2007.03746  [pdf, ps, other

    eess.SP cs.HC cs.LG stat.ML

    Transfer Learning for Motor Imagery Based Brain-Computer Interfaces: A Complete Pipeline

    Authors: Dongrui Wu, Xue Jiang, Ruimin Peng, Wanzeng Kong, Jian Huang, Zhigang Zeng

    Abstract: Transfer learning (TL) has been widely used in motor imagery (MI) based brain-computer interfaces (BCIs) to reduce the calibration effort for a new subject, and demonstrated promising performance. While a closed-loop MI-based BCI system, after electroencephalogram (EEG) signal acquisition and temporal filtering, includes spatial filtering, feature engineering, and classification blocks before send… ▽ More

    Submitted 22 January, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

    Journal ref: Neural Networks, 153:235-253, 2022

  49. arXiv:2006.01045  [pdf, other

    eess.SP cs.LG stat.ML

    A Hierarchical Deep Convolutional Neural Network and Gated Recurrent Unit Framework for Structural Damage Detection

    Authors: Jianxi Yang, Likai Zhang, Cen Chen, Yangfan Li, Ren Li, Gui** Wang, Shixin Jiang, Zeng Zeng

    Abstract: Structural damage detection has become an interdisciplinary area of interest for various engineering fields, while the available damage detection methods are being in the process of adapting machine learning concepts. Most machine learning based methods heavily depend on extracted ``hand-crafted" features that are manually selected in advance by domain experts and then, fixed. Recently, deep learn… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: The work has been accepted by Information Sciences!

  50. arXiv:2005.10407  [pdf, other

    eess.AS cs.LG cs.SD

    Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning

    Authors: Zhi** Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma

    Abstract: In this work, we study leveraging extra text data to improve low-resource end-to-end ASR under cross-lingual transfer learning setting. To this end, we extend our prior work [1], and propose a hybrid Transformer-LSTM based architecture. This architecture not only takes advantage of the highly effective encoding capacity of the Transformer network but also benefits from extra text data due to the L… ▽ More

    Submitted 28 May, 2020; v1 submitted 20 May, 2020; originally announced May 2020.