Skip to main content

Showing 1–24 of 24 results for author: Zhuang, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08835  [pdf, other

    cs.SD eess.AS

    A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed

    Authors: Ziyang Zhuang, Chenfeng Miao, Kun Zou, Shuai Gong, Ming Fang, Tao Wei, Zijian Li, Wei Hu, Shaojun Wang, **g Xiao

    Abstract: Non-autoregressive (NAR) automatic speech recognition (ASR) models predict tokens independently and simultaneously, bringing high inference speed. However, there is still a gap in the accuracy of the NAR models compared to the autoregressive (AR) models. To further narrow the gap between the NAR and AR models, we propose a single-step NAR ASR architecture with high accuracy and inference speed, ca… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2403.15448  [pdf, other

    eess.SP cs.LG

    What is Wrong with End-to-End Learning for Phase Retrieval?

    Authors: Wenjie Zhang, Yuxiang Wan, Zhong Zhuang, Ju Sun

    Abstract: For nonlinear inverse problems that are prevalent in imaging science, symmetries in the forward model are common. When data-driven deep learning approaches are used to solve such problems, these intrinsic symmetries can cause substantial learning difficulties. In this paper, we explain how such difficulties arise and, more importantly, how to overcome them by preprocessing the training set before… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  3. arXiv:2312.15721  [pdf, ps, other

    eess.SY

    UAV Trajectory Tracking via RNN-enhanced IMM-KF with ADS-B Data

    Authors: Yian Zhu, Ziye Jia, Qihui Wu, Chao Dong, Zirui Zhuang, Huiling Hu, Qi Cai

    Abstract: With the increasing use of autonomous unmanned aerial vehicles (UAVs), it is critical to ensure that they are continuously tracked and controlled, especially when UAVs operate beyond the communication range of ground stations (GSs). Conventional surveillance methods for UAVs, such as satellite communications, ground mobile networks and radars are subject to high costs and latency. The automatic de… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  4. arXiv:2312.00951  [pdf, other

    cs.RO eess.SY

    AV4EV: Open-Source Modular Autonomous Electric Vehicle Platform for Making Mobility Research Accessible

    Authors: Zhijie Qiao, Mingyan Zhou, Zhijun Zhuang, Tejas Agarwal, Felix Jahncke, Po-Jen Wang, Jason Friedman, Hongyi Lai, Divyanshu Sahu, Tomáš Nagy, Martin Endler, Jason Schlessman, Rahul Mangharam

    Abstract: When academic researchers develop and validate autonomous driving algorithms, there is a challenge in balancing high-performance capabilities with the cost and complexity of the vehicle platform. Much of today's research on autonomous vehicles (AV) is limited to experimentation on expensive commercial vehicles that require large skilled teams to retrofit the vehicles and test them in dedicated fac… ▽ More

    Submitted 12 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 pages, 5 figures

  5. arXiv:2309.07152  [pdf

    eess.SP physics.med-ph

    Novel Smart N95 Filtering Facepiece Respirator with Real-time Adaptive Fit Functionality and Wireless Humidity Monitoring for Enhanced Wearable Comfort

    Authors: Kangkyu Kwon, Yoon Jae Lee, Yeongju Jung, Ira Soltis, Chanyeong Choi, Yewon Na, Lissette Romero, Myung Chul Kim, Nathan Rodeheaver, Hodam Kim, Michael S. Lloyd, Ziqing Zhuang, William King, Susan Xu, Seung-Hwan Ko, **woo Lee, Woon-Hong Yeo

    Abstract: The widespread emergence of the COVID-19 pandemic has transformed our lifestyle, and facial respirators have become an essential part of daily life. Nevertheless, the current respirators possess several limitations such as poor respirator fit because they are incapable of covering diverse human facial sizes and shapes, potentially diminishing the effect of wearing respirators. In addition, the cur… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 20 pages, 5 figures, 1 table, submitted for possible publication

    MSC Class: 92C55

  6. arXiv:2307.10316  [pdf, other

    cs.CV eess.IV

    CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation

    Authors: Lizhao Liu, Zhuangwei Zhuang, Shangxin Huang, Xunlong Xiao, Tianhang Xiang, Cen Chen, **gdong Wang, Mingkui Tan

    Abstract: We study the task of weakly-supervised point cloud semantic segmentation with sparse annotations (e.g., less than 0.1% points are labeled), aiming to reduce the expensive cost of dense annotations. Unfortunately, with extremely sparse annotated points, it is very difficult to extract both contextual and object information for scene understanding such as semantic segmentation. Motivated by masked m… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV 2023

  7. arXiv:2306.12098  [pdf, other

    eess.SP cs.LG

    MSW-Transformer: Multi-Scale Shifted Windows Transformer Networks for 12-Lead ECG Classification

    Authors: Renjie Cheng, Zhemin Zhuang, Shuxin Zhuang, Lei Xie, **gfeng Guo

    Abstract: Automatic classification of electrocardiogram (ECG) signals plays a crucial role in the early prevention and diagnosis of cardiovascular diseases. While ECG signals can be used for the diagnosis of various diseases, their pathological characteristics exhibit minimal variations, posing a challenge to automatic classification models. Existing methods primarily utilize convolutional neural networks t… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  8. arXiv:2305.19931  [pdf, other

    eess.SP

    Asymptotic Performance Analysis of Large-Scale Active IRS-Aided Wireless Network

    Authors: Yan Wang, Feng Shu, Zhihong Zhuang, Rongen Dong, Qi Zhang, Di Wu, Liang Yang, Jiangzhou Wang

    Abstract: In this paper, the dominant factor affecting the performance of active intelligent reflecting surface (IRS) aided wireless communication networks in Rayleigh fading channel, namely the average signal-to-noise ratio (SNR) $γ_0$ at IRS, is studied. Making use of the weak law of large numbers, its simple asymptotic expression is derived as the number $N$ of IRS elements goes to medium-scale and large… ▽ More

    Submitted 5 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  9. arXiv:2211.00799  [pdf, other

    cs.CV cs.LG eess.IV

    Practical Phase Retrieval Using Double Deep Image Priors

    Authors: Zhong Zhuang, David Yang, Felix Hofmann, David Barmherzig, Ju Sun

    Abstract: Phase retrieval (PR) concerns the recovery of complex phases from complex magnitudes. We identify the connection between the difficulty level and the number and variety of symmetries in PR problems. We focus on the most difficult far-field PR (FFPR), and propose a novel method using double deep image priors. In realistic evaluation, our method outperforms all competing methods by large margins. As… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  10. arXiv:2208.09483  [pdf, other

    eess.IV cs.CV cs.LG eess.SP

    Blind Image Deblurring with Unknown Kernel Size and Substantial Noise

    Authors: Zhong Zhuang, Taihui Li, Hengkang Wang, Ju Sun

    Abstract: Blind image deblurring (BID) has been extensively studied in computer vision and adjacent fields. Modern methods for BID can be grouped into two categories: single-instance methods that deal with individual instances using statistical inference and numerical optimization, and data-driven methods that train deep-learning models to deblur future instances directly. Data-driven methods can be free fr… ▽ More

    Submitted 15 September, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

    Journal ref: International Journal of Computer Vision, 2023

  11. arXiv:2208.01227  [pdf, ps, other

    eess.SP

    Optimal Measurement of Drone Swarm in RSS-based Passive Localization with Region Constraints

    Authors: Xin Cheng, Feng Shu, Yifan Li, Zhihong Zhuang, Di Wu, Jiangzhou Wang

    Abstract: Passive geolocation by multiple unmanned aerial vehicles (UAVs) covers a wide range of military and civilian applications including rescue, wild life tracking and electronic warfare. The sensor-target geometry is known to significantly affect the localization precision. The existing sensor placement strategies mainly work on the cases without any constraints on the sensors locations. However, UAVs… ▽ More

    Submitted 7 August, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

  12. arXiv:2205.11346  [pdf, other

    eess.IV cs.CV

    Spatial Attention-based Implicit Neural Representation for Arbitrary Reduction of MRI Slice Spacing

    Authors: Xin Wang, Sheng Wang, Honglin Xiong, Kai Xuan, Zixu Zhuang, Mengjun Liu, Zhenrong Shen, Xiangyu Zhao, Lichi Zhang, Qian Wang

    Abstract: Magnetic resonance (MR) images collected in 2D clinical protocols typically have large inter-slice spacing, resulting in high in-plane resolution and reduced through-plane resolution. Super-resolution technique can enhance the through-plane resolution of MR images to facilitate downstream visualization and computer-aided diagnosis. However, most existing works train the super-resolution network at… ▽ More

    Submitted 19 March, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

  13. arXiv:2204.09411  [pdf, ps, other

    eess.SP

    Two Low-complexity DOA Estimators for Massive/Ultra-massive MIMO Receive Array

    Authors: Yiwen Chen, Xichao Zhan, Feng Shu, Qijuan Jie, Xin Cheng, Zhihong Zhuang, Jiangzhou Wang

    Abstract: Eigen-decomposition-based direction finding methods of using large-scale/ultra-large-scale fully-digital receive antenna arrays lead to a high or ultra-high complexity. To address the complexity dilemma, in this paper, three low-complexity estimators are proposed: partitioned subarray auto-correlation combining (PSAC), partitioned subarray cross-correlation combining (PSCC) and power iteration max… ▽ More

    Submitted 10 August, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  14. arXiv:2201.04318  [pdf, other

    eess.IV cs.CV

    Knee Cartilage Defect Assessment by Graph Representation and Surface Convolution

    Authors: Zixu Zhuang, Li** Si, Sheng Wang, Kai Xuan, Xi Ouyang, Yiqiang Zhan, Zhong Xue, Lichi Zhang, Dinggang Shen, Weiwu Yao, Qian Wang

    Abstract: Knee osteoarthritis (OA) is the most common osteoarthritis and a leading cause of disability. Cartilage defects are regarded as major manifestations of knee OA, which are visible by magnetic resonance imaging (MRI). Thus early detection and assessment for knee cartilage defects are important for protecting patients from knee OA. In this way, many attempts have been made on knee cartilage defect as… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: 10 pages, 4 figures

  15. arXiv:2112.06074  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Early Stop** for Deep Image Prior

    Authors: Hengkang Wang, Taihui Li, Zhong Zhuang, Tiancong Chen, Hengyue Liang, Ju Sun

    Abstract: Deep image prior (DIP) and its variants have showed remarkable potential for solving inverse problems in computer vision, without any extra training data. Practical DIP models are often substantially overparameterized. During the fitting process, these models learn mostly the desired visual content first, and then pick up the potential modeling and observational noise, i.e., overfitting. Thus, the… ▽ More

    Submitted 11 December, 2023; v1 submitted 11 December, 2021; originally announced December 2021.

    Comments: Published in TMLR (https://openreview.net/forum?id=231ZzrLC8X)

    Journal ref: Transactions on Machine Learning Research (TMLR), 2835-8856 (12/2023)

  16. arXiv:2110.12271  [pdf, other

    cs.CV cs.LG eess.IV eess.SP

    Self-Validation: Early Stop** for Single-Instance Deep Generative Priors

    Authors: Taihui Li, Zhong Zhuang, Hengyue Liang, Le Peng, Hengkang Wang, Ju Sun

    Abstract: Recent works have shown the surprising effectiveness of deep generative models in solving numerous image reconstruction (IR) tasks, even without training data. We call these models, such as deep image prior and deep decoder, collectively as single-instance deep generative priors (SIDGPs). The successes, however, often hinge on appropriate early stop** (ES), which by far has largely been handled… ▽ More

    Submitted 23 October, 2021; originally announced October 2021.

    Comments: To appear in British Machine Vision Conference (BMVC) 2021

  17. arXiv:2109.00154  [pdf, other

    cs.IT eess.SP

    DOA Estimation Using Massive Receive MIMO: Basic Principle and Key Techniques

    Authors: Jiangzhou Wang, Baihua Shi, Feng Shu, Qi Zhang, Di Wu, Qijuan Jie, Zhihong Zhuang, Siling Feng, Yi** Zhang

    Abstract: As massive multiple-input multiple-output (MIMO) becomes popular, direction of arrival (DOA) measurement has been made a real renaissance due to the high-resolution achieved. Thus, there is no doubt about DOA estimation using massive MIMO. The purpose of this paper is to describe its basic principles and key techniques, to present the performance analysis, and to appreciate its engineering applica… ▽ More

    Submitted 15 July, 2023; v1 submitted 31 August, 2021; originally announced September 2021.

  18. arXiv:2106.04812  [pdf, other

    cs.LG eess.IV

    Phase Retrieval using Single-Instance Deep Generative Prior

    Authors: Kshitij Tayal, Raunak Manekar, Zhong Zhuang, David Yang, Vipin Kumar, Felix Hofmann, Ju Sun

    Abstract: Several deep learning methods for phase retrieval exist, but most of them fail on realistic data without precise support information. We propose a novel method based on single-instance deep generative prior that works well on complex-valued crystal data.

    Submitted 22 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

  19. arXiv:1911.09887  [pdf, other

    eess.SP cs.IT

    UAV-enabled Secure Communication with Finite Blocklength

    Authors: Yuntian Wang, Xiaobo Zhou, Zhihong Zhuang, Linlin Sun, Yuwen Qian, **hui Lu, Feng Shu

    Abstract: In the finite blocklength scenario, which is suitable for practical applications, a method of maximizing the average effective secrecy rate (AESR) is proposed for a UAV-enabled secure communication by optimizing the UAV's trajectory and transmit power subject to the UAV's mobility constraints and transmit power constraints. To address the formulated non-convex optimization problem, it is first dec… ▽ More

    Submitted 26 March, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

  20. arXiv:1908.09289  [pdf, ps, other

    eess.SP

    UAV-Enabled Uplink Non-Orthogonal Multiple Access System: Joint Deployment and Power Control

    Authors: **hui Lu, Yuntian Wang, Tingting Liu, Zhihong Zhuang, Xiaobo Zhou, Feng Shu, Zhu Han

    Abstract: In order to overcome the inherent latency in multi-user unmanned aerial vehicle (UAV) networks with orthogonal multiple access (OMA). In this paper, we investigate the UAV enabled uplink non-orthogonal multiple access (NOMA) network, where a UAV is deployed to collect the messages transmitted by ground users. In order to maximize the sum rate of all users and to meet the quality of service (QoS) r… ▽ More

    Submitted 25 August, 2019; originally announced August 2019.

  21. arXiv:1907.06817  [pdf, ps, other

    cs.IT eess.SP

    Energy-efficient Alternating Iterative Secure Structure of Maximizing Secrecy Rate for Directional Modulation Networks

    Authors: Linlin Sun, Jiayu Li, Yu Zhang, Yuntian Wang, Linqing Gui, Fanyuan Li, Haochen Li, Zhihong Zhuang, Feng Shu

    Abstract: In a directional modulation (DM) network, the issues of security and privacy have taken on an increasingly important role. Since the power allocation of confidential message and artificial noise will make a constructive effect on the system performance, it is important to jointly consider the relationship between the beamforming vectors and the power allocation (PA) factors. To maximize the secrec… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

  22. arXiv:1905.04837  [pdf, ps, other

    cs.IT eess.SP

    Secure Hybrid Digital and Analog Precoder for mmWave Systems with low-resolution DACs and finite-quantized phase shifters

    Authors: Ling Xu, Feng Shu, Guiyang Xia, Yi** Zhang, Zhihong Zhuang, Jiangzhou Wang

    Abstract: Millimeter wave (mmWave) communication has been regarded as one of the most promising technologies for the future generation wireless networks because of its advantages of providing a ultra-wide new spectrum and ultra-high data transmission rate. To reduce the power consumption and circuit cost for mmWave systems, hybrid digital and analog (HDA) architecture is preferred in such a scenario. In thi… ▽ More

    Submitted 12 May, 2019; originally announced May 2019.

    Comments: 11 pages,7figures

  23. Power Allocation Strategies for Secure Spatial Modulation

    Authors: Guiyang Xia, Linqiong Jia, Yuwen Qian, Feng Shu, Zhihong Zhuang, Jiangzhou Wang

    Abstract: In secure spatial modulation (SM) networks, power allocation (PA) strategies are investigated in this paper under the total power constraint. Considering that there is no closed-form expression for secrecy rate (SR), an approximate closed-form expression of SR is presented, which is used as an efficient metric to optimize PA factor and can greatly reduce the computation complexity. Based on this e… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

  24. Underwater Source Localization Using TDOA and FDOA Measurements with Unknown Propagation Speed and Sensor Parameter Errors

    Authors: Bingbing Zhang, Yongchang Hu, Hongyi Wang, Zhaowen Zhuang

    Abstract: Underwater source localization problems are complicated and challenging: a) the sound propagation speed is often unknown and the unpredictable ocean current might lead to the uncertainties of sensor parameters (i.e. position and velocity); b) the underwater acoustic signal travels much slower than the radio one in terrestrial environments, thus resulting in a significantly severe Doppler effect; c… ▽ More

    Submitted 24 July, 2018; v1 submitted 10 July, 2018; originally announced July 2018.

    Journal ref: IEEE Access 6(1):36645-36661, 2018