Skip to main content

Showing 1–50 of 76 results for author: Hu, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08835  [pdf, other

    cs.SD eess.AS

    A Single-Step Non-Autoregressive Automatic Speech Recognition Architecture with High Accuracy and Inference Speed

    Authors: Ziyang Zhuang, Chenfeng Miao, Kun Zou, Shuai Gong, Ming Fang, Tao Wei, Zijian Li, Wei Hu, Shaojun Wang, **g Xiao

    Abstract: Non-autoregressive (NAR) automatic speech recognition (ASR) models predict tokens independently and simultaneously, bringing high inference speed. However, there is still a gap in the accuracy of the NAR models compared to the autoregressive (AR) models. To further narrow the gap between the NAR and AR models, we propose a single-step NAR ASR architecture with high accuracy and inference speed, ca… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2405.20279  [pdf, other

    cs.CV cs.AI eess.IV

    CV-VAE: A Compatible Video VAE for Latent Generative Video Models

    Authors: Sijie Zhao, Yong Zhang, Xiaodong Cun, Shaoshu Yang, Muyao Niu, Xiaoyu Li, Wenbo Hu, Ying Shan

    Abstract: Spatio-temporal compression of videos, utilizing networks such as Variational Autoencoders (VAE), plays a crucial role in OpenAI's SORA and numerous other video generative models. For instance, many LLM-like video models learn the distribution of discrete tokens derived from 3D VAEs within the VQVAE framework, while most diffusion-based video models capture the distribution of continuous latent ex… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Project Page: https://ailab-cvc.github.io/cvvae/index.html

  3. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  4. arXiv:2404.01949  [pdf

    eess.SY

    Heuristic Optimization of Amplifier Reconfiguration Process for Autonomous Driving Optical Networks

    Authors: Qizhi Qiu, Xiaomin Liu, Yihao Zhang, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: We propose a heuristic-based optimization scheme for reliable optical amplifier reconfiguration process in ADON. In the experiment on a commercial testbed, the scheme prevents a 0.48-dB Q-factor degradation and outperforms 97.3% random solutions.

    Submitted 2 April, 2024; originally announced April 2024.

  5. arXiv:2403.10094  [pdf, other

    cs.CV eess.IV

    RangeLDM: Fast Realistic LiDAR Point Cloud Generation

    Authors: Qianjiang Hu, Zhimin Zhang, Wei Hu

    Abstract: Autonomous driving demands high-quality LiDAR data, yet the cost of physical LiDAR sensors presents a significant scaling-up challenge. While recent efforts have explored deep generative models to address this issue, they often consume substantial computational resources with slow generation speeds while suffering from a lack of realism. To address these limitations, we introduce RangeLDM, a novel… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  6. arXiv:2402.01860  [pdf, ps, other

    eess.SY

    Outlier Accommodation for GNSS Precise Point Positioning using Risk-Averse State Estimation

    Authors: Wang Hu, Jean-Bernard Uwineza, Jay A. Farrell

    Abstract: Reliable and precise absolute positioning is necessary in the realm of Connected Automated Vehicles (CAV). Global Navigation Satellite Systems (GNSS) provides the foundation for absolute positioning. Recently enhanced Precise Point Positioning (PPP) technology now offers corrections for GNSS on a global scale, with the potential to achieve accuracy suitable for real-time CAV applications. However,… ▽ More

    Submitted 13 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 7 pages,2 figures, Accepted by 2024 American Control Conference

  7. arXiv:2401.12173  [pdf, other

    eess.SP

    Waveform-Domain Complementary Signal Sets for Interrupted Sampling Repeater Jamming Suppression

    Authors: Hanning Su, Qinglong Bao, Jiameng Pan, Fucheng Guo, Weidong Hu

    Abstract: The interrupted-sampling repeater jamming (ISRJ) is coherent and has the characteristic of suppression and deception to degrade the radar detection capabilities. The study focuses on anti-ISRJ techniques in the waveform domain, primarily capitalizing on waveform design and and anti-jamming signal processing methods in the waveform domain. By exploring the relationship between waveform-domain adapt… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  8. arXiv:2310.04677  [pdf, other

    eess.IV cs.CV

    AG-CRC: Anatomy-Guided Colorectal Cancer Segmentation in CT with Imperfect Anatomical Knowledge

    Authors: Rongzhao Zhang, Zhian Bai, Ruoying Yu, Wenrao Pang, Lingyun Wang, Lifeng Zhu, Xiaofan Zhang, Huan Zhang, Weiguo Hu

    Abstract: When delineating lesions from medical images, a human expert can always keep in mind the anatomical structure behind the voxels. However, although high-quality (though not perfect) anatomical information can be retrieved from computed tomography (CT) scans with modern deep learning algorithms, it is still an open problem how these automatically generated organ masks can assist in addressing challe… ▽ More

    Submitted 30 November, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: under review

  9. arXiv:2309.12552  [pdf, other

    eess.SY

    Adaptive Model Predictive Control for Engine-Driven Ducted Fan Lift Systems using an Associated Linear Parameter Varying Model

    Authors: Hanjie Jiang, Ye Zhou, Hann Woei Ho, Wenjie Hu

    Abstract: Ducted fan lift systems (DFLSs) powered by two-stroke aviation piston engines present a challenging control problem due to their complex multivariable dynamics. Current controllers for these systems typically rely on proportional-integral algorithms combined with data tables, which rely on accurate models and are not adaptive to handle time-varying dynamics or system uncertainties. This paper prop… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  10. Building a digital twin of EDFA: a grey-box modeling approach

    Authors: Yichen Liu, Xiaomin Liu, Yihao Zhang, Meng Cai, Mengfan Fu, Xueying Zhong, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: To enable intelligent and self-driving optical networks, high-accuracy physical layer models are required. The dynamic wavelength-dependent gain effects of non-constant-pump erbium-doped fiber amplifiers (EDFAs) remain a crucial problem in terms of modeling, as it determines optical-to-signal noise ratio as well as the magnitude of fiber nonlinearities. Black-box data-driven models have been widel… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  11. arXiv:2307.03368  [pdf, other

    eess.SP

    Waveform-Domain Adaptive Matched Filtering for Suppressing Interrupted-Sampling Repeater Jamming

    Authors: Hanning Su, Qinglong Bao, Jiameng Pan, Fucheng Guo, Weidong Hu

    Abstract: The inadequate adaptability to flexible interference scenarios remains an unresolved challenge in the majority of techniques utilized for mitigating interrupted-sampling repeater jamming (ISRJ). Matched filtering system based methods is desirable to incorporate anti-ISRJ measures based on prior ISRJ modeling, either preceding or succeeding the matched filtering. Due to the partial matching nature… ▽ More

    Submitted 13 November, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

  12. arXiv:2307.01665  [pdf

    eess.SP

    Multicarrier Modulation-Based Digital Radio-over-Fibre System Achieving Unequal Bit Protection with Over 10 dB SNR Gain

    Authors: Yicheng Xu, Yixiao Zhu, Xiaobo Zeng, Mengfan Fu, Hexun Jiang, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: We propose a multicarrier modulation-based digital radio-over-fibre system achieving unequal bit protection by bit and power allocation for subcarriers. A theoretical SNR gain of 16.1 dB is obtained in the AWGN channel and the simulation results show a 13.5 dB gain in the bandwidth-limited case.

    Submitted 4 July, 2023; originally announced July 2023.

  13. arXiv:2303.15124  [pdf, other

    cs.CV cs.LG eess.IV

    Blind Inpainting with Object-aware Discrimination for Artificial Marker Removal

    Authors: Xuechen Guo, Wenhao Hu, Chiming Ni, Wenhao Chai, Shiyan Li, Gaoang Wang

    Abstract: Medical images often contain artificial markers added by doctors, which can negatively affect the accuracy of AI-based diagnosis. To address this issue and recover the missing visual contents, inpainting techniques are highly needed. However, existing inpainting methods require manual mask input, limiting their application scenarios. In this paper, we introduce a novel blind inpainting method that… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  14. arXiv:2212.00532  [pdf, other

    eess.IV cs.CV

    EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and Eosin Image Dataset for Image Segmentation Tasks

    Authors: Liyu Shi, Xiaoyan Li, Weiming Hu, Haoyuan Chen, **g Chen, Zizhen Fan, Minghe Gao, Yujie **g, Guotao Lu, Deguo Ma, Zhiyu Ma, Qingtao Meng, Dechao Tang, Hongzan Sun, Marcin Grzegorzek, Shouliang Qi, Yueyang Teng, Chen Li

    Abstract: Background and Purpose: Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of rectal cancer, which often hampers the assessment accuracy when comp… ▽ More

    Submitted 6 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

  15. arXiv:2210.10349  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation

    Authors: Botao Yu, Peiling Lu, Rui Wang, Wei Hu, Xu Tan, Wei Ye, Shikun Zhang, Tao Qin, Tie-Yan Liu

    Abstract: Symbolic music generation aims to generate music scores automatically. A recent trend is to use Transformer or its variants in music generation, which is, however, suboptimal, because the full attention cannot efficiently model the typically long music sequences (e.g., over 10,000 tokens), and the existing models have shortcomings in generating musical repetition structures. In this paper, we prop… ▽ More

    Submitted 30 October, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted by the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  16. arXiv:2210.02448  [pdf

    cs.LG eess.SP

    TgDLF2.0: Theory-guided deep-learning for electrical load forecasting via Transformer and transfer learning

    Authors: Jiaxin Gao, Wenbo Hu, Dongxiao Zhang, Yuntian Chen

    Abstract: Electrical energy is essential in today's society. Accurate electrical load forecasting is beneficial for better scheduling of electricity generation and saving electrical energy. In this paper, we propose theory-guided deep-learning load forecasting 2.0 (TgDLF2.0) to solve this issue, which is an improved version of the theory-guided deep-learning framework for load forecasting via ensemble long… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  17. arXiv:2207.13326  [pdf, other

    cs.CV eess.IV

    Point Cloud Attacks in Graph Spectral Domain: When 3D Geometry Meets Graph Signal Processing

    Authors: Daizong Liu, Wei Hu, Xin Li

    Abstract: With the increasing attention in various 3D safety-critical applications, point cloud learning models have been shown to be vulnerable to adversarial attacks. Although existing 3D attack methods achieve high success rates, they delve into the data space with point-wise perturbation, which may neglect the geometric characteristics. Instead, we propose point cloud attacks from a new perspective -- t… ▽ More

    Submitted 7 December, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). arXiv admin note: substantial text overlap with arXiv:2202.07261

  18. arXiv:2207.05706  [pdf

    eess.SP physics.optics

    Optical Field Recovery in Jones Space

    Authors: Qi Wu, Yixiao Zhu, Hexun Jiang, Qunbi Zhuge, Weisheng Hu

    Abstract: Optical full-field recovery makes it possible to compensate for fiber impairments such as chromatic dispersion and polarization mode dispersion (PMD) in the digital signal processing. For cost-sensitive short-reach optical networks, some advanced single-polarization (SP) optical field recovery schemes are recently proposed to avoid chromatic dispersion-induced power fading effect, and improve the… ▽ More

    Submitted 13 July, 2022; v1 submitted 22 June, 2022; originally announced July 2022.

    Comments: 8 pages and 9 figures

  19. arXiv:2206.13774  [pdf, other

    eess.SY

    Assessment of U.S. Department of Transportation Lane-Level Map for Connected Vehicle Applications

    Authors: Wang Hu, David Oswald, Guoyuan Wu, Jay A. Farrell

    Abstract: High-definition (Hi-Def) digital maps are an indispensable automated driving technology that is develo** rapidly. There are various commercial or governmental map products in the market. It is notable that the U.S. Department of Transportation (USDOT) map tool allows the user to create MAP and Signal Phase and Timing (SPaT) messages with free access. However, an analysis of the accuracy of this… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 6 pages, 6 figures

  20. arXiv:2206.06077  [pdf

    eess.SP

    Physics-informed EDFA Gain Model Based on Active Learning

    Authors: Xiaomin Liu, Yuli Chen, Yihao Zhang, Yichen Liu, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: We propose a physics-informed EDFA gain model based on the active learning method. Experimental results show that the proposed modelling method can reach a higher optimal accuracy and reduce ~90% training data to achieve the same performance compared with the conventional method.

    Submitted 13 June, 2022; originally announced June 2022.

  21. arXiv:2205.12843  [pdf, other

    eess.IV cs.CV

    A Comparative Study of Gastric Histopathology Sub-size Image Classification: from Linear Regression to Visual Transformer

    Authors: Weiming Hu, Haoyuan Chen, Wanli Liu, Xiaoyan Li, Hongzan Sun, Xinyu Huang, Marcin Grzegorzek, Chen Li

    Abstract: Gastric cancer is the fifth most common cancer in the world. At the same time, it is also the fourth most deadly cancer. Early detection of cancer exists as a guide for the treatment of gastric cancer. Nowadays, computer technology has advanced rapidly to assist physicians in the diagnosis of pathological pictures of gastric cancer. Ensemble learning is a way to improve the accuracy of algorithms,… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2106.02473

  22. arXiv:2204.10704  [pdf, other

    cs.CV eess.IV

    SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite

    Authors: Runzhe Zhu, Ling Yin, Mingze Yang, Fei Wu, Yuncheng Yang, Wenbo Hu

    Abstract: Cross-view image matching aims to match images of the same target scene acquired from different platforms. With the rapid development of drone technology, cross-view matching by neural network models has been a widely accepted choice for drone position or navigation. However, existing public datasets do not include images obtained by drones at different heights, and the types of scenes are relativ… ▽ More

    Submitted 21 January, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

  23. A3CLNN: Spatial, Spectral and Multiscale Attention ConvLSTM Neural Network for Multisource Remote Sensing Data Classification

    Authors: Heng-Chao Li, Wen-Shuai Hu, Wei Li, Jun Li, Qian Du, Antonio Plaza

    Abstract: The problem of effectively exploiting the information multiple data sources has become a relevant but challenging research topic in remote sensing. In this paper, we propose a new approach to exploit the complementarity of two data sources: hyperspectral images (HSIs) and light detection and ranging (LiDAR) data. Specifically, we develop a new dual-channel spatial, spectral and multiscale attentio… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: 16 pages, 10 figures

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems, vol. 33, no. 2, pp. 747-761, Feb. 2022

  24. arXiv:2202.13526  [pdf, other

    cs.LG eess.SP

    Sparse Graph Learning with Spectrum Prior for Deep Graph Convolutional Networks

    Authors: ** Zeng, Yang Liu, Gene Cheung, Wei Hu

    Abstract: A graph convolutional network (GCN) employs a graph filtering kernel tailored for data with irregular structures. However, simply stacking more GCN layers does not improve performance; instead, the output converges to an uninformative low-dimensional subspace, where the convergence rate is characterized by the graph spectrum -- this is the known over-smoothing problem in GCN. In this paper, we pro… ▽ More

    Submitted 2 November, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

  25. Automated Extraction of Energy Systems Information from Remotely Sensed Data: A Review and Analysis

    Authors: Simiao Ren, Wei Hu, Kyle Bradbury, Dylan Harrison-Atlas, Laura Malaguzzi Valeri, Brian Murray, Jordan M. Malof

    Abstract: High quality energy systems information is a crucial input to energy systems research, modeling, and decision-making. Unfortunately, actionable information about energy systems is often of limited availability, incomplete, or only accessible for a substantial fee or through a non-disclosure agreement. Recently, remotely sensed data (e.g., satellite imagery, aerial photography) have emerged as a po… ▽ More

    Submitted 2 October, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: This is only an Arxived version. For actual publication please refer to https://doi.org/10.1016/j.apenergy.2022.119876

    Journal ref: Applied Energy, 326, 119876 (2022)

  26. arXiv:2202.08552  [pdf, other

    eess.IV cs.CV

    EBHI:A New Enteroscope Biopsy Histopathological H&E Image Dataset for Image Classification Evaluation

    Authors: Weiming Hu, Chen Li, Xiaoyan Li, Md Mamunur Rahaman, Yong Zhang, Haoyuan Chen, Wanli Liu, Yudong Yao, Hongzan Sun, Ning Xu, Xinyu Huang, Marcin Grzegorze

    Abstract: Background and purpose: Colorectal cancer has become the third most common cancer worldwide, accounting for approximately 10% of cancer patients. Early detection of the disease is important for the treatment of colorectal cancer patients. Histopathological examination is the gold standard for screening colorectal cancer. However, the current lack of histopathological image datasets of colorectal c… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  27. arXiv:2202.07261  [pdf, other

    cs.CV eess.IV

    Exploring the Devil in Graph Spectral Domain for 3D Point Cloud Attacks

    Authors: Qianjiang Hu, Daizong Liu, Wei Hu

    Abstract: With the maturity of depth sensors, point clouds have received increasing attention in various applications such as autonomous driving, robotics, surveillance, etc., while deep point cloud learning models have shown to be vulnerable to adversarial attacks. Existing attack methods generally add/delete points or perform point-wise perturbation over point clouds to generate adversarial examples in th… ▽ More

    Submitted 26 December, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

  28. arXiv:2201.12576  [pdf, other

    cs.CV eess.IV

    Scale-arbitrary Invertible Image Downscaling

    Authors: **bo Xing, Wenbo Hu, Tien-Tsin Wong

    Abstract: Conventional social media platforms usually downscale the HR images to restrict their resolution to a specific size for saving transmission/storage cost, which leads to the super-resolution (SR) being highly ill-posed. Recent invertible image downscaling methods jointly model the downscaling/upscaling problems and achieve significant improvements. However, they only consider fixed integer scale fa… ▽ More

    Submitted 9 March, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

  29. Fast and accurate waveform modeling of long-haul multi-channel optical fiber transmission using a hybrid model-data driven scheme

    Authors: Hang Yang, Zekun Niu, Haochen Zhao, Shilin Xiao, Weisheng Hu, Lilin Yi

    Abstract: The modeling of optical wave propagation in optical fiber is a task of fast and accurate solving the nonlinear Schrödinger equation (NLSE), and can enable the optical system design, digital signal processing verification and fast waveform calculation. Traditional waveform modeling of full-time and full-frequency information is the split-step Fourier method (SSFM), which has long been regarded as c… ▽ More

    Submitted 16 May, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: 8 pages, 5 figures, 1 table, 30 references

  30. arXiv:2111.15395  [pdf, other

    eess.IV physics.flu-dyn

    Two-dimensional flow field measurement of sediment-laden flow based on ultrasound image velocimetry

    Authors: Weiliang Tao, Yan Liu, Zhimin Ma, Wenbin Hu

    Abstract: This paper proposes a novel particle image velocimetry (PIV) technique to generate an instantaneous two-dimensional velocity field for sediment-laden fluid based on the optical flow algorithm of ultrasound imaging. In this paper, an ultrasonic PIV (UPIV) system is constructed by integrating a medical ultrasound instrument and an ultrasonic particle image velocimetry algorithm. The medical ultrasou… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

    Comments: 18 pages, 11 figures, 2 tables, technology manuscript

  31. arXiv:2111.10990  [pdf, other

    cs.CV eess.IV

    Imperceptible Transfer Attack and Defense on 3D Point Cloud Classification

    Authors: Daizong Liu, Wei Hu

    Abstract: Although many efforts have been made into attack and defense on the 2D image domain in recent years, few methods explore the vulnerability of 3D models. Existing 3D attackers generally perform point-wise perturbation over point clouds, resulting in deformed structures or outliers, which is easily perceivable by humans. Moreover, their adversarial examples are generated under the white-box setting,… ▽ More

    Submitted 24 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

  32. Using PPP Information to Implement a Global Real-Time Virtual Network DGNSS Approach

    Authors: Wang Hu, Ashim Neupane, Jay A. Farrell

    Abstract: Differential GNSS (DGNSS) has been demonstrated to provide reliable, high-quality range correction information enabling real-time navigation with centimeter to sub-meter accuracy, which is required for applications such as connected and autonomous vehicles. However, DGNSS requires a local reference station near each user. For a continental or global scale implementation, this information dissemina… ▽ More

    Submitted 28 June, 2022; v1 submitted 22 September, 2021; originally announced October 2021.

    Comments: 14 pages, 8 tables, 4 figures, Code and data are available at https://github.com/Azurehappen/Virtual-Network-DGNSS-Project

    Journal ref: in IEEE Transactions on Vehicular Technology, vol. 71, no. 10, pp. 10337-10349, Oct. 2022

  33. arXiv:2108.06884  [pdf, other

    eess.SP cs.NI

    Seirios: Leveraging Multiple Channels for LoRaWAN Indoor and Outdoor Localization

    Authors: Jun Liu, Jiayao Gao, Sanjay Jha, Wen Hu

    Abstract: Localization is important for a large number of Internet of Things (IoT) endpoint devices connected by LoRaWAN. Due to the bandwidth limitations of LoRaWAN, existing localization methods without specialized hardware (e.g., GPS) produce poor performance. To increase the localization accuracy, we propose a super-resolution localization method, called Seirios, which features a novel algorithm to sync… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: MOBICOM 2021

  34. arXiv:2107.11113  [pdf, ps, other

    cs.CL cs.LG cs.SD eess.AS

    OLR 2021 Challenge: Datasets, Rules and Baselines

    Authors: Binling Wang, Wenxuan Hu, **g Li, Yiming Zhi, Zheng Li, Qingyang Hong, Lin Li, Dong Wang, Liming Song, Cheng Yang

    Abstract: This paper introduces the sixth Oriental Language Recognition (OLR) 2021 Challenge, which intends to improve the performance of language recognition systems and speech recognition systems within multilingual scenarios. The data profile, four tasks, two baselines, and the evaluation principles are introduced in this paper. In addition to the Language Identification (LID) tasks, multilingual Automat… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2006.03473, arXiv:1907.07626, arXiv:1806.00616, arXiv:1706.09742

  35. arXiv:2107.06374  [pdf, ps, other

    math.OC eess.SY

    Bilinear Control of Convection-Cooling: From Open-Loop to Closed-Loop

    Authors: Weiwei Hu, Jun Liu, Zhu Wang

    Abstract: This paper is concerned with a bilinear control problem for enhancing convection-cooling via an incompressible velocity field. Both optimal open-loop control and closed-loop feedback control designs are addressed. First and second order optimality conditions for characterizing the optimal solution are discussed. In particular, the method of instantaneous control is applied to establish the feedbac… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: 27 pages, 7 figures, 3 tables

    MSC Class: 49M41; 35Q93

  36. arXiv:2105.11689  [pdf, other

    cs.LG cs.SI eess.SP

    Self-Supervised Graph Representation Learning via Topology Transformations

    Authors: Xiang Gao, Wei Hu, Guo-Jun Qi

    Abstract: We present the Topology Transformation Equivariant Representation learning, a general paradigm of self-supervised learning for node representations of graph data to enable the wide applicability of Graph Convolutional Neural Networks (GCNNs). We formalize the proposed model from an information-theoretic perspective, by maximizing the mutual information between topology transformations and node rep… ▽ More

    Submitted 2 December, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Accepted to IEEE Transactions on Knowledge and Data Engineering (TKDE)

  37. arXiv:2105.08350  [pdf, other

    cs.MM cs.CR eess.IV

    Generic Reversible Visible Watermarking Via Regularized Graph Fourier Transform Coding

    Authors: Wenfa Qi, Sirui Guo, Wei Hu

    Abstract: Reversible visible watermarking (RVW) is an active copyright protection mechanism. It not only transparently superimposes copyright patterns on specific positions of digital images or video frames to declare the copyright ownership information, but also completely erases the visible watermark image and thus enables restoring the original host image without any distortion. However, existing RVW alg… ▽ More

    Submitted 26 November, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

    Comments: This manuscript is accepted to IEEE Transactions on Image Processing on November 21th 2021. It has 15 pages, 12 figures and 4 tables

  38. arXiv:2104.06243  [pdf, other

    eess.IV cs.CV cs.LG

    A State-of-the-art Survey of Artificial Neural Networks for Whole-slide Image Analysis:from Popular Convolutional Neural Networks to Potential Visual Transformers

    Authors: Xintong Li, Weiming Hu, Chen Li, Tao Jiang, Hongzan Sun, Xiaoyan Li, Xinyu Huang, Marcin Grzegorzek

    Abstract: To increase the objectivity and accuracy of pathologists' work, artificial neural network(ANN) methods have been generally needed in the segmentation, classification, and detection of histopathological WSI. In this paper, WSI analysis methods based on ANN are reviewed. Firstly, the development status of WSI and ANN methods is introduced. Secondly, we summarize the common ANN methods. Next, we disc… ▽ More

    Submitted 26 February, 2022; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: 22 pages, 38 figures. arXiv admin note: substantial text overlap with arXiv:2102.10553

  39. First arrival picking using U-net with Lovasz loss and nearest point picking method

    Authors: Pengyu Yuan, Wenyi Hu, Xuqing Wu, Jiefu Chen, Hien Van Nguyen

    Abstract: We proposed a robust segmentation and picking workflow to solve the first arrival picking problem for seismic signal processing. Unlike traditional classification algorithm, image segmentation method can utilize the location information by outputting a prediction map which has the same size of the input image. A parameter-free nearest point picking algorithm is proposed to further improve the accu… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  40. An Interpretable Map** from a Communication System to a Neural Network for Optimal Transceiver-Joint Equalization

    Authors: Zhiqun Zhai, Hexun Jiang, Mengfan Fu, Lei Liu, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: In this paper, we propose a scheme that utilizes the optimization ability of artificial intelligence (AI) for optimal transceiver-joint equalization in compensating for the optical filtering impairments caused by wavelength selective switches (WSS). In contrast to adding or replacing a certain module of existing digital signal processing (DSP), we exploit the similarity between a communication sys… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  41. arXiv:2103.09455  [pdf, other

    cs.CV eess.IV

    Prediction-assistant Frame Super-Resolution for Video Streaming

    Authors: Wang Shen, Wenbo Bao, Guangtao Zhai, Charlie L Wang, Jerry W Hu, Zhiyong Gao

    Abstract: Video frame transmission delay is critical in real-time applications such as online video gaming, live show, etc. The receiving deadline of a new frame must catch up with the frame rendering time. Otherwise, the system will buffer a while, and the user will encounter a frozen screen, resulting in unsatisfactory user experiences. An effective approach is to transmit frames in lower-quality under po… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

  42. arXiv:2103.04530  [pdf, other

    eess.SP cs.LG

    Weather Analogs with a Machine Learning Similarity Metric for Renewable Resource Forecasting

    Authors: Weiming Hu, Guido Cervone, George Young, Luca Delle Monache

    Abstract: The Analog Ensemble (AnEn) technique has been shown effective on several weather problems. Unlike previous weather analogs that are sought within a large spatial domain and an extended temporal window, AnEn strictly confines space and time, and independently generates results at each grid point within a short time window. AnEn can find similar forecasts that lead to accurate and calibrated ensembl… ▽ More

    Submitted 8 March, 2021; v1 submitted 7 March, 2021; originally announced March 2021.

    Comments: 32 pages, 9 figures

  43. arXiv:2102.13400  [pdf, other

    cs.RO cs.CV eess.IV

    Panoramic annular SLAM with loop closure and global optimization

    Authors: Hao Chen, Weijian Hu, Kailun Yang, Jian Bai, Kaiwei Wang

    Abstract: In this paper, we propose panoramic annular simultaneous localization and map** (PA-SLAM), a visual SLAM system based on panoramic annular lens. A hybrid point selection strategy is put forward in the tracking front-end, which ensures repeatability of keypoints and enables loop closure detection based on the bag-of-words approach. Every detected loop candidate is verified geometrically and the… ▽ More

    Submitted 3 June, 2021; v1 submitted 26 February, 2021; originally announced February 2021.

    Comments: Accepted to Applied Optics. 12 pages, 11 figures, 3 tables

  44. arXiv:2101.11442  [pdf

    physics.med-ph cs.LG eess.IV

    Magnetic Resonance Spectroscopy Deep Learning Denoising Using Few In Vivo Data

    Authors: Dicheng Chen, Wanqi Hu, Huiting Liu, Yirong Zhou, Tianyu Qiu, Yihui Huang, Zi Wang, Jiazheng Wang, Liangjie Lin, Zhigang Wu, Hao Chen, Xi Chen, Gen Yan, Di Guo, Jianzhong Lin, Xiaobo Qu

    Abstract: Magnetic Resonance Spectroscopy (MRS) is a noninvasive tool to reveal metabolic information. One challenge of 1H-MRS is the low Signal-Noise Ratio (SNR). To improve the SNR, a typical approach is to perform Signal Averaging (SA) with M repeated samples. The data acquisition time, however, is increased by M times accordingly, and a complete clinical MRS scan takes approximately 10 minutes at a comm… ▽ More

    Submitted 25 October, 2022; v1 submitted 26 January, 2021; originally announced January 2021.

  45. arXiv:2012.12727  [pdf

    eess.SP

    Low Complexity Component Nonlinear Distortions Mitigation Scheme for Probabilistically Shaped 64-QAM Signals

    Authors: Yiwen Wu, Mengfan Fu, Huazhi Lun, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: We propose a degenerated hierarchical look-up table (DH-LUT) scheme to compensate component nonlinearities. For probabilistically shaped 64-QAM signals, it achieves up to 2-dB SNR improvement, while the size of table is only 8.59% compared to the conventional LUT method.

    Submitted 20 December, 2020; originally announced December 2020.

  46. A Data-Fusion-Assisted Telemetry Layer for Autonomous Optical Networks

    Authors: Xiaomin Liu, Huazhi Lun, Ruoxuan Gao, Meng Cai, Lilin Yi, Weisheng Hu, Qunbi Zhuge

    Abstract: For further improving the capacity and reliability of optical networks, a closed-loop autonomous architecture is preferred. Considering a large number of optical components in an optical network and many digital signal processing modules in each optical transceiver, massive real-time data can be collected. However, for a traditional monitoring structure, collecting, storing and processing a large… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

  47. Deep Learning for Radio-based Human Sensing: Recent Advances and Future Directions

    Authors: Isura Nirmal, Abdelwahed Khamis, Mahbub Hassan, Wen Hu, Xiaoqing Zhu

    Abstract: While decade-long research has clearly demonstrated the vast potential of radio frequency (RF) for many human sensing tasks, scaling this technology to large scenarios remained problematic with conventional approaches. Recently, researchers have successfully applied deep learning to take radio-based sensing to a new level. Many different types of deep learning models have been proposed to achieve… ▽ More

    Submitted 7 February, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Journal ref: 23, 2021, 995-1019

  48. arXiv:2009.02752  [pdf, other

    eess.SP cs.HC cs.LG

    Simultaneous Energy Harvesting and Gait Recognition using Piezoelectric Energy Harvester

    Authors: Dong Ma, Guohao Lan, Weitao Xu, Mahbub Hassan, Wen Hu

    Abstract: Piezoelectric energy harvester, which generates electricity from stress or vibrations, is gaining increasing attention as a viable solution to extend battery life in wearables. Recent research further reveals that, besides generating energy, PEH can also serve as a passive sensor to detect human gait power-efficiently because its stress or vibration patterns are significantly influenced by the gai… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: 13 pages, 17 figures, and 2 tables

  49. Mononizing Binocular Videos

    Authors: Wenbo Hu, Menghan Xia, Chi-Wing Fu, Tien-Tsin Wong

    Abstract: This paper presents the idea ofmono-nizingbinocular videos and a frame-work to effectively realize it. Mono-nize means we purposely convert abinocular video into a regular monocular video with the stereo informationimplicitly encoded in a visual but nearly-imperceptible form. Hence, wecan impartially distribute and show the mononized video as an ordinarymonocular video. Unlike ordinary monocular v… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Comments: 16 pages, 17 figures. Accepted in Siggraph Asia 2020

    Journal ref: ACM Transactions on Graphics (SIGGRAPH Asia 2020 issue)

  50. arXiv:2008.05750  [pdf, other

    eess.AS cs.CL cs.SD

    Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition

    Authors: Wenyong Huang, Wenchao Hu, Yu Ting Yeung, Xiao Chen

    Abstract: Transformer has achieved competitive performance against state-of-the-art end-to-end models in automatic speech recognition (ASR), and requires significantly less training time than RNN-based models. The original Transformer, with encoder-decoder architecture, is only suitable for offline ASR. It relies on an attention mechanism to learn alignments, and encodes input audio bidirectionally. The hig… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Accepted by INTERSPEECH 2020