Skip to main content

Showing 1–50 of 62 results for author: Guo, W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.15222  [pdf

    eess.IV cs.AI cs.CV

    Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study

    Authors: Yujian Hu, Yilang Xiang, Yan-Jie Zhou, Yangyan He, Shifeng Yang, Xiaolong Du, Chunlan Den, Youyao Xu, Gaofeng Wang, Zhengyao Ding, **gyong Huang, Wenjun Zhao, Xuejun Wu, Donglin Li, Qianqian Zhu, Zhenjiang Li, Chenyang Qiu, Ziheng Wu, Yunjun He, Chen Tian, Yihui Qiu, Zuodong Lin, Xiaolong Zhang, Yuan He, Zhenpeng Yuan , et al. (15 additional authors not shown)

    Abstract: Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed… ▽ More

    Submitted 24 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: under peer review

  2. arXiv:2406.01173  [pdf, other

    eess.SY

    Cascade Network Stability of Synchronized Traffic Load Balancing with Heterogeneous Energy Efficiency Policies

    Authors: Mengbang Zou, Weisi Guo

    Abstract: Cascade stability of load balancing is critical for ensuring high efficiency service delivery and preventing undesirable handovers. In energy efficient networks that employ diverse sleep mode operations, handing over traffic to neighbouring cells' expanded coverage must be done with minimal side effects. Current research is largely concerned with designing distributed and centralized efficient loa… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2406.00320  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching

    Authors: Yongqi Wang, Wenxiang Guo, Rongjie Huang, Jiawei Huang, Zehan Wang, Fuming You, Ruiqi Li, Zhou Zhao

    Abstract: Video-to-audio (V2A) generation aims to synthesize content-matching audio from silent video, and it remains challenging to build V2A models with high generation quality, efficiency, and visual-audio temporal synchrony. We propose Frieren, a V2A model based on rectified flow matching. Frieren regresses the conditional transport vector field from noise to spectrogram latent with straight paths and c… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  4. arXiv:2405.14398  [pdf, other

    cs.HC cs.AI eess.SP

    SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network

    Authors: Weiyu Guo, Ying Sun, Yijie Xu, Ziyue Qiao, Yongkui Yang, Hui Xiong

    Abstract: Surface electromyography (sEMG) based gesture recognition offers a natural and intuitive interaction modality for wearable devices. Despite significant advancements in sEMG-based gesture-recognition models, existing methods often suffer from high computational latency and increased energy consumption. Additionally, the inherent instability of sEMG signals, combined with their sensitivity to distri… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2405.09752  [pdf, other

    eess.SP math.NA math.OC

    Time-Varying Graph Signal Recovery Using High-Order Smoothness and Adaptive Low-rankness

    Authors: Weihong Guo, Yifei Lou, **g Qin, Ming Yan

    Abstract: Time-varying graph signal recovery has been widely used in many applications, including climate change, environmental hazard monitoring, and epidemic studies. It is crucial to choose appropriate regularizations to describe the characteristics of the underlying signals, such as the smoothness of the signal over the graph domain and the low-rank structure of the spatial-temporal signal modeled in a… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  6. arXiv:2405.04867  [pdf, other

    eess.IV cs.CV

    MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results

    Authors: Yaqi Wu, Zhihao Fan, Xiaofeng Chu, Jimmy S. Ren, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangcheng Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Senyan Xu, Zhi**g Sun, Jiaying Zhu, Yurui Zhu, Xueyang Fu, Zheng-Jun Zha, Jun Cao, Cheng Li, Shu Chen, Liang Ma, Shiyang Zhou, Hai** Zeng, Kai Feng , et al. (24 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: MIPI@CVPR2024. Website: https://mipi-challenge.org/MIPI2024/

  7. arXiv:2404.11213  [pdf, other

    eess.SP cs.AI

    Revisiting Noise Resilience Strategies in Gesture Recognition: Short-Term Enhancement in Surface Electromyographic Signal Analysis

    Authors: Weiyu Guo, Ziyue Qiao, Ying Sun, Hui Xiong

    Abstract: Gesture recognition based on surface electromyography (sEMG) has been gaining importance in many 3D Interactive Scenes. However, sEMG is easily influenced by various forms of noise in real-world environments, leading to challenges in providing long-term stable interactions through sEMG. Existing methods often struggle to enhance model noise resilience through various predefined data augmentation t… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  8. arXiv:2404.09131  [pdf, other

    eess.SP

    Design of Artificial Interference Signals for Covert Communication Aided by Multiple Friendly Nodes

    Authors: Xuyang Zhao. Wei Guo, Yongchao Wang

    Abstract: In this paper, we consider a scenario of covert communication aided by multiple friendly interference nodes. The objective is to conceal the legitimate communication link under the surveillance of a warden. The main content is as follows: first, we propose a novel strategy for generating artificial noise signals in the considered covert scenario. Then, we leverage the statistical information of ch… ▽ More

    Submitted 9 May, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

  9. arXiv:2404.02731  [pdf, other

    eess.IV cs.CV cs.MM

    Event Camera Demosaicing via Swin Transformer and Pixel-focus Loss

    Authors: Yunfan Lu, Yijie Xu, Wenzong Ma, Weiyu Guo, Hui Xiong

    Abstract: Recent research has highlighted improvements in high-quality imaging guided by event cameras, with most of these efforts concentrating on the RGB domain. However, these advancements frequently neglect the unique challenges introduced by the inherent flaws in the sensor design of event cameras in the RAW domain. Specifically, this sensor design results in the partial loss of pixel values, posing ne… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted for the CVPR 2024 Workshop on Mobile Intelligent Photography & Imaging

  10. arXiv:2404.00309  [pdf, other

    cs.IT eess.SP

    Model-Driven Deep Learning for Distributed Detection with Binary Quantization

    Authors: Wei Guo, Meng He, Chuan Huang, Hengtao He, Shenghui Song, Jun Zhang, Khaled B. Letaief

    Abstract: Within the realm of rapidly advancing wireless sensor networks (WSNs), distributed detection assumes a significant role in various practical applications. However, critical challenge lies in maintaining robust detection performance while operating within the constraints of limited bandwidth and energy resources. This paper introduces a novel approach that combines model-driven deep learning (DL) w… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  11. arXiv:2402.18527  [pdf, other

    cs.CV cs.LG eess.IV

    Defect Detection in Tire X-Ray Images: Conventional Methods Meet Deep Structures

    Authors: Andrei Cozma, Landon Harris, Hairong Qi, ** Ji, Wenpeng Guo, Song Yuan

    Abstract: This paper introduces a robust approach for automated defect detection in tire X-ray images by harnessing traditional feature extraction methods such as Local Binary Pattern (LBP) and Gray Level Co-Occurrence Matrix (GLCM) features, as well as Fourier and Wavelet-based features, complemented by advanced machine learning techniques. Recognizing the challenges inherent in the complex patterns and te… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 2 figures, 3 tables, submitted to ICIP2024

    ACM Class: I.4.7; I.4.9; I.4.0

  12. arXiv:2311.18539  [pdf, other

    cs.CR eess.SY

    Bridging Both Worlds in Semantics and Time: Domain Knowledge Based Analysis and Correlation of Industrial Process Attacks

    Authors: Moses Ike, Kandy Phan, Anwesh Badapanda, Matthew Landen, Keaton Sadoski, Wanda Guo, Asfahan Shah, Saman Zonouz, Wenke Lee

    Abstract: Modern industrial control systems (ICS) attacks infect supervisory control and data acquisition (SCADA) hosts to stealthily alter industrial processes, causing damage. To detect attacks with low false alarms, recent work detects attacks in both SCADA and process data. Unfortunately, this led to the same problem - disjointed (false) alerts, due to the semantic and time gap in SCADA and process beha… ▽ More

    Submitted 3 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

  13. arXiv:2310.06879  [pdf, other

    cs.CV eess.IV

    The Solution for the CVPR2023 NICE Image Captioning Challenge

    Authors: Xiangyu Wu, Yi Gao, Hailiang Zhang, Yang Yang, Weili Guo, Jianfeng Lu

    Abstract: In this paper, we present our solution to the New frontiers for Zero-shot Image Captioning Challenge. Different from the traditional image captioning datasets, this challenge includes a larger new variety of visual concepts from many domains (such as COVID-19) as well as various image types (photographs, illustrations, graphics). For the data level, we collect external training data from Laion-5B,… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  14. arXiv:2309.10935  [pdf, other

    cs.CV eess.IV

    A Geometric Flow Approach for Segmentation of Images with Inhomongeneous Intensity and Missing Boundaries

    Authors: Paramjyoti Mohapatra, Richard Lartey, Weihong Guo, Michael Judkovich, Xiaojuan Li

    Abstract: Image segmentation is a complex mathematical problem, especially for images that contain intensity inhomogeneity and tightly packed objects with missing boundaries in between. For instance, Magnetic Resonance (MR) muscle images often contain both of these issues, making muscle segmentation especially difficult. In this paper we propose a novel intensity correction and a semi-automatic active conto… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Presented at CVIT 2023 Conference. Accepted to Journal of Image and Graphics

  15. arXiv:2309.01112  [pdf

    cs.RO eess.SY

    Swing Leg Motion Strategy for Heavy-load Legged Robot Based on Force Sensing

    Authors: Ze Fu, Yinghui Li, Weizhong Guo

    Abstract: The heavy-load legged robot has strong load carrying capacity and can adapt to various unstructured terrains. But the large weight results in higher requirements for motion stability and environmental perception ability. In order to utilize force sensing information to improve its motion performance, in this paper, we propose a finite state machine model for the swing leg in the static gait by imi… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  16. arXiv:2308.08283  [pdf, other

    eess.IV cs.CV cs.LG

    CARE: A Large Scale CT Image Dataset and Clinical Applicable Benchmark Model for Rectal Cancer Segmentation

    Authors: Hantao Zhang, Weidong Guo, Chenyang Qiu, Shouhong Wan, Bingbing Zou, Wanqin Wang, Peiquan **

    Abstract: Rectal cancer segmentation of CT image plays a crucial role in timely clinical diagnosis, radiotherapy treatment, and follow-up. Although current segmentation methods have shown promise in delineating cancerous tissues, they still encounter challenges in achieving high segmentation precision. These obstacles arise from the intricate anatomical structures of the rectum and the difficulties in perfo… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 8 pages

  17. arXiv:2306.14097  [pdf, other

    eess.IV cs.CV math.NA

    Interpretable Small Training Set Image Segmentation Network Originated from Multi-Grid Variational Model

    Authors: Junying Meng, Weihong Guo, Jun Liu, Mingrui Yang

    Abstract: The main objective of image segmentation is to divide an image into homogeneous regions for further analysis. This is a significant and crucial task in many applications such as medical imaging. Deep learning (DL) methods have been proposed and widely used for image segmentation. However, these methods usually require a large amount of manually segmented data as training data and suffer from poor… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 25 pages, 9 figures, 6 tables

    MSC Class: 94A08; 68U10

  18. arXiv:2306.00303  [pdf, other

    cs.CV eess.IV

    Sea Ice Extraction via Remote Sensed Imagery: Algorithms, Datasets, Applications and Challenges

    Authors: Anzhu Yu, Wenjun Huang, Qing Xu, Qun Sun, Wenyue Guo, Song Ji, Bowei Wen, Chun** Qiu

    Abstract: The deep learning, which is a dominating technique in artificial intelligence, has completely changed the image understanding over the past decade. As a consequence, the sea ice extraction (SIE) problem has reached a new era. We present a comprehensive review of four important aspects of SIE, including algorithms, datasets, applications, and the future trends. Our review focuses on researches publ… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 24 pages, 6 figures

  19. arXiv:2303.01249  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech Recognition

    Authors: Zhijie Shen, Wu Guo, Bin Gu

    Abstract: In this paper, we propose a language-universal adapter learning framework based on a pre-trained model for end-to-end multilingual automatic speech recognition (ASR). For acoustic modeling, the wav2vec 2.0 pre-trained model is fine-tuned by inserting language-specific and language-universal adapters. An online knowledge distillation is then used to enable the language-universal adapters to learn b… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

  20. arXiv:2302.12428  [pdf

    eess.SY

    A holistically 3D-printed flexible millimeter-wave Doppler radar: Towards fully printed high-frequency multilayer flexible hybrid electronics systems

    Authors: Hong Tang, Yingjie Zhang, Bowen Zheng, Sensong An, Mohammad Haerinia, Yunxi Dong, Yi Huang, Wei Guo, Hualiang Zhang

    Abstract: Flexible hybrid electronics (FHE) is an emerging technology enabled through the integration of advanced semiconductor devices and 3D printing technology. It unlocks tremendous market potential by realizing low-cost flexible circuits and systems that can be conformally integrated into various applications. However, the operating frequencies of most reported FHE systems are relatively low. It is als… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    MSC Class: 78-05

  21. arXiv:2211.02147  [pdf, other

    eess.SY cs.LG

    A Survey on Reinforcement Learning in Aviation Applications

    Authors: Pouria Razzaghi, Amin Tabrizian, Wei Guo, Shulu Chen, Abenezer Taye, Ellis Thompson, Alexis Bregeon, Ali Baheri, Peng Wei

    Abstract: Compared with model-based control and optimization methods, reinforcement learning (RL) provides a data-driven, learning-based framework to formulate and solve sequential decision-making problems. The RL framework has become promising due to largely improved data availability and computing power in the aviation industry. Many aviation-based applications can be formulated or treated as sequential d… ▽ More

    Submitted 22 November, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

  22. arXiv:2206.10955  [pdf, other

    eess.SP

    Adversarial Reconfigurable Intelligent Surface Against Physical Layer Key Generation

    Authors: Zhuangkun Wei, Bin Li, Weisi Guo

    Abstract: The development of reconfigurable intelligent surfaces (RIS) has recently advanced the research of physical layer security (PLS). Beneficial impacts of RIS include but are not limited to offering a new degree-of-freedom (DoF) for key-less PLS optimization, and increasing channel randomness for physical layer secret key generation (PL-SKG). However, there is a lack of research studying how adversar… ▽ More

    Submitted 6 November, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  23. arXiv:2205.09316  [pdf, other

    cs.IT eess.SP

    Dynamic Clustering and Power Control for Two-Tier Wireless Federated Learning

    Authors: Wei Guo, Chuan Huang, Xiaoqi Qin, Lian Yang, Wei Zhang

    Abstract: Federated learning (FL) has been recognized as a promising distributed learning paradigm to support intelligent applications at the wireless edge, where a global model is trained iteratively through the collaboration of the edge devices without sharing their data. However, due to the relatively large communication cost between the devices and parameter server (PS), direct computing based on the in… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  24. arXiv:2205.09306  [pdf, other

    cs.IT eess.SP

    Joint Device Selection and Power Control for Wireless Federated Learning

    Authors: Wei Guo, Ran Li, Chuan Huang, Xiaoqi Qin, Kaiming Shen, Wei Zhang

    Abstract: This paper studies the joint device selection and power control scheme for wireless federated learning (FL), considering both the downlink and uplink communications between the parameter server (PS) and the terminal devices. In each round of model training, the PS first broadcasts the global model to the terminal devices in an analog fashion, and then the terminal devices perform local training an… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  25. arXiv:2205.00581  [pdf

    cs.CV cs.LG eess.IV

    Using a novel fractional-order gradient method for CNN back-propagation

    Authors: Mundher Mohammed Taresh, Ningbo Zhu, Talal Ahmed Ali Ali, Mohammed Alghaili, Weihua Guo

    Abstract: Computer-aided diagnosis tools have experienced rapid growth and development in recent years. Among all, deep learning is the most sophisticated and popular tool. In this paper, researchers propose a novel deep learning model and apply it to COVID-19 diagnosis. Our model uses the tool of fractional calculus, which has the potential to improve the performance of gradient methods. To this end, the r… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

    Comments: 9 pages, 6 figuers

    MSC Class: D.1.2; F.3.1; F.4.1 ACM Class: F.2.2, I.2.7 K.5

  26. arXiv:2201.07210  [pdf

    cs.NE cs.LG eess.SP

    Efficient Training of Spiking Neural Networks with Temporally-Truncated Local Backpropagation through Time

    Authors: Wenzhe Guo, Mohammed E. Fouda, Ahmed M. Eltawil, Khaled Nabil Salama

    Abstract: Directly training spiking neural networks (SNNs) has remained challenging due to complex neural dynamics and intrinsic non-differentiability in firing functions. The well-known backpropagation through time (BPTT) algorithm proposed to train SNNs suffers from large memory footprint and prohibits backward and update unlocking, making it impossible to exploit the potential of locally-supervised train… ▽ More

    Submitted 13 December, 2021; originally announced January 2022.

    Comments: 16

  27. arXiv:2111.00428  [pdf, other

    eess.SP

    Reconfigurable Intelligent Surface-induced Randomness for mmWave Key Generation

    Authors: Shubo Yang, Han Han, Yihong Liu, Weisi Guo, Zhibo Pang, Lei Zhang

    Abstract: Secret key generation in physical layer security exploits the unpredictable random nature of wireless channels. The millimeter-wave (mmWave) channels have limited multipath and channel randomness in static environments. In this paper, for mmWave secret key generation of physical layer security, we use a reconfigurable intelligent surface (RIS) to induce randomness directly in wireless environments… ▽ More

    Submitted 8 August, 2022; v1 submitted 31 October, 2021; originally announced November 2021.

    Comments: Add contents, including continuous group phase shifts and secret key rate analysis

  28. arXiv:2110.12785  [pdf, other

    cs.IT eess.SP

    Random Matrix based Physical Layer Secret Key Generation in Static Channels

    Authors: Zhuangkun Wei, Weisi Guo

    Abstract: Physical layer secret key generation exploits the reciprocal channel randomness for key generation and has proven to be an effective addition security layer in wireless communications. However, static or scarcely random channels require artificially induced dynamics to improve the secrecy performance, e.g., using intelligent reflecting surface (IRS). One key challenge is that the induced random ph… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  29. arXiv:2110.10435  [pdf, other

    eess.SP

    RSS-based Multiple Sources Localization with Unknown Log-normal Shadow Fading

    Authors: Yueyan Chu, Wenbin Guo, Kangyong You, Lei Zhao, Tao Peng, Wenbo Wang

    Abstract: Multi-source localization based on received signal strength (RSS) has drawn great interest in wireless sensor networks. However, the shadow fading term caused by obstacles cannot be separated from the received signal, which leads to severe error in location estimate. In this paper, we approximate the log-normal sum distribution through Fenton-Wilkinson method to formulate a non-convex maximum like… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: 11 pages, 10 figures. arXiv admin note: substantial text overlap with arXiv:2105.15097

  30. arXiv:2106.08637  [pdf

    cs.CL cs.SD eess.AS

    Topic Classification on Spoken Documents Using Deep Acoustic and Linguistic Features

    Authors: Tan Liu, Wu Guo, Bin Gu

    Abstract: Topic classification systems on spoken documents usually consist of two modules: an automatic speech recognition (ASR) module to convert speech into text and a text topic classification (TTC) module to predict the topic class from the decoded text. In this paper, instead of using the ASR transcripts, the fusion of deep acoustic and linguistic features is used for topic classification on spoken doc… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

  31. arXiv:2105.15097  [pdf, other

    cs.NI eess.SP

    Multiple Sources Localization with Sparse Recovery under Log-normal Shadow Fading

    Authors: Yueyan Chu, Kangyong You, Wenbin Guo

    Abstract: Localization based on received signal strength (RSS) has drawn great interest in the wireless sensor network (WSN). In this paper, we investigate the RSS-based multi-sources localization problem with unknown transmitted power under shadow fading. The log-normal shadowing effect is approximated through Fenton-Wilkinson (F-W) method and maximum likelihood estimation is adopted to optimize the RSS-ba… ▽ More

    Submitted 31 March, 2021; originally announced May 2021.

  32. arXiv:2104.00230  [pdf, other

    eess.AS

    Bidirectional Multiscale Feature Aggregation for Speaker Verification

    Authors: Jiajun Qi, Wu Guo, Bin Gu

    Abstract: In this paper, we propose a novel bidirectional multiscale feature aggregation (BMFA) network with attentional fusion modules for text-independent speaker verification. The feature maps from different stages of the backbone network are iteratively combined and refined in both a bottom-up and top-down manner. Furthermore, instead of simple concatenation or element-wise addition of feature maps from… ▽ More

    Submitted 31 March, 2021; originally announced April 2021.

  33. arXiv:2103.15421  [pdf, other

    eess.AS cs.SD

    Improved Meta-Learning Training for Speaker Verification

    Authors: Yafeng Chen, Wu Guo, Bin Gu

    Abstract: Meta-learning has recently become a research hotspot in speaker verification (SV). We introduce two methods to improve the meta-learning training for SV in this paper. For the first method, a backbone embedding network is first jointly trained with the conventional cross entropy loss and prototypical networks (PN) loss. Then, inspired by speaker adaptive training in speech recognition, additional… ▽ More

    Submitted 2 August, 2023; v1 submitted 29 March, 2021; originally announced March 2021.

  34. arXiv:2011.12722  [pdf, other

    cs.CV cs.LG eess.IV

    Attention Aware Cost Volume Pyramid Based Multi-view Stereo Network for 3D Reconstruction

    Authors: Anzhu Yu, Wenyue Guo, Bing Liu, Xin Chen, Xin Wang, Xuefeng Cao, Bingchuan Jiang

    Abstract: We present an efficient multi-view stereo (MVS) network for 3D reconstruction from multiview images. While previous learning based reconstruction approaches performed quite well, most of them estimate depth maps at a fixed resolution using plane sweep volumes with a fixed depth hypothesis at each plane, which requires densely sampled planes for desired accuracy and therefore is difficult to achiev… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

  35. arXiv:2010.10919   

    eess.AS cs.SD

    Multi-task Metric Learning for Text-independent Speaker Verification

    Authors: Yafeng Chen, Wu Guo, **g**g Shi, Jiajun Qi, Tan Liu

    Abstract: In this work, we introduce metric learning (ML) to enhance the deep embedding learning for text-independent speaker verification (SV). Specifically, the deep speaker embedding network is trained with conventional cross entropy loss and auxiliary pair-based ML loss function. For the auxiliary ML task, training samples of a mini-batch are first arranged into pairs, then positive and negative pairs a… ▽ More

    Submitted 22 March, 2023; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Not a particularly high-quality work, so we request withdrawal

  36. arXiv:2010.10163  [pdf, other

    eess.IV cs.CV cs.LG

    Claw U-Net: A Unet-based Network with Deep Feature Concatenation for Scleral Blood Vessel Segmentation

    Authors: Chang Yao, **gyu Tang, Menghan Hu, Yue Wu, Wenyi Guo, Qingli Li, Xiao-** Zhang

    Abstract: Sturge-Weber syndrome (SWS) is a vascular malformation disease, and it may cause blindness if the patient's condition is severe. Clinical results show that SWS can be divided into two types based on the characteristics of scleral blood vessels. Therefore, how to accurately segment scleral blood vessels has become a significant problem in computer-aided diagnosis. In this research, we propose to co… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Comments: 5 pages,4 figures

  37. arXiv:2010.06248   

    eess.AS

    Exploring Universal Speech Attributes for Speaker Verification with an Improved Cross-stitch Network

    Authors: Jiajun Qi, Wu Guo, **g**g Shi, Yafeng Chen, Tan Liu

    Abstract: The universal speech attributes for x-vector based speaker verification (SV) are addressed in this paper. The manner and place of articulation form the fundamental speech attribute unit (SAU), and then new speech attribute (NSA) units for acoustic modeling are generated by tied tri-SAU states. An improved cross-stitch network is adopted as a multitask learning (MTL) framework for integrating these… ▽ More

    Submitted 31 May, 2023; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: Not a particularly high-quality work, so we request withdrawal

  38. arXiv:2007.13290  [pdf, ps, other

    eess.SP cs.LG eess.IV eess.SY

    Deep Learning Methods for Solving Linear Inverse Problems: Research Directions and Paradigms

    Authors: Yanna Bai, Wei Chen, Jie Chen, Weisi Guo

    Abstract: The linear inverse problem is fundamental to the development of various scientific areas. Innumerable attempts have been carried out to solve different variants of the linear inverse problem in different applications. Nowadays, the rapid development of deep learning provides a fresh perspective for solving the linear inverse problem, which has various well-designed network architectures results in… ▽ More

    Submitted 11 August, 2020; v1 submitted 26 July, 2020; originally announced July 2020.

    Comments: 60 pages; Publication in (Elsevier) Signal Processing, 2020

    Journal ref: Signal Processing, 2020

  39. HDR-GAN: HDR Image Reconstruction from Multi-Exposed LDR Images with Large Motions

    Authors: Yuzhen Niu, Jianbin Wu, Wenxi Liu, Wenzhong Guo, Rynson W. H. Lau

    Abstract: Synthesizing high dynamic range (HDR) images from multiple low-dynamic range (LDR) exposures in dynamic scenes is challenging. There are two major problems caused by the large motions of foreground objects. One is the severe misalignment among the LDR images. The other is the missing content due to the over-/under-saturated regions caused by the moving objects, which may not be easily compensated… ▽ More

    Submitted 3 July, 2020; originally announced July 2020.

  40. arXiv:2006.03568  [pdf, other

    eess.SP cs.CR

    Graph Layer Security: Encrypting Information via Common Networked Physics

    Authors: Zhuangkun Wei, Liang Wang, Schyler Chengyao Sun, Bin Li, Weisi Guo

    Abstract: The proliferation of low-cost Internet of Things (IoT) devices has led to a race between wireless security and channel attacks. Traditional cryptography requires high-computational power and is not suitable for low-power IoT scenarios. Whist, recently developed physical layer security (PLS) can exploit common wireless channel state information (CSI), its sensitivity to channel estimation makes the… ▽ More

    Submitted 23 May, 2022; v1 submitted 5 June, 2020; originally announced June 2020.

  41. Uncertainty of Resilience in Complex Networks with Nonlinear Dynamics

    Authors: Giannis Moutsinas, Mengbang Zou, Weisi Guo

    Abstract: Resilience is a system's ability to maintain its function when perturbations and errors occur. Whilst we understand low-dimensional networked systems' behavior well, our understanding of systems consisting of a large number of components is limited. Recent research in predicting the network level resilience pattern has advanced our understanding of the coupling relationship between global network… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: 8pages, 7figures

  42. Sampling and Inference of Networked Dynamics using Log-Koopman Nonlinear Graph Fourier Transform

    Authors: Zhuangkun Wei, Bin Li, Chengyao Sun, Weisi Guo

    Abstract: Networked nonlinear dynamics underpin the complex functionality of many engineering, social, biological, and ecological systems. Monitoring the networked dynamics via the minimum subset of nodes is essential for a variety of scientific and operational purposes. When there is a lack of a explicit model and networked signal space, traditional evolution analysis and non-convex methods are insufficien… ▽ More

    Submitted 16 October, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  43. arXiv:2002.06049  [pdf

    eess.AS eess.SP

    An Adaptive X-vector Model for Text-independent Speaker Verification

    Authors: Bin Gu, Wu Guo, Lirong Dai, Jun Du

    Abstract: In this paper, adaptive mechanisms are applied in deep neural network (DNN) training for x-vector-based text-independent speaker verification. First, adaptive convolutional neural networks (ACNNs) are employed in frame-level embedding layers, where the parameters of the convolution filters are adjusted based on the input features. Compared with conventional CNNs, ACNNs have more flexibility in cap… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: 6 pages, 3 figures

  44. arXiv:2002.05508  [pdf, other

    cs.LG eess.SP physics.soc-ph stat.ML

    Neural Network Approximation of Graph Fourier Transforms for Sparse Sampling of Networked Flow Dynamics

    Authors: Alessio Pagani, Zhuangkun Wei, Ricardo Silva, Weisi Guo

    Abstract: Infrastructure monitoring is critical for safe operations and sustainability. Water distribution networks (WDNs) are large-scale networked critical systems with complex cascade dynamics which are difficult to predict. Ubiquitous monitoring is expensive and a key challenge is to infer the contaminant dynamics from partial sparse monitoring data. Existing approaches use multi-objective optimisation… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

  45. arXiv:2001.06790  [pdf

    eess.IV

    High-speed and high-efficiency three-dimensional shape measurement based on Gray-coded light

    Authors: Zhoujie Wu, Wenbo Guo, Yueyang Li, Yihang Liu, Qican Zhang

    Abstract: Fringe projection profilometry has been increasingly sought and applied in dynamic three-dimensional (3D) shape measurement. In this work, a robust and high-efficiency 3D measurement based on Gray-code light is proposed. Unlike the traditional method, a novel tripartite phase unwrap** method is proposed to avoid the jump errors on the boundary of code words, which are mainly caused by the defocu… ▽ More

    Submitted 19 January, 2020; originally announced January 2020.

  46. arXiv:2001.04585  [pdf

    eess.AS cs.SD eess.SP

    Gaussian speaker embedding learning for text-independent speaker verification

    Authors: Bin Gu, Wu Guo

    Abstract: The x-vector maps segments of arbitrary duration to vectors of fixed dimension using deep neural network. Combined with the probabilistic linear discriminant analysis (PLDA) backend, the x-vector/PLDA has become the dominant framework in text-independent speaker verification. Nevertheless, how to extract the x-vector appropriate for the PLDA backend is a key problem. In this paper, we propose a Ga… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

    Comments: 5 pages, 3 figures

  47. arXiv:2001.04584  [pdf

    eess.AS cs.SD eess.SP

    An Improved Deep Neural Network for Modeling Speaker Characteristics at Different Temporal Scales

    Authors: Bin Gu, Wu Guo

    Abstract: This paper presents an improved deep embedding learning method based on convolutional neural network (CNN) for text-independent speaker verification. Two improvements are proposed for x-vector embedding learning: (1) Multi-scale convolution (MSCNN) is adopted in frame-level layers to capture complementary speaker information in different receptive fields. (2) A Baum-Welch statistics attention (BWS… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

    Comments: 5 pages,2 figures

  48. arXiv:2001.00129  [pdf

    eess.AS cs.SD

    Attentive batch normalization for lstm-based acoustic modeling of speech recognition

    Authors: Fenglin Ding, Wu Guo, Lirong Dai, Jun Du

    Abstract: Batch normalization (BN) is an effective method to accelerate model training and improve the generalization performance of neural networks. In this paper, we propose an improved batch normalization technique called attentive batch normalization (ABN) in Long Short Term Memory (LSTM) based acoustic modeling for automatic speech recognition (ASR). In the proposed method, an auxiliary network is used… ▽ More

    Submitted 31 December, 2019; originally announced January 2020.

    Comments: 5 pages,1 figure, submitted to ICASSP 2020

  49. arXiv:1912.13307  [pdf

    eess.AS

    Attention-based gated scaling adaptative acoustic model for ctc-based speech recognition

    Authors: Fenglin Ding, Wu Guo, Lirong Dai, Jun Du

    Abstract: In this paper, we propose a novel adaptive technique that uses an attention-based gated scaling (AGS) scheme to improve deep feature learning for connectionist temporal classification (CTC) acoustic modeling. In AGS, the outputs of each hidden layer of the main network are scaled by an auxiliary gate matrix extracted from the lower layer by using attention mechanisms. Furthermore, the auxiliary AG… ▽ More

    Submitted 31 December, 2019; originally announced December 2019.

    Comments: 5 pages,2 figures, submitted to ICASSP 2020

  50. Parametric Sparse Bayesian Dictionary Learning for Multiple Sources Localization with Propagation Parameters Uncertainty and Nonuniform Noise

    Authors: Kangyong You, Wenbin Guo, Tao Peng, Yueliang Liu, Peiliang Zuo, Wenbo Wang

    Abstract: Received signal strength (RSS) based source localization method is popular due to its simplicity and low cost. However, this method is highly dependent on the propagation model which is not easy to be captured in practice. Moreover, most existing works only consider the single source and the identical measurement noise scenario, while in practice multiple co-channel sources may transmit simultaneo… ▽ More

    Submitted 22 December, 2019; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: 12 pages, 9 figures