Skip to main content

Showing 1–6 of 6 results for author: Zhan, Q

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.18549  [pdf

    eess.IV cs.CV

    Advancements in Feature Extraction Recognition of Medical Imaging Systems Through Deep Learning Technique

    Authors: Qishi Zhan, Dan Sun, Erdi Gao, Yuhan Ma, Yaxin Liang, Haowei Yang

    Abstract: This study introduces a novel unsupervised medical image feature extraction method that employs spatial stratification techniques. An objective function based on weight is proposed to achieve the purpose of fast image recognition. The algorithm divides the pixels of the image into multiple subdomains and uses a quadtree to access the image. A technique for threshold optimization utilizing a simple… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

    Comments: conference

  2. arXiv:2212.07656  [pdf, other

    eess.SY

    Hybrid stability augmentation control of multi-rotor UAV in confined space based on adaptive backstep** control

    Authors: QuanXi Zhan, JunRui Zhang, ChenYang Sun, RunJie Shen, Bin He

    Abstract: This paper applies the UAV to the inspection of water diversion pipelines in hydropower stations. The diversion pipeline is an enclosed space, so the airflow disturbance caused by the rotation of the UAV blades and the strong air convection from the chimney effect have a great impact on the flight control of the UAV. Although the traditional linear control PID flight control algorithm has been wid… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: 7 pages

  3. arXiv:2203.16822  [pdf, other

    eess.AS cs.CL cs.LG

    How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications

    Authors: Juan Zuluaga-Gomez, Amrutha Prasad, Iuliia Nigmatulina, Saeed Sarfjoo, Petr Motlicek, Matthias Kleinert, Hartmut Helmke, Oliver Ohneiser, Qingran Zhan

    Abstract: Recent work on self-supervised pre-training focus on leveraging large-scale unlabeled speech data to build robust end-to-end (E2E) acoustic models (AM) that can be later fine-tuned on downstream tasks e.g., automatic speech recognition (ASR). Yet, few works investigated the impact on performance when the data properties substantially differ between the pre-training and fine-tuning phases, termed d… ▽ More

    Submitted 17 October, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: To be published in the 2022 IEEE Spoken Language Technology Workshop (SLT) (SLT 2022)

  4. arXiv:2203.16025  [pdf

    eess.SP eess.AS

    Multiple Narrow-band signals Direction Finding with TMLA by Nonuniform Period Modulation

    Authors: Kebin Liu, Lening Zhang, Qingkui Zhan, Chong He

    Abstract: A new array signal reconstruction and signal-channel DOA estimation method based on TMLA by nonuniform period modulation are proposed. By using non-uniform period modulation, the harmonic component produced by different elements could be separated. Therefore, the conventional snapshot could be reconstructed by analyzing the spectrum of the combined signal. Then spatial spectrum estimation method i… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  5. arXiv:2010.07726  [pdf, other

    eess.IV cs.CV

    LiteDepthwiseNet: An Extreme Lightweight Network for Hyperspectral Image Classification

    Authors: Benlei Cui, XueMei Dong, Qiaoqiao Zhan, Jiangtao Peng, Weiwei Sun

    Abstract: Deep learning methods have shown considerable potential for hyperspectral image (HSI) classification, which can achieve high accuracy compared with traditional methods. However, they often need a large number of training samples and have a lot of parameters and high computational overhead. To solve these problems, this paper proposes a new network architecture, LiteDepthwiseNet, for HSI classifica… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  6. arXiv:2006.10304  [pdf, ps, other

    cs.CL cs.CV cs.LG cs.SD eess.AS

    Automatic Speech Recognition Benchmark for Air-Traffic Communications

    Authors: Juan Zuluaga-Gomez, Petr Motlicek, Qingran Zhan, Karel Vesely, Rudolf Braun

    Abstract: Advances in Automatic Speech Recognition (ASR) over the last decade opened new areas of speech-based automation such as in Air-Traffic Control (ATC) environment. Currently, voice communication and data links communications are the only way of contact between pilots and Air-Traffic Controllers (ATCo), where the former is the most widely used and the latter is a non-spoken method mandatory for ocean… ▽ More

    Submitted 13 August, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Accepted to: 21st INTERSPEECH conference (Shanghai, October 25-29)