Skip to main content

Showing 1–6 of 6 results for author: Mei, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.01605  [pdf, other

    eess.IV cs.CV

    An Enhanced Encoder-Decoder Network Architecture for Reducing Information Loss in Image Semantic Segmentation

    Authors: Zijun Gao, Qi Wang, Taiyuan Mei, Xiaohan Cheng, Yun Zi, Haowei Yang

    Abstract: The traditional SegNet architecture commonly encounters significant information loss during the sampling process, which detrimentally affects its accuracy in image semantic segmentation tasks. To counter this challenge, we introduce an innovative encoder-decoder network structure enhanced with residual connections. Our approach employs a multi-residual connection strategy designed to preserve the… ▽ More

    Submitted 26 May, 2024; originally announced June 2024.

  2. Visual-Aware Text-to-Speech

    Authors: Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei

    Abstract: Dynamically synthesizing talking speech that actively responds to a listening head is critical during the face-to-face interaction. For example, the speaker could take advantage of the listener's facial expression to adjust the tones, stressed syllables, or pauses. In this work, we present a new visual-aware text-to-speech (VA-TTS) task to synthesize speech conditioned on both textual inputs and s… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: accepted as oral and top 3% paper by ICASSP 2023

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023, 1-5

  3. arXiv:2203.02291  [pdf, other

    cs.CV cs.SD eess.AS

    Freeform Body Motion Generation from Speech

    Authors: **g Xu, Wei Zhang, Yalong Bai, Qibin Sun, Tao Mei

    Abstract: People naturally conduct spontaneous body motions to enhance their speeches while giving talks. Body motion generation from speech is inherently difficult due to the non-deterministic map** from speech to body motions. Most existing works map speech to motion in a deterministic way by conditioning on certain styles, leading to sub-optimal results. Motivated by studies in linguistics, we decompos… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  4. arXiv:2008.08528  [pdf, other

    cs.CV cs.LG eess.IV

    Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification

    Authors: Boqiang Xu, Lingxiao He, Xingyu Liao, Wu Liu, Zhenan Sun, Tao Mei

    Abstract: Person re-identification (Re-ID) aims at retrieving an input person image from a set of images captured by multiple cameras. Although recent Re-ID methods have made great success, most of them extract features in terms of the attributes of clothing (e.g., color, texture). However, it is common for people to wear black clothes or be captured by surveillance systems in low light illumination, in whi… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

  5. arXiv:1908.05858  [pdf, other

    cs.CV cs.MM eess.IV

    daBNN: A Super Fast Inference Framework for Binary Neural Networks on ARM devices

    Authors: Jianhao Zhang, Yingwei Pan, Ting Yao, He Zhao, Tao Mei

    Abstract: It is always well believed that Binary Neural Networks (BNNs) could drastically accelerate the inference efficiency by replacing the arithmetic operations in float-valued Deep Neural Networks (DNNs) with bit-wise operations. Nevertheless, there has not been open-source implementation in support of this idea on low-end ARM devices (e.g., mobile phones and embedded devices). In this work, we propose… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: Accepted by 2019 ACMMM Open Source Software Competition. Source code: https://github.com/JDAI-CV/dabnn

  6. Gaussian Random Number Generator: Implemented in FPGA for Quantum Key Distribution

    Authors: Yue Hu, Yan Wu, Yi Chen, G. C. Wan, S. T. Mei

    Abstract: Quantum Key Distribution is the process of using quantum communication to establish a shared key between two parties. It has been demonstrated the unconditional security and effective communication of quantum communication system can be guaranteed by an excellent Gaussian random number generator with high speed and an extended random period. In this paper, we propose to construct the Gaussian rand… ▽ More

    Submitted 9 January, 2019; v1 submitted 20 February, 2018; originally announced February 2018.

    Comments: 18 pages, 6 figures, accepted for publication in the International Journal of Numerical Modeling: Electronic Networks, Devices and Fields