Skip to main content

Showing 1–3 of 3 results for author: Galvez, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.06220  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    Label-Loo**: Highly Efficient Decoding for Transducers

    Authors: Vladimir Bataev, Hainan Xu, Daniel Galvez, Vitaly Lavrukhin, Boris Ginsburg

    Abstract: This paper introduces a highly efficient greedy decoding algorithm for Transducer inference. We propose a novel data structure using CUDA tensors to represent partial hypotheses in a batch that supports parallelized hypothesis manipulations. During decoding, our algorithm maximizes GPU parallelism by adopting a nested-loop design, where the inner loop consumes all blank predictions, while non-blan… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2311.04996  [pdf, other

    eess.AS cs.LG

    GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition

    Authors: Daniel Galvez, Tim Kaldewey

    Abstract: While Connectionist Temporal Classification (CTC) models deliver state-of-the-art accuracy in automated speech recognition (ASR) pipelines, their performance has been limited by CPU-based beam search decoding. We introduce a GPU-accelerated Weighted Finite State Transducer (WFST) beam search decoder compatible with current CTC models. It increases pipeline throughput and decreases latency, support… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  3. arXiv:1711.08058  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio

    Authors: Ahmad AbdulKader, Kareem Nassar, Mohamed Mahmoud, Daniel Galvez, Chetan Patil

    Abstract: We propose using cascaded classifiers for a keyword spotting (KWS) task on narrow-band (NB), 8kHz audio acquired in non-IID environments --- a more challenging task than most state-of-the-art KWS systems face. We present a model that incorporates Deep Neural Networks (DNNs), cascading, multiple-feature representations, and multiple-instance learning. The cascaded classifiers handle the task's clas… ▽ More

    Submitted 21 November, 2017; originally announced November 2017.

    Comments: To be published in the proceedings of NIPS 2017