Skip to main content

Showing 1–1 of 1 results for author: Amirante, A

.
  1. arXiv:2405.03484  [pdf, other

    cs.SD cs.LG eess.AS

    Whispy: Adapting STT Whisper Models to Real-Time Environments

    Authors: Antonio Bevilacqua, Paolo Saviano, Alessandro Amirante, Simon Pietro Romano

    Abstract: Large general-purpose transformer models have recently become the mainstay in the realm of speech analysis. In particular, Whisper achieves state-of-the-art results in relevant tasks such as speech recognition, translation, language identification, and voice activity detection. However, Whisper models are not designed to be used in real-time conditions, and this limitation makes them unsuitable fo… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.