Skip to main content

Showing 1–1 of 1 results for author: Basoglu, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2210.14446  [pdf, other

    cs.CL cs.SD eess.AS

    Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead

    Authors: Piyush Behre, Naveen Parihar, Sharman Tan, Amy Shah, Eva Sharma, Geoffrey Liu, Shuangyu Chang, Hosam Khalil, Chris Basoglu, Sayan Pathak

    Abstract: Segmentation for continuous Automatic Speech Recognition (ASR) has traditionally used silence timeouts or voice activity detectors (VADs), which are both limited to acoustic features. This segmentation is often overly aggressive, given that people naturally pause to think as they speak. Consequently, segmentation happens mid-sentence, hindering both punctuation and downstream tasks like machine tr… ▽ More

    Submitted 27 October, 2022; v1 submitted 25 October, 2022; originally announced October 2022.