Skip to main content

Showing 1–4 of 4 results for author: Fuchs, T S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2304.00993  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels

    Authors: Tzeviya Sylvia Fuchs, Yedid Hoshen

    Abstract: Unsupervised word segmentation in audio utterances is challenging as, in speech, there is typically no gap between words. In a preliminary experiment, we show that recent deep self-supervised features are very effective for word segmentation but require supervision for training the classification head. To extend their effectiveness to unsupervised word segmentation, we propose a pseudo-labeling st… ▽ More

    Submitted 30 March, 2023; originally announced April 2023.

    Comments: ICASSP 2023

  2. arXiv:2204.13094  [pdf, other

    cs.SD eess.AS

    Unsupervised Word Segmentation using K Nearest Neighbors

    Authors: Tzeviya Sylvia Fuchs, Yedid Hoshen, Joseph Keshet

    Abstract: In this paper, we propose an unsupervised kNN-based approach for word segmentation in speech utterances. Our method relies on self-supervised pre-trained speech representations, and compares each audio segment of a given utterance to its K nearest neighbors within the training set. Our main assumption is that a segment containing more than one word would occur less often than a segment containing… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

    Comments: Submitted to interspeech 2022

  3. arXiv:2103.05468  [pdf, other

    eess.AS cs.LG cs.SD

    CNN-based Spoken Term Detection and Localization without Dynamic Programming

    Authors: Tzeviya Sylvia Fuchs, Yael Segal, Joseph Keshet

    Abstract: In this paper, we propose a spoken term detection algorithm for simultaneous prediction and localization of in-vocabulary and out-of-vocabulary terms within an audio segment. The proposed algorithm infers whether a term was uttered within a given speech signal or not by predicting the word embeddings of various parts of the speech signal and comparing them to the word embedding of the desired term… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Journal ref: ICASSP 2021

  4. arXiv:1904.07704  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    SpeechYOLO: Detection and Localization of Speech Objects

    Authors: Yael Segal, Tzeviya Sylvia Fuchs, Joseph Keshet

    Abstract: In this paper, we propose to apply object detection methods from the vision domain on the speech recognition domain, by treating audio fragments as objects. More specifically, we present SpeechYOLO, which is inspired by the YOLO algorithm for object detection in images. The goal of SpeechYOLO is to localize boundaries of utterances within the input signal, and to correctly classify them. Our syste… ▽ More

    Submitted 30 June, 2019; v1 submitted 14 April, 2019; originally announced April 2019.

    Journal ref: Interspeech 2019, pp. 4210-4214