Skip to main content

Showing 1–2 of 2 results for author: McCree, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2104.02469  [pdf, other

    eess.AS cs.LG eess.SP

    Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings

    Authors: Kiran Karra, Alan McCree

    Abstract: Many modern systems for speaker diarization, such as the recently-developed VBx approach, rely on clustering of DNN speaker embeddings followed by resegmentation. Two problems with this approach are that the DNN is not directly optimized for this task, and the parameters need significant retuning for different applications. We have recently presented progress in this direction with a Leave-One-Out… ▽ More

    Submitted 14 June, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: 5 pages, 2 figures, accepted at INTERSPEECH 2021

  2. arXiv:2008.03616  [pdf, ps, other

    eess.AS cs.LG eess.SP

    Variable frame rate-based data augmentation to handle speaking-style variability for automatic speaker verification

    Authors: Amber Afshan, **xi Guo, Soo ** Park, Vijay Ravi, Alan McCree, Abeer Alwan

    Abstract: The effects of speaking-style variability on automatic speaker verification were investigated using the UCLA Speaker Variability database which comprises multiple speaking styles per speaker. An x-vector/PLDA (probabilistic linear discriminant analysis) system was trained with the SRE and Switchboard databases with standard augmentation techniques and evaluated with utterances from the UCLA databa… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.

    Comments: Accepted to Interspeech 2020