Skip to main content

Showing 1–1 of 1 results for author: Nash, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2103.14583  [pdf, other

    cs.CL cs.SD eess.AS

    Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages

    Authors: Nay San, Martijn Bartelds, Mitchell Browne, Lily Clifford, Fiona Gibson, John Mansfield, David Nash, Jane Simpson, Myfany Turpin, Maria Vollmer, Sasha Wilmoth, Dan Jurafsky

    Abstract: Pre-trained speech representations like wav2vec 2.0 are a powerful tool for automatic speech recognition (ASR). Yet many endangered languages lack sufficient data for pre-training such models, or are predominantly oral vernaculars without a standardised writing system, precluding fine-tuning. Query-by-example spoken term detection (QbE-STD) offers an alternative for iteratively indexing untranscri… ▽ More

    Submitted 13 September, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted at ASRU 2021