Skip to main content

Showing 1–4 of 4 results for author: Schatz, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2101.11332  [pdf, other

    cs.CL

    A phonetic model of non-native spoken word processing

    Authors: Yevgen Matusevych, Herman Kamper, Thomas Schatz, Naomi H. Feldman, Sharon Goldwater

    Abstract: Non-native speakers show difficulties with spoken word processing. Many studies attribute these difficulties to imprecise phonological encoding of words in the lexical memory. We test an alternative hypothesis: that some of these difficulties can arise from the non-native speakers' phonetic perception. We train a computational model of phonetic learning, which has no access to phonology, on either… ▽ More

    Submitted 11 March, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted for publication in Proceedings of EACL-2021. 11 pages, 5 figures, 2 tables

  2. arXiv:2008.02888  [pdf, other

    cs.CL cs.SD eess.AS

    Evaluating computational models of infant phonetic learning across languages

    Authors: Yevgen Matusevych, Thomas Schatz, Herman Kamper, Naomi H. Feldman, Sharon Goldwater

    Abstract: In the first year of life, infants' speech perception becomes attuned to the sounds of their native language. Many accounts of this early phonetic learning exist, but computational models predicting the attunement patterns observed in infants from the speech input they hear have been lacking. A recent study presented the first such model, drawing on algorithms proposed for unsupervised learning fr… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: 7 pages, 1 figure

    Journal ref: 2020. In S. Denison, M. Mack, Y. Xu, and B. Armstrong (Eds.), Proceedings of the 42nd Annual Conference of the Cognitive Science Society (pp. 571-577). Austin, TX: Cognitive Science Society

  3. arXiv:1804.11297  [pdf, other

    cs.CL cs.LG

    Sampling strategies in Siamese Networks for unsupervised speech representation learning

    Authors: Rachid Riad, Corentin Dancette, Julien Karadayi, Neil Zeghidour, Thomas Schatz, Emmanuel Dupoux

    Abstract: Recent studies have investigated siamese network architectures for learning invariant speech representations using same-different side information at the word level. Here we investigate systematically an often ignored component of siamese networks: the sampling procedure (how pairs of same vs. different tokens are selected). We show that sampling strategies taking into account Zipf's Law, the dist… ▽ More

    Submitted 23 August, 2018; v1 submitted 30 April, 2018; originally announced April 2018.

    Comments: Conference paper at Interspeech 2018

  4. arXiv:1711.01161  [pdf, other

    cs.CL

    Learning Filterbanks from Raw Speech for Phone Recognition

    Authors: Neil Zeghidour, Nicolas Usunier, Iasonas Kokkinos, Thomas Schatz, Gabriel Synnaeve, Emmanuel Dupoux

    Abstract: We train a bank of complex filters that operates on the raw waveform and is fed into a convolutional neural network for end-to-end phone recognition. These time-domain filterbanks (TD-filterbanks) are initialized as an approximation of mel-filterbanks, and then fine-tuned jointly with the remaining convolutional architecture. We perform phone recognition experiments on TIMIT and show that for seve… ▽ More

    Submitted 4 April, 2018; v1 submitted 3 November, 2017; originally announced November 2017.

    Comments: Accepted at ICASSP 2018