Skip to main content

Showing 1–3 of 3 results for author: Carmantini, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.00898  [pdf, other

    cs.SD cs.CL eess.AS

    Phonetic Error Analysis of Raw Waveform Acoustic Models with Parametric and Non-Parametric CNNs

    Authors: Erfan Loweimi, Andrea Carmantini, Peter Bell, Steve Renals, Zoran Cvetkovic

    Abstract: In this paper, we analyse the error patterns of the raw waveform acoustic models in TIMIT's phone recognition task. Our analysis goes beyond the conventional phone error rate (PER) metric. We categorise the phones into three groups: {affricate, diphthong, fricative, nasal, plosive, semi-vowel, vowel, silence}, {consonant, vowel+, silence}, and {voiced, unvoiced, silence} and, compute the PER for e… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 5 pages, 6 figures, 3 tables

  2. arXiv:2303.18110  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR

    Authors: Ramon Sanabria, Nikolay Bogoychev, Nina Markl, Andrea Carmantini, Ondrej Klejch, Peter Bell

    Abstract: English is the most widely spoken language in the world, used daily by millions of people as a first or second language in many different contexts. As a result, there are many varieties of English. Although the great many advances in English automatic speech recognition (ASR) over the past decades, results are usually reported based on test datasets which fail to represent the diversity of English… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: Accepted to IEEE ICASSP 2023

  3. arXiv:1910.02168  [pdf, other

    eess.AS

    Cross lingual transfer learning for zero-resource domain adaptation

    Authors: Alberto Abad, Peter Bell, Andrea Carmantini, Steve Renals

    Abstract: We propose a method for zero-resource domain adaptation of DNN acoustic models, for use in low-resource situations where the only in-language training data available may be poorly matched to the intended target domain. Our method uses a multi-lingual model in which several DNN layers are shared between languages. This architecture enables domain adaptation transforms learned for one well-resourced… ▽ More

    Submitted 29 October, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

    Comments: Submitted to ICASSP 2020. Main updates wrt previous versions: same network config in all experiments, added Babel/Material LR target language experiments, added comparison with alternative/similar methods of cross-lingual adaptation