Skip to main content

Showing 1–10 of 10 results for author: Matassoni, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.08849  [pdf, other

    eess.AS cs.CL

    L2 proficiency assessment using self-supervised speech representations

    Authors: Stefano Bannò, Kate M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales

    Abstract: There has been a growing demand for automated spoken language assessment systems in recent years. A standard pipeline for this process is to start with a speech recognition system and derive features, either hand-crafted or based on deep-learning, that exploit the transcription and audio. Though these approaches can yield high performance systems, they require speech recognition systems that can b… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

  2. arXiv:2210.13168  [pdf, other

    cs.CL cs.SD eess.AS

    Proficiency assessment of L2 spoken English using wav2vec 2.0

    Authors: Stefano Bannò, Marco Matassoni

    Abstract: The increasing demand for learning English as a second language has led to a growing interest in methods for automatically assessing spoken language proficiency. Most approaches use hand-crafted features, but their efficacy relies on their particular underlying assumptions and they risk discarding potentially salient information about proficiency. Other approaches rely on transcriptions produced b… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted at SLT 2022

  3. arXiv:2107.09433  [pdf, other

    cs.CL

    Seed Words Based Data Selection for Language Model Adaptation

    Authors: Roberto Gretter, Marco Matassoni, Daniele Falavigna

    Abstract: We address the problem of language model customization in applications where the ASR component needs to manage domain-specific terminology; although current state-of-the-art speech recognition technology provides excellent results for generic domains, the adaptation to specialized dictionaries or glossaries is still an open issue. In this work we present an approach for automatically selecting sen… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    Comments: 11 pages

    Journal ref: Proceedings of MT Summit 2021 - August 16-20, 2021

  4. Mixtures of Deep Neural Experts for Automated Speech Scoring

    Authors: Sara Papi, Edmondo Trentin, Roberto Gretter, Marco Matassoni, Daniele Falavigna

    Abstract: The paper copes with the task of automatic assessment of second language proficiency from the language learners' spoken responses to test prompts. The task has significant relevance to the field of computer assisted language learning. The approach presented in the paper relies on two separate modules: (1) an automatic speech recognition system that yields text transcripts of the spoken interaction… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Journal ref: Proceedings of INTERSPEECH 2020

  5. arXiv:2104.05980  [pdf, other

    cs.CL cs.SD eess.AS

    Experiments of ASR-based mispronunciation detection for children and adult English learners

    Authors: Nina Hosseini-Kivanani, Roberto Gretter, Marco Matassoni, Giuseppe Daniele Falavigna

    Abstract: Pronunciation is one of the fundamentals of language learning, and it is considered a primary factor of spoken language when it comes to an understanding and being understood by others. The persistent presence of high error rates in speech recognition domains resulting from mispronunciations motivates us to find alternative techniques for handling mispronunciations. In this study, we develop a mis… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: Submitted to INTERSPEECH2021

  6. arXiv:2104.02819  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Learning to Rank Microphones for Distant Speech Recognition

    Authors: Samuele Cornell, Alessio Brutti, Marco Matassoni, Stefano Squartini

    Abstract: Fully exploiting ad-hoc microphone networks for distant speech recognition is still an open issue. Empirical evidence shows that being able to select the best microphone leads to significant improvements in recognition without any additional effort on front-end processing. Current channel selection techniques either rely on signal, decoder or posterior-based features. Signal-based features are ine… ▽ More

    Submitted 13 April, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

  7. arXiv:2001.08051  [pdf, ps, other

    cs.CL

    TLT-school: a Corpus of Non Native Children Speech

    Authors: Roberto Gretter, Marco Matassoni, Stefano Bannò, Daniele Falavigna

    Abstract: This paper describes "TLT-school" a corpus of speech utterances collected in schools of northern Italy for assessing the performance of students learning both English and German. The corpus was recorded in the years 2017 and 2018 from students aged between nine and sixteen years, attending primary, middle and high school. All utterances have been scored, in terms of some predefined proficiency ind… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

  8. arXiv:1809.09658  [pdf, ps, other

    cs.CL

    Non-native children speech recognition through transfer learning

    Authors: Marco Matassoni, Roberto Gretter, Daniele Falavigna, Diego Giuliani

    Abstract: This work deals with non-native children's speech and investigates both multi-task and transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to speakers, specifically children, learning a foreign language. The application scenario is characterized by young students learning English and German and reading sentences in these second-languages, as well as in their mother lan… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

  9. Automatic Quality Estimation for ASR System Combination

    Authors: Shahab Jalalvand, Matteo Negri, Daniele Falavigna, Marco Matassoni, Marco Turchi

    Abstract: Recognizer Output Voting Error Reduction (ROVER) has been widely used for system combination in automatic speech recognition (ASR). In order to select the most appropriate words to insert at each position in the output transcriptions, some ROVER extensions rely on critical information such as confidence scores and other ASR decoder features. This information, which is not always available, highly… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.

  10. arXiv:1702.01714  [pdf, ps, other

    cs.CL

    DNN adaptation by automatic quality estimation of ASR hypotheses

    Authors: Daniele Falavigna, Marco Matassoni, Shahab Jalalvand, Matteo Negri, Marco Turchi

    Abstract: In this paper we propose to exploit the automatic Quality Estimation (QE) of ASR hypotheses to perform the unsupervised adaptation of a deep neural network modeling acoustic probabilities. Our hypothesis is that significant improvements can be achieved by: i)automatically transcribing the evaluation data we are currently trying to recognise, and ii) selecting from it a subset of "good quality" ins… ▽ More

    Submitted 6 February, 2017; originally announced February 2017.

    Comments: Computer Speech & Language December 2016