Skip to main content

Showing 1–2 of 2 results for author: Kukk, K

.
  1. arXiv:2205.07083  [pdf, other

    eess.AS cs.CL

    Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge

    Authors: Tanel Alumäe, Kunnar Kukk

    Abstract: This paper investigates different pretraining approaches to spoken language identification. The paper is based on our submission to the Oriental Language Recognition 2021 Challenge. We participated in two tracks of the challenge: constrained and unconstrained language recognition. For the constrained track, we first trained a Conformer-based encoder-decoder model for multilingual automatic speech… ▽ More

    Submitted 14 May, 2022; originally announced May 2022.

    Comments: Accepted to Speaker Odyssey 2022

  2. arXiv:2203.16972  [pdf, other

    eess.AS

    Improving Language Identification of Accented Speech

    Authors: Kunnar Kukk, Tanel Alumäe

    Abstract: Language identification from speech is a common preprocessing step in many spoken language processing systems. In recent years, this field has seen fast progress, mostly due to the use of self-supervised models pretrained on multilingual data and the use of large training corpora. This paper shows that for speech with a non-native or regional accent, the accuracy of spoken language identification… ▽ More

    Submitted 1 July, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted to INTERSPEECH 2022