Skip to main content

Showing 1–4 of 4 results for author: Blandón, M A C

Searching in archive eess. Search in all archives.
.
  1. Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System

    Authors: Khazar Khorrami, María Andrea Cruz Blandón, Tuomas Virtanen, Okko Räsänen

    Abstract: Speech representation learning with self-supervised algorithms has resulted in notable performance boosts in many downstream tasks. Recent work combined self-supervised learning (SSL) and visually grounded speech (VGS) processing mechanisms for representation learning. The joint training with SSL and VGS mechanisms provides the opportunity to utilize both unlabeled speech and speech-related visual… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 5 pages, accepted by EUSIPCO 2023

  2. arXiv:2306.01506  [pdf, other

    cs.CL eess.AS stat.ML

    BabySLM: language-acquisition-friendly benchmark of self-supervised spoken language models

    Authors: Marvin Lavechin, Yaya Sy, Hadrien Titeux, María Andrea Cruz Blandón, Okko Räsänen, Hervé Bredin, Emmanuel Dupoux, Alejandrina Cristia

    Abstract: Self-supervised techniques for learning speech representations have been shown to develop linguistic competence from exposure to speech without the need for human labels. In order to fully realize the potential of these approaches and further our understanding of how infants learn language, simulations must closely emulate real-life situations by training on developmentally plausible corpora and b… ▽ More

    Submitted 8 June, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Proceedings of Interspeech 2023

  3. arXiv:2305.01965  [pdf, other

    cs.CL cs.SD eess.AS

    Analysing the Impact of Audio Quality on the Use of Naturalistic Long-Form Recordings for Infant-Directed Speech Research

    Authors: María Andrea Cruz Blandón, Alejandrina Cristia, Okko Räsänen

    Abstract: Modelling of early language acquisition aims to understand how infants bootstrap their language skills. The modelling encompasses properties of the input data used for training the models, the cognitive hypotheses and their algorithmic implementations being tested, and the evaluation methodologies to compare models to human data. Recent developments have enabled the use of more naturalistic traini… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted at the CogSci2023 conference

  4. arXiv:2008.00731  [pdf

    eess.AS cs.CL

    Unsupervised Discovery of Recurring Speech Patterns Using Probabilistic Adaptive Metrics

    Authors: Okko Räsänen, María Andrea Cruz Blandón

    Abstract: Unsupervised spoken term discovery (UTD) aims at finding recurring segments of speech from a corpus of acoustic speech data. One potential approach to this problem is to use dynamic time war** (DTW) to find well-aligning patterns from the speech data. However, automatic selection of initial candidate segments for the DTW-alignment and detection of "sufficiently good" alignments among those requi… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.