Skip to main content

Showing 1–7 of 7 results for author: Möbius, B

.
  1. arXiv:2306.02405  [pdf, other

    cs.CL

    An Information-Theoretic Analysis of Self-supervised Discrete Representations of Speech

    Authors: Badr M. Abdullah, Mohammed Maqsood Shaik, Bernd Möbius, Dietrich Klakow

    Abstract: Self-supervised representation learning for speech often involves a quantization step that transforms the acoustic input into discrete units. However, it remains unclear how to characterize the relationship between these discrete units and abstract phonetic categories such as phonemes. In this paper, we develop an information-theoretic framework whereby we represent each phonetic category as a dis… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted in Interspeech 2023

  2. arXiv:2209.06633  [pdf, other

    cs.CL eess.AS

    Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings

    Authors: Badr M. Abdullah, Bernd Möbius, Dietrich Klakow

    Abstract: Models of acoustic word embeddings (AWEs) learn to map variable-length spoken word segments onto fixed-dimensionality vector representations such that different acoustic exemplars of the same word are projected nearby in the embedding space. In addition to their speech technology applications, AWE models have been shown to predict human performance on a variety of auditory lexical processing tasks… ▽ More

    Submitted 18 September, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted in INTERSPEECH 2022

  3. arXiv:2109.10179  [pdf, other

    cs.CL

    How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

    Authors: Badr M. Abdullah, Iuliia Zaitova, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

    Abstract: How do neural networks "perceive" speech sounds from unknown languages? Does the typological similarity between the model's training language (L1) and an unknown language (L2) have an impact on the model representations of L2 speech signals? To answer these questions, we present a novel experimental design based on representational similarity analysis (RSA) to analyze acoustic word embeddings (AWE… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: BlackboxNLP 2021

  4. arXiv:2106.08686  [pdf, other

    cs.CL cs.SD eess.AS

    Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study

    Authors: Badr M. Abdullah, Marius Mosbach, Iuliia Zaitova, Bernd Möbius, Dietrich Klakow

    Abstract: Several variants of deep neural networks have been successfully employed for building parametric models that project variable-duration spoken word segments onto fixed-size vector representations, or acoustic word embeddings (AWEs). However, it remains unclear to what degree we can rely on the distance in the emerging AWE space as an estimate of word-form similarity. In this paper, we ask: does the… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted in Interspeech 2021

  5. arXiv:2010.11973  [pdf, other

    cs.CL

    Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification

    Authors: Badr M. Abdullah, Jacek Kudera, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

    Abstract: Deep neural networks have been employed for various spoken language recognition tasks, including tasks that are multilingual by definition such as spoken language identification. In this paper, we present a neural model for Slavic language identification in speech signals and analyze its emergent representations to investigate whether they reflect objective measures of language relatedness and/or… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted in VarDial 2020 Workshop

  6. arXiv:2008.00545  [pdf, other

    eess.AS cs.CL

    Cross-Domain Adaptation of Spoken Language Identification for Related Languages: The Curious Case of Slavic Languages

    Authors: Badr M. Abdullah, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

    Abstract: State-of-the-art spoken language identification (LID) systems, which are based on end-to-end deep neural networks, have shown remarkable success not only in discriminating between distant languages but also between closely-related languages or even different spoken varieties of the same language. However, it is still unclear to what extent neural LID models generalize to speech samples with differ… ▽ More

    Submitted 6 August, 2020; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: To appear in INTERSPEECH 2020

  7. Studying Mutual Phonetic Influence with a Web-Based Spoken Dialogue System

    Authors: Eran Raveh, Ingmar Steiner, Iona Gessinger, Bernd Möbius

    Abstract: This paper presents a study on mutual speech variation influences in a human-computer setting. The study highlights behavioral patterns in data collected as part of a shadowing experiment, and is performed using a novel end-to-end platform for studying phonetic variation in dialogue. It includes a spoken dialogue system capable of detecting and tracking the state of phonetic features in the user's… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: Proc. 20th International Conference on Speech and Computer (SPECOM)