Skip to main content

Showing 1–3 of 3 results for author: Dolev, E L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.19310  [pdf, other

    cs.CL

    Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation

    Authors: Eyal Liron Dolev, Clemens Fidel Lutz, Noëmi Aepli

    Abstract: Whisper is a state-of-the-art automatic speech recognition (ASR) model (Radford et al., 2022). Although Swiss German dialects are allegedly not part of Whisper's training data, preliminary experiments showed that Whisper can transcribe Swiss German quite well, with the output being a speech translation into Standard German. To gain a better understanding of Whisper's performance on Swiss German, w… ▽ More

    Submitted 9 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted at VarDial 2024 (the eleventh Workshop on NLP for Similar Languages, Varieties and Dialects 2024), Mexico City

  2. arXiv:2306.08999  [pdf, other

    cs.CL cs.AI

    Voting Booklet Bias: Stance Detection in Swiss Federal Communication

    Authors: Eric Egli, Noah Mamié, Eyal Liron Dolev, Mathias Müller

    Abstract: In this study, we use recent stance detection methods to study the stance (for, against or neutral) of statements in official information booklets for voters. Our main goal is to answer the fundamental question: are topics to be voted on presented in a neutral way? To this end, we first train and compare several models for stance detection on a large dataset about Swiss politics. We find that fi… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: 10 pages (including abstract and appendix), 5 figures, Keywords: stance detection, natural language processing, political analysis

  3. arXiv:2306.08702  [pdf, other

    cs.CL

    Does mBERT understand Romansh? Evaluating word embeddings using word alignment

    Authors: Eyal Liron Dolev

    Abstract: We test similarity-based word alignment models (SimAlign and awesome-align) in combination with word embeddings from mBERT and XLM-R on parallel sentences in German and Romansh. Since Romansh is an unseen language, we are dealing with a zero-shot setting. Using embeddings from mBERT, both models reach an alignment error rate of 0.22, which outperforms fast_align, a statistical model, and is on par… ▽ More

    Submitted 17 August, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Journal ref: In Proceedings of the 8th edition of the Swiss Text Analytics Conference, 2023, pages 41-53, Neuchatel, Switzerland. Association for Computational Linguistics