Skip to main content

Showing 1–4 of 4 results for author: Gollan, C

.
  1. arXiv:2310.12648  [pdf, other

    cs.CL

    Towards Real-World Streaming Speech Translation for Code-Switched Speech

    Authors: Belen Alastruey, Matthias Sperber, Christian Gollan, Dominic Telaar, Tim Ng, Aashish Agarwal

    Abstract: Code-switching (CS), i.e. mixing different languages in a single sentence, is a common phenomenon in communication and can be challenging in many Natural Language Processing (NLP) settings. Previous studies on CS speech have shown promising results for end-to-end speech translation (ST), but have been limited to offline scenarios and to translation to one of the languages present in the source (\t… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

  2. arXiv:2204.05076  [pdf, other

    cs.CL cs.SD eess.AS

    End-to-End Speech Translation for Code Switched Speech

    Authors: Orion Weller, Matthias Sperber, Telmo Pires, Hendra Setiawan, Christian Gollan, Dominic Telaar, Matthias Paulik

    Abstract: Code switching (CS) refers to the phenomenon of interchangeably using words and phrases from different languages. CS can pose significant accuracy challenges to NLP, due to the often monolingual nature of the underlying systems. In this work, we focus on CS in the context of English/Spanish conversations for the task of speech translation (ST), generating and evaluating both transcript and transla… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted to Findings of ACL 2022

  3. arXiv:2101.09149  [pdf, other

    cs.CL cs.LG

    Streaming Models for Joint Speech Recognition and Translation

    Authors: Orion Weller, Matthias Sperber, Christian Gollan, Joris Kluivers

    Abstract: Using end-to-end models for speech translation (ST) has increasingly been the focus of the ST community. These models condense the previously cascaded systems by directly converting sound waves into translated text. However, cascaded models have the advantage of including automatic speech recognition output, useful for a variety of practical ST systems that often display transcripts to the user al… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Camera Ready for EACL 2021

  4. arXiv:2007.12741  [pdf, other

    cs.CL

    Consistent Transcription and Translation of Speech

    Authors: Matthias Sperber, Hendra Setiawan, Christian Gollan, Udhyakumar Nallasamy, Matthias Paulik

    Abstract: The conventional paradigm in speech translation starts with a speech recognition step to generate transcripts, followed by a translation step with the automatic transcripts as input. To address various shortcomings of this paradigm, recent work explores end-to-end trainable direct models that translate without transcribing. However, transcripts can be an indispensable output in practical applicati… ▽ More

    Submitted 28 August, 2020; v1 submitted 24 July, 2020; originally announced July 2020.

    Comments: Accepted at TACL (pre-MIT Press publication version); added dataset link