Skip to main content

Showing 1–3 of 3 results for author: Blatt, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.13842  [pdf, other

    cs.CL cs.SD eess.AS

    Joint vs Sequential Speaker-Role Detection and Automatic Speech Recognition for Air-traffic Control

    Authors: Alexander Blatt, Aravind Krishnan, Dietrich Klakow

    Abstract: Utilizing air-traffic control (ATC) data for downstream natural-language processing tasks requires preprocessing steps. Key steps are the transcription of the data via automatic speech recognition (ASR) and speaker diarization, respectively speaker role detection (SRD) to divide the transcripts into pilot and air-traffic controller (ATCO) transcripts. While traditional approaches take on these tas… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  2. arXiv:2211.04054  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

    Authors: Juan Zuluaga-Gomez, Karel Veselý, Igor Szöke, Alexander Blatt, Petr Motlicek, Martin Kocour, Mickael Rigault, Khalid Choukri, Amrutha Prasad, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Claudia Cevenini, Pavel Kolčárek, Allan Tart, Jan Černocký, Dietrich Klakow

    Abstract: Personal assistants, automatic speech recognizers and dialogue understanding systems are becoming more critical in our interconnected digital world. A clear example is air traffic control (ATC) communications. ATC aims at guiding aircraft and controlling the airspace in a safe and optimal manner. These voice-based dialogues are carried between an air traffic controller (ATCO) and pilots via very-h… ▽ More

    Submitted 15 June, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Manuscript under review; The code is available at: https://github.com/idiap/atco2-corpus

  3. arXiv:2204.06309  [pdf, other

    cs.CL cs.SD eess.AS

    Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information

    Authors: Alexander Blatt, Martin Kocour, Karel Veselý, Igor Szöke, Dietrich Klakow

    Abstract: Air traffic control (ATC) relies on communication via speech between pilot and air-traffic controller (ATCO). The call-sign, as unique identifier for each flight, is used to address a specific pilot by the ATCO. Extracting the call-sign from the communication is a challenge because of the noisy ATC voice channel and the additional noise introduced by the receiver. A low signal-to-noise ratio (SNR)… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted by ICASSP 2022