Skip to main content

Showing 1–8 of 8 results for author: Koller, O

.
  1. arXiv:2303.10782  [pdf, ps, other

    cs.CL cs.CV

    On the Importance of Signer Overlap for Sign Language Detection

    Authors: Abhilash Pal, Stephan Huber, Cyrine Chaabani, Alessandro Manzotti, Oscar Koller

    Abstract: Sign language detection, identifying if someone is signing or not, is becoming crucially important for its applications in remote conferencing software and for selecting useful sign data for training sign language recognition or translation tasks. We argue that the current benchmark data sets for sign language detection estimate overly positive results that do not generalize well due to signer ove… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  2. arXiv:2210.13326  [pdf, other

    cs.CL cs.CV

    Clean Text and Full-Body Transformer: Microsoft's Submission to the WMT22 Shared Task on Sign Language Translation

    Authors: Subhadeep Dey, Abhilash Pal, Cyrine Chaabani, Oscar Koller

    Abstract: This paper describes Microsoft's submission to the first shared task on sign language translation at WMT 2022, a public competition tackling sign language to spoken language translation for Swiss German sign language. The task is very challenging due to data scarcity and an unprecedented vocabulary size of more than 20k words on the target side. Moreover, the data is taken from real broadcast news… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: accepted for publication at WMT2022

  3. arXiv:2106.08126  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Dialectal Speech Recognition and Translation of Swiss German Speech to Standard German Text: Microsoft's Submission to SwissText 2021

    Authors: Yuriy Arabskyy, Aashish Agarwal, Subhadeep Dey, Oscar Koller

    Abstract: This paper describes the winning approach in the Shared Task 3 at SwissText 2021 on Swiss German Speech to Standard German Text, a public competition on dialect recognition and translation. Swiss German refers to the multitude of Alemannic dialects spoken in the German-speaking parts of Switzerland. Swiss German differs significantly from standard German in pronunciation, word inventory and gramma… ▽ More

    Submitted 1 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: to be published in SwissText 2021

  4. arXiv:2009.00299  [pdf, other

    cs.CV

    Multi-channel Transformers for Multi-articulatory Sign Language Translation

    Authors: Necati Cihan Camgoz, Oscar Koller, Simon Hadfield, Richard Bowden

    Abstract: Sign languages use multiple asynchronous information channels (articulators), not just the hands but also the face and body, which computational approaches often ignore. In this paper we tackle the multi-articulatory sign language translation task and propose a novel multi-channel transformer architecture. The proposed architecture allows both the inter and intra contextual relationships between d… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

  5. arXiv:2008.09918  [pdf, other

    cs.CV

    Quantitative Survey of the State of the Art in Sign Language Recognition

    Authors: Oscar Koller

    Abstract: This work presents a meta study covering around 300 published sign language recognition papers with over 400 experimental results. It includes most papers between the start of the field in 1983 and 2020. Additionally, it covers a fine-grained analysis on over 25 studies that have compared their recognition approaches on RWTH-PHOENIX-Weather 2014, the standard benchmark task of the field. Research… ▽ More

    Submitted 29 August, 2020; v1 submitted 22 August, 2020; originally announced August 2020.

  6. arXiv:2003.13830  [pdf, other

    cs.CV cs.CL cs.HC cs.LG

    Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation

    Authors: Necati Cihan Camgoz, Oscar Koller, Simon Hadfield, Richard Bowden

    Abstract: Prior work on Sign Language Translation has shown that having a mid-level sign gloss representation (effectively recognizing the individual signs) improves the translation performance drastically. In fact, the current state-of-the-art in translation requires gloss level tokenization in order to work. We introduce a novel transformer based architecture that jointly learns Continuous Sign Language R… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

  7. arXiv:1908.08597  [pdf, other

    cs.CV cs.CL cs.CY cs.GR cs.HC

    Sign Language Recognition, Generation, and Translation: An Interdisciplinary Perspective

    Authors: Danielle Bragg, Oscar Koller, Mary Bellard, Larwan Berke, Patrick Boudrealt, Annelies Braffort, Naomi Caselli, Matt Huenerfauth, Hernisa Kacorri, Tessa Verhoef, Christian Vogler, Meredith Ringel Morris

    Abstract: Develo** successful sign language recognition, generation, and translation systems requires expertise in a wide range of fields, including computer vision, computer graphics, natural language processing, human-computer interaction, linguistics, and Deaf culture. Despite the need for deep interdisciplinary knowledge, existing research occurs in separate disciplinary silos, and tackles separate po… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

  8. arXiv:1812.01053  [pdf, other

    cs.CV

    MS-ASL: A Large-Scale Data Set and Benchmark for Understanding American Sign Language

    Authors: Hamid Reza Vaezi Joze, Oscar Koller

    Abstract: Sign language recognition is a challenging and often underestimated problem comprising multi-modal articulators (handshape, orientation, movement, upper body and face) that integrate asynchronously on multiple streams. Learning powerful statistical models in such a scenario requires much data, particularly to apply recent advances of the field. However, labeled data is a scarce resource for sign l… ▽ More

    Submitted 20 November, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Journal ref: British Machine Vision Conference, September 2019, Cardiff, UK