Skip to main content

Showing 1–2 of 2 results for author: Lekakou, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1807.10740  [pdf, ps, other

    cs.CL

    A small Griko-Italian speech translation corpus

    Authors: Marcely Zanon Boito, Antonios Anastasopoulos, Marika Lekakou, Aline Villavicencio, Laurent Besacier

    Abstract: This paper presents an extension to a very low-resource parallel corpus collected in an endangered language, Griko, making it useful for computational research. The corpus consists of 330 utterances (about 20 minutes of speech) which have been transcribed and translated in Italian, with annotations for word-level speech-to-transcription and speech-to-translation alignments. The corpus also include… ▽ More

    Submitted 27 July, 2018; originally announced July 2018.

  2. arXiv:1806.03757  [pdf, ps, other

    cs.CL

    Part-of-Speech Tagging on an Endangered Language: a Parallel Griko-Italian Resource

    Authors: Antonis Anastasopoulos, Marika Lekakou, Josep Quer, Eleni Zimianiti, Justin DeBenedetto, David Chiang

    Abstract: Most work on part-of-speech (POS) tagging is focused on high resource languages, or examines low-resource and active learning settings through simulated studies. We evaluate POS tagging techniques on an actual endangered language, Griko. We present a resource that contains 114 narratives in Griko, along with sentence-level translations in Italian, and provides gold annotations for the test set. Ba… ▽ More

    Submitted 10 June, 2018; originally announced June 2018.

    Comments: to be presented at COLING 2018