Skip to main content

Showing 1–4 of 4 results for author: Mahmudi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.04809  [pdf, other

    cs.CL

    Low-Resource Machine Translation through Retrieval-Augmented LLM Prompting: A Study on the Mambai Language

    Authors: Raphaƫl Merx, Aso Mahmudi, Katrina Langford, Leo Alberto de Araujo, Ekaterina Vylomova

    Abstract: This study explores the use of large language models (LLMs) for translating English into Mambai, a low-resource Austronesian language spoken in Timor-Leste, with approximately 200,000 native speakers. Leveraging a novel corpus derived from a Mambai language manual and additional sentences translated by a native speaker, we examine the efficacy of few-shot LLM prompting for machine translation (MT)… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  2. arXiv:2109.08615  [pdf

    cs.CL

    CKMorph: A Comprehensive Morphological Analyzer for Central Kurdish

    Authors: Morteza Naserzade, Aso Mahmudi, Hadi Veisi, Hawre Hosseini, Mohammad MohammadAmini

    Abstract: A morphological analyzer, which is a significant component of many natural language processing applications especially for morphologically rich languages, divides an input word into all its composing morphemes and identifies their morphological roles. In this paper, we introduce a comprehensive morphological analyzer for Central Kurdish (CK), a low-resourced language with a rich morphology. Buildi… ▽ More

    Submitted 2 March, 2022; v1 submitted 17 September, 2021; originally announced September 2021.

  3. arXiv:2102.12109  [pdf

    cs.CL

    Automatic Meter Classification of Kurdish Poems

    Authors: Aso Mahmudi, Hadi Veisi

    Abstract: Most of the classic texts in Kurdish literature are poems. Knowing the meter of the poems is helpful for correct reading, a better understanding of the meaning, and avoidance of ambiguity. This paper presents a rule-based method for automatic classification of the poem meter for the Central Kurdish language. The metrical system of Kurdish poetry is divided into three classes of quantitative, sylla… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  4. arXiv:2102.07412  [pdf

    cs.AI

    Jira: a Kurdish Speech Recognition System Designing and Building Speech Corpus and Pronunciation Lexicon

    Authors: Hadi Veisi, Hawre Hosseini, Mohammad Mohammadamini, Wirya Fathy, Aso Mahmudi

    Abstract: In this paper, we introduce the first large vocabulary speech recognition system (LVSR) for the Central Kurdish language, named Jira. The Kurdish language is an Indo-European language spoken by more than 30 million people in several countries, but due to the lack of speech and text resources, there is no speech recognition system for this language. To fill this gap, we introduce the first speech c… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.