Skip to main content

Showing 1–4 of 4 results for author: Mager, E

.
  1. arXiv:2305.19474  [pdf, other

    cs.CL

    Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers

    Authors: Manuel Mager, Elisabeth Mager, Katharina Kann, Ngoc Thang Vu

    Abstract: In recent years machine translation has become very successful for high-resource language pairs. This has also sparked new interest in research on the automatic translation of low-resource languages, including Indigenous languages. However, the latter are deeply related to the ethnic and cultural groups that speak (or used to speak) them. The data collection, modeling and deploying machine transla… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL2023 Main Conference

  2. arXiv:2203.08954  [pdf, other

    cs.CL cs.AI

    BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages

    Authors: Manuel Mager, Arturo Oncevay, Elisabeth Mager, Katharina Kann, Ngoc Thang Vu

    Abstract: Morphologically-rich polysynthetic languages present a challenge for NLP systems due to data sparsity, and a common strategy to handle this issue is to apply subword segmentation. We investigate a wide variety of supervised and unsupervised morphological segmentation methods for four polysynthetic languages: Nahuatl, Raramuri, Shipibo-Konibo, and Wixarika. Then, we compare the morphologically insp… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of ACL 2022

  3. arXiv:2104.08726  [pdf, other

    cs.CL

    AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages

    Authors: Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Meza-Ruiz, Gustavo A. Giménez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Ngoc Thang Vu, Katharina Kann

    Abstract: Pretrained multilingual models are able to perform cross-lingual transfer in a zero-shot setting, even for languages unseen during pretraining. However, prior work evaluating performance on unseen languages has largely been limited to low-level, syntactic tasks, and it remains unclear if zero-shot learning of high-level, semantic tasks is possible for unseen languages. To explore this question, we… ▽ More

    Submitted 16 March, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: Accepted to ACL 2022

  4. arXiv:1807.00286  [pdf, ps, other

    cs.CL

    Lost in Translation: Analysis of Information Loss During Machine Translation Between Polysynthetic and Fusional Languages

    Authors: Manuel Mager, Elisabeth Mager, Alfonso Medina-Urrea, Ivan Meza, Katharina Kann

    Abstract: Machine translation from polysynthetic to fusional languages is a challenging task, which gets further complicated by the limited amount of parallel text available. Thus, translation performance is far from the state of the art for high-resource and more intensively studied language pairs. To shed light on the phenomena which hamper automatic translation to and from polysynthetic languages, we stu… ▽ More

    Submitted 1 July, 2018; originally announced July 2018.

    Comments: To appear in "All Together Now? Computational Modeling of Polysynthetic Languages" Workshop, at COLING 2018