Skip to main content

Showing 1–2 of 2 results for author: Liberato, J P

.
  1. arXiv:2407.03032  [pdf, other

    cs.CL

    Strategies for Arabic Readability Modeling

    Authors: Juan Piñeros Liberato, Bashar Alhafni, Muhamed Al Khalil, Nizar Habash

    Abstract: Automatic readability assessment is relevant to building NLP applications for education, content analysis, and accessibility. However, Arabic readability assessment is a challenging task due to Arabic's morphological richness and limited readability resources. In this paper, we present a set of experimental results on Arabic readability assessment using a diverse range of approaches, from rule-bas… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted to ArabicNLP 2024, ACL

  2. arXiv:2404.18615  [pdf, other

    cs.CL

    The SAMER Arabic Text Simplification Corpus

    Authors: Bashar Alhafni, Reem Hazim, Juan Piñeros Liberato, Muhamed Al Khalil, Nizar Habash

    Abstract: We present the SAMER Corpus, the first manually annotated Arabic parallel corpus for text simplification targeting school-aged learners. Our corpus comprises texts of 159K words selected from 15 publicly available Arabic fiction novels most of which were published between 1865 and 1955. Our corpus includes readability level annotations at both the document and word levels, as well as two simplifie… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024. 15 pages, 6 tables, 1 figure