Skip to main content

Showing 1–1 of 1 results for author: Boujou, E

.
  1. arXiv:2102.11000  [pdf, other

    cs.CL cs.LG

    An open access NLP dataset for Arabic dialects : Data collection, labeling, and model construction

    Authors: ElMehdi Boujou, Hamza Chataoui, Abdellah El Mekki, Saad Benjelloun, Ikram Chairi, Ismail Berrada

    Abstract: Natural Language Processing (NLP) is today a very active field of research and innovation. Many applications need however big sets of data for supervised learning, suitably labelled for the training purpose. This includes applications for the Arabic language and its national dialects. However, such open access labeled data sets in Arabic and its dialects are lacking in the Data Science ecosystem a… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.