Skip to main content

Showing 1–2 of 2 results for author: Naski, M

.
  1. arXiv:2111.13138  [pdf, other

    cs.CL cs.LG

    TunBERT: Pretrained Contextualized Text Representation for Tunisian Dialect

    Authors: Abir Messaoudi, Ahmed Cheikhrouhou, Hatem Haddad, Nourchene Ferchichi, Moez BenHajhmida, Abir Korched, Malek Naski, Faten Ghriss, Amine Kerkeni

    Abstract: Pretrained contextualized text representation models learn an effective representation of a natural language to make it machine understandable. After the breakthrough of the attention mechanism, a new generation of pretrained models have been proposed achieving good performances since the introduction of the Transformer. Bidirectional Encoder Representations from Transformers (BERT) has become the… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  2. arXiv:2104.02516  [pdf, other

    cs.CL

    AI4D -- African Language Program

    Authors: Kathleen Siminyu, Godson Kalipe, Davor Orlic, Jade Abbott, Vukosi Marivate, Sackey Freshia, Prateek Sibal, Bhanu Neupane, David I. Adelani, Amelia Taylor, Jamiil Toure ALI, Kevin Degila, Momboladji Balogoun, Thierno Ibrahima DIOP, Davis David, Chayma Fourati, Hatem Haddad, Malek Naski

    Abstract: Advances in speech and language technologies enable tools such as voice-search, text-to-speech, speech recognition and machine translation. These are however only available for high resource languages like English, French or Chinese. Without foundational digital resources for African languages, which are considered low-resource in the digital context, these advanced tools remain out of reach. This… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.