Skip to main content

Showing 1–11 of 11 results for author: Berrada, I

.
  1. arXiv:2405.16482  [pdf, other

    cs.CL

    DarijaBanking: A New Resource for Overcoming Language Barriers in Banking Intent Detection for Moroccan Arabic Speakers

    Authors: Abderrahman Skiredj, Ferdaous Azhari, Ismail Berrada, Saad Ezzini

    Abstract: Navigating the complexities of language diversity is a central challenge in develo** robust natural language processing systems, especially in specialized domains like banking. The Moroccan Dialect (Darija) serves as the common language that blends cultural complexities, historical impacts, and regional differences. The complexities of Darija present a special set of challenges for language mode… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  2. arXiv:2401.04848  [pdf, other

    cs.CL

    Arabic Text Diacritization In The Age Of Transfer Learning: Token Classification Is All You Need

    Authors: Abderrahman Skiredj, Ismail Berrada

    Abstract: Automatic diacritization of Arabic text involves adding diacritical marks (diacritics) to the text. This task poses a significant challenge with noteworthy implications for computational processing and comprehension. In this paper, we introduce PTCAD (Pre-FineTuned Token Classification for Arabic Diacritization, a novel two-phase approach for the Arabic Text Diacritization task. PTCAD comprises a… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 32 pages, 3 figures, journal

  3. arXiv:2311.14462  [pdf, other

    eess.IV cs.CV

    CT-xCOV: a CT-scan based Explainable Framework for COVid-19 diagnosis

    Authors: Ismail Elbouknify, Afaf Bouhoute, Khalid Fardousse, Ismail Berrada, Abdelmajid Badri

    Abstract: In this work, CT-xCOV, an explainable framework for COVID-19 diagnosis using Deep Learning (DL) on CT-scans is developed. CT-xCOV adopts an end-to-end approach from lung segmentation to COVID-19 detection and explanations of the detection model's prediction. For lung segmentation, we used the well-known U-Net model. For COVID-19 detection, we compared three different CNN architectures: a standard… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  4. arXiv:2310.18778  [pdf, other

    cs.CL

    ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting

    Authors: Abdellah El Mekki, Muhammad Abdul-Mageed, ElMoatez Billah Nagoudi, Ismail Berrada, Ahmed Khoumsi

    Abstract: Bilingual Lexicon Induction (BLI), where words are translated between two languages, is an important NLP task. While noticeable progress on BLI in rich resource languages using static word embeddings has been achieved. The word translation performance can be further improved by incorporating information from contextualized word embeddings. In this paper, we introduce ProMap, a novel approach for B… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: To appear in IJCNLP-AACL 2023

  5. arXiv:2206.08415  [pdf, other

    cs.CL

    CS-UM6P at SemEval-2022 Task 6: Transformer-based Models for Intended Sarcasm Detection in English and Arabic

    Authors: Abdelkader El Mahdaouy, Abdellah El Mekki, Kabil Essefar, Abderrahman Skiredj, Ismail Berrada

    Abstract: Sarcasm is a form of figurative language where the intended meaning of a sentence differs from its literal meaning. This poses a serious challenge to several Natural Language Processing (NLP) applications such as Sentiment Analysis, Opinion Mining, and Author Profiling. In this paper, we present our participating system to the intended sarcasm detection task in English and Arabic languages. Our sy… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  6. arXiv:2206.08407  [pdf, other

    cs.CL

    Deep Multi-Task Models for Misogyny Identification and Categorization on Arabic Social Media

    Authors: Abdelkader El Mahdaouy, Abdellah El Mekki, Ahmed Oumar, Hajar Mousannif, Ismail Berrada

    Abstract: The prevalence of toxic content on social media platforms, such as hate speech, offensive language, and misogyny, presents serious challenges to our interconnected society. These challenging issues have attracted widespread attention in Natural Language Processing (NLP) community. In this paper, we present the submitted systems to the first Arabic Misogyny Identification shared task. We investigat… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  7. arXiv:2204.13515  [pdf, other

    cs.CL

    UM6P-CS at SemEval-2022 Task 11: Enhancing Multilingual and Code-Mixed Complex Named Entity Recognition via Pseudo Labels using Multilingual Transformer

    Authors: Abdellah El Mekki, Abdelkader El Mahdaouy, Mohammed Akallouch, Ismail Berrada, Ahmed Khoumsi

    Abstract: Building real-world complex Named Entity Recognition (NER) systems is a challenging task. This is due to the complexity and ambiguity of named entities that appear in various contexts such as short input sentences, emerging entities, and complex entities. Besides, real-world queries are mostly malformed, as they can be code-mixed or multilingual, among other scenarios. In this paper, we introduce… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

  8. arXiv:2106.12495  [pdf, other

    cs.CL

    BERT-based Multi-Task Model for Country and Province Level Modern Standard Arabic and Dialectal Arabic Identification

    Authors: Abdellah El Mekki, Abdelkader El Mahdaouy, Kabil Essefar, Nabil El Mamoun, Ismail Berrada, Ahmed Khoumsi

    Abstract: Dialect and standard language identification are crucial tasks for many Arabic natural language processing applications. In this paper, we present our deep learning-based system, submitted to the second NADI shared task for country-level and province-level identification of Modern Standard Arabic (MSA) and Dialectal Arabic (DA). The system is based on an end-to-end deep Multi-Task Learning (MTL) m… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  9. arXiv:2106.12488  [pdf, other

    cs.CL

    Deep Multi-Task Model for Sarcasm Detection and Sentiment Analysis in Arabic Language

    Authors: Abdelkader El Mahdaouy, Abdellah El Mekki, Kabil Essefar, Nabil El Mamoun, Ismail Berrada, Ahmed Khoumsi

    Abstract: The prominence of figurative language devices, such as sarcasm and irony, poses serious challenges for Arabic Sentiment Analysis (SA). While previous research works tackle SA and sarcasm detection separately, this paper introduces an end-to-end deep Multi-Task Learning (MTL) model, allowing knowledge interaction between the two tasks. Our MTL model's architecture consists of a Bidirectional Encode… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

  10. arXiv:2102.11000  [pdf, other

    cs.CL cs.LG

    An open access NLP dataset for Arabic dialects : Data collection, labeling, and model construction

    Authors: ElMehdi Boujou, Hamza Chataoui, Abdellah El Mekki, Saad Benjelloun, Ikram Chairi, Ismail Berrada

    Abstract: Natural Language Processing (NLP) is today a very active field of research and innovation. Many applications need however big sets of data for supervised learning, suitably labelled for the training purpose. This includes applications for the Arabic language and its national dialects. However, such open access labeled data sets in Arabic and its dialects are lacking in the Data Science ecosystem a… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

  11. arXiv:1905.09083  [pdf, other

    cs.DM cs.DS cs.LO

    A Hypergraph Based Approach for the 4-Constraint Satisfaction Problem Tractability

    Authors: Rachid Oucheikh, Ismail Berrada, Outman El Hichami

    Abstract: Constraint Satisfaction Problem (CSP) is a framework for modeling and solving a variety of real-world problems. Once the problem is expressed as a finite set of constraints, the goal is to find the variables' values satisfying them. Even though the problem is in general NP-complete, there are some approximation and practical techniques to tackle its intractability. One of the most widely used tech… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.