Skip to main content

Showing 1–5 of 5 results for author: Kadaoui, K

.
  1. arXiv:2407.01257  [pdf, other

    cs.CL cs.SD eess.AS

    uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation via Large-Scale Pseudo Labelling

    Authors: Abdul Waheed, Karima Kadaoui, Muhammad Abdul-Mageed

    Abstract: Recent work on distilling Whisper's knowledge into small models using pseudo-labels shows promising performance while reducing the size by up to 50\%. This results in small, efficient, and dedicated models. However, a critical step of distillation from pseudo-labels involves filtering high-quality predictions and using only those during training. This step requires ground truth to compare and filt… ▽ More

    Submitted 3 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: Work in progress

  2. arXiv:2406.04512  [pdf, other

    cs.CL cs.SD eess.AS

    To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation

    Authors: Abdul Waheed, Karima Kadaoui, Muhammad Abdul-Mageed

    Abstract: Arabic is known to present unique challenges for Automatic Speech Recognition (ASR). On one hand, its rich linguistic diversity and wide range of dialects complicate the development of robust, inclusive models. On the other, current multilingual ASR models are compute-intensive and lack proper comprehensive evaluations. In light of these challenges, we distill knowledge from large teacher models i… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL'24 main

  3. arXiv:2308.03051  [pdf, other

    cs.CL cs.LG

    TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties

    Authors: Karima Kadaoui, Samar M. Magdy, Abdul Waheed, Md Tawkat Islam Khondaker, Ahmed Oumar El-Shangiti, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

    Abstract: Despite the purported multilingual proficiency of instruction-finetuned large language models (LLMs) such as ChatGPT and Bard, the linguistic inclusivity of these models remains insufficiently explored. Considering this constraint, we present a thorough assessment of Bard and ChatGPT (encompassing both GPT-3.5 and GPT-4) regarding their machine translation proficiencies across ten varieties of Ara… ▽ More

    Submitted 23 October, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: ArabicNLP 2023

  4. arXiv:2107.13114  [pdf, other

    cs.CV

    A Thorough Review on Recent Deep Learning Methodologies for Image Captioning

    Authors: Ahmed Elhagry, Karima Kadaoui

    Abstract: Image Captioning is a task that combines computer vision and natural language processing, where it aims to generate descriptive legends for images. It is a two-fold process relying on accurate image understanding and correct language understanding both syntactically and semantically. It is becoming increasingly difficult to keep up with the latest research and findings in the field of image captio… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

  5. arXiv:2107.13111  [pdf, other

    cs.CV

    Experimenting with Self-Supervision using Rotation Prediction for Image Captioning

    Authors: Ahmed Elhagry, Karima Kadaoui

    Abstract: Image captioning is a task in the field of Artificial Intelligence that merges between computer vision and natural language processing. It is responsible for generating legends that describe images, and has various applications like descriptions used by assistive technology or indexing images (for search engines for instance). This makes it a crucial topic in AI that is undergoing a lot of researc… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.