Skip to main content

Showing 1–17 of 17 results for author: Waheed, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01257  [pdf, other

    cs.CL cs.SD eess.AS

    uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation via Large-Scale Pseudo Labelling

    Authors: Abdul Waheed, Karima Kadaoui, Muhammad Abdul-Mageed

    Abstract: Recent work on distilling Whisper's knowledge into small models using pseudo-labels shows promising performance while reducing the size by up to 50\%. This results in small, efficient, and dedicated models. However, a critical step of distillation from pseudo-labels involves filtering high-quality predictions and using only those during training. This step requires ground truth to compare and filt… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Work in progress

  2. arXiv:2406.16751  [pdf, other

    cs.CL cs.SD eess.AS

    Towards Zero-Shot Text-To-Speech for Arabic Dialects

    Authors: Khai Duy Doan, Abdul Waheed, Muhammad Abdul-Mageed

    Abstract: Zero-shot multi-speaker text-to-speech (ZS-TTS) systems have advanced for English, however, it still lags behind due to insufficient resources. We address this gap for Arabic, a language of more than 450 million native speakers, by first adapting a sizeable existing dataset to suit the needs of speech synthesis. Additionally, we employ a set of Arabic dialect identification models to explore the i… ▽ More

    Submitted 25 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.04512  [pdf, other

    cs.CL cs.SD eess.AS

    To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation

    Authors: Abdul Waheed, Karima Kadaoui, Muhammad Abdul-Mageed

    Abstract: Arabic is known to present unique challenges for Automatic Speech Recognition (ASR). On one hand, its rich linguistic diversity and wide range of dialects complicate the development of robust, inclusive models. On the other, current multilingual ASR models are compute-intensive and lack proper comprehensive evaluations. In light of these challenges, we distill knowledge from large teacher models i… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL'24 main

  4. A Novel Defocus-Blur Region Detection Approach Based on DCT Feature and PCNN Structure

    Authors: Sadia Basar, Mushtaq Ali, Abdul Waheed, Muneer Ahmad, Mahdi H. Miraz

    Abstract: The motion or out-of-focus effect in digital images is the main reason for the blurred regions in defocused-blurred images. It may adversely affect various image features such as texture, pixel, and region. Therefore, it is important to detect in-focused objects in defocused-blurred images after the segmentation of blurred and non-blurred regions. The state-of-the-art techniques are prone to noisy… ▽ More

    Submitted 12 October, 2023; originally announced November 2023.

    Journal ref: IEEE Access, 29 August 2023, Vol. 7, Electronic ISSN: 2169-3536, pp. 94945-94961, https://ieeexplore.ieee.org/document/10233857

  5. arXiv:2310.11069  [pdf, other

    cs.CL cs.SD eess.AS

    VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System

    Authors: Abdul Waheed, Bashar Talafha, Peter Sullivan, AbdelRahim Elmadany, Muhammad Abdul-Mageed

    Abstract: Arabic is a complex language with many varieties and dialects spoken by over 450 millions all around the world. Due to the linguistic diversity and variations, it is challenging to build a robust and generalized ASR system for Arabic. In this work, we address this gap by develo** and demoing a system, dubbed VoxArabica, for dialect identification (DID) as well as automatic speech recognition (AS… ▽ More

    Submitted 27 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted at ArabicNLP conference co-located with EMNLP'23. First three authors contributed equally

  6. arXiv:2308.03051  [pdf, other

    cs.CL cs.LG

    TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties

    Authors: Karima Kadaoui, Samar M. Magdy, Abdul Waheed, Md Tawkat Islam Khondaker, Ahmed Oumar El-Shangiti, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

    Abstract: Despite the purported multilingual proficiency of instruction-finetuned large language models (LLMs) such as ChatGPT and Bard, the linguistic inclusivity of these models remains insufficiently explored. Considering this constraint, we present a thorough assessment of Bard and ChatGPT (encompassing both GPT-3.5 and GPT-4) regarding their machine translation proficiencies across ten varieties of Ara… ▽ More

    Submitted 23 October, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

    Comments: ArabicNLP 2023

  7. arXiv:2306.02902  [pdf, ps, other

    cs.CL cs.SD eess.AS

    N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition

    Authors: Bashar Talafha, Abdul Waheed, Muhammad Abdul-Mageed

    Abstract: Whisper, the recently developed multilingual weakly supervised model, is reported to perform well on multiple speech recognition benchmarks in both monolingual and multilingual settings. However, it is not clear how Whisper would fare under diverse conditions even on languages it was evaluated on such as Arabic. In this work, we address this gap by comprehensively evaluating Whisper on several var… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 4 pages, INTERSPEECH 2023

  8. arXiv:2305.14976  [pdf, other

    cs.CL cs.LG

    GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP

    Authors: Md Tawkat Islam Khondaker, Abdul Waheed, El Moatez Billah Nagoudi, Muhammad Abdul-Mageed

    Abstract: ChatGPT's emergence heralds a transformative phase in NLP, particularly demonstrated through its excellent performance on many English benchmarks. However, the model's efficacy across diverse linguistic contexts remains largely uncharted territory. This work aims to bridge this knowledge gap, with a primary focus on assessing ChatGPT's capabilities on Arabic languages and dialectal varieties. Our… ▽ More

    Submitted 21 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Main Conference

  9. arXiv:2304.14402  [pdf, other

    cs.CL

    LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions

    Authors: Minghao Wu, Abdul Waheed, Chiyu Zhang, Muhammad Abdul-Mageed, Alham Fikri Aji

    Abstract: Large language models (LLMs) with instruction fine-tuning demonstrate superior generative capabilities. However, these models are resource-intensive. To alleviate this issue, we explore distilling knowledge from instruction-tuned LLMs into much smaller ones. To this end, we carefully develop a large set of 2.58M instructions based on both existing and newly-generated instructions. In addition to b… ▽ More

    Submitted 28 January, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: 21 pages, 8 figures, 17 tables, accepted by EACL2024 main conference

  10. arXiv:2304.08566  [pdf, other

    cs.LG cs.CR

    GrOVe: Ownership Verification of Graph Neural Networks using Embeddings

    Authors: Asim Waheed, Vasisht Duddu, N. Asokan

    Abstract: Graph neural networks (GNNs) have emerged as a state-of-the-art approach to model and draw inferences from large scale graph-structured data in various application settings such as social networking. The primary goal of a GNN is to learn an embedding for each graph node in a dataset that encodes both the node features and the local graph structure around the node. Embeddings generated by a GNN for… ▽ More

    Submitted 1 September, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: To appear in the IEEE Symposium on Security and Privacy, 2024. 12 pages, 5 figures

  11. NLP Workbench: Efficient and Extensible Integration of State-of-the-art Text Mining Tools

    Authors: Peiran Yao, Matej Kosmajac, Abeer Waheed, Kostyantyn Guzhva, Natalie Hervieux, Denilson Barbosa

    Abstract: NLP Workbench is a web-based platform for text mining that allows non-expert users to obtain semantic understanding of large-scale corpora using state-of-the-art text mining models. The platform is built upon latest pre-trained models and open source systems from academia that provide semantic analysis functionalities, including but not limited to entity linking, sentiment analysis, semantic parsi… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Camera-ready version for EACL 2023: System Demonstrations

  12. arXiv:2111.06647  [pdf, other

    cs.CL

    Speaker and Time-aware Joint Contextual Learning for Dialogue-act Classification in Counselling Conversations

    Authors: Ganeshan Malhotra, Abdul Waheed, Aseem Srivastava, Md Shad Akhtar, Tanmoy Chakraborty

    Abstract: The onset of the COVID-19 pandemic has brought the mental health of people under risk. Social counselling has gained remarkable significance in this environment. Unlike general goal-oriented dialogues, a conversation between a patient and a therapist is considerably implicit, though the objective of the conversation is quite apparent. In such a case, understanding the intent of the patient is impe… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: 9 pages; Accepted to WSDM 2022

  13. arXiv:2108.07249  [pdf, other

    cs.CL

    BloomNet: A Robust Transformer based model for Bloom's Learning Outcome Classification

    Authors: Abdul Waheed, Muskan Goyal, Nimisha Mittal, Deepak Gupta, Ashish Khanna, Moolchand Sharma

    Abstract: Bloom taxonomy is a common paradigm for categorizing educational learning objectives into three learning levels: cognitive, affective, and psychomotor. For the optimization of educational programs, it is crucial to design course learning outcomes (CLOs) according to the different cognitive levels of Bloom Taxonomy. Usually, administrators of the institutions manually complete the tedious work of m… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: Bloom's Taxonomy, Natural Language Processing, Transformer, Robustness and Generalization

  14. arXiv:2103.05094  [pdf

    eess.IV cs.CV cs.LG

    CovidGAN: Data Augmentation Using Auxiliary Classifier GAN for Improved Covid-19 Detection

    Authors: Abdul Waheed, Muskan Goyal, Deepak Gupta, Ashish Khanna, Fadi Al-Turjman, Placido Rogerio Pinheiro

    Abstract: Coronavirus (COVID-19) is a viral disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The spread of COVID-19 seems to have a detrimental effect on the global economy and health. A positive chest X-ray of infected patients is a crucial step in the battle against COVID-19. Early results suggest that abnormalities exist in chest X-rays of patients suggestive of COVID-19. T… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted at IEEE Access. Received April 30, 2020, accepted May 11, 2020, date of publication May 14, 2020, date of current version May 28, 2020

    ACM Class: I.2.7

    Journal ref: IEEE Access, vol. 8, pp. 91916-91923, 2020

  15. arXiv:2103.05069  [pdf, ps, other

    cs.CL

    Domain Controlled Title Generation with Human Evaluation

    Authors: Abdul Waheed, Muskan Goyal, Nimisha Mittal, Deepak Gupta

    Abstract: We study automatic title generation and present a method for generating domain-controlled titles for scientific articles. A good title allows you to get the attention that your research deserves. A title can be interpreted as a high-compression description of a document containing information on the implemented process. For domain-controlled titles, we used the pre-trained text-to-text transformer… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted at ICICC-2021 for publication in Springer AISC series

  16. arXiv:2103.04264  [pdf, other

    cs.CR cs.LG

    T-Miner: A Generative Approach to Defend Against Trojan Attacks on DNN-based Text Classification

    Authors: Ahmadreza Azizi, Ibrahim Asadullah Tahmid, Asim Waheed, Neal Mangaokar, Jiameng Pu, Mobin Javed, Chandan K. Reddy, Bimal Viswanath

    Abstract: Deep Neural Network (DNN) classifiers are known to be vulnerable to Trojan or backdoor attacks, where the classifier is manipulated such that it misclassifies any input containing an attacker-determined Trojan trigger. Backdoors compromise a model's integrity, thereby posing a severe threat to the landscape of DNN-based classification. While multiple defenses against such attacks exist for classif… ▽ More

    Submitted 10 March, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Accepted to Usenix Security 2021; First two authors contributed equally to this work; 18 pages, 11 tables

  17. arXiv:2103.00199  [pdf, other

    cs.CL cs.IR cs.LG

    COVID-19 Tweets Analysis through Transformer Language Models

    Authors: Abdul Hameed Azeemi, Adeel Waheed

    Abstract: Understanding the public sentiment and perception in a healthcare crisis is essential for develo** appropriate crisis management techniques. While some studies have used Twitter data for predictive modelling during COVID-19, fine-grained sentiment analysis of the opinion of people on social media during this pandemic has not yet been done. In this study, we perform an in-depth, fine-grained sent… ▽ More

    Submitted 27 February, 2021; originally announced March 2021.

    Comments: 5 pages, 5 figures