Skip to main content

Showing 1–50 of 93 results for author: Saad, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18776  [pdf, other

    cs.CL

    Implicit Discourse Relation Classification For Nigerian Pidgin

    Authors: Muhammed Saeed, Peter Bourgonje, Vera Demberg

    Abstract: Despite attempts to make Large Language Models multi-lingual, many of the world's languages are still severely under-resourced. This widens the performance gap between NLP and AI applications aimed at well-financed, and those aimed at less-resourced languages. In this paper, we focus on Nigerian Pidgin (NP), which is spoken by nearly 100 million people, but has comparatively very few NLP resources… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.09630  [pdf, other

    cs.CV cs.LG

    Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition

    Authors: Mehreen Saeed, Adrian Chan, Anupam Mijar, Joseph Moukarzel, Georges Habchi, Carlos Younes, Amin Elias, Chau-Wai Wong, Akram Khater

    Abstract: We present the Manuscripts of Handwritten Arabic~(Muharaf) dataset, which is a machine learning dataset consisting of more than 1,600 historic handwritten page images transcribed by experts in archival Arabic. Each document image is accompanied by spatial polygonal coordinates of its text lines as well as basic page elements. This dataset was compiled to advance the state of the art in handwritten… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  3. arXiv:2405.20987  [pdf, other

    cs.CV cs.LG eess.IV

    Early Stop** Criteria for Training Generative Adversarial Networks in Biomedical Imaging

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Generative Adversarial Networks (GANs) have high computational costs to train their complex architectures. Throughout the training process, GANs' output is analyzed qualitatively based on the loss and synthetic images' diversity and quality. Based on this qualitative analysis, training is manually halted once the desired synthetic images are generated. By utilizing an early stop** criterion, the… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: This paper is accepted at the 35th IEEE Irish Signals and Systems Conference (ISSC 2024)

  4. arXiv:2404.19238  [pdf, other

    cs.IT cs.DC cs.GT cs.LG cs.NI

    Pilot Contamination in Massive MIMO Systems: Challenges and Future Prospects

    Authors: Muhammad Kamran Saeed, Ashfaq Khokhar, Shakil Ahmed

    Abstract: Massive multiple input multiple output (M-MIMO) technology plays a pivotal role in fifth-generation (5G) and beyond communication systems, offering a wide range of benefits, from increased spectral efficiency (SE) to enhanced energy efficiency and higher reliability. However, these advantages are contingent upon precise channel state information (CSI) availability at the base station (BS). Ensurin… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted At IWCMC 2024 Comm & SP Symposium

  5. arXiv:2404.18264  [pdf, other

    cs.CL cs.AI

    Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin

    Authors: Pin-Jie Lin, Merel Scholman, Muhammed Saeed, Vera Demberg

    Abstract: Nigerian Pidgin is an English-derived contact language and is traditionally an oral language, spoken by approximately 100 million people. No orthographic standard has yet been adopted, and thus the few available Pidgin datasets that exist are characterised by noise in the form of orthographic variations. This contributes to under-performance of models in critical NLP tasks. The current work is the… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 Main Conference

  6. arXiv:2404.10188  [pdf, other

    cs.NI cs.GT cs.IT cs.LG cs.SI

    Smart Pilot Assignment for IoT in Massive MIMO Systems: A Path Towards Scalable IoT Infrastructure

    Authors: Muhammad Kamran Saeed, Ashfaq Khokhar

    Abstract: 5G sets the foundation for an era of creativity with its faster speeds, increased data throughput, reduced latency, and enhanced IoT connectivity, all enabled by Massive MIMO (M-MIMO) technology. M-MIMO boosts network efficiency and enhances user experience by employing intelligent user scheduling. This paper presents a user scheduling scheme and pilot assignment strategy designed for IoT devices,… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted At ICC-2024

  7. arXiv:2404.09342  [pdf, other

    cs.CV cs.SD eess.AS

    Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf

    Abstract: The advancements of technology have led to the use of multimodal systems in various real-world applications. Among them, the audio-visual systems are one of the widely used multimodal systems. In the recent years, associating face and voice of a person has gained attention due to presence of unique correlation between them. The Face-voice Association in Multilingual Environments (FAME) Challenge 2… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: ACM Multimedia Conference - Grand Challenge

  8. arXiv:2404.06144  [pdf, other

    cs.LG cs.AI

    Differential Privacy for Anomaly Detection: Analyzing the Trade-off Between Privacy and Explainability

    Authors: Fatima Ezzeddine, Mirna Saad, Omran Ayoub, Davide Andreoletti, Martin Gjoreski, Ihab Sbeity, Marc Langheinrich, Silvia Giordano

    Abstract: Anomaly detection (AD), also referred to as outlier detection, is a statistical process aimed at identifying observations within a dataset that significantly deviate from the expected pattern of the majority of the data. Such a process finds wide application in various fields, such as finance and healthcare. While the primary objective of AD is to yield high detection accuracy, the requirements of… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  9. arXiv:2401.17967  [pdf, other

    cs.SE cs.LG

    CONCORD: Towards a DSL for Configurable Graph Code Representation

    Authors: Mootez Saad, Tushar Sharma

    Abstract: Deep learning is widely used to uncover hidden patterns in large code corpora. To achieve this, constructing a format that captures the relevant characteristics and features of source code is essential. Graph-based representations have gained attention for their ability to model structural and semantic information. However, existing tools lack flexibility in constructing graphs across different pr… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  10. arXiv:2401.09824  [pdf, other

    cs.CR

    Conning the Crypto Conman: End-to-End Analysis of Cryptocurrency-based Technical Support Scams

    Authors: Bhupendra Acharya, Muhammad Saad, Antonio Emanuele Cinà, Lea Schönherr, Hoang Dai Nguyen, Adam Oest, Phani Vadrevu, Thorsten Holz

    Abstract: The mainstream adoption of cryptocurrencies has led to a surge in wallet-related issues reported by ordinary users on social media platforms. In parallel, there is an increase in an emerging fraud trend called cryptocurrency-based technical support scam, in which fraudsters offer fake wallet recovery services and target users experiencing wallet-related issues. In this paper, we perform a compre… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  11. ArabIcros: AI-Powered Arabic Crossword Puzzle Generation for Educational Applications

    Authors: Kamyar Zeinalipour, Mohamed Zaky Saad, Marco Maggini, Marco Gori

    Abstract: This paper presents the first Arabic crossword puzzle generator driven by advanced AI technology. Leveraging cutting-edge large language models including GPT4, GPT3-Davinci, GPT3-Curie, GPT3-Babbage, GPT3-Ada, and BERT, the system generates distinctive and challenging clues. Based on a dataset comprising over 50,000 clue-answer pairs, the generator employs fine-tuning, few/zero-shot learning strat… ▽ More

    Submitted 26 January, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted Paper for ArabicNLP 2023 - The First Arabic Natural Language Processing Conference - Co-located with EMNLP 2023 in Singapore

  12. arXiv:2311.15024  [pdf

    cs.CR

    A Comparative Study of Watering Hole Attack Detection Using Supervised Neural Network

    Authors: Mst. Nishita Aktar, Sornali Akter, Md. Nusaim Islam Saad, Jakir Hosen Jisun, Kh. Mustafizur Rahman, Md. Nazmus Sakib

    Abstract: The state of security demands innovative solutions to defend against targeted attacks due to the growing sophistication of cyber threats. This study explores the nefarious tactic known as "watering hole attacks using supervised neural networks to detect and prevent these attacks. The neural network identifies patterns in website behavior and network traffic associated with such attacks. Testing on… ▽ More

    Submitted 12 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

  13. arXiv:2311.13508  [pdf, other

    cs.SE cs.LG

    Naturalness of Attention: Revisiting Attention in Code Language Models

    Authors: Mootez Saad, Tushar Sharma

    Abstract: Language models for code such as CodeBERT offer the capability to learn advanced source code representation, but their opacity poses barriers to understanding of captured properties. Recent attention analysis studies provide initial interpretability insights by focusing solely on attention weights rather than considering the wider context modeling of Transformers. This study aims to shed some ligh… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Accepted at ICSE-NIER (2024) track

  14. arXiv:2310.11266  [pdf

    cs.CL cs.AI cs.NE

    Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models

    Authors: Khushboo Verma, Marina Moore, Stephanie Wottrich, Karla Robles López, Nishant Aggarwal, Zeel Bhatt, Aagamjit Singh, Bradford Unroe, Salah Basheer, Nitish Sachdeva, Prinka Arora, Harmanjeet Kaur, Tanupreet Kaur, Tevon Hood, Anahi Marquez, Tushar Varshney, Nanfu Deng, Azaan Ramani, Pawanraj Ishwara, Maimoona Saeed, Tatiana López Velarde Peña, Bryan Barksdale, Sushovan Guha, Satwant Kumar

    Abstract: In response to the pressing need for advanced clinical problem-solving tools in healthcare, we introduce BooksMed, a novel framework based on a Large Language Model (LLM). BooksMed uniquely emulates human cognitive processes to deliver evidence-based and reliable responses, utilizing the GRADE (Grading of Recommendations, Assessment, Development, and Evaluations) framework to effectively quantify… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  15. arXiv:2310.03278  [pdf, other

    cs.IT cs.GT cs.LG cs.NI eess.SP

    Mitigating Pilot Contamination and Enabling IoT Scalability in Massive MIMO Systems

    Authors: Muhammad Kamran Saeed, Ahmed E. Kamal, Ashfaq Khokhar

    Abstract: Massive MIMO is expected to play an important role in the development of 5G networks. This paper addresses the issue of pilot contamination and scalability in massive MIMO systems. The current practice of reusing orthogonal pilot sequences in adjacent cells leads to difficulty in differentiating incoming inter- and intra-cell pilot sequences. One possible solution is to increase the number of orth… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted At GLOBECOM 2023

  16. arXiv:2310.02240  [pdf, other

    cs.RO

    Spherical Rolling Robots Design, Modeling, and Control: A Systematic Literature Review

    Authors: Aminata Diouf, Bruno Belzile, Maarouf Saad, David St-Onge

    Abstract: Spherical robots have garnered increasing interest for their applications in exploration, tunnel inspection, and extraterrestrial missions. Diverse designs have emerged, including barycentric configurations, pendulum-based mechanisms, etc. In addition, a wide spectrum of control strategies has been proposed, ranging from traditional PID approaches to cutting-edge neural networks. Our systematic re… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  17. arXiv:2309.12245  [pdf, other

    eess.IV cs.CV cs.LG

    Adaptive Input-image Normalization for Solving the Mode Collapse Problem in GAN-based X-ray Images

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imag… ▽ More

    Submitted 29 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Submitted to the Elsevier Journal

  18. arXiv:2308.05247  [pdf, other

    cs.SI cs.CR

    TUBERAIDER: Attributing Coordinated Hate Attacks on YouTube Videos to their Source Communities

    Authors: Mohammad Hammas Saeed, Kostantinos Papadamou, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini

    Abstract: Alas, coordinated hate attacks, or raids, are becoming increasingly common online. In a nutshell, these are perpetrated by a group of aggressors who organize and coordinate operations on a platform (e.g., 4chan) to target victims on another community (e.g., YouTube). In this paper, we focus on attributing raids to their source community, paving the way for moderation approaches that take the conte… ▽ More

    Submitted 22 June, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted for publication at the 18th International AAAI Conference on Web and Social Media (ICWSM 2024). Please cite accordingly

  19. arXiv:2308.02505  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing Intra-class Diversity and Quality of Synthetically Generated Images in a Biomedical and Non-biomedical Setting

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: In biomedical image analysis, data imbalance is common across several imaging modalities. Data augmentation is one of the key solutions in addressing this limitation. Generative Adversarial Networks (GANs) are increasingly being relied upon for data augmentation tasks. Biomedical image features are sensitive to evaluating the efficacy of synthetic images. These features can have a significant impa… ▽ More

    Submitted 23 July, 2023; originally announced August 2023.

    Comments: This work is accepted in 25th Irish Machine Vision and Image Processing (IMVIP) Conference

  20. arXiv:2307.00382  [pdf, other

    cs.CL

    Low-Resource Cross-Lingual Adaptive Training for Nigerian Pidgin

    Authors: Pin-Jie Lin, Muhammed Saeed, Ernie Chang, Merel Scholman

    Abstract: Develo** effective spoken language processing systems for low-resource languages poses several challenges due to the lack of parallel data and limited resources for fine-tuning models. In this work, we target on improving upon both text classification and translation of Nigerian Pidgin (Naija) by collecting a large-scale parallel English-Pidgin corpus and further propose a framework of cross-lin… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: To appear in INTERSPEECH 2023

  21. arXiv:2306.02630  [pdf, other

    stat.ML cs.LG

    Covariance Adaptive Best Arm Identification

    Authors: El Mehdi Saad, Gilles Blanchard, Nicolas Verzelen

    Abstract: We consider the problem of best arm identification in the multi-armed bandit model, under fixed confidence. Given a confidence input $δ$, the goal is to identify the arm with the highest mean reward with a probability of at least 1 -- $δ$, while minimizing the number of arm pulls. While the literature provides solutions to this problem under the assumption of independent arms distributions, we pro… ▽ More

    Submitted 20 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: New version with some minor corrections

    Journal ref: Neurips 2023

  22. arXiv:2306.02628  [pdf, other

    stat.ML cs.LG

    Active Ranking of Experts Based on their Performances in Many Tasks

    Authors: El Mehdi Saad, Nicolas Verzelen, Alexandra Carpentier

    Abstract: We consider the problem of ranking n experts based on their performances on d tasks. We make a monotonicity assumption stating that for each pair of experts, one outperforms the other on all tasks. We consider the sequential setting where in each round, the learner has access to noisy evaluations of actively chosen pair of expert-task, given the information available up to the actual round. Given… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  23. arXiv:2304.13253  [pdf, other

    cs.CR cs.CY cs.LG cs.SE

    Analyzing In-browser Cryptojacking

    Authors: Muhammad Saad, David Mohaisen

    Abstract: Cryptojacking is the permissionless use of a target device to covertly mine cryptocurrencies. With cryptojacking, attackers use malicious JavaScript codes to force web browsers into solving proof-of-work puzzles, thus making money by exploiting the resources of the website visitors. To understand and counter such attacks, we systematically analyze the static, dynamic, and economic aspects of in-br… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 14 pages, 11 tables, 8 figures, and 69 references. arXiv admin note: substantial text overlap with arXiv:1809.02152

  24. arXiv:2304.00472  [pdf, other

    cs.DB cs.AI

    Querying Large Language Models with SQL

    Authors: Mohammed Saeed, Nicola De Cao, Paolo Papotti

    Abstract: In many use-cases, information is stored in text but not available in structured data. However, extracting data from natural language text to precisely fit a schema, and thus enable querying, is a challenging task. With the rise of pre-trained Large Language Models (LLMs), there is now an effective solution to store and use information extracted from massive corpora of text documents. Thus, we env… ▽ More

    Submitted 25 October, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: Accepted for presentation at EDBT 2024 as Vision paper

  25. arXiv:2303.13055  [pdf, other

    cs.HC cs.LG

    Reimagining Application User Interface (UI) Design using Deep Learning Methods: Challenges and Opportunities

    Authors: Subtain Malik, Muhammad Tariq Saeed, Marya Jabeen Zia, Shahzad Rasool, Liaquat Ali Khan, Mian Ilyas Ahmed

    Abstract: In this paper, we present a review of the recent work in deep learning methods for user interface design. The survey encompasses well known deep learning techniques (deep neural networks, convolutional neural networks, recurrent neural networks, autoencoders, and generative adversarial networks) and datasets widely used to design user interface applications. We highlight important problems and eme… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: A review paper on studies of UI design techniques and deep learning

  26. arXiv:2303.08729  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    DACOS-A Manually Annotated Dataset of Code Smells

    Authors: Himesh Nandani, Mootez Saad, Tushar Sharma

    Abstract: Researchers apply machine-learning techniques for code smell detection to counter the subjectivity of many code smells. Such approaches need a large, manually annotated dataset for training and benchmarking. Existing literature offers a few datasets; however, they are small in size and, more importantly, do not focus on the subjective code snippets. In this paper, we present DACOS, a manually anno… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 4 pages

  27. arXiv:2303.06129  [pdf, other

    cs.CV

    Single-branch Network for Multimodal Training

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Muhammad Zaigham Zaheer, Karthik Nandakumar, Muhammad Haroon Yousaf, Arif Mahmood

    Abstract: With the rapid growth of social media platforms, users are sharing billions of multimedia posts containing audio, images, and text. Researchers have focused on building autonomous systems capable of processing such multimedia data to solve challenging multimodal tasks including cross-modal retrieval, matching, and verification. Existing works use separate networks to extract embeddings of each mod… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted at ICASSP 2023

  28. arXiv:2302.13033  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Speaker Recognition in Realistic Scenario Using Multimodal Data

    Authors: Saqlain Hussain Shah, Muhammad Saad Saeed, Shah Nawaz, Muhammad Haroon Yousaf

    Abstract: In recent years, an association is established between faces and voices of celebrities leveraging large scale audio-visual information from YouTube. The availability of large scale audio-visual datasets is instrumental in develo** speaker recognition methods based on standard Convolutional Neural Networks. Thus, the aim of this paper is to leverage large scale audio-visual information to improve… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted at the International Conference on Artificial Intelligence (ICAI'2023)

  29. arXiv:2211.12009  [pdf

    cs.CV cs.AI

    Deep-Learning-Based Computer Vision Approach For The Segmentation Of Ball Deliveries And Tracking In Cricket

    Authors: Kumail Abbas, Muhammad Saeed, M. Imad Khan, Khandakar Ahmed, Hua Wang

    Abstract: There has been a significant increase in the adoption of technology in cricket recently. This trend has created the problem of duplicate work being done in similar computer vision-based research works. Our research tries to solve one of these problems by segmenting ball deliveries in a cricket broadcast using deep learning models, MobileNet and YOLO, thus enabling researchers to use our work as a… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  30. arXiv:2210.06334  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    A Self-attention Guided Multi-scale Gradient GAN for Diversified X-ray Image Synthesis

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Imbalanced image datasets are commonly available in the domain of biomedical image analysis. Biomedical images contain diversified features that are significant in predicting targeted diseases. Generative Adversarial Networks (GANs) are utilized to address the data limitation problem via the generation of synthetic images. Training challenges such as mode collapse, non-convergence, and instability… ▽ More

    Submitted 12 November, 2022; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: Accepted in AICS-2022 Conference

  31. arXiv:2208.10238  [pdf, other

    cs.CV

    Learning Branched Fusion and Orthogonal Projection for Face-Voice Association

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: Recent years have seen an increased interest in establishing association between faces and voices of celebrities leveraging audio-visual information from YouTube. Prior works adopt metric learning methods to learn an embedding space that is amenable for associated matching and verification tasks. Albeit showing some progress, such formulations are, however, restrictive due to dependency on distanc… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Submitted: IEEE Transactions on Multimedia. arXiv admin note: substantial text overlap with arXiv:2112.10483

  32. arXiv:2208.09214  [pdf, other

    cs.IR cs.AI cs.DB

    Crowdsourced Fact-Checking at Twitter: How Does the Crowd Compare With Experts?

    Authors: Mohammed Saeed, Nicolas Traub, Maelle Nicolas, Gianluca Demartini, Paolo Papotti

    Abstract: Fact-checking is one of the effective solutions in fighting online misinformation. However, traditional fact-checking is a process requiring scarce expert human resources, and thus does not scale well on social media because of the continuous flow of new content to be checked. Methods based on crowdsourcing have been proposed to tackle this challenge, as they can scale with a smaller cost, but, wh… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Journal ref: Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM 2022)

  33. arXiv:2208.08224  [pdf, other

    cs.CV eess.IV

    Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture

    Authors: Muhammad Muzammel, Mohd Zuki Yusoff, Mohamad Naufal Mohamad Saad, Faryal Sheikh, Muhammad Ahsan Awais

    Abstract: Buses and heavy vehicles have more blind spots compared to cars and other road vehicles due to their large sizes. Therefore, accidents caused by these heavy vehicles are more fatal and result in severe injuries to other road users. These possible blind-spot collisions can be identified early using vision-based object detection approaches. Yet, the existing state-of-the-art vision-based object dete… ▽ More

    Submitted 19 August, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  34. arXiv:2208.05593  [pdf, other

    eess.IV cs.CV

    Evaluating the Quality and Diversity of DCGAN-based Generatively Synthesized Diabetic Retinopathy Imagery

    Authors: Cristina-Madalina Dragan, Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Publicly available diabetic retinopathy (DR) datasets are imbalanced, containing limited numbers of images with DR. This imbalance contributes to overfitting when training machine learning classifiers. The impact of this imbalance is exacerbated as the severity of the DR stage increases, affecting the classifiers' diagnostic capacity. The imbalance can be addressed using Generative Adversarial Net… ▽ More

    Submitted 30 August, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: 29 Pages, 8 Figures, submitted to MEDAL23: Advances in Deep Generative Models for Medical Artificial Intelligence (Springer Nature series)

  35. arXiv:2208.04705  [pdf, other

    cs.CY cs.LG eess.SY

    Classification of Stress via Ambulatory ECG and GSR Data

    Authors: Zachary Dair, Muhammad Muneeb Saad, Urja Pawar, Samantha Dockray, Ruairi O'Reilly

    Abstract: In healthcare, detecting stress and enabling individuals to monitor their mental health and wellbeing is challenging. Advancements in wearable technology now enable continuous physiological data collection. This data can provide insights into mental health and behavioural states through psychophysiological analysis. However, automated analysis is required to provide timely results due to the quant… ▽ More

    Submitted 8 June, 2023; v1 submitted 19 July, 2022; originally announced August 2022.

    Comments: Associated Code to enable reproducible experimental work - https://github.com/ZacDair/EMBC_Release SMILE dataset provided by Computational Wellbeing Group (COMPWELL) https://compwell.rice.edu/workshops/embc2022/dataset - https://compwell.rice.edu/

    ACM Class: I.2.m; J.3; J.4

    Journal ref: EMBC 2022 Compwell Workshop

  36. Medical Dataset Classification for Kurdish Short Text over Social Media

    Authors: Ari M. Saeed, Shnya R. Hussein, Chro M. Ali, Tarik A. Rashid

    Abstract: The Facebook application is used as a resource for collecting the comments of this dataset, The dataset consists of 6756 comments to create a Medical Kurdish Dataset (MKD). The samples are comments of users, which are gathered from different posts of pages (Medical, News, Economy, Education, and Sport). Six steps as a preprocessing technique are performed on the raw dataset to clean and remove noi… ▽ More

    Submitted 26 March, 2022; originally announced April 2022.

    Comments: 11 pages

    Journal ref: DIB, 2020

  37. arXiv:2202.02489  [pdf, other

    cs.CV

    Investigating the Challenges of Class Imbalance and Scale Variation in Object Detection in Aerial Images

    Authors: Ahmed Elhagry, Mohamed Saeed

    Abstract: While object detection is a common problem in computer vision, it is even more challenging when dealing with aerial satellite images. The variety in object scales and orientations can make them difficult to identify. In addition, there can be large amounts of densely packed small objects such as cars. In this project, we propose a few changes to the Faster-RCNN architecture. First, we experiment w… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  38. arXiv:2201.12946  [pdf, ps, other

    quant-ph cs.ET

    Pauli Error Propagation-Based Gate Reschedulingfor Quantum Circuit Error Mitigation

    Authors: Vedika Saravanan, Samah Mohamed Saeed

    Abstract: Noisy Intermediate-Scale Quantum (NISQ) algorithms, which run on noisy quantum computers should be carefully designed to boost the output state fidelity. While several compilation approaches have been proposed to minimize circuit errors, they often omit the detailed circuit structure information that does not affect the circuit depth or the gate count. In the presence of spatial variation in the e… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

  39. arXiv:2201.10324  [pdf, other

    eess.IV cs.CV cs.LG

    Addressing the Intra-class Mode Collapse Problem using Adaptive Input Image Normalization in GAN-based X-ray Images

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imag… ▽ More

    Submitted 12 April, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Accepted to the IEEE EMBC22 Conference

  40. arXiv:2201.07646  [pdf, other

    cs.LG cs.CV

    A Survey on Training Challenges in Generative Adversarial Networks for Biomedical Image Analysis

    Authors: Muhammad Muneeb Saad, Ruairi O'Reilly, Mubashir Husain Rehmani

    Abstract: In biomedical image analysis, the applicability of deep learning methods is directly impacted by the quantity of image data available. This is due to deep learning models requiring large image datasets to provide high-level performance. Generative Adversarial Networks (GANs) have been widely utilized to address data limitations through the generation of synthetic biomedical images. GANs consist of… ▽ More

    Submitted 10 August, 2023; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Submitted to the AI Review Journal

  41. arXiv:2201.07219  [pdf, other

    eess.IV cs.CV cs.LG

    Contrastive Pretraining for Echocardiography Segmentation with Limited Data

    Authors: Mohamed Saeed, Rand Muhtaseb, Mohammad Yaqub

    Abstract: Contrastive learning has proven useful in many applications where access to labelled data is limited. The lack of annotated data is particularly problematic in medical image segmentation as it is difficult to have clinical experts manually annotate large volumes of data such as cardiac structures in ultrasound images of the heart. In this paper, We propose a self supervised contrastive learning me… ▽ More

    Submitted 14 July, 2022; v1 submitted 16 January, 2022; originally announced January 2022.

  42. arXiv:2112.10483  [pdf, other

    cs.CV

    Fusion and Orthogonal Projection for Improved Face-Voice Association

    Authors: Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: We study the problem of learning association between face and voice, which is gaining interest in the computer vision community lately. Prior works adopt pairwise or triplet loss formulations to learn an embedding space amenable for associated matching and verification tasks. Albeit showing some progress, such loss formulations are, however, restrictive due to dependency on distance-dependent marg… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

  43. arXiv:2112.00443  [pdf, other

    cs.CR cs.CY cs.SI

    TROLLMAGNIFIER: Detecting State-Sponsored Troll Accounts on Reddit

    Authors: Mohammad Hammas Saeed, Shiza Ali, Jeremy Blackburn, Emiliano De Cristofaro, Savvas Zannettou, Gianluca Stringhini

    Abstract: Growing evidence points to recurring influence campaigns on social media, often sponsored by state actors aiming to manipulate public opinion on sensitive political topics. Typically, campaigns are performed through instrumented accounts, known as troll accounts; despite their prominence, however, little work has been done to detect these accounts in the wild. In this paper, we present TROLLMAGNIF… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  44. arXiv:2110.14205  [pdf, other

    cs.LG

    FedPrune: Towards Inclusive Federated Learning

    Authors: Muhammad Tahir Munir, Muhammad Mustansar Saeed, Mahad Ali, Zafar Ayyub Qazi, Ihsan Ayyub Qazi

    Abstract: Federated learning (FL) is a distributed learning technique that trains a shared model over distributed data in a privacy-preserving manner. Unfortunately, FL's performance degrades when there is (i) variability in client characteristics in terms of computational and memory resources (system heterogeneity) and (ii) non-IID data distribution across clients (statistical heterogeneity). For example,… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  45. arXiv:2109.13006  [pdf, other

    cs.AI cs.CL cs.LG cs.LO cs.NE

    RuleBert: Teaching Soft Rules to Pre-trained Language Models

    Authors: Mohammed Saeed, Naser Ahmadi, Preslav Nakov, Paolo Papotti

    Abstract: While pre-trained language models (PLMs) are the go-to solution to tackle many natural language processing problems, they are still very limited in their ability to capture and to use common-sense knowledge. In fact, even if information is available in the form of approximate (soft) logical rules, it is not clear how to transfer it to a PLM in order to improve its performance for deductive reasoni… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Logical reasoning, soft Horn rules, Transformers, pre-trained language models, combining symbolic and probabilistic methods, BERT

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

    Journal ref: EMNLP-2021

  46. arXiv:2107.13643  [pdf

    cs.CV eess.IV

    Lighter Stacked Hourglass Human Pose Estimation

    Authors: Ahmed Elhagry, Mohamed Saeed, Musie Araia

    Abstract: Human pose estimation (HPE) is one of the most challenging tasks in computer vision as humans are deformable by nature and thus their pose has so much variance. HPE aims to correctly identify the main joint locations of a single person or multiple people in a given image or video. Locating joints of a person in images or videos is an important task that can be applied in action recognition and obj… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

  47. arXiv:2105.02816  [pdf, ps, other

    stat.ML cs.IT cs.LG cs.SI

    Semidefinite Programming for Community Detection with Side Information

    Authors: Mohammad Esmaeili, Hussein Metwaly Saad, Aria Nosratinia

    Abstract: This paper produces an efficient Semidefinite Programming (SDP) solution for community detection that incorporates non-graph data, which in this context is known as side information. SDP is an efficient solution for standard community detection on graphs. We formulate a semi-definite relaxation for the maximum likelihood estimation of node labels, subject to observing both graph and non-graph data… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 15 pages

  48. An Optimized Framework to Adopt Computer Laboratory Administrations for Operating System and Application Installations

    Authors: Miran Hama Rahim Saeed, Bryar A. Hassan, Shko M. Qader

    Abstract: Nowadays, in most of the fields, task automation is area of interest and research due to that manual execution of a task is error prone, time consuming, involving more human resources and focus concerning. In the area of Computer laboratory administration, the old fashioned administration cannot run with todays growth, where the Operating System (OS) and required applications are installed on all… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

  49. arXiv:2102.09099  [pdf

    eess.IV cs.CV cs.LG q-bio.QM

    NuCLS: A scalable crowdsourcing, deep learning approach and dataset for nucleus classification, localization and segmentation

    Authors: Mohamed Amgad, Lamees A. Atteya, Hagar Hussein, Kareem Hosny Mohammed, Ehab Hafiz, Maha A. T. Elsebaie, Ahmed M. Alhusseiny, Mohamed Atef AlMoslemany, Abdelmagid M. Elmatboly, Philip A. Pappalardo, Rokia Adel Sakr, Pooya Mobadersany, Ahmad Rachid, Anas M. Saad, Ahmad M. Alkashash, Inas A. Ruhban, Anas Alrefai, Nada M. Elgazar, Ali Abdulkarim, Abo-Alela Farag, Amira Etman, Ahmed G. Elsaeed, Yahya Alagha, Yomna A. Amer, Ahmed M. Raslan , et al. (12 additional authors not shown)

    Abstract: High-resolution map** of cells and tissue structures provides a foundation for develo** interpretable machine-learning models for computational pathology. Deep learning algorithms can provide accurate map**s given large numbers of labeled instances for training and validation. Generating adequate volume of quality labels has emerged as a critical barrier in computational pathology given the… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Journal ref: GigaScience, 11 (2022)

  50. arXiv:2101.00330  [pdf, other

    cs.CR cs.DC

    e-PoS: Making Proof-of-Stake Decentralized and Fair

    Authors: Muhammad Saad, Zhan Qin, Kui Ren, DaeHun Nyang, David Mohaisen

    Abstract: Blockchain applications that rely on the Proof-of-Work (PoW) have increasingly become energy inefficient with a staggering carbon footprint. In contrast, energy-efficient alternative consensus protocols such as Proof-of-Stake (PoS) may cause centralization and unfairness in the blockchain system. To address these challenges, we propose a modular version of PoS-based blockchain systems called epos… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

    Journal ref: IEEE Transactions on Parallel and Distributed Systems, 2021