Skip to main content

Showing 1–18 of 18 results for author: Jamal, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.13104  [pdf, other

    cs.CL cs.AI

    Multi Class Depression Detection Through Tweets using Artificial Intelligence

    Authors: Muhammad Osama Nusrat, Waseem Shahzad, Saad Ahmed Jamal

    Abstract: Depression is a significant issue nowadays. As per the World Health Organization (WHO), in 2023, over 280 million individuals are grappling with depression. This is a huge number; if not taken seriously, these numbers will increase rapidly. About 4.89 billion individuals are social media users. People express their feelings and emotions on platforms like Twitter, Facebook, Reddit, Instagram, etc.… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 33 pages

  2. arXiv:2402.05128  [pdf, other

    cs.CL cs.AI

    Enhancing Textbook Question Answering Task with Large Language Models and Retrieval Augmented Generation

    Authors: Hessa Abdulrahman Alawwad, Areej Alhothali, Usman Naseem, Ali Alkhathlan, Amani Jamal

    Abstract: Textbook question answering (TQA) is a challenging task in artificial intelligence due to the complex nature of context and multimodal data. Although previous research has significantly improved the task, there are still some limitations including the models' weak reasoning and inability to capture contextual information in the lengthy context. The introduction of large language models (LLMs) has… ▽ More

    Submitted 14 February, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2401.12421  [pdf, other

    cs.CV cs.AI

    AdaEmbed: Semi-supervised Domain Adaptation in the Embedding Space

    Authors: Ali Mottaghi, Mohammad Abdullah Jamal, Serena Yeung, Omid Mohareri

    Abstract: Semi-supervised domain adaptation (SSDA) presents a critical hurdle in computer vision, especially given the frequent scarcity of labeled data in real-world settings. This scarcity often causes foundation models, trained on extensive datasets, to underperform when applied to new domains. AdaEmbed, our newly proposed methodology for SSDA, offers a promising solution to these challenges. Leveraging… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  4. arXiv:2312.12250  [pdf, other

    cs.CV

    ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room

    Authors: Idris Hamoud, Muhammad Abdullah Jamal, Vinkle Srivastav, Didier Mutter, Nicolas Padoy, Omid Mohareri

    Abstract: Surgical robotics holds much promise for improving patient safety and clinician experience in the Operating Room (OR). However, it also comes with new challenges, requiring strong team coordination and effective OR management. Automatic detection of surgical activities is a key requirement for develo** AI-based intelligent tools to tackle these challenges. The current state-of-the-art surgical a… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  5. arXiv:2312.10068  [pdf, other

    eess.SP cs.AI cs.LG

    Estimation of Physical Parameters of Waveforms With Neural Networks

    Authors: Saad Ahmed Jamal, Thomas Corpetti, Dirk Tiede, Mathilde Letard, Dimitri Lague

    Abstract: Light Detection and Ranging (LiDAR) are fast emerging sensors in the field of Earth Observation. It is a remote sensing technology that utilizes laser beams to measure distances and create detailed three-dimensional representations of objects and environments. The potential of Full Waveform LiDAR is much greater than just height estimation and 3D reconstruction only. Overall shape of signal provid… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  6. arXiv:2309.15313  [pdf, other

    cs.CV

    M$^{3}$3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding

    Authors: Muhammad Abdullah Jamal, Omid Mohareri

    Abstract: We present a new pre-training strategy called M$^{3}$3D ($\underline{M}$ulti-$\underline{M}$odal $\underline{M}$asked $\underline{3D}$) built based on Multi-modal masked autoencoders that can leverage 3D priors and learned cross-modal representations in RGB-D data. We integrate two major self-supervised learning frameworks; Masked Image Modeling (MIM) and contrastive learning; aiming to effectivel… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  7. arXiv:2308.02960  [pdf, other

    cs.CV cs.AI cs.LG

    Data Fusion for Multi-Task Learning of Building Extraction and Height Estimation

    Authors: Saad Ahmed Jamal, Arioluwa Aribisala

    Abstract: In accordance with the urban reconstruction problem proposed by the DFC23 Track 2 Contest, this paper attempts a multitask-learning method of building extraction and height estimation using both optical and radar satellite imagery. Contrary to the initial goal of multitask learning which could potentially give a superior solution by reusing features and forming implicit constraints between multipl… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: 5 pages, 5 figures, 3 Tables

  8. arXiv:2307.02054  [pdf, other

    cs.CL cs.AI

    Emoji Prediction in Tweets using BERT

    Authors: Muhammad Osama Nusrat, Zeeshan Habib, Mehreen Alam, Saad Ahmed Jamal

    Abstract: In recent years, the use of emojis in social media has increased dramatically, making them an important element in understanding online communication. However, predicting the meaning of emojis in a given text is a challenging task due to their ambiguous nature. In this study, we propose a transformer-based approach for emoji prediction using BERT, a widely-used pre-trained language model. We fine-… ▽ More

    Submitted 26 August, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

    Comments: This paper is focused on predicting emojis corresponding to tweets using BERT

  9. arXiv:2305.11451  [pdf, other

    cs.CV

    SurgMAE: Masked Autoencoders for Long Surgical Video Analysis

    Authors: Muhammad Abdullah Jamal, Omid Mohareri

    Abstract: There has been a growing interest in using deep learning models for processing long surgical videos, in order to automatically detect clinical/operational activities and extract metrics that can enable workflow efficiency tools and applications. However, training such models require vast amounts of labeled data which is costly and not scalable. Recently, self-supervised learning has been explored… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  10. arXiv:2301.02693  [pdf

    cs.CV

    Design of Arabic Sign Language Recognition Model

    Authors: Muhammad Al-Barham, Ahmad Jamal, Musa Al-Yaman

    Abstract: Deaf people are using sign language for communication, and it is a combination of gestures, movements, postures, and facial expressions that correspond to alphabets and words in spoken languages. The proposed Arabic sign language recognition model helps deaf and hard hearing people communicate effectively with ordinary people. The recognition has four stages of converting the alphabet into letters… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

  11. arXiv:2207.07894  [pdf, other

    cs.CV

    Multi-Modal Unsupervised Pre-Training for Surgical Operating Room Workflow Analysis

    Authors: Muhammad Abdullah Jamal, Omid Mohareri

    Abstract: Data-driven approaches to assist operating room (OR) workflow analysis depend on large curated datasets that are time consuming and expensive to collect. On the other hand, we see a recent paradigm shift from supervised learning to self-supervised and/or unsupervised learning approaches that can learn representations from unlabeled datasets. In this paper, we leverage the unlabeled data captured i… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'22)

  12. arXiv:2205.02805  [pdf, other

    cs.CV

    An Empirical Study on Activity Recognition in Long Surgical Videos

    Authors: Zhuohong He, Ali Mottaghi, Aidean Sharghi, Muhammad Abdullah Jamal, Omid Mohareri

    Abstract: Activity recognition in surgical videos is a key research area for develo** next-generation devices and workflow monitoring systems. Since surgeries are long processes with highly-variable lengths, deep learning models used for surgical videos often consist of a two-stage setup using a backbone and temporal sequence model. In this paper, we investigate many state-of-the-art backbones and tempora… ▽ More

    Submitted 6 September, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: 9 pages, excluding references

  13. arXiv:2111.05671  [pdf, other

    cs.CL

    Pre-trained Transformer-Based Approach for Arabic Question Answering : A Comparative Study

    Authors: Kholoud Alsubhi, Amani Jamal, Areej Alhothali

    Abstract: Question answering(QA) is one of the most challenging yet widely investigated problems in Natural Language Processing (NLP). Question-answering (QA) systems try to produce answers for given questions. These answers can be generated from unstructured or structured text. Hence, QA is considered an important research area that can be used in evaluating text understanding systems. A large volume of QA… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  14. arXiv:2101.02663  [pdf, other

    cs.CV cs.AI

    L2PF -- Learning to Prune Faster

    Authors: Manoj-Rohit Vemparala, Nael Fasfous, Alexander Frickenstein, Mhd Ali Moraly, Aquib Jamal, Lukas Frickenstein, Christian Unger, Naveen-Shankar Nagaraja, Walter Stechele

    Abstract: Various applications in the field of autonomous driving are based on convolutional neural networks (CNNs), especially for processing camera data. The optimization of such CNNs is a major challenge in continuous development. Newly learned features must be brought into vehicles as quickly as possible, and as such, it is not feasible to spend redundant GPU hours during compression. In this context, w… ▽ More

    Submitted 7 January, 2021; originally announced January 2021.

  15. arXiv:2008.12480  [pdf, other

    eess.IV cs.CV

    Human Blastocyst Classification after In Vitro Fertilization Using Deep Learning

    Authors: Ali Akbar Septiandri, Ade Jamal, Pritta Ameilia Iffanolida, Oki Riayati, Budi Wiweko

    Abstract: Embryo quality assessment after in vitro fertilization (IVF) is primarily done visually by embryologists. Variability among assessors, however, remains one of the main causes of the low success rate of IVF. This study aims to develop an automated embryo assessment based on a deep learning model. This study includes a total of 1084 images from 1226 embryos. The images were captured by an inverted m… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

  16. arXiv:2003.10780  [pdf, other

    cs.CV cs.LG stat.ML

    Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective

    Authors: Muhammad Abdullah Jamal, Matthew Brown, Ming-Hsuan Yang, Liqiang Wang, Boqing Gong

    Abstract: Object frequency in the real world often follows a power law, leading to a mismatch between datasets with long-tailed class distributions seen by a machine learning model and our expectation of the model to perform well on all classes. We analyze this mismatch from a domain adaptation point of view. First of all, we connect existing class-balanced methods for long-tailed classification to target s… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at CVPR2020

  17. arXiv:1805.07722  [pdf, other

    cs.LG stat.ML

    Task-Agnostic Meta-Learning for Few-shot Learning

    Authors: Muhammad Abdullah Jamal, Guo-Jun Qi, Mubarak Shah

    Abstract: Meta-learning approaches have been proposed to tackle the few-shot learning problem.Typically, a meta-learner is trained on a variety of tasks in the hopes of being generalizable to new tasks. However, the generalizability on new tasks of a meta-learner could be fragile when it is over-trained on existing tasks during meta-training phase. In other words, the initial model of a meta-learner could b… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.

  18. arXiv:1506.08966  [pdf

    cs.IR cs.DL

    Classification of Research Citations (CRC)

    Authors: Bilal Hayat Butt, Muhammad Rafi, Arsal Jamal, Raja Sami Ur Rehman, Syed Muhammad Zubair Alam, Muhammad Bilal Alam

    Abstract: Research is a continuous phenomenon. It is recursive in nature. Every research is based on some earlier research outcome. A general approach in reviewing the literature for a problem is to categorize earlier work for the same problem as positive and negative citations. In this paper, we propose a novel automated technique, which classifies whether an earlier work is cited as sentiment positive or… ▽ More

    Submitted 30 June, 2015; originally announced June 2015.