Skip to main content

Showing 1–9 of 9 results for author: Jamal, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.12421  [pdf, other

    cs.CV cs.AI

    AdaEmbed: Semi-supervised Domain Adaptation in the Embedding Space

    Authors: Ali Mottaghi, Mohammad Abdullah Jamal, Serena Yeung, Omid Mohareri

    Abstract: Semi-supervised domain adaptation (SSDA) presents a critical hurdle in computer vision, especially given the frequent scarcity of labeled data in real-world settings. This scarcity often causes foundation models, trained on extensive datasets, to underperform when applied to new domains. AdaEmbed, our newly proposed methodology for SSDA, offers a promising solution to these challenges. Leveraging… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  2. arXiv:2312.12250  [pdf, other

    cs.CV

    ST(OR)2: Spatio-Temporal Object Level Reasoning for Activity Recognition in the Operating Room

    Authors: Idris Hamoud, Muhammad Abdullah Jamal, Vinkle Srivastav, Didier Mutter, Nicolas Padoy, Omid Mohareri

    Abstract: Surgical robotics holds much promise for improving patient safety and clinician experience in the Operating Room (OR). However, it also comes with new challenges, requiring strong team coordination and effective OR management. Automatic detection of surgical activities is a key requirement for develo** AI-based intelligent tools to tackle these challenges. The current state-of-the-art surgical a… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  3. arXiv:2309.15313  [pdf, other

    cs.CV

    M$^{3}$3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding

    Authors: Muhammad Abdullah Jamal, Omid Mohareri

    Abstract: We present a new pre-training strategy called M$^{3}$3D ($\underline{M}$ulti-$\underline{M}$odal $\underline{M}$asked $\underline{3D}$) built based on Multi-modal masked autoencoders that can leverage 3D priors and learned cross-modal representations in RGB-D data. We integrate two major self-supervised learning frameworks; Masked Image Modeling (MIM) and contrastive learning; aiming to effectivel… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  4. arXiv:2305.11451  [pdf, other

    cs.CV

    SurgMAE: Masked Autoencoders for Long Surgical Video Analysis

    Authors: Muhammad Abdullah Jamal, Omid Mohareri

    Abstract: There has been a growing interest in using deep learning models for processing long surgical videos, in order to automatically detect clinical/operational activities and extract metrics that can enable workflow efficiency tools and applications. However, training such models require vast amounts of labeled data which is costly and not scalable. Recently, self-supervised learning has been explored… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  5. arXiv:2302.12570  [pdf, other

    cs.NE

    Lasting Diversity and Superior Runtime Guarantees for the $(μ+1)$ Genetic Algorithm

    Authors: Benjamin Doerr, Aymen Echarghaoui, Mohammed Jamal, Martin S. Krejca

    Abstract: Most evolutionary algorithms (EAs) used in practice employ crossover. In contrast, only for few and mostly artificial examples a runtime advantage from crossover could be proven with mathematical means. The most convincing such result shows that the $(μ+1)$ genetic algorithm (GA) with population size $μ=O(n)$ optimizes jump functions with gap size $k \ge 3$ in time $O(n^k / μ+ n^{k-1}\log n)$, bea… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  6. arXiv:2207.07894  [pdf, other

    cs.CV

    Multi-Modal Unsupervised Pre-Training for Surgical Operating Room Workflow Analysis

    Authors: Muhammad Abdullah Jamal, Omid Mohareri

    Abstract: Data-driven approaches to assist operating room (OR) workflow analysis depend on large curated datasets that are time consuming and expensive to collect. On the other hand, we see a recent paradigm shift from supervised learning to self-supervised and/or unsupervised learning approaches that can learn representations from unlabeled datasets. In this paper, we leverage the unlabeled data captured i… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'22)

  7. arXiv:2205.02805  [pdf, other

    cs.CV

    An Empirical Study on Activity Recognition in Long Surgical Videos

    Authors: Zhuohong He, Ali Mottaghi, Aidean Sharghi, Muhammad Abdullah Jamal, Omid Mohareri

    Abstract: Activity recognition in surgical videos is a key research area for develo** next-generation devices and workflow monitoring systems. Since surgeries are long processes with highly-variable lengths, deep learning models used for surgical videos often consist of a two-stage setup using a backbone and temporal sequence model. In this paper, we investigate many state-of-the-art backbones and tempora… ▽ More

    Submitted 6 September, 2022; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: 9 pages, excluding references

  8. arXiv:2003.10780  [pdf, other

    cs.CV cs.LG stat.ML

    Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition from a Domain Adaptation Perspective

    Authors: Muhammad Abdullah Jamal, Matthew Brown, Ming-Hsuan Yang, Liqiang Wang, Boqing Gong

    Abstract: Object frequency in the real world often follows a power law, leading to a mismatch between datasets with long-tailed class distributions seen by a machine learning model and our expectation of the model to perform well on all classes. We analyze this mismatch from a domain adaptation point of view. First of all, we connect existing class-balanced methods for long-tailed classification to target s… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at CVPR2020

  9. arXiv:1805.07722  [pdf, other

    cs.LG stat.ML

    Task-Agnostic Meta-Learning for Few-shot Learning

    Authors: Muhammad Abdullah Jamal, Guo-Jun Qi, Mubarak Shah

    Abstract: Meta-learning approaches have been proposed to tackle the few-shot learning problem.Typically, a meta-learner is trained on a variety of tasks in the hopes of being generalizable to new tasks. However, the generalizability on new tasks of a meta-learner could be fragile when it is over-trained on existing tasks during meta-training phase. In other words, the initial model of a meta-learner could b… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.