Skip to main content

Showing 1–8 of 8 results for author: Sarraf, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06018  [pdf, other

    cs.CV cs.LG

    Leveraging Transformers for Weakly Supervised Object Localization in Unconstrained Videos

    Authors: Shakeeb Murtaza, Marco Pedersoli, Aydin Sarraf, Eric Granger

    Abstract: Weakly-Supervised Video Object Localization (WSVOL) involves localizing an object in videos using only video-level labels, also referred to as tags. State-of-the-art WSVOL methods like Temporal CAM (TCAM) rely on class activation map** (CAM) and typically require a pre-trained CNN classifier. However, their localization accuracy is affected by their tendency to minimize the mutual information be… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2312.03723  [pdf, other

    cs.CL cs.AI cs.LG

    ChatGPT Application In Summarizing An Evolution Of Deep Learning Techniques In Imaging: A Qualitative Study

    Authors: Arman Sarraf, Amirabbas Abbaspour

    Abstract: The pursuit of article or text summarization has captured the attention of natural language processing (NLP) practitioners, presenting itself as a formidable challenge. ChatGPT 3.5 exhibits the capacity to condense the content of up to 3000 tokens into a single page, aiming to retain pivotal information from a given text across diverse themes. In a conducted qualitative research endeavor, we selec… ▽ More

    Submitted 26 November, 2023; originally announced December 2023.

  3. DiPS: Discriminative Pseudo-Label Sampling with Self-Supervised Transformers for Weakly Supervised Object Localization

    Authors: Shakeeb Murtaza, Soufiane Belharbi, Marco Pedersoli, Aydin Sarraf, Eric Granger

    Abstract: Self-supervised vision transformers (SSTs) have shown great potential to yield rich localization maps that highlight different objects in an image. However, these maps remain class-agnostic since the model is unsupervised. They often tend to decompose the image into multiple maps containing different objects while being unable to distinguish the object of interest from background noise objects. In… ▽ More

    Submitted 18 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Journal ref: Image and Vision Computing 140C (2023) 104838

  4. arXiv:2303.17708  [pdf, other

    cs.SE cs.LG

    Analysis of Failures and Risks in Deep Learning Model Converters: A Case Study in the ONNX Ecosystem

    Authors: Purvish Jajal, Wenxin Jiang, Arav Tewari, Erik Kocinare, Joseph Woo, Anusha Sarraf, Yung-Hsiang Lu, George K. Thiruvathukal, James C. Davis

    Abstract: Software engineers develop, fine-tune, and deploy deep learning (DL) models using a variety of development frameworks and runtime environments. DL model converters move models between frameworks and to runtime environments. Conversion errors compromise model quality and disrupt deployment. However, the failure characteristics of DL model converters are unknown, adding risk when using DL interopera… ▽ More

    Submitted 24 April, 2024; v1 submitted 30 March, 2023; originally announced March 2023.

  5. arXiv:2209.09209  [pdf, other

    cs.CV

    Discriminative Sampling of Proposals in Self-Supervised Transformers for Weakly Supervised Object Localization

    Authors: Shakeeb Murtaza, Soufiane Belharbi, Marco Pedersoli, Aydin Sarraf, Eric Granger

    Abstract: Drones are employed in a growing number of visual recognition applications. A recent development in cell tower inspection is drone-based asset surveillance, where the autonomous flight of a drone is guided by localizing objects of interest in successive aerial images. In this paper, we propose a method to train deep weakly-supervised object localization (WSOL) models based only on image-class labe… ▽ More

    Submitted 19 November, 2022; v1 submitted 9 September, 2022; originally announced September 2022.

  6. arXiv:2209.09195  [pdf, other

    cs.CV

    Constrained Sampling for Class-Agnostic Weakly Supervised Object Localization

    Authors: Shakeeb Murtaza, Soufiane Belharbi, Marco Pedersoli, Aydin Sarraf, Eric Granger

    Abstract: Self-supervised vision transformers can generate accurate localization maps of the objects in an image. However, since they decompose the scene into multiple maps containing various objects, and they do not rely on any explicit supervisory signal, they cannot distinguish between the object of interest from other objects, as required in weakly-supervised object localization (WSOL). To address this… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 3 pages, 2 figures

  7. arXiv:2109.07069  [pdf, other

    cs.CV cs.LG

    F-CAM: Full Resolution Class Activation Maps via Guided Parametric Upscaling

    Authors: Soufiane Belharbi, Aydin Sarraf, Marco Pedersoli, Ismail Ben Ayed, Luke McCaffrey, Eric Granger

    Abstract: Class Activation Map** (CAM) methods have recently gained much attention for weakly-supervised object localization (WSOL) tasks. They allow for CNN visualization and interpretation without training on fully annotated image datasets. CAM methods are typically integrated within off-the-shelf CNN backbones, such as ResNet50. Due to convolution and pooling operations, these backbones yield low resol… ▽ More

    Submitted 20 October, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

    Comments: 23pages, WACV 2022

  8. arXiv:2107.04145  [pdf, other

    cs.MA cs.IT eess.SP

    Intelligent Link Adaptation for Grant-Free Access Cellular Networks: A Distributed Deep Reinforcement Learning Approach

    Authors: Joao V. C. Evangelista, Zeeshan Sattar, Georges Kaddoum, Bassant Selim, Aydin Sarraf

    Abstract: With the continuous growth of machine-type devices (MTDs), it is expected that massive machine-type communication (mMTC) will be the dominant form of traffic in future wireless networks. Applications based on this technology, have fundamentally different traffic characteristics from human-to-human (H2H) communication, which involves a relatively small number of devices transmitting large packets c… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

    Comments: 14 pages, 8 Figures