Skip to main content

Showing 1–3 of 3 results for author: Sleiman, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.03413  [pdf, other

    cs.CV

    MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

    Authors: Kirolos Ataallah, Xiaoqian Shen, Eslam Abdelrahman, Essam Sleiman, Deyao Zhu, Jian Ding, Mohamed Elhoseiny

    Abstract: This paper introduces MiniGPT4-Video, a multimodal Large Language Model (LLM) designed specifically for video understanding. The model is capable of processing both temporal visual and textual data, making it adept at understanding the complexities of videos. Building upon the success of MiniGPT-v2, which excelled in translating visual features into the LLM space for single images and achieved imp… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 6 pages,8 figures

  2. arXiv:2310.02544  [pdf, other

    cs.CV

    SlowFormer: Universal Adversarial Patch for Attack on Compute and Energy Efficiency of Inference Efficient Vision Transformers

    Authors: KL Navaneet, Soroush Abbasi Koohpayegani, Essam Sleiman, Hamed Pirsiavash

    Abstract: Recently, there has been a lot of progress in reducing the computation of deep models at inference time. These methods can reduce both the computational needs and power usage of deep models. Some of these approaches adaptively scale the compute based on the input instance. We show that such models can be vulnerable to a universal adversarial patch attack, where the attacker optimizes for a patch t… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Code is available at https://github.com/UCDvision/SlowFormer

  3. arXiv:2211.12999  [pdf, other

    cs.LG

    Mitigating Negative Transfer in Multi-Task Learning with Exponential Moving Average Loss Weighting Strategies

    Authors: Anish Lakkapragada, Essam Sleiman, Saimourya Surabhi, Dennis P. Wall

    Abstract: Multi-Task Learning (MTL) is a growing subject of interest in deep learning, due to its ability to train models more efficiently on multiple tasks compared to using a group of conventional single-task models. However, MTL can be impractical as certain tasks can dominate training and hurt performance in others, thus making some tasks perform better in a single-task model compared to a multi-task on… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: AAAI 2023 Student Abstract, Contains Abstract + Appendix / Supplementary Material