Skip to main content

Showing 1–7 of 7 results for author: Abdelhamed, A

.
  1. arXiv:2405.15668  [pdf, other

    cs.CV

    What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models

    Authors: Abdelrahman Abdelhamed, Mahmoud Afifi, Alec Go

    Abstract: Large language models (LLMs) has been effectively used for many computer vision tasks, including image classification. In this paper, we present a simple yet effective approach for zero-shot image classification using multimodal LLMs. By employing multimodal LLMs, we generate comprehensive textual representations from input images. These textual representations are then utilized to generate fixed-… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2206.02715  [pdf, other

    cs.CV eess.IV

    Day-to-Night Image Synthesis for Training Nighttime Neural ISPs

    Authors: Abhijith Punnappurath, Abdullah Abuolaim, Abdelrahman Abdelhamed, Alex Levinshtein, Michael S. Brown

    Abstract: Many flagship smartphone cameras now use a dedicated neural image signal processor (ISP) to render noisy raw sensor images to the final processed output. Training nightmode ISP networks relies on large-scale datasets of image pairs with: (1) a noisy raw image captured with a short exposure and a high ISO gain; and (2) a ground truth low-noise raw image captured with a long exposure and low ISO tha… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  3. arXiv:2006.12709  [pdf, other

    cs.CV eess.IV

    CIE XYZ Net: Unprocessing Images for Low-Level Computer Vision Tasks

    Authors: Mahmoud Afifi, Abdelrahman Abdelhamed, Abdullah Abuolaim, Abhijith Punnappurath, Michael S. Brown

    Abstract: Cameras currently allow access to two image states: (i) a minimally processed linear raw-RGB image state (i.e., raw sensor data) or (ii) a highly-processed nonlinear image state (e.g., sRGB). There are many computer vision tasks that work best with a linear image state, such as image deblurring and image dehazing. Unfortunately, the vast majority of images are saved in the nonlinear image state. B… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  4. arXiv:2005.04117  [pdf, other

    cs.CV eess.IV

    NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results

    Authors: Abdelrahman Abdelhamed, Mahmoud Afifi, Radu Timofte, Michael S. Brown, Yue Cao, Zhilu Zhang, Wangmeng Zuo, Xiaoling Zhang, Jiye Liu, Wendong Chen, Changyuan Wen, Meng Liu, Shuailin Lv, Yunchao Zhang, Zhihong Pan, Baopu Li, Teng Xi, Yanwen Fan, Xiyu Yu, Gang Zhang, **gtuo Liu, Junyu Han, Errui Ding, Songhyun Yu, Bumjun Park , et al. (65 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real image denoising with focus on the newly introduced dataset, the proposed methods and their results. The challenge is a new version of the previous NTIRE 2019 challenge on real image denoising that was based on the SIDD benchmark. This challenge is based on a newly collected validation and testing image datasets, and hence, named SIDD+. This chall… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  5. arXiv:2001.00048  [pdf, other

    cs.RO

    MIR-Vehicle: Cost-Effective Research Platform for Autonomous Vehicle Applications

    Authors: Ahmed Abdelhamed, Balakrishna Yadav Peddagolla, Girma Tewolde, Jaerock Kwon

    Abstract: This paper illustrates the MIR (Mobile Intelligent Robotics) Vehicle: a feasible option of transforming an electric ride-on-car into a modular Graphics Processing Unit (GPU) powered autonomous platform equipped with the capability that supports test and deployment of various intelligent autonomous vehicles algorithms. To use a platform for research, two components must be provided: perception and… ▽ More

    Submitted 31 December, 2019; originally announced January 2020.

    Comments: 20 pages, 16 figures

  6. arXiv:1908.08453  [pdf, other

    cs.CV cs.LG eess.IV

    Noise Flow: Noise Modeling with Conditional Normalizing Flows

    Authors: Abdelrahman Abdelhamed, Marcus A. Brubaker, Michael S. Brown

    Abstract: Modeling and synthesizing image noise is an important aspect in many computer vision applications. The long-standing additive white Gaussian and heteroscedastic (signal-dependent) noise models widely used in the literature provide only a coarse approximation of real sensor noise. This paper introduces Noise Flow, a powerful and accurate noise model based on recent normalizing flow architectures. N… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

  7. arXiv:1706.04277  [pdf, other

    cs.CV

    AFIF4: Deep Gender Classification based on AdaBoost-based Fusion of Isolated Facial Features and Foggy Faces

    Authors: Mahmoud Afifi, Abdelrahman Abdelhamed

    Abstract: Gender classification aims at recognizing a person's gender. Despite the high accuracy achieved by state-of-the-art methods for this task, there is still room for improvement in generalized and unrestricted datasets. In this paper, we advocate a new strategy inspired by the behavior of humans in gender recognition. Instead of dealing with the face image as a sole feature, we rely on the combinatio… ▽ More

    Submitted 17 November, 2017; v1 submitted 13 June, 2017; originally announced June 2017.

    Comments: 26 pages, 7 figures, 7 tables