Skip to main content

Showing 1–7 of 7 results for author: El-Gaaly, T

.
  1. arXiv:2004.14491  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    Detecting Deep-Fake Videos from Appearance and Behavior

    Authors: Shruti Agarwal, Tarek El-Gaaly, Hany Farid, Ser-Nam Lim

    Abstract: Synthetically-generated audios and videos -- so-called deep fakes -- continue to capture the imagination of the computer-graphics and computer-vision communities. At the same time, the democratization of access to technology that can create sophisticated manipulated video of anybody saying anything continues to be of concern because of its power to disrupt democratic elections, commit small to lar… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Journal ref: IEEE Workshop on Image Forensics and Security, 2020

  2. arXiv:1904.08159  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    3D Object Recognition with Ensemble Learning --- A Study of Point Cloud-Based Deep Learning Models

    Authors: Daniel Koguciuk, Łukasz Chechliński, Tarek El-Gaaly

    Abstract: In this study, we present an analysis of model-based ensemble learning for 3D point-cloud object classification and detection. An ensemble of multiple model instances is known to outperform a single model instance, but there is little study of the topic of ensemble learning for 3D point clouds. First, an ensemble of multiple model instances trained on the same part of the $\textit{ModelNet40}$ dat… ▽ More

    Submitted 22 May, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

  3. arXiv:1806.06778  [pdf, other

    cs.CV

    BinGAN: Learning Compact Binary Descriptors with a Regularized GAN

    Authors: Maciej Zieba, Piotr Semberecki, Tarek El-Gaaly, Tomasz Trzcinski

    Abstract: In this paper, we propose a novel regularization method for Generative Adversarial Networks, which allows the model to learn discriminative yet compact binary representations of image patches (image descriptors). We employ the dimensionality reduction that takes place in the intermediate layers of the discriminator network and train binarized low-dimensional representation of the penultimate layer… ▽ More

    Submitted 7 November, 2018; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: Paper accepted to NIPS 2018

  4. arXiv:1511.05175  [pdf, other

    cs.CV cs.AI cs.LG

    Convolutional Models for Joint Object Categorization and Pose Estimation

    Authors: Mohamed Elhoseiny, Tarek El-Gaaly, Amr Bakry, Ahmed Elgammal

    Abstract: In the task of Object Recognition, there exists a dichotomy between the categorization of objects and estimating object pose, where the former necessitates a view-invariant representation, while the latter requires a representation capable of capturing pose information over different categories of objects. With the rise of deep architectures, the prime focus has been on object category recognition… ▽ More

    Submitted 19 April, 2016; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: only for workshop presentation at ICLR

  5. arXiv:1508.01983  [pdf, other

    cs.CV

    Digging Deep into the layers of CNNs: In Search of How CNNs Achieve View Invariance

    Authors: Amr Bakry, Mohamed Elhoseiny, Tarek El-Gaaly, Ahmed Elgammal

    Abstract: This paper is focused on studying the view-manifold structure in the feature spaces implied by the different layers of Convolutional Neural Networks (CNN). There are several questions that this paper aims to answer: Does the learned CNN representation achieve viewpoint invariance? How does it achieve viewpoint invariance? Is it achieved by collapsing the view manifolds, or separating them while pr… ▽ More

    Submitted 20 June, 2016; v1 submitted 9 August, 2015; originally announced August 2015.

    Comments: This paper accepted in ICLR 2016 main conference

  6. arXiv:1503.06813  [pdf, other

    cs.CV

    Factorization of View-Object Manifolds for Joint Object Recognition and Pose Estimation

    Authors: Haopeng Zhang, Tarek El-Gaaly, Ahmed Elgammal, Zhiguo Jiang

    Abstract: Due to large variations in shape, appearance, and viewing conditions, object recognition is a key precursory challenge in the fields of object manipulation and robotic/AI visual reasoning in general. Recognizing object categories, particular instances of objects and viewpoints/poses of objects are three critical subproblems robots must solve in order to accurately grasp/manipulate objects and reas… ▽ More

    Submitted 12 April, 2015; v1 submitted 23 March, 2015; originally announced March 2015.

  7. arXiv:1407.3540  [pdf

    cs.CV

    Measuring Atmospheric Scattering from Digital Images of Urban Scenery using Temporal Polarization-Based Vision

    Authors: Tarek El-Gaaly, Joshua Gluckman

    Abstract: Particulate Matter (PM) is a form of air pollution that visually degrades urban scenery and is hazardous to human health and the environment. Current monitoring devices are limited in measuring average PM over large areas. Quantifying the visual effects of haze in digital images of urban scenery and correlating these effects to PM levels is a vital step in more practically monitoring our environme… ▽ More

    Submitted 14 July, 2014; originally announced July 2014.

    Comments: Masters in Computer Science Thesis