Skip to main content

Showing 1–38 of 38 results for author: Khan, S H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.12986  [pdf

    eess.IV cs.AI cs.CV

    A Novel Feature Map Enhancement Technique Integrating Residual CNN and Transformer for Alzheimer Diseases Diagnosis

    Authors: Saddam Hussain Khan

    Abstract: Alzheimer diseases (ADs) involves cognitive decline and abnormal brain protein accumulation, necessitating timely diagnosis for effective treatment. Therefore, CAD systems leveraging deep learning advancements have demonstrated success in AD detection but pose computational intricacies and the dataset minor contrast, structural, and texture variations. In this regard, a novel hybrid FME-Residual-H… ▽ More

    Submitted 25 May, 2024; v1 submitted 30 March, 2024; originally announced May 2024.

    Comments: 28 Pages, 11 Figures, 3 Tables

  2. arXiv:2401.11621  [pdf

    q-fin.ST cs.CE cs.LG

    A Novel Decision Ensemble Framework: Customized Attention-BiLSTM and XGBoost for Speculative Stock Price Forecasting

    Authors: Riaz Ud Din, Salman Ahmed, Saddam Hussain Khan

    Abstract: Forecasting speculative stock prices is essential for effective investment risk management that drives the need for the development of innovative algorithms. However, the speculative nature, volatility, and complex sequential dependencies within financial markets present inherent challenges which necessitate advanced techniques. This paper proposes a novel framework, CAB-XDE (customized attention… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 30 pages, 16 Figures, 4 Tables

  3. arXiv:2312.00634  [pdf

    eess.IV cs.CV

    A Recent Survey of Vision Transformers for Medical Image Segmentation

    Authors: Asifullah Khan, Zunaira Rauf, Abdul Rehman Khan, Saima Rathore, Saddam Hussain Khan, Najmus Saher Shah, Umair Farooq, Hifsa Asif, Aqsa Asif, Umme Zahoora, Rafi Ullah Khalil, Suleman Qamar, Umme Hani Asif, Faiza Babar Khan, Abdul Majid, Jeonghwan Gwak

    Abstract: Medical image segmentation plays a crucial role in various healthcare applications, enabling accurate diagnosis, treatment planning, and disease monitoring. Traditionally, convolutional neural networks (CNNs) dominated this domain, excelling at local feature extraction. However, their limitations in capturing long-range dependencies across image regions pose challenges for segmenting complex, inte… ▽ More

    Submitted 18 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

  4. arXiv:2311.10754  [pdf

    eess.IV cs.CV

    A Recent Survey of the Advancements in Deep Learning Techniques for Monkeypox Disease Detection

    Authors: Saddam Hussain Khan, Rashid Iqbal, Saeeda Naz

    Abstract: Monkeypox (MPox) is a zoonotic infectious disease induced by the MPox Virus, part of the poxviridae orthopoxvirus group initially discovered in Africa and gained global attention in mid-2022 with cases reported outside endemic areas. Symptoms include headaches, chills, fever, smallpox, measles, and chickenpox-like skin manifestations and the WHO officially announced MPox as a global public health… ▽ More

    Submitted 23 November, 2023; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: 53 pages, 16 figures, 7 tables

  5. arXiv:2310.10935  [pdf, other

    cs.CL cs.LG

    Intent Detection and Slot Filling for Home Assistants: Dataset and Analysis for Bangla and Sylheti

    Authors: Fardin Ahsan Sakib, A H M Rezaul Karim, Saadat Hasan Khan, Md Mushfiqur Rahman

    Abstract: As voice assistants cement their place in our technologically advanced society, there remains a need to cater to the diverse linguistic landscape, including colloquial forms of low-resource languages. Our study introduces the first-ever comprehensive dataset for intent detection and slot filling in formal Bangla, colloquial Bangla, and Sylheti languages, totaling 984 samples across 10 unique inten… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted at the First Workshop on Bangla Language Processing, 2023

  6. arXiv:2307.08260  [pdf, other

    cs.SE cs.CL

    Extending the Frontier of ChatGPT: Code Generation and Debugging

    Authors: Fardin Ahsan Sakib, Saadat Hasan Khan, A. H. M. Rezaul Karim

    Abstract: Large-scale language models (LLMs) have emerged as a groundbreaking innovation in the realm of question-answering and conversational agents. These models, leveraging different deep learning architectures such as Transformers, are trained on vast corpora to predict sentences based on given queries. Among these LLMs, ChatGPT, developed by OpenAI, has ushered in a new era by utilizing artificial inte… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  7. arXiv:2302.02619  [pdf

    eess.IV cs.CV cs.LG

    COVID-19 Infection Analysis Framework using Novel Boosted CNNs and Radiological Images

    Authors: Saddam Hussain Khan

    Abstract: COVID-19 is a new pathogen that first appeared in the human population at the end of 2019, and it can lead to novel variants of pneumonia after infection. COVID-19 is a rapidly spreading infectious disease that infects humans faster. Therefore, efficient diagnostic systems may accurately identify infected patients and thus help control their spread. In this regard, a new two-stage analysis framewo… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: 26 Pages, 11 Figures, 6 Tables. arXiv admin note: text overlap with arXiv:2209.10963

  8. arXiv:2212.08008  [pdf

    cs.CV cs.AI

    A New Deep Boosted CNN and Ensemble Learning based IoT Malware Detection

    Authors: Saddam Hussain Khan, Wasi Ullah

    Abstract: Security issues are threatened in various types of networks, especially in the Internet of Things (IoT) environment that requires early detection. IoT is the network of real-time devices like home automation systems and can be controlled by open-source android devices, which can be an open ground for attackers. Attackers can access the network credentials, initiate a different kind of security bre… ▽ More

    Submitted 15 January, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 20 pages, 10 figures, 6 tables; Corresponding [email protected]

  9. arXiv:2212.02477  [pdf

    eess.IV cs.CV cs.LG

    Malaria Parasitic Detection using a New Deep Boosted and Ensemble Learning Framework

    Authors: Saddam Hussain Khan, Tahani Jaser Alahmadi

    Abstract: Malaria is a potentially fatal plasmodium parasite injected by female anopheles mosquitoes that infect red blood cells and millions worldwide yearly. However, specialists' manual screening in clinical practice is laborious and prone to error. Therefore, a novel Deep Boosted and Ensemble Learning (DBEL) framework, comprising the stacking of new Boosted-BR-STM convolutional neural networks (CNN) and… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 26 pages, 10 figures, 9 Tables

  10. arXiv:2211.16571  [pdf

    eess.IV cs.CV cs.LG

    Brain Tumor MRI Classification using a Novel Deep Residual and Regional CNN

    Authors: Mirza Mumtaz Zahoor, Saddam Hussain Khan

    Abstract: Brain tumor classification is crucial for clinical analysis and an effective treatment plan to cure patients. Deep learning models help radiologists to accurately and efficiently analyze tumors without manual intervention. However, brain tumor analysis is challenging because of its complex structure, texture, size, location, and appearance. Therefore, a novel deep residual and regional-based Res-B… ▽ More

    Submitted 10 December, 2022; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 21 pages, 11 figures, 4 tables

  11. arXiv:2209.10963  [pdf

    eess.IV cs.CV

    COVID-19 Detection and Analysis From Lung CT Images using Novel Channel Boosted CNNs

    Authors: Saddam Hussain Khan

    Abstract: In December 2019, the global pandemic COVID-19 in Wuhan, China, affected human life and the worldwide economy. Therefore, an efficient diagnostic system is required to control its spread. However, the automatic diagnostic system poses challenges with a limited amount of labeled data, minor contrast variation, and high structural similarity between infection and background. In this regard, a new tw… ▽ More

    Submitted 26 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: 13 Figures, 6 Tables, 27 Pages

  12. A Survey of Deep Learning Techniques for the Analysis of COVID-19 and their usability for Detecting Omicron

    Authors: Asifullah Khan, Saddam Hussain Khan, Mahrukh Saif, Asiya Batool, Anabia Sohail, Muhammad Waleed Khan

    Abstract: The Coronavirus (COVID-19) outbreak in December 2019 has become an ongoing threat to humans worldwide, creating a health crisis that infected millions of lives, as well as devastating the global economy. Deep learning (DL) techniques have proved helpful in analysis and delineation of infectious regions in radiological images in a timely manner. This paper makes an in-depth survey of DL techniques… ▽ More

    Submitted 4 April, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

    Comments: Pages: 44, Figures: 7, Tables: 14

  13. Autonomous Drone Swarm Navigation and Multi-target Tracking in 3D Environments with Dynamic Obstacles

    Authors: Suleman Qamar, Saddam Hussain Khan, Muhammad Arif Arshad, Maryam Qamar, Asifullah Khan

    Abstract: Autonomous modeling of artificial swarms is necessary because manual creation is a time intensive and complicated procedure which makes it impractical. An autonomous approach employing deep reinforcement learning is presented in this study for swarm navigation. In this approach, complex 3D environments with static and dynamic obstacles and resistive forces (like linear drag, angular drag, and grav… ▽ More

    Submitted 13 February, 2022; originally announced February 2022.

    Comments: Pages: 19, Figures: 17, Tables: 8

  14. arXiv:2202.04121  [pdf

    cs.CR cs.AI

    IoT Malware Detection Architecture using a Novel Channel Boosted and Squeezed CNN

    Authors: Muhammad Asam, Saddam Hussain Khan, Tauseef Jamal, Asifullah Khan

    Abstract: Interaction between devices, people, and the Internet has given birth to a new digital communication model, the Internet of Things (IoT). The seamless network of these smart devices is the core of this IoT model. However, on the other hand, integrating smart devices to constitute a network introduces many security challenges. These connected devices have created a security blind spot, where cyberc… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  15. arXiv:2201.05373  [pdf

    eess.IV cs.CV cs.LG

    A New Deep Hybrid Boosted and Ensemble Learning-based Brain Tumor Analysis using MRI

    Authors: Mirza Mumtaz Zahoor, Shahzad Ahmad Qureshi, Saddam Hussain Khan, Asifullah Khan

    Abstract: Brain tumors analysis is important in timely diagnosis and effective treatment to cure patients. Tumor analysis is challenging because of tumor morphology like size, location, texture, and heteromorphic appearance in the medical images. In this regard, a novel two-phase deep learning-based framework is proposed to detect and categorize brain tumors in magnetic resonance images (MRIs). In the first… ▽ More

    Submitted 11 February, 2022; v1 submitted 14 January, 2022; originally announced January 2022.

    Comments: 26 pages, 9 figures, 8 tables

  16. arXiv:2108.11720  [pdf

    eess.IV cs.CV

    Segmentation of Shoulder Muscle MRI Using a New Region and Edge based Deep Auto-Encoder

    Authors: Saddam Hussain Khan, Asifullah Khan, Yeon Soo Lee, Mehdi Hassan, Woong Kyo jeong

    Abstract: Automatic segmentation of shoulder muscle MRI is challenging due to the high variation in muscle size, shape, texture, and spatial position of tears. Manual segmentation of tear and muscle portion is hard, time-consuming, and subjective to pathological expertise. This work proposes a new Region and Edge-based Deep Auto-Encoder (RE-DAE) for shoulder muscle MRI segmentation. The proposed RE-DAE harm… ▽ More

    Submitted 26 August, 2021; originally announced August 2021.

    Comments: Pages: 23, 8 Figures, 2 Tables

  17. arXiv:2107.04008  [pdf

    cs.CR cs.CV cs.LG

    Malware Classification Using Deep Boosted Learning

    Authors: Muhammad Asam, Saddam Hussain Khan, Tauseef Jamal, Umme Zahoora, Asifullah Khan

    Abstract: Malicious activities in cyberspace have gone further than simply hacking machines and spreading viruses. It has become a challenge for a nations survival and hence has evolved to cyber warfare. Malware is a key component of cyber-crime, and its analysis is the first line of defence against attack. This work proposes a novel deep boosted hybrid learning-based malware classification framework and na… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  18. toon2real: Translating Cartoon Images to Realistic Images

    Authors: K. M. Arefeen Sultan, Mohammad Imrul Jubair, MD. Nahidul Islam, Sayed Hossain Khan

    Abstract: In terms of Image-to-image translation, Generative Adversarial Networks (GANs) has achieved great success even when it is used in the unsupervised dataset. In this work, we aim to translate cartoon images to photo-realistic images using GAN. We apply several state-of-the-art models to perform this task; however, they fail to perform good quality translations. We observe that the shallow difference… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: Accepted as a short paper at ICTAI 2020

  19. arXiv:2012.05073  [pdf

    eess.IV cs.CV

    COVID-19 Detection in Chest X-Ray Images using a New Channel Boosted CNN

    Authors: Saddam Hussain Khan, Anabia Sohail, Asifullah Khan

    Abstract: COVID-19 is a highly contagious respiratory infection that has affected a large population across the world and continues with its devastating consequences. It is imperative to detect COVID-19 at the earliest to limit the span of infection. In this work, a new classification technique CB-STM-RENet based on deep Convolutional Neural Network (CNN) and Channel Boosting is proposed for the screening o… ▽ More

    Submitted 17 December, 2020; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Pages: 26 Tables: 3 Figures: 10 Equations: 8

  20. arXiv:2009.08864  [pdf

    eess.IV cs.CV

    Classification and Region Analysis of COVID-19 Infection using Lung CT Images and Deep Convolutional Neural Networks

    Authors: Saddam Hussain Khan, Anabia Sohail, Asifullah Khan, Yeon Soo Lee

    Abstract: COVID-19 is a global health problem. Consequently, early detection and analysis of the infection patterns are crucial for controlling infection spread as well as devising a treatment plan. This work proposes a two-stage deep Convolutional Neural Networks (CNNs) based framework for delineation of COVID-19 infected regions in Lung CT images. In the first stage, initially, COVID-19 specific CT image… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

    Comments: Pages: 32, Tables: 6, Figures: 14

  21. arXiv:2001.07059  [pdf, other

    cs.CV cs.CC

    Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models

    Authors: Moshiur R. Farazi, Salman H. Khan, Nick Barnes

    Abstract: Visual Question Answering (VQA) has emerged as a Visual Turing Test to validate the reasoning ability of AI agents. The pivot to existing VQA models is the joint embedding that is learned by combining the visual features from an image and the semantic features from a given question. Consequently, a large body of literature has focused on develo** complex joint embedding strategies coupled with v… ▽ More

    Submitted 20 January, 2020; originally announced January 2020.

  22. arXiv:1912.04251  [pdf

    cs.CV cs.LG

    Cascaded Structure Tensor Framework for Robust Identification of Heavily Occluded Baggage Items from Multi-Vendor X-ray Scans

    Authors: Taimur Hassan, Salman H. Khan, Samet Akcay, Mohammed Bennamoun, Naoufel Werghi

    Abstract: In the last two decades, luggage scanning has globally become one of the prime aviation security concerns. Manual screening of the baggage items is a cumbersome, subjective and inefficient process. Hence, many researchers have developed Xray imagery-based autonomous systems to address these shortcomings. However, to the best of our knowledge, there is no framework, up to now, that can recognize he… ▽ More

    Submitted 21 January, 2020; v1 submitted 9 December, 2019; originally announced December 2019.

  23. Question-Agnostic Attention for Visual Question Answering

    Authors: Moshiur R Farazi, Salman H Khan, Nick Barnes

    Abstract: Visual Question Answering (VQA) models employ attention mechanisms to discover image locations that are most relevant for answering a specific question. For this purpose, several multimodal fusion strategies have been proposed, ranging from relatively simple operations (e.g., linear sum) to more complex ones (e.g., Block). The resulting multimodal representations define an intermediate feature spa… ▽ More

    Submitted 5 September, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: To appear in the proceedings of International Conference on Pattern Recognition (ICPR) 2020

  24. arXiv:1906.03650  [pdf, other

    cs.CV

    Unsupervised Primitive Discovery for Improved 3D Generative Modeling

    Authors: Salman H. Khan, Yulan Guo, Munawar Hayat, Nick Barnes

    Abstract: 3D shape generation is a challenging problem due to the high-dimensional output space and complex part configurations of real-world objects. As a result, existing algorithms experience difficulties in accurate generative modeling of 3D shapes. Here, we propose a novel factorized generative model for 3D shape generation that sequentially transitions from coarse to fine scale shape generation. To th… ▽ More

    Submitted 9 June, 2019; originally announced June 2019.

    Comments: CVPR 2019

  25. arXiv:1905.11736  [pdf, other

    cs.CV

    Cross-Domain Transferability of Adversarial Perturbations

    Authors: Muzammal Naseer, Salman H. Khan, Harris Khan, Fahad Shahbaz Khan, Fatih Porikli

    Abstract: Adversarial examples reveal the blind spots of deep neural networks (DNNs) and represent a major concern for security-critical applications. The transferability of adversarial examples makes real-world attacks possible in black-box settings, where the attacker is forbidden to access the internal parameters of the model. The underlying assumption in most adversary generation methods, whether learni… ▽ More

    Submitted 14 October, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Accepted at NeurIPS 2019 (Camera Ready). Source Code along with pretrained adversarial generators is available at https://github.com/Muzammal-Naseer/Cross-domain-perturbations

  26. arXiv:1901.06091  [pdf

    cs.LG stat.ML

    Transfer Learning and Meta Classification Based Deep Churn Prediction System for Telecom Industry

    Authors: Uzair Ahmed, Asifullah Khan, Saddam Hussain Khan, Abdul Basit, Irfan Ul Haq, Yeon Soo Lee

    Abstract: A churn prediction system guides telecom service providers to reduce revenue loss. However, the development of a churn prediction system for a telecom industry is a challenging task, mainly due to the large size of the data, high dimensional features, and imbalanced distribution of the data. In this paper, we present a solution to the inherent problems of churn prediction, using the concept of Tra… ▽ More

    Submitted 5 March, 2019; v1 submitted 18 January, 2019; originally announced January 2019.

    Comments: Number of Pages: 10 Number of Figures:4 Number of Tables: 4

  27. Image Super-Resolution as a Defense Against Adversarial Attacks

    Authors: Aamir Mustafa, Salman H. Khan, Munawar Hayat, Jianbing Shen, Ling Shao

    Abstract: Convolutional Neural Networks have achieved significant success across multiple computer vision tasks. However, they are vulnerable to carefully crafted, human-imperceptible adversarial noise patterns which constrain their deployment in critical security-sensitive systems. This paper proposes a computationally efficient image enhancement approach that provides a strong defense mechanism to effecti… ▽ More

    Submitted 2 September, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

    Comments: Published in IEEE Transactions in Image Processing

  28. arXiv:1811.12772  [pdf, other

    cs.CV

    From Known to the Unknown: Transferring Knowledge to Answer Questions about Novel Visual and Semantic Concepts

    Authors: Moshiur R Farazi, Salman H Khan, Nick Barnes

    Abstract: Current Visual Question Answering (VQA) systems can answer intelligent questions about `Known' visual content. However, their performance drops significantly when questions about visually and linguistically `Unknown' concepts are presented during inference (`Open-world' scenario). A practical VQA system should be able to deal with novel concepts in real world settings. To address this problem, we… ▽ More

    Submitted 30 November, 2018; originally announced November 2018.

  29. arXiv:1811.09020  [pdf, other

    cs.CV

    Task-generalizable Adversarial Attack based on Perceptual Metric

    Authors: Muzammal Naseer, Salman H. Khan, Shafin Rahman, Fatih Porikli

    Abstract: Deep neural networks (DNNs) can be easily fooled by adding human imperceptible perturbations to the images. These perturbed images are known as `adversarial examples' and pose a serious threat to security and safety critical systems. A litmus test for the strength of adversarial examples is their transferability across different DNN models in a black box setting (i.e. when the target model's archi… ▽ More

    Submitted 26 March, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

  30. arXiv:1807.01216  [pdf, other

    cs.CV

    Local Gradients Smoothing: Defense against localized adversarial attacks

    Authors: Muzammal Naseer, Salman H. Khan, Fatih Porikli

    Abstract: Deep neural networks (DNNs) have shown vulnerability to adversarial attacks, i.e., carefully perturbed inputs designed to mislead the network at inference time. Recently introduced localized attacks, Localized and Visible Adversarial Noise (LaVAN) and Adversarial patch, pose a new challenge to deep learning security by adding adversarial noise only within a specific region without affecting the sa… ▽ More

    Submitted 19 November, 2018; v1 submitted 3 July, 2018; originally announced July 2018.

    Comments: Accepted At WACV-2019

  31. arXiv:1805.04247  [pdf, other

    cs.CV cs.AI cs.CL

    Reciprocal Attention Fusion for Visual Question Answering

    Authors: Moshiur R Farazi, Salman H Khan

    Abstract: Existing attention mechanisms either attend to local image grid or object level features for Visual Question Answering (VQA). Motivated by the observation that questions can relate to both object instances and their parts, we propose a novel attention mechanism that jointly considers reciprocal relationships between the two levels of visual details. The bottom-up attention thus generated is furthe… ▽ More

    Submitted 22 July, 2018; v1 submitted 11 May, 2018; originally announced May 2018.

    Comments: To appear in the British Machine Vision Conference (BMVC), September 2018

    Journal ref: Proceedings of the British Machine Vision Conference (250) 2018

  32. arXiv:1804.10323  [pdf, other

    cs.CV

    Adversarial Training of Variational Auto-encoders for High Fidelity Image Generation

    Authors: Salman H. Khan, Munawar Hayat, Nick Barnes

    Abstract: Variational auto-encoders (VAEs) provide an attractive solution to image generation problem. However, they tend to produce blurred and over-smoothed images due to their dependence on pixel-wise reconstruction loss. This paper introduces a new approach to alleviate this problem in the VAE based generative models. Our model simultaneously learns to match the data, reconstruction loss and the latent… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

  33. Indoor Scene Understanding in 2.5/3D for Autonomous Agents: A Survey

    Authors: Muzammal Naseer, Salman H Khan, Fatih Porikli

    Abstract: With the availability of low-cost and compact 2.5/3D visual sensing devices, computer vision community is experiencing a growing interest in visual scene understanding of indoor environments. This survey paper provides a comprehensive background to this research topic. We begin with a historical perspective, followed by popular 3D data representations and a comparative analysis of available datase… ▽ More

    Submitted 10 January, 2019; v1 submitted 8 March, 2018; originally announced March 2018.

    Comments: IEEE Access

    Journal ref: Year: DECEMBER 2019, Volume: 7, Issue:1, Page(s): 1859-1887,

  34. A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning

    Authors: Shafin Rahman, Salman H. Khan, Fatih Porikli

    Abstract: Prevalent techniques in zero-shot learning do not generalize well to other related problem scenarios. Here, we present a unified approach for conventional zero-shot, generalized zero-shot and few-shot learning problems. Our approach is based on a novel Class Adapting Principal Directions (CAPD) concept that allows multiple embeddings of image features into a semantic space. Given an image, our met… ▽ More

    Submitted 26 October, 2017; v1 submitted 26 June, 2017; originally announced June 2017.

  35. arXiv:1606.02009  [pdf, other

    cs.CV

    Learning deep structured network for weakly supervised change detection

    Authors: Salman H Khan, Xuming He, Fatih Porikli, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri

    Abstract: Conventional change detection methods require a large number of images to learn background models or depend on tedious pixel-level labeling by humans. In this paper, we present a weakly supervised approach that needs only image-level labels to simultaneously detect and localize changes in a pair of images. To this end, we employ a deep neural network with DAG topology to learn patterns of change f… ▽ More

    Submitted 22 May, 2017; v1 submitted 6 June, 2016; originally announced June 2016.

  36. arXiv:1508.03422  [pdf, other

    cs.CV

    Cost Sensitive Learning of Deep Feature Representations from Imbalanced Data

    Authors: Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Ferdous Sohel, Roberto Togneri

    Abstract: Class imbalance is a common problem in the case of real-world object detection and classification tasks. Data of some classes is abundant making them an over-represented majority, and data of other classes is scarce, making them an under-represented minority. This imbalance makes it challenging for a classifier to appropriately learn the discriminating boundaries of the majority and minority class… ▽ More

    Submitted 23 March, 2017; v1 submitted 14 August, 2015; originally announced August 2015.

  37. A Spatial Layout and Scale Invariant Feature Representation for Indoor Scene Classification

    Authors: Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Senjian An

    Abstract: Unlike standard object classification, where the image to be classified contains one or multiple instances of the same object, indoor scene classification is quite different since the image consists of multiple distinct objects. Further, these objects can be of varying sizes and are present across numerous spatial locations in different layouts. For automatic indoor scene categorization, large sca… ▽ More

    Submitted 14 August, 2015; v1 submitted 17 June, 2015; originally announced June 2015.

  38. A Discriminative Representation of Convolutional Features for Indoor Scene Recognition

    Authors: Salman H. Khan, Munawar Hayat, Mohammed Bennamoun, Roberto Togneri, Ferdous Sohel

    Abstract: Indoor scene recognition is a multi-faceted and challenging problem due to the diverse intra-class variations and the confusing inter-class similarities. This paper presents a novel approach which exploits rich mid-level convolutional features to categorize indoor scenes. Traditionally used convolutional features preserve the global spatial structure, which is a desirable property for general obje… ▽ More

    Submitted 16 June, 2015; originally announced June 2015.