Skip to main content

Showing 1–42 of 42 results for author: Khan, M H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01440  [pdf, other

    cs.LG

    GAT-Steiner: Rectilinear Steiner Minimal Tree Prediction Using GNNs

    Authors: Bugra Onal, Eren Dogan, Muhammad Hadir Khan, Matthew R. Guthaus

    Abstract: The Rectilinear Steiner Minimum Tree (RSMT) problem is a fundamental problem in VLSI placement and routing and is known to be NP-hard. Traditional RSMT algorithms spend a significant amount of time on finding Steiner points to reduce the total wire length or use heuristics to approximate producing sub-optimal results. We show that Graph Neural Networks (GNNs) can be used to predict optimal Steiner… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Preprint for The 2024 IEEE/ACM International Conference on Computer-Aided Design (ICCAD 2024)

  2. arXiv:2405.14497  [pdf, other

    cs.CV

    Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment

    Authors: Muhammad Sohail Danish, Muhammad Haris Khan, Muhammad Akhtar Munir, M. Saquib Sarfraz, Mohsen Ali

    Abstract: In this work, we tackle the problem of domain generalization for object detection, specifically focusing on the scenario where only a single source domain is available. We propose an effective approach that involves two key steps: diversifying the source domain and aligning detections based on class prediction confidence and localization. Firstly, we demonstrate that by carefully selecting a set o… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.13518  [pdf, other

    cs.CV

    PerSense: Personalized Instance Segmentation in Dense Images

    Authors: Muhammad Ibraheem Siddiqui, Muhammad Umer Sheikh, Hassan Abid, Muhammad Haris Khan

    Abstract: Leveraging large-scale pre-training, vision foundational models showcase notable performance benefits. While recent years have witnessed significant advancements in segmentation algorithms, existing models still face challenges to automatically segment personalized instances in dense and crowded scenarios. The primary factor behind this limitation stems from bounding box-based detections, which ar… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Technical report of PerSense

  4. arXiv:2404.09342  [pdf, other

    cs.CV cs.SD eess.AS

    Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf

    Abstract: The advancements of technology have led to the use of multimodal systems in various real-world applications. Among them, the audio-visual systems are one of the widely used multimodal systems. In the recent years, associating face and voice of a person has gained attention due to presence of unique correlation between them. The Face-voice Association in Multilingual Environments (FAME) Challenge 2… ▽ More

    Submitted 16 April, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: ACM Multimedia Conference - Grand Challenge

  5. arXiv:2403.16194  [pdf, other

    cs.CV

    Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery

    Authors: Siddharth Tourani, Ahmed Alwheibi, Arif Mahmood, Muhammad Haris Khan

    Abstract: Unsupervised landmarks discovery (ULD) for an object category is a challenging computer vision problem. In pursuit of develo** a robust ULD framework, we explore the potential of a recent paradigm of self-supervised learning algorithms, known as diffusion models. Some recent works have shown that these models implicitly contain important correspondence cues. Towards harnessing the potential of d… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted in CVPR 2024

  6. arXiv:2403.11674  [pdf, other

    cs.CV

    Towards Generalizing to Unseen Domains with Few Labels

    Authors: Chamuditha Jayanga Galappaththige, Sanoojan Baliah, Malitha Gunawardhana, Muhammad Haris Khan

    Abstract: We approach the challenge of addressing semi-supervised domain generalization (SSDG). Specifically, our aim is to obtain a model that learns domain-generalizable features by leveraging a limited subset of labelled data alongside a substantially larger pool of unlabeled data. Existing domain generalization (DG) methods which are unable to exploit unlabeled data perform poorly compared to semi-super… ▽ More

    Submitted 7 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024

  7. arXiv:2403.02782  [pdf, other

    cs.CV

    Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos

    Authors: Kumaranage Ravindu Yasas Nagasinghe, Honglu Zhou, Malitha Gunawardhana, Martin Renqiang Min, Daniel Harari, Muhammad Haris Khan

    Abstract: In this paper, we explore the capability of an agent to construct a logical sequence of action steps, thereby assembling a strategic procedural plan. This plan is crucial for navigating from an initial visual observation to a target visual outcome, as depicted in real-life instructional videos. Existing works have attained partial success by extensively leveraging various sources of information av… ▽ More

    Submitted 15 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 8 pages, 6 figures, (supplementary material: 9 pages, 5 figures), accepted to CVPR 2024

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024 , Pages 18816-18826

  8. arXiv:2401.13965  [pdf, other

    cs.CV

    Improving Pseudo-labelling and Enhancing Robustness for Semi-Supervised Domain Generalization

    Authors: Adnan Khan, Mai A. Shaaban, Muhammad Haris Khan

    Abstract: Beyond attaining domain generalization (DG), visual recognition models should also be data-efficient during learning by leveraging limited labels. We study the problem of Semi-Supervised Domain Generalization (SSDG) which is crucial for real-world applications like automated healthcare. SSDG requires learning a cross-domain generalizable model when the given training data is only partially labelle… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  9. arXiv:2401.13785  [pdf, other

    cs.CV

    Unified Spatio-Temporal Tri-Perspective View Representation for 3D Semantic Occupancy Prediction

    Authors: Sathira Silva, Savindu Bhashitha Wannigama, Gihan Jayatilaka, Muhammad Haris Khan, Roshan Ragel

    Abstract: Holistic understanding and reasoning in 3D scenes play a vital role in the success of autonomous driving systems. The evolution of 3D semantic occupancy prediction as a pretraining task for autonomous driving and robotic downstream tasks capture finer 3D details compared to methods like 3D detection. Existing approaches predominantly focus on spatial cues such as tri-perspective view embeddings (T… ▽ More

    Submitted 4 April, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  10. arXiv:2311.04815  [pdf, other

    cs.CV

    Domain Adaptive Object Detection via Balancing Between Self-Training and Adversarial Learning

    Authors: Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

    Abstract: Deep learning based object detectors struggle generalizing to a new target domain bearing significant variations in object and background. Most current methods align domains by using image or instance-level adversarial feature alignment. This often suffers due to unwanted background and lacks class-specific alignment. A straightforward approach to promote class-level alignment is to use high confi… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Accepted for publication in IEEE Transactions on Pattern Analysis and Machine Intelligence (Volume: 45, Issue: 12, December 2023); Extended version of our conference paper, arXiv link: arXiv:2110.00249

  11. arXiv:2311.03570  [pdf, other

    cs.CV

    Cal-DETR: Calibrated Detection Transformer

    Authors: Muhammad Akhtar Munir, Salman Khan, Muhammad Haris Khan, Mohsen Ali, Fahad Shahbaz Khan

    Abstract: Albeit revealing impressive predictive performance for several computer vision tasks, deep neural networks (DNNs) are prone to making overconfident predictions. This limits the adoption and wider utilization of DNNs in many safety-critical applications. There have been recent efforts toward calibrating DNNs, however, almost all of them focus on the classification task. Surprisingly, very little at… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023

  12. arXiv:2310.17255  [pdf, other

    cs.CV

    Generalizing to Unseen Domains in Diabetic Retinopathy Classification

    Authors: Chamuditha Jayanga Galappaththige, Gayal Kuruppu, Muhammad Haris Khan

    Abstract: Diabetic retinopathy (DR) is caused by long-standing diabetes and is among the fifth leading cause for visual impairments. The process of early diagnosis and treatments could be helpful in curing the disease, however, the detection procedure is rather challenging and mostly tedious. Therefore, automated diabetic retinopathy classification using deep learning techniques has gained interest in the m… ▽ More

    Submitted 27 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted at WACV 2024

  13. arXiv:2309.11301  [pdf, other

    cs.CV

    Generalizing Across Domains in Diabetic Retinopathy via Variational Autoencoders

    Authors: Sharon Chokuwa, Muhammad H. Khan

    Abstract: Domain generalization for Diabetic Retinopathy (DR) classification allows a model to adeptly classify retinal images from previously unseen domains with various imaging conditions and patient demographics, thereby enhancing its applicability in a wide range of clinical environments. In this study, we explore the inherent capacity of variational autoencoders to disentangle the latent space of fundu… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Accepted at MICCAI 2023 1st International Workshop on Foundation Models for General Medical AI (MedAGI)

  14. arXiv:2309.10518  [pdf, other

    cs.CV

    Unsupervised Landmark Discovery Using Consistency Guided Bottleneck

    Authors: Mamona Awan, Muhammad Haris Khan, Sanoojan Baliah, Muhammad Ahmad Waseem, Salman Khan, Fahad Shahbaz Khan, Arif Mahmood

    Abstract: We study a challenging problem of unsupervised discovery of object landmarks. Many recent methods rely on bottlenecks to generate 2D Gaussian heatmaps however, these are limited in generating informed heatmaps while training, presumably due to the lack of effective structural cues. Also, it is assumed that all predicted landmarks are semantically relevant despite having no ground truth supervision… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted ORAL at BMVC 2023 ; Code: https://github.com/MamonaAwan/CGB_ULD

    ACM Class: I.4

  15. arXiv:2309.02636  [pdf, other

    cs.CV cs.LG

    Multiclass Alignment of Confidence and Certainty for Network Calibration

    Authors: Vinith Kugathasan, Muhammad Haris Khan

    Abstract: Deep neural networks (DNNs) have made great strides in pushing the state-of-the-art in several challenging domains. Recent studies reveal that they are prone to making overconfident predictions. This greatly reduces the overall trust in model predictions, especially in safety-critical applications. Early work in improving model calibration employs post-processing techniques which rely on limited p… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted at GCPR 2023

  16. arXiv:2308.14212  [pdf, other

    cs.CV

    Exploring the Transfer Learning Capabilities of CLIP in Domain Generalization for Diabetic Retinopathy

    Authors: Sanoojan Baliah, Fadillah A. Maani, Santosh Sanjeev, Muhammad Haris Khan

    Abstract: Diabetic Retinopathy (DR), a leading cause of vision impairment, requires early detection and treatment. Develo** robust AI models for DR classification holds substantial potential, but a key challenge is ensuring their generalization in unfamiliar domains with varying data distributions. To address this, our paper investigates cross-domain generalization, also known as domain generalization (DG… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  17. arXiv:2307.08930  [pdf, other

    cs.CV cs.AI

    Unsupervised Deep Graph Matching Based on Cycle Consistency

    Authors: Siddharth Tourani, Carsten Rother, Muhammad Haris Khan, Bogdan Savchynskyy

    Abstract: We contribute to the sparsely populated area of unsupervised deep graph matching with application to keypoint matching in images. Contrary to the standard \emph{supervised} approach, our method does not require ground truth correspondences between keypoint pairs. Instead, it is self-supervised by enforcing consistency of matchings between images of the same object category. As the matching and the… ▽ More

    Submitted 11 February, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: 12 pages, 5 figures, 3 papers

  18. arXiv:2306.08271  [pdf, other

    cs.CV

    Multiclass Confidence and Localization Calibration for Object Detection

    Authors: Bimsara Pathiraja, Malitha Gunawardhana, Muhammad Haris Khan

    Abstract: Albeit achieving high predictive accuracy across many challenging computer vision problems, recent studies suggest that deep neural networks (DNNs) tend to make overconfident predictions, rendering them poorly calibrated. Most of the existing attempts for improving DNN calibration are limited to classification tasks and restricted to calibrating in-domain predictions. Surprisingly, very little to… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: Project page - https://bimsarapathiraja.github.io/mccl-project-page/

  19. arXiv:2303.14404  [pdf, other

    cs.CV

    Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection

    Authors: Muhammad Akhtar Munir, Muhammad Haris Khan, Salman Khan, Fahad Shahbaz Khan

    Abstract: Deep neural networks (DNNs) have enabled astounding progress in several vision-based problems. Despite showing high predictive accuracy, recently, several works have revealed that they tend to provide overconfident predictions and thus are poorly calibrated. The majority of the works addressing the miscalibration of DNNs fall under the scope of classification and consider only in-domain prediction… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  20. arXiv:2303.06129  [pdf, other

    cs.CV

    Single-branch Network for Multimodal Training

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Muhammad Zaigham Zaheer, Karthik Nandakumar, Muhammad Haroon Yousaf, Arif Mahmood

    Abstract: With the rapid growth of social media platforms, users are sharing billions of multimedia posts containing audio, images, and text. Researchers have focused on building autonomous systems capable of processing such multimedia data to solve challenging multimodal tasks including cross-modal retrieval, matching, and verification. Existing works use separate networks to extract embeddings of each mod… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted at ICASSP 2023

  21. arXiv:2303.01954  [pdf, other

    stat.ML cs.AI cs.LG

    Synthetic Data Generator for Adaptive Interventions in Global Health

    Authors: Aditya Rastogi, Juan Francisco Garamendi, Ana Fernández del Río, Anna Guitart, Moiz Hassan Khan, Dexian Tang, África Periáñez

    Abstract: Artificial Intelligence and digital health have the potential to transform global health. However, having access to representative data to test and validate algorithms in realistic production environments is essential. We introduce HealthSyn, an open-source synthetic data generator of user behavior for testing reinforcement learning algorithms in the context of mobile health interventions. The gen… ▽ More

    Submitted 27 April, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

  22. arXiv:2212.04673  [pdf, other

    cs.CV

    MSI: Maximize Support-Set Information for Few-Shot Segmentation

    Authors: Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia

    Abstract: FSS(Few-shot segmentation) aims to segment a target class using a small number of labeled images(support set). To extract information relevant to the target class, a dominant approach in best-performing FSS methods removes background features using a support mask. We observe that this feature excision through a limiting support mask introduces an information bottleneck in several challenging FSS c… ▽ More

    Submitted 10 November, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: ICCV 2023

  23. arXiv:2209.07601  [pdf, other

    cs.CV

    Towards Improving Calibration in Object Detection Under Domain Shift

    Authors: Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

    Abstract: With deep neural network based solution more readily being incorporated in real-world applications, it has been pressing requirement that predictions by such models, especially in safety-critical environments, be highly accurate and well-calibrated. Although some techniques addressing DNN calibration have been proposed, they are only limited to visual classification applications and in-domain pred… ▽ More

    Submitted 29 October, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: To appear in NeurIPS 2022

  24. arXiv:2208.10238  [pdf, other

    cs.CV

    Learning Branched Fusion and Orthogonal Projection for Face-Voice Association

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: Recent years have seen an increased interest in establishing association between faces and voices of celebrities leveraging audio-visual information from YouTube. Prior works adopt metric learning methods to learn an embedding space that is amenable for associated matching and verification tasks. Albeit showing some progress, such formulations are, however, restrictive due to dependency on distanc… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Submitted: IEEE Transactions on Multimedia. arXiv admin note: substantial text overlap with arXiv:2112.10483

  25. arXiv:2207.12392  [pdf, other

    cs.CV cs.AI cs.LG

    Self-Distilled Vision Transformer for Domain Generalization

    Authors: Maryam Sultana, Muzammal Naseer, Muhammad Haris Khan, Salman Khan, Fahad Shahbaz Khan

    Abstract: In the recent past, several domain generalization (DG) methods have been proposed, showing encouraging performance, however, almost all of them build on convolutional neural networks (CNNs). There is little to no progress on studying the DG performance of vision transformers (ViTs), which are challenging the supremacy of CNNs on standard benchmarks, often built on i.i.d assumption. This renders th… ▽ More

    Submitted 4 October, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: 23 pages, 12 figures

    Journal ref: The 16th Asian Conference on Computer Vision (ACCV 2022)

  26. arXiv:2203.13253  [pdf, other

    cs.CV

    Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer

    Authors: Omkar Thawakar, Sanath Narayan, Jiale Cao, Hisham Cholakkal, Rao Muhammad Anwer, Muhammad Haris Khan, Salman Khan, Michael Felsberg, Fahad Shahbaz Khan

    Abstract: State-of-the-art transformer-based video instance segmentation (VIS) approaches typically utilize either single-scale spatio-temporal features or per-frame multi-scale features during the attention computations. We argue that such an attention computation ignores the multi-scale spatio-temporal feature relationships that are crucial to tackle target appearance deformations in videos. To address th… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

  27. arXiv:2203.12826  [pdf, other

    cs.CV

    HM: Hybrid Masking for Few-Shot Segmentation

    Authors: Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia

    Abstract: We study few-shot semantic segmentation that aims to segment a target object from a query image when provided with a few annotated support images of the target class. Several recent methods resort to a feature masking (FM) technique to discard irrelevant feature activations which eventually facilitates the reliable prediction of segmentation mask. A fundamental limitation of FM is the inability to… ▽ More

    Submitted 24 July, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: 14 pages

    MSC Class: 68T45

  28. arXiv:2203.03962  [pdf, other

    cs.CV

    Generative Cooperative Learning for Unsupervised Video Anomaly Detection

    Authors: Muhammad Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segu, Fisher Yu, Seung-Ik Lee

    Abstract: Video anomaly detection is well investigated in weakly-supervised and one-class classification (OCC) settings. However, unsupervised video anomaly detection methods are quite sparse, likely because anomalies are less frequent in occurrence and usually not well-defined, which when coupled with the absence of ground truth supervision, could adversely affect the performance of the learning algorithms… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted to the Conference on Computer Vision and Pattern Recognition CVPR 2022

  29. arXiv:2201.09873  [pdf, other

    eess.IV cs.CV

    Transformers in Medical Imaging: A Survey

    Authors: Fahad Shamshad, Salman Khan, Syed Waqas Zamir, Muhammad Haris Khan, Munawar Hayat, Fahad Shahbaz Khan, Huazhu Fu

    Abstract: Following unprecedented success on the natural language tasks, Transformers have been successfully applied to several computer vision problems, achieving state-of-the-art results and prompting researchers to reconsider the supremacy of convolutional neural networks (CNNs) as {de facto} operators. Capitalizing on these advances in computer vision, the medical imaging field has also witnessed growin… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 41 pages, \url{https://github.com/fahadshamshad/awesome-transformers-in-medical-imaging}

  30. arXiv:2112.10483  [pdf, other

    cs.CV

    Fusion and Orthogonal Projection for Improved Face-Voice Association

    Authors: Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: We study the problem of learning association between face and voice, which is gaining interest in the computer vision community lately. Prior works adopt pairwise or triplet loss formulations to learn an embedding space amenable for associated matching and verification tasks. Albeit showing some progress, such loss formulations are, however, restrictive due to dependency on distance-dependent marg… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

  31. arXiv:2112.02838  [pdf, other

    cs.CV

    Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey and Outlook

    Authors: Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, Jiri Matas

    Abstract: Accurate and robust visual object tracking is one of the most challenging and fundamental computer vision problems. It entails estimating the trajectory of the target in an image sequence, given only its initial location, and segmentation, or its rough approximation in the form of a bounding box. Discriminative Correlation Filters (DCFs) and deep Siamese Networks (SNs) have emerged as dominating t… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Tracking Survey

  32. arXiv:2110.00249  [pdf, other

    cs.CV

    Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection

    Authors: Muhammad Akhtar Munir, Muhammad Haris Khan, M. Saquib Sarfraz, Mohsen Ali

    Abstract: We study adapting trained object detectors to unseen domains manifesting significant variations of object appearance, viewpoints and backgrounds. Most current methods align domains by either using image or instance-level feature alignment in an adversarial fashion. This often suffers due to the presence of unwanted background and as such lacks class-specific alignment. A common remedy to promote c… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: To appear in NeurIPS2021

  33. arXiv:2104.12709  [pdf, other

    cs.CV

    Rich Semantics Improve Few-shot Learning

    Authors: Mohamed Afham, Salman Khan, Muhammad Haris Khan, Muzammal Naseer, Fahad Shahbaz Khan

    Abstract: Human learning benefits from multi-modal inputs that often appear as rich semantics (e.g., description of an object's attributes while learning about it). This enables us to learn generalizable concepts from very limited visual examples. However, current few-shot learning (FSL) methods use numerical class labels to denote object classes which do not provide rich semantic meanings about the learned… ▽ More

    Submitted 12 November, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: Accepted to 32nd British Machine Vision Conference (BMVC 2021)

  34. arXiv:2003.01107  [pdf

    cs.DC cs.NI cs.OS

    Reconfigurable Parallel Architecture of High Speed Round Robin Arbiter

    Authors: Arnab Paul, Mamdudul Haque Khan, M. Muktadir Rahman, Tanvir Zaman Khan, Prajoy Podder, Md. Yeasir Akram Khan

    Abstract: With a view to managing the increasing traffic in computer networks, round robin arbiter has been proposed to work with packet switching system to have increased speed in providing access and scheduling. Round robin arbiter is a doorway to a particular bus based on request along with equal priority and gives turns to devices connected to it in a cyclic order. Considering the rapid growth in comput… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: Published in 2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)

    Report number: 15438813

    Journal ref: 2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)

  35. arXiv:1911.03711  [pdf, other

    eess.IV cs.CV

    Unsupervised adulterated red-chili pepper content transformation for hyperspectral classification

    Authors: Muhammad Hussain Khan, Zainab Saleem, Muhammad Ahmad, Ahmed Sohaib, Hamail Ayaz

    Abstract: Preserving red-chili quality is of utmost importance in which the authorities demand the quality techniques to detect, classify and prevent it from the impurities. For example, salt, wheat flour, wheat bran, and rice bran contamination in grounded red chili, which typically a food, are a serious threat to people who are allergic to such items. This work presents the feasibility of utilizing visibl… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: 10 pages,

  36. arXiv:1910.07721  [pdf, other

    cs.CV

    Deep Contextual Attention for Human-Object Interaction Detection

    Authors: Tiancai Wang, Rao Muhammad Anwer, Muhammad Haris Khan, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao, Jorma Laaksonen

    Abstract: Human-object interaction detection is an important and relatively new class of visual relationship detection tasks, essential for deeper scene understanding. Most existing approaches decompose the problem into object localization and interaction recognition. Despite showing progress, these approaches only rely on the appearances of humans and objects and overlook the available context information,… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: Accepted at ICCV 2019

  37. arXiv:1910.06160  [pdf, other

    cs.CV

    Mask-Guided Attention Network for Occluded Pedestrian Detection

    Authors: Yanwei Pang, ** Xie, Muhammad Haris Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Ling Shao

    Abstract: Pedestrian detection relying on deep convolution neural networks has made significant progress. Though promising results have been achieved on standard pedestrians, the performance on heavily occluded pedestrians remains far from satisfactory. The main culprits are intra-class occlusions involving other pedestrians and inter-class occlusions caused by other objects, such as cars and bicycles. Thes… ▽ More

    Submitted 15 October, 2019; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: Accepted at ICCV 2019

  38. arXiv:1909.04951  [pdf, other

    cs.CV

    AnimalWeb: A Large-Scale Hierarchical Dataset of Annotated Animal Faces

    Authors: Muhammad Haris Khan, John McDonagh, Salman Khan, Muhammad Shahabuddin, Aditya Arora, Fahad Shahbaz Khan, Ling Shao, Georgios Tzimiropoulos

    Abstract: Being heavily reliant on animals, it is our ethical obligation to improve their well-being by understanding their needs. Several studies show that animal needs are often expressed through their faces. Though remarkable progress has been made towards the automatic understanding of human faces, this has regrettably not been the case with animal faces. There exists significant room and appropriate ne… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: 15 pages, 14 figures

  39. arXiv:1811.01194  [pdf, other

    cs.CV

    Pushing the boundaries of audiovisual word recognition using Residual Networks and LSTMs

    Authors: Themos Stafylakis, Muhammad Haris Khan, Georgios Tzimiropoulos

    Abstract: Visual and audiovisual speech recognition are witnessing a renaissance which is largely due to the advent of deep learning methods. In this paper, we present a deep learning architecture for lipreading and audiovisual word recognition, which combines Residual Networks equipped with spatiotemporal input layers and Bidirectional LSTMs. The lipreading architecture attains 11.92% misclassification rat… ▽ More

    Submitted 3 November, 2018; originally announced November 2018.

    Comments: Accepted to Computer Vision and Image Understanding (Elsevier)

  40. arXiv:1609.05708  [pdf

    cs.NI

    Reducing energy consumption of network infrastructure using spectral approach

    Authors: Mohammad Habibullah Khan, Eric Rondeau, Jean-Philippe Georges

    Abstract: The energy consumption by ICT (Information and Communication Technology) equipment is rapidly increasing which causes a significant economic and environmental problem. At present, the network infrastructure is becoming a large portion of the energy footprint in ICT. Thus, the concept of energy efficient or green networking has been introduced. Now one of the main concerns of network industry is to… ▽ More

    Submitted 19 September, 2016; originally announced September 2016.

    Comments: International Sustainable Ecological Engineering Design for Society (SEEDS) Conference, Sep 2016, Leeds, United Kingdom

  41. arXiv:1209.5426  [pdf

    cs.DB

    A Coherent Distributed Grid Service for Assimilation and Unification of Heterogeneous Data Source

    Authors: Tanvir Ahmed, Mohammad Saiedur Rahaman, Mohammad Saidur Rahman, Manzur H. Khan

    Abstract: Grid services are heavily used for handling large distributed computations. They are also very useful to handle heavy data intensive applications where data are distributed in different sites. Most of the data grid services used in such situations are meant for homogeneous data source. In case of Heterogeneous data sources, most of the grid services that are available are designed such a way that… ▽ More

    Submitted 12 October, 2012; v1 submitted 24 September, 2012; originally announced September 2012.

    Comments: 9 pages; ISSN 1608-3679

    ACM Class: H.2.5; H.2.4; H.3.3

    Journal ref: AIUB Journal of Science And Engineering (AJSE) Vol. 9, No. 1, PP 47-55, 2010

  42. arXiv:1001.1966  [pdf

    cs.CV cs.CR

    A New Method to Extract Dorsal Hand Vein Pattern using Quadratic Inference Function

    Authors: Maleika Heenaye Mamode Khan, Naushad Ali Mamode Khan

    Abstract: Among all biometric, dorsal hand vein pattern is attracting the attention of researchers, of late. Extensive research is being carried out on various techniques in the hope of finding an efficient one which can be applied on dorsal hand vein pattern to improve its accuracy and matching time. One of the crucial step in biometric is the extraction of features. In this paper, we propose a method ba… ▽ More

    Submitted 12 January, 2010; originally announced January 2010.

    Comments: 5 pages IEEE format, International Journal of Computer Science and Information Security, IJCSIS December 2009, ISSN 1947 5500, http://sites.google.com/site/ijcsis/

    Report number: ISSN 1947 5500

    Journal ref: International Journal of Computer Science and Information Security, IJCSIS, Vol. 6, No. 3, pp. 026-030, December 2009, USA