Skip to main content

Showing 1–43 of 43 results for author: Awais, M

.
  1. arXiv:2406.19556  [pdf, other

    eess.IV cs.CV cs.LG

    BOrg: A Brain Organoid-Based Mitosis Dataset for Automatic Analysis of Brain Diseases

    Authors: Muhammad Awais, Mehaboobathunnisa Sahul Hameed, Bidisha Bhattacharya, Orly Reiner, Rao Muhammad Anwer

    Abstract: Recent advances have enabled the study of human brain development using brain organoids derived from stem cells. Quantifying cellular processes like mitosis in these organoids offers insights into neurodevelopmental disorders, but the manual analysis is time-consuming, and existing datasets lack specific details for brain organoid studies. We introduce BOrg, a dataset designed to study mitotic eve… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.17460  [pdf, other

    cs.CV

    Investigating Self-Supervised Methods for Label-Efficient Learning

    Authors: Srinivasa Rao Nandam, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais

    Abstract: Vision transformers combined with self-supervised learning have enabled the development of models which scale across large datasets for several downstream tasks like classification, segmentation and detection. The low-shot learning capability of these models, across several low-shot downstream tasks, has been largely under explored. We perform a system level study of different self supervised pret… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.17450  [pdf, other

    cs.CV cs.AI

    Pseudo Labelling for Enhanced Masked Autoencoders

    Authors: Srinivasa Rao Nandam, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais

    Abstract: Masked Image Modeling (MIM)-based models, such as SdAE, CAE, GreenMIM, and MixAE, have explored different strategies to enhance the performance of Masked Autoencoders (MAE) by modifying prediction, loss functions, or incorporating additional architectural components. In this paper, we propose an enhanced approach that boosts MAE performance by integrating pseudo labelling for both class and data t… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.04413  [pdf, other

    cs.CV cs.AI

    Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

    Authors: Amandeep Kumar, Muhammad Awais, Sanath Narayan, Hisham Cholakkal, Salman Khan, Rao Muhammad Anwer

    Abstract: Drawing upon StyleGAN's expressivity and disentangled latent space, existing 2D approaches employ textual prompting to edit facial images with different attributes. In contrast, 3D-aware approaches that generate faces at different target poses require attribute-specific classifiers, learning separate model weights for each attribute, and are not scalable for novel attributes. In this work, we prop… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2405.00168  [pdf, other

    cs.CV

    Revisiting RGBT Tracking Benchmarks from the Perspective of Modality Validity: A New Benchmark, Problem, and Method

    Authors: Zhangyong Tang, Tianyang Xu, Zhenhua Feng, Xuefeng Zhu, He Wang, Pengcheng Shao, Chunyang Cheng, Xiao-Jun Wu, Muhammad Awais, Sara Atito, Josef Kittler

    Abstract: RGBT tracking draws increasing attention due to its robustness in multi-modality warranting (MMW) scenarios, such as nighttime and bad weather, where relying on a single sensing modality fails to ensure stable tracking results. However, the existing benchmarks predominantly consist of videos collected in common scenarios where both RGB and thermal infrared (TIR) information are of sufficient quali… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  6. arXiv:2404.00509  [pdf, other

    cs.LG cs.CV

    DailyMAE: Towards Pretraining Masked Autoencoders in One Day

    Authors: Jiantao Wu, Shentong Mo, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais

    Abstract: Recently, masked image modeling (MIM), an important self-supervised learning (SSL) method, has drawn attention for its effectiveness in learning data representation from unlabeled data. Numerous studies underscore the advantages of MIM, highlighting how models pretrained on extensive datasets can enhance the performance of downstream tasks. However, the high computational demands of pretraining po… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  7. arXiv:2402.15534  [pdf, other

    eess.IV cs.CV cs.LG

    DiCoM -- Diverse Concept Modeling towards Enhancing Generalizability in Chest X-Ray Studies

    Authors: Abhieet Parida, Daniel Capellan-Martin, Sara Atito, Muhammad Awais, Maria J. Ledesma-Carbayo, Marius G. Linguraru, Syed Muhammad Anwar

    Abstract: Chest X-Ray (CXR) is a widely used clinical imaging modality and has a pivotal role in the diagnosis and prognosis of various lung and heart related conditions. Conventional automated clinical diagnostic tool design strategies relying on radiology reads and supervised learning, entail the cumbersome requirement of high quality annotated training data. To address this challenge, self-supervised pre… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  8. arXiv:2312.01118  [pdf, other

    cs.CV

    Beyond Accuracy: Statistical Measures and Benchmark for Evaluation of Representation from Self-Supervised Learning

    Authors: Jiantao Wu, Shentong Mo, Sara Atito, Josef Kittler, Zhenhua Feng, Muhammad Awais

    Abstract: Recently, self-supervised metric learning has raised attention for the potential to learn a generic distance function. It overcomes the limitations of conventional supervised one, e.g., scalability and label biases. Despite progress in this domain, current benchmarks, incorporating a narrow scope of classes, stop the nuanced evaluation of semantic representations. To bridge this gap, we introduce… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  9. LT-ViT: A Vision Transformer for multi-label Chest X-ray classification

    Authors: Umar Marikkar, Sara Atito, Muhammad Awais, Adam Mahdi

    Abstract: Vision Transformers (ViTs) are widely adopted in medical imaging tasks, and some existing efforts have been directed towards vision-language training for Chest X-rays (CXRs). However, we envision that there still exists a potential for improvement in vision-only training for CXRs using ViTs, by aggregating information from multiple scales, which has been proven beneficial for non-transformer netwo… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 5 pages, 2 figures

  10. arXiv:2309.05834  [pdf, other

    cs.CV

    SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Action Recognition

    Authors: Cong Wu, Xiao-Jun Wu, Josef Kittler, Tianyang Xu, Sara Atito, Muhammad Awais, Zhenhua Feng

    Abstract: Contrastive learning has achieved great success in skeleton-based action recognition. However, most existing approaches encode the skeleton sequences as entangled spatiotemporal representations and confine the contrasts to the same level of representation. Instead, this paper introduces a novel contrastive learning framework, namely Spatiotemporal Clues Disentanglement Network (SCD-Net). Specifica… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  11. arXiv:2308.11448  [pdf, other

    cs.CV cs.LG

    Masked Momentum Contrastive Learning for Zero-shot Semantic Understanding

    Authors: Jiantao Wu, Shentong Mo, Muhammad Awais, Sara Atito, Zhenhua Feng, Josef Kittler

    Abstract: Self-supervised pretraining (SSP) has emerged as a popular technique in machine learning, enabling the extraction of meaningful feature representations without labelled data. In the realm of computer vision, pretrained vision transformers (ViTs) have played a pivotal role in advancing transfer learning. Nonetheless, the escalating cost of finetuning these large models has posed a challenge due to… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  12. arXiv:2307.13721  [pdf, other

    cs.CV cs.AI

    Foundational Models Defining a New Era in Vision: A Survey and Outlook

    Authors: Muhammad Awais, Muzammal Naseer, Salman Khan, Rao Muhammad Anwer, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Fahad Shahbaz Khan

    Abstract: Vision systems to see and reason about the compositional nature of visual scenes are fundamental to understanding our world. The complex relations between objects and their locations, ambiguities, and variations in the real-world environment can be better described in human language, naturally governed by grammatical rules and other modalities such as audio and depth. The models learned to bridge… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: Project page: https://github.com/awaisrauf/Awesome-CV-Foundational-Models

  13. CAMP: A Context-Aware Cricket Players Performance Metric

    Authors: Muhammad Sohaib Ayub, Naimat Ullah, Sarwan Ali, Imdad Ullah Khan, Mian Muhammad Awais, Muhammad Asad Khan, Safiullah Faizullah

    Abstract: Cricket is the second most popular sport after soccer in terms of viewership. However, the assessment of individual player performance, a fundamental task in team sports, is currently primarily based on aggregate performance statistics, including average runs and wickets taken. We propose Context-Aware Metric of player Performance, CAMP, to quantify individual players' contributions toward a crick… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Journal ref: Journal of the Operational Research Society (2023) 1-27

  14. arXiv:2306.15369  [pdf, other

    cs.SE cs.LG

    A Meta-analytical Comparison of Naive Bayes and Random Forest for Software Defect Prediction

    Authors: Ch Muhammad Awais, Wei Gu, Gcinizwe Dlamini, Zamira Kholmatova, Giancarlo Succi

    Abstract: Is there a statistical difference between Naive Bayes and Random Forest in terms of recall, f-measure, and precision for predicting software defects? By utilizing systematic literature review and meta-analysis, we are answering this question. We conducted a systematic literature review by establishing criteria to search and choose papers, resulting in five studies. After that, using the meta-data… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 11 pages, 8 figures, Conference Paper

    Journal ref: Intelligent Systems Design and Applications. ISDA 2022. Lecture Notes in Networks and Systems, vol 716

  15. arXiv:2303.12959  [pdf, other

    cs.LG cs.AI

    Variantional autoencoder with decremental information bottleneck for disentanglement

    Authors: Jiantao Wu, Shentong Mo, Xiang Yang, Muhammad Awais, Sara Atito, Xingshen Zhang, Lin Wang, Xiang Yang

    Abstract: One major challenge of disentanglement learning with variational autoencoders is the trade-off between disentanglement and reconstruction fidelity. Previous studies, which increase the information bottleneck during training, tend to lose the constraint of disentanglement, leading to the information diffusion problem. In this paper, we present a novel framework for disentangled representation learn… ▽ More

    Submitted 4 October, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  16. arXiv:2211.13189  [pdf, other

    cs.SD cs.CV eess.AS

    ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification

    Authors: Sara Atito, Muhammad Awais, Wenwu Wang, Mark D Plumbley, Josef Kittler

    Abstract: Transformers, which were originally developed for natural language processing, have recently generated significant interest in the computer vision and audio communities due to their flexibility in learning long-range relationships. Constrained by the data hungry nature of transformers and the limited amount of labelled data, most transformer-based models for audio tasks are finetuned from ImageNet… ▽ More

    Submitted 10 March, 2024; v1 submitted 23 November, 2022; originally announced November 2022.

  17. arXiv:2211.12944  [pdf, other

    eess.IV cs.CV

    SPCXR: Self-supervised Pretraining using Chest X-rays Towards a Domain Specific Foundation Model

    Authors: Syed Muhammad Anwar, Abhijeet Parida, Sara Atito, Muhammad Awais, Gustavo Nino, Josef Kitler, Marius George Linguraru

    Abstract: Chest X-rays (CXRs) are a widely used imaging modality for the diagnosis and prognosis of lung disease. The image analysis tasks vary. Examples include pathology detection and lung segmentation. There is a large body of work where machine learning algorithms are developed for specific tasks. A significant recent example is Coronavirus disease (covid-19) detection using CXR data. However, the tradi… ▽ More

    Submitted 18 May, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  18. arXiv:2210.13217  [pdf, other

    physics.app-ph

    Piezoelectric PVDF-TrFE/ PET Energy Harvesters for Structural Health Monitoring (SHM) Applications

    Authors: Berkay Kullukçu, Mohammad Bathaei, Muhammad Awais, Hadi Mirzajani, Levent Beker

    Abstract: This research describes a piezoelectric Poly(vinylidene fluoride-co-trifluoroethylene)/ Polyethylene Terephthalate energy harvester for structural health monitoring of wind turbines. The piezoelectric energy harvester was made of a polyvinylidene fluoride-trifluoroethylene (PVDF-TrFE) layer. In addition, PET sheets, double-sided micron-thick tapes, and PVDF sheets were used for device fabrication.… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 8 pages

  19. arXiv:2208.13923  [pdf, other

    eess.IV cs.CV cs.LG

    SB-SSL: Slice-Based Self-Supervised Transformers for Knee Abnormality Classification from MRI

    Authors: Sara Atito, Syed Muhammad Anwar, Muhammad Awais, Josef Kitler

    Abstract: The availability of large scale data with high quality ground truth labels is a challenge when develo** supervised machine learning solutions for healthcare domain. Although, the amount of digital data in clinical workflows is increasing, most of this data is distributed on clinical sites and protected to ensure patient privacy. Radiological readings and dealing with large-scale clinical data pu… ▽ More

    Submitted 29 August, 2022; originally announced August 2022.

    Comments: Accepted at MICCAI MILLAND workshop

  20. arXiv:2208.08224  [pdf, other

    cs.CV eess.IV

    Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture

    Authors: Muhammad Muzammel, Mohd Zuki Yusoff, Mohamad Naufal Mohamad Saad, Faryal Sheikh, Muhammad Ahsan Awais

    Abstract: Buses and heavy vehicles have more blind spots compared to cars and other road vehicles due to their large sizes. Therefore, accidents caused by these heavy vehicles are more fatal and result in severe injuries to other road users. These possible blind-spot collisions can be identified early using vision-based object detection approaches. Yet, the existing state-of-the-art vision-based object dete… ▽ More

    Submitted 19 August, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  21. Blockchain based Secure Energy Marketplace Scheme to Motivate Peer to Peer Microgrids

    Authors: Muhammad Awais, Qamar Abbas, Shehbaz Tariq, Sayyaf Haider Warraich

    Abstract: In the past years trend of microgrids is increasing very fast to reduce peak-hour costs. However, in these systems, third parties are still involved in selling surplus energy. This results in increased cost of energy and there are many operational and security barriers in such systems. These issues can be solved by the decentralized distributed system of microgrids where a consumer can locally sel… ▽ More

    Submitted 20 May, 2024; v1 submitted 14 June, 2022; originally announced June 2022.

    Journal ref: International Journal of Informatics and Communication Technology 11, 177-184 (2022)

  22. arXiv:2205.14986  [pdf, other

    cs.CV

    GMML is All you Need

    Authors: Sara Atito, Muhammad Awais, Josef Kittler

    Abstract: Vision transformers have generated significant interest in the computer vision community because of their flexibility in exploiting contextual information, whether it is sharply confined local, or long range global. However, they are known to be data hungry. This has motivated the research in self-supervised transformer pretraining, which does not need to decode the semantic information conveyed b… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  23. arXiv:2205.02108  [pdf, other

    cs.LG cs.AI

    Using Deep Reinforcement Learning to solve Optimal Power Flow problem with generator failures

    Authors: Muhammad Usman Awais

    Abstract: Deep Reinforcement Learning (DRL) is being used in many domains. One of the biggest advantages of DRL is that it enables the continuous improvement of a learning agent. Secondly, the DRL framework is robust and flexible enough to be applicable to problems of varying nature and domain. Presented work is evidence of using the DRL technique to solve an Optimal Power Flow (OPF) problem. Two classical… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  24. arXiv:2111.15340  [pdf, other

    cs.CV cs.LG

    MC-SSL0.0: Towards Multi-Concept Self-Supervised Learning

    Authors: Sara Atito, Muhammad Awais, Ammarah Farooq, Zhenhua Feng, Josef Kittler

    Abstract: Self-supervised pretraining is the method of choice for natural language processing models and is rapidly gaining popularity in many vision tasks. Recently, self-supervised pretraining has shown to outperform supervised pretraining for many downstream vision applications, marking a milestone in the area. This superiority is attributed to the negative impact of incomplete labelling of the training… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

  25. arXiv:2111.13156  [pdf, other

    cs.CV

    Global Interaction Modelling in Vision Transformer via Super Tokens

    Authors: Ammarah Farooq, Muhammad Awais, Sara Ahmed, Josef Kittler

    Abstract: With the popularity of Transformer architectures in computer vision, the research focus has shifted towards develo** computationally efficient designs. Window-based local attention is one of the major techniques being adopted in recent works. These methods begin with very small patch size and small embedding dimensions and then perform strided convolution (patch merging) in order to reduce the f… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  26. arXiv:2111.05073  [pdf, other

    cs.LG cs.AI cs.CV

    MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps

    Authors: Muhammad Awais, Fengwei Zhou, Chuanlong Xie, Jiawei Li, Sung-Ho Bae, Zhenguo Li

    Abstract: Deep neural networks are susceptible to adversarially crafted, small and imperceptible changes in the natural inputs. The most effective defense mechanism against these examples is adversarial training which constructs adversarial examples during training by iterative maximization of loss. The model is then trained to minimize the loss on these constructed examples. This min-max optimization requi… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted by NeurIPS 2021

  27. arXiv:2111.03861  [pdf, other

    cs.CV cs.AI cs.LG

    What augmentations are sensitive to hyper-parameters and why?

    Authors: Ch Muhammad Awais, Imad Eddine Ibrahim Bekkouch

    Abstract: We apply augmentations to our dataset to enhance the quality of our predictions and make our final models more resilient to noisy data and domain drifts. Yet the question remains, how are these augmentations going to perform with different hyper-parameters? In this study we evaluate the sensitivity of augmentations with regards to the model's hyper parameters along with their consistency and influ… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

    Comments: 10 pages, 17 figures

  28. arXiv:2109.00946  [pdf, other

    cs.LG cs.CV

    Adversarial Robustness for Unsupervised Domain Adaptation

    Authors: Muhammad Awais, Fengwei Zhou, Hang Xu, Lanqing Hong, ** Luo, Sung-Ho Bae, Zhenguo Li

    Abstract: Extensive Unsupervised Domain Adaptation (UDA) studies have shown great success in practice by learning transferable representations across a labeled source domain and an unlabeled target domain with deep models. However, previous works focus on improving the generalization ability of UDA models on clean examples without considering the adversarial robustness, which is crucial in real-world applic… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: Accepted by ICCV 2021

  29. arXiv:2104.03602  [pdf, other

    cs.CV cs.LG

    SiT: Self-supervised vIsion Transformer

    Authors: Sara Atito, Muhammad Awais, Josef Kittler

    Abstract: Self-supervised learning methods are gaining increasing traction in computer vision due to their recent success in reducing the gap with supervised learning. In natural language processing (NLP) self-supervised learning and transformers are already the methods of choice. The recent literature suggests that the transformers are becoming increasingly popular also in computer vision. So far, the visi… ▽ More

    Submitted 26 December, 2022; v1 submitted 8 April, 2021; originally announced April 2021.

  30. arXiv:2103.03503  [pdf

    cs.CV cs.LG

    NPT-Loss: A Metric Loss with Implicit Mining for Face Recognition

    Authors: Syed Safwan Khalid, Muhammad Awais, Chi-Ho Chan, Zhenhua Feng, Ammarah Farooq, Ali Akbari, Josef Kittler

    Abstract: Face recognition (FR) using deep convolutional neural networks (DCNNs) has seen remarkable success in recent years. One key ingredient of DCNN-based FR is the appropriate design of a loss function that ensures discrimination between various identities. The state-of-the-art (SOTA) solutions utilise normalised Softmax loss with additive and/or multiplicative margins. Despite being popular, these Sof… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  31. arXiv:2101.08238  [pdf, other

    cs.CV cs.LG

    AXM-Net: Implicit Cross-Modal Feature Alignment for Person Re-identification

    Authors: Ammarah Farooq, Muhammad Awais, Josef Kittler, Syed Safwan Khalid

    Abstract: Cross-modal person re-identification (Re-ID) is critical for modern video surveillance systems. The key challenge is to align cross-modality representations induced by the semantic information present for a person and ignore background information. This work presents a novel convolutional neural network (CNN) based architecture designed to learn semantically aligned cross-modal visual and textual… ▽ More

    Submitted 20 July, 2022; v1 submitted 19 January, 2021; originally announced January 2021.

    Comments: AAAI-2022 (Oral Paper)

  32. arXiv:2010.10368  [pdf, other

    cs.CV cs.AI

    A Flatter Loss for Bias Mitigation in Cross-dataset Facial Age Estimation

    Authors: Ali Akbari, Muhammad Awais, Zhen-Hua Feng, Ammarah Farooq, Josef Kittler

    Abstract: The most existing studies in the facial age estimation assume training and test images are captured under similar shooting conditions. However, this is rarely valid in real-world applications, where training and test sets usually have different characteristics. In this paper, we advocate a cross-dataset protocol for age estimation benchmarking. In order to improve the cross-dataset age estimation… ▽ More

    Submitted 26 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

  33. Deep Convolutional Neural Network Ensembles using ECOC

    Authors: Sara Atito Ali Ahmed, Cemre Zor, Berrin Yanikoglu, Muhammad Awais, Josef Kittler

    Abstract: Deep neural networks have enhanced the performance of decision making systems in many applications including image understanding, and further gains can be achieved by constructing ensembles. However, designing an ensemble of deep networks is often not very beneficial since the time needed to train the networks is very high or the performance gain obtained is not very significant. In this paper, we… ▽ More

    Submitted 7 March, 2021; v1 submitted 7 September, 2020; originally announced September 2020.

    Comments: 13 pages double column IEEE transactions style

    MSC Class: 68T07; ACM Class: I.5.2; I.2.0

  34. arXiv:2006.11007  [pdf, other

    cs.LG stat.ML

    Towards an Adversarially Robust Normalization Approach

    Authors: Muhammad Awais, Fahad Shamshad, Sung-Ho Bae

    Abstract: Batch Normalization (BatchNorm) is effective for improving the performance and accelerating the training of deep neural networks. However, it has also shown to be a cause of adversarial vulnerability, i.e., networks without it are more robust to adversarial attacks. In this paper, we investigate how BatchNorm causes this vulnerability and proposed new normalization that is robust to adversarial at… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  35. arXiv:2005.07765  [pdf

    cs.NI

    SDN Enabled and OpenFlow Compatible Network Performance Monitoring System

    Authors: S. H. Warraich, Z. Aziz, H. Khurshid, R. Hameed, A. Saboor, M. Awais

    Abstract: Network performance monitoring holds a pivotal role in improving the overall network performance. It is essential to monitor the traffic statistics at Internet eXchange Points (IXPs) to optimize the traffic flows. The existing monitoring system either lacks usability or programmability. We present a Software Defined Networking (SDN) enabled and OpenFlow (OF) compatible network performance monitori… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Comments: 10 pages, 18 figures

  36. arXiv:2004.04775  [pdf, other

    cs.CV

    Early Disease Diagnosis for Rice Crop

    Authors: M. Hammad Masood, Habiba Saim, Murtaza Taj, Mian M. Awais

    Abstract: Many existing techniques provide automatic estimation of crop damage due to various diseases. However, early detection can prevent or reduce the extend of damage itself. The limited performance of existing techniques in early detection is lack of localized information. We instead propose a dataset with annotations for each diseased segment in each image. Unlike existing approaches, instead of clas… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Comments: Paper presented at the ICLR 2020 Workshop on Computer Vision for Agriculture (CV4A)

  37. arXiv:2003.00808  [pdf, other

    cs.CV cs.LG stat.ML

    A Convolutional Baseline for Person Re-Identification Using Vision and Language Descriptions

    Authors: Ammarah Farooq, Muhammad Awais, Fei Yan, Josef Kittler, Ali Akbari, Syed Safwan Khalid

    Abstract: Classical person re-identification approaches assume that a person of interest has appeared across different cameras and can be queried by one of the existing images. However, in real-world surveillance scenarios, frequently no visual information will be available about the queried person. In such scenarios, a natural language description of the person by a witness will provide the only source of… ▽ More

    Submitted 20 February, 2020; originally announced March 2020.

    Comments: 12 pages including references, currently under review in IEEE transactions on Image Processing

  38. arXiv:1811.12488  [pdf, other

    cs.CV cs.LG stat.ML

    Leveraging Deep Stein's Unbiased Risk Estimator for Unsupervised X-ray Denoising

    Authors: Fahad Shamshad, Muhammad Awais, Muhammad Asim, Zain ul Aabidin Lodhi, Muhammad Umair, Ali Ahmed

    Abstract: Among the plethora of techniques devised to curb the prevalence of noise in medical images, deep learning based approaches have shown the most promise. However, one critical limitation of these deep learning based denoisers is the requirement of high-quality noiseless ground truth images that are difficult to obtain in many medical imaging applications such as X-rays. To circumvent this issue, we… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/223

  39. arXiv:1711.06753  [pdf, other

    cs.CV

    Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

    Authors: Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu

    Abstract: We present a new loss function, namely Wing loss, for robust facial landmark localisation with Convolutional Neural Networks (CNNs). We first compare and analyse different loss functions including L2, L1 and smooth L1. The analysis of these loss functions suggests that, for the training of a CNN-based localisation model, more attention should be paid to small and medium range errors. To this end,… ▽ More

    Submitted 23 October, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: 11 pages, 6 figures, 6 tables

  40. Medical Image Analysis using Convolutional Neural Networks: A Review

    Authors: Syed Muhammad Anwar, Muhammad Majid, Adnan Qayyum, Muhammad Awais, Majdi Alnowami, Muhammad Khurram Khan

    Abstract: The science of solving clinical problems by analyzing images generated in clinical practice is known as medical image analysis. The aim is to extract information in an effective and efficient manner for improved clinical diagnosis. The recent advances in the field of biomedical engineering has made medical image analysis one of the top research and development area. One of the reason for this adva… ▽ More

    Submitted 21 May, 2019; v1 submitted 4 September, 2017; originally announced September 2017.

    Journal ref: Journal of Medical Systems (2018)

  41. 3D Morphable Models as Spatial Transformer Networks

    Authors: Anil Bas, Patrik Huber, William A. P. Smith, Muhammad Awais, Josef Kittler

    Abstract: In this paper, we show how a 3D Morphable Model (i.e. a statistical model of the 3D shape of a class of objects such as faces) can be used to spatially transform input data as a module (a 3DMM-STN) within a convolutional neural network. This is an extension of the original spatial transformer network in that we are able to interpret and normalise 3D pose changes and self-occlusions. The trained lo… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

    Comments: Accepted to ICCV 2017 2nd Workshop on Geometry Meets Deep Learning

    MSC Class: 68T45 ACM Class: I.4.8; I.2.10

  42. arXiv:1705.02402  [pdf, other

    cs.CV

    Face Detection, Bounding Box Aggregation and Pose Estimation for Robust Facial Landmark Localisation in the Wild

    Authors: Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu

    Abstract: We present a framework for robust face detection and landmark localisation of faces in the wild, which has been evaluated as part of `the 2nd Facial Landmark Localisation Competition'. The framework has four stages: face detection, bounding box aggregation, pose estimation and landmark localisation. To achieve a high detection rate, we use two publicly available CNN-based face detectors and two pr… ▽ More

    Submitted 1 June, 2017; v1 submitted 5 May, 2017; originally announced May 2017.

  43. Medical Image Retrieval using Deep Convolutional Neural Network

    Authors: Adnan Qayyum, Syed Muhammad Anwar, Muhammad Awais, Muhammad Majid

    Abstract: With a widespread use of digital imaging data in hospitals, the size of medical image repositories is increasing rapidly. This causes difficulty in managing and querying these large databases leading to the need of content based medical image retrieval (CBMIR) systems. A major challenge in CBMIR systems is the semantic gap that exists between the low level visual information captured by imaging de… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

    Comments: Submitted to Neurocomputing

    Journal ref: Neurocomputing 2017