Search | arXiv e-print repository

Investigation of UAV Detection in Images with Complex Backgrounds and Rainy Artifacts

Authors: Adnan Munir, Abdul Jabbar Siddiqui, Saeed Anwar

Abstract: To detect unmanned aerial vehicles (UAVs) in real-time, computer vision and deep learning approaches are evolving research areas. Interest in this problem has grown due to concerns regarding the possible hazards and misuse of employing UAVs in many applications. These include potential privacy violations. To address the concerns, vision-based object detection methods have been developed for UAV de… ▽ More To detect unmanned aerial vehicles (UAVs) in real-time, computer vision and deep learning approaches are evolving research areas. Interest in this problem has grown due to concerns regarding the possible hazards and misuse of employing UAVs in many applications. These include potential privacy violations. To address the concerns, vision-based object detection methods have been developed for UAV detection. However, UAV detection in images with complex backgrounds and weather artifacts like rain has yet to be reasonably studied. Hence, for this purpose, we prepared two training datasets. The first dataset has the sky as its background and is called the Sky Background Dataset (SBD). The second training dataset has more complex scenes (with diverse backgrounds) and is named the Complex Background Dataset (CBD). Additionally, two test sets were prepared: one containing clear images and the other with images with three rain artifacts, named the Rainy Test Set (RTS). This work also focuses on benchmarking state-of-the-art object detection models, and to the best of our knowledge, it is the first to investigate the performance of recent and popular vision-based object detection methods for UAV detection under challenging conditions such as complex backgrounds, varying UAV sizes, and low-to-heavy rainy conditions. The findings presented in the paper shall help provide insights concerning the performance of the selected models for UAV detection under challenging conditions and pave the way to develop more robust UAV detection methods. The codes and datasets are available at: https://github.com/AdnanMunir294/UAVD-CBRA. △ Less

Submitted 5 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

Comments: Accepted at the Real-World Surveillance Workshop, IEEE/CVF Winter Conference on Applications of Computer Vision 2024

arXiv:2203.05975 [pdf, other]

FExGAN-Meta: Facial Expression Generation with Meta Humans

Authors: J. Rafid Siddiqui

Abstract: The subtleness of human facial expressions and a large degree of variation in the level of intensity to which a human expresses them is what makes it challenging to robustly classify and generate images of facial expressions. Lack of good quality data can hinder the performance of a deep learning model. In this article, we have proposed a Facial Expression Generation method for Meta-Humans (FExGAN… ▽ More The subtleness of human facial expressions and a large degree of variation in the level of intensity to which a human expresses them is what makes it challenging to robustly classify and generate images of facial expressions. Lack of good quality data can hinder the performance of a deep learning model. In this article, we have proposed a Facial Expression Generation method for Meta-Humans (FExGAN-Meta) that works robustly with the images of Meta-Humans. We have prepared a large dataset of facial expressions exhibited by ten Meta-Humans when placed in a studio environment and then we have evaluated FExGAN-Meta on the collected images. The results show that FExGAN-Meta robustly generates and classifies the images of Meta-Humans for the simple as well as the complex facial expressions. △ Less

Submitted 16 February, 2022; originally announced March 2022.

arXiv:2201.09061 [pdf, other]

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

Authors: J. Rafid Siddiqui

Abstract: Facial expressions are a form of non-verbal communication that humans perform seamlessly for meaningful transfer of information. Most of the literature addresses the facial expression recognition aspect however, with the advent of Generative Models, it has become possible to explore the affect space in addition to mere classification of a set of expressions. In this article, we propose a generativ… ▽ More Facial expressions are a form of non-verbal communication that humans perform seamlessly for meaningful transfer of information. Most of the literature addresses the facial expression recognition aspect however, with the advent of Generative Models, it has become possible to explore the affect space in addition to mere classification of a set of expressions. In this article, we propose a generative model architecture which robustly generates a set of facial expressions for multiple character identities and explores the possibilities of generating complex expressions by combining the simple ones. △ Less

Submitted 8 February, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

arXiv:2105.10724 [pdf, other]

Crawling Twitter data through API: A technical/legal perspective

Authors: Shahab Saquib Sohail, Mohammad Muzammil Khan, Mohd Arsalan, Aslam Khan, Jamshed Siddiqui, Syed Hamid Hasan, M. Afshar Alam

Abstract: The popularity of the online media-driven social network relation is proven in today's digital era. The many challenges that these emergence has created include a huge growing network of social relations, and the large amount of data which is continuously been generated via the different platform of social networking sites, viz. Facebook, Twitter, LinkedIn, Instagram, etc. These data are Personall… ▽ More The popularity of the online media-driven social network relation is proven in today's digital era. The many challenges that these emergence has created include a huge growing network of social relations, and the large amount of data which is continuously been generated via the different platform of social networking sites, viz. Facebook, Twitter, LinkedIn, Instagram, etc. These data are Personally Identifiable Information (PII) of the users which are also publicly available for some platform, and others allow with some restricted permission to download it for research purposes. The users' accessible data help in providing with better recommendation services to users, however, the PII can be used to embezzle the users and cause severe detriment to them. Hence, it is crucial to maintain the users' privacy while providing their PII accessible for various services. Therefore, it is a burning issue to come up with an approach that can help the users in getting better recommendation services without their privacy being harmed. In this paper, a framework is suggested for the same. Further, how data through Twitter API can be crawled and used has been extensively discussed. In addition to this, various security and legal perspectives regarding PII while crawling the data is highlighted. We believe the presented approach in this paper can serve as a benchmark for future research in the field of data privacy. △ Less

Submitted 22 May, 2021; originally announced May 2021.

arXiv:2105.00199 [pdf]

Can we aggregate human intelligence? an approach for human centric aggregation using ordered weighted averaging operators

Authors: Shahab Saquib Sohail, Jamshed Siddiqui, Rashid Ali, S. Hamid Hasan, M. Afshar Alam

Abstract: The primary objective of this paper is to present an approach for recommender systems that can assimilate ranking to the voters or rankers so that recommendation can be made by giving priority to experts suggestion over usual recommendation. To accomplish this, we have incorporated the concept of human-centric aggregation via Ordered Weighted Aggregation (OWA). Here, we are advocating ranked recom… ▽ More The primary objective of this paper is to present an approach for recommender systems that can assimilate ranking to the voters or rankers so that recommendation can be made by giving priority to experts suggestion over usual recommendation. To accomplish this, we have incorporated the concept of human-centric aggregation via Ordered Weighted Aggregation (OWA). Here, we are advocating ranked recommendation where rankers are assigned weights according to their place in the ranking. Further, the recommendation process which is presented here for the recommendation of books to university students exploits linguistic data summaries and Ordered Weighted Aggregation (OWA) technique. In the suggested approach, the weights are assigned in a way that it associates higher weights to best ranked university. The approach has been evaluated over eight different parameters. The superiority of the proposed approach is evident from the evaluation results. We claim that proposed scheme saves storage spaces required in traditional recommender systems as well as it does not need users prior preferences and hence produce a solution for cold start problem. This envisaged that the proposed scheme can be very useful in decision making problems, especially for recommender systems. In addition, it emphasizes on how human-centric aggregation can be useful in recommendation researches, and also it gives a new direction about how various human specific tasks can be numerically aggregated. △ Less

Submitted 1 May, 2021; originally announced May 2021.

arXiv:2007.04226 [pdf, other]

Labelling imaging datasets on the basis of neuroradiology reports: a validation study

Authors: David A. Wood, Sina Kafiabadi, Aisha Al Busaidi, Emily Guilhem, Jeremy Lynch, Matthew Townend, Antanas Montvila, Juveria Siddiqui, Naveen Gadapa, Matthew Benger, Gareth Barker, Sebastian Ourselin, James H. Cole, Thomas C. Booth

Abstract: Natural language processing (NLP) shows promise as a means to automate the labelling of hospital-scale neuroradiology magnetic resonance imaging (MRI) datasets for computer vision applications. To date, however, there has been no thorough investigation into the validity of this approach, including determining the accuracy of report labels compared to image labels as well as examining the performan… ▽ More Natural language processing (NLP) shows promise as a means to automate the labelling of hospital-scale neuroradiology magnetic resonance imaging (MRI) datasets for computer vision applications. To date, however, there has been no thorough investigation into the validity of this approach, including determining the accuracy of report labels compared to image labels as well as examining the performance of non-specialist labellers. In this work, we draw on the experience of a team of neuroradiologists who labelled over 5000 MRI neuroradiology reports as part of a project to build a dedicated deep learning-based neuroradiology report classifier. We show that, in our experience, assigning binary labels (i.e. normal vs abnormal) to images from reports alone is highly accurate. In contrast to the binary labels, however, the accuracy of more granular labelling is dependent on the category, and we highlight reasons for this discrepancy. We also show that downstream model performance is reduced when labelling of training reports is performed by a non-specialist. To allow other researchers to accelerate their research, we make our refined abnormality definitions and labelling rules available, as well as our easy-to-use radiology report labelling app which helps streamline this process. △ Less

Submitted 8 March, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

arXiv:2002.06588 [pdf, other]

Automated Labelling using an Attention model for Radiology reports of MRI scans (ALARM)

Authors: David A. Wood, Jeremy Lynch, Sina Kafiabadi, Emily Guilhem, Aisha Al Busaidi, Antanas Montvila, Thomas Varsavsky, Juveria Siddiqui, Naveen Gadapa, Matthew Townend, Martin Kiik, Keena Patel, Gareth Barker, Sebastian Ourselin, James H. Cole, Thomas C. Booth

Abstract: Labelling large datasets for training high-capacity neural networks is a major obstacle to the development of deep learning-based medical imaging applications. Here we present a transformer-based network for magnetic resonance imaging (MRI) radiology report classification which automates this task by assigning image labels on the basis of free-text expert radiology reports. Our model's performance… ▽ More Labelling large datasets for training high-capacity neural networks is a major obstacle to the development of deep learning-based medical imaging applications. Here we present a transformer-based network for magnetic resonance imaging (MRI) radiology report classification which automates this task by assigning image labels on the basis of free-text expert radiology reports. Our model's performance is comparable to that of an expert radiologist, and better than that of an expert physician, demonstrating the feasibility of this approach. We make code available online for researchers to label their own MRI datasets for medical imaging applications. △ Less

Submitted 16 February, 2020; originally announced February 2020.

arXiv:1802.10588 [pdf]

IntLIM: Integration using Linear Models of metabolomics and gene expression data

Authors: Jalal K. Siddiqui, Elizabeth Baskin, Mingrui Liu, Carmen Z. Cantemir-Stone, Bofei Zhang, Russell Bonneville, Joseph P. McElroy, Kevin R. Coombes, Ewy A. Mathé

Abstract: Integration of transcriptomic and metabolomic data improves functional interpretation of disease-related metabolomic phenotypes, and facilitates discovery of putative metabolite biomarkers and gene targets. For this reason, these data are increasingly collected in large cohorts, driving a need for the development of novel methods for their integration. Of note, clinical/translational studies typic… ▽ More Integration of transcriptomic and metabolomic data improves functional interpretation of disease-related metabolomic phenotypes, and facilitates discovery of putative metabolite biomarkers and gene targets. For this reason, these data are increasingly collected in large cohorts, driving a need for the development of novel methods for their integration. Of note, clinical/translational studies typically provide snapshot gene and metabolite profiles and, oftentimes, most metabolites are not identified. Thus, in these types of studies, pathway/network approaches that take into account the complexity of gene-metabolite relationships may neither be applicable nor readily uncover novel relationships. With this in mind, we propose a simple linear modeling approach to capture phenotype-specific gene-metabolite associations, with the assumption that co-regulation patterns reflect functionally related genes and metabolites. The proposed linear model, metabolite ~ gene + phenotype + gene:phenotype, specifically evaluates whether gene-metabolite relationships differ by phenotype, by testing whether the relationship in one phenotype is significantly different from the relationship in another phenotype (via an interaction gene:phenotype p-value). Interaction p-values for all possible gene-metabolite pairs are computed and significant pairs are clustered by the directionality of associations. We implemented our approach as an R package, IntLIM, which includes a user-friendly Shiny app. We applied IntLIM to two published datasets, collected in NCI-60 cell lines and in human breast tumor and non-tumor tissue. We demonstrate that IntLIM captures relevant tumor-specific gene-metabolite associations involved in cancer-related pathways. and also uncover novel relationships that could be tested experimentally. The IntLIM R package is publicly available in GitHub (https://github.com/mathelab/IntLIM). △ Less

Submitted 28 February, 2018; originally announced February 2018.

arXiv:1801.09373 [pdf, other]

Search Based Code Generation for Machine Learning Programs

Authors: Muhammad Zubair Malik, Muhammad Nawaz, Nimrah Mustafa, Junaid Haroon Siddiqui

Abstract: Machine Learning (ML) has revamped every domain of life as it provides powerful tools to build complex systems that learn and improve from experience and data. Our key insight is that to solve a machine learning problem, data scientists do not invent a new algorithm each time, but evaluate a range of existing models with different configurations and select the best one. This task is laborious, err… ▽ More Machine Learning (ML) has revamped every domain of life as it provides powerful tools to build complex systems that learn and improve from experience and data. Our key insight is that to solve a machine learning problem, data scientists do not invent a new algorithm each time, but evaluate a range of existing models with different configurations and select the best one. This task is laborious, error-prone, and drains a large chunk of project budget and time. In this paper we present a novel framework inspired by programming by Sketching and Partial Evaluation to minimize human intervention in develo** ML solutions. We templatize machine learning algorithms to expose configuration choices as holes to be searched. We share code and computation between different algorithms, and only partially evaluate configuration space of algorithms based on information gained from initial algorithm evaluations. We also employ hierarchical and heuristic based pruning to reduce the search space. Our initial findings indicate that our approach can generate highly accurate ML models. Interviews with data scientists show that they feel our framework can eliminate sources of common errors and significantly reduce development time. △ Less

Submitted 6 February, 2018; v1 submitted 29 January, 2018; originally announced January 2018.

Comments: Search Based Software Engineering, Generating Machine Learning Code, Partial Evaluation, Futamura Projection, Sketching

arXiv:1611.08815 [pdf, other]

Did Evolution get it right? An evaluation of Near-Infrared imaging in semantic scene segmentation using deep learning

Authors: J. Rafid Siddiqui

Abstract: Animals have evolved to restrict their sensing capabilities to certain region of electromagnetic spectrum. This is surprisingly a very narrow band on a vast scale which makes one think if there is a systematic bias underlying such selective filtration. The situation becomes even more intriguing when we find a sharp cutoff point at Near-infrared point whereby almost all animal vision systems seem t… ▽ More Animals have evolved to restrict their sensing capabilities to certain region of electromagnetic spectrum. This is surprisingly a very narrow band on a vast scale which makes one think if there is a systematic bias underlying such selective filtration. The situation becomes even more intriguing when we find a sharp cutoff point at Near-infrared point whereby almost all animal vision systems seem to have a lower bound. This brings us to an interesting question: did evolution "intentionally" performed such a restriction in order to evolve higher visual cognition? In this work this question is addressed by experimenting with Near-infrared images for their potential applicability in higher visual processing such as semantic segmentation. A modified version of Fully Convolutional Networks are trained on NIR images and RGB images respectively and compared for their respective effectiveness in the wake of semantic segmentation. The results from the experiments show that visible part of the spectrum alone is sufficient for the robust semantic segmentation of the indoor as well as outdoor scenes. △ Less

Submitted 27 November, 2016; originally announced November 2016.

Showing 1–10 of 10 results for author: Siddiqui, J