-
Harnessing The Power of Attention For Patch-Based Biomedical Image Classification
Authors:
Gousia Habib,
Shaima Qureshi,
Malik ishfaq
Abstract:
Biomedical image analysis is of paramount importance for the advancement of healthcare and medical research. Although conventional convolutional neural networks (CNNs) are frequently employed in this domain, facing limitations in capturing intricate spatial and temporal relationships at the pixel level due to their reliance on fixed-sized windows and immutable filter weights post-training. These c…
▽ More
Biomedical image analysis is of paramount importance for the advancement of healthcare and medical research. Although conventional convolutional neural networks (CNNs) are frequently employed in this domain, facing limitations in capturing intricate spatial and temporal relationships at the pixel level due to their reliance on fixed-sized windows and immutable filter weights post-training. These constraints impede their ability to adapt to input fluctuations and comprehend extensive long-range contextual information. To overcome these challenges, a novel architecture based on self-attention mechanisms as an alternative to conventional CNNs.The proposed model utilizes attention-based mechanisms to surpass the limitations of CNNs. The key component of our strategy is the combination of non-overlap** (vanilla patching) and novel overlapped Shifted Patching Techniques (S.P.T.s), which enhances the model's capacity to capture local context and improves generalization. Additionally, we introduce the Lancoz5 interpolation technique, which adapts variable image sizes to higher resolutions, facilitating better analysis of high-resolution biomedical images. Our methods address critical challenges faced by attention-based vision models, including inductive bias, weight sharing, receptive field limitations, and efficient data handling. Experimental evidence shows the effectiveness of proposed model in generalizing to various biomedical imaging tasks. The attention-based model, combined with advanced data augmentation methodologies, exhibits robust modeling capabilities and superior performance compared to existing approaches. The integration of S.P.T.s significantly enhances the model's ability to capture local context, while the Lancoz5 interpolation technique ensures efficient handling of high-resolution images.
△ Less
Submitted 9 June, 2024; v1 submitted 1 April, 2024;
originally announced April 2024.
-
Exploring the Efficacy of Group-Normalization in Deep Learning Models for Alzheimer's Disease Classification
Authors:
Gousia Habib,
Ishfaq Ahmed Malik,
Jameel Ahmad,
Imtiaz Ahmed,
Shaima Qureshi
Abstract:
Batch Normalization is an important approach to advancing deep learning since it allows multiple networks to train simultaneously. A problem arises when normalizing along the batch dimension because B.N.'s error increases significantly as batch size shrinks because batch statistics estimates are inaccurate. As a result, computer vision tasks like detection, segmentation, and video, which require t…
▽ More
Batch Normalization is an important approach to advancing deep learning since it allows multiple networks to train simultaneously. A problem arises when normalizing along the batch dimension because B.N.'s error increases significantly as batch size shrinks because batch statistics estimates are inaccurate. As a result, computer vision tasks like detection, segmentation, and video, which require tiny batches based on memory consumption, aren't suitable for using Batch Normalization for larger model training and feature transfer. Here, we explore Group Normalization as an easy alternative to using Batch Normalization A Group Normalization is a channel normalization method in which each group is divided into different channels, and the corresponding mean and variance are calculated for each group. Group Normalization computations are accurate across a wide range of batch sizes and are independent of batch size. When trained using a large ImageNet database on ResNet-50, GN achieves a very low error rate of 10.6% compared to Batch Normalization. when a smaller batch size of only 2 is used. For usual batch sizes, the performance of G.N. is comparable to that of Batch Normalization, but at the same time, it outperforms other normalization techniques. Implementing Group Normalization as a direct alternative to B.N to combat the serious challenges faced by the Batch Normalization in deep learning models with comparable or improved classification accuracy. Additionally, Group Normalization can be naturally transferred from the pre-training to the fine-tuning phase. .
△ Less
Submitted 1 April, 2024;
originally announced April 2024.
-
A New Deep Hybrid Boosted and Ensemble Learning-based Brain Tumor Analysis using MRI
Authors:
Mirza Mumtaz Zahoor,
Shahzad Ahmad Qureshi,
Saddam Hussain Khan,
Asifullah Khan
Abstract:
Brain tumors analysis is important in timely diagnosis and effective treatment to cure patients. Tumor analysis is challenging because of tumor morphology like size, location, texture, and heteromorphic appearance in the medical images. In this regard, a novel two-phase deep learning-based framework is proposed to detect and categorize brain tumors in magnetic resonance images (MRIs). In the first…
▽ More
Brain tumors analysis is important in timely diagnosis and effective treatment to cure patients. Tumor analysis is challenging because of tumor morphology like size, location, texture, and heteromorphic appearance in the medical images. In this regard, a novel two-phase deep learning-based framework is proposed to detect and categorize brain tumors in magnetic resonance images (MRIs). In the first phase, a novel deep boosted features and ensemble classifiers (DBF-EC) scheme is proposed to detect tumor MRI images from healthy individuals effectively. The deep boosted feature space is achieved through the customized and well-performing deep convolutional neural networks (CNNs), and consequently, fed into the ensemble of machine learning (ML) classifiers. While in the second phase, a new hybrid features fusion-based brain tumor classification approach is proposed, comprised of dynamic-static feature and ML classifier to categorize different tumor types. The dynamic features are extracted from the proposed BRAIN-RENet CNN, which carefully learns heteromorphic and inconsistent behavior of various tumors, while the static features are extracted using HOG. The effectiveness of the proposed two-phase brain tumor analysis framework is validated on two standard benchmark datasets; collected from Kaggle and Figshare containing different types of tumor, including glioma, meningioma, pituitary, and normal images. Experimental results proved that the proposed DBF-EC detection scheme outperforms and achieved accuracy (99.56%), precision (0.9991), recall (0.9899), F1-Score (0.9945), MCC (0.9892), and AUC-PR (0.9990). While the classification scheme, the joint employment of the deep features fusion of proposed BRAIN-RENet and HOG features improves performance significantly in terms of recall (0.9913), precision (0.9906), F1-Score (0.9909), and accuracy (99.20%) on diverse datasets.
△ Less
Submitted 11 February, 2022; v1 submitted 14 January, 2022;
originally announced January 2022.
-
Sustainable Development Through a Mobile Application for a Community Clinic
Authors:
Martina A. Clarke,
Sajda Qureshi,
Timi Barone,
John R. Windle
Abstract:
Implementing Information and Communication Technology (ICT) solutions can alleviate pressing problems in society and are a central component of sustainable development. Often, healthcare addresses the symptoms without approaching the socioeconomic limiters that can lead to reduced individual economic freedoms from receiving healthcare. This paper investigates the question: How can technology and t…
▽ More
Implementing Information and Communication Technology (ICT) solutions can alleviate pressing problems in society and are a central component of sustainable development. Often, healthcare addresses the symptoms without approaching the socioeconomic limiters that can lead to reduced individual economic freedoms from receiving healthcare. This paper investigates the question: How can technology and training interventions enable clinicians to offer care that addresses the socioeconomic limitations of their patients? This paper observes the implementation of a mobile app designed to offer people who cannot access health resources in Omaha, Nebraska, a city in the Midwestern United States. This study follows the design science and action research approach, with clinicians participating in develo** the mobile app. As a result of COVID, patients no longer have access to the free clinic because it was shut down. The app is available to the broader community needing basic resources to stay healthy. Through sets of application revisions and observations of usage, this paper arrives at insights into how such applications can support multi-ethnic and underserved communities. The contribution of this paper is to provide contextually specific and rich descriptions of how to implement sustainable ICT solutions to meet the information needs of patients in underserved communities.
△ Less
Submitted 22 August, 2021;
originally announced August 2021.
-
Assessing the Effectiveness of Syntactic Structure to Learn Code Edit Representations
Authors:
Syed Arbaaz Qureshi,
Sonu Mehta,
Ranjita Bhagwan,
Rahul Kumar
Abstract:
In recent times, it has been shown that one can use code as data to aid various applications such as automatic commit message generation, automatic generation of pull request descriptions and automatic program repair. Take for instance the problem of commit message generation. Treating source code as a sequence of tokens, state of the art techniques generate commit messages using neural machine tr…
▽ More
In recent times, it has been shown that one can use code as data to aid various applications such as automatic commit message generation, automatic generation of pull request descriptions and automatic program repair. Take for instance the problem of commit message generation. Treating source code as a sequence of tokens, state of the art techniques generate commit messages using neural machine translation models. However, they tend to ignore the syntactic structure of programming languages.
Previous work, i.e., code2seq has used structural information from Abstract Syntax Tree (AST) to represent source code and they use it to automatically generate method names. In this paper, we elaborate upon this state of the art approach and modify it to represent source code edits. We determine the effect of using such syntactic structure for the problem of classifying code edits. Inspired by the code2seq approach, we evaluate how using structural information from AST, i.e., paths between AST leaf nodes can help with the task of code edit classification on two datasets of fine-grained syntactic edits.
Our experiments shows that attempts of adding syntactic structure does not result in any improvements over less sophisticated methods. The results suggest that techniques such as code2seq, while promising, have a long way to go before they can be generically applied to learning code edit representations. We hope that these results will benefit other researchers and inspire them to work further on this problem.
△ Less
Submitted 10 June, 2021;
originally announced June 2021.
-
Transfer Learning with Ensembles of Deep Neural Networks for Skin Cancer Detection in Imbalanced Data Sets
Authors:
Aqsa Saeed Qureshi,
Teemu Roos
Abstract:
Several machine learning techniques for accurate detection of skin cancer from medical images have been reported. Many of these techniques are based on pre-trained convolutional neural networks (CNNs), which enable training the models based on limited amounts of training data. However, the classification accuracy of these models still tends to be severely limited by the scarcity of representative…
▽ More
Several machine learning techniques for accurate detection of skin cancer from medical images have been reported. Many of these techniques are based on pre-trained convolutional neural networks (CNNs), which enable training the models based on limited amounts of training data. However, the classification accuracy of these models still tends to be severely limited by the scarcity of representative images from malignant tumours. We propose a novel ensemble-based CNN architecture where multiple CNN models, some of which are pre-trained and some are trained only on the data at hand, along with auxiliary data in the form of metadata associated with the input images, are combined using a meta-learner. The proposed approach improves the model's ability to handle limited and imbalanced data. We demonstrate the benefits of the proposed technique using a dataset with 33126 dermoscopic images from 2056 patients. We evaluate the performance of the proposed technique in terms of the F1-measure, area under the ROC curve (AUC-ROC), and area under the PR-curve (AUC-PR), and compare it with that of seven different benchmark methods, including two recent CNN-based techniques. The proposed technique compares favourably in terms of all the evaluation metrics.
△ Less
Submitted 17 May, 2021; v1 submitted 22 March, 2021;
originally announced March 2021.
-
Extracting Signals of Higgs Boson From Background Noise Using Deep Neural Networks
Authors:
Muhammad Abbas,
Asifullah Khan,
Aqsa Saeed Qureshi,
Muhammad Waleed Khan
Abstract:
Higgs boson is a fundamental particle, and the classification of Higgs signals is a well-known problem in high energy physics. The identification of the Higgs signal is a challenging task because its signal has a resemblance to the background signals. This study proposes a Higgs signal classification using a novel combination of random forest, auto encoder and deep auto encoder to build a robust a…
▽ More
Higgs boson is a fundamental particle, and the classification of Higgs signals is a well-known problem in high energy physics. The identification of the Higgs signal is a challenging task because its signal has a resemblance to the background signals. This study proposes a Higgs signal classification using a novel combination of random forest, auto encoder and deep auto encoder to build a robust and generalized Higgs boson prediction system to discriminate the Higgs signal from the background noise. The proposed ensemble technique is based on achieving diversity in the decision space, and the results show good discrimination power on the private leaderboard; achieving an area under the Receiver Operating Characteristic curve of 0.9 and an Approximate Median Significance score of 3.429.
△ Less
Submitted 16 October, 2020;
originally announced October 2020.
-
Scalable Blocking for Very Large Databases
Authors:
Andrew Borthwick,
Stephen Ash,
Bin Pang,
Shehzad Qureshi,
Timothy Jones
Abstract:
In the field of database deduplication, the goal is to find approximately matching records within a database. Blocking is a typical stage in this process that involves cheaply finding candidate pairs of records that are potential matches for further processing. We present here Hashed Dynamic Blocking, a new approach to blocking designed to address datasets larger than those studied in most prior w…
▽ More
In the field of database deduplication, the goal is to find approximately matching records within a database. Blocking is a typical stage in this process that involves cheaply finding candidate pairs of records that are potential matches for further processing. We present here Hashed Dynamic Blocking, a new approach to blocking designed to address datasets larger than those studied in most prior work. Hashed Dynamic Blocking (HDB) extends Dynamic Blocking, which leverages the insight that rare matching values and rare intersections of values are predictive of a matching relationship. We also present a novel use of Locality Sensitive Hashing (LSH) to build blocking key values for huge databases with a convenient configuration to control the trade-off between precision and recall. HDB achieves massive scale by minimizing data movement, using compact block representation, and greedily pruning ineffective candidate blocks using a Count-min Sketch approximate counting data structure. We benchmark the algorithm by focusing on real-world datasets in excess of one million rows, demonstrating that the algorithm displays linear time complexity scaling in this range. Furthermore, we execute HDB on a 530 million row industrial dataset, detecting 68 billion candidate pairs in less than three hours at a cost of $307 on a major cloud service.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Insight into Potential Well Based Nanoscale FDSOI MOSFET Using Doped Silicon Tubs- A Simulation and Device Physics Based Study: Part II: Scalability to 10 nm Gate Length
Authors:
Shruti Mehrotra,
S. Qureshi
Abstract:
The doped silicon regions (tubs) in PWFDSOI MOSFET cause significant reduction in OFF current by reducing the number of carriers contributing to the OFF current. The emphasis of the simulation and device physics study on PWFDSOI MOSFET presented in this paper is on the scalability of the device to 10 nm gate length and its related information. A high ION /IOFF ratio of 7.6 x 10^5 and subthreshold…
▽ More
The doped silicon regions (tubs) in PWFDSOI MOSFET cause significant reduction in OFF current by reducing the number of carriers contributing to the OFF current. The emphasis of the simulation and device physics study on PWFDSOI MOSFET presented in this paper is on the scalability of the device to 10 nm gate length and its related information. A high ION /IOFF ratio of 7.6 x 10^5 and subthreshold swing of 87 mV/decade were achieved in 10 nm gate length PWFDSOI MOSFET. The study was performed on devices with unstrained silicon channel.
△ Less
Submitted 5 July, 2020;
originally announced July 2020.
-
Insight into Potential Well Based Nanoscale FDSOI MOSFET Using Doped Silicon Tubs- A Simulation and Device Physics Based Study: Part I: Theory and Methodology
Authors:
Shruti Mehrotra,
S. Qureshi
Abstract:
A novel planar device having doped silicon regions (tubs) under the source and drain of an FDSOI MOSFET is reported at 20 nm gate length. The doped silicon regions result in formation of potential wells (PW) in the source and drain regions of FDSOI MOSFET and thus, the device being called as Potential Well Based FDSOI MOSFET (PWFDSOI MOSFET). Simulation and device physics study on PWFDSOI MOSFET s…
▽ More
A novel planar device having doped silicon regions (tubs) under the source and drain of an FDSOI MOSFET is reported at 20 nm gate length. The doped silicon regions result in formation of potential wells (PW) in the source and drain regions of FDSOI MOSFET and thus, the device being called as Potential Well Based FDSOI MOSFET (PWFDSOI MOSFET). Simulation and device physics study on PWFDSOI MOSFET showed reduction in the OFF current of the device by orders of magnitude. A low IOF F of 22 pA/um, high ION /IOF F ratio of 1.5 x 107 and subthreshold swing of 76 mV/decade were achieved in 20 nm gate length PWFDSOI MOSFET. The study was performed on devices with unstrained silicon channel.
△ Less
Submitted 5 July, 2020;
originally announced July 2020.
-
Wind Speed Prediction using Deep Ensemble Learning with a Jet-like Architecture
Authors:
Aqsa Saeed Qureshi,
Asifullah Khan,
Muhammad Waleed Khan
Abstract:
The wind is one of the most increasingly used renewable energy resources. Accurate and reliable forecast of wind speed is necessary for efficient power production; however, it is not an easy task because it depends upon meteorological features of the surrounding region. Deep learning is extensively used these days for performing feature extraction. It has also been observed that the integration of…
▽ More
The wind is one of the most increasingly used renewable energy resources. Accurate and reliable forecast of wind speed is necessary for efficient power production; however, it is not an easy task because it depends upon meteorological features of the surrounding region. Deep learning is extensively used these days for performing feature extraction. It has also been observed that the integration of several learning models, known as ensemble learning, generally gives better performance compared to a single model. The design of wings, tail, and nose of a jet improves the aerodynamics resulting in a smooth and controlled flight of the jet against the variations of the air currents. Inspired by the shape and working of a jet, a novel Deep Ensemble Learning using Jet-like Architecture (DEL-Jet) technique is proposed to enhance the diversity and robustness of a learning system against the variations in the input space. The diverse feature spaces of the base-regressors are exploited using the jet-like ensemble architecture. Two Convolutional Neural Networks (as jet wings) and one deep Auto-Encoder (as jet tail) are used to extract the diverse feature spaces from the input data. After that, nonlinear PCA (as jet main body) is employed to reduce the dimensionality of extracted feature space. Finally, both the reduced and the original feature spaces are exploited to train the meta-regressor (as jet nose) for forecasting the wind speed. The performance of the proposed DEL-Jet technique is evaluated for ten independent runs and shows that the deep and jet-like architecture helps in improving the robustness and generalization of the learning system.
△ Less
Submitted 20 March, 2020; v1 submitted 28 February, 2020;
originally announced February 2020.
-
Malware Classification using Deep Learning based Feature Extraction and Wrapper based Feature Selection Technique
Authors:
Muhammad Furqan Rafique,
Muhammad Ali,
Aqsa Saeed Qureshi,
Asifullah Khan,
Anwar Majid Mirza
Abstract:
In the case of malware analysis, categorization of malicious files is an essential part after malware detection. Numerous static and dynamic techniques have been reported so far for categorizing malware. This research presents a deep learning-based malware detection (DLMD) technique based on static methods for classifying different malware families. The proposed DLMD technique uses both the byte a…
▽ More
In the case of malware analysis, categorization of malicious files is an essential part after malware detection. Numerous static and dynamic techniques have been reported so far for categorizing malware. This research presents a deep learning-based malware detection (DLMD) technique based on static methods for classifying different malware families. The proposed DLMD technique uses both the byte and ASM files for feature engineering, thus classifying malware families. First, features are extracted from byte files using two different Deep Convolutional Neural Networks (CNN). After that, essential and discriminative opcode features are selected using a wrapper-based mechanism, where Support Vector Machine (SVM) is used as a classifier. The idea is to construct a hybrid feature space by combining the different feature spaces to overcome the shortcoming of particular feature space and thus, reduce the chances of missing a malware. Finally, the hybrid feature space is used to train a Multilayer Perceptron, which classifies all nine different malware families. Experimental results show that proposed DLMD technique achieves log-loss of 0.09 for ten independent runs. Moreover, the proposed DLMD technique's performance is compared against different classifiers and shows its effectiveness in categorizing malware. The relevant code and database can be found at https://github.com/cyberhunters/Malware-Detection-Using-Machine-Learning.
△ Less
Submitted 26 December, 2020; v1 submitted 24 October, 2019;
originally announced October 2019.
-
Performance Considerations of Thin Ferroelectrics (~10 nm HfO2, ~20 nm PZT) FDSOI NCFETs for Digital Circuits at Reduced Power Consumption
Authors:
Shruti Mehrotra,
S. Qureshi
Abstract:
The paper presents simulation study of thin ferroelectrics (Si doped HfO2, PZT) PGP FDSOI NCFETs at circuit level for high performance, low VDD low-power digital circuits. The baseline PGP FDSOI MOSFET has 20 nm metal gate length with supply voltage varying from 0.5 V to 0.9 V. The circuits studied were 3-stage CMOS ring oscillator, NAND-2 and NOR-2 gates at a frequency of 20 GHz. The paper shows…
▽ More
The paper presents simulation study of thin ferroelectrics (Si doped HfO2, PZT) PGP FDSOI NCFETs at circuit level for high performance, low VDD low-power digital circuits. The baseline PGP FDSOI MOSFET has 20 nm metal gate length with supply voltage varying from 0.5 V to 0.9 V. The circuits studied were 3-stage CMOS ring oscillator, NAND-2 and NOR-2 gates at a frequency of 20 GHz. The paper shows that HfO2 FDSOI NCFET based NAND-2 gates can provide significant reduction in average power consumption, which was ~66% that of baseline FDSOI MOSFET based NAND-2 gates for comparable performance. For the same performance, the average power consumption for PZT FDSOI NCFET based NAND-2 gate was ~86% that of baseline FDSOI MOSFET based NAND-2 gate. The power-delay product of HfO2 FDSOI NCFET based gates was found to be ~24% lower than baseline FDSOI MOSFET based gates and that of PZT FDSOI NCFET based gates was found to be ~21% less than that of baseline FDSOI MOSFET based gates. The performance of HfO2 FDSOI NCFET based gates with increased fan-in and fan-out was also found to be superior to PZT FDSOI NCFET based gates and baseline FDSOI MOSFET based gates.
△ Less
Submitted 12 June, 2019;
originally announced June 2019.
-
The Verbal and Non Verbal Signals of Depression -- Combining Acoustics, Text and Visuals for Estimating Depression Level
Authors:
Syed Arbaaz Qureshi,
Mohammed Hasanuzzaman,
Sriparna Saha,
Gaƫl Dias
Abstract:
Depression is a serious medical condition that is suffered by a large number of people around the world. It significantly affects the way one feels, causing a persistent lowering of mood. In this paper, we propose a novel attention-based deep neural network which facilitates the fusion of various modalities. We use this network to regress the depression level. Acoustic, text and visual modalities…
▽ More
Depression is a serious medical condition that is suffered by a large number of people around the world. It significantly affects the way one feels, causing a persistent lowering of mood. In this paper, we propose a novel attention-based deep neural network which facilitates the fusion of various modalities. We use this network to regress the depression level. Acoustic, text and visual modalities have been used to train our proposed network. Various experiments have been carried out on the benchmark dataset, namely, Distress Analysis Interview Corpus - a Wizard of Oz (DAIC-WOZ). From the results, we empirically justify that the fusion of all three modalities helps in giving the most accurate estimation of depression level. Our proposed approach outperforms the state-of-the-art by 7.17% on root mean squared error (RMSE) and 8.08% on mean absolute error (MAE).
△ Less
Submitted 2 April, 2019;
originally announced April 2019.
-
A Recent Survey on the Applications of Genetic Programming in Image Processing
Authors:
Asifullah Khan,
Aqsa Saeed Qureshi,
Noorul Wahab,
Mutawara Hussain,
Muhammad Yousaf Hamza
Abstract:
Genetic Programming (GP) has been primarily used to tackle optimization, classification, and feature selection related tasks. The widespread use of GP is due to its flexible and comprehensible tree-type structure. Similarly, research is also gaining momentum in the field of Image Processing, because of its promising results over vast areas of applications ranging from medical Image Processing to m…
▽ More
Genetic Programming (GP) has been primarily used to tackle optimization, classification, and feature selection related tasks. The widespread use of GP is due to its flexible and comprehensible tree-type structure. Similarly, research is also gaining momentum in the field of Image Processing, because of its promising results over vast areas of applications ranging from medical Image Processing to multispectral imaging. Image Processing is mainly involved in applications such as computer vision, pattern recognition, image compression, storage, and medical diagnostics. This universal nature of images and their associated algorithm, i.e., complexities, gave an impetus to the exploration of GP. GP has thus been used in different ways for Image Processing since its inception. Many interesting GP techniques have been developed and employed in the field of Image Processing, and consequently, we aim to provide the research community an extensive view of these techniques. This survey thus presents the diverse applications of GP in Image Processing and provides useful resources for further research. Also, the comparison of different parameters used in different applications of Image Processing is summarized in tabular form. Moreover, analysis of the different parameters used in Image Processing related tasks is carried-out to save the time needed in the future for evaluating the parameters of GP. As more advancement is made in GP methodologies, its success in solving complex tasks, not only in Image Processing but also in other fields, may increase. Additionally, guidelines are provided for applying GP in Image Processing related tasks, the pros and cons of GP techniques are discussed, and some future directions are also set.
△ Less
Submitted 25 June, 2020; v1 submitted 18 January, 2019;
originally announced January 2019.
-
A Survey of the Recent Architectures of Deep Convolutional Neural Networks
Authors:
Asifullah Khan,
Anabia Sohail,
Umme Zahoora,
Aqsa Saeed Qureshi
Abstract:
Deep Convolutional Neural Network (CNN) is a special type of Neural Networks, which has shown exemplary performance on several competitions related to Computer Vision and Image Processing. Some of the exciting application areas of CNN include Image Classification and Segmentation, Object Detection, Video Processing, Natural Language Processing, and Speech Recognition. The powerful learning ability…
▽ More
Deep Convolutional Neural Network (CNN) is a special type of Neural Networks, which has shown exemplary performance on several competitions related to Computer Vision and Image Processing. Some of the exciting application areas of CNN include Image Classification and Segmentation, Object Detection, Video Processing, Natural Language Processing, and Speech Recognition. The powerful learning ability of deep CNN is primarily due to the use of multiple feature extraction stages that can automatically learn representations from the data. The availability of a large amount of data and improvement in the hardware technology has accelerated the research in CNNs, and recently interesting deep CNN architectures have been reported. Several inspiring ideas to bring advancements in CNNs have been explored, such as the use of different activation and loss functions, parameter optimization, regularization, and architectural innovations. However, the significant improvement in the representational capacity of the deep CNN is achieved through architectural innovations. Notably, the ideas of exploiting spatial and channel information, depth and width of architecture, and multi-path information processing have gained substantial attention. Similarly, the idea of using a block of layers as a structural unit is also gaining popularity. This survey thus focuses on the intrinsic taxonomy present in the recently reported deep CNN architectures and, consequently, classifies the recent innovations in CNN architectures into seven different categories. These seven categories are based on spatial exploitation, depth, multi-path, width, feature-map exploitation, channel boosting, and attention. Additionally, the elementary understanding of CNN components, current challenges, and applications of CNN are also provided.
△ Less
Submitted 10 May, 2020; v1 submitted 17 January, 2019;
originally announced January 2019.
-
Adaptive Transfer Learning in Deep Neural Networks: Wind Power Prediction using Knowledge Transfer from Region to Region and Between Different Task Domains
Authors:
Aqsa Saeed Qureshi,
Asifullah Khan
Abstract:
Transfer Learning (TL) in Deep Neural Networks is gaining importance because in most of the applications, the labeling of data is costly and time-consuming. Additionally, TL also provides an effective weight initialization strategy for Deep Neural Networks . This paper introduces the idea of Adaptive Transfer Learning in Deep Neural Networks (ATL-DNN) for wind power prediction. Specifically, we sh…
▽ More
Transfer Learning (TL) in Deep Neural Networks is gaining importance because in most of the applications, the labeling of data is costly and time-consuming. Additionally, TL also provides an effective weight initialization strategy for Deep Neural Networks . This paper introduces the idea of Adaptive Transfer Learning in Deep Neural Networks (ATL-DNN) for wind power prediction. Specifically, we show in case of wind power prediction that adaptive TL of Deep Neural Networks system can be adaptively modified as regards training on a different wind farm is concerned. The proposed ATL-DNN technique is tested for short-term wind power prediction, where continuously arriving information has to be exploited. Adaptive TL not only helps in providing good weight initialization, but is also helpful to utilize the incoming data for effective learning. Additionally, the proposed ATL-DNN technique is shown to transfer knowledge between different task domains (wind power to wind speed prediction) and from one region to another region. The simulation results show that the proposed ATL-DNN technique achieves average values of 0.0637,0.0986, and 0.0984 for the Mean-Absolute-Error, Root-Mean-Squared-Error, and Standard-Deviation-Error, respectively.
△ Less
Submitted 20 December, 2018; v1 submitted 30 October, 2018;
originally announced October 2018.
-
Solar wind driven electrostatic instabilities with generalized r,q distribution function
Authors:
Warda Nasir,
Zahida Ehsan,
M. N. S. Qureshi,
H. A. Shah
Abstract:
Using Boltzmann Vlasov kinetic model with the flat-top non-Maxwellian distributed electrons and ions, a velocity power law energetic tail, known as the generalized r, q distribution, a current-less electrostatic instability namely ion acoustic which is driven by a stream of solar wind plasma is studied here. The instability threshold is affected and depends upon the spectral indices r and q. It is…
▽ More
Using Boltzmann Vlasov kinetic model with the flat-top non-Maxwellian distributed electrons and ions, a velocity power law energetic tail, known as the generalized r, q distribution, a current-less electrostatic instability namely ion acoustic which is driven by a stream of solar wind plasma is studied here. The instability threshold is affected and depends upon the spectral indices r and q. It is found that the growth rate increases with the decrease of spectral index. Moreover, such kinetic instability has also been discussed for a three species electron-ion-dust plasma using the generalized r, q distribution function. Such case is of interest when the solar wind is streaming through the cometary plasma in the presence of interstellar dust and excites electrostatic instabilities. In the limits of phase velocity of the waves larger and smaller than the thermal velocity of dust particles, the dispersion properties and growth rate of dust-acoustic mode are calculated analytically and has been plotted for different values of the spectral indices.
△ Less
Submitted 27 November, 2017;
originally announced November 2017.
-
MODLEACH: A Variant of LEACH for WSNs
Authors:
D. Mahmood,
N. Javaid,
S. Mahmood,
S. Qureshi,
A. M. Memon,
T. Zaman
Abstract:
Wireless sensor networks are appearing as an emerging need for mankind. Though, Such networks are still in research phase however, they have high potential to be applied in almost every field of life. Lots of research is done and a lot more is awaiting to be standardized. In this work, cluster based routing in wireless sensor networks is studied precisely. Further, we modify one of the most promin…
▽ More
Wireless sensor networks are appearing as an emerging need for mankind. Though, Such networks are still in research phase however, they have high potential to be applied in almost every field of life. Lots of research is done and a lot more is awaiting to be standardized. In this work, cluster based routing in wireless sensor networks is studied precisely. Further, we modify one of the most prominent wireless sensor network's routing protocol "LEACH" as modified LEACH (MODLEACH) by introducing \emph{efficient cluster head replacement scheme} and \emph{dual transmitting power levels}. Our modified LEACH, in comparison with LEACH out performs it using metrics of cluster head formation, through put and network life. Afterwards, hard and soft thresholds are implemented on modified LEACH (MODLEACH) that boast the performance even more. Finally a brief performance analysis of LEACH, Modified LEACH (MODLEACH), MODLEACH with hard threshold (MODLEACHHT) and MODLEACH with soft threshold (MODLEACHST) is undertaken considering metrics of throughput, network life and cluster head replacements.
△ Less
Submitted 26 July, 2013;
originally announced July 2013.