Search | arXiv e-print repository

arXiv:2406.07716 [pdf]

Unleashing the Power of Transfer Learning Model for Sophisticated Insect Detection: Revolutionizing Insect Classification

Authors: Md. Mahmudul Hasan, SM Shaqib, Ms. Sharmin Akter, Rabiul Alam, Afraz Ul Haque, Shahrun akter khushbu

Abstract: The purpose of the Insect Detection System for Crop and Plant Health is to keep an eye out for and identify insect infestations in farming areas. By utilizing cutting-edge technology like computer vision and machine learning, the system seeks to identify hazardous insects early and accurately. This would enable prompt response to save crops and maintain optimal plant health. The Method of this stu… ▽ More The purpose of the Insect Detection System for Crop and Plant Health is to keep an eye out for and identify insect infestations in farming areas. By utilizing cutting-edge technology like computer vision and machine learning, the system seeks to identify hazardous insects early and accurately. This would enable prompt response to save crops and maintain optimal plant health. The Method of this study includes Data Acquisition, Preprocessing, Data splitting, Model Implementation and Model evaluation. Different models like MobileNetV2, ResNet152V2, Xecption, Custom CNN was used in this study. In order to categorize insect photos, a Convolutional Neural Network (CNN) based on the ResNet152V2 architecture is constructed and evaluated in this work. Achieving 99% training accuracy and 97% testing accuracy, ResNet152V2 demonstrates superior performance among four implemented models. The results highlight its potential for real-world applications in insect classification and entomology studies, emphasizing efficiency and accuracy. To ensure food security and sustain agricultural output globally, finding insects is crucial. Cutting-edge technology, such as ResNet152V2 models, greatly influence automating and improving the accuracy of insect identification. Efficient insect detection not only minimizes crop losses but also enhances agricultural productivity, contributing to sustainable food production. This underscores the pivotal role of technology in addressing challenges related to global food security. △ Less

Submitted 11 June, 2024; originally announced June 2024.

arXiv:2405.07349 [pdf, other]

WeedScout: Real-Time Autonomous blackgrass Classification and Map** using dedicated hardware

Authors: Matthew Gazzard, Helen Hicks, Isibor Kennedy Ihianle, Jordan J. Bird, Md Mahmudul Hasan, Pedro Machado

Abstract: Blackgrass (Alopecurus myosuroides) is a competitive weed that has wide-ranging impacts on food security by reducing crop yields and increasing cultivation costs. In addition to the financial burden on agriculture, the application of herbicides as a preventive to blackgrass can negatively affect access to clean water and sanitation. The WeedScout project introduces a Real-Rime Autonomous Black-Gra… ▽ More Blackgrass (Alopecurus myosuroides) is a competitive weed that has wide-ranging impacts on food security by reducing crop yields and increasing cultivation costs. In addition to the financial burden on agriculture, the application of herbicides as a preventive to blackgrass can negatively affect access to clean water and sanitation. The WeedScout project introduces a Real-Rime Autonomous Black-Grass Classification and Map** (RT-ABGCM), a cutting-edge solution tailored for real-time detection of blackgrass, for precision weed management practices. Leveraging Artificial Intelligence (AI) algorithms, the system processes live image feeds, infers blackgrass density, and covers two stages of maturation. The research investigates the deployment of You Only Look Once (YOLO) models, specifically the streamlined YOLOv8 and YOLO-NAS, accelerated at the edge with the NVIDIA Jetson Nano (NJN). By optimising inference speed and model performance, the project advances the integration of AI into agricultural practices, offering potential solutions to challenges such as herbicide resistance and environmental impact. Additionally, two datasets and model weights are made available to the research community, facilitating further advancements in weed detection and precision farming technologies. △ Less

Submitted 12 May, 2024; originally announced May 2024.

arXiv:2404.18062 [pdf, other]

Compressed Image Captioning using CNN-based Encoder-Decoder Framework

Authors: Md Alif Rahman Ridoy, M Mahmud Hasan, Shovon Bhowmick

Abstract: In today's world, image processing plays a crucial role across various fields, from scientific research to industrial applications. But one particularly exciting application is image captioning. The potential impact of effective image captioning is vast. It can significantly boost the accuracy of search engines, making it easier to find relevant information. Moreover, it can greatly enhance access… ▽ More In today's world, image processing plays a crucial role across various fields, from scientific research to industrial applications. But one particularly exciting application is image captioning. The potential impact of effective image captioning is vast. It can significantly boost the accuracy of search engines, making it easier to find relevant information. Moreover, it can greatly enhance accessibility for visually impaired individuals, providing them with a more immersive experience of digital content. However, despite its promise, image captioning presents several challenges. One major hurdle is extracting meaningful visual information from images and transforming it into coherent language. This requires bridging the gap between the visual and linguistic domains, a task that demands sophisticated algorithms and models. Our project is focused on addressing these challenges by develo** an automatic image captioning architecture that combines the strengths of convolutional neural networks (CNNs) and encoder-decoder models. The CNN model is used to extract the visual features from images, and later, with the help of the encoder-decoder framework, captions are generated. We also did a performance comparison where we delved into the realm of pre-trained CNN models, experimenting with multiple architectures to understand their performance variations. In our quest for optimization, we also explored the integration of frequency regularization techniques to compress the "AlexNet" and "EfficientNetB0" model. We aimed to see if this compressed model could maintain its effectiveness in generating image captions while being more resource-efficient. △ Less

Submitted 27 April, 2024; originally announced April 2024.

arXiv:2404.15168 [pdf, other]

Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech

Authors: Hasmot Ali, Md. Fahad Hossain, Md. Mehedi Hasan, Sheikh Abujar, Sheak Rashed Haider Noori

Abstract: Voice based applications are ruling over the era of automation because speech has a lot of factors that determine a speakers information as well as speech. Modern Automatic Speech Recognition (ASR) is a blessing in the field of Human-Computer Interaction (HCI) for efficient communication among humans and devices using Artificial Intelligence technology. Speech is one of the easiest mediums of comm… ▽ More Voice based applications are ruling over the era of automation because speech has a lot of factors that determine a speakers information as well as speech. Modern Automatic Speech Recognition (ASR) is a blessing in the field of Human-Computer Interaction (HCI) for efficient communication among humans and devices using Artificial Intelligence technology. Speech is one of the easiest mediums of communication because it has a lot of identical features for different speakers. Nowadays it is possible to determine speakers and their identity using their speech in terms of speaker recognition. In this paper, we presented a method that will provide a speakers geographical identity in a certain region using continuous Bengali speech. We consider eight different divisions of Bangladesh as the geographical region. We applied the Mel Frequency Cepstral Coefficient (MFCC) and Delta features on an Artificial Neural Network to classify speakers division. We performed some preprocessing tasks like noise reduction and 8-10 second segmentation of raw audio before feature extraction. We used our dataset of more than 45 hours of audio data from 633 individual male and female speakers. We recorded the highest accuracy of 85.44%. △ Less

Submitted 18 April, 2024; originally announced April 2024.

arXiv:2404.13639 [pdf]

Performance Analysis for Deterministic System using Time Sensitive Network

Authors: Md Mehedi Hasan, He Feng

Abstract: Modern technology necessitates the use of dependable, fast, and inexpensive networks as the backbone for data transmission. Switched Ethernet coupled with the Time Sensitive Networking Modern technology necessitates the use of dependable, fast, and inexpensive networks as the backbone for data transmission. Switched Ethernet coupled with the Time Sensitive Networking △ Less

Submitted 21 April, 2024; originally announced April 2024.

Comments: No Comment

MSC Class: Review

arXiv:2402.17028 [pdf]

Separation of biocrude produced from hydrothermal liquefaction of faecal sludge without any solvent

Authors: H M Fairooz Adnan, Md Khalekuzzaman, Md. Atik Fayshal, Md. Mehedi Hasan

Abstract: In this study faecal sludge is used as raw biomass due to its abundance, low cost, and easy availability. After HTL operation, product separation is getting challenging. Current developed studies observed the separation of aqueous and biocrude oil products occurs during the HTL process more popularly with the use of an organic solvent which is quite expensive. Focusing on this critical issue, this… ▽ More In this study faecal sludge is used as raw biomass due to its abundance, low cost, and easy availability. After HTL operation, product separation is getting challenging. Current developed studies observed the separation of aqueous and biocrude oil products occurs during the HTL process more popularly with the use of an organic solvent which is quite expensive. Focusing on this critical issue, this study aims to separate the biocrude and aqueous phase without using any solvent by gravity separation technique. From FTIR analysis data it showed that centrifuged at 6000 rpm partial separation of biocrude and aqueous phase (AP) was noticed. however, at 9000 rpm, FTIR analysis showed that biocrude samples included aliphatic hydrocarbons, phenols, and esters where no signs of any carbon chain were found at AP which indicated the products are successfully separated. The separated Crude portion had the higher A-Factor (0.68) and lower C-Factor (0.58) value which indicates the oil quality was immature grade of lower kerogen type II (i.e., moderate oil-prone). This low-cost technique can be economically advantageous for commercial-scale biocrude production. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: Conference: WasteSafe 2023At: Khulna University of Engineering & Technology, Khulna, Bangladesh Volume: 8

arXiv:2402.15992 [pdf, other]

A Machine Learning Approach to Detect Customer Satisfaction From Multiple Tweet Parameters

Authors: Md Mahmudul Hasan, Dr. Shaikh Anowarul Fattah

Abstract: Since internet technologies have advanced, one of the primary factors in company development is customer happiness. Online platforms have become prominent places for sharing reviews. Twitter is one of these platforms where customers frequently post their thoughts. Reviews of flights on these platforms have become a concern for the airline business. A positive review can help the company grow, whil… ▽ More Since internet technologies have advanced, one of the primary factors in company development is customer happiness. Online platforms have become prominent places for sharing reviews. Twitter is one of these platforms where customers frequently post their thoughts. Reviews of flights on these platforms have become a concern for the airline business. A positive review can help the company grow, while a negative one can quickly ruin its revenue and reputation. So it's vital for airline businesses to examine the feedback and experiences of their customers and enhance their services to remain competitive. But studying thousands of tweets and analyzing them to find the satisfaction of the customer is quite a difficult task. This tedious process can be made easier by using a machine learning approach to analyze tweets to determine client satisfaction levels. Some work has already been done on this strategy to automate the procedure using machine learning and deep learning techniques. However, they are all purely concerned with assessing the text's sentiment. In addition to the text, the tweet also includes the time, location, username, airline name, and so on. This additional information can be crucial for improving the model's outcome. To provide a machine learning based solution, this work has broadened its perspective to include these qualities. And it has come as no surprise that the additional features beyond text sentiment analysis produce better outcomes in machine learning based models. △ Less

Submitted 25 February, 2024; originally announced February 2024.

arXiv:2402.05736 [pdf, ps, other]

Numerical solution of the Newtonian plane Couette flow with linear dynamic wall slip

Authors: Muner M. A. Hasan, Ethar A. A. Ahmed, Ahmed F. Ghaleb, Moustafa S. Abou-Dina, Georgios C. Georgiou

Abstract: An efficient numerical approach based on weighted average finite differences is used to solve the Newtonian plane Couette flow with wall slip, obeying a dynamic slip law that generalizes the Navier slip law with the inclusion of a relaxation term. Slip is exhibited only along the fixed plate, and the motion is triggered by the motion of the other plate. Three different cases are considered for the… ▽ More An efficient numerical approach based on weighted average finite differences is used to solve the Newtonian plane Couette flow with wall slip, obeying a dynamic slip law that generalizes the Navier slip law with the inclusion of a relaxation term. Slip is exhibited only along the fixed plate, and the motion is triggered by the motion of the other plate. Three different cases are considered for the motion of the moving plate, i.e., constant speed, oscillating speed, and a single-period sinusoidal speed. The velocity and the volumetric flow rate are calculated in all cases and comparisons are made with the results of other methods and available results in the literature. The numerical outcomes confirm the dam** with time and the lagging effects arising from the Navier and dynamic wall slip conditions and demonstrate the hysteretic behavior of the slip velocity in following the harmonic boundary motion. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: 21 pages, 15 figures

arXiv:2401.12340 [pdf, other]

doi 10.1109/TAES.2023.3337768

Contrastive Learning and Cycle Consistency-based Transductive Transfer Learning for Target Annotation

Authors: Shoaib Meraj Sami, Md Mahedi Hasan, Nasser M. Nasrabadi, Raghuveer Rao

Abstract: Annotating automatic target recognition (ATR) is a highly challenging task, primarily due to the unavailability of labeled data in the target domain. Hence, it is essential to construct an optimal target domain classifier by utilizing the labeled information of the source domain images. The transductive transfer learning (TTL) method that incorporates a CycleGAN-based unpaired domain translation n… ▽ More Annotating automatic target recognition (ATR) is a highly challenging task, primarily due to the unavailability of labeled data in the target domain. Hence, it is essential to construct an optimal target domain classifier by utilizing the labeled information of the source domain images. The transductive transfer learning (TTL) method that incorporates a CycleGAN-based unpaired domain translation network has been previously proposed in the literature for effective ATR annotation. Although this method demonstrates great potential for ATR, it severely suffers from lower annotation performance, higher Fréchet Inception Distance (FID) score, and the presence of visual artifacts in the synthetic images. To address these issues, we propose a hybrid contrastive learning base unpaired domain translation (H-CUT) network that achieves a significantly lower FID score. It incorporates both attention and entropy to emphasize the domain-specific region, a noisy feature mixup module to generate high variational synthetic negative patches, and a modulated noise contrastive estimation (MoNCE) loss to reweight all negative patches using optimal transport for better performance. Our proposed contrastive learning and cycle-consistency-based TTL (C3TTL) framework consists of two H-CUT networks and two classifiers. It simultaneously optimizes cycle-consistency, MoNCE, and identity losses. In C3TTL, two H-CUT networks have been employed through a bijection map** to feed the reconstructed source domain images into a pretrained classifier to guide the optimal target domain classifier. Extensive experimental analysis conducted on three ATR datasets demonstrates that the proposed C3TTL method is effective in annotating civilian and military vehicles, as well as ship targets. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: This Paper is Accepted in IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS. This Arxiv version is an older version than the reviewed version

arXiv:2401.06157 [pdf, other]

UDEEP: Edge-based Computer Vision for In-Situ Underwater Crayfish and Plastic Detection

Authors: Dennis Monari, Jack Larkin, Pedro Machado, Jordan J. Bird, Isibor Kennedy Ihianle, Salisu Wada Yahaya, Farhad Fassihi Tash, Md Mahmudul Hasan, Ahmad Lotfi

Abstract: Invasive signal crayfish have a detrimental impact on ecosystems. They spread the fungal-type crayfish plague disease (Aphanomyces astaci) that is lethal to the native white clawed crayfish, the only native crayfish species in Britain. Invasive signal crayfish extensively burrow, causing habitat destruction, erosion of river banks and adverse changes in water quality, while also competing with nat… ▽ More Invasive signal crayfish have a detrimental impact on ecosystems. They spread the fungal-type crayfish plague disease (Aphanomyces astaci) that is lethal to the native white clawed crayfish, the only native crayfish species in Britain. Invasive signal crayfish extensively burrow, causing habitat destruction, erosion of river banks and adverse changes in water quality, while also competing with native species for resources and leading to declines in native populations. Moreover, pollution exacerbates the vulnerability of White-clawed crayfish, with their populations declining by over 90% in certain English counties, making them highly susceptible to extinction. To safeguard aquatic ecosystems, it is imperative to address the challenges posed by invasive species and discarded plastics in the United Kingdom's river ecosystem's. The UDEEP platform can play a crucial role in environmental monitoring by performing on-the-fly classification of Signal crayfish and plastic debris while leveraging the efficacy of AI, IoT devices and the power of edge computing (i.e., NJN). By providing accurate data on the presence, spread and abundance of these species, the UDEEP platform can contribute to monitoring efforts and aid in mitigating the spread of invasive species. △ Less

Submitted 21 December, 2023; originally announced January 2024.

arXiv:2312.10701 [pdf, other]

Bengali License Plate Recognition: Unveiling Clarity with CNN and GFP-GAN

Authors: Noushin Afrin, Md Mahamudul Hasan, Mohammed Fazlay Elahi Safin, Khondakar Rifat Amin, Md Zahidul Haque, Farzad Ahmed, Md. Tanvir Rouf Shawon

Abstract: Automated License Plate Recognition(ALPR) is a system that automatically reads and extracts data from vehicle license plates using image processing and computer vision techniques. The Goal of LPR is to identify and read the license plate number accurately and quickly, even under challenging, conditions such as poor lighting, angled or obscured plates, and different plate fonts and layouts. The pro… ▽ More Automated License Plate Recognition(ALPR) is a system that automatically reads and extracts data from vehicle license plates using image processing and computer vision techniques. The Goal of LPR is to identify and read the license plate number accurately and quickly, even under challenging, conditions such as poor lighting, angled or obscured plates, and different plate fonts and layouts. The proposed method consists of processing the Bengali low-resolution blurred license plates and identifying the plate's characters. The processes include image restoration using GFPGAN, Maximizing contrast, Morphological image processing like dilation, feature extraction and Using Convolutional Neural Networks (CNN), character segmentation and recognition are accomplished. A dataset of 1292 images of Bengali digits and characters was prepared for this project. △ Less

Submitted 17 December, 2023; originally announced December 2023.

arXiv:2312.09367 [pdf, other]

Text-Guided Face Recognition using Multi-Granularity Cross-Modal Contrastive Learning

Authors: Md Mahedi Hasan, Shoaib Meraj Sami, Nasser Nasrabadi

Abstract: State-of-the-art face recognition (FR) models often experience a significant performance drop when dealing with facial images in surveillance scenarios where images are in low quality and often corrupted with noise. Leveraging facial characteristics, such as freckles, scars, gender, and ethnicity, becomes highly beneficial in improving FR performance in such scenarios. In this paper, we introduce… ▽ More State-of-the-art face recognition (FR) models often experience a significant performance drop when dealing with facial images in surveillance scenarios where images are in low quality and often corrupted with noise. Leveraging facial characteristics, such as freckles, scars, gender, and ethnicity, becomes highly beneficial in improving FR performance in such scenarios. In this paper, we introduce text-guided face recognition (TGFR) to analyze the impact of integrating facial attributes in the form of natural language descriptions. We hypothesize that adding semantic information into the loop can significantly improve the image understanding capability of an FR algorithm compared to other soft biometrics. However, learning a discriminative joint embedding within the multimodal space poses a considerable challenge due to the semantic gap in the unaligned image-text representations, along with the complexities arising from ambiguous and incoherent textual descriptions of the face. To address these challenges, we introduce a face-caption alignment module (FCAM), which incorporates cross-modal contrastive losses across multiple granularities to maximize the mutual information between local and global features of the face-caption pair. Within FCAM, we refine both facial and textual features for learning aligned and discriminative features. We also design a face-caption fusion module (FCFM) that applies fine-grained interactions and coarse-grained associations among cross-modal features. Through extensive experiments conducted on three face-caption datasets, proposed TGFR demonstrates remarkable improvements, particularly on low-quality images, over existing FR models and outperforms other related methods and benchmarks. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Comments: Accepted at IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024

arXiv:2311.16180 [pdf]

Aiming to Minimize Alcohol-Impaired Road Fatalities: Utilizing Fairness-Aware and Domain Knowledge-Infused Artificial Intelligence

Authors: Tejas Venkateswaran, Sheikh Rabiul Islam, Md Golam Moula Mehedi Hasan, Mohiuddin Ahmed

Abstract: Approximately 30% of all traffic fatalities in the United States are attributed to alcohol-impaired driving. This means that, despite stringent laws against this offense in every state, the frequency of drunk driving accidents is alarming, resulting in approximately one person being killed every 45 minutes. The process of charging individuals with Driving Under the Influence (DUI) is intricate and… ▽ More Approximately 30% of all traffic fatalities in the United States are attributed to alcohol-impaired driving. This means that, despite stringent laws against this offense in every state, the frequency of drunk driving accidents is alarming, resulting in approximately one person being killed every 45 minutes. The process of charging individuals with Driving Under the Influence (DUI) is intricate and can sometimes be subjective, involving multiple stages such as observing the vehicle in motion, interacting with the driver, and conducting Standardized Field Sobriety Tests (SFSTs). Biases have been observed through racial profiling, leading to some groups and geographical areas facing fewer DUI tests, resulting in many actual DUI incidents going undetected, ultimately leading to a higher number of fatalities. To tackle this issue, our research introduces an Artificial Intelligence-based predictor that is both fairness-aware and incorporates domain knowledge to analyze DUI-related fatalities in different geographic locations. Through this model, we gain intriguing insights into the interplay between various demographic groups, including age, race, and income. By utilizing the provided information to allocate policing resources in a more equitable and efficient manner, there is potential to reduce DUI-related fatalities and have a significant impact on road safety. △ Less

Submitted 24 November, 2023; originally announced November 2023.

Comments: IEEE Big Data 2023

arXiv:2308.06866 [pdf, other]

Improving Face Recognition from Caption Supervision with Multi-Granular Contextual Feature Aggregation

Authors: Md Mahedi Hasan, Nasser Nasrabadi

Abstract: We introduce caption-guided face recognition (CGFR) as a new framework to improve the performance of commercial-off-the-shelf (COTS) face recognition (FR) systems. In contrast to combining soft biometrics (eg., facial marks, gender, and age) with face images, in this work, we use facial descriptions provided by face examiners as a piece of auxiliary information. However, due to the heterogeneity o… ▽ More We introduce caption-guided face recognition (CGFR) as a new framework to improve the performance of commercial-off-the-shelf (COTS) face recognition (FR) systems. In contrast to combining soft biometrics (eg., facial marks, gender, and age) with face images, in this work, we use facial descriptions provided by face examiners as a piece of auxiliary information. However, due to the heterogeneity of the modalities, improving the performance by directly fusing the textual and facial features is very challenging, as both lie in different embedding spaces. In this paper, we propose a contextual feature aggregation module (CFAM) that addresses this issue by effectively exploiting the fine-grained word-region interaction and global image-caption association. Specifically, CFAM adopts a self-attention and a cross-attention scheme for improving the intra-modality and inter-modality relationship between the image and textual features, respectively. Additionally, we design a textual feature refinement module (TFRM) that refines the textual features of the pre-trained BERT encoder by updating the contextual embeddings. This module enhances the discriminative power of textual features with a cross-modal projection loss and realigns the word and caption embeddings with visual features by incorporating a visual-semantic alignment loss. We implemented the proposed CGFR framework on two face recognition models (ArcFace and AdaFace) and evaluated its performance on the Multi-Modal CelebA-HQ dataset. Our framework significantly improves the performance of ArcFace in both 1:1 verification and 1:N identification protocol. △ Less

Submitted 13 August, 2023; originally announced August 2023.

Comments: This article has been accepted for publication in the IEEE International Joint Conference on Biometrics (IJCB), 2023

arXiv:2307.07732 [pdf, other]

Prawn Morphometrics and Weight Estimation from Images using Deep Learning for Landmark Localization

Authors: Alzayat Saleh, Md Mehedi Hasan, Herman W Raadsma, Mehar S Khatkar, Dean R Jerry, Mostafa Rahimi Azghadi

Abstract: Accurate weight estimation and morphometric analyses are useful in aquaculture for optimizing feeding, predicting harvest yields, identifying desirable traits for selective breeding, grading processes, and monitoring the health status of production animals. However, the collection of phenotypic data through traditional manual approaches at industrial scales and in real-time is time-consuming, labo… ▽ More Accurate weight estimation and morphometric analyses are useful in aquaculture for optimizing feeding, predicting harvest yields, identifying desirable traits for selective breeding, grading processes, and monitoring the health status of production animals. However, the collection of phenotypic data through traditional manual approaches at industrial scales and in real-time is time-consuming, labour-intensive, and prone to errors. Digital imaging of individuals and subsequent training of prediction models using Deep Learning (DL) has the potential to rapidly and accurately acquire phenotypic data from aquaculture species. In this study, we applied a novel DL approach to automate weight estimation and morphometric analysis using the black tiger prawn (Penaeus monodon) as a model crustacean. The DL approach comprises two main components: a feature extraction module that efficiently combines low-level and high-level features using the Kronecker product operation; followed by a landmark localization module that then uses these features to predict the coordinates of key morphological points (landmarks) on the prawn body. Once these landmarks were extracted, weight was estimated using a weight regression module based on the extracted landmarks using a fully connected network. For morphometric analyses, we utilized the detected landmarks to derive five important prawn traits. Principal Component Analysis (PCA) was also used to identify landmark-derived distances, which were found to be highly correlated with shape features such as body length, and width. We evaluated our approach on a large dataset of 8164 images of the Black tiger prawn (Penaeus monodon) collected from Australian farms. Our experimental results demonstrate that the novel DL approach outperforms existing DL methods in terms of accuracy, robustness, and efficiency. △ Less

Submitted 15 July, 2023; originally announced July 2023.

Comments: 33 pages, 8 figures. Submitted to the Computers and Electronics in Agriculture journal

arXiv:2306.11884 [pdf, other]

Protecting the Decentralized Future: An Exploration of Common Blockchain Attacks and their Countermeasures

Authors: Bilash Saha, Md Mehedi Hasan, Nafisa Anjum, Sharaban Tahora, Aiasha Siddika, Hossain Shahriar

Abstract: Blockchain technology transformed the digital sphere by providing a transparent, secure, and decentralized platform for data security across a range of industries, including cryptocurrencies and supply chain management. Blockchain's integrity and dependability have been jeopardized by the rising number of security threats, which have attracted cybercriminals as a target. By summarizing suggested f… ▽ More Blockchain technology transformed the digital sphere by providing a transparent, secure, and decentralized platform for data security across a range of industries, including cryptocurrencies and supply chain management. Blockchain's integrity and dependability have been jeopardized by the rising number of security threats, which have attracted cybercriminals as a target. By summarizing suggested fixes, this research aims to offer a thorough analysis of mitigating blockchain attacks. The objectives of the paper include identifying weak blockchain attacks, evaluating various solutions, and determining how effective and effective they are at preventing these attacks. The study also highlights how crucial it is to take into account the particular needs of every blockchain application. This study provides beneficial perspectives and insights for blockchain researchers and practitioners, making it essential reading for those interested in current and future trends in blockchain security research. △ Less

Submitted 20 June, 2023; originally announced June 2023.

Comments: Submitted to BSTIA 2023 (Book - Blockchain and Smart-Contract Technologies for Innovative Applications)

arXiv:2305.04402 [pdf, other]

TaLU: A Hybrid Activation Function Combining Tanh and Rectified Linear Unit to Enhance Neural Networks

Authors: Md. Mehedi Hasan, Md. Ali Hossain, Azmain Yakin Srizon, Abu Sayeed

Abstract: The application of the deep learning model in classification plays an important role in the accurate detection of the target objects. However, the accuracy is affected by the activation function in the hidden and output layer. In this paper, an activation function called TaLU, which is a combination of Tanh and Rectified Linear Units (ReLU), is used to improve the prediction. ReLU activation funct… ▽ More The application of the deep learning model in classification plays an important role in the accurate detection of the target objects. However, the accuracy is affected by the activation function in the hidden and output layer. In this paper, an activation function called TaLU, which is a combination of Tanh and Rectified Linear Units (ReLU), is used to improve the prediction. ReLU activation function is used by many deep learning researchers for its computational efficiency, ease of implementation, intuitive nature, etc. However, it suffers from a dying gradient problem. For instance, when the input is negative, its output is always zero because its gradient is zero. A number of researchers used different approaches to solve this issue. Some of the most notable are LeakyReLU, Softplus, Softsign, ELU, ThresholdedReLU, etc. This research developed TaLU, a modified activation function combining Tanh and ReLU, which mitigates the dying gradient problem of ReLU. The deep learning model with the proposed activation function was tested on MNIST and CIFAR-10, and it outperforms ReLU and some other studied activation functions in terms of accuracy(upto 6% in most cases, when used with Batch Normalization and a reasonable learning rate). △ Less

Submitted 19 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

arXiv:2305.01044

Venn Diagram Multi-label Class Interpretation of Diabetic Foot Ulcer with Color and Sharpness Enhancement

Authors: Md Mahamudul Hasan, Moi Hoon Yap, Md Kamrul Hasan

Abstract: DFU is a severe complication of diabetes that can lead to amputation of the lower limb if not treated properly. Inspired by the 2021 Diabetic Foot Ulcer Grand Challenge, researchers designed automated multi-class classification of DFU, including infection, ischaemia, both of these conditions, and none of these conditions. However, it remains a challenge as classification accuracy is still not sati… ▽ More DFU is a severe complication of diabetes that can lead to amputation of the lower limb if not treated properly. Inspired by the 2021 Diabetic Foot Ulcer Grand Challenge, researchers designed automated multi-class classification of DFU, including infection, ischaemia, both of these conditions, and none of these conditions. However, it remains a challenge as classification accuracy is still not satisfactory. This paper proposes a Venn Diagram interpretation of multi-label CNN-based method, utilizing different image enhancement strategies, to improve the multi-class DFU classification. We propose to reduce the four classes into two since both class wounds can be interpreted as the simultaneous occurrence of infection and ischaemia and none class wounds as the absence of infection and ischaemia. We introduce a novel Venn Diagram representation block in the classifier to interpret all four classes from these two classes. To make our model more resilient, we propose enhancing the perceptual quality of DFU images, particularly blurry or inconsistently lit DFU images, by performing color and sharpness enhancements on them. We also employ a fine-tuned optimization technique, adaptive sharpness aware minimization, to improve the CNN model generalization performance. The proposed method is evaluated on the test dataset of DFUC2021, containing 5,734 images and the results are compared with the top-3 winning entries of DFUC2021. Our proposed approach outperforms these existing approaches and achieves Macro-Average F1, Recall and Precision scores of 0.6592, 0.6593, and 0.6652, respectively.Additionally, We perform ablation studies and image quality measurements to further interpret our proposed method. This proposed method will benefit patients with DFUs since it tackles the inconsistencies in captured images and can be employed for a more robust remote DFU wound classification. △ Less

Submitted 5 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

Comments: The Paper is not complete, more modifications are needed

arXiv:2303.02310 [pdf, other]

doi 10.1109/ICIP49359.2023.10221899

IKD+: Reliable Low Complexity Deep Models For Retinopathy Classification

Authors: Shreyas Bhat Brahmavar, Rohit Rajesh, Tirtharaj Dash, Lovekesh Vig, Tanmay Tulsidas Verlekar, Md Mahmudul Hasan, Tariq Khan, Erik Meijering, Ashwin Srinivasan

Abstract: Deep neural network (DNN) models for retinopathy have estimated predictive accuracies in the mid-to-high 90%. However, the following aspects remain unaddressed: State-of-the-art models are complex and require substantial computational infrastructure to train and deploy; The reliability of predictions can vary widely. In this paper, we focus on these aspects and propose a form of iterative knowledg… ▽ More Deep neural network (DNN) models for retinopathy have estimated predictive accuracies in the mid-to-high 90%. However, the following aspects remain unaddressed: State-of-the-art models are complex and require substantial computational infrastructure to train and deploy; The reliability of predictions can vary widely. In this paper, we focus on these aspects and propose a form of iterative knowledge distillation(IKD), called IKD+ that incorporates a tradeoff between size, accuracy and reliability. We investigate the functioning of IKD+ using two widely used techniques for estimating model calibration (Platt-scaling and temperature-scaling), using the best-performing model available, which is an ensemble of EfficientNets with approximately 100M parameters. We demonstrate that IKD+ equipped with temperature-scaling results in models that show up to approximately 500-fold decreases in the number of parameters than the original ensemble without a significant loss in accuracy. In addition, calibration scores (reliability) for the IKD+ models are as good as or better than the base mode △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: Submitted to IEEE International Conference on Image Processing (ICIP 2023)

Journal ref: IEEE International Conference on Image Processing (ICIP 2023)

arXiv:2302.11559 [pdf, other]

Word level Bangla Sign Language Dataset for Continuous BSL Recognition

Authors: Md Shamimul Islam, A. J. M. Akhtarujjaman Joha, Md Nur Hossain, Sohaib Abdullah, Ibrahim Elwarfalli, Md Mahedi Hasan

Abstract: An robust sign language recognition system can greatly alleviate communication barriers, particularly for people who struggle with verbal communication. This is crucial for human growth and progress as it enables the expression of thoughts, feelings, and ideas. However, sign recognition is a complex task that faces numerous challenges such as same gesture patterns for multiple signs, lighting, clo… ▽ More An robust sign language recognition system can greatly alleviate communication barriers, particularly for people who struggle with verbal communication. This is crucial for human growth and progress as it enables the expression of thoughts, feelings, and ideas. However, sign recognition is a complex task that faces numerous challenges such as same gesture patterns for multiple signs, lighting, clothing, carrying conditions, and the presence of large poses, as well as illumination discrepancies across different views. Additionally, the absence of an extensive Bangla sign language video dataset makes it even more challenging to operate recognition systems, particularly when utilizing deep learning techniques. In order to address this issue, firstly, we created a large-scale dataset called the MVBSL-W50, which comprises 50 isolated words across 13 categories. Secondly, we developed an attention-based Bi-GRU model that captures the temporal dynamics of pose information for individuals communicating through sign language. The proposed model utilizes human pose information, which has shown to be successful in analyzing sign language patterns. By focusing solely on movement information and disregarding body appearance and environmental factors, the model is simplified and can achieve a speedier performance. The accuracy of the model is reported to be 85.64%. △ Less

Submitted 9 April, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

arXiv:2302.02201 [pdf]

First-principles Studies on Structural, Electronic, Optical and Mechanical Properties of Inorganic CS2NaTlX6 (X = F, Cl, Br) Double Halide Perovskites

Authors: Mohammed Mehedi Hasan, Nazmul Hasan, Alamgir Kabir

Abstract: The structural, electrical, optical, and mechanical characteristics of the lead-free halide double perovskites Cs2NaTlX6 X = F, Cl, Br are calculated by utilizing PBE functional within generalized gradient approximation GGA under the context of density functional theory DFT.The structural properties such as lattice parameter, cell volume, total energy, bulk modulus, pressure derivative, and tolera… ▽ More The structural, electrical, optical, and mechanical characteristics of the lead-free halide double perovskites Cs2NaTlX6 X = F, Cl, Br are calculated by utilizing PBE functional within generalized gradient approximation GGA under the context of density functional theory DFT.The structural properties such as lattice parameter, cell volume, total energy, bulk modulus, pressure derivative, and tolerance factor are computed at equilibrium.The electronic density of states reveals the semiconducting nature of the compound and the band structure exhibits the nature of the band gap to be direct.HSE06 functional is introduced to correct the underestimated band gap as obtained in the GGA-PBE functional.The real and imaginary components of the dielectric function, absorption coefficient, energy loss function, reflectivity, refractive index, and extinction coefficient are analyzed and explained by electronic structures. △ Less

Submitted 4 February, 2023; originally announced February 2023.

arXiv:2212.14744

A Comparison Study of Deep CNN Architecture in Detecting of Pneumonia

Authors: Al Mohidur Rahman Porag, Md. Mahedi Hasan, Dr. Md Taimur Ahad

Abstract: Pneumonia, a respiratory infection brought on by bacteria or viruses, affects a large number of people, especially in develo** and impoverished countries where high levels of pollution, unclean living conditions, and overcrowding are frequently observed, along with insufficient medical infrastructure. Pleural effusion, a condition in which fluids fill the lung and complicate breathing, is brough… ▽ More Pneumonia, a respiratory infection brought on by bacteria or viruses, affects a large number of people, especially in develo** and impoverished countries where high levels of pollution, unclean living conditions, and overcrowding are frequently observed, along with insufficient medical infrastructure. Pleural effusion, a condition in which fluids fill the lung and complicate breathing, is brought on by pneumonia. Early detection of pneumonia is essential for ensuring curative care and boosting survival rates. The approach most usually used to diagnose pneumonia is chest X-ray imaging. The purpose of this work is to develop a method for the automatic diagnosis of bacterial and viral pneumonia in digital x-ray pictures. This article first presents the authors' technique, and then gives a comprehensive report on recent developments in the field of reliable diagnosis of pneumonia. In this study, here tuned a state-of-the-art deep convolutional neural network to classify plant diseases based on images and tested its performance. Deep learning architecture is compared empirically. VGG19, ResNet with 152v2, Resnext101, Seresnet152, Mobilenettv2, and DenseNet with 201 layers are among the architectures tested. Experiment data consists of two groups, sick and healthy X-ray pictures. To take appropriate action against plant diseases as soon as possible, rapid disease identification models are preferred. DenseNet201 has shown no overfitting or performance degradation in our experiments, and its accuracy tends to increase as the number of epochs increases. Further, DenseNet201 achieves state-of-the-art performance with a significantly a smaller number of parameters and within a reasonable computing time. This architecture outperforms the competition in terms of testing accuracy, scoring 95%. Each architecture was trained using Keras, using Theano as the backend. △ Less

Submitted 14 February, 2023; v1 submitted 30 December, 2022; originally announced December 2022.

Comments: I have to remake the artical. Case there was some accuracy problem

arXiv:2212.13599 [pdf]

Brain Cancer Segmentation Using YOLOv5 Deep Neural Network

Authors: Sudipto Paul, Dr. Md Taimur Ahad, Md. Mahedi Hasan

Abstract: An expansion of aberrant brain cells is referred to as a brain tumor. The brain's architecture is extremely intricate, with several regions controlling various nervous system processes. Any portion of the brain or skull can develop a brain tumor, including the brain's protective coating, the base of the skull, the brainstem, the sinuses, the nasal cavity, and many other places. Over the past ten y… ▽ More An expansion of aberrant brain cells is referred to as a brain tumor. The brain's architecture is extremely intricate, with several regions controlling various nervous system processes. Any portion of the brain or skull can develop a brain tumor, including the brain's protective coating, the base of the skull, the brainstem, the sinuses, the nasal cavity, and many other places. Over the past ten years, numerous developments in the field of computer-aided brain tumor diagnosis have been made. Recently, instance segmentation has attracted a lot of interest in numerous computer vision applications. It seeks to assign various IDs to various scene objects, even if they are members of the same class. Typically, a two-stage pipeline is used to perform instance segmentation. This study shows brain cancer segmentation using YOLOv5. Yolo takes dataset as picture format and corresponding text file. You Only Look Once (YOLO) is a viral and widely used algorithm. YOLO is famous for its object recognition properties. You Only Look Once (YOLO) is a popular algorithm that has gone viral. YOLO is well known for its ability to identify objects. YOLO V2, V3, V4, and V5 are some of the YOLO latest versions that experts have published in recent years. Early brain tumor detection is one of the most important jobs that neurologists and radiologists have. However, it can be difficult and error-prone to manually identify and segment brain tumors from Magnetic Resonance Imaging (MRI) data. For making an early diagnosis of the condition, an automated brain tumor detection system is necessary. The model of the research paper has three classes. They are respectively Meningioma, Pituitary, Glioma. The results show that, our model achieves competitive accuracy, in terms of runtime usage of M2 10 core GPU. △ Less

Submitted 27 December, 2022; originally announced December 2022.

arXiv:2207.12999 [pdf, other]

A Bayesian hierarchical framework for emulating a complex crop yield simulator

Authors: Muhammad Mahmudul Hasan, Jonathan Andrew Cumming

Abstract: Emulation of complex computer simulations have become an effective tool in the exploration of the behaviour of the simulated processes. Agriculture is one such area where the simulation of crop growth, nutrition, soil condition and pollution could be invaluable in any land management decisions. In this paper, we study output from the EPIC simulation model to investigate the behaviour of crop yield… ▽ More Emulation of complex computer simulations have become an effective tool in the exploration of the behaviour of the simulated processes. Agriculture is one such area where the simulation of crop growth, nutrition, soil condition and pollution could be invaluable in any land management decisions. In this paper, we study output from the EPIC simulation model to investigate the behaviour of crop yield in response to changes in inputs such as fertilizer levels, soil, steepness, and other environmental covariates. We build a model for crop yield around a non-linear Mitscherlich Baule growth model to make inferences about the response of crop yield to changes continuous input variables (fertiliser levels), as well as exploring the impact of categorical factor inputs such as land steepness and soil type. A Bayesian hierarchical approach to the modelling was taking for mixed inputs, requiring Markov Chain Monte Carlo simulations to obtain samples from the posterior distributions, to validate and illustrate the results, and to carry out model selection. Our results highlight a strong response of yield to nitrogen, but surprisingly a weak response to phosphorus and also shows the substantial improvement of the model after adding factor effects response to maximum yield for this particular simulator configuration and catchment. △ Less

Submitted 26 July, 2022; originally announced July 2022.

Comments: Submitted to the Statistical Modelling Journal

arXiv:2207.09627 [pdf, other]

EVHA: Explainable Vision System for Hardware Testing and Assurance -- An Overview

Authors: Md Mahfuz Al Hasan, Mohammad Tahsin Mostafiz, Thomas An Le, Jake Julia, Nidish Vashistha, Shayan Taheri, Navid Asadizanjani

Abstract: Due to the ever-growing demands for electronic chips in different sectors the semiconductor companies have been mandated to offshore their manufacturing processes. This unwanted matter has made security and trustworthiness of their fabricated chips concerning and caused creation of hardware attacks. In this condition, different entities in the semiconductor supply chain can act maliciously and exe… ▽ More Due to the ever-growing demands for electronic chips in different sectors the semiconductor companies have been mandated to offshore their manufacturing processes. This unwanted matter has made security and trustworthiness of their fabricated chips concerning and caused creation of hardware attacks. In this condition, different entities in the semiconductor supply chain can act maliciously and execute an attack on the design computing layers, from devices to systems. Our attack is a hardware Trojan that is inserted during mask generation/fabrication in an untrusted foundry. The Trojan leaves a footprint in the fabricated through addition, deletion, or change of design cells. In order to tackle this problem, we propose Explainable Vision System for Hardware Testing and Assurance (EVHA) in this work that can detect the smallest possible change to a design in a low-cost, accurate, and fast manner. The inputs to this system are Scanning Electron Microscopy (SEM) images acquired from the Integrated Circuits (ICs) under examination. The system output is determination of IC status in terms of having any defect and/or hardware Trojan through addition, deletion, or change in the design cells at the cell-level. This article provides an overview on the design, development, implementation, and analysis of our defense system. △ Less

Submitted 19 July, 2022; originally announced July 2022.

Comments: Please contact Dr. Shayan Taheri for any questions and/or comments regarding the paper arXiv submission at: "www.shayan-taheri.com". The Paper Initial Submission: The ACM Journal on Emerging Technologies in Computing Systems (JETC)

arXiv:2205.12322 [pdf]

Optimizing Return and Secure Disposal of Prescription Opioids to Reduce the Diversion to Secondary Users and Black Market

Authors: Md Mahmudul Hasan, Tasnim Ibn Faiz, Alicia Sasser Modestino, Gary J. Young, Md. Noor-E-Alam

Abstract: Opioid Use Disorder (OUD) has reached an epidemic level in the US. Diversion of unused prescription opioids to secondary users and black market significantly contributes to the abuse and misuse of these highly addictive drugs, leading to the increased risk of OUD and accidental opioid overdose within communities. Hence, it is critical to design effective strategies to reduce the non-medical use of… ▽ More Opioid Use Disorder (OUD) has reached an epidemic level in the US. Diversion of unused prescription opioids to secondary users and black market significantly contributes to the abuse and misuse of these highly addictive drugs, leading to the increased risk of OUD and accidental opioid overdose within communities. Hence, it is critical to design effective strategies to reduce the non-medical use of opioids that can occur via diversion at the patient level. In this paper, we aim to address this critical public health problem by designing strategies for the return and safe disposal of unused prescription opioids. We propose a data-driven optimization framework to determine the optimal incentive disbursement plans and locations of easily accessible opioid disposal kiosks to motivate prescription opioid users of diverse profiles in returning their unused opioids. We develop a Mixed-Integer Non-Linear Programming (MINLP) model to solve the decision problem, followed by a reformulation scheme using Benders Decomposition that results in a computationally efficient solution. We present a case study to show the benefits and usability of the model using a dataset created from Massachusetts All Payer Claims Data (MA APCD). Our proposed model allows the policymakers to estimate and include a penalty cost considering the economic and healthcare burden associated with prescription opioid diversion. Our numerical experiments demonstrate the ability of model and usefulness in determining optimal locations of opioid disposal kiosks and incentive disbursement plans for maximizing the disposal of unused opioids. The proposed optimization framework offers various trade-off strategies that can help government agencies design pragmatic policies for reducing the diversion of unused prescription opioids. △ Less

Submitted 14 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

arXiv:2202.10459 [pdf]

Towards technological adaptation of advanced farming through AI, IoT, and Robotics: A Comprehensive overview

Authors: Md. Mahadi Hasan, Muhammad Usama Islam, Muhammad Jafar Sadeq

Abstract: The population explosion of the 21st century has adversely affected the natural resources with restricted availability of cultivable land, increased average temperatures due to global warming, and carbon footprint resulting in a drastic increase in floods as well as droughts thus making food security significant anxiety for most countries. The traditional methods were no longer sufficient which pa… ▽ More The population explosion of the 21st century has adversely affected the natural resources with restricted availability of cultivable land, increased average temperatures due to global warming, and carbon footprint resulting in a drastic increase in floods as well as droughts thus making food security significant anxiety for most countries. The traditional methods were no longer sufficient which paved the way for technological ascents such as a substantial rise in Artificial Intelligence (AI), Internet of Things (IoT), as well as Robotics that provides high productivity, functional efficiency, flexibility, cost-effectiveness in the domain of agriculture. AI, IoT, and Robotics-based devices and methods have produced new paradigms and opportunities in agriculture. AI's existing approaches are soil management, crop diseases identification, weed identification, and management in collaboration with IoT devices. IoT has utilized automatic agricultural operations and real-time monitoring with few personnel employed in real-time. The major existing applications of agricultural robotics are for the function of soil preparation, planting, monitoring, harvesting, and storage. In this paper, researchers have explored a comprehensive overview of recent implementation, scopes, opportunities, challenges, limitations, and future research instructions of AI, IoT, and Robotics based methodology in the agriculture sector. △ Less

Submitted 21 February, 2022; originally announced February 2022.

Comments: 27 pages, 4 figures, book chapter, https://www.routledge.com/Artificial-Intelligence-and-Smart-Agriculture-Technology/Kose-Prasath-Mondal-Podder-Bharat/p/book/9781032120799

arXiv:2112.07819 [pdf, other]

Weed Recognition using Deep Learning Techniques on Class-imbalanced Imagery

Authors: A S M Mahmudul Hasan, Ferdous Sohel, Dean Diepeveen, Hamid Laga, Michael G. K. Jones

Abstract: Most weed species can adversely impact agricultural productivity by competing for nutrients required by high-value crops. Manual weeding is not practical for large crop** areas. Many studies have been undertaken to develop automatic weed management systems for agricultural crops. In this process, one of the major tasks is to recognise the weeds from images. However, weed recognition is a challen… ▽ More Most weed species can adversely impact agricultural productivity by competing for nutrients required by high-value crops. Manual weeding is not practical for large crop** areas. Many studies have been undertaken to develop automatic weed management systems for agricultural crops. In this process, one of the major tasks is to recognise the weeds from images. However, weed recognition is a challenging task. It is because weed and crop plants can be similar in colour, texture and shape which can be exacerbated further by the imaging conditions, geographic or weather conditions when the images are recorded. Advanced machine learning techniques can be used to recognise weeds from imagery. In this paper, we have investigated five state-of-the-art deep neural networks, namely VGG16, ResNet-50, Inception-V3, Inception-ResNet-v2 and MobileNetV2, and evaluated their performance for weed recognition. We have used several experimental settings and multiple dataset combinations. In particular, we constructed a large weed-crop dataset by combining several smaller datasets, mitigating class imbalance by data augmentation, and using this dataset in benchmarking the deep neural networks. We investigated the use of transfer learning techniques by preserving the pre-trained weights for extracting the features and fine-tuning them using the images of crop and weed datasets. We found that VGG16 performed better than others on small-scale datasets, while ResNet-50 performed better than other deep networks on the large combined dataset. △ Less

Submitted 14 December, 2021; originally announced December 2021.

Comments: The paper is accepted by Crop and Pasture Science journal (https://www.publish.csiro.au/CP/justaccepted/CP21626)

arXiv:2112.02241 [pdf]

Underexpanded Supersonic Jet in Imposed Oscillating Condition

Authors: Md. Elius, Md. Mahmudul Hasan, A. B. M. Toufiqe Hasan

Abstract: In the present study, a computation study is performed to investigate the effect of imposed oscillation of nozzle pressure ratio (NPR) on the flow structure in a two-dimensional, axisymmetric supersonic converging nozzle. In this study, the underexpanded flow conditions are considered which are dominated by diamond shock-cell structure. The computational results are well validated with the availab… ▽ More In the present study, a computation study is performed to investigate the effect of imposed oscillation of nozzle pressure ratio (NPR) on the flow structure in a two-dimensional, axisymmetric supersonic converging nozzle. In this study, the underexpanded flow conditions are considered which are dominated by diamond shock-cell structure. The computational results are well validated with the available experimental measurements. The flow is initially computed to be fully developed and then oscillations are imposed. NPR is increased from 1.6 to 2.6 and then decreased again to 1.6 and thus completes a cycle. Results showed that the external flow structure of the nozzle is dependent on the process of change of pressure ratio during the oscillation. Distinct flow structures are observed during increasing and decreasing processes of the change of pressure ratio even when the nozzle is at the same NPR. Irreversible behaviors in the locations of jet centreline axis and off-axis as well as expansion, compression and neutral zones, are observed at the same NPRs during this oscillation. Further, the effect of oscillation frequency is explored on this irreversible behavior at 100 Hz, 200 Hz, 500 Hz and 1000 Hz frequencies. △ Less

Submitted 3 December, 2021; originally announced December 2021.

Comments: 16th Asian Congress of Fluid Mechanics (16ACFM), Bengaluru, India, 13-17 December 2019

arXiv:2111.03890 [pdf, other]

doi 10.1109/CSDE53843.2021.9718400

Demystifying Deep Learning Models for Retinal OCT Disease Classification using Explainable AI

Authors: Tasnim Sakib Apon, Mohammad Mahmudul Hasan, Abrar Islam, MD. Golam Rabiul Alam

Abstract: In the world of medical diagnostics, the adoption of various deep learning techniques is quite common as well as effective, and its statement is equally true when it comes to implementing it into the retina Optical Coherence Tomography (OCT) sector, but (i)These techniques have the black box characteristics that prevent the medical professionals to completely trust the results generated from them… ▽ More In the world of medical diagnostics, the adoption of various deep learning techniques is quite common as well as effective, and its statement is equally true when it comes to implementing it into the retina Optical Coherence Tomography (OCT) sector, but (i)These techniques have the black box characteristics that prevent the medical professionals to completely trust the results generated from them (ii)Lack of precision of these methods restricts their implementation in clinical and complex cases (iii)The existing works and models on the OCT classification are substantially large and complicated and they require a considerable amount of memory and computational power, reducing the quality of classifiers in real-time applications. To meet these problems, in this paper a self-developed CNN model has been proposed which is comparatively smaller and simpler along with the use of Lime that introduces Explainable AI to the study and helps to increase the interpretability of the model. This addition will be an asset to the medical experts for getting major and detailed information and will help them in making final decisions and will also reduce the opacity and vulnerability of the conventional deep learning models. △ Less

Submitted 6 November, 2021; originally announced November 2021.

arXiv:2109.10208 [pdf, other]

doi 10.1007/978-3-030-86133-9_7

Bayes Linear Emulation of Simulated Crop Yield

Authors: Muhammad Mahmudul Hasan, Jonathan A. Cumming

Abstract: The analysis of the output from a large scale computer simulation experiment can pose a challenging problem in terms of size and computation. We consider output in the form of simulated crop yields from the Environmental Policy Integrated Climate (EPIC) model, which requires a large number of inputs such as fertiliser levels, weather conditions, and crop rotations inducing a high dimensional input… ▽ More The analysis of the output from a large scale computer simulation experiment can pose a challenging problem in terms of size and computation. We consider output in the form of simulated crop yields from the Environmental Policy Integrated Climate (EPIC) model, which requires a large number of inputs such as fertiliser levels, weather conditions, and crop rotations inducing a high dimensional input space. In this paper, we adopt a Bayes linear approach to efficiently emulate crop yield as a function of the simulator fertiliser inputs. We explore emulator diagnostics and present the results from emulation of a subset of the simulated EPIC data output. △ Less

Submitted 21 September, 2021; originally announced September 2021.

Comments: Conference Paper, Canadian Statistics Conference

Report number: Paper 138

arXiv:2108.12500 [pdf, other]

Positive Planar Satisfiability Problems under 3-Connectivity Constraints

Authors: Md. Manzurul Hasan, Debajyoti Mondal, Md. Saidur Rahman

Abstract: A 3-SAT problem is called positive and planar if all the literals are positive and the clause-variable incidence graph (i.e., SAT graph) is planar. The NAE 3-SAT and 1-in-3-SAT are two variants of 3-SAT that remain NP-complete even when they are positive. The positive 1-in-3-SAT problem remains NP-complete under planarity constraint, but planar NAE 3-SAT is solvable in $O(n^{1.5}\log n)$ time. In… ▽ More A 3-SAT problem is called positive and planar if all the literals are positive and the clause-variable incidence graph (i.e., SAT graph) is planar. The NAE 3-SAT and 1-in-3-SAT are two variants of 3-SAT that remain NP-complete even when they are positive. The positive 1-in-3-SAT problem remains NP-complete under planarity constraint, but planar NAE 3-SAT is solvable in $O(n^{1.5}\log n)$ time. In this paper we prove that a positive planar NAE 3-SAT is always satisfiable when the underlying SAT graph is 3-connected, and a satisfiable assignment can be obtained in linear time. We also show that without 3-connectivity constraint, existence of a linear-time algorithm for positive planar NAE 3-SAT problem is unlikely as it would imply a linear-time algorithm for finding a spanning 2-matching in a planar subcubic graph. We then prove that positive planar 1-in-3-SAT remains NP-complete under the 3-connectivity constraint, even when each variable appears in at most 4 clauses. However, we show that the 3-connected planar 1-in-3-SAT is always satisfiable when each variable appears in an even number of clauses. △ Less

Submitted 27 August, 2021; originally announced August 2021.

MSC Class: 05C85; 05C85 ACM Class: F.2

arXiv:2103.14115 [pdf, other]

Training Neural Networks Using the Property of Negative Feedback to Inverse a Function

Authors: Md Munir Hasan, Jeremy Holleman

Abstract: With high forward gain, a negative feedback system has the ability to perform the inverse of a linear or non linear function that is in the feedback path. This property of negative feedback systems has been widely used in analog circuits to construct precise closed-loop functions. This paper describes how the property of a negative feedback system to perform inverse of a function can be used for t… ▽ More With high forward gain, a negative feedback system has the ability to perform the inverse of a linear or non linear function that is in the feedback path. This property of negative feedback systems has been widely used in analog circuits to construct precise closed-loop functions. This paper describes how the property of a negative feedback system to perform inverse of a function can be used for training neural networks. This method does not require that the cost or activation functions be differentiable. Hence, it is able to learn a class of non-differentiable functions as well where a gradient descent-based method fails. We also show that gradient descent emerges as a special case of the proposed method. We have applied this method to the MNIST dataset and obtained results that shows the method is viable for neural network training. This method, to the best of our knowledge, is novel in machine learning. △ Less

Submitted 25 March, 2021; originally announced March 2021.

arXiv:2103.01415 [pdf, other]

A Survey of Deep Learning Techniques for Weed Detection from Images

Authors: A S M Mahmudul Hasan, Ferdous Sohel, Dean Diepeveen, Hamid Laga, Michael G. K. Jones

Abstract: The rapid advances in Deep Learning (DL) techniques have enabled rapid detection, localisation, and recognition of objects from images or videos. DL techniques are now being used in many applications related to agriculture and farming. Automatic detection and classification of weeds can play an important role in weed management and so contribute to higher yields. Weed detection in crops from image… ▽ More The rapid advances in Deep Learning (DL) techniques have enabled rapid detection, localisation, and recognition of objects from images or videos. DL techniques are now being used in many applications related to agriculture and farming. Automatic detection and classification of weeds can play an important role in weed management and so contribute to higher yields. Weed detection in crops from imagery is inherently a challenging problem because both weeds and crops have similar colours ('green-on-green'), and their shapes and texture can be very similar at the growth phase. Also, a crop in one setting can be considered a weed in another. In addition to their detection, the recognition of specific weed species is essential so that targeted controlling mechanisms (e.g. appropriate herbicides and correct doses) can be applied. In this paper, we review existing deep learning-based weed detection and classification techniques. We cover the detailed literature on four main procedures, i.e., data acquisition, dataset preparation, DL techniques employed for detection, location and classification of weeds in crops, and evaluation metrics approaches. We found that most studies applied supervised learning techniques, they achieved high classification accuracy by fine-tuning pre-trained models on any plant dataset, and past experiments have already achieved high accuracy when a large amount of labelled data is available. △ Less

Submitted 1 March, 2021; originally announced March 2021.

arXiv:2006.07799 [pdf, other]

On the Stability of Explicit Finite Difference Methods for Advection-Diffusion Equations

Authors: Xianyi Zeng, Md Mahmudul Hasan

Abstract: In this paper we study the stability of explicit finite difference discretizations of linear advection-diffusion equations (ADE) with arbitrary order of accuracy in the context of method of lines. The analysis first focuses on the stability of the system of ordinary differential equations (ODE) that is obtained by discretizing the ADE in space and then extends to fully discretized methods where ex… ▽ More In this paper we study the stability of explicit finite difference discretizations of linear advection-diffusion equations (ADE) with arbitrary order of accuracy in the context of method of lines. The analysis first focuses on the stability of the system of ordinary differential equations (ODE) that is obtained by discretizing the ADE in space and then extends to fully discretized methods where explicit Runge-Kutta methods are used for integrating the ODE system. In particular, it is proved that all stable semi-discretization of the ADE gives rise to a conditionally stable fully discretized method if the time-integrator is at least first-order accurate, whereas high-order spatial discretization of the advection equation cannot yield a stable method if the temporal order is too low. In the second half of this paper, we extend the analysis to a partially dissipative wave system and obtain the stability results for both semi-discretized and fully-discretized methods. Finally, the major theoretical predictions are verified numerically. △ Less

Submitted 15 June, 2020; v1 submitted 14 June, 2020; originally announced June 2020.

Comments: 27 pages, 11 figures

MSC Class: 65M06; 65M12

arXiv:2002.03130 [pdf]

doi 10.5120/17195-7390

Design and Implementation of Butterworth, Chebyshev-I and Elliptic Filter for Speech Signal Analysis

Authors: Prajoy Podder, Md. Mehedi Hasan, Md. Rafiqul Islam, Mursalin Sayeed

Abstract: In the field of digital signal processing, the function of a filter is to remove unwanted parts of the signal such as random noise that is also undesirable. To remove noise from the speech signal transmission or to extract useful parts of the signal such as the components lying within a certain frequency range, filters are necessary. Filters are broadly used in signal processing and communication… ▽ More In the field of digital signal processing, the function of a filter is to remove unwanted parts of the signal such as random noise that is also undesirable. To remove noise from the speech signal transmission or to extract useful parts of the signal such as the components lying within a certain frequency range, filters are necessary. Filters are broadly used in signal processing and communication systems in applications such as channel equalization, noise reduction, radar, audio processing, speech signal processing, video processing, biomedical signal processing that is noisy ECG, EEG, EMG signal filtering, electrical circuit analysis and analysis of economic and financial data. In this paper, three types of infinite impulse response filter i.e. Butterworth, Chebyshev type I and Elliptical filter have been discussed theoretically and experimentally. Butterworth, Chebyshev type I and elliptic low pass, high pass, band pass and band stop filter have been designed in this paper using MATLAB Software. The impulse responses, magnitude responses, phase responses of Butterworth, Chebyshev type I and Elliptical filter for filtering the speech signal have been observed in this paper. Analyzing the Speech signal, its sampling rate and spectrum response have also been found. △ Less

Submitted 27 May, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

Journal ref: International Journal of Computer Applications 98(7):12-18, July 2014

arXiv:2001.09494 [pdf, other]

Efficient, Effective and Well Justified Estimation of Active Nodes within a Cluster

Authors: Md Mahmudul Hasan, Shuangqing Wei, Ramachandran Vaidyanathan

Abstract: Reliable and efficient estimation of the size of a dynamically changing cluster in an IoT network is critical in its nominal operation. Most previous estimation schemes worked with relatively smaller frame size and large number of rounds. Here we propose a new estimator named \textquotedblleft Gaussian Estimator of Active Nodes,\textquotedblright (GEAN), that works with large enough frame size und… ▽ More Reliable and efficient estimation of the size of a dynamically changing cluster in an IoT network is critical in its nominal operation. Most previous estimation schemes worked with relatively smaller frame size and large number of rounds. Here we propose a new estimator named \textquotedblleft Gaussian Estimator of Active Nodes,\textquotedblright (GEAN), that works with large enough frame size under which testing statistics is well approximated as a Gaussian variable, thereby requiring less number of frames, and thus less total number of channel slots to attain a desired accuracy in estimation. More specifically, the selection of the frame size is done according to Triangular Array Central Limit Theorem which also enables us to quantify the approximation error. Larger frame size helps the statistical average to converge faster to the ensemble mean of the estimator and the quantification of the approximation error helps to determine the number of rounds to keep up with the accuracy requirements. We present the analysis of our scheme under two different channel models i.e. $ \{0,1 \} $ and $ \{0,1,e \} $, whereas all previous schemes worked only under $ \{0,1 \} $ channel model. The overall performance of GEAN is better than the previously proposed schemes considering the number of slots required for estimation to achieve a given level of estimation accuracy. △ Less

Submitted 26 January, 2020; originally announced January 2020.

Comments: 15 pages, 11 figures. arXiv admin note: text overlap with arXiv:1701.05952

arXiv:2001.02712 [pdf, other]

Latent Factor Analysis of Gaussian Distributions under Graphical Constraints

Authors: Md Mahmudul Hasan, Shuangqing Wei, Ali Moharrer

Abstract: We explore the algebraic structure of the solution space of convex optimization problem Constrained Minimum Trace Factor Analysis (CMTFA), when the population covariance matrix $Σ_x$ has an additional latent graphical constraint, namely, a latent star topology. In particular, we have shown that CMTFA can have either a rank $ 1 $ or a rank $ n-1 $ solution and nothing in between. The special case o… ▽ More We explore the algebraic structure of the solution space of convex optimization problem Constrained Minimum Trace Factor Analysis (CMTFA), when the population covariance matrix $Σ_x$ has an additional latent graphical constraint, namely, a latent star topology. In particular, we have shown that CMTFA can have either a rank $ 1 $ or a rank $ n-1 $ solution and nothing in between. The special case of a rank $ 1 $ solution, corresponds to the case where just one latent variable captures all the dependencies among the observables, giving rise to a star topology. We found explicit conditions for both rank $ 1 $ and rank $n- 1$ solutions for CMTFA solution of $Σ_x$. As a basic attempt towards building a more general Gaussian tree, we have found a necessary and a sufficient condition for multiple clusters, each having rank $ 1 $ CMTFA solution, to satisfy a minimum probability to combine together to build a Gaussian tree. To support our analytical findings we have presented some numerical demonstrating the usefulness of the contributions of our work. △ Less

Submitted 11 January, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

Comments: 9 pages, 4 figures

arXiv:1912.03356 [pdf, other]

Cognitive Internet of Vehicles: Motivation, Layered Architecture and Security Issues

Authors: Khondokar Fida Hasan, Tarandeep Kaur, Md. Mhedi Hasan, Yanming Feng

Abstract: Over the past few years, we have experienced great technological advancements in the information and communication field, which has significantly contributed to resha** the Intelligent Transportation System (ITS) concept. Evolving from the platform of a collection of sensors aiming to collect data, the data exchanged paradigm among vehicles is shifted from the local network to the cloud. With th… ▽ More Over the past few years, we have experienced great technological advancements in the information and communication field, which has significantly contributed to resha** the Intelligent Transportation System (ITS) concept. Evolving from the platform of a collection of sensors aiming to collect data, the data exchanged paradigm among vehicles is shifted from the local network to the cloud. With the introduction of cloud and edge computing along with ubiquitous 5G mobile network, it is expected to see the role of Artificial Intelligence (AI) in data processing and smart decision imminent. So as to fully understand the future automobile scenario in this verge of industrial revolution 4.0, it is necessary first of all to get a clear understanding of the cutting-edge technologies that going to take place in the automotive ecosystem so that the cyber-physical impact on transportation system can be measured. CIoV, which is abbreviated from Cognitive Internet of Vehicle, is one of the recently proposed architectures of the technological evolution in transportation, and it has amassed great attention. It introduces cloud-based artificial intelligence and machine learning into transportation system. What are the future expectations of CIoV. To fully contemplate this architectures future potentials, and milestones set to achieve, it is crucial to understand all the technologies that leaned into it. Also, the security issues to meet the security requirements of its practical implementation. Aiming to that, this paper presents the evolution of CIoV along with the layer abstractions to outline the distinctive functional parts of the proposed architecture. It also gives an investigation of the prime security and privacy issues associated with technological evolution to take measures. △ Less

Submitted 20 November, 2019; originally announced December 2019.

Comments: 6 pages

arXiv:1904.03524 [pdf]

A Big Data Analytics Framework to Predict the Risk of Opioid Use Disorder

Authors: Md Mahmudul Hasan, Md. Noor-E-Alam, Mehul Rakeshkumar Patel, Alicia Sasser Modestino, Leon D. Sanchez, Gary Young

Abstract: Overdose related to prescription opioids have reached an epidemic level in the US, creating an unprecedented national crisis. This has been exacerbated partly due to the lack of tools for physicians to help predict the risk of whether a patient will develop opioid use disorder. Little is known about how machine learning can be applied to a big-data platform to ensure an informed, sustained and jud… ▽ More Overdose related to prescription opioids have reached an epidemic level in the US, creating an unprecedented national crisis. This has been exacerbated partly due to the lack of tools for physicians to help predict the risk of whether a patient will develop opioid use disorder. Little is known about how machine learning can be applied to a big-data platform to ensure an informed, sustained and judicious prescribing of opioids, in particular for commercially insured population. This study explores Massachusetts All Payer Claims Data, a de-identified healthcare dataset, and proposes a machine learning framework to examine how naïve users develop opioid use disorder. We perform several feature selections techniques to identify influential demographic and clinical features associated with opioid use disorder from a class imbalanced analytic sample. We then compare the predictive power of four well-known machine learning algorithms: Logistic Regression, Random Forest, Decision Tree, and Gradient Boosting to predict the risk of opioid use disorder. The study results show that the Random Forest model outperforms the other three algorithms while determining the features, some of which are consistent with prior clinical findings. Moreover, alongside the higher predictive accuracy, the proposed framework is capable of extracting some risk factors that will add significant knowledge to what is already known in the extant literature. We anticipate that this study will help healthcare practitioners improve the current prescribing practice of opioids and contribute to curb the increasing rate of opioid addiction and overdose. △ Less

Submitted 30 May, 2020; v1 submitted 6 April, 2019; originally announced April 2019.

arXiv:1903.02189 [pdf, other]

Grid-Connected Emergency Back-Up Power Supply

Authors: Dhiman Chowdhury, Mohammad Sharif Miah, Md. Feroz Hossain, Md. Mostafijur Rahman, Md. Marzan Hossain, Md. Nazim Uddin Sheikh, Md. Mehedi Hasan, Uzzal Sarker, Abu Shahir Md. Khalid Hasan

Abstract: This paper documents a design and modelling of a grid-connected emergency back-up power supply for medium power applications. There are a rectifier-link boost derived battery charging circuit and a 4-switch push-pull power inverter circuit which are controlled by pulse width modulation (PWM) signals. This paper presents a state averaging model and Laplace domain transfer function of the charging c… ▽ More This paper documents a design and modelling of a grid-connected emergency back-up power supply for medium power applications. There are a rectifier-link boost derived battery charging circuit and a 4-switch push-pull power inverter circuit which are controlled by pulse width modulation (PWM) signals. This paper presents a state averaging model and Laplace domain transfer function of the charging circuit and a switching converter model of the power inverter circuit. A changeover relay based transfer switch controls the power flow towards the utility loads. During off-grid situations, loads are fed power by the proposed inverter circuit and during on-grid situations, battery is charged by an ac-link rectifier-fed boost converter. There is a relay switching circuit to control the charging phenomenon of the battery. The proposed design has been simulated in PLECS and the simulation results corroborate the reliability of the presented framework. △ Less

Submitted 6 March, 2019; originally announced March 2019.

arXiv:1901.06466 [pdf, other]

Algebraic Properties of Wyner Common Information Solution under Graphical Constraints

Authors: Md Mahmudul Hasan, Shuangqing Wei, Ali Moharrer

Abstract: The Constrained Minimum Determinant Factor Analysis (CMDFA) setting was motivated by Wyner's common information problem where we seek a latent representation of a given Gaussian vector distribution with the minimum mutual information under certain generative constraints. In this paper, we explore the algebraic structures of the solution space of the CMDFA, when the underlying covariance matrix… ▽ More The Constrained Minimum Determinant Factor Analysis (CMDFA) setting was motivated by Wyner's common information problem where we seek a latent representation of a given Gaussian vector distribution with the minimum mutual information under certain generative constraints. In this paper, we explore the algebraic structures of the solution space of the CMDFA, when the underlying covariance matrix $Σ_x$ has an additional latent graphical constraint, namely, a latent star topology. In particular, sufficient and necessary conditions in terms of the relationships between edge weights of the star graph have been found. Under such conditions and constraints, we have shown that the CMDFA problem has either a rank one solution or a rank $n-1$ solution where $n$ is the dimension of the observable vector. Further results are given in regards to the solution to the CMDFA with $n-1$ latent factors. △ Less

Submitted 18 January, 2019; originally announced January 2019.

Comments: 9 pages, 2 figures. arXiv admin note: substantial text overlap with arXiv:1701.05952

arXiv:1812.10595 [pdf, other]

Deep Learning based Early Detection and Grading of Diabetic Retinopathy Using Retinal Fundus Images

Authors: Sheikh Muhammad Saiful Islam, Md Mahedi Hasan, Sohaib Abdullah

Abstract: Diabetic Retinopathy (DR) is a constantly deteriorating disease, being one of the leading causes of vision impairment and blindness. Subtle distinction among different grades and existence of many significant small features make the task of recognition very challenging. In addition, the present approach of retinopathy detection is a very laborious and time-intensive task, which heavily relies on t… ▽ More Diabetic Retinopathy (DR) is a constantly deteriorating disease, being one of the leading causes of vision impairment and blindness. Subtle distinction among different grades and existence of many significant small features make the task of recognition very challenging. In addition, the present approach of retinopathy detection is a very laborious and time-intensive task, which heavily relies on the skill of a physician. Automated detection of diabetic retinopathy is essential to tackle these problems. Early-stage detection of diabetic retinopathy is also very important for diagnosis, which can prevent blindness with proper treatment. In this paper, we developed a novel deep convolutional neural network, which performs the early-stage detection by identifying all microaneurysms (MAs), the first signs of DR, along with correctly assigning labels to retinal fundus images which are graded into five categories. We have tested our network on the largest publicly available Kaggle diabetic retinopathy dataset, and achieved 0.851 quadratic weighted kappa score and 0.844 AUC score, which achieves the state-of-the-art performance on severity grading. In the early-stage detection, we have achieved a sensitivity of 98% and specificity of above 94%, which demonstrates the effectiveness of our proposed method. Our proposed architecture is at the same time very simple and efficient with respect to computational time and space are concerned. △ Less

Submitted 26 December, 2018; originally announced December 2018.

Comments: Accepted in MIND 2019

arXiv:1811.00053 [pdf, other]

DEEPGONET: Multi-label Prediction of GO Annotation for Protein from Sequence Using Cascaded Convolutional and Recurrent Network

Authors: Sheikh Muhammad Saiful Islam, Md Mahedi Hasan

Abstract: The present gap between the amount of available protein sequence due to the development of next generation sequencing technology (NGS) and slow and expensive experimental extraction of useful information like annotation of protein sequence in different functional aspects, is ever widening, which can be reduced by employing automatic function prediction (AFP) approaches. Gene Ontology (GO), compris… ▽ More The present gap between the amount of available protein sequence due to the development of next generation sequencing technology (NGS) and slow and expensive experimental extraction of useful information like annotation of protein sequence in different functional aspects, is ever widening, which can be reduced by employing automatic function prediction (AFP) approaches. Gene Ontology (GO), comprising of more than 40, 000 classes, defines three aspects of protein function names Biological Process (BP), Cellular Component (CC), Molecular Function (MF). Multiple functions of a single protein, has made automatic function prediction a large-scale, multi-class, multi-label task. In this paper, we present DEEPGONET, a novel cascaded convolutional and recurrent neural network, to predict the top-level hierarchy of GO ontology. The network takes the primary sequence of protein as input which makes it more useful than other prevailing state-of-the-art deep learning based methods with multi-modal input, making them less applicable for proteins where only primary sequence is available. All predictions of different protein functions of our network are performed by the same architecture, a proof of better generalization as demonstrated by promising performance on a variety of organisms while trained on Homo sapiens only, which is made possible by efficient exploration of vast output space by leveraging hierarchical relationship among GO classes. The promising performance of our model makes it a potential avenue for directing experimental protein functions exploration efficiently by vastly eliminating possible routes which is done by the exploring only the suggested routes from our model. Our proposed model is also very simple and efficient in terms of computational time and space compared to other architectures in literature. △ Less

Submitted 31 October, 2018; originally announced November 2018.

Comments: Accepted in ICCIT 2018

arXiv:1809.07793 [pdf]

Survey on Error Concealment Strategies and Subjective Testing of 3D Videos

Authors: Md Mehedi Hasan, Michael Frater, John Arnold

Abstract: Over the last decade, different technologies to visualize 3D scenes have been introduced and improved. These technologies include stereoscopic, multi-view, integral imaging and holographic types. Despite increasing consumer interest; poor image quality, crosstalk or side effects of 3D displays and also the lack of defined broadcast standards has hampered the advancement of 3D displays to the mass… ▽ More Over the last decade, different technologies to visualize 3D scenes have been introduced and improved. These technologies include stereoscopic, multi-view, integral imaging and holographic types. Despite increasing consumer interest; poor image quality, crosstalk or side effects of 3D displays and also the lack of defined broadcast standards has hampered the advancement of 3D displays to the mass consumer market. Also, in real time transmission of 3DTV sequences over packet-based networks may results in visual quality degradations due to packet loss and others. In the conventional 2D videos different extrapolation and directional interpolation strategies have been used for concealing the missing blocks but in 3D, it is still an emerging field of research. Few studies have been carried out to define the assessment methods of stereoscopic images and videos. But through industrial and commercial perspective, subjective quality evaluation is the most direct way to evaluate human perception on 3DTV systems. This paper reviews the state-of-the-art error concealment strategies and the subjective evaluation of 3D videos and proposes a low complexity frame loss concealment method for the video decoder. Subjective testing on prominent datasets videos and comparison with existing concealment methods show that the proposed method is very much efficient to conceal errors of stereoscopic videos in terms of computation time, comfort and distortion. △ Less

Submitted 29 August, 2018; originally announced September 2018.

arXiv:1809.07792 [pdf, ps, other]

Binocular Rivalry - Psychovisual Challenge in Stereoscopic Video Error Concealment

Authors: Md Mehedi Hasan, John F. Arnold, Michael R. Frater

Abstract: During Stereoscopic 3D (S3D) video transmission, one or both views can be affected by bit errors and packet losses caused by adverse channel conditions, delay or jitter. Typically, the Human Visual System (HVS) is incapable of aligning and fusing stereoscopic content if one view is affected by artefacts caused by compression, transmission and rendering with distorted patterns being perceived as al… ▽ More During Stereoscopic 3D (S3D) video transmission, one or both views can be affected by bit errors and packet losses caused by adverse channel conditions, delay or jitter. Typically, the Human Visual System (HVS) is incapable of aligning and fusing stereoscopic content if one view is affected by artefacts caused by compression, transmission and rendering with distorted patterns being perceived as alterations of the original which presents a shimmering effect known as binocular rivalry and is detrimental to a user's Quality of Experience (QoE). This study attempts to quantify the effects of binocular rivalry for stereoscopic videos. Existing approaches, in which one or more frames are lost in one or both views undergo error concealment, are implemented. Then, subjective testing is carried out on the error concealed 3D video sequences. The evaluations provided by these subjects were then combined and analysed using a standard Student t-test thus quantifying the impact of binocular rivalry and allowing the impact to be compared with that of monocular viewing. The main focus is implementing error-resilient video communication, avoiding the detrimental effects of binocular rivalry and improving the overall QoE of viewers. △ Less

Submitted 28 August, 2018; originally announced September 2018.

Comments: 11 pages, 9 Figures

arXiv:1808.10086 [pdf, other]

Artifacts Detection and Error Block Analysis from Broadcasted Videos

Authors: Md Mehedi Hasan, Tasneem Rahman, Kiok Ahn, Oksam Chae

Abstract: With the advancement of IPTV and HDTV technology, previous subtle errors in videos are now becoming more prominent because of the structure oriented and compression based artifacts. In this paper, we focus towards the development of a real-time video quality check system. Light weighted edge gradient magnitude information is incorporated to acquire the statistical information and the distorted fra… ▽ More With the advancement of IPTV and HDTV technology, previous subtle errors in videos are now becoming more prominent because of the structure oriented and compression based artifacts. In this paper, we focus towards the development of a real-time video quality check system. Light weighted edge gradient magnitude information is incorporated to acquire the statistical information and the distorted frames are then estimated based on the characteristics of their surrounding frames. Then we apply the prominent texture patterns to classify them in different block errors and analyze them not only in video error detection application but also in error concealment, restoration and retrieval. Finally, evaluating the performance through experiments on prominent datasets and broadcasted videos show that the proposed algorithm is very much efficient to detect errors for video broadcast and surveillance applications in terms of computation time and analysis of distorted frames. △ Less

Submitted 29 August, 2018; originally announced August 2018.

arXiv:1804.02533 [pdf, other]

doi 10.1145/3197231.3197234

MobiCoMonkey - Context Testing of Android Apps

Authors: Amit Seal Ami, Md. Mehedi Hasan, Md. Rayhanur Rahman, Kazi Sakib

Abstract: The functionality of many mobile applications is dependent on various contextual, external factors. Depending on unforeseen scenarios, mobile apps can even malfunction or crash. In this paper, we have introduced MobiCoMonkey - automated tool that allows a developer to test app against custom or auto generated contextual scenarios and help detect possible bugs through the emulator. Moreover, it rep… ▽ More The functionality of many mobile applications is dependent on various contextual, external factors. Depending on unforeseen scenarios, mobile apps can even malfunction or crash. In this paper, we have introduced MobiCoMonkey - automated tool that allows a developer to test app against custom or auto generated contextual scenarios and help detect possible bugs through the emulator. Moreover, it reports the connection between the bugs and contextual factors so that the bugs can later be reproduced. It utilizes the tools offered by Android SDK and logcat to inject events and capture traces of the app execution. △ Less

Submitted 7 April, 2018; originally announced April 2018.

Comments: 4 pages

MSC Class: 68N01

arXiv:1803.03143 [pdf, ps, other]

Efficient method for fractional Lévy-Feller advection-dispersion equation using Jacobi polynomials

Authors: N. H. Sweilam, M. M. Abou Hasan

Abstract: In this paper, a novel formula expressing explicitly the fractional-order derivatives, in the sense of Riesz-Feller operator, of Jacobi polynomials is presented. Jacobi spectral collocation method together with trapezoidal rule are used to reduce the fractional Lévy-Feller advection-dispersion equation (LFADE) to a system of algebraic equations which greatly simplifies solving like this fractional… ▽ More In this paper, a novel formula expressing explicitly the fractional-order derivatives, in the sense of Riesz-Feller operator, of Jacobi polynomials is presented. Jacobi spectral collocation method together with trapezoidal rule are used to reduce the fractional Lévy-Feller advection-dispersion equation (LFADE) to a system of algebraic equations which greatly simplifies solving like this fractional differential equation. Numerical simulations with some comparisons are introduced to confirm the effectiveness and reliability of the proposed technique for the Lévy-Feller fractional partial differential equations. △ Less

Submitted 29 March, 2018; v1 submitted 8 March, 2018; originally announced March 2018.

Comments: 23 pages, 4 figures

arXiv:1801.03481 [pdf, ps, other]

Latent Factor Analysis of Gaussian Distributions under Graphical Constraints

Authors: Md Mahmudul Hasan, Shuangqing Wei, Ali Moharrer

Abstract: In this paper, we explore the algebraic structures of solution spaces for Gaussian latent factor analysis when the population covariance matrix $Σ_x$ has an additional latent graphical constraint, namely, a latent star topology. In particular, we give sufficient and necessary conditions under which the solutions to constrained minimum trace factor analysis (CMTFA) is still star. We further show th… ▽ More In this paper, we explore the algebraic structures of solution spaces for Gaussian latent factor analysis when the population covariance matrix $Σ_x$ has an additional latent graphical constraint, namely, a latent star topology. In particular, we give sufficient and necessary conditions under which the solutions to constrained minimum trace factor analysis (CMTFA) is still star. We further show that the solution to CMTFA under the star constraint can only have two cases, i.e. the number of latent variable can be only one (star) or $n-1$ where $n$ is the dimension of the observable vector, and characterize the solution for both the cases. △ Less

Submitted 9 July, 2018; v1 submitted 10 January, 2018; originally announced January 2018.

Comments: 7 pages

Showing 1–50 of 65 results for author: Hasan, M M