Search | arXiv e-print repository

Wind Power Prediction across Different Locations using Deep Domain Adaptive Learning

Authors: Md Saiful Islam Sajol, Md Shazid Islam, A S M Jahid Hasan, Md Saydur Rahman, Jubair Yusuf

Abstract: Accurate prediction of wind power is essential for the grid integration of this intermittent renewable source and aiding grid planners in forecasting available wind capacity. Spatial differences lead to discrepancies in climatological data distributions between two geographically dispersed regions, consequently making the prediction task more difficult. Thus, a prediction model that learns from th… ▽ More Accurate prediction of wind power is essential for the grid integration of this intermittent renewable source and aiding grid planners in forecasting available wind capacity. Spatial differences lead to discrepancies in climatological data distributions between two geographically dispersed regions, consequently making the prediction task more difficult. Thus, a prediction model that learns from the data of a particular climatic region can suffer from being less robust. A deep neural network (DNN) based domain adaptive approach is proposed to counter this drawback. Effective weather features from a large set of weather parameters are selected using a random forest approach. A pre-trained model from the source domain is utilized to perform the prediction task, assuming no source data is available during target domain prediction. The weights of only the last few layers of the DNN model are updated throughout the task, kee** the rest of the network unchanged, making the model faster compared to the traditional approaches. The proposed approach demonstrates higher accuracy ranging from 6.14% to even 28.44% compared to the traditional non-adaptive method. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2401.14422 [pdf, other]

Location Agnostic Source-Free Domain Adaptive Learning to Predict Solar Power Generation

Authors: Md Shazid Islam, A S M Jahid Hasan, Md Saydur Rahman, Jubair Yusuf, Md Saiful Islam Sajol, Farhana Akter Tumpa

Abstract: The prediction of solar power generation is a challenging task due to its dependence on climatic characteristics that exhibit spatial and temporal variability. The performance of a prediction model may vary across different places due to changes in data distribution, resulting in a model that works well in one region but not in others. Furthermore, as a consequence of global warming, there is a no… ▽ More The prediction of solar power generation is a challenging task due to its dependence on climatic characteristics that exhibit spatial and temporal variability. The performance of a prediction model may vary across different places due to changes in data distribution, resulting in a model that works well in one region but not in others. Furthermore, as a consequence of global warming, there is a notable acceleration in the alteration of weather patterns on an annual basis. This phenomenon introduces the potential for diminished efficacy of existing models, even within the same geographical region, as time progresses. In this paper, a domain adaptive deep learning-based framework is proposed to estimate solar power generation using weather features that can solve the aforementioned challenges. A feed-forward deep convolutional network model is trained for a known location dataset in a supervised manner and utilized to predict the solar power of an unknown location later. This adaptive data-driven approach exhibits notable advantages in terms of computing speed, storage efficiency, and its ability to improve outcomes in scenarios where state-of-the-art non-adaptive methods fail. Our method has shown an improvement of $10.47 \%$, $7.44 \%$, $5.11\%$ in solar power prediction accuracy compared to best performing non-adaptive method for California (CA), Florida (FL) and New York (NY), respectively. △ Less

Submitted 6 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

arXiv:2311.14012 [pdf, other]

Shadow: A Novel Loss Function for Efficient Training in Siamese Networks

Authors: Alif Elham Khan, Mohammad Junayed Hasan, Humayra Anjum, Nabeel Mohammed

Abstract: Despite significant recent advances in similarity detection tasks, existing approaches pose substantial challenges under memory constraints. One of the primary reasons for this is the use of computationally expensive metric learning loss functions such as Triplet Loss in Siamese networks. In this paper, we present a novel loss function called Shadow Loss that compresses the dimensions of an embedd… ▽ More Despite significant recent advances in similarity detection tasks, existing approaches pose substantial challenges under memory constraints. One of the primary reasons for this is the use of computationally expensive metric learning loss functions such as Triplet Loss in Siamese networks. In this paper, we present a novel loss function called Shadow Loss that compresses the dimensions of an embedding space during loss calculation without loss of performance. The distance between the projections of the embeddings is learned from inputs on a compact projection space where distances directly correspond to a measure of class similarity. Projecting on a lower-dimension projection space, our loss function converges faster, and the resulting classified image clusters have higher inter-class and smaller intra-class distances. Shadow Loss not only reduces embedding dimensions favoring memory constraint devices but also consistently performs better than the state-of-the-art Triplet Margin Loss by an accuracy of 5\%-10\% across diverse datasets. The proposed loss function is also model agnostic, upholding its performance across several tested models. Its effectiveness and robustness across balanced, imbalanced, medical, and non-medical image datasets suggests that it is not specific to a particular model or dataset but demonstrates superior performance consistently while using less memory and computation. △ Less

Submitted 23 November, 2023; originally announced November 2023.

arXiv:2311.13810 [pdf, other]

Bridging Classical and Quantum Machine Learning: Knowledge Transfer From Classical to Quantum Neural Networks Using Knowledge Distillation

Authors: Mohammad Junayed Hasan, M. R. C. Mahdy

Abstract: Very recently, studies have shown that quantum neural networks surpass classical neural networks in tasks like image classification when a similar number of learnable parameters are used. However, the development and optimization of quantum models are currently hindered by issues such as qubit instability and limited qubit availability, leading to error-prone systems with weak performance. In cont… ▽ More Very recently, studies have shown that quantum neural networks surpass classical neural networks in tasks like image classification when a similar number of learnable parameters are used. However, the development and optimization of quantum models are currently hindered by issues such as qubit instability and limited qubit availability, leading to error-prone systems with weak performance. In contrast, classical models can exhibit high-performance owing to substantial resource availability. As a result, more studies have been focusing on hybrid classical-quantum integration. A line of research particularly focuses on transfer learning through classical-quantum integration or quantum-quantum approaches. Unlike previous studies, this paper introduces a new method to transfer knowledge from classical to quantum neural networks using knowledge distillation, effectively bridging the gap between classical machine learning and emergent quantum computing techniques. We adapt classical convolutional neural network (CNN) architectures like LeNet and AlexNet to serve as teacher networks, facilitating the training of student quantum models by sending supervisory signals during backpropagation through KL-divergence. The approach yields significant performance improvements for the quantum models by solely depending on classical CNNs, with quantum models achieving an average accuracy improvement of 0.80% on the MNIST dataset and 5.40% on the more complex Fashion MNIST dataset. Applying this technique eliminates the cumbersome training of huge quantum models for transfer learning in resource-constrained settings and enables re-using existing pre-trained classical models to improve performance.Thus, this study paves the way for future research in quantum machine learning (QML) by positioning knowledge distillation as a core technique for advancing QML applications. △ Less

Submitted 23 November, 2023; originally announced November 2023.

Comments: 19 pages, 7 figures and 17 equations

arXiv:2311.01571 [pdf, other]

Preserving the knowledge of long clinical texts using aggregated ensembles of large language models

Authors: Mohammad Junayed Hasan, Suhra Noor, Mohammad Ashrafuzzaman Khan

Abstract: Clinical texts, such as admission notes, discharge summaries, and progress notes, contain rich and valuable information that can be used for various clinical outcome prediction tasks. However, applying large language models, such as BERT-based models, to clinical texts poses two major challenges: the limitation of input length and the diversity of data sources. This paper proposes a novel method t… ▽ More Clinical texts, such as admission notes, discharge summaries, and progress notes, contain rich and valuable information that can be used for various clinical outcome prediction tasks. However, applying large language models, such as BERT-based models, to clinical texts poses two major challenges: the limitation of input length and the diversity of data sources. This paper proposes a novel method to preserve the knowledge of long clinical texts using aggregated ensembles of large language models. Unlike previous studies which use model ensembling or text aggregation methods separately, we combine ensemble learning with text aggregation and train multiple large language models on two clinical outcome tasks: mortality prediction and length of stay prediction. We show that our method can achieve better results than baselines, ensembling, and aggregation individually, and can improve the performance of large language models while handling long inputs and diverse datasets. We conduct extensive experiments on the admission notes from the MIMIC-III clinical database by combining multiple unstructured and high-dimensional datasets, demonstrating our method's effectiveness and superiority over existing approaches. We also provide a comprehensive analysis and discussion of our results, highlighting our method's applications and limitations for future research in the domain of clinical healthcare. The results and analysis of this study is supportive of our method assisting in clinical healthcare systems by enabling clinical decision-making with robust performance overcoming the challenges of long text inputs and varied datasets. △ Less

Submitted 2 November, 2023; originally announced November 2023.

Comments: 17 pages, 4 figures, 4 tables, 9 equations and 1 algorithm

ACM Class: I.2.7

arXiv:2310.13483 [pdf]

Application of deep learning for livestock behaviour recognition: A systematic literature review

Authors: Ali Rohan, Muhammad Saad Rafaq, Md. Junayed Hasan, Furqan Asghar, Ali Kashif Bashir, Tania Dottorini

Abstract: Livestock health and welfare monitoring has traditionally been a labor-intensive task performed manually. Recent advances have led to the adoption of AI and computer vision techniques, particularly deep learning models, as decision-making tools within the livestock industry. These models have been employed for tasks like animal identification, tracking, body part recognition, and species classific… ▽ More Livestock health and welfare monitoring has traditionally been a labor-intensive task performed manually. Recent advances have led to the adoption of AI and computer vision techniques, particularly deep learning models, as decision-making tools within the livestock industry. These models have been employed for tasks like animal identification, tracking, body part recognition, and species classification. In the past decade, there has been a growing interest in using these models to explore the connection between livestock behaviour and health issues. While previous review studies have been rather generic, there is currently no review study specifically focusing on DL for livestock behaviour recognition. Hence, this systematic literature review (SLR) was conducted. The SLR involved an initial search across electronic databases, resulting in 1101 publications. After applying defined selection criteria, 126 publications were shortlisted. These publications were further filtered based on quality criteria, resulting in the selection of 44 high-quality primary studies. These studies were analysed to address the research questions. The results showed that DL successfully addressed 13 behaviour recognition problems encompassing 44 different behaviour classes. A variety of DL models and networks were employed, with CNN, Faster R-CNN, YOLOv5, and YOLOv4 being among the most common models, and VGG16, CSPDarknet53, GoogLeNet, ResNet101, and ResNet50 being popular networks. Performance evaluation involved ten different matrices, with precision and accuracy being the most frequently used. Primary studies identified challenges, including occlusion, adhesion, data imbalance, and the complexities of the livestock environment. The SLR study also discussed potential solutions and research directions to facilitate the development of autonomous livestock behaviour recognition systems. △ Less

Submitted 20 October, 2023; originally announced October 2023.

arXiv:2308.03631 [pdf]

Segmentation Framework for Heat Loss Identification in Thermal Images: Empowering Scottish Retrofitting and Thermographic Survey Companies

Authors: Md Junayed Hasan, Eyad Elyan, Yijun Yan, **chang Ren, Md Mostafa Kamal Sarker

Abstract: Retrofitting and thermographic survey (TS) companies in Scotland collaborate with social housing providers to tackle fuel poverty. They employ ground-level infrared (IR) camera-based-TSs (GIRTSs) for collecting thermal images to identi-fy the heat loss sources resulting from poor insulation. However, this identifica-tion process is labor-intensive and time-consuming, necessitating extensive data p… ▽ More Retrofitting and thermographic survey (TS) companies in Scotland collaborate with social housing providers to tackle fuel poverty. They employ ground-level infrared (IR) camera-based-TSs (GIRTSs) for collecting thermal images to identi-fy the heat loss sources resulting from poor insulation. However, this identifica-tion process is labor-intensive and time-consuming, necessitating extensive data processing. To automate this, an AI-driven approach is necessary. Therefore, this study proposes a deep learning (DL)-based segmentation framework using the Mask Region Proposal Convolutional Neural Network (Mask RCNN) to validate its applicability to these thermal images. The objective of the framework is to au-tomatically identify, and crop heat loss sources caused by weak insulation, while also eliminating obstructive objects present in those images. By doing so, it min-imizes labor-intensive tasks and provides an automated, consistent, and reliable solution. To validate the proposed framework, approximately 2500 thermal imag-es were collected in collaboration with industrial TS partner. Then, 1800 repre-sentative images were carefully selected with the assistance of experts and anno-tated to highlight the target objects (TO) to form the final dataset. Subsequently, a transfer learning strategy was employed to train the dataset, progressively aug-menting the training data volume and fine-tuning the pre-trained baseline Mask RCNN. As a result, the final fine-tuned model achieved a mean average precision (mAP) score of 77.2% for segmenting the TO, demonstrating the significant po-tential of proposed framework in accurately quantifying energy loss in Scottish homes. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 9 Pages, 3 Figures, Accepted from the conference - BICS 2023: 2023 International Conference on Brain-Inspired Cognitive Systems Kuala Lumpur, Malaysia, August 5-6, 2023 [peer-reviewed]

arXiv:2104.03498 [pdf]

A Centralized Optimization Approach for Bidirectional PEV Impacts Analysis in a Commercial Building-Integrated Microgrid

Authors: Jubair Yusuf, A S M Jahid Hasan, Luis Fernando Enriquez-Contreras, Sadrul Ula

Abstract: Building sector is the largest energy user in the United States. Conventional building energy studies mostly involve Heating, Ventilation, and Air Conditioning (HVAC), and lighting energy consumptions. Recent additions of solar Photovoltaics (PV) along with other Distributed Energy Resources (DER), particularly Plug-in Electric Vehicles (PEV) have added a new dimension to this problem and made it… ▽ More Building sector is the largest energy user in the United States. Conventional building energy studies mostly involve Heating, Ventilation, and Air Conditioning (HVAC), and lighting energy consumptions. Recent additions of solar Photovoltaics (PV) along with other Distributed Energy Resources (DER), particularly Plug-in Electric Vehicles (PEV) have added a new dimension to this problem and made it more complex. This paper presents an avant-garde framework for selecting the best charging/discharging level of PEV for a commercial building-integrated microgrid. A typical commercial building is used as a microgrid testbed incorporating all the DERs presented in a smart building. A Mixed Integer Linear Programming (MILP) problem is formulated to optimize the energy and demand cost associated with this building operation. The cost function is solved in conjunction with real data and modified to assess the bidirectional PEV impacts on the flexible building loads that are contributing factors in making energy usage decisions. Finally, the impacts of optimized DERs are investigated on a Distribution System (DS) to show the necessity of a holistic approach for selecting the suitable PEV strategies. The results show that bidirectional fast PEV activities can provide higher cost reduction and less voltage deviation in comparison to slow PEV activities. △ Less

Submitted 8 April, 2021; originally announced April 2021.

Comments: 38 pages single column double spaced, 11 figures, 5 tables, preprint

arXiv:2103.09381 [pdf]

A Comprehensive Optimization Method for Commercial Building Loads with Renewable Generation and Energy Storage from Utility Rate Structure Perspective

Authors: A S M Jahid Hasan, Jubair Yusuf, Sadrul Ula

Abstract: To accommodate the changes in the nature and pattern of electricity consumption with the available resources, utility companies have introduced a variety of rate structures over the years. This paper develops a comprehensive optimization method that addresses the diversity of utility rate structure of a commercial building. It includes a general set of constraints that can be used for any system w… ▽ More To accommodate the changes in the nature and pattern of electricity consumption with the available resources, utility companies have introduced a variety of rate structures over the years. This paper develops a comprehensive optimization method that addresses the diversity of utility rate structure of a commercial building. It includes a general set of constraints that can be used for any system with a building load, a renewable source, and a battery energy storage system (BESS). A cost function is formulated for each type of rate structure that can be exercised by a utility on a commercial building. A novel algorithm is developed to apply the optimization model and generate the desired optimal outputs by using the appropriate cost function. The results for several building loads and rate structure types were obtained and compared. A sensitivity analysis was done on the optimization model based on the changes in the rates using historical data. The results exhibit that adding BESS is more effective for buildings with lower load factor and CPP rate structures in comparison to the buildings with flat energy rates. Savings from adding renewables such as solar is primarily influenced by energy charges whereas additional benefits from BESS are dominated by demand charges. These results can help a customer with deciding on the different rate structure options and resource planning of their renewable generation and energy storage. Utilities may also benefit from this work by designing a unified rate structure, considering the increasing renewable penetration and BESS deployment in the grid. △ Less

Submitted 19 April, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

Showing 1–9 of 9 results for author: Hasan, M J