Search | arXiv e-print repository

C-MP: A decentralized adaptive-coordinated traffic signal control using the Max Pressure framework

Authors: Tanveer Ahmed, Hao Liu, Vikash V. Gayah

Abstract: Coordinated traffic signals seek to provide uninterrupted flow through a series of closely spaced intersections, typically using pre-defined fixed signal timings and offsets. Adaptive traffic signals dynamically change signal timings based on observed traffic conditions in a way that might disrupt coordinated movements, particularly when these decisions are made independently at each intersection.… ▽ More Coordinated traffic signals seek to provide uninterrupted flow through a series of closely spaced intersections, typically using pre-defined fixed signal timings and offsets. Adaptive traffic signals dynamically change signal timings based on observed traffic conditions in a way that might disrupt coordinated movements, particularly when these decisions are made independently at each intersection. To alleviate this issue, this paper introduces a novel Max Pressure-based traffic signal framework that can provide coordination even under decentralized decision-making. The proposed Coordinated Max Pressure (C-MP) algorithm uses the space mean speeds of vehicles to explicitly detect freely flowing platoons of vehicles and prioritizes their movement along a corridor. Specifically, upstream platoons are detected and their weight in the MP framework increased to provide priority, while downstream platoons are detected and their weight reduced to ensure smooth traffic flow across corridors. The study analytically proves that C-MP maintains the desirable maximum stability property, while micro-simulation analyses conducted on an arterial network demonstrate its ability to achieve a larger stable region compared to benchmark MP control policies. Simulation results also reveal that the proposed control algorithm can effectively coordinate traffic signals in both directions along an arterial without explicitly assigned offsets or constraints. The results also reveal C-MP's superiority to benchmark coordination strategies in reducing travel time, and fuel consumption both at the corridor level and the network level by balancing the negative impact imparted to vehicles in the minor direction. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: Submitted to Transportation Research Part C: Emerging Technologies

arXiv:2407.00828 [pdf]

doi 10.1109/VTC2023-Spring57618.2023.10199400

DRL-Based RAT Selection in a Hybrid Vehicular Communication Network

Authors: Badreddine Yacine Yacheur, Toufik Ahmed, Mohamed Mosbah

Abstract: Cooperative intelligent transport systems rely on a set of Vehicle-to-Everything (V2X) applications to enhance road safety. Emerging new V2X applications like Advanced Driver Assistance Systems (ADASs) and Connected Autonomous Driving (CAD) applications depend on a significant amount of shared data and require high reliability, low end-to-end (E2E) latency, and high throughput. However, present V2… ▽ More Cooperative intelligent transport systems rely on a set of Vehicle-to-Everything (V2X) applications to enhance road safety. Emerging new V2X applications like Advanced Driver Assistance Systems (ADASs) and Connected Autonomous Driving (CAD) applications depend on a significant amount of shared data and require high reliability, low end-to-end (E2E) latency, and high throughput. However, present V2X communication technologies such as ITS-G5 and C-V2X (Cellular V2X) cannot satisfy these requirements alone. In this paper, we propose an intelligent, scalable hybrid vehicular communication architecture that leverages the performance of multiple Radio Access Technologies (RATs) to meet the needs of these applications. Then, we propose a communication mode selection algorithm based on Deep Reinforcement Learning (DRL) to maximize the network's reliability while limiting resource consumption. Finally, we assess our work using the platooning scenario that requires high reliability. Numerical results reveal that the hybrid vehicular communication architecture has the potential to enhance the packet reception rate (PRR) by up to 30% compared to both the static RAT selection strategy and the multi-criteria decision-making (MCDM) selection algorithm. Additionally, it improves the efficiency of the redundant communication mode by 20% regarding resource consumption △ Less

Submitted 3 April, 2024; originally announced July 2024.

Journal ref: 2023 IEEE 97th Vehicular Technology Conference (VTC2023-Spring), Jun 2023, Florence, Italy. pp.1-5

arXiv:2406.19269 [pdf, other]

OCC-MP: A Max-Pressure framework to prioritize transit and high occupancy vehicles

Authors: Tanveer Ahmed, Hao Liu, Vikash V. Gayah

Abstract: Max-pressure (MP) is a decentralized adaptive traffic signal control approach that has been shown to maximize throughput for private vehicles. However, MP-based signal control algorithms do not differentiate the movement of transit vehicles from private vehicles or between high and single-occupancy private vehicles. Prioritizing the movement of transit or other high occupancy vehicles (HOVs) is vi… ▽ More Max-pressure (MP) is a decentralized adaptive traffic signal control approach that has been shown to maximize throughput for private vehicles. However, MP-based signal control algorithms do not differentiate the movement of transit vehicles from private vehicles or between high and single-occupancy private vehicles. Prioritizing the movement of transit or other high occupancy vehicles (HOVs) is vital to reduce congestion and improve the reliability and efficiency of transit operations. This study proposes OCC-MP: a novel MP-based algorithm that considers both vehicle queues and passenger occupancies in computing the weights of movements. By weighing movements with higher passenger occupancies more heavily, transit and other HOVs are implicitly provided with priority, while accounting for any negative impacts of that priority on single occupancy vehicles. And, unlike rule-based transit signal priority (TSP) strategies, OCC-MP more naturally also accommodates conflicting transit routes at a signalized intersection and facilitates their movement, even in mixed traffic without dedicated lanes. Simulations on a grid network under varying demands and transit configurations demonstrate the effectiveness of OCC-MP at providing TSP while simultaneously reducing the negative impact imparted onto lower occupancy private vehicles. Furthermore, OCC-MP is shown to have a larger stable region for demand compared to rule-based TSP strategies integrated into the MP framework. The performance of OCC-MP is also shown to be robust to errors in passenger occupancy information from transit vehicles and can be applied when passenger occupancies of private vehicles are not available. Finally, OCC-MP can be applied in a partially connected vehicle (CV) environment when a subset of vehicles is able to provide information to the signal controller, outperforming baseline methods at low CV penetration rates. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2406.15914 [pdf, ps, other]

Three weak solutions for a $(p, q)$-Schrödinger-Kirchhoff type equation

Authors: Ahmed Ahmed- Taghi Ahmedatt- Aberqi Ahmed

Abstract: In this manuscript, we investigate a $(p, q)$-Schrödinger-Kirchhoff equation involving a continuous positive potential that meets the del Pino-Felmer type conditions. Using Recceri's classical variational approach, we prove the existence of three weak solutions. In this manuscript, we investigate a $(p, q)$-Schrödinger-Kirchhoff equation involving a continuous positive potential that meets the del Pino-Felmer type conditions. Using Recceri's classical variational approach, we prove the existence of three weak solutions. △ Less

Submitted 22 June, 2024; originally announced June 2024.

Comments: 15 pages, 0 figures

MSC Class: 35Q55; 35J60; 35D30; 35J20

arXiv:2406.06651 [pdf, other]

Short-Term Electricity Demand Forecasting of Dhaka City Using CNN with Stacked BiLSTM

Authors: Kazi Fuad Bin Akhter, Sadia Mobasshira, Saief Nowaz Haque, Mahjub Alam Khan Hesham, Tanvir Ahmed

Abstract: The precise forecasting of electricity demand also referred to as load forecasting, is essential for both planning and managing a power system. It is crucial for many tasks, including choosing which power units to commit to, making plans for future power generation capacity, enhancing the power network, and controlling electricity consumption. As Bangladesh is a develo** country, the electricity… ▽ More The precise forecasting of electricity demand also referred to as load forecasting, is essential for both planning and managing a power system. It is crucial for many tasks, including choosing which power units to commit to, making plans for future power generation capacity, enhancing the power network, and controlling electricity consumption. As Bangladesh is a develo** country, the electricity infrastructure is critical for economic growth and employment in this country. Accurate forecasting of electricity demand is crucial for ensuring that this country has a reliable and sustainable electricity supply to meet the needs of its growing population and economy. The complex and nonlinear behavior of such energy systems inhibits the creation of precise algorithms. Within this context, this paper aims to propose a hybrid model of Convolutional Neural Network (CNN) and stacked Bidirectional Long-short Term Memory (BiLSTM) architecture to perform an accurate short-term forecast of the electricity demand of Dhaka city. Short-term forecasting is ordinarily done to anticipate load for the following few hours to a few weeks. Normalization techniques have been also investigated because of the sensitivity of these models towards the input range. The proposed approach produced the best prediction results in comparison to the other benchmark models (LSTM, CNN- BiLSTM and CNN-LSTM) used in the study, with MAPE 1.64%, MSE 0.015, RMSE 0.122 and MAE 0.092. The result of the proposed model also outperformed some of the existing works on load-forecasting. △ Less

Submitted 10 June, 2024; originally announced June 2024.

arXiv:2405.18894 [pdf, other]

Few-Shot Testing: Estimating Uncertainty of Memristive Deep Neural Networks Using One Bayesian Test Vector

Authors: Soyed Tuhin Ahmed, Mehdi Tahoori

Abstract: The performance of deep learning algorithms such as neural networks (NNs) has increased tremendously recently, and they can achieve state-of-the-art performance in many domains. However, due to memory and computation resource constraints, implementing NNs on edge devices is a challenging task. Therefore, hardware accelerators such as computation-in-memory (CIM) with memristive devices have been de… ▽ More The performance of deep learning algorithms such as neural networks (NNs) has increased tremendously recently, and they can achieve state-of-the-art performance in many domains. However, due to memory and computation resource constraints, implementing NNs on edge devices is a challenging task. Therefore, hardware accelerators such as computation-in-memory (CIM) with memristive devices have been developed to accelerate the most common operations, i.e., matrix-vector multiplication. However, due to inherent device properties, external environmental factors such as temperature, and an immature fabrication process, memristors suffer from various non-idealities, including defects and variations occurring during manufacturing and runtime. Consequently, there is a lack of complete confidence in the predictions made by the model. To improve confidence in NN predictions made by hardware accelerators in the presence of device non-idealities, in this paper, we propose a Bayesian test vector generation framework that can estimate the model uncertainty of NNs implemented on memristor-based CIM hardware. Compared to the conventional point estimate test vector generation method, our method is more generalizable across different model dimensions and requires storing only one test Bayesian vector in the hardware. Our method is evaluated on different model dimensions, tasks, fault rates, and variation noise to show that it can consistently achieve $100\%$ coverage with only $0.024$ MB of memory overhead. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.13843 [pdf]

Hyperspectral Image Reconstruction for Predicting Chick Embryo Mortality Towards Advancing Egg and Hatchery Industry

Authors: Md. Toukir Ahmed, Md Wadud Ahmed, Ocean Monjur, Jason Lee Emmert, Girish Chowdhary, Mohammed Kamruzzaman

Abstract: As the demand for food surges and the agricultural sector undergoes a transformative shift towards sustainability and efficiency, the need for precise and proactive measures to ensure the health and welfare of livestock becomes paramount. In the context of the broader agricultural landscape outlined, the application of Hyperspectral Imaging (HSI) takes on profound significance. HSI has emerged as… ▽ More As the demand for food surges and the agricultural sector undergoes a transformative shift towards sustainability and efficiency, the need for precise and proactive measures to ensure the health and welfare of livestock becomes paramount. In the context of the broader agricultural landscape outlined, the application of Hyperspectral Imaging (HSI) takes on profound significance. HSI has emerged as a cutting-edge, non-destructive technique for fast and accurate egg quality analysis, including the detection of chick embryo mortality. However, the high cost and operational complexity compared to conventional RGB imaging are significant bottlenecks in the widespread adoption of HSI technology. To overcome these hurdles and unlock the full potential of HSI, a promising solution is hyperspectral image reconstruction from standard RGB images. This study aims to reconstruct hyperspectral images from RGB images for non-destructive early prediction of chick embryo mortality. Firstly, the performance of different image reconstruction algorithms, such as HRNET, MST++, Restormer, and EDSR were compared to reconstruct the hyperspectral images of the eggs in the early incubation period. Later, the reconstructed spectra were used to differentiate live from dead chick-producing eggs using the XGBoost and Random Forest classification methods. Among the reconstruction methods, HRNET showed impressive reconstruction performance with MRAE of 0.0955, RMSE of 0.0159, and PSNR of 36.79 dB. This study motivated that harnessing imaging technology integrated with smart sensors and data analytics has the potential to improve automation, enhance biosecurity, and optimize resource management towards sustainable agriculture 4.0. △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Under review

arXiv:2405.13331 [pdf]

Comparative Analysis of Hyperspectral Image Reconstruction Using Deep Learning for Agricultural and Biological Applications

Authors: Md. Toukir Ahmed, Arthur Villordon, Mohammed Kamruzzaman

Abstract: Hyperspectral imaging (HSI) has become a key technology for non-invasive quality evaluation in various fields, offering detailed insights through spatial and spectral data. Despite its efficacy, the complexity and high cost of HSI systems have hindered their widespread adoption. This study addressed these challenges by exploring deep learning-based hyperspectral image reconstruction from RGB (Red,… ▽ More Hyperspectral imaging (HSI) has become a key technology for non-invasive quality evaluation in various fields, offering detailed insights through spatial and spectral data. Despite its efficacy, the complexity and high cost of HSI systems have hindered their widespread adoption. This study addressed these challenges by exploring deep learning-based hyperspectral image reconstruction from RGB (Red, Green, Blue) images, particularly for agricultural products. Specifically, different hyperspectral reconstruction algorithms, such as Hyperspectral Convolutional Neural Network - Dense (HSCNN-D), High-Resolution Network (HRNET), and Multi-Scale Transformer Plus Plus (MST++), were compared to assess the dry matter content of sweet potatoes. Among the tested reconstruction methods, HRNET demonstrated superior performance, achieving the lowest mean relative absolute error (MRAE) of 0.07, root mean square error (RMSE) of 0.03, and the highest peak signal-to-noise ratio (PSNR) of 32.28 decibels (dB). Some key features were selected using the genetic algorithm (GA), and their importance was interpreted using explainable artificial intelligence (XAI). Partial least squares regression (PLSR) models were developed using the RGB, reconstructed, and ground truth (GT) data. The visual and spectra quality of these reconstructed methods was compared with GT data, and predicted maps were generated. The results revealed the prospect of deep learning-based hyperspectral image reconstruction as a cost-effective and efficient quality assessment tool for agricultural and biological applications. △ Less

Submitted 2 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

Comments: Under review

arXiv:2405.12313 [pdf]

Deep learning-based hyperspectral image reconstruction for quality assessment of agro-product

Authors: Md. Toukir Ahmed, Ocean Monjur, Mohammed Kamruzzaman

Abstract: Hyperspectral imaging (HSI) has recently emerged as a promising tool for many agricultural applications; however, the technology cannot be directly used in a real-time system due to the extensive time needed to process large volumes of data. Consequently, the development of a simple, compact, and cost-effective imaging system is not possible with the current HSI systems. Therefore, the overall goa… ▽ More Hyperspectral imaging (HSI) has recently emerged as a promising tool for many agricultural applications; however, the technology cannot be directly used in a real-time system due to the extensive time needed to process large volumes of data. Consequently, the development of a simple, compact, and cost-effective imaging system is not possible with the current HSI systems. Therefore, the overall goal of this study was to reconstruct hyperspectral images from RGB images through deep learning for agricultural applications. Specifically, this study used Hyperspectral Convolutional Neural Network - Dense (HSCNN-D) to reconstruct hyperspectral images from RGB images for predicting soluble solid content (SSC) in sweet potatoes. The algorithm accurately reconstructed the hyperspectral images from RGB images, with the resulting spectra closely matching the ground-truth. The partial least squares regression (PLSR) model based on reconstructed spectra outperformed the model using the full spectral range, demonstrating its potential for SSC prediction in sweet potatoes. These findings highlight the potential of deep learning-based hyperspectral image reconstruction as a low-cost, efficient tool for various agricultural uses. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: Under review

arXiv:2405.05286 [pdf, other]

Tiny Deep Ensemble: Uncertainty Estimation in Edge AI Accelerators via Ensembling Normalization Layers with Shared Weights

Authors: Soyed Tuhin Ahmed, Michael Hefenbrock, Mehdi B. Tahoori

Abstract: The applications of artificial intelligence (AI) are rapidly evolving, and they are also commonly used in safety-critical domains, such as autonomous driving and medical diagnosis, where functional safety is paramount. In AI-driven systems, uncertainty estimation allows the user to avoid overconfidence predictions and achieve functional safety. Therefore, the robustness and reliability of model pr… ▽ More The applications of artificial intelligence (AI) are rapidly evolving, and they are also commonly used in safety-critical domains, such as autonomous driving and medical diagnosis, where functional safety is paramount. In AI-driven systems, uncertainty estimation allows the user to avoid overconfidence predictions and achieve functional safety. Therefore, the robustness and reliability of model predictions can be improved. However, conventional uncertainty estimation methods, such as the deep ensemble method, impose high computation and, accordingly, hardware (latency and energy) overhead because they require the storage and processing of multiple models. Alternatively, Monte Carlo dropout (MC-dropout) methods, although having low memory overhead, necessitate numerous ($\sim 100$) forward passes, leading to high computational overhead and latency. Thus, these approaches are not suitable for battery-powered edge devices with limited computing and memory resources. In this paper, we propose the Tiny-Deep Ensemble approach, a low-cost approach for uncertainty estimation on edge devices. In our approach, only normalization layers are ensembled $M$ times, with all ensemble members sharing common weights and biases, leading to a significant decrease in storage requirements and latency. Moreover, our approach requires only one forward pass in a hardware architecture that allows batch processing for inference and uncertainty estimation. Furthermore, it has approximately the same memory overhead compared to a single model. Therefore, latency and memory overhead are reduced by a factor of up to $\sim M\times$. Nevertheless, our method does not compromise accuracy, with an increase in inference accuracy of up to $\sim 1\%$ and a reduction in RMSE of $17.17\%$ in various benchmark datasets, tasks, and state-of-the-art architectures. △ Less

Submitted 7 May, 2024; originally announced May 2024.

arXiv:2405.02828 [pdf, other]

Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy

Authors: Aftab Hussain, Md Rafiqul Islam Rabin, Toufique Ahmed, Bowen Xu, Premkumar Devanbu, Mohammad Amin Alipour

Abstract: Large language models (LLMs) have provided a lot of exciting new capabilities in software development. However, the opaque nature of these models makes them difficult to reason about and inspect. Their opacity gives rise to potential security risks, as adversaries can train and deploy compromised models to disrupt the software development process in the victims' organization. This work presents… ▽ More Large language models (LLMs) have provided a lot of exciting new capabilities in software development. However, the opaque nature of these models makes them difficult to reason about and inspect. Their opacity gives rise to potential security risks, as adversaries can train and deploy compromised models to disrupt the software development process in the victims' organization. This work presents an overview of the current state-of-the-art trojan attacks on large language models of code, with a focus on triggers -- the main design point of trojans -- with the aid of a novel unifying trigger taxonomy framework. We also aim to provide a uniform definition of the fundamental concepts in the area of trojans in Code LLMs. Finally, we draw implications of findings on how code models learn on trigger design. △ Less

Submitted 5 May, 2024; originally announced May 2024.

Comments: arXiv admin note: substantial text overlap with arXiv:2305.03803

arXiv:2404.19318 [pdf, other]

Enhancing Trust in LLM-Generated Code Summaries with Calibrated Confidence Scores

Authors: Yuvraj Virk, Premkumar Devanbu, Toufique Ahmed

Abstract: A good summary can often be very useful during program comprehension. While a brief, fluent, and relevant summary can be helpful, it does require significant human effort to produce. Often, good summaries are unavailable in software projects, thus making maintenance more difficult. There has been a considerable body of research into automated AI-based methods, using Large Language models (LLMs), t… ▽ More A good summary can often be very useful during program comprehension. While a brief, fluent, and relevant summary can be helpful, it does require significant human effort to produce. Often, good summaries are unavailable in software projects, thus making maintenance more difficult. There has been a considerable body of research into automated AI-based methods, using Large Language models (LLMs), to generate summaries of code; there also has been quite a bit work on ways to measure the performance of such summarization methods, with special attention paid to how closely these AI-generated summaries resemble a summary a human might have produced. Measures such as BERTScore and BLEU have been suggested and evaluated with human-subject studies. However, LLMs often err and generate something quite unlike what a human might say. Given an LLM-produced code summary, is there a way to gauge whether it's likely to be sufficiently similar to a human produced summary, or not? In this paper, we study this question, as a calibration problem: given a summary from an LLM, can we compute a confidence measure, which is a good indication of whether the summary is sufficiently similar to what a human would have produced in this situation? We examine this question using several LLMs, for several languages, and in several different settings. We suggest an approach which provides well-calibrated predictions of likelihood of similarity to human summaries. △ Less

Submitted 30 April, 2024; originally announced April 2024.

arXiv:2404.12986 [pdf, other]

Nuclei Instance Segmentation of Cryosectioned H&E Stained Histological Images using Triple U-Net Architecture

Authors: Zarif Ahmed, Chowdhury Nur E Alam Siddiqi, Fardifa Fathmiul Alam, Tasnim Ahmed, Tareque Mohmud Chowdhury

Abstract: Nuclei instance segmentation is crucial in oncological diagnosis and cancer pathology research. H&E stained images are commonly used for medical diagnosis, but pre-processing is necessary before using them for image processing tasks. Two principal pre-processing methods are formalin-fixed paraffin-embedded samples (FFPE) and frozen tissue samples (FS). While FFPE is widely used, it is time-consumi… ▽ More Nuclei instance segmentation is crucial in oncological diagnosis and cancer pathology research. H&E stained images are commonly used for medical diagnosis, but pre-processing is necessary before using them for image processing tasks. Two principal pre-processing methods are formalin-fixed paraffin-embedded samples (FFPE) and frozen tissue samples (FS). While FFPE is widely used, it is time-consuming, while FS samples can be processed quickly. Analyzing H&E stained images derived from fast sample preparation, staining, and scanning can pose difficulties due to the swift process, which can result in the degradation of image quality. This paper proposes a method that leverages the unique optical characteristics of H&E stained images. A three-branch U-Net architecture has been implemented, where each branch contributes to the final segmentation results. The process includes applying watershed algorithm to separate overlap** regions and enhance accuracy. The Triple U-Net architecture comprises an RGB branch, a Hematoxylin branch, and a Segmentation branch. This study focuses on a novel dataset named CryoNuSeg. The results obtained through robust experiments outperform the state-of-the-art results across various metrics. The benchmark score for this dataset is AJI 52.5 and PQ 47.7, achieved through the implementation of U-Net Architecture. However, the proposed Triple U-Net architecture achieves an AJI score of 67.41 and PQ of 50.56. The proposed architecture improves more on AJI than other evaluation metrics, which further justifies the superiority of the Triple U-Net architecture over the baseline U-Net model, as AJI is a more strict evaluation metric. The use of the three-branch U-Net model, followed by watershed post-processing, significantly surpasses the benchmark scores, showing substantial improvement in the AJI score △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: To be published in "6th IVPR & 11th ICIEV"

arXiv:2403.01342 [pdf, ps, other]

LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems

Authors: Tasnim Ahmed, Salimur Choudhury

Abstract: In the rapidly evolving field of natural language processing, the translation of linguistic descriptions into mathematical formulation of optimization problems presents a formidable challenge, demanding intricate understanding and processing capabilities from Large Language Models (LLMs). This study compares prominent LLMs, including GPT-3.5, GPT-4, and Llama-2-7b, in zero-shot and one-shot settin… ▽ More In the rapidly evolving field of natural language processing, the translation of linguistic descriptions into mathematical formulation of optimization problems presents a formidable challenge, demanding intricate understanding and processing capabilities from Large Language Models (LLMs). This study compares prominent LLMs, including GPT-3.5, GPT-4, and Llama-2-7b, in zero-shot and one-shot settings for this task. Our findings show GPT-4's superior performance, particularly in the one-shot scenario. A central part of this research is the introduction of `LM4OPT,' a progressive fine-tuning framework for Llama-2-7b that utilizes noisy embeddings and specialized datasets. However, this research highlights a notable gap in the contextual understanding capabilities of smaller models such as Llama-2-7b compared to larger counterparts, especially in processing lengthy and complex input contexts. Our empirical investigation, utilizing the NL4Opt dataset, unveils that GPT-4 surpasses the baseline performance established by previous research, achieving an F1-score of 0.63, solely based on the problem description in natural language, and without relying on any additional named entity information. GPT-3.5 follows closely, both outperforming the fine-tuned Llama-2-7b. These findings not only benchmark the current capabilities of LLMs in a novel application area but also lay the groundwork for future improvements in mathematical formulation of optimization problems from natural language input. △ Less

Submitted 2 March, 2024; originally announced March 2024.

arXiv:2402.18594 [pdf]

Analyzing the Dynamics of COVID-19 Lockdown Success: Insights from Regional Data and Public Health Measures

Authors: Md. Motaleb Hossen Manik, Md. Ahsan Habib, Md. Zabirul Islam, Tanim Ahmed, Fabliha Haque

Abstract: The COVID-19 pandemic caused by the coronavirus had a significant effect on social, economic, and health systems globally. The virus emerged in Wuhan, China, and spread worldwide resulting in severe disease, death, and social interference. Countries implemented lockdowns in various regions to limit the spread of the virus. Some of them were successful and some failed. Here, several factors played… ▽ More The COVID-19 pandemic caused by the coronavirus had a significant effect on social, economic, and health systems globally. The virus emerged in Wuhan, China, and spread worldwide resulting in severe disease, death, and social interference. Countries implemented lockdowns in various regions to limit the spread of the virus. Some of them were successful and some failed. Here, several factors played a vital role in their success. But mostly these factors and their correlations remained unidentified. In this paper, we unlocked those factors that contributed to the success of lockdown during the COVID-19 pandemic and explored the correlations among them. Moreover, this paper proposes several strategies to control any pandemic situation in the future. Here, it explores the relationships among variables, such as population density, number of infected, death, recovered patients, and the success or failure of the lockdown in different regions of the world. The findings suggest a strong correlation among these factors and indicate that the spread of similar kinds of viruses can be reduced in the future by implementing several safety measures. △ Less

Submitted 24 February, 2024; originally announced February 2024.

arXiv:2402.15818 [pdf, other]

Linguistic Intelligence in Large Language Models for Telecommunications

Authors: Tasnim Ahmed, Nicola Piovesan, Antonio De Domenico, Salimur Choudhury

Abstract: Large Language Models (LLMs) have emerged as a significant advancement in the field of Natural Language Processing (NLP), demonstrating remarkable capabilities in language generation and other language-centric tasks. Despite their evaluation across a multitude of analytical and reasoning tasks in various scientific domains, a comprehensive exploration of their knowledge and understanding within th… ▽ More Large Language Models (LLMs) have emerged as a significant advancement in the field of Natural Language Processing (NLP), demonstrating remarkable capabilities in language generation and other language-centric tasks. Despite their evaluation across a multitude of analytical and reasoning tasks in various scientific domains, a comprehensive exploration of their knowledge and understanding within the realm of natural language tasks in the telecommunications domain is still needed. This study, therefore, seeks to evaluate the knowledge and understanding capabilities of LLMs within this domain. To achieve this, we conduct an exhaustive zero-shot evaluation of four prominent LLMs-Llama-2, Falcon, Mistral, and Zephyr. These models require fewer resources than ChatGPT, making them suitable for resource-constrained environments. Their performance is compared with state-of-the-art, fine-tuned models. To the best of our knowledge, this is the first work to extensively evaluate and compare the understanding of LLMs across multiple language-centric tasks in this domain. Our evaluation reveals that zero-shot LLMs can achieve performance levels comparable to the current state-of-the-art fine-tuned models. This indicates that pretraining on extensive text corpora equips LLMs with a degree of specialization, even within the telecommunications domain. We also observe that no single LLM consistently outperforms others, and the performance of different LLMs can fluctuate. Although their performance lags behind fine-tuned models, our findings underscore the potential of LLMs as a valuable resource for understanding various aspects of this field that lack large annotated data. △ Less

Submitted 24 February, 2024; originally announced February 2024.

arXiv:2402.15100 [pdf, other]

Studying LLM Performance on Closed- and Open-source Data

Authors: Toufique Ahmed, Christian Bird, Premkumar Devanbu, Saikat Chakraborty

Abstract: Large Language models (LLMs) are finding wide use in software engineering practice. These models are extremely data-hungry, and are largely trained on open-source (OSS) code distributed with permissive licenses. In terms of actual use however, a great deal of software development still occurs in the for-profit/proprietary sphere, where the code under development is not, and never has been, in the… ▽ More Large Language models (LLMs) are finding wide use in software engineering practice. These models are extremely data-hungry, and are largely trained on open-source (OSS) code distributed with permissive licenses. In terms of actual use however, a great deal of software development still occurs in the for-profit/proprietary sphere, where the code under development is not, and never has been, in the public domain; thus, many developers, do their work, and use LLMs, in settings where the models may not be as familiar with the code under development. In such settings, do LLMs work as well as they do for OSS code? If not, what are the differences? When performance differs, what are the possible causes, and are there work-arounds? In this paper, we examine this issue using proprietary, closed-source software data from Microsoft, where most proprietary code is in C# and C++. We find that performance for C# changes little from OSS --> proprietary code, but does significantly reduce for C++; we find that this difference is attributable to differences in identifiers. We also find that some performance degradation, in some cases, can be ameliorated efficiently by in-context learning. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.11817 [pdf, other]

EMU/GAMA: A Technique for Detecting Active Galactic Nuclei in Low Mass Systems

Authors: Jahang Prathap, Andrew M. Hopkins, Aaron S. G. Robotham, Sabine Bellstedt, José Afonso, Ummee T. Ahmed, Maciej Bilicki, Malcolm N. Bremer, Sarah Brough, Michael J. I. Brown, Yjan Gordon, Benne W. Holwerda, Denis Leahy, Ángel R. López-Sánchez, Joshua R. Marvil, Tamal Mukherjee, Isabella Prandoni, Stanislav S. Shabala, Tessa Vernstrom, Tayyaba Zafar

Abstract: We propose a new method for identifying active galactic nuclei (AGN) in low mass ($\rm M_*\leq10^{10}M_\odot$) galaxies. This method relies on spectral energy distribution (SED) fitting to identify galaxies whose radio flux density has an excess over that expected from star formation alone. Combining data in the Galaxy and Mass Assembly (GAMA) G23 region from GAMA, Evolutionary Map of the Universe… ▽ More We propose a new method for identifying active galactic nuclei (AGN) in low mass ($\rm M_*\leq10^{10}M_\odot$) galaxies. This method relies on spectral energy distribution (SED) fitting to identify galaxies whose radio flux density has an excess over that expected from star formation alone. Combining data in the Galaxy and Mass Assembly (GAMA) G23 region from GAMA, Evolutionary Map of the Universe (EMU) early science observations, and Wide-field Infrared Survey Explorer (WISE), we compare this technique with a selection of different AGN diagnostics to explore the similarities and differences in AGN classification. We find that diagnostics based on optical and near-infrared criteria (the standard BPT diagram, the WISE colour criterion, and the mass-excitation, or MEx diagram) tend to favour detection of AGN in high mass, high luminosity systems, while the ``ProSpect'' SED fitting tool can identify AGN efficiently in low mass systems. We investigate an explanation for this result in the context of proportionally lower mass black holes in lower mass galaxies compared to higher mass galaxies and differing proportions of emission from AGN and star formation dominating the light at optical and infrared wavelengths as a function of galaxy stellar mass. We conclude that SED-derived AGN classification is an efficient approach to identify low mass hosts with low radio luminosity AGN. △ Less

Submitted 18 February, 2024; originally announced February 2024.

Comments: 16 pages, 12 figures, 5 tables. Accepted for publication in PASA

arXiv:2402.10605 [pdf, other]

Studying the Impact of Quantum-Specific Hyperparameters on Hybrid Quantum-Classical Neural Networks

Authors: Kamila Zaman, Tasnim Ahmed, Muhammad Kashif, Muhammad Abdullah Hanif, Alberto Marchisio, Muhammad Shafique

Abstract: In current noisy intermediate-scale quantum devices, hybrid quantum-classical neural networks (HQNNs) represent a promising solution that combines the strengths of classical machine learning with quantum computing capabilities. Compared to classical deep neural networks (DNNs), HQNNs present an additional set of hyperparameters, which are specific to quantum circuits. These quantum-specific hyperp… ▽ More In current noisy intermediate-scale quantum devices, hybrid quantum-classical neural networks (HQNNs) represent a promising solution that combines the strengths of classical machine learning with quantum computing capabilities. Compared to classical deep neural networks (DNNs), HQNNs present an additional set of hyperparameters, which are specific to quantum circuits. These quantum-specific hyperparameters, such as quantum circuit depth, number of qubits, type of entanglement, number of shots, and measurement observables, can significantly impact the behavior of the HQNNs and their capabilities to learn the given task. In this paper, we investigate the impact of these variations on different HQNN models for image classification tasks, implemented on the PennyLane framework. We aim to uncover intuitive and counter-intuitive learning patterns of HQNN models within granular levels of controlled quantum perturbations, to form a sound basis for their correlation to accuracy and training time. The outcome of our study opens new avenues for designing efficient HQNN algorithms and builds a foundational base for comprehending and identifying tunable hyperparameters of HQNN models that can lead to useful design implementation and usage. △ Less

Submitted 25 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

Comments: Accepted at the 3rd International Conference on Emergent Quantum Technologies (ICEQT'24), July 2024

arXiv:2402.10540 [pdf, other]

A Comparative Analysis of Hybrid-Quantum Classical Neural Networks

Authors: Kamila Zaman, Tasnim Ahmed, Muhammad Abdullah Hanif, Alberto Marchisio, Muhammad Shafique

Abstract: Hybrid Quantum-Classical Machine Learning (ML) is an emerging field, amalgamating the strengths of both classical neural networks and quantum variational circuits on the current noisy intermediate-scale quantum devices. This paper performs an extensive comparative analysis between different hybrid quantum-classical machine learning algorithms, namely Quantum Convolution Neural Network, Quanvolutio… ▽ More Hybrid Quantum-Classical Machine Learning (ML) is an emerging field, amalgamating the strengths of both classical neural networks and quantum variational circuits on the current noisy intermediate-scale quantum devices. This paper performs an extensive comparative analysis between different hybrid quantum-classical machine learning algorithms, namely Quantum Convolution Neural Network, Quanvolutional Neural Network and Quantum ResNet, for image classification. The experiments designed in this paper focus on different Quantum ML (QML) algorithms to better understand the accuracy variation across the different quantum architectures by implementing interchangeable quantum circuit layers, varying the repetition of such layers and their efficient placement. Such variations enable us to compare the accuracy across different architectural permutations of a given hybrid QML algorithm. The performance comparison of the hybrid models, based on the accuracy, provides us with an understanding of hybrid quantum-classical convergence in correlation with the quantum layer count and the qubit count variations in the circuit. △ Less

Submitted 25 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

Comments: Accepted at the 3rd International Conference on Emergent Quantum Technologies (ICEQT'24), July 2024

arXiv:2402.07311 [pdf, other]

Two-loop non-planar four-point topology with massive internal loop

Authors: Taushif Ahmed, Ekta Chaubey, Mandeep Kaur, Sara Maggio

Abstract: We study a set of two-loop non-planar master integrals needed for the NNLO QCD corrections to diphoton and dijet production at hadron colliders. The top-sector topology contains an internal massive fermion loop and is known to contain elliptic curves. Leveraging the method of differential equations, we provide a comprehensive discussion for deriving an $ε$-factorized differential equation related… ▽ More We study a set of two-loop non-planar master integrals needed for the NNLO QCD corrections to diphoton and dijet production at hadron colliders. The top-sector topology contains an internal massive fermion loop and is known to contain elliptic curves. Leveraging the method of differential equations, we provide a comprehensive discussion for deriving an $ε$-factorized differential equation related to the most intricate sector within the Feynman integral family. Despite the dependence on multiple scales and the presence of two elliptic sectors, we demonstrate how to leverage the properties of their maximal cuts and the factorization of the Picard-Fuchs operator to deal with the complexity of the analytic computation. In particular, we construct a transformation matrix that brings the differential equations into a format enabling the convenient expression of analytic results in terms of Chen's iterated integrals. △ Less

Submitted 11 February, 2024; originally announced February 2024.

Comments: 19 pages, 2 figures

arXiv:2402.02047 [pdf, other]

Calibration and Correctness of Language Models for Code

Authors: Claudio Spiess, David Gros, Kunal Suresh Pai, Michael Pradel, Md Rafiqul Islam Rabin, Amin Alipour, Susmit Jha, Prem Devanbu, Toufique Ahmed

Abstract: Machine learning models are widely used but can also often be wrong. Users would benefit from a reliable indication of whether a given output from a given model should be trusted, so a rational decision can be made whether to use the output or not. For example, outputs can be associated with a confidence measure; if this confidence measure is strongly associated with likelihood of correctness, the… ▽ More Machine learning models are widely used but can also often be wrong. Users would benefit from a reliable indication of whether a given output from a given model should be trusted, so a rational decision can be made whether to use the output or not. For example, outputs can be associated with a confidence measure; if this confidence measure is strongly associated with likelihood of correctness, then the model is said to be well-calibrated. In this case, for example, high-confidence outputs could be safely accepted, and low-confidence outputs rejected. Calibration has so far been studied in mostly non-generative (e.g., classification) settings, especially in Software Engineering. However, generated code can quite often be wrong: Developers need to know when they should e.g., directly use, use after careful review, or discard model-generated code; thus Calibration is vital in generative settings. However, the notion of correctness of generated code is non-trivial, and thus so is Calibration. In this paper we make several contributions. We develop a framework for evaluating the Calibration of code-generating models. We consider several tasks, correctness criteria, datasets, and approaches, and find that by and large generative code models are not well-calibrated out of the box. We then show how Calibration can be improved, using standard methods such as Platt scaling. Our contributions will lead to better-calibrated decision-making in the current use of code generated by language models, and offers a framework for future research to further improve calibration methods for generative models in Software Engineering. △ Less

Submitted 16 February, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

arXiv:2401.12416 [pdf, other]

Enhancing Reliability of Neural Networks at the Edge: Inverted Normalization with Stochastic Affine Transformations

Authors: Soyed Tuhin Ahmed, Kamal Danouchi, Guillaume Prenat, Lorena Anghel, Mehdi B. Tahoori

Abstract: Bayesian Neural Networks (BayNNs) naturally provide uncertainty in their predictions, making them a suitable choice in safety-critical applications. Additionally, their realization using memristor-based in-memory computing (IMC) architectures enables them for resource-constrained edge applications. In addition to predictive uncertainty, however, the ability to be inherently robust to noise in comp… ▽ More Bayesian Neural Networks (BayNNs) naturally provide uncertainty in their predictions, making them a suitable choice in safety-critical applications. Additionally, their realization using memristor-based in-memory computing (IMC) architectures enables them for resource-constrained edge applications. In addition to predictive uncertainty, however, the ability to be inherently robust to noise in computation is also essential to ensure functional safety. In particular, memristor-based IMCs are susceptible to various sources of non-idealities such as manufacturing and runtime variations, drift, and failure, which can significantly reduce inference accuracy. In this paper, we propose a method to inherently enhance the robustness and inference accuracy of BayNNs deployed in IMC architectures. To achieve this, we introduce a novel normalization layer combined with stochastic affine transformations. Empirical results in various benchmark datasets show a graceful degradation in inference accuracy, with an improvement of up to $58.11\%$. △ Less

Submitted 22 January, 2024; originally announced January 2024.

arXiv:2401.07145 [pdf, ps, other]

Scalable and Efficient Methods for Uncertainty Estimation and Reduction in Deep Learning

Authors: Soyed Tuhin Ahmed

Abstract: Neural networks (NNs) can achieved high performance in various fields such as computer vision, and natural language processing. However, deploying NNs in resource-constrained safety-critical systems has challenges due to uncertainty in the prediction caused by out-of-distribution data, and hardware non-idealities. To address the challenges of deploying NNs in resource-constrained safety-critical s… ▽ More Neural networks (NNs) can achieved high performance in various fields such as computer vision, and natural language processing. However, deploying NNs in resource-constrained safety-critical systems has challenges due to uncertainty in the prediction caused by out-of-distribution data, and hardware non-idealities. To address the challenges of deploying NNs in resource-constrained safety-critical systems, this paper summarizes the (4th year) PhD thesis work that explores scalable and efficient methods for uncertainty estimation and reduction in deep learning, with a focus on Computation-in-Memory (CIM) using emerging resistive non-volatile memories. We tackle the inherent uncertainties arising from out-of-distribution inputs and hardware non-idealities, crucial in maintaining functional safety in automated decision-making systems. Our approach encompasses problem-aware training algorithms, novel NN topologies, and hardware co-design solutions, including dropout-based \emph{binary} Bayesian Neural Networks leveraging spintronic devices and variational inference techniques. These innovations significantly enhance OOD data detection, inference accuracy, and energy efficiency, thereby contributing to the reliability and robustness of NN implementations. △ Less

Submitted 13 January, 2024; originally announced January 2024.

arXiv:2401.06195 [pdf, other]

NeuSpin: Design of a Reliable Edge Neuromorphic System Based on Spintronics for Green AI

Authors: Soyed Tuhin Ahmed, Kamal Danouchi, Guillaume Prenat, Lorena Anghel, Mehdi B. Tahoori

Abstract: Internet of Things (IoT) and smart wearable devices for personalized healthcare will require storing and computing ever-increasing amounts of data. The key requirements for these devices are ultra-low-power, high-processing capabilities, autonomy at low cost, as well as reliability and accuracy to enable Green AI at the edge. Artificial Intelligence (AI) models, especially Bayesian Neural Networks… ▽ More Internet of Things (IoT) and smart wearable devices for personalized healthcare will require storing and computing ever-increasing amounts of data. The key requirements for these devices are ultra-low-power, high-processing capabilities, autonomy at low cost, as well as reliability and accuracy to enable Green AI at the edge. Artificial Intelligence (AI) models, especially Bayesian Neural Networks (BayNNs) are resource-intensive and face challenges with traditional computing architectures due to the memory wall problem. Computing-in-Memory (CIM) with emerging resistive memories offers a solution by combining memory blocks and computing units for higher efficiency and lower power consumption. However, implementing BayNNs on CIM hardware, particularly with spintronic technologies, presents technical challenges due to variability and manufacturing defects. The NeuSPIN project aims to address these challenges through full-stack hardware and software co-design, develo** novel algorithmic and circuit design approaches to enhance the performance, energy-efficiency and robustness of BayNNs on sprintronic-based CIM platforms. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2401.04744 [pdf, other]

Testing Spintronics Implemented Monte Carlo Dropout-Based Bayesian Neural Networks

Authors: Soyed Tuhin Ahmed, Michael Hefenbrock, Guillaume Prenat, Lorena Anghel, Mehdi B. Tahoori

Abstract: Bayesian Neural Networks (BayNNs) can inherently estimate predictive uncertainty, facilitating informed decision-making. Dropout-based BayNNs are increasingly implemented in spintronics-based computation-in-memory architectures for resource-constrained yet high-performance safety-critical applications. Although uncertainty estimation is important, the reliability of Dropout generation and BayNN co… ▽ More Bayesian Neural Networks (BayNNs) can inherently estimate predictive uncertainty, facilitating informed decision-making. Dropout-based BayNNs are increasingly implemented in spintronics-based computation-in-memory architectures for resource-constrained yet high-performance safety-critical applications. Although uncertainty estimation is important, the reliability of Dropout generation and BayNN computation is equally important for target applications but is overlooked in existing works. However, testing BayNNs is significantly more challenging compared to conventional NNs, due to their stochastic nature. In this paper, we present for the first time the model of the non-idealities of the spintronics-based Dropout module and analyze their impact on uncertainty estimates and accuracy. Furthermore, we propose a testing framework based on repeatability ranking for Dropout-based BayNN with up to $100\%$ fault coverage while using only $0.2\%$ of training data as test vectors. △ Less

Submitted 9 January, 2024; originally announced January 2024.

arXiv:2401.01458 [pdf, other]

Concurrent Self-testing of Neural Networks Using Uncertainty Fingerprint

Authors: Soyed Tuhin Ahmed, Mehdi B. tahoori

Abstract: Neural networks (NNs) are increasingly used in always-on safety-critical applications deployed on hardware accelerators (NN-HAs) employing various memory technologies. Reliable continuous operation of NN is essential for safety-critical applications. During online operation, NNs are susceptible to single and multiple permanent and soft errors due to factors such as radiation, aging, and thermal ef… ▽ More Neural networks (NNs) are increasingly used in always-on safety-critical applications deployed on hardware accelerators (NN-HAs) employing various memory technologies. Reliable continuous operation of NN is essential for safety-critical applications. During online operation, NNs are susceptible to single and multiple permanent and soft errors due to factors such as radiation, aging, and thermal effects. Explicit NN-HA testing methods cannot detect transient faults during inference, are unsuitable for always-on applications, and require extensive test vector generation and storage. Therefore, in this paper, we propose the \emph{uncertainty fingerprint} approach representing the online fault status of NN. Furthermore, we propose a dual head NN topology specifically designed to produce uncertainty fingerprints and the primary prediction of the NN in \emph{a single shot}. During the online operation, by matching the uncertainty fingerprint, we can concurrently self-test NNs with up to $100\%$ coverage with a low false positive rate while maintaining a similar performance of the primary task. Compared to existing works, memory overhead is reduced by up to $243.7$ MB, multiply and accumulate (MAC) operation is reduced by up to $10000\times$, and false-positive rates are reduced by up to $89\%$. △ Less

Submitted 2 January, 2024; originally announced January 2024.

arXiv:2312.11883 [pdf, other]

EMU/GAMA: Radio detected galaxies are more obscured than optically selected galaxies

Authors: U. T. Ahmed, A. M. Hopkins, J. Ware, Y. A. Gordon, M. Bilicki, M. J. I. Brown, M. Cluver, G. Gürkan, Á. R. López-Sánchez, D. A. Leahy, L. Marchetti, S. Phillipps, I. Prandoni, N. Seymour, E. N. Taylor, E. Vardoulaki

Abstract: We demonstrate the importance of radio selection in probing heavily obscured galaxy populations. We combine Evolutionary Map of the Universe (EMU) Early Science data in the Galaxy and Mass Assembly (GAMA) G23 field with the GAMA data, providing optical photometry and spectral line measurements, together with Wide-field Infrared Survey Explorer (WISE) infrared (IR) photometry, providing IR luminosi… ▽ More We demonstrate the importance of radio selection in probing heavily obscured galaxy populations. We combine Evolutionary Map of the Universe (EMU) Early Science data in the Galaxy and Mass Assembly (GAMA) G23 field with the GAMA data, providing optical photometry and spectral line measurements, together with Wide-field Infrared Survey Explorer (WISE) infrared (IR) photometry, providing IR luminosities and colours. We investigate the degree of obscuration in star forming galaxies, based on the Balmer decrement (BD), and explore how this trend varies, over a redshift range of 0<z<0.345. We demonstrate that the radio detected population has on average higher levels of obscuration than the parent optical sample, arising through missing the lowest BD and lowest mass galaxies, which are also the lower star formation rate (SFR) and metallicity systems. We discuss possible explanations for this result, including speculation around whether it might arise from steeper stellar initial mass functions in low mass, low SFR galaxies. △ Less

Submitted 19 December, 2023; originally announced December 2023.

Comments: Accepted for publication in PASA, 17 pages, 14 figures, 3 tables

arXiv:2312.04004 [pdf, other]

Occlusion-based Detection of Trojan-triggering Inputs in Large Language Models of Code

Authors: Aftab Hussain, Md Rafiqul Islam Rabin, Toufique Ahmed, Mohammad Amin Alipour, Bowen Xu

Abstract: Large language models (LLMs) are becoming an integrated part of software development. These models are trained on large datasets for code, where it is hard to verify each data point. Therefore, a potential attack surface can be to inject poisonous data into the training data to make models vulnerable, aka trojaned. It can pose a significant threat by hiding manipulative behaviors inside models, le… ▽ More Large language models (LLMs) are becoming an integrated part of software development. These models are trained on large datasets for code, where it is hard to verify each data point. Therefore, a potential attack surface can be to inject poisonous data into the training data to make models vulnerable, aka trojaned. It can pose a significant threat by hiding manipulative behaviors inside models, leading to compromising the integrity of the models in downstream tasks. In this paper, we propose an occlusion-based human-in-the-loop technique, OSeql, to distinguish trojan-triggering inputs of code. The technique is based on the observation that trojaned neural models of code rely heavily on the triggering part of input; hence, its removal would change the confidence of the models in their prediction substantially. Our results suggest that OSeql can detect the triggering inputs with almost 100% recall. We discuss the problem of false positives and how to address them. These results provide a baseline for future studies in this field. △ Less

Submitted 10 December, 2023; v1 submitted 6 December, 2023; originally announced December 2023.

arXiv:2311.15816 [pdf, other]

Scale-Dropout: Estimating Uncertainty in Deep Neural Networks Using Stochastic Scale

Authors: Soyed Tuhin Ahmed, Kamal Danouchi, Michael Hefenbrock, Guillaume Prenat, Lorena Anghel, Mehdi B. Tahoori

Abstract: Uncertainty estimation in Neural Networks (NNs) is vital in improving reliability and confidence in predictions, particularly in safety-critical applications. Bayesian Neural Networks (BayNNs) with Dropout as an approximation offer a systematic approach to quantifying uncertainty, but they inherently suffer from high hardware overhead in terms of power, memory, and computation. Thus, the applicabi… ▽ More Uncertainty estimation in Neural Networks (NNs) is vital in improving reliability and confidence in predictions, particularly in safety-critical applications. Bayesian Neural Networks (BayNNs) with Dropout as an approximation offer a systematic approach to quantifying uncertainty, but they inherently suffer from high hardware overhead in terms of power, memory, and computation. Thus, the applicability of BayNNs to edge devices with limited resources or to high-performance applications is challenging. Some of the inherent costs of BayNNs can be reduced by accelerating them in hardware on a Computation-In-Memory (CIM) architecture with spintronic memories and binarizing their parameters. However, numerous stochastic units are required to implement conventional dropout-based BayNN. In this paper, we propose the Scale Dropout, a novel regularization technique for Binary Neural Networks (BNNs), and Monte Carlo-Scale Dropout (MC-Scale Dropout)-based BayNNs for efficient uncertainty estimation. Our approach requires only one stochastic unit for the entire model, irrespective of the model size, leading to a highly scalable Bayesian NN. Furthermore, we introduce a novel Spintronic memory-based CIM architecture for the proposed BayNN that achieves more than $100\times$ energy savings compared to the state-of-the-art. We validated our method to show up to a $1\%$ improvement in predictive performance and superior uncertainty estimates compared to related works. △ Less

Submitted 11 January, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

arXiv:2311.07984 [pdf]

Intrinsic defect engineering of CVD grown monolayer MoS$_2$ for tuneable functional nanodevices

Authors: Irfan H. Abidi, Sindhu Priya Giridhar, Jonathan O. Tollerud, Jake Limb, Aishani Mazumder, Edwin LH Mayes, Billy J. Murdoch, Chenglong Xu, Ankit Bhoriya, Abhishek Ranjan, Taimur Ahmed, Yongxiang Li, Jeffrey A. Davis, Cameron L. Bentley, Salvy P. Russo, Enrico Della Gaspera, Sumeet Walia

Abstract: Defects in atomically thin materials can drive new functionalities and expand applications to multifunctional systems that are monolithically integrated. An ability to control formation of defects during the synthesis process is an important capability to create practical deployment opportunities. Molybdenum disulfide (MoS$_2$), a two-dimensional (2D) semiconducting material harbors intrinsic defe… ▽ More Defects in atomically thin materials can drive new functionalities and expand applications to multifunctional systems that are monolithically integrated. An ability to control formation of defects during the synthesis process is an important capability to create practical deployment opportunities. Molybdenum disulfide (MoS$_2$), a two-dimensional (2D) semiconducting material harbors intrinsic defects that can be harnessed to achieve tuneable electronic, optoelectronic, and electrochemical devices. However, achieving precise control over defect formation within monolayer MoS$_2$, while maintaining the structural integrity of the crystals remains a notable challenge. Here, we present a one-step, in-situ defect engineering approach for monolayer MoS$_2$ using a pressure dependent chemical vapour deposition (CVD) process. Monolayer MoS$_2$ grown in low-pressure CVD conditions (LP-MoS$_2$) produces sulfur vacancy (Vs) induced defect rich crystals primarily attributed to the kinetics of the growth conditions. Conversely, atmospheric pressure CVD grown MoS$_2$ (AP-MoS$_2$) passivates these Vs defects with oxygen. This disparity in defect profiles profoundly impacts crucial functional properties and device performance. AP-MoS$_2$ shows a drastically enhanced photoluminescence, which is significantly quenched in LP-MoS$_2$ attributed to in-gap electron donor states induced by the Vs defects. However, the n-do** induced by the Vs defects in LP-MoS$_2$ generates enhanced photoresponsivity and detectivity in our fabricated photodetectors compared to the AP-MoS$_2$ based devices. Defect-rich LP-MoS$_2$ outperforms AP-MoS$_2$ as channel layers of field-effect transistors (FETs), as well as electrocatalytic material for hydrogen evolution reaction (HER). This work presents a single-step CVD approach for in-situ defect engineering in monolayer MoS$_2$ and presents a pathway to control defects in other monolayer material systems. △ Less

Submitted 14 November, 2023; originally announced November 2023.

Comments: 29 pages, 5 figures

arXiv:2310.03080 [pdf, other]

Theoretical prediction of giant Hall high harmonic generation in monolayer NbSe$_2$

Authors: Daniel A. Rehn, Towfiq Ahmed, **kyoung Yoo, Rohit Prasankumar, Jian-Xin Zhu

Abstract: High harmonic generation (HHG) is a powerful probe of electron dynamics on attosecond to femtosecond timescales and has been successfully used to detect electronic and structural changes in novel solid-state quantum materials, including transition metal dichalcogenides (TMDs). Among TMDs, bulk NbSe$_2$ exhibits charge density wave (CDW) order below 33 K and becomes superconducting below 7.3 K. Mon… ▽ More High harmonic generation (HHG) is a powerful probe of electron dynamics on attosecond to femtosecond timescales and has been successfully used to detect electronic and structural changes in novel solid-state quantum materials, including transition metal dichalcogenides (TMDs). Among TMDs, bulk NbSe$_2$ exhibits charge density wave (CDW) order below 33 K and becomes superconducting below 7.3 K. Monolayer NbSe$_2$ is therefore interesting as a material whose different structural and electronic properties could be probed via HHG. Here, we predict the HHG response of the pristine 2H and CDW phases of monolayer NbSe$_2$ using real-time time-dependent density functional theory under the application of a simulated laser pulse excitation. We find that due to the lack of inversion symmetry in both monolayer phases, it is possible to excite even harmonics and that, importantly, the even harmonics appear strictly as the transverse (Hall) components of the current response under excitations polarized along the zigzag direction of the monolayer, while odd harmonics arise from the longitudinal current response in all excitation directions. This suggests that the even and odd harmonic response can be controlled via the polarization of the probing field, opening a new avenue for potentially useful applications in opto-electronic devices. △ Less

Submitted 4 October, 2023; originally announced October 2023.

arXiv:2309.07325 [pdf, other]

Bicrystallography-informed Frenkel-Kontorova model for interlayer dislocations in strained 2D heterostructures

Authors: Md Tusher Ahmed, Chenhaoyue Wang, Amartya S. Banerjee, Nikhil Chandra Admal

Abstract: In recent years, van der Waals (vdW) heterostructures and homostructures, which consist of stacks of two-dimensional (2D) materials, have risen to prominence due to their association with exotic quantum phenomena. Atomistic scale relaxation effects play an extremely important role in the electronic scale quantum physics of these systems. We investigate such structural relaxation effects in this wo… ▽ More In recent years, van der Waals (vdW) heterostructures and homostructures, which consist of stacks of two-dimensional (2D) materials, have risen to prominence due to their association with exotic quantum phenomena. Atomistic scale relaxation effects play an extremely important role in the electronic scale quantum physics of these systems. We investigate such structural relaxation effects in this work using atomistic and mesoscale models, within the context of twisted bilayer graphene -- a well-known heterostructure system that features moire patterns arising from the lattices of the two graphene layers. For small twist angles, atomic relaxation effects in this system are associated with the natural emergence of interface dislocations or strain solitons, which result from the cyclic nature of the generalized stacking fault energy (GSFE), that measures the interface energy based on the relative movement of the two layers. In this work, we first demonstrate using atomistic simulations that atomic reconstruction in bilayer graphene under a large twist also results from interface dislocations, although the Burgers vectors of such dislocations are considerably smaller than those observed in small-twist systems. To reveal the translational invariance of the heterointerface responsible for the formation of such dislocations, we derive the translational symmetry of the GSFE of a 2D heterostructure using the notions of coincident site lattices (CSLs) and displacement shift complete lattices (DSCLs). The workhorse for this exercise is a recently developed Smith normal form bicrystallography framework. Next, we construct a bicrystallography-informed and frame-invariant Frenkel-Kontorova model, which can predict the formation of strain solitons in arbitrary 2D heterostructures, and apply it to study a heterostrained, large-twist bilayer graphene system. △ Less

Submitted 12 September, 2023; originally announced September 2023.

MSC Class: 74G65; 74K15

arXiv:2308.05028 [pdf, other]

doi 10.1007/JHEP01(2024)010

Two-loop Vertices with Vacuum Polarization Insertion

Authors: Taushif Ahmed, Giulio Crisanti, Federico Gasparotto, Syed Mehedi Hasan, Pierpaolo Mastrolia

Abstract: We present the analytic evaluation of the second-order corrections to the massive form factors, due to two-loop vertex diagrams with a vacuum polarization insertion, with exact dependence on the external and internal fermion masses, and on the squared momentum transfer. We consider vector, axial-vector, scalar and pseudoscalar interactions between the external fermion and the external field. After… ▽ More We present the analytic evaluation of the second-order corrections to the massive form factors, due to two-loop vertex diagrams with a vacuum polarization insertion, with exact dependence on the external and internal fermion masses, and on the squared momentum transfer. We consider vector, axial-vector, scalar and pseudoscalar interactions between the external fermion and the external field. After renormalization, the finite expressions of the form factors are expressed in terms of polylogarithms up to weight three. △ Less

Submitted 23 January, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

Comments: 20 pages, 3 figures, 2 ancillary files. v2: results.m ancillary file updated (constants in the expression "canonicalMasters" simplified, residual logarithms in the renormalisedFormFactorsGPLS converted to GPLS), final version

Report number: MITP-23-041

Journal ref: Ahmed, T., Crisanti, G., Gasparotto, F. et al. Two-loop vertices with vacuum polarization insertion. J. High Energ. Phys. 2024, 10 (2024)

arXiv:2306.14169 [pdf]

A Web-based Mpox Skin Lesion Detection System Using State-of-the-art Deep Learning Models Considering Racial Diversity

Authors: Shams Nafisa Ali, Md. Tazuddin Ahmed, Tasnim Jahan, Joydip Paul, S. M. Sakeef Sani, Nawsabah Noor, Anzirun Nahar Asma, Taufiq Hasan

Abstract: The recent 'Mpox' outbreak, formerly known as 'Monkeypox', has become a significant public health concern and has spread to over 110 countries globally. The challenge of clinically diagnosing mpox early on is due, in part, to its similarity to other types of rashes. Computer-aided screening tools have been proven valuable in cases where Polymerase Chain Reaction (PCR) based diagnosis is not immedi… ▽ More The recent 'Mpox' outbreak, formerly known as 'Monkeypox', has become a significant public health concern and has spread to over 110 countries globally. The challenge of clinically diagnosing mpox early on is due, in part, to its similarity to other types of rashes. Computer-aided screening tools have been proven valuable in cases where Polymerase Chain Reaction (PCR) based diagnosis is not immediately available. Deep learning methods are powerful in learning complex data representations, but their efficacy largely depends on adequate training data. To address this challenge, we present the "Mpox Skin Lesion Dataset Version 2.0 (MSLD v2.0)" as a follow-up to the previously released openly accessible dataset, one of the first datasets containing mpox lesion images. This dataset contains images of patients with mpox and five other non-mpox classes (chickenpox, measles, hand-foot-mouth disease, cowpox, and healthy). We benchmark the performance of several state-of-the-art deep learning models, including VGG16, ResNet50, DenseNet121, MobileNetV2, EfficientNetB3, InceptionV3, and Xception, to classify mpox and other infectious skin diseases. In order to reduce the impact of racial bias, we utilize a color space data augmentation method to increase skin color variability during training. Additionally, by leveraging transfer learning implemented with pre-trained weights generated from the HAM10000 dataset, an extensive collection of pigmented skin lesion images, we achieved the best overall accuracy of $83.59\pm2.11\%$. Finally, the developed models are incorporated within a prototype web application to analyze uploaded skin images by a user and determine whether a subject is a suspected mpox patient. △ Less

Submitted 25 June, 2023; originally announced June 2023.

arXiv:2306.11943 [pdf, other]

Towards Understanding What Code Language Models Learned

Authors: Toufique Ahmed, Dian Yu, Chengxuan Huang, Cathy Wang, Prem Devanbu, Kenji Sagae

Abstract: Pre-trained language models are effective in a variety of natural language tasks, but it has been argued their capabilities fall short of fully learning meaning or understanding language. To understand the extent to which language models can learn some form of meaning, we investigate their ability to capture semantics of code beyond superficial frequency and co-occurrence. In contrast to previous… ▽ More Pre-trained language models are effective in a variety of natural language tasks, but it has been argued their capabilities fall short of fully learning meaning or understanding language. To understand the extent to which language models can learn some form of meaning, we investigate their ability to capture semantics of code beyond superficial frequency and co-occurrence. In contrast to previous research on probing models for linguistic features, we study pre-trained models in a setting that allows for objective and straightforward evaluation of a model's ability to learn semantics. In this paper, we examine whether such models capture the semantics of code, which is precisely and formally defined. Through experiments involving the manipulation of code fragments, we show that code pre-trained models of code learn a robust representation of the computational semantics of code that goes beyond superficial features of form alone △ Less

Submitted 27 February, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.10185 [pdf, other]

Spatial-SpinDrop: Spatial Dropout-based Binary Bayesian Neural Network with Spintronics Implementation

Authors: Soyed Tuhin Ahmed, Kamal Danouchi, Michael Hefenbrock, Guillaume Prenat, Lorena Anghel, Mehdi B. Tahoori

Abstract: Recently, machine learning systems have gained prominence in real-time, critical decision-making domains, such as autonomous driving and industrial automation. Their implementations should avoid overconfident predictions through uncertainty estimation. Bayesian Neural Networks (BayNNs) are principled methods for estimating predictive uncertainty. However, their computational costs and power consum… ▽ More Recently, machine learning systems have gained prominence in real-time, critical decision-making domains, such as autonomous driving and industrial automation. Their implementations should avoid overconfident predictions through uncertainty estimation. Bayesian Neural Networks (BayNNs) are principled methods for estimating predictive uncertainty. However, their computational costs and power consumption hinder their widespread deployment in edge AI. Utilizing Dropout as an approximation of the posterior distribution, binarizing the parameters of BayNNs, and further to that implementing them in spintronics-based computation-in-memory (CiM) hardware arrays provide can be a viable solution. However, designing hardware Dropout modules for convolutional neural network (CNN) topologies is challenging and expensive, as they may require numerous Dropout modules and need to use spatial information to drop certain elements. In this paper, we introduce MC-SpatialDropout, a spatial dropout-based approximate BayNNs with spintronics emerging devices. Our method utilizes the inherent stochasticity of spintronic devices for efficient implementation of the spatial dropout module compared to existing implementations. Furthermore, the number of dropout modules per network layer is reduced by a factor of $9\times$ and energy consumption by a factor of $94.11\times$, while still achieving comparable predictive performance and uncertainty estimates compared to related works. △ Less

Submitted 16 June, 2023; originally announced June 2023.

arXiv:2306.09961 [pdf]

The Evolution theory of Learning: From Natural Selection to Reinforcement Learning

Authors: Taboubi Ahmed

Abstract: Evolution is a fundamental process that shapes the biological world we inhabit, and reinforcement learning is a powerful tool used in artificial intelligence to develop intelligent agents that learn from their environment. In recent years, researchers have explored the connections between these two seemingly distinct fields, and have found compelling evidence that they are more closely related tha… ▽ More Evolution is a fundamental process that shapes the biological world we inhabit, and reinforcement learning is a powerful tool used in artificial intelligence to develop intelligent agents that learn from their environment. In recent years, researchers have explored the connections between these two seemingly distinct fields, and have found compelling evidence that they are more closely related than previously thought. This paper examines these connections and their implications, highlighting the potential for reinforcement learning principles to enhance our understanding of evolution and the role of feedback in evolutionary systems. △ Less

Submitted 16 June, 2023; originally announced June 2023.

Report number: Bt01

arXiv:2306.00108 [pdf, other]

Better patching using LLM prompting, via Self-Consistency

Authors: Toufique Ahmed, Premkumar Devanbu

Abstract: Large Language models (LLMs) can be induced to solve non-trivial problems with "few-shot" prompts including illustrative problem-solution examples. Now if the few-shots also include "chain of thought" (CoT) explanations, which are of the form problem-explanation-solution, LLMs will generate a "explained" solution, and perform even better. Recently an exciting, substantially better technique, self-… ▽ More Large Language models (LLMs) can be induced to solve non-trivial problems with "few-shot" prompts including illustrative problem-solution examples. Now if the few-shots also include "chain of thought" (CoT) explanations, which are of the form problem-explanation-solution, LLMs will generate a "explained" solution, and perform even better. Recently an exciting, substantially better technique, self-consistency [1] (S-C) has emerged, based on the intuition that there are many plausible explanations for the right solution; when the LLM is sampled repeatedly to generate a pool of explanation-solution pairs, for a given problem, the most frequently occurring solutions in the pool (ignoring the explanations) tend to be even more likely to be correct! Unfortunately, the use of this highly-performant S-C (or even CoT) approach in software engineering settings is hampered by the lack of explanations; most software datasets lack explanations. In this paper, we describe an application of the S-C approach to program repair, using the commit log on the fix as the explanation, only in the illustrative few-shots. We achieve state-of-the art results, beating previous approaches to prompting-based program repair, on the MODIT dataset; we also find evidence suggesting that the correct commit messages are hel** the LLM learn to produce better patches. △ Less

Submitted 16 August, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

Comments: Accepted at ASE-NIER (2023) track

arXiv:2305.09348 [pdf, other]

One-Shot Online Testing of Deep Neural Networks Based on Distribution Shift Detection

Authors: Soyed Tuhin Ahmed, Mehdi B. Tahoori

Abstract: Neural networks (NNs) are capable of learning complex patterns and relationships in data to make predictions with high accuracy, making them useful for various tasks. However, NNs are both computation-intensive and memory-intensive methods, making them challenging for edge applications. To accelerate the most common operations (matrix-vector multiplication) in NNs, hardware accelerator architectur… ▽ More Neural networks (NNs) are capable of learning complex patterns and relationships in data to make predictions with high accuracy, making them useful for various tasks. However, NNs are both computation-intensive and memory-intensive methods, making them challenging for edge applications. To accelerate the most common operations (matrix-vector multiplication) in NNs, hardware accelerator architectures such as computation-in-memory (CiM) with non-volatile memristive crossbars are utilized. Although they offer benefits such as power efficiency, parallelism, and nonvolatility, they suffer from various faults and variations, both during manufacturing and lifetime operations. This can lead to faulty computations and, in turn, degradation of post-map** inference accuracy, which is unacceptable for many applications, including safety-critical applications. Therefore, proper testing of NN hardware accelerators is required. In this paper, we propose a \emph{one-shot} testing approach that can test NNs accelerated on memristive crossbars with only one test vector, making it very suitable for online testing applications. Our approach can consistently achieve $100\%$ fault coverage across several large topologies with up to $201$ layers and challenging tasks like semantic segmentation. Nevertheless, compared to existing methods, the fault coverage is improved by up to $24\%$, the memory overhead is only $0.0123$ MB, a reduction of up to $19980\times$ and the number of test vectors is reduced by $10000\times$. △ Less

Submitted 16 May, 2023; originally announced May 2023.

arXiv:2305.03803 [pdf, other]

A Survey of Trojans in Neural Models of Source Code: Taxonomy and Techniques

Authors: Aftab Hussain, Md Rafiqul Islam Rabin, Toufique Ahmed, Navid Ayoobi, Bowen Xu, Prem Devanbu, Mohammad Amin Alipour

Abstract: In this work, we study literature in Explainable AI and Safe AI to understand poisoning of neural models of code. In order to do so, we first establish a novel taxonomy for Trojan AI for code, and present a new aspect-based classification of triggers in neural models of code. Next, we highlight recent works that help us deepen our conception of how these models understand software code. Then we pi… ▽ More In this work, we study literature in Explainable AI and Safe AI to understand poisoning of neural models of code. In order to do so, we first establish a novel taxonomy for Trojan AI for code, and present a new aspect-based classification of triggers in neural models of code. Next, we highlight recent works that help us deepen our conception of how these models understand software code. Then we pick some of the recent, state-of-art poisoning strategies that can be used to manipulate such models. The insights we draw can potentially help to foster future research in the area of Trojan AI for code. △ Less

Submitted 18 April, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

arXiv:2304.06815 [pdf, other]

Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization)

Authors: Toufique Ahmed, Kunal Suresh Pai, Premkumar Devanbu, Earl T. Barr

Abstract: Large Language Models (LLM) are a new class of computation engines, "programmed" via prompt engineering. We are still learning how to best "program" these LLMs to help developers. We start with the intuition that developers tend to consciously and unconsciously have a collection of semantics facts in mind when working on coding tasks. Mostly these are shallow, simple facts arising from a quick rea… ▽ More Large Language Models (LLM) are a new class of computation engines, "programmed" via prompt engineering. We are still learning how to best "program" these LLMs to help developers. We start with the intuition that developers tend to consciously and unconsciously have a collection of semantics facts in mind when working on coding tasks. Mostly these are shallow, simple facts arising from a quick read. For a function, examples of facts might include parameter and local variable names, return expressions, simple pre- and post-conditions, and basic control and data flow, etc. One might assume that the powerful multi-layer architecture of transformer-style LLMs makes them inherently capable of doing this simple level of "code analysis" and extracting such information, implicitly, while processing code: but are they, really? If they aren't, could explicitly adding this information help? Our goal here is to investigate this question, using the code summarization task and evaluate whether automatically augmenting an LLM's prompt with semantic facts explicitly, actually helps. Prior work shows that LLM performance on code summarization benefits from few-shot samples drawn either from the same-project or from examples found via information retrieval methods (such as BM25). While summarization performance has steadily increased since the early days, there is still room for improvement: LLM performance on code summarization still lags its performance on natural-language tasks like translation and text summarization. We find that adding semantic facts actually does help! This approach improves performance in several different settings suggested by prior work, including for two different Large Language Models. In most cases, improvement nears or exceeds 2 BLEU; for the PHP language in the challenging CodeSearchNet dataset, this augmentation actually yields performance surpassing 30 BLEU. △ Less

Submitted 11 January, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

Comments: Accepted at International Conference on Software Engineering (ICSE-2024)

arXiv:2304.06520 [pdf, other]

An Efficient Transfer Learning-based Approach for Apple Leaf Disease Classification

Authors: Md. Hamjajul Ashmafee, Tasnim Ahmed, Sabbir Ahmed, Md. Bakhtiar Hasan, Mst Nura Jahan, A. B. M. Ashikur Rahman

Abstract: Correct identification and categorization of plant diseases are crucial for ensuring the safety of the global food supply and the overall financial success of stakeholders. In this regard, a wide range of solutions has been made available by introducing deep learning-based classification systems for different staple crops. Despite being one of the most important commercial crops in many parts of t… ▽ More Correct identification and categorization of plant diseases are crucial for ensuring the safety of the global food supply and the overall financial success of stakeholders. In this regard, a wide range of solutions has been made available by introducing deep learning-based classification systems for different staple crops. Despite being one of the most important commercial crops in many parts of the globe, research proposing a smart solution for automatically classifying apple leaf diseases remains relatively unexplored. This study presents a technique for identifying apple leaf diseases based on transfer learning. The system extracts features using a pretrained EfficientNetV2S architecture and passes to a classifier block for effective prediction. The class imbalance issues are tackled by utilizing runtime data augmentation. The effect of various hyperparameters, such as input resolution, learning rate, number of epochs, etc., has been investigated carefully. The competence of the proposed pipeline has been evaluated on the apple leaf disease subset from the publicly available `PlantVillage' dataset, where it achieved an accuracy of 99.21%, outperforming the existing works. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: Accepted in ECCE 2023, 6 pages, 6 figures, 4 tables

arXiv:2304.04869 [pdf, other]

doi 10.1088/1538-3873/acd1b5

The James Webb Space Telescope Mission

Authors: Jonathan P. Gardner, John C. Mather, Randy Abbott, James S. Abell, Mark Abernathy, Faith E. Abney, John G. Abraham, Roberto Abraham, Yasin M. Abul-Huda, Scott Acton, Cynthia K. Adams, Evan Adams, David S. Adler, Maarten Adriaensen, Jonathan Albert Aguilar, Mansoor Ahmed, Nasif S. Ahmed, Tanjira Ahmed, Rüdeger Albat, Loïc Albert, Stacey Alberts, David Aldridge, Mary Marsha Allen, Shaune S. Allen, Martin Altenburg , et al. (983 additional authors not shown)

Abstract: Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono… ▽ More Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astronomers will celebrate their accomplishments for the life of the mission, potentially as long as 20 years, and beyond. This report and the scientific discoveries that follow are extended thank-you notes to the 20,000 team members. The telescope is working perfectly, with much better image quality than expected. In this and accompanying papers, we give a brief history, describe the observatory, outline its objectives and current observing program, and discuss the inventions and people who made it possible. We cite detailed reports on the design and the measured performance on orbit. △ Less

Submitted 10 April, 2023; originally announced April 2023.

Comments: Accepted by PASP for the special issue on The James Webb Space Telescope Overview, 29 pages, 4 figures

arXiv:2303.11455 [pdf, other]

Large Language Models and Simple, Stupid Bugs

Authors: Kevin Jesse, Toufique Ahmed, Premkumar T. Devanbu, Emily Morgan

Abstract: With the advent of powerful neural language models, AI-based systems to assist developers in coding tasks are becoming widely available; Copilot is one such system. Copilot uses Codex, a large language model (LLM), to complete code conditioned on a preceding "prompt". Codex, however, is trained on public GitHub repositories, viz., on code that may include bugs and vulnerabilities. Previous studies… ▽ More With the advent of powerful neural language models, AI-based systems to assist developers in coding tasks are becoming widely available; Copilot is one such system. Copilot uses Codex, a large language model (LLM), to complete code conditioned on a preceding "prompt". Codex, however, is trained on public GitHub repositories, viz., on code that may include bugs and vulnerabilities. Previous studies [1], [2] show Codex reproduces vulnerabilities seen in training. In this study, we examine how prone Codex is to generate an interesting bug category, single statement bugs, commonly referred to as simple, stupid bugs or SStuBs in the MSR community. We find that Codex and similar LLMs do help avoid some SStuBs, but do produce known, verbatim SStuBs as much as 2x as likely than known, verbatim correct code. We explore the consequences of the Codex generated SStuBs and propose avoidance strategies that suggest the possibility of reducing the production of known, verbatim SStubs, and increase the possibility of producing known, verbatim fixes. △ Less

Submitted 20 March, 2023; originally announced March 2023.

Comments: Accepted at International Conference on Mining Software Repositories (MSR-2023)

arXiv:2302.07394 [pdf, other]

doi 10.1103/PhysRevLett.130.122502

First Evidence of Axial Shape Asymmetry and Configuration Coexistence in $^{74}$Zn: Suggestion for a Northern Extension of the $N=40$ Island of Inversion

Authors: M. Rocchini, P. E. Garrett, M. Zielinska, S. M. Lenzi, D. D. Dao, F. Nowacki, V. Bildstein, A. D. MacLean, B. Olaizola, Z. T. Ahmed, C. Andreoiu, A. Babu, G. C. Ball, S. S. Bhattacharjee, H. Bidaman, C. Cheng, R. Coleman, I. Dillmann, A. B. Garnsworthy, S. Gillespie, C. J. Griffin, G. F. Grinyer, G. Hackman, M. Hanley, A. Illana , et al. (19 additional authors not shown)

Abstract: The excited states of $N=44$ $^{74}$Zn were investigated via $γ$-ray spectroscopy following $^{74}$Cu $β$ decay. By exploiting $γ$-$γ$ angular correlation analysis, the $2_2^+$, $3_1^+$, $0_2^+$ and $2_3^+$ states in $^{74}$Zn were firmly established. The $γ$-ray branching and $E2/M1$ mixing ratios for transitions de-exciting the $2_2^+$, $3_1^+$ and $2_3^+$ states were measured, allowing for the… ▽ More The excited states of $N=44$ $^{74}$Zn were investigated via $γ$-ray spectroscopy following $^{74}$Cu $β$ decay. By exploiting $γ$-$γ$ angular correlation analysis, the $2_2^+$, $3_1^+$, $0_2^+$ and $2_3^+$ states in $^{74}$Zn were firmly established. The $γ$-ray branching and $E2/M1$ mixing ratios for transitions de-exciting the $2_2^+$, $3_1^+$ and $2_3^+$ states were measured, allowing for the extraction of relative $B(E2)$ values. In particular, the $2_3^+ \to 0_2^+$ and $2_3^+ \to 4_1^+$ transitions were observed for the first time. The results show excellent agreement with new microscopic large-scale shell-model calculations, and are discussed in terms of underlying shapes, as well as the role of neutron excitations across the $N=40$ gap. Enhanced axial shape asymmetry (triaxiality) is suggested to characterize $^{74}$Zn in its ground state. Furthermore, an excited $K=0$ band with a significantly larger softness in its shape is identified. A shore of the $N=40$ ``island of inversion'' appears to manifest above $Z=26$, previously thought as its northern limit in the chart of the nuclides. △ Less

Submitted 14 February, 2023; originally announced February 2023.

arXiv:2301.10174 [pdf]

doi 10.1109/I2CT54291.2022.9825052

Analysis of Arrhythmia Classification on ECG Dataset

Authors: Taminul Islam, Arindom Kundu, Tanzim Ahmed, Nazmul Islam Khan

Abstract: The heart is one of the most vital organs in the human body. It supplies blood and nutrients in other parts of the body. Therefore, maintaining a healthy heart is essential. As a heart disorder, arrhythmia is a condition in which the heart's pum** mechanism becomes aberrant. The Electrocardiogram is used to analyze the arrhythmia problem from the ECG signals because of its fewer difficulties and… ▽ More The heart is one of the most vital organs in the human body. It supplies blood and nutrients in other parts of the body. Therefore, maintaining a healthy heart is essential. As a heart disorder, arrhythmia is a condition in which the heart's pum** mechanism becomes aberrant. The Electrocardiogram is used to analyze the arrhythmia problem from the ECG signals because of its fewer difficulties and cheapness. The heart peaks shown in the ECG graph are used to detect heart diseases, and the R peak is used to analyze arrhythmia disease. Arrhythmia is grouped into two groups - Tachycardia and Bradycardia for detection. In this paper, we discussed many different techniques such as Deep CNNs, LSTM, SVM, NN classifier, Wavelet, TQWT, etc., that have been used for detecting arrhythmia using various datasets throughout the previous decade. This work shows the analysis of some arrhythmia classification on the ECG dataset. Here, Data preprocessing, feature extraction, classification processes were applied on most research work and achieved better performance for classifying ECG signals to detect arrhythmia. Automatic arrhythmia detection can help cardiologists make the right decisions immediately to save human life. In addition, this research presents various previous research limitations with some challenges in detecting arrhythmia that will help in future research. △ Less

Submitted 10 January, 2023; originally announced January 2023.

Comments: 6 pages, 5 figures. This paper has been published to 2022 proceedings of IEEE 7th International conference for Convergence in Technology (I2CT), 07-09 April 2022, Mumbai, India

Journal ref: In 2022 IEEE 7th International conference for Convergence in Technology (I2CT) (pp. 1-6). IEEE

arXiv:2301.03797 [pdf, other]

Recommending Root-Cause and Mitigation Steps for Cloud Incidents using Large Language Models

Authors: Toufique Ahmed, Supriyo Ghosh, Chetan Bansal, Thomas Zimmermann, Xuchao Zhang, Saravan Rajmohan

Abstract: Incident management for cloud services is a complex process involving several steps and has a huge impact on both service health and developer productivity. On-call engineers require significant amount of domain knowledge and manual effort for root causing and mitigation of production incidents. Recent advances in artificial intelligence has resulted in state-of-the-art large language models like… ▽ More Incident management for cloud services is a complex process involving several steps and has a huge impact on both service health and developer productivity. On-call engineers require significant amount of domain knowledge and manual effort for root causing and mitigation of production incidents. Recent advances in artificial intelligence has resulted in state-of-the-art large language models like GPT-3.x (both GPT-3.0 and GPT-3.5), which have been used to solve a variety of problems ranging from question answering to text summarization. In this work, we do the first large-scale study to evaluate the effectiveness of these models for hel** engineers root cause and mitigate production incidents. We do a rigorous study at Microsoft, on more than 40,000 incidents and compare several large language models in zero-shot, fine-tuned and multi-task setting using semantic and lexical metrics. Lastly, our human evaluation with actual incident owners show the efficacy and future potential of using artificial intelligence for resolving cloud incidents. △ Less

Submitted 9 February, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

Comments: Accepted at International Conference on Software Engineering (ICSE-2023)

arXiv:2301.01701 [pdf, other]

Extending Source Code Pre-Trained Language Models to Summarise Decompiled Binaries

Authors: Ali Al-Kaswan, Toufique Ahmed, Maliheh Izadi, Anand Ashok Sawant, Premkumar Devanbu, Arie van Deursen

Abstract: Reverse engineering binaries is required to understand and analyse programs for which the source code is unavailable. Decompilers can transform the largely unreadable binaries into a more readable source code-like representation. However, reverse engineering is time-consuming, much of which is taken up by labelling the functions with semantic information. While the automated summarisation of dec… ▽ More Reverse engineering binaries is required to understand and analyse programs for which the source code is unavailable. Decompilers can transform the largely unreadable binaries into a more readable source code-like representation. However, reverse engineering is time-consuming, much of which is taken up by labelling the functions with semantic information. While the automated summarisation of decompiled code can help Reverse Engineers understand and analyse binaries, current work mainly focuses on summarising source code, and no suitable dataset exists for this task. In this work, we extend large pre-trained language models of source code to summarise decompiled binary functions. Furthermore, we investigate the impact of input and data properties on the performance of such models. Our approach consists of two main components; the data and the model. We first build CAPYBARA, a dataset of 214K decompiled function-documentation pairs across various compiler optimisations. We extend CAPYBARA further by generating synthetic datasets and deduplicating the data. Next, we fine-tune the CodeT5 base model with CAPYBARA to create BinT5. BinT5 achieves the state-of-the-art BLEU-4 score of 60.83, 58.82, and 44.21 for summarising source, decompiled, and synthetically stripped decompiled code, respectively. This indicates that these models can be extended to decompiled binaries successfully. Finally, we found that the performance of BinT5 is not heavily dependent on the dataset size and compiler optimisation level. We recommend future research to further investigate transferring knowledge when working with less expressive input formats such as stripped binaries. △ Less

Submitted 13 January, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

Comments: SANER 2023 Technical Track Camera Ready

arXiv:2212.08610 [pdf, other]

doi 10.1109/ICCIT57492.2022.10054769

Huruf: An Application for Arabic Handwritten Character Recognition Using Deep Learning

Authors: Minhaz Kamal, Fairuz Shaiara, Chowdhury Mohammad Abdullah, Sabbir Ahmed, Tasnim Ahmed, Md. Hasanul Kabir

Abstract: Handwriting Recognition has been a field of great interest in the Artificial Intelligence domain. Due to its broad use cases in real life, research has been conducted widely on it. Prominent work has been done in this field focusing mainly on Latin characters. However, the domain of Arabic handwritten character recognition is still relatively unexplored. The inherent cursive nature of the Arabic c… ▽ More Handwriting Recognition has been a field of great interest in the Artificial Intelligence domain. Due to its broad use cases in real life, research has been conducted widely on it. Prominent work has been done in this field focusing mainly on Latin characters. However, the domain of Arabic handwritten character recognition is still relatively unexplored. The inherent cursive nature of the Arabic characters and variations in writing styles across individuals makes the task even more challenging. We identified some probable reasons behind this and proposed a lightweight Convolutional Neural Network-based architecture for recognizing Arabic characters and digits. The proposed pipeline consists of a total of 18 layers containing four layers each for convolution, pooling, batch normalization, dropout, and finally one Global average pooling and a Dense layer. Furthermore, we thoroughly investigated the different choices of hyperparameters such as the choice of the optimizer, kernel initializer, activation function, etc. Evaluating the proposed architecture on the publicly available 'Arabic Handwritten Character Dataset (AHCD)' and 'Modified Arabic handwritten digits Database (MadBase)' datasets, the proposed model respectively achieved an accuracy of 96.93% and 99.35% which is comparable to the state-of-the-art and makes it a suitable solution for real-life end-level applications. △ Less

Submitted 24 December, 2022; v1 submitted 16 December, 2022; originally announced December 2022.

Comments: Accepted in 25th ICCIT (6 pages, 4 tables, 4 figures)

Report number: 10054769

Journal ref: 2022 25th International Conference on Computer and Information Technology (ICCIT)

Showing 1–50 of 239 results for author: Ahmed, T