Search | arXiv e-print repository

arXiv:2406.05519 [pdf, other]

Synergizing Deep Learning and Phase Change Materials for Four-state Broadband Multifunctional Metasurfaces in the Visible Range

Authors: Md. Ehsanul Karim, Md. Redwanul Karim, Sajid Muhaimin Choudhury

Abstract: In this article, we report, for the first time, broadband multifunctional metasurfaces with more than four distinct functionalities. The constituent meta-atoms combine two different phase change materials, $\mathrm{VO_2}$ and $\mathrm{Sb_2S_3}$ in a multi-stage configuration. FDTD simulations demonstrate a broadband reflection amplitude switching between the four states in visible range due to the… ▽ More In this article, we report, for the first time, broadband multifunctional metasurfaces with more than four distinct functionalities. The constituent meta-atoms combine two different phase change materials, $\mathrm{VO_2}$ and $\mathrm{Sb_2S_3}$ in a multi-stage configuration. FDTD simulations demonstrate a broadband reflection amplitude switching between the four states in visible range due to the enhanced cavity length modulation effect from the cascaded Fabry-Perot cavities, overcoming the inherent small optical contrast between the phase change material (PCM) states. This, along with the reflection phase control between the four states, allows us to incorporate both amplitude and phase-dependent properties in the same metasurface - achromatic deflection, wavelength beam splitting, achromatic focusing, and broadband absorption, overcoming the limitations of previous functionality switching mechanisms for the visible band. We have used a Tandem Neural network-based inverse design scheme to ensure the stringent requirements of different states are realized. We have used two forward networks for predicting the reflection amplitude and phase for a meta-atom within the pre-defined design space. The excellent prediction capability of these surrogate models is utilized to train the reverse network. The inverse design network, trained with a labeled data set, is capable of producing the optimized meta-units given the desired figure-of-merits in terms of reflection amplitude and phase for the four states. The optical characteristics of two inverse-designed metasurfaces have been evaluated as test cases for two different sets of design parameters in the four states. Both structures demonstrate the four desired broadband functionalities while closely matching the design requirements, suggesting their potential in visible-range portable medical imaging devices. △ Less

Submitted 8 June, 2024; originally announced June 2024.

arXiv:2403.04134 [pdf, other]

An Adaptable, Safe, and Portable Robot-Assisted Feeding System

Authors: Ethan Kroll Gordon, Rajat Kumar Jenamani, Amal Nanavati, Ziang Liu, Haya Bolotski, Raida Karim, Daniel Stabile, Atharva Kashyap, Bernie Hao Zhu, Xilai Dai, Tyler Schrenk, Jonathan Ko, Taylor Kessler Faulkner, Tapomayukh Bhattacharjee, Siddhartha Srinivasa

Abstract: We demonstrate a robot-assisted feeding system that enables people with mobility impairments to feed themselves. Our system design embodies Safety, Portability, and User Control, with comprehensive full-stack safety checks, the ability to be mounted on and powered by any powered wheelchair, and a custom web-app allowing care-recipients to leverage their own assistive devices for robot control. For… ▽ More We demonstrate a robot-assisted feeding system that enables people with mobility impairments to feed themselves. Our system design embodies Safety, Portability, and User Control, with comprehensive full-stack safety checks, the ability to be mounted on and powered by any powered wheelchair, and a custom web-app allowing care-recipients to leverage their own assistive devices for robot control. For bite acquisition, we leverage multi-modal online learning to tractably adapt to unseen food types. For bite transfer, we leverage real-time mouth perception and interaction-aware control. Co-designed with community researchers, our system has been validated through multiple end-user studies. △ Less

Submitted 6 March, 2024; originally announced March 2024.

Comments: HRI 2024 Demo; Corrected inaccurate author ordering in ACM DL which occurred due to formatting issues

arXiv:2403.02560 [pdf]

Impact of COVID-19 on Exchange rate volatility of Bangladesh: Evidence through GARCH model

Authors: Rizwanul Karim

Abstract: This study uses the GARCH (1,1) model to examine the impact of COVID-19 cases (log value) on the volatility of the Exchange rate return of Bangladeshi taka (BDT) over the US dollar (USD), Japanese Yen (JPY), and Swedish Krona (SEK). The result shows that an increase in the number of COVID-19-affected cases in Bangladesh has a significant and positive impact on the volatility of exchange rates BDT/… ▽ More This study uses the GARCH (1,1) model to examine the impact of COVID-19 cases (log value) on the volatility of the Exchange rate return of Bangladeshi taka (BDT) over the US dollar (USD), Japanese Yen (JPY), and Swedish Krona (SEK). The result shows that an increase in the number of COVID-19-affected cases in Bangladesh has a significant and positive impact on the volatility of exchange rates BDT/USD, BDT/JPY, and BDT/SEK. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2401.06787 [pdf]

doi 10.33166/AETiC.2024.01.005

Deep Learning Based Cyberbullying Detection in Bangla Language

Authors: Sristy Shidul Nath, Razuan Karim, Mahdi H. Miraz

Abstract: The Internet is currently the largest platform for global communication including expressions of opinions, reviews, contents, images, videos and so forth. Moreover, social media has now become a very broad and highly engaging platform due to its immense popularity and swift adoption trend. Increased social networking, however, also has detrimental impacts on the society leading to a range of unwan… ▽ More The Internet is currently the largest platform for global communication including expressions of opinions, reviews, contents, images, videos and so forth. Moreover, social media has now become a very broad and highly engaging platform due to its immense popularity and swift adoption trend. Increased social networking, however, also has detrimental impacts on the society leading to a range of unwanted phenomena, such as online assault, intimidation, digital bullying, criminality and trolling. Hence, cyberbullying has become a pervasive and worrying problem that poses considerable psychological and emotional harm to the people, particularly amongst the teens and the young adults. In order to lessen its negative effects and provide victims with prompt support, a great deal of research to identify cyberbullying instances at various online platforms is emerging. In comparison to other languages, Bangla (also known as Bengali) has fewer research studies in this domain. This study demonstrates a deep learning strategy for identifying cyberbullying in Bengali, using a dataset of 12282 versatile comments from multiple social media sites. In this study, a two-layer bidirectional long short-term memory (Bi-LSTM) model has been built to identify cyberbullying, using a variety of optimisers as well as 5-fold cross validation. To evaluate the functionality and efficacy of the proposed system, rigorous assessment and validation procedures have been employed throughout the project. The results of this study reveals that the proposed model's accuracy, using momentum-based stochastic gradient descent (SGD) optimiser, is 94.46%. It also reflects a higher accuracy of 95.08% and a F1 score of 95.23% using Adam optimiser as well as a better accuracy of 94.31% in 5-fold cross validation. △ Less

Submitted 6 January, 2024; originally announced January 2024.

Journal ref: Annals of Emerging Technologies in Computing (AETiC), Print ISSN: 2516-0281, Online ISSN: 2516-029X, pp. 50-65, Vol. 8, No. 1, 1st January 2024, Available: http://aetic.theiaer.org/archive/v8/v8n1/p5.html

arXiv:2401.04670 [pdf, other]

Modified Levenberg-Marquardt Algorithm For Tensor CP Decomposition in Image Compression

Authors: Ramin Goudarzi Karim, Dipak Dulal, Carmeliza Navasca

Abstract: This paper explores a new version of the Levenberg-Marquardt algorithm used for Tensor Canonical Polyadic (CP) decomposition with an emphasis on image compression and reconstruction. Tensor computation, especially CP decomposition, holds significant applications in data compression and analysis. In this study, we formulate CP as a nonlinear least squares optimization problem. Then, we present an i… ▽ More This paper explores a new version of the Levenberg-Marquardt algorithm used for Tensor Canonical Polyadic (CP) decomposition with an emphasis on image compression and reconstruction. Tensor computation, especially CP decomposition, holds significant applications in data compression and analysis. In this study, we formulate CP as a nonlinear least squares optimization problem. Then, we present an iterative Levenberg-Marquardt (LM) based algorithm for computing the CP decomposition. Ultimately, we test the algorithm on various datasets, including randomly generated tensors and RGB images. The proposed method proves to be both efficient and effective, offering a reduced computational burden when compared to the traditional Levenberg-Marquardt technique. △ Less

Submitted 9 January, 2024; originally announced January 2024.

Comments: Accepted on (DCC 2024) 2024 Data Compression Conference

arXiv:2310.20478 [pdf, other]

doi 10.1007/978-3-031-44067-0_24

Unveiling Black-boxes: Explainable Deep Learning Models for Patent Classification

Authors: Md Shajalal, Sebastian Denef, Md. Rezaul Karim, Alexander Boden, Gunnar Stevens

Abstract: Recent technological advancements have led to a large number of patents in a diverse range of domains, making it challenging for human experts to analyze and manage. State-of-the-art methods for multi-label patent classification rely on deep neural networks (DNNs), which are complex and often considered black-boxes due to their opaque decision-making processes. In this paper, we propose a novel de… ▽ More Recent technological advancements have led to a large number of patents in a diverse range of domains, making it challenging for human experts to analyze and manage. State-of-the-art methods for multi-label patent classification rely on deep neural networks (DNNs), which are complex and often considered black-boxes due to their opaque decision-making processes. In this paper, we propose a novel deep explainable patent classification framework by introducing layer-wise relevance propagation (LRP) to provide human-understandable explanations for predictions. We train several DNN models, including Bi-LSTM, CNN, and CNN-BiLSTM, and propagate the predictions backward from the output layer up to the input layer of the model to identify the relevance of words for individual predictions. Considering the relevance score, we then generate explanations by visualizing relevant words for the predicted patent class. Experimental results on two datasets comprising two-million patent texts demonstrate high performance in terms of various evaluation measures. The explanations generated for each prediction highlight important relevant words that align with the predicted class, making the prediction more understandable. Explainable systems have the potential to facilitate the adoption of complex AI-enabled methods for patent classification in real-world applications. △ Less

Submitted 31 October, 2023; originally announced October 2023.

Comments: This is the pre-print of the submitted manuscript on the World Conference on eXplainable Artificial Intelligence (xAI2023), Lisbon, Portugal. The published manuscript can be found here https://doi.org/10.1007/978-3-031-44067-0_24

arXiv:2310.12296 [pdf, other]

Understanding Video Transformers for Segmentation: A Survey of Application and Interpretability

Authors: Rezaul Karim, Richard P. Wildes

Abstract: Video segmentation encompasses a wide range of categories of problem formulation, e.g., object, scene, actor-action and multimodal video segmentation, for delineating task-specific scene components with pixel-level masks. Recently, approaches in this research area shifted from concentrating on ConvNet-based to transformer-based models. In addition, various interpretability approaches have appeared… ▽ More Video segmentation encompasses a wide range of categories of problem formulation, e.g., object, scene, actor-action and multimodal video segmentation, for delineating task-specific scene components with pixel-level masks. Recently, approaches in this research area shifted from concentrating on ConvNet-based to transformer-based models. In addition, various interpretability approaches have appeared for transformer models and video temporal dynamics, motivated by the growing interest in basic scientific understanding, model diagnostics and societal implications of real-world deployment. Previous surveys mainly focused on ConvNet models on a subset of video segmentation tasks or transformers for classification tasks. Moreover, component-wise discussion of transformer-based video segmentation models has not yet received due focus. In addition, previous reviews of interpretability methods focused on transformers for classification, while analysis of video temporal dynamics modelling capabilities of video models received less attention. In this survey, we address the above with a thorough discussion of various categories of video segmentation, a component-wise discussion of the state-of-the-art transformer-based models, and a review of related interpretability methods. We first present an introduction to the different video segmentation task categories, their objectives, specific challenges and benchmark datasets. Next, we provide a component-wise review of recent transformer-based models and document the state of the art on different video segmentation tasks. Subsequently, we discuss post-hoc and ante-hoc interpretability methods for transformer models and interpretability methods for understanding the role of the temporal dimension in video models. Finally, we conclude our discussion with future research directions. △ Less

Submitted 18 October, 2023; originally announced October 2023.

arXiv:2310.10935 [pdf, other]

Intent Detection and Slot Filling for Home Assistants: Dataset and Analysis for Bangla and Sylheti

Authors: Fardin Ahsan Sakib, A H M Rezaul Karim, Saadat Hasan Khan, Md Mushfiqur Rahman

Abstract: As voice assistants cement their place in our technologically advanced society, there remains a need to cater to the diverse linguistic landscape, including colloquial forms of low-resource languages. Our study introduces the first-ever comprehensive dataset for intent detection and slot filling in formal Bangla, colloquial Bangla, and Sylheti languages, totaling 984 samples across 10 unique inten… ▽ More As voice assistants cement their place in our technologically advanced society, there remains a need to cater to the diverse linguistic landscape, including colloquial forms of low-resource languages. Our study introduces the first-ever comprehensive dataset for intent detection and slot filling in formal Bangla, colloquial Bangla, and Sylheti languages, totaling 984 samples across 10 unique intents. Our analysis reveals the robustness of large language models for tackling downstream tasks with inadequate data. The GPT-3.5 model achieves an impressive F1 score of 0.94 in intent detection and 0.51 in slot filling for colloquial Bangla. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: Accepted at the First Workshop on Bangla Language Processing, 2023

arXiv:2310.08365 [pdf, other]

From Large Language Models to Knowledge Graphs for Biomarker Discovery in Cancer

Authors: Md. Rezaul Karim, Lina Molinas Comet, Md Shajalal, Oya Deniz Beyan, Dietrich Rebholz-Schuhmann, Stefan Decker

Abstract: Domain experts often rely on most recent knowledge for apprehending and disseminating specific biological processes that help them design strategies for develo** prevention and therapeutic decision-making in various disease scenarios. A challenging scenarios for artificial intelligence (AI) is using biomedical data (e.g., texts, imaging, omics, and clinical) to provide diagnosis and treatment re… ▽ More Domain experts often rely on most recent knowledge for apprehending and disseminating specific biological processes that help them design strategies for develo** prevention and therapeutic decision-making in various disease scenarios. A challenging scenarios for artificial intelligence (AI) is using biomedical data (e.g., texts, imaging, omics, and clinical) to provide diagnosis and treatment recommendations for cancerous conditions.~Data and knowledge about biomedical entities like cancer, drugs, genes, proteins, and their mechanism is spread across structured (knowledge bases (KBs)) and unstructured (e.g., scientific articles) sources. A large-scale knowledge graph (KG) can be constructed by integrating and extracting facts about semantically interrelated entities and relations. Such a KG not only allows exploration and question answering (QA) but also enables domain experts to deduce new knowledge. However, exploring and querying large-scale KGs is tedious for non-domain users due to their lack of understanding of the data assets and semantic technologies. In this paper, we develop a domain KG to leverage cancer-specific biomarker discovery and interactive QA. For this, we constructed a domain ontology called OncoNet Ontology (ONO), which enables semantic reasoning for validating gene-disease (different types of cancer) relations. The KG is further enriched by harmonizing the ONO, metadata, controlled vocabularies, and biomedical concepts from scientific articles by employing BioBERT- and SciBERT-based information extractors. Further, since the biomedical domain is evolving, where new findings often replace old ones, without having access to up-to-date scientific findings, there is a high chance an AI system exhibits concept drift while providing diagnosis and treatment. Therefore, we fine-tune the KG using large language models (LLMs) based on more recent articles and KBs. △ Less

Submitted 19 November, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2302.04737

arXiv:2310.07438 [pdf, other]

DESTINE: Dynamic Goal Queries with Temporal Transductive Alignment for Trajectory Prediction

Authors: Rezaul Karim, Soheil Mohamad Alizadeh Shabestary, Amir Rasouli

Abstract: Predicting temporally consistent road users' trajectories in a multi-agent setting is a challenging task due to unknown characteristics of agents and their varying intentions. Besides using semantic map information and modeling interactions, it is important to build an effective mechanism capable of reasoning about behaviors at different levels of granularity. To this end, we propose Dynamic goal… ▽ More Predicting temporally consistent road users' trajectories in a multi-agent setting is a challenging task due to unknown characteristics of agents and their varying intentions. Besides using semantic map information and modeling interactions, it is important to build an effective mechanism capable of reasoning about behaviors at different levels of granularity. To this end, we propose Dynamic goal quErieS with temporal Transductive alIgNmEnt (DESTINE) method. Unlike past arts, our approach 1) dynamically predicts agents' goals irrespective of particular road structures, such as lanes, allowing the method to produce a more accurate estimation of destinations; 2) achieves map compliant predictions by generating future trajectories in a coarse-to-fine fashion, where the coarser predictions at a lower frame rate serve as intermediate goals; and 3) uses an attention module designed to temporally align predicted trajectories via masked attention. Using the common Argoverse benchmark dataset, we show that our method achieves state-of-the-art performance on various metrics, and further investigate the contributions of proposed modules via comprehensive ablation studies. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 6 tables 4 figures

arXiv:2310.02939 [pdf, other]

Identifying physical structures in our Galaxy with Gaussian Mixture Models: An unsupervised machine learning technique

Authors: M. Tiwari, R. Kievit, S. Kabanovic, L. Bonne, F. Falasca, C. Guevara, R. Higgins, M. Justen, R. Karim, Ü. Kavak, C. Pabst, M. W. Pound, N. Schneider, R. Simon, J. Stutzki, M. Wolfire, A. G. G. M. Tielens

Abstract: We explore the potential of the Gaussian Mixture Model (GMM), an unsupervised machine learning method, to identify coherent physical structures in the ISM. The implementation we present can be used on any kind of spatially and spectrally resolved data set. We provide a step-by-step guide to use these models on different sources and data sets. Following the guide, we run the models on NGC 1977, RCW… ▽ More We explore the potential of the Gaussian Mixture Model (GMM), an unsupervised machine learning method, to identify coherent physical structures in the ISM. The implementation we present can be used on any kind of spatially and spectrally resolved data set. We provide a step-by-step guide to use these models on different sources and data sets. Following the guide, we run the models on NGC 1977, RCW 120 and RCW 49 using the [CII] 158 $μ$m map** observations from the SOFIA telescope. We find that the models identified 6, 4 and 5 velocity coherent physical structures in NGC 1977, RCW 120 and RCW 49, respectively, which are validated by analysing the observed spectra towards these structures and by comparison to earlier findings. In this work we demonstrate that GMM is a powerful tool that can better automate the process of spatial and spectral analysis to interpret map** observations. △ Less

Submitted 4 October, 2023; originally announced October 2023.

Comments: 19 pages, 14 figures

arXiv:2310.01539 [pdf]

Effect of Triangular Pre-Cracks on the Mechanical Behavior of 2D MoTe$_2$: A Molecular Dynamics Study

Authors: Md. Jobayer Aziz, Md Akibul Islam, Md. Rezwanul Karim, Arafat Ahmed Bhuiyan

Abstract: Among two-dimensional (2D) materials, transition metal dichalcogenides (TMDs) stand out for their remarkable electronic, optical, and chemical properties. In addition to being variable bandgap semiconductor materials, the atomic thinness provides flexibility to TMDs. Therefore, understanding the physical properties of TMDs for applications in flexible and wearable devices is crucial. Despite the g… ▽ More Among two-dimensional (2D) materials, transition metal dichalcogenides (TMDs) stand out for their remarkable electronic, optical, and chemical properties. In addition to being variable bandgap semiconductor materials, the atomic thinness provides flexibility to TMDs. Therefore, understanding the physical properties of TMDs for applications in flexible and wearable devices is crucial. Despite the growing enthusiasm surrounding two-dimensional transition metal dichalcogenides (TMDs), our understanding of the mechanical characteristics of molybdenum ditelluride (MoTe$_2$) remains limited. The mechanical properties of MoTe$_2$ deteriorate in the presence of pre-existing cracks or vacancy defects, which are very common in grown TMDs. In this study, the fracture properties and crack propagation of monolayer molybdenum ditelluride (MoTe$_2$) sheets containing pre-existing triangular cracks with various vertex angles are investigated by performing molecular dynamics (MD) simulations of uniaxial and biaxial tensile loading. Due to pre-crack length, angle, and perimeter variations, monolayer MoTe$_2$ with pre-existing cracks underwent considerable changes in Young's modulus, tensile strength, fracture toughness, and fracture strain values. We have found that the pre-cracked MoTe$_2$ is more brittle than its pristine counterpart. Regulated alteration of pre-crack angle under constant simulation conditions improved the uniaxial mechanical properties. Similarly, regulated alteration of the perimeter of the pre-crack resulted in improved biaxial mechanical properties. This study contributes to the foundational knowledge for advanced design strategies involving strain engineering in MoTe$_2$ and other similar transition metal dichalcogenides. △ Less

Submitted 2 October, 2023; originally announced October 2023.

Comments: Submitted to a peer-reviewed journal, 55 pages, 21 figures

arXiv:2309.14637 [pdf, other]

SOFIA FEEDBACK Survey: The Pillars of Creation in [C II] and Molecular Lines

Authors: Ramsey L. Karim, Marc W. Pound, Alexander G. G. M. Tielens, Maitraiyee Tiwari, Lars Bonne, Mark G. Wolfire, Nicola Schneider, Ümit Kavak, Lee G. Mundy, Robert Simon, Rolf Güsten, Jürgen Stutzki, Friedrich Wyrowski, Netty Honingh

Abstract: We investigate the physical structure and conditions of photodissociation regions (PDRs) and molecular gas within the Pillars of Creation in the Eagle Nebula using SOFIA FEEDBACK observations of the [C II] 158 micron line. These observations are velocity resolved to 0.5 km s$^{-1}$ and are analyzed alongside a collection of complimentary data with similar spatial and spectral resolution: the [O I]… ▽ More We investigate the physical structure and conditions of photodissociation regions (PDRs) and molecular gas within the Pillars of Creation in the Eagle Nebula using SOFIA FEEDBACK observations of the [C II] 158 micron line. These observations are velocity resolved to 0.5 km s$^{-1}$ and are analyzed alongside a collection of complimentary data with similar spatial and spectral resolution: the [O I] 63 micron line, also observed with SOFIA, and rotational lines of CO, HCN, HCO$^{+}$, CS, and N$_2$H$^{+}$. Using the superb spectral resolution of SOFIA, APEX, CARMA, and BIMA, we reveal the relationships between the warm PDR and cool molecular gas layers in context of the Pillars' kinematic structure. We assemble a geometric picture of the Pillars and their surroundings informed by illumination patterns and kinematic relationships and derive physical conditions in the PDRs associated with the Pillars. We estimate an average molecular gas density $n_{{\rm H}_2} \sim 1.3 \times 10^5$ cm$^{-3}$ and an average atomic gas density $n_{\rm H} \sim 1.8 \times 10^4$ cm$^{-3}$ and infer that the ionized, atomic, and molecular phases are in pressure equilibrium if the atomic gas is magnetically supported. We find pillar masses of 103, 78, 103, and 18 solar masses for P1a, P1b, P2, and P3 respectively, and evaporation times of $\sim$1-2 Myr. The dense clumps at the tops of the pillars are currently supported by the magnetic field. Our analysis suggests that ambipolar diffusion is rapid and these clumps are likely to collapse within their photoevaporation timescales. △ Less

Submitted 25 September, 2023; originally announced September 2023.

Comments: 42 pages, 16 figures. Accepted for publication in The Astronomical Journal

arXiv:2309.11646 [pdf, other]

An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder

Authors: Rownak Ara Rasul, Promy Saha, Diponkor Bala, S M Rakib Ul Karim, Md. Ibrahim Abdullah, Bishwajit Saha

Abstract: Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify cru… ▽ More Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify crucial ASD traits, aiming to enhance and automate the diagnostic process. We study eight state-of-the-art classification models to determine their effectiveness in ASD detection. We evaluate the models using accuracy, precision, recall, specificity, F1-score, area under the curve (AUC), kappa, and log loss metrics to find the best classifier for these binary datasets. Among all the classification models, for the children dataset, the SVM and LR models achieve the highest accuracy of 100% and for the adult dataset, the LR model produces the highest accuracy of 97.14%. Our proposed ANN model provides the highest accuracy of 94.24% for the new combined dataset when hyperparameters are precisely tuned for each model. As almost all classification models achieve high accuracy which utilize true labels, we become interested in delving into five popular clustering algorithms to understand model behavior in scenarios without true labels. We calculate Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and Silhouette Coefficient (SC) metrics to select the best clustering models. Our evaluation finds that spectral clustering outperforms all other benchmarking clustering models in terms of NMI and ARI metrics while demonstrating comparability to the optimal SC achieved by k-means. The implemented code is available at GitHub. △ Less

Submitted 28 December, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

Comments: 20 pages, 12 figures, 8 tables

arXiv:2307.08260 [pdf, other]

Extending the Frontier of ChatGPT: Code Generation and Debugging

Authors: Fardin Ahsan Sakib, Saadat Hasan Khan, A. H. M. Rezaul Karim

Abstract: Large-scale language models (LLMs) have emerged as a groundbreaking innovation in the realm of question-answering and conversational agents. These models, leveraging different deep learning architectures such as Transformers, are trained on vast corpora to predict sentences based on given queries. Among these LLMs, ChatGPT, developed by OpenAI, has ushered in a new era by utilizing artificial inte… ▽ More Large-scale language models (LLMs) have emerged as a groundbreaking innovation in the realm of question-answering and conversational agents. These models, leveraging different deep learning architectures such as Transformers, are trained on vast corpora to predict sentences based on given queries. Among these LLMs, ChatGPT, developed by OpenAI, has ushered in a new era by utilizing artificial intelligence (AI) to tackle diverse problem domains, ranging from composing essays and biographies to solving intricate mathematical integrals. The versatile applications enabled by ChatGPT offer immense value to users. However, assessing the performance of ChatGPT's output poses a challenge, particularly in scenarios where queries lack clear objective criteria for correctness. For instance, evaluating the quality of generated essays becomes arduous and relies heavily on manual labor, in stark contrast to evaluating solutions to well-defined, closed-ended questions such as mathematical problems. This research paper delves into the efficacy of ChatGPT in solving programming problems, examining both the correctness and the efficiency of its solution in terms of time and memory complexity. The research reveals a commendable overall success rate of 71.875\%, denoting the proportion of problems for which ChatGPT was able to provide correct solutions that successfully satisfied all the test cases present in Leetcode. It exhibits strengths in structured problems and shows a linear correlation between its success rate and problem acceptance rates. However, it struggles to improve solutions based on feedback, pointing to potential shortcomings in debugging tasks. These findings provide a compact yet insightful glimpse into ChatGPT's capabilities and areas for improvement. △ Less

Submitted 17 July, 2023; originally announced July 2023.

arXiv:2307.07812 [pdf, other]

Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation

Authors: Mennatullah Siam, Rezaul Karim, He Zhao, Richard Wildes

Abstract: Few-shot video segmentation is the task of delineating a specific novel class in a query video using few labelled support images. Typical approaches compare support and query features while limiting comparisons to a single feature layer and thereby ignore potentially valuable information. We present a meta-learned Multiscale Memory Comparator (MMC) for few-shot video segmentation that combines inf… ▽ More Few-shot video segmentation is the task of delineating a specific novel class in a query video using few labelled support images. Typical approaches compare support and query features while limiting comparisons to a single feature layer and thereby ignore potentially valuable information. We present a meta-learned Multiscale Memory Comparator (MMC) for few-shot video segmentation that combines information across scales within a transformer decoder. Typical multiscale transformer decoders for segmentation tasks learn a compressed representation, their queries, through information exchange across scales. Unlike previous work, we instead preserve the detailed feature maps during across scale information exchange via a multiscale memory transformer decoding to reduce confusion between the background and novel class. Integral to the approach, we investigate multiple forms of information exchange across scales in different tasks and provide insights with empirical evidence on which to use in each task. The overall comparisons among query and support features benefit from both rich semantics and precise localization. We demonstrate our approach primarily on few-shot video object segmentation and an adapted version on the fully supervised counterpart. In all cases, our approach outperforms the baseline and yields state-of-the-art performance. Our code is publicly available at https://github.com/MSiam/MMC-MultiscaleMemory. △ Less

Submitted 15 July, 2023; originally announced July 2023.

arXiv:2306.14733 [pdf]

Temperature Dependent Failure of Atomically Thin MoTe2

Authors: A S M Redwan Haider, Ahmad Fatehi Ali Mohammed Hezam, Md Akibul Islam, Yeasir Arafat, Mohammad Tanvirul Ferdaous, Sayedus Salehin, Md. Rezwanul Karim

Abstract: In this study, we systematically investigated the mechanical responses of monolayer molybdenum ditelluride (MoTe2) using molecular dynamics (MD) simulations. The tensile behavior of trigonal prismatic phase (2H phase) MoTe2 under uniaxial strain was simulated in the armchair and zigzag directions. We also investigated the crack formation and propagation in both armchair and zigzag directions at 10… ▽ More In this study, we systematically investigated the mechanical responses of monolayer molybdenum ditelluride (MoTe2) using molecular dynamics (MD) simulations. The tensile behavior of trigonal prismatic phase (2H phase) MoTe2 under uniaxial strain was simulated in the armchair and zigzag directions. We also investigated the crack formation and propagation in both armchair and zigzag directions at 10K and 300K to understand the fracture behavior of monolayer MoTe2 at different temperatures. The MD simulations show clean cleavage for the armchair direction, and the cracks were numerous and scattered in the case of the zigzag direction. Finally, we investigated the effect of temperature on Young's modulus and fracture stress of monolayer MoTe2. The results show that at a strain rate of 10^-4 ps^-1, the fracture strength of monolayer MoTe2 in the armchair and zigzag directions at 10K is 16.33 GPa (11.43 N/m) and 13.71 GPa (9.46 N/m) under a 24% and 18% fracture strain, respectively. The fracture strength of monolayer MoTe2 in the armchair and zigzag direction at 600K is 10.81 GPa (7.56 N/m) and 10.13 GPa (7.09 N/m) under a 12.5% and 12.47% fracture strain, respectively. △ Less

Submitted 20 January, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: 22 Pages, 7 Figures

arXiv:2304.05930 [pdf, other]

A Unified Multiscale Encoder-Decoder Transformer for Video Segmentation

Authors: Rezaul Karim, He Zhao, Richard P. Wildes, Mennatullah Siam

Abstract: In this paper, we present an end-to-end trainable unified multiscale encoder-decoder transformer that is focused on dense prediction tasks in video. The presented Multiscale Encoder-Decoder Video Transformer (MED-VT) uses multiscale representation throughout and employs an optional input beyond video (e.g., audio), when available, for multimodal processing (MED-VT++). Multiscale representation at… ▽ More In this paper, we present an end-to-end trainable unified multiscale encoder-decoder transformer that is focused on dense prediction tasks in video. The presented Multiscale Encoder-Decoder Video Transformer (MED-VT) uses multiscale representation throughout and employs an optional input beyond video (e.g., audio), when available, for multimodal processing (MED-VT++). Multiscale representation at both encoder and decoder yields three key benefits: (i) implicit extraction of spatiotemporal features at different levels of abstraction for capturing dynamics without reliance on input optical flow, (ii) temporal consistency at encoding and (iii) coarse-to-fine detection for high-level (e.g., object) semantics to guide precise localization at decoding. Moreover, we present a transductive learning scheme through many-to-many label propagation to provide temporally consistent video predictions. We showcase MED-VT/MED-VT++ on three unimodal video segmentation tasks (Automatic Video Object Segmentation (AVOS), actor-action segmentation and Video Semantic Segmentation (VSS)) as well as a multimodal segmentation task (Audio-Visual Segmentation (AVS)). Results show that the proposed architecture outperforms alternative state-of-the-art approaches on multiple benchmarks using only video (and optional audio) as input, without reliance on optical flow. Finally, to document details of the model's internal learned representations, we present a detailed interpretability study, encompassing both quantitative and qualitative analyses. △ Less

Submitted 26 February, 2024; v1 submitted 12 April, 2023; originally announced April 2023.

Comments: Extension of CVPR'23 paper for journal submission

arXiv:2302.11880 [pdf, other]

Catch Me If You Can: Semi-supervised Graph Learning for Spotting Money Laundering

Authors: Md. Rezaul Karim, Felix Hermsen, Sisay Adugna Chala, Paola de Perthuis, Avikarsha Mandal

Abstract: Money laundering is the process where criminals use financial services to move massive amounts of illegal money to untraceable destinations and integrate them into legitimate financial systems. It is very crucial to identify such activities accurately and reliably in order to enforce an anti-money laundering (AML). Despite tremendous efforts to AML only a tiny fraction of illicit activities are pr… ▽ More Money laundering is the process where criminals use financial services to move massive amounts of illegal money to untraceable destinations and integrate them into legitimate financial systems. It is very crucial to identify such activities accurately and reliably in order to enforce an anti-money laundering (AML). Despite tremendous efforts to AML only a tiny fraction of illicit activities are prevented. From a given graph of money transfers between accounts of a bank, existing approaches attempted to detect money laundering. In particular, some approaches employ structural and behavioural dynamics of dense subgraph detection thereby not taking into consideration that money laundering involves high-volume flows of funds through chains of bank accounts. Some approaches model the transactions in the form of multipartite graphs to detect the complete flow of money from source to destination. However, existing approaches yield lower detection accuracy, making them less reliable. In this paper, we employ semi-supervised graph learning techniques on graphs of financial transactions in order to identify nodes involved in potential money laundering. Experimental results suggest that our approach can sport money laundering from real and synthetic transaction graphs. △ Less

Submitted 24 February, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

arXiv:2302.04737 [pdf, other]

A Biomedical Knowledge Graph for Biomarker Discovery in Cancer

Authors: Md. Rezaul Karim, Lina Molinas Comet, Oya Beyan, Dietrich Rebholz-Schuhmann, Stefan Decker

Abstract: Structured and unstructured data and facts about drugs, genes, protein, viruses, and their mechanism are spread across a huge number of scientific articles. These articles are a large-scale knowledge source and can have a huge impact on disseminating knowledge about the mechanisms of certain biological processes. A domain-specific knowledge graph~(KG) is an explicit conceptualization of a specific… ▽ More Structured and unstructured data and facts about drugs, genes, protein, viruses, and their mechanism are spread across a huge number of scientific articles. These articles are a large-scale knowledge source and can have a huge impact on disseminating knowledge about the mechanisms of certain biological processes. A domain-specific knowledge graph~(KG) is an explicit conceptualization of a specific subject-matter domain represented w.r.t semantically interrelated entities and relations. A KG can be constructed by integrating such facts and data and be used for data integration, exploration, and federated queries. However, exploration and querying large-scale KGs is tedious for certain groups of users due to a lack of knowledge about underlying data assets or semantic technologies. Such a KG will not only allow deducing new knowledge and question answering(QA) but also allows domain experts to explore. Since cross-disciplinary explanations are important for accurate diagnosis, it is important to query the KG to provide interactive explanations about learned biomarkers. Inspired by these, we construct a domain-specific KG, particularly for cancer-specific biomarker discovery. The KG is constructed by integrating cancer-related knowledge and facts from multiple sources. First, we construct a domain-specific ontology, which we call OncoNet Ontology (ONO). The ONO ontology is developed to enable semantic reasoning for verification of the predictions for relations between diseases and genes. The KG is then developed and enriched by harmonizing the ONO, additional metadata schemas, ontologies, controlled vocabularies, and additional concepts from external sources using a BERT-based information extraction method. BioBERT and SciBERT are finetuned with the selected articles crawled from PubMed. We listed down some queries and some examples of QA and deducing knowledge based on the KG. △ Less

Submitted 23 February, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

arXiv:2212.14558 [pdf, other]

Covid-19 Analysis Using Tensor Methods

Authors: Dipak Dulal, Ramin Goudarzi Karim, Carmeliza Navasca

Abstract: In this paper, we use tensor models to analyze Covid-19 pandemic data. First, we use tensor models, canonical polyadic and higher-order Tucker decompositions, to extract patterns over multiple modes. Second, we implement a tensor completion algorithm using canonical polyadic tensor decomposition to predict spatiotemporal data from multiple spatial sources and to identify Covid-19 hotspots. We appl… ▽ More In this paper, we use tensor models to analyze Covid-19 pandemic data. First, we use tensor models, canonical polyadic and higher-order Tucker decompositions, to extract patterns over multiple modes. Second, we implement a tensor completion algorithm using canonical polyadic tensor decomposition to predict spatiotemporal data from multiple spatial sources and to identify Covid-19 hotspots. We apply a regularized iterative tensor completion technique with a practical regularization parameter estimator to predict the spread of Covid-19 cases and to find and identify hotspots. Our method can predict weekly and quarterly Covid-19 spreads with high accuracy. Third, we analyze Covid-19 data in the US using a novel sampling method for alternating least-squares. Moreover, we compare the algorithms with standard tensor decompositions in terms of their interpretability, visualization and cost analysis. Finally, we demonstrate the efficacy of the methods by applying the techniques to New Jersey's Covid-19 data. △ Less

Submitted 30 December, 2022; originally announced December 2022.

Comments: 22 pages, 22 figures

arXiv:2212.13261 [pdf, other]

Explainable AI for Bioinformatics: Methods, Tools, and Applications

Authors: Md. Rezaul Karim, Tanhim Islam, Oya Beyan, Christoph Lange, Michael Cochez, Dietrich Rebholz-Schuhmann, Stefan Decker

Abstract: Artificial intelligence (AI) systems utilizing deep neural networks (DNNs) and machine learning (ML) algorithms are widely used for solving important problems in bioinformatics, biomedical informatics, and precision medicine. However, complex DNNs or ML models, which are often perceived as opaque and black-box, can make it difficult to understand the reasoning behind their decisions. This lack of… ▽ More Artificial intelligence (AI) systems utilizing deep neural networks (DNNs) and machine learning (ML) algorithms are widely used for solving important problems in bioinformatics, biomedical informatics, and precision medicine. However, complex DNNs or ML models, which are often perceived as opaque and black-box, can make it difficult to understand the reasoning behind their decisions. This lack of transparency can be a challenge for both end-users and decision-makers, as well as AI developers. Additionally, in sensitive areas like healthcare, explainability and accountability are not only desirable but also legally required for AI systems that can have a significant impact on human lives. Fairness is another growing concern, as algorithmic decisions should not show bias or discrimination towards certain groups or individuals based on sensitive attributes. Explainable artificial intelligence (XAI) aims to overcome the opaqueness of black-box models and provide transparency in how AI systems make decisions. Interpretable ML models can explain how they make predictions and the factors that influence their outcomes. However, most state-of-the-art interpretable ML methods are domain-agnostic and evolved from fields like computer vision, automated reasoning, or statistics, making direct application to bioinformatics problems challenging without customization and domain-specific adaptation. In this paper, we discuss the importance of explainability in the context of bioinformatics, provide an overview of model-specific and model-agnostic interpretable ML methods and tools, and outline their potential caveats and drawbacks. Besides, we discuss how to customize existing interpretable ML methods for bioinformatics problems. Nevertheless, we demonstrate how XAI methods can improve transparency through case studies in bioimaging, cancer genomics, and text mining. △ Less

Submitted 23 February, 2023; v1 submitted 25 December, 2022; originally announced December 2022.

arXiv:2211.16848 [pdf, other]

Compound Multivariate Hawkes Processes: Large Deviations and Rare Event Simulation

Authors: Raviar S. Karim, Roger J. A. Laeven, Michel R. H. Mandjes

Abstract: In this paper, we establish a large deviations principle for a multivariate compound process induced by a multivariate Hawkes process with random marks. Our proof hinges on showing essential smoothness of the limiting cumulant of the multivariate compound process, resolving the inherent complication that this cumulant is implicitly characterized through a fixed-point representation. We employ the… ▽ More In this paper, we establish a large deviations principle for a multivariate compound process induced by a multivariate Hawkes process with random marks. Our proof hinges on showing essential smoothness of the limiting cumulant of the multivariate compound process, resolving the inherent complication that this cumulant is implicitly characterized through a fixed-point representation. We employ the large deviations principle to derive logarithmic asymptotic results on the marginal ruin probabilities of the associated multivariate risk process. We also show how to conduct rare event simulation in this multivariate setting using importance sampling and prove the asymptotic efficiency of our importance sampling based estimator. The paper is concluded with a systematic assessment of the performance of our rare event simulation procedure. △ Less

Submitted 27 June, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

arXiv:2211.15013 [pdf, other]

Enhancing Data Security for Cloud Computing Applications through Distributed Blockchain-based SDN Architecture in IoT Networks

Authors: Anichur Rahman, Md. Jahidul Islam, Rafiqul Islam, Ayesha Aziz, Dipanjali Kundu, Sadia Sazzad, Md. Razaul Karim, Mahedi Hasan, Ziaur Rahman, Said Elnaffar, Shahab S. Band

Abstract: Blockchain (BC) and Software Defined Networking (SDN) are some of the most prominent emerging technologies in recent research. These technologies provide security, integrity, as well as confidentiality in their respective applications. Cloud computing has also been a popular comprehensive technology for several years. Confidential information is often shared with the cloud infrastructure to give c… ▽ More Blockchain (BC) and Software Defined Networking (SDN) are some of the most prominent emerging technologies in recent research. These technologies provide security, integrity, as well as confidentiality in their respective applications. Cloud computing has also been a popular comprehensive technology for several years. Confidential information is often shared with the cloud infrastructure to give customers access to remote resources, such as computation and storage operations. However, cloud computing also presents substantial security threats, issues, and challenges. Therefore, to overcome these difficulties, we propose integrating Blockchain and SDN in the cloud computing platform. In this research, we introduce the architecture to better secure clouds. Moreover, we leverage a distributed Blockchain approach to convey security, confidentiality, privacy, integrity, adaptability, and scalability in the proposed architecture. BC provides a distributed or decentralized and efficient environment for users. Also, we present an SDN approach to improving the reliability, stability, and load balancing capabilities of the cloud infrastructure. Finally, we provide an experimental evaluation of the performance of our SDN and BC-based implementation using different parameters, also monitoring some attacks in the system and proving its efficacy. △ Less

Submitted 27 November, 2022; originally announced November 2022.

Comments: 12 Pages 16 Figures 3 Tables

ACM Class: E.3

arXiv:2210.09723 [pdf, other]

doi 10.1007/978-3-031-33231-9_12

Textual Entailment Recognition with Semantic Features from Empirical Text Representation

Authors: Md Shajalal, Md Atabuzzaman, Maksuda Bilkis Baby, Md Rezaul Karim, Alexander Boden

Abstract: Textual entailment recognition is one of the basic natural language understanding(NLU) tasks. Understanding the meaning of sentences is a prerequisite before applying any natural language processing(NLP) techniques to automatically recognize the textual entailment. A text entails a hypothesis if and only if the true value of the hypothesis follows the text. Classical approaches generally utilize t… ▽ More Textual entailment recognition is one of the basic natural language understanding(NLU) tasks. Understanding the meaning of sentences is a prerequisite before applying any natural language processing(NLP) techniques to automatically recognize the textual entailment. A text entails a hypothesis if and only if the true value of the hypothesis follows the text. Classical approaches generally utilize the feature value of each word from word embedding to represent the sentences. In this paper, we propose a novel approach to identifying the textual entailment relationship between text and hypothesis, thereby introducing a new semantic feature focusing on empirical threshold-based semantic text representation. We employ an element-wise Manhattan distance vector-based feature that can identify the semantic entailment relationship between the text-hypothesis pair. We carried out several experiments on a benchmark entailment classification(SICK-RTE) dataset. We train several machine learning(ML) algorithms applying both semantic and lexical features to classify the text-hypothesis pair as entailment, neutral, or contradiction. Our empirical sentence representation technique enriches the semantic information of the texts and hypotheses found to be more efficient than the classical ones. In the end, our approach significantly outperforms known methods in understanding the meaning of the sentences for the textual entailment classification task. △ Less

Submitted 19 June, 2023; v1 submitted 18 October, 2022; originally announced October 2022.

Comments: Pre-print for our paper at International Conference on Speech & Language Technology for Low-resource Languages (SPELLL'2022)

Journal ref: Pre-print for our paper at International Conference on Speech & Language Technology for Low-resource Languages (SPELLL'2022)

arXiv:2210.06469 [pdf, other]

Participatory Design for Mental Health Data Visualization on a Social Robot

Authors: Raida Karim, Edgar Lopez, Elin A. Björling, Maya Cakmak

Abstract: The intersection of data visualization and human-robot interaction (HRI) is a burgeoning field. Understanding, communicating, and processing different kinds of data for creating versatile visualizations can benefit HRI. Conversely, expressing different kinds of data generated from HRI through effective visualizations can provide interesting insights. Our work adds to the literature of this growing… ▽ More The intersection of data visualization and human-robot interaction (HRI) is a burgeoning field. Understanding, communicating, and processing different kinds of data for creating versatile visualizations can benefit HRI. Conversely, expressing different kinds of data generated from HRI through effective visualizations can provide interesting insights. Our work adds to the literature of this growing domain. In this paper, we present our exploratory work on visualizing mental health data on a social robot. Particularly, we discuss development of mental health data visualizations using a participatory design (PD) approach. As a first step with mental health data visualization on a social robot, this work paves the way for relevant further work and using social robots as data visualization tools. △ Less

Submitted 20 August, 2022; originally announced October 2022.

Comments: Accepted to workshop on participatory design (PD) in human-robot interaction in RO-MAN 2022

arXiv:2210.06040 [pdf, other]

Question Answering Over Biological Knowledge Graph via Amazon Alexa

Authors: Md. Rezaul Karim, Hussain Ali, Prinon Das, Mohamed Abdelwaheb, Stefan Decker

Abstract: Structured and unstructured data and facts about drugs, genes, protein, viruses, and their mechanism are spread across a huge number of scientific articles. These articles are a large-scale knowledge source and can have a huge impact on disseminating knowledge about the mechanisms of certain biological processes. A knowledge graph (KG) can be constructed by integrating such facts and data and be u… ▽ More Structured and unstructured data and facts about drugs, genes, protein, viruses, and their mechanism are spread across a huge number of scientific articles. These articles are a large-scale knowledge source and can have a huge impact on disseminating knowledge about the mechanisms of certain biological processes. A knowledge graph (KG) can be constructed by integrating such facts and data and be used for data integration, exploration, and federated queries. However, exploration and querying large-scale KGs is tedious for certain groups of users due to a lack of knowledge about underlying data assets or semantic technologies. A question-answering (QA) system allows the answer of natural language questions over KGs automatically using triples contained in a KG. Recently, the use and adaption of digital assistants are getting wider owing to their capability at enabling users to voice commands to control smart systems or devices. This paper is about using Amazon Alexa's voice-enabled interface for QA over KGs. As a proof-of-concept, we use the well-known DisgeNET KG, which contains knowledge covering 1.13 million gene-disease associations between 21,671 genes and 30,170 diseases, disorders, and clinical or abnormal human phenotypes. Our study shows how Alex could be of help to find facts about certain biological entities from large-scale knowledge bases. △ Less

Submitted 12 October, 2022; originally announced October 2022.

Comments: This paper is based on the Knowledge Graph Lab course (https://dbis.rwth-aachen.de/dbis/index.php/) offered at Computer Science 5 - Information Systems and Databases, RWTH Aachen University, Germany, and a joint collaboration with Osthus GmbH (https://www.osthus.com/), Aachen, Germany

arXiv:2208.13405 [pdf, other]

Interpreting Black-box Machine Learning Models for High Dimensional Datasets

Authors: Md. Rezaul Karim, Md. Shajalal, Alex Graß, Till Döhmen, Sisay Adugna Chala, Alexander Boden, Christian Beecks, Stefan Decker

Abstract: Deep neural networks (DNNs) have been shown to outperform traditional machine learning algorithms in a broad variety of application domains due to their effectiveness in modeling complex problems and handling high-dimensional datasets. Many real-life datasets, however, are of increasingly high dimensionality, where a large number of features may be irrelevant for both supervised and unsupervised l… ▽ More Deep neural networks (DNNs) have been shown to outperform traditional machine learning algorithms in a broad variety of application domains due to their effectiveness in modeling complex problems and handling high-dimensional datasets. Many real-life datasets, however, are of increasingly high dimensionality, where a large number of features may be irrelevant for both supervised and unsupervised learning tasks. The inclusion of such features would not only introduce unwanted noise but also increase computational complexity. Furthermore, due to high non-linearity and dependency among a large number of features, DNN models tend to be unavoidably opaque and perceived as black-box methods because of their not well-understood internal functioning. Their algorithmic complexity is often simply beyond the capacities of humans to understand the interplay among myriads of hyperparameters. A well-interpretable model can identify statistically significant features and explain the way they affect the model's outcome. In this paper, we propose an efficient method to improve the interpretability of black-box models for classification tasks in the case of high-dimensional datasets. First, we train a black-box model on a high-dimensional dataset to learn the embeddings on which the classification is performed. To decompose the inner working principles of the black-box model and to identify top-k important features, we employ different probing and perturbing techniques. We then approximate the behavior of the black-box model by means of an interpretable surrogate model on the top-k feature space. Finally, we derive decision rules and local explanations from the surrogate model to explain individual decisions. Our approach outperforms state-of-the-art methods like TabNet and XGboost when tested on different datasets with varying dimensionality between 50 and 20,000 w.r.t metrics and explainability. △ Less

Submitted 21 November, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

Comments: This paper is currently under review in a journal

arXiv:2208.08174 [pdf, other]

doi 10.3847/1538-3881/ac8a44

SOFIA FEEDBACK survey: PDR diagnostics of stellar feedback in different regions of RCW 49

Authors: M. Tiwari, M. Wolfire, M. W. Pound, E. Tarantino, R. Karim, L. Bonne, C. Buchbender, R. Güsten, C. Guevara, S. Kabanovic, Ü. Kavak, M. Mertens, N. Schneider, R. Simon, J. Stutzki, A. G. G. M. Tielens

Abstract: We quantified the effects of stellar feedback in RCW 49 by determining the physical conditions in different regions using the [CII] 158 $μ$m and [OI] 63 $μ$m observations from SOFIA, the $^{12}$CO (3-2) observations from APEX and the H$_2$ line observations from Spitzer telescopes. Large maps of RCW 49 were observed with the SOFIA and APEX telescopes, while the Spitzer observations were only avail… ▽ More We quantified the effects of stellar feedback in RCW 49 by determining the physical conditions in different regions using the [CII] 158 $μ$m and [OI] 63 $μ$m observations from SOFIA, the $^{12}$CO (3-2) observations from APEX and the H$_2$ line observations from Spitzer telescopes. Large maps of RCW 49 were observed with the SOFIA and APEX telescopes, while the Spitzer observations were only available towards three small areas. From our qualitative analysis, we found that the H$_2$ 0-0 S(2) emission line probes denser gas compared to the H$_2$ 0-0 S(1) line. In four regions ("northern cloud", "pillar", "ridge", and "shell"), we compared our observations with the updated PDR Toolbox models and derived the integrated far-ultraviolet flux between 6-13.6 eV ($G_{\rm 0}$), H nucleus density ($n$), temperatures and pressures. We found the ridge to have the highest $G_{\rm 0}$ (2.4 $\times$ 10$^3$ Habing units), while the northern cloud has the lowest $G_{\rm 0}$ (5 $\times$ 10$^2$ Habing units). This is a direct consequence of the location of these regions with respect to the Wd2 cluster. The ridge also has a high density (6.4 $\times$ 10$^3$ cm$^{-3}$), which is consistent with its ongoing star formation. Among the Spitzer positions, we found the one closest to the Wd2 cluster to be the densest, suggesting an early phase of star formation. Furthermore, the Spitzer position that overlaps with the shell was found to have the highest $G_{\rm 0}$ and we expect this to be a result of its proximity to an O9V star. △ Less

Submitted 17 August, 2022; originally announced August 2022.

Comments: 18 pages, 12 figures

arXiv:2208.04389 [pdf, other]

Share with Me: A Study on a Social Robot Collecting Mental Health Data

Authors: Raida Karim, Edgar Lopez, Katelynn Oleson, Tony Li, Elin A. Björling, Maya Cakmak

Abstract: Social robots have been used to assist with mental well-being in various ways such as to help children with autism improve on their social skills and executive functioning such as joint attention and bodily awareness. They are also used to help older adults by reducing feelings of isolation and loneliness, as well as supporting mental well-being of teens and children. However, existing work in thi… ▽ More Social robots have been used to assist with mental well-being in various ways such as to help children with autism improve on their social skills and executive functioning such as joint attention and bodily awareness. They are also used to help older adults by reducing feelings of isolation and loneliness, as well as supporting mental well-being of teens and children. However, existing work in this sphere has only shown support for mental health through social robots by responding interactively to human activity to help them learn relevant skills. We hypothesize that humans can also get help from social robots in mental well-being by releasing or sharing their mental health data with the social robots. In this paper, we present a human-robot interaction (HRI) study to evaluate this hypothesis. During the five-day study, a total of fifty-five (n=55) participants shared their in-the-moment mood and stress levels with a social robot. We saw a majority of positive results indicating it is worth conducting future work in this direction, and the potential of social robots to largely support mental well-being. △ Less

Submitted 8 August, 2022; originally announced August 2022.

Comments: Submitted to ICSR 2022

arXiv:2208.02177 [pdf, other]

On the Integration of Blockchain and SDN: Overview, Applications, and Future Perspectives

Authors: Anichur Rahman, Antonio Montieri, Dipanjali Kundu, Md. Razaul Karim, Md. Jahidul Islam, Sara Umme, Alfredo Nascita, Antonio Pescapè

Abstract: Blockchain (BC) and Software-Defined Networking (SDN) are leading technologies which have recently found applications in several network-related scenarios and have consequently experienced a growing interest in the research community. Indeed, current networks connect a massive number of objects over the Internet and in this complex scenario, to ensure security, privacy, confidentiality, and progra… ▽ More Blockchain (BC) and Software-Defined Networking (SDN) are leading technologies which have recently found applications in several network-related scenarios and have consequently experienced a growing interest in the research community. Indeed, current networks connect a massive number of objects over the Internet and in this complex scenario, to ensure security, privacy, confidentiality, and programmability, the utilization of BC and SDN have been successfully proposed. In this work, we provide a comprehensive survey regarding these two recent research trends and review the related state-of-the-art literature. We first describe the main features of each technology and discuss their most common and used variants. Furthermore, we envision the integration of such technologies to jointly take advantage of these latter efficiently. Indeed, we consider their group-wise utilization -- named BC-SDN -- based on the need for stronger security and privacy. Additionally, we cover the application fields of these technologies both individually and combined. Finally, we discuss the open issues of reviewed research and describe potential directions for future avenues regarding the integration of BC and SDN. To summarize, the contribution of the present survey spans from an overview of the literature background on BC and SDN to the discussion of the benefits and limitations of BC-SDN integration in different fields, which also raises open challenges and possible future avenues examined herein. To the best of our knowledge, compared to existing surveys, this is the first work that analyzes the aforementioned aspects in light of a broad BC-SDN integration, with a specific focus on security and privacy issues in actual utilization scenarios. △ Less

Submitted 3 August, 2022; originally announced August 2022.

Comments: 42 pages, 14 figures, to be published in Journal of Network and Systems Management - Special Issue on Blockchains and Distributed Ledgers in Network and Service Management

ACM Class: C.2.3; C.2.4

arXiv:2207.06479 [pdf, other]

doi 10.3847/1538-4357/ac8052

The SOFIA FEEDBACK Legacy Survey: Dynamics and mass ejection in the bipolar HII region RCW 36

Authors: L. Bonne, N. Schneider, P. García, A. Bij, P. Broos, L. Fissel, R. Guesten, J. Jackson, R. Simon, L. Townsley, A. Zavagno, R. Aladro, C. Buchbender, C. Guevara, R. Higgins, A. M. Jacob, S. Kabanovic, R. Karim, A. Soam, J. Stutzki, M. Tiwari, F. Wyrowski, A. G. G. M. Tielens

Abstract: We present [CII] 158 $μ$m and [OI] 63 $μ$m observations of the bipolar HII region RCW 36 in the Vela C molecular cloud, obtained within the SOFIA legacy project FEEDBACK, which is complemented with APEX $^{12/13}$CO(3-2) and Chandra X-ray (0.5-7 keV) data. This shows that the molecular ring, forming the waist of the bipolar nebula, expands with a velocity of 1 - 1.9 km s$^{-1}$. We also observe an… ▽ More We present [CII] 158 $μ$m and [OI] 63 $μ$m observations of the bipolar HII region RCW 36 in the Vela C molecular cloud, obtained within the SOFIA legacy project FEEDBACK, which is complemented with APEX $^{12/13}$CO(3-2) and Chandra X-ray (0.5-7 keV) data. This shows that the molecular ring, forming the waist of the bipolar nebula, expands with a velocity of 1 - 1.9 km s$^{-1}$. We also observe an increased linewidth in the ring indicating that turbulence is driven by energy injection from the stellar feedback. The bipolar cavity hosts blue-shifted expanding [CII] shells at 5.2$\pm$0.5$\pm$0.5 km s$^{-1}$ (statistical and systematic uncertainty) which indicates that expansion out of the dense gas happens non-uniformly and that the observed bipolar phase might be relatively short ($\sim$0.2 Myr). The X-ray observations show diffuse emission that traces a hot plasma, created by stellar winds, in and around RCW 36. At least 50 \% of the stellar wind energy is missing in RCW 36. This is likely due to leakage which is clearing even larger cavities around the bipolar RCW 36 region. Lastly, the cavities host high-velocity wings in [CII] which indicates relatively high mass ejection rates ($\sim$5$\times$10$^{-4}$ M$_{\odot}$ yr$^{-1}$). This could be driven by stellar winds and/or radiation pressure, but remains difficult to constrain. This local mass ejection, which can remove all mass within 1 pc of RCW 36 in 1-2 Myr, and the large-scale clearing of ambient gas in the Vela C cloud indicates that stellar feedback plays a significant role in suppressing the star formation efficiency (SFE). △ Less

Submitted 13 July, 2022; originally announced July 2022.

Comments: 38 pages, 27 figures, 8 tables, accepted in ApJ

arXiv:2205.09193 [pdf, other]

doi 10.1051/0004-6361/202142596

Filamentary structures of ionized gas in Cygnus X

Authors: K. L. Emig, G. J. White, P. Salas, R. L. Karim, R. J. van Weeren, P. J. Teuben, A. Zavagno, P. Chiu, M. Haverkorn, J. B. R. Oonk, E. Orrú, I. M. Polderman, W. Reich, H. J. A. Röttgering, A. G. G. M. Tielens

Abstract: Ionized gas probes the influence of massive stars on their environment. The Cygnus X region (d~1.5 kpc) is one of the most massive star forming complexes in our Galaxy, in which the Cyg OB2 association (age of 3-5 Myr and stellar mass $2 \times 10^{4}$ M$_{\odot}$) has a dominant influence. We observe the Cygnus X region at 148 MHz using the Low Frequency Array (LOFAR) and take into account short-… ▽ More Ionized gas probes the influence of massive stars on their environment. The Cygnus X region (d~1.5 kpc) is one of the most massive star forming complexes in our Galaxy, in which the Cyg OB2 association (age of 3-5 Myr and stellar mass $2 \times 10^{4}$ M$_{\odot}$) has a dominant influence. We observe the Cygnus X region at 148 MHz using the Low Frequency Array (LOFAR) and take into account short-spacing information during image deconvolution. Together with data from the Canadian Galactic Plane Survey, we investigate the morphology, distribution, and physical conditions of low-density ionized gas in a $4^{\circ} \times 4^{\circ}$ (100 pc $\times$ 100 pc) region at a resolution of 2' (0.9 pc). The Galactic radio emission in the region analyzed is almost entirely thermal (free-free) at 148 MHz, with emission measures of $10^3 < EM~{\rm[pc~cm^{-6}]} < 10^6$. As filamentary structure is a prominent feature of the emission, we use DisPerSE and FilChap to identify filamentary ridges and characterize their radial ($EM$) profiles. The distribution of radial profiles has a characteristic width of 4.3 pc and a power-law distribution ($β= -1.8 \pm 0.1$) in peak $EM$ down to our completeness limit of 4200 pc cm$^{-6}$. The electron densities of the filamentary structure range from $10 < n_e~{\rm[cm^{-3}]} < 400$ with a median value of 35 cm$^{-3}$, remarkably similar to [N II] surveys of ionized gas. Cyg OB2 may ionize at most two-thirds of the total ionized gas and the ionized gas in filaments. More than half of the filamentary structures are likely photoevaporating surfaces flowing into a surrounding diffuse (~5 cm$^{-3}$) medium. However, this is likely not the case for all ionized gas ridges. A characteristic width in the distribution of ionized gas points to the stellar winds of Cyg OB2 creating a fraction of the ionized filaments through swept-up ionized gas or dissipated turbulence. △ Less

Submitted 18 May, 2022; originally announced May 2022.

Comments: 19 pages, 14 figures, 1 table; accepted for publication in A&A

Journal ref: A&A 664, A88 (2022)

arXiv:2204.10196 [pdf, other]

Multimodal Hate Speech Detection from Bengali Memes and Texts

Authors: Md. Rezaul Karim, Sumon Kanti Dey, Tanhim Islam, Md. Shajalal, Bharathi Raja Chakravarthi

Abstract: Numerous machine learning (ML) and deep learning (DL)-based approaches have been proposed to utilize textual data from social media for anti-social behavior analysis like cyberbullying, fake news detection, and identification of hate speech mainly for highly-resourced languages such as English. However, despite having a lot of diversity and millions of native speakers, some languages like Bengali… ▽ More Numerous machine learning (ML) and deep learning (DL)-based approaches have been proposed to utilize textual data from social media for anti-social behavior analysis like cyberbullying, fake news detection, and identification of hate speech mainly for highly-resourced languages such as English. However, despite having a lot of diversity and millions of native speakers, some languages like Bengali are under-resourced, which is due to a lack of computational resources for natural language processing (NLP). Similar to other languages, Bengali social media contents also include images along with texts (e.g., multimodal memes are posted by embedding short texts into images on Facebook). Therefore, only the textual data is not enough to judge them since images might give extra context to make a proper judgement. This paper is about hate speech detection from multimodal Bengali memes and texts. We prepared the only multimodal hate speech dataset for-a-kind of problem for Bengali, which we use to train state-of-the-art neural architectures (e.g., Bi-LSTM/Conv-LSTM with word embeddings, ConvNets + pre-trained language models, e.g., monolingual Bangla BERT, multilingual BERT-cased/uncased, and XLM-RoBERTa) to jointly analyze textual and visual information for hate speech detection. Conv-LSTM and XLM-RoBERTa models performed best for texts, yielding F1 scores of 0.78 and 0.82, respectively. As of memes, ResNet-152 and DenseNet-161 models yield F1 scores of 0.78 and 0.79, respectively. As for multimodal fusion, XLM-RoBERTa + DenseNet-161 performed the best, yielding an F1 score of 0.83. Our study suggests that text modality is most useful for hate speech detection, while memes are moderately useful. △ Less

Submitted 21 December, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

Comments: arXiv admin note: text overlap with arXiv:2107.00648 by other authors

Journal ref: Pre-print for our paper at International Conference on Speech & Language Technology for Low-resource Languages (SPELLL'2022)

arXiv:2111.06566 [pdf]

doi 10.11591/ijpeds.v12.i4.pp2209-2220

An analysis of voltage source inverter switches fault classification using short time Fourier transform

Authors: Mustafa Manap, Srete Nikolovski, Aleksandr Skamyin, Rony Karim, Tole Sutikno, Mohd Hatta Jopri

Abstract: The dependability of power electronics systems, such as three-phase inverters, is critical in a variety of applications. Different types of failures that occur in an inverter circuit might affect system operation and raise the entire cost of the manufacturing process. As a result, detecting and identifying inverter problems for such devices is critical in industry. This study presents the short-ti… ▽ More The dependability of power electronics systems, such as three-phase inverters, is critical in a variety of applications. Different types of failures that occur in an inverter circuit might affect system operation and raise the entire cost of the manufacturing process. As a result, detecting and identifying inverter problems for such devices is critical in industry. This study presents the short-time Fourier transform (STFT) for fault classification and identification in three-phase type, voltage source inverter (VSI) switches. Time-frequency representation (TFR) represents the signal analysis of STFT, which includes total harmonic distortion, instantaneous RMS current, RMS fundamental current, total non harmonic distortion, total waveform distortion and average current. The features of the faults are used with a rule-based classifier based on the signal parameters to categorise and detect the switch faults. The suggested method's performance is evaluated using 60 signals containing short and open circuit faults with varying characteristics for each switch in VSI. The classification results demonstrate the proposed technique is good to be implemented for VSI switches faults classification, with an accuracy classification rate of 98.3%. △ Less

Submitted 12 November, 2021; originally announced November 2021.

Comments: International Journal of Power Electronics and Drive Systems (IJPEDS) Vol. 12, No. 4, December 2021, pp. 2209~2220

arXiv:2106.03560 [pdf, other]

Exact and Asymptotic Analysis of General Multivariate Hawkes Processes and Induced Population Processes

Authors: Raviar Karim, Roger J. A. Laeven, Michel Mandjes

Abstract: This paper considers population processes in which general, not necessarily Markovian, multivariate Hawkes processes dictate the stochastic arrivals. We establish results to determine the corresponding time-dependent joint probability distribution, allowing for general intensity decay functions, general intensity jumps, and general sojourn times. We obtain an exact, full characterization of the ti… ▽ More This paper considers population processes in which general, not necessarily Markovian, multivariate Hawkes processes dictate the stochastic arrivals. We establish results to determine the corresponding time-dependent joint probability distribution, allowing for general intensity decay functions, general intensity jumps, and general sojourn times. We obtain an exact, full characterization of the time-dependent joint transform of the multivariate population process and its underlying intensity process in terms of a fixed-point representation and corresponding convergence results. We also derive the asymptotic tail behavior of the population process and its underlying intensity process in the setting of heavy-tailed intensity jumps. By exploiting the results we establish, arbitrary joint spatial-temporal moments and other distributional properties can now be readily evaluated using standard transform differentiation and inversion techniques, and we illustrate this in a few examples. △ Less

Submitted 7 June, 2021; originally announced June 2021.

MSC Class: 60G55; 60E10

arXiv:2104.04276 [pdf, other]

doi 10.3847/1538-4357/abf6ce

SOFIA FEEDBACK survey: exploring the dynamics of the stellar wind driven shell of RCW 49

Authors: M. Tiwari, R. Karim, M. W. Pound, M. Wolfire, A. Jacob, C. Buchbender, R. Güsten, C. Guevara, R. D. Higgins, S. Kabanovic, C. Pabst, O. Ricken, N. Schneider, R. Simon, J. Stutzki, A. G. G. M. Tielens

Abstract: We unveil the stellar wind driven shell of the luminous massive star-forming region of RCW 49 using SOFIA FEEDBACK observations of the [CII] 158 $μ$m line. The complementary dataset of the $^{12}$CO and $^{13}$CO J = 3 - 2 transitions is observed by the APEX telescope and probes the dense gas toward RCW 49. Using the spatial and spectral resolution provided by the SOFIA and APEX telescopes, we dis… ▽ More We unveil the stellar wind driven shell of the luminous massive star-forming region of RCW 49 using SOFIA FEEDBACK observations of the [CII] 158 $μ$m line. The complementary dataset of the $^{12}$CO and $^{13}$CO J = 3 - 2 transitions is observed by the APEX telescope and probes the dense gas toward RCW 49. Using the spatial and spectral resolution provided by the SOFIA and APEX telescopes, we disentangle the shell from a complex set of individual components of gas centered around RCW 49. We find that the shell of radius ~ 6 pc is expanding at a velocity of 13 km s$^{-1}$ toward the observer. Comparing our observed data with the ancillary data at X-Ray, infrared, sub-millimeter and radio wavelengths, we investigate the morphology of the region. The shell has a well defined eastern arc, while the western side is blown open and is venting plasma further into the west. Though the stellar cluster, which is ~ 2 Myr old gave rise to the shell, it only gained momentum relatively recently as we calculate the shell's expansion lifetime ~ 0.27 Myr, making the Wolf-Rayet star WR20a a likely candidate responsible for the shell's re-acceleration. △ Less

Submitted 9 April, 2021; originally announced April 2021.

Comments: 31 pages, 17 figures

arXiv:2103.00025 [pdf, ps, other]

TEC: Tensor Ensemble Classifier for Big Data

Authors: Peide Li, Rejaul Karim, Tapabrata Maiti

Abstract: Tensor (multidimensional array) classification problem has become very popular in modern applications such as image recognition and high dimensional spatio-temporal data analysis. Support Tensor Machine (STM) classifier, which is extended from the support vector machine, takes CANDECOMP / Parafac (CP) form of tensor data as input and predicts the data labels. The distribution-free and statisticall… ▽ More Tensor (multidimensional array) classification problem has become very popular in modern applications such as image recognition and high dimensional spatio-temporal data analysis. Support Tensor Machine (STM) classifier, which is extended from the support vector machine, takes CANDECOMP / Parafac (CP) form of tensor data as input and predicts the data labels. The distribution-free and statistically consistent properties of STM highlight its potential in successfully handling wide varieties of data applications. Training a STM can be computationally expensive with high-dimensional tensors. However, reducing the size of tensor with a random projection technique can reduce the computational time and cost, making it feasible to handle large size tensors on regular machines. We name an STM estimated with randomly projected tensor as Random Projection-based Support Tensor Machine (RPSTM). In this work, we propose a Tensor Ensemble Classifier (TEC), which aggregates multiple RPSTMs for big tensor classification. TEC utilizes the ensemble idea to minimize the excessive classification risk brought by random projection, providing statistically consistent predictions while taking the computational advantage of RPSTM. Since each RPSTM can be estimated independently, TEC can further take advantage of parallel computing techniques and be more computationally efficient. The theoretical and numerical results demonstrate the decent performance of TEC model in high-dimensional tensor classification problems. The model prediction is statistically consistent as its risk is shown to converge to the optimal Bayes risk. Besides, we highlight the trade-off between the computational cost and the prediction risk for TEC model. The method is validated by extensive simulation and a real data example. We prepare a python package for applying TEC, which is available at our GitHub. △ Less

Submitted 26 February, 2021; originally announced March 2021.

arXiv:2102.02389 [pdf]

Experimental determination of the valence band offsets of $ZnGeN_2$ and $ZnGe_{0.94}Ga_{0.12}N_2$ with $GaN$

Authors: Md Rezaul Karim, Brenton A. Noesges, Benthara Hewage Dinushi Jayatunga, Menglin Zhu, **woo Hwang, Walter R. L. Lambrecht, Leonard J. Brillson, Kathleen Kash, Hong** Zhao

Abstract: A predicted type-II staggered band alignment with an approximately $1.4 eV$ valence band offset at the $ZnGeN_2/GaN$ heterointerface has inspired novel band-engineered $III-N/ZnGeN_2$ heterostructure-based device designs for applications in high performance optoelectronics. We report on the determination of the valence band offset between metalorganic chemical vapor deposition grown… ▽ More A predicted type-II staggered band alignment with an approximately $1.4 eV$ valence band offset at the $ZnGeN_2/GaN$ heterointerface has inspired novel band-engineered $III-N/ZnGeN_2$ heterostructure-based device designs for applications in high performance optoelectronics. We report on the determination of the valence band offset between metalorganic chemical vapor deposition grown $(ZnGe)_{1-x}Ga_{2x}N_2$, for $x = 0$ and $0.06$, and $GaN$ using X-ray photoemission spectroscopy. The valence band of $ZnGeN_2$ was found to lie $1.45-1.65 eV$ above that of $GaN$. This result agrees well with the value predicted by first-principles density functional theory calculations using the local density approximation for the potential profile and quasiparticle self-consistent GW calculations of the band edge states relative to the potential. For $(ZnGe)_{0.94}Ga_{0.12}N_2$ the value was determined to be $1.29 eV$, $~10-20\%$ lower than that of $ZnGeN_2$. The experimental determination of the large band offset between $ZnGeN_2$ and $GaN$ provides promising alternative solutions to address challenges faced with pure III-nitride-based structures and devices. △ Less

Submitted 3 February, 2021; originally announced February 2021.

arXiv:2012.14353 [pdf, other]

DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language

Authors: Md. Rezaul Karim, Sumon Kanti Dey, Tanhim Islam, Sagor Sarker, Mehadi Hasan Menon, Kabir Hossain, Bharathi Raja Chakravarthi, Md. Azam Hossain, Stefan Decker

Abstract: The exponential growths of social media and micro-blogging sites not only provide platforms for empowering freedom of expressions and individual voices, but also enables people to express anti-social behaviour like online harassment, cyberbullying, and hate speech. Numerous works have been proposed to utilize textual data for social and anti-social behaviour analysis, by predicting the contexts mo… ▽ More The exponential growths of social media and micro-blogging sites not only provide platforms for empowering freedom of expressions and individual voices, but also enables people to express anti-social behaviour like online harassment, cyberbullying, and hate speech. Numerous works have been proposed to utilize textual data for social and anti-social behaviour analysis, by predicting the contexts mostly for highly-resourced languages like English. However, some languages are under-resourced, e.g., South Asian languages like Bengali, that lack computational resources for accurate natural language processing (NLP). In this paper, we propose an explainable approach for hate speech detection from the under-resourced Bengali language, which we called DeepHateExplainer. Bengali texts are first comprehensively preprocessed, before classifying them into political, personal, geopolitical, and religious hates using a neural ensemble method of transformer-based neural architectures (i.e., monolingual Bangla BERT-base, multilingual BERT-cased/uncased, and XLM-RoBERTa). Important(most and least) terms are then identified using sensitivity analysis and layer-wise relevance propagation(LRP), before providing human-interpretable explanations. Finally, we compute comprehensiveness and sufficiency scores to measure the quality of explanations w.r.t faithfulness. Evaluations against machine learning~(linear and tree-based models) and neural networks (i.e., CNN, Bi-LSTM, and Conv-LSTM with word embeddings) baselines yield F1-scores of 78%, 91%, 89%, and 84%, for political, personal, geopolitical, and religious hates, respectively, outperforming both ML and DNN baselines. △ Less

Submitted 6 August, 2021; v1 submitted 28 December, 2020; originally announced December 2020.

Comments: Proceeding of IEEE International Conference on Data Science and Advanced Analytics (DSAA'2021), October 6-9, 2021, Porto, Portugal

arXiv:2011.05805 [pdf]

doi 10.1109/IS.2018.8710564

Crime Prediction Using Multiple-ANFIS Architecture and Spatiotemporal Data

Authors: Mashnoon Islam, Redwanul Karim, Kalyan Roy, Saif Mahmood, Sadat Hossain, M. Rashedur Rahman

Abstract: Statistical values alone cannot bring the whole scenario of crime occurrences in the city of Dhaka. We need a better way to use these statistical values to predict crime occurrences and make the city a safer place to live. Proper decision-making for the future is key in reducing the rate of criminal offenses in an area or a city. If the law enforcement bodies can allocate their resources efficient… ▽ More Statistical values alone cannot bring the whole scenario of crime occurrences in the city of Dhaka. We need a better way to use these statistical values to predict crime occurrences and make the city a safer place to live. Proper decision-making for the future is key in reducing the rate of criminal offenses in an area or a city. If the law enforcement bodies can allocate their resources efficiently for the future, the rate of crime in Dhaka can be brought down to a minimum. In this work, we have made an initiative to provide an effective tool with which law enforcement officials and detectives can predict crime occurrences ahead of time and take better decisions easily and quickly. We have used several Fuzzy Inference Systems (FIS) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS) to predict the type of crime that is highly likely to occur at a certain place and time. △ Less

Submitted 7 November, 2020; originally announced November 2020.

Comments: Accepted Version, 2018 IEEE International Conference on Intelligent Systems (IS) September 25-27, Funchal - Madeira, Portugal

MSC Class: 03B52; 68T07 ACM Class: I.2.3; I.2.6

arXiv:2009.08730 [pdf, other]

doi 10.1088/1538-3873/aba840

FEEDBACK: a SOFIA Legacy Program to Study Stellar Feedback in Regions of Massive Star Formation

Authors: N. Schneider, R. Simon, C. Guevara, C. Buchbender, R. D. Higgins, Y. Okada, J. Stutzki, R. Guesten, L. D. Anderson, J. Bally, H. Beuther, L. Bonne, S. Bontemps, E. Chambers, T. Csengeri, U. U. Graf, A. Gusdorf, K. Jacobs, S. Kabanovic, R. Karim, M. Luisi, K. Menten, M. Mertens, B. Mookerjea, V. Ossenkopf-Okada , et al. (15 additional authors not shown)

Abstract: FEEDBACK is a SOFIA legacy program dedicated to study the interaction of massive stars with their environment. It performs a survey of 11 galactic high mass star forming regions in the 158 $μ$m (1.9 THz) line of CII and the 63 $μ$m (4.7 THz) line of OI. We employ the 14 pixel LFA and 7 pixel HFA upGREAT instrument to spectrally resolve (0.24 MHz) these FIR structure lines. With an observing time o… ▽ More FEEDBACK is a SOFIA legacy program dedicated to study the interaction of massive stars with their environment. It performs a survey of 11 galactic high mass star forming regions in the 158 $μ$m (1.9 THz) line of CII and the 63 $μ$m (4.7 THz) line of OI. We employ the 14 pixel LFA and 7 pixel HFA upGREAT instrument to spectrally resolve (0.24 MHz) these FIR structure lines. With an observing time of 96h, we will cover $\sim$6700 arcmin$^2$ at 14.1$''$ angular resolution for the CII line and 6.3$''$ for the OI line. The observations started in spring 2019 (Cycle 7). Our aim is to understand the dynamics in regions dominated by different feedback processes from massive stars such as stellar winds, thermal expansion, and radiation pressure, and to quantify the mechanical energy injection and radiative heating efficiency. The CII line provides the kinematics of the gas and is one of the dominant cooling lines of gas for low to moderate densities and UV fields. The OI line traces warm and high-density gas, excited in photodissociations regions with a strong UV field or by shocks. The source sample spans a broad range in stellar characteristics from single OB stars, to small groups of O stars, to rich young stellar clusters, to ministarburst complexes. It contains well-known targets such as Aquila, the Cygnus X region, M16, M17, NGC7538, NGC6334, Vela, and W43 as well as a selection of HII region bubbles, namely RCW49, RCW79, and RCW120. These CII maps, together with the less explored OI 63 $μ$m line, provide an outstanding database for the community. They will be made publically available and will trigger further studies and follow-up observations. △ Less

Submitted 18 September, 2020; originally announced September 2020.

Journal ref: PASP 2020, Volume 132, Number 1016; https://iopscience.iop.org/article/10.1088/1538-3873/aba840

arXiv:2004.12314 [pdf]

A Global Benchmark of Algorithms for Segmenting Late Gadolinium-Enhanced Cardiac Magnetic Resonance Imaging

Authors: Zhaohan Xiong, Qing Xia, Zhiqiang Hu, Ning Huang, Cheng Bian, Yefeng Zheng, Sulaiman Vesal, Nishant Ravikumar, Andreas Maier, Xin Yang, Pheng-Ann Heng, Dong Ni, Caizi Li, Qianqian Tong, Weixin Si, Elodie Puybareau, Younes Khoudli, Thierry Geraud, Chen Chen, Wenjia Bai, Daniel Rueckert, Lingchao Xu, Xiahai Zhuang, Xinzhe Luo, Shuman Jia , et al. (19 additional authors not shown)

Abstract: Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment. However, direct segmentation of LGE-MRIs is challenging due to its attenuated contrast. Since most clinical studies have relied on manual and labor-intensive approaches, auto… ▽ More Segmentation of cardiac images, particularly late gadolinium-enhanced magnetic resonance imaging (LGE-MRI) widely used for visualizing diseased cardiac structures, is a crucial first step for clinical diagnosis and treatment. However, direct segmentation of LGE-MRIs is challenging due to its attenuated contrast. Since most clinical studies have relied on manual and labor-intensive approaches, automatic methods are of high interest, particularly optimized machine learning approaches. To address this, we organized the "2018 Left Atrium Segmentation Challenge" using 154 3D LGE-MRIs, currently the world's largest cardiac LGE-MRI dataset, and associated labels of the left atrium segmented by three medical experts, ultimately attracting the participation of 27 international teams. In this paper, extensive analysis of the submitted algorithms using technical and biological metrics was performed by undergoing subgroup analysis and conducting hyper-parameter analysis, offering an overall picture of the major design choices of convolutional neural networks (CNNs) and practical considerations for achieving state-of-the-art left atrium segmentation. Results show the top method achieved a dice score of 93.2% and a mean surface to a surface distance of 0.7 mm, significantly outperforming prior state-of-the-art. Particularly, our analysis demonstrated that double, sequentially used CNNs, in which a first CNN is used for automatic region-of-interest localization and a subsequent CNN is used for refined regional segmentation, achieved far superior results than traditional methods and pipelines containing single CNNs. This large-scale benchmarking study makes a significant step towards much-improved segmentation methods for cardiac LGE-MRIs, and will serve as an important benchmark for evaluating and comparing the future works in the field. △ Less

Submitted 7 May, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

arXiv:2004.07807 [pdf, other]

Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network

Authors: Md. Rezaul Karim, Bharathi Raja Chakravarthi, John P. McCrae, Michael Cochez

Abstract: Exponential growths of social media and micro-blogging sites not only provide platforms for empowering freedom of expressions and individual voices but also enables people to express anti-social behaviour like online harassment, cyberbullying, and hate speech. Numerous works have been proposed to utilize these data for social and anti-social behaviours analysis, document characterization, and sent… ▽ More Exponential growths of social media and micro-blogging sites not only provide platforms for empowering freedom of expressions and individual voices but also enables people to express anti-social behaviour like online harassment, cyberbullying, and hate speech. Numerous works have been proposed to utilize these data for social and anti-social behaviours analysis, document characterization, and sentiment analysis by predicting the contexts mostly for highly resourced languages such as English. However, there are languages that are under-resources, e.g., South Asian languages like Bengali, Tamil, Assamese, Telugu that lack of computational resources for the NLP tasks. In this paper, we provide several classification benchmarks for Bengali, an under-resourced language. We prepared three datasets of expressing hate, commonly used topics, and opinions for hate speech detection, document classification, and sentiment analysis, respectively. We built the largest Bengali word embedding models to date based on 250 million articles, which we call BengFastText. We perform three different experiments, covering document classification, sentiment analysis, and hate speech detection. We incorporate word embeddings into a Multichannel Convolutional-LSTM (MConv-LSTM) network for predicting different types of hate speech, document classification, and sentiment analysis. Experiments demonstrate that BengFastText can capture the semantics of words from respective contexts correctly. Evaluations against several baseline embedding models, e.g., Word2Vec and GloVe yield up to 92.30%, 82.25%, and 90.45% F1-scores in case of document classification, sentiment analysis, and hate speech detection, respectively during 5-fold cross-validation tests. △ Less

Submitted 19 April, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

Comments: This paper is under review in the Journal of Natural Language Engineering

arXiv:2004.04582 [pdf, other]

DeepCOVIDExplainer: Explainable COVID-19 Diagnosis Based on Chest X-ray Images

Authors: Md. Rezaul Karim, Till Döhmen, Dietrich Rebholz-Schuhmann, Stefan Decker, Michael Cochez, Oya Beyan

Abstract: Amid the coronavirus disease(COVID-19) pandemic, humanity experiences a rapid increase in infection numbers across the world. Challenge hospitals are faced with, in the fight against the virus, is the effective screening of incoming patients. One methodology is the assessment of chest radiography(CXR) images, which usually requires expert radiologist's knowledge. In this paper, we propose an expla… ▽ More Amid the coronavirus disease(COVID-19) pandemic, humanity experiences a rapid increase in infection numbers across the world. Challenge hospitals are faced with, in the fight against the virus, is the effective screening of incoming patients. One methodology is the assessment of chest radiography(CXR) images, which usually requires expert radiologist's knowledge. In this paper, we propose an explainable deep neural networks(DNN)-based method for automatic detection of COVID-19 symptoms from CXR images, which we call DeepCOVIDExplainer. We used 15,959 CXR images of 15,854 patients, covering normal, pneumonia, and COVID-19 cases. CXR images are first comprehensively preprocessed, before being augmented and classified with a neural ensemble method, followed by highlighting class-discriminating regions using gradient-guided class activation maps(Grad-CAM++) and layer-wise relevance propagation(LRP). Further, we provide human-interpretable explanations of the predictions. Evaluation results based on hold-out data show that our approach can identify COVID-19 confidently with a positive predictive value(PPV) of 91.6%, 92.45%, and 96.12%; precision, recall, and F1 score of 94.6%, 94.3%, and 94.6%, respectively for normal, pneumonia, and COVID-19 cases, respectively, making it comparable or improved results over recent approaches. We hope that our findings will be a useful contribution to the fight against COVID-19 and, in more general, towards an increasing acceptance and adoption of AI-assisted applications in the clinical practice. △ Less

Submitted 6 June, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

arXiv:1909.12996 [pdf, other]

Distributed Iterative Gating Networks for Semantic Segmentation

Authors: Rezaul Karim, Md Amirul Islam, Neil D. B. Bruce

Abstract: In this paper, we present a canonical structure for controlling information flow in neural networks with an efficient feedback routing mechanism based on a strategy of Distributed Iterative Gating (DIGNet). The structure of this mechanism derives from a strong conceptual foundation and presents a light-weight mechanism for adaptive control of computation similar to recurrent convolutional neural n… ▽ More In this paper, we present a canonical structure for controlling information flow in neural networks with an efficient feedback routing mechanism based on a strategy of Distributed Iterative Gating (DIGNet). The structure of this mechanism derives from a strong conceptual foundation and presents a light-weight mechanism for adaptive control of computation similar to recurrent convolutional neural networks by integrating feedback signals with a feed-forward architecture. In contrast to other RNN formulations, DIGNet generates feedback signals in a cascaded manner that implicitly carries information from all the layers above. This cascaded feedback propagation by means of the propagator gates is found to be more effective compared to other feedback mechanisms that use feedback from the output of either the corresponding stage or from the previous stage. Experiments reveal the high degree of capability that this recurrent approach with cascaded feedback presents over feed-forward baselines and other recurrent models for pixel-wise labeling problems on three challenging datasets, PASCAL VOC 2012, COCO-Stuff, and ADE20K. △ Less

Submitted 27 September, 2019; originally announced September 2019.

Comments: WACV 2020

arXiv:1909.04169 [pdf, other]

OncoNetExplainer: Explainable Predictions of Cancer Types Based on Gene Expression Data

Authors: Md. Rezaul Karim, Michael Cochez, Oya Beyan, Stefan Decker, Christoph Lange

Abstract: The discovery of important biomarkers is a significant step towards understanding the molecular mechanisms of carcinogenesis; enabling accurate diagnosis for, and prognosis of, a certain cancer type. Before recommending any diagnosis, genomics data such as gene expressions(GE) and clinical outcomes need to be analyzed. However, complex nature, high dimensionality, and heterogeneity in genomics dat… ▽ More The discovery of important biomarkers is a significant step towards understanding the molecular mechanisms of carcinogenesis; enabling accurate diagnosis for, and prognosis of, a certain cancer type. Before recommending any diagnosis, genomics data such as gene expressions(GE) and clinical outcomes need to be analyzed. However, complex nature, high dimensionality, and heterogeneity in genomics data make the overall analysis challenging. Convolutional neural networks(CNN) have shown tremendous success in solving such problems. However, neural network models are perceived mostly as `black box' methods because of their not well-understood internal functioning. However, interpretability is important to provide insights on why a given cancer case has a certain type. Besides, finding the most important biomarkers can help in recommending more accurate treatments and drug repositioning. In this paper, we propose a new approach called OncoNetExplainer to make explainable predictions of cancer types based on GE data. We used genomics data about 9,074 cancer patients covering 33 different cancer types from the Pan-Cancer Atlas on which we trained CNN and VGG16 networks using guided-gradient class activation maps++(GradCAM++). Further, we generate class-specific heat maps to identify significant biomarkers and computed feature importance in terms of mean absolute impact to rank top genes across all the cancer types. Quantitative and qualitative analyses show that both models exhibit high confidence at predicting the cancer types correctly giving an average precision of 96.25%. To provide comparisons with the baselines, we identified top genes, and cancer-specific driver genes using gradient boosted trees and SHapley Additive exPlanations(SHAP). Finally, our findings were validated with the annotations provided by the TumorPortal. △ Less

Submitted 9 September, 2019; originally announced September 2019.

Comments: In proc. of 19th IEEE International Conference on Bioinformatics and Bioengineering(IEEE BIBE 2019)

Journal ref: IEEE International Conference on Bioinformatics and Bioengineering(IEEE BIBE 2019)

arXiv:1908.01288 [pdf, other]

Drug-Drug Interaction Prediction Based on Knowledge Graph Embeddings and Convolutional-LSTM Network

Authors: Md. Rezaul Karim, Michael Cochez, Joao Bosco Jares, Mamtaz Uddin, Oya Beyan, Stefan Decker

Abstract: Interference between pharmacological substances can cause serious medical injuries. Correctly predicting so-called drug-drug interactions (DDI) does not only reduce these cases but can also result in a reduction of drug development cost. Presently, most drug-related knowledge is the result of clinical evaluations and post-marketing surveillance; resulting in a limited amount of information. Existi… ▽ More Interference between pharmacological substances can cause serious medical injuries. Correctly predicting so-called drug-drug interactions (DDI) does not only reduce these cases but can also result in a reduction of drug development cost. Presently, most drug-related knowledge is the result of clinical evaluations and post-marketing surveillance; resulting in a limited amount of information. Existing data-driven prediction approaches for DDIs typically rely on a single source of information, while using information from multiple sources would help improve predictions. Machine learning (ML) techniques are used, but the techniques are often unable to deal with skewness in the data. Hence, we propose a new ML approach for predicting DDIs based on multiple data sources. For this task, we use 12,000 drug features from DrugBank, PharmGKB, and KEGG drugs, which are integrated using Knowledge Graphs (KGs). To train our prediction model, we first embed the nodes in the graph using various embedding approaches. We found that the best performing combination was a ComplEx embedding method creating using PyTorch-BigGraph (PBG) with a Convolutional-LSTM network and classic machine learning-based prediction models. The model averaging ensemble method of three best classifiers yields up to 0.94, 0.92, 0.80 for AUPR, F1-score, and MCC, respectively during 5-fold cross-validation tests. △ Less

Submitted 4 August, 2019; originally announced August 2019.

arXiv:1904.12375 [pdf, other]

Rank Approximation of a Tensor with Applications in Color Image and Video Processing

Authors: Ramin Goudarzi Karim, Carmeliza Navasca, Da Yan

Abstract: We propose a block coordinate descent type algorithm for estimating the rank of a given tensor. In addition, the algorithm provides the canonical polyadic decomposition of a tensor. In order to estimate the tensor rank we use sparse optimization method using $\ell_1$ norm. The algorithm is implemented on single moving object videos and color images for approximating the rank. We propose a block coordinate descent type algorithm for estimating the rank of a given tensor. In addition, the algorithm provides the canonical polyadic decomposition of a tensor. In order to estimate the tensor rank we use sparse optimization method using $\ell_1$ norm. The algorithm is implemented on single moving object videos and color images for approximating the rank. △ Less

Submitted 28 April, 2019; originally announced April 2019.

arXiv:1811.08043 [pdf, other]

Recurrent Iterative Gating Networks for Semantic Segmentation

Authors: Rezaul Karim, Md Amirul Islam, Neil D. B. Bruce

Abstract: In this paper, we present an approach for Recurrent Iterative Gating called RIGNet. The core elements of RIGNet involve recurrent connections that control the flow of information in neural networks in a top-down manner, and different variants on the core structure are considered. The iterative nature of this mechanism allows for gating to spread in both spatial extent and feature space. This is re… ▽ More In this paper, we present an approach for Recurrent Iterative Gating called RIGNet. The core elements of RIGNet involve recurrent connections that control the flow of information in neural networks in a top-down manner, and different variants on the core structure are considered. The iterative nature of this mechanism allows for gating to spread in both spatial extent and feature space. This is revealed to be a powerful mechanism with broad compatibility with common existing networks. Analysis shows how gating interacts with different network characteristics, and we also show that more shallow networks with gating may be made to perform better than much deeper networks that do not include RIGNet modules. △ Less

Submitted 19 November, 2018; originally announced November 2018.

Comments: WACV 2019

Showing 1–50 of 67 results for author: Karim, R