-
Combating Financial Crimes with Unsupervised Learning Techniques: Clustering and Dimensionality Reduction for Anti-Money Laundering
Authors:
Ahmed N. Bakry,
Almohammady S. Alsharkawy,
Mohamed S. Farag,
Kamal R. Raslan
Abstract:
Anti-Money Laundering (AML) is a crucial task in ensuring the integrity of financial systems. One keychallenge in AML is identifying high-risk groups based on their behavior. Unsupervised learning, particularly clustering, is a promising solution for this task. However, the use of hundreds of features todescribe behavior results in a highdimensional dataset that negatively impacts clustering perfo…
▽ More
Anti-Money Laundering (AML) is a crucial task in ensuring the integrity of financial systems. One keychallenge in AML is identifying high-risk groups based on their behavior. Unsupervised learning, particularly clustering, is a promising solution for this task. However, the use of hundreds of features todescribe behavior results in a highdimensional dataset that negatively impacts clustering performance.In this paper, we investigate the effectiveness of combining clustering method agglomerative hierarchicalclustering with four dimensionality reduction techniques -Independent Component Analysis (ICA), andKernel Principal Component Analysis (KPCA), Singular Value Decomposition (SVD), Locality Preserving Projections (LPP)- to overcome the issue of high-dimensionality in AML data and improve clusteringresults. This study aims to provide insights into the most effective way of reducing the dimensionality ofAML data and enhance the accuracy of clustering-based AML systems. The experimental results demonstrate that KPCA outperforms other dimension reduction techniques when combined with agglomerativehierarchical clustering. This superiority is observed in the majority of situations, as confirmed by threedistinct validation indices.
△ Less
Submitted 14 February, 2024;
originally announced March 2024.
-
Confident Naturalness Explanation (CNE): A Framework to Explain and Assess Patterns Forming Naturalness
Authors:
Ahmed Emam,
Mohamed Farag,
Ribana Roscher
Abstract:
Protected natural areas are regions that have been minimally affected by human activities such as urbanization, agriculture, and other human interventions. To better understand and map the naturalness of these areas, machine learning models can be used to analyze satellite imagery. Specifically, explainable machine learning methods show promise in uncovering patterns that contribute to the concept…
▽ More
Protected natural areas are regions that have been minimally affected by human activities such as urbanization, agriculture, and other human interventions. To better understand and map the naturalness of these areas, machine learning models can be used to analyze satellite imagery. Specifically, explainable machine learning methods show promise in uncovering patterns that contribute to the concept of naturalness within these protected environments. Additionally, addressing the uncertainty inherent in machine learning models is crucial for a comprehensive understanding of this concept. However, existing approaches have limitations. They either fail to provide explanations that are both valid and objective or struggle to offer a quantitative metric that accurately measures the contribution of specific patterns to naturalness, along with the associated confidence. In this paper, we propose a novel framework called the Confident Naturalness Explanation (CNE) framework. This framework combines explainable machine learning and uncertainty quantification to assess and explain naturalness. We introduce a new quantitative metric that describes the confident contribution of patterns to the concept of naturalness. Furthermore, we generate an uncertainty-aware segmentation mask for each input sample, highlighting areas where the model lacks knowledge. To demonstrate the effectiveness of our framework, we apply it to a study site in Fennoscandia using two open-source satellite datasets.
△ Less
Submitted 16 February, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Environmental and Safety Impacts of Vehicle-to-Everything Enabled Applications: A Review of State-of-the-Art Studies
Authors:
Jianhe Du,
Kyoungho Ahn,
Mohamed Farag,
Hesham Rakha
Abstract:
With the rapid development of communication technology, connected vehicles (CV) have the potential, through the sharing of data, to enhance vehicle safety and reduce vehicle energy consumption and emissions. Numerous research efforts are quantifying the impacts of CV applications, assuming instant and accurate communication among vehicles, devices, pedestrians, infrastructure, the network, the clo…
▽ More
With the rapid development of communication technology, connected vehicles (CV) have the potential, through the sharing of data, to enhance vehicle safety and reduce vehicle energy consumption and emissions. Numerous research efforts are quantifying the impacts of CV applications, assuming instant and accurate communication among vehicles, devices, pedestrians, infrastructure, the network, the cloud, and the grid, collectively known as V2X (vehicle-to-everything). The use of cellular vehicle-to-everything (C-V2X), to share data is emerging as an efficient means to achieve this objective. C-V2X releases 14 and 15 utilize the 4G LTE technology and release 16 utilizes the new 5G new radio (NR) technology. C-V2X can function without network infrastructure coverage and has a better communication range, improved latency, and greater data rates compared to older technologies. Such highly efficient interchange of information among all participating parts in a CV environment will not only provide timely data to enhance the capacity of the transportation system but can also be used to develop applications that enhance vehicle safety and minimize negative environmental impacts. However, before the full benefits of CV can be achieved, there is a need to thoroughly investigate the effectiveness, strengths, and weaknesses of different CV applications, the communication protocols, the varied results with different CV market penetration rates (MPRs), the interaction of CVs and human driven vehicles, the integration of multiple applications, and the errors and latencies associated with data communication. This paper reviews existing literature on the environmental, mobility and safety impacts of CV applications, identifies the gaps in our current research of CVs and recommends future research directions.
△ Less
Submitted 7 December, 2021;
originally announced February 2022.
-
Multi-objective Eco-Routing Model Development and Evaluation for Battery Electric Vehicles
Authors:
Kyoungho Ahn,
Youssef Bichiou,
Mohamed Farag,
Hesham A. Rakha
Abstract:
This paper develops and investigates the impacts of multi-objective Nash optimum (user equilibrium) traffic assignment on a large-scale network for battery electric vehicles (BEVs) and internal combustion engine vehicles (ICEVs) in a microscopic traffic simulation environment. Eco-routing is a technique that finds the most energy efficient route. ICEV and BEV energy consumption patterns are signif…
▽ More
This paper develops and investigates the impacts of multi-objective Nash optimum (user equilibrium) traffic assignment on a large-scale network for battery electric vehicles (BEVs) and internal combustion engine vehicles (ICEVs) in a microscopic traffic simulation environment. Eco-routing is a technique that finds the most energy efficient route. ICEV and BEV energy consumption patterns are significantly different with regard to their sensitivity to driving cycles. Unlike ICEVs, BEVs are more energy efficient on low-speed arterial trips compared to highway trips. Different energy consumption patterns require different eco-routing strategies for ICEVs and BEVs. This study found that eco-routing could reduce energy consumption for BEVs but also significantly increases their average travel time. The simulation study found that multi-objective routing could reduce the energy consumption of BEVs by 13.5, 14.2, 12.9, and 10.7 percent, as well as the fuel consumption of ICEVs by 0.1, 4.3, 3.4, and 10.6 percent for "not congested", "slightly congested", "moderately congested", and "highly congested" conditions, respectively. The study also found that multi-objective user equilibrium routing reduced the average vehicle travel time by up to 10.1% compared to the standard user equilibrium traffic assignment for the highly congested conditions, producing a solution closer to the system optimum traffic assignment. The results indicate that the multi-objective eco-routing can effectively reduce fuel/energy consumption with minimum impacts on travel times for both BEVs and ICEVs.
△ Less
Submitted 10 August, 2020;
originally announced April 2021.
-
Hard Decision Cooperative Spectrum Sensing Based on Estimating the Noise Uncertainty Factor
Authors:
Hossam M. Farag,
Ehab Mahmoud Mohamed
Abstract:
Spectrum Sensing (SS) is one of the most challenging issues in Cognitive Radio (CR) systems. Cooperative Spectrum Sensing (CSS) is proposed to enhance the detection reliability of a Primary User (PU) in fading environments. In this paper, we propose a hard decision based CSS algorithm using energy detection with taking into account the noise uncertainty effect. In the proposed algorithm, two dynam…
▽ More
Spectrum Sensing (SS) is one of the most challenging issues in Cognitive Radio (CR) systems. Cooperative Spectrum Sensing (CSS) is proposed to enhance the detection reliability of a Primary User (PU) in fading environments. In this paper, we propose a hard decision based CSS algorithm using energy detection with taking into account the noise uncertainty effect. In the proposed algorithm, two dynamic thresholds are toggled based on predicting the current PU activity, which can be successfully expected using a simple successive averaging process with time. Also, their values are evaluated using an estimated value of the noise uncertainty factor. These dynamic thresholds are used to compensate the noise uncertainty effect and increase (decrease) the probability of detection (false alarm), respectively. Theoretical analysis is performed on the proposed algorithm to deduce its enhanced false alarm and detection probabilities compared to the conventional hard decision CSS. Moreover, simulation analysis is used to confirm the theoretical claims and prove the high performance of the proposed scheme compared to the conventional CSS using different fusion rules.
△ Less
Submitted 13 October, 2015;
originally announced October 2015.
-
Soft Decision Cooperative Spectrum Sensing Based Upon Noise Uncertainty Estimation
Authors:
Hossam M. Farag,
Ehab Mahmoud Mohamed
Abstract:
Spectrum Sensing (SS) constitutes the most critical task i n Cognitive Radio (CR) systems for Primary User (PU) detection. Cooperative Spectrum Sensing (CSS) is introduced to enhance the detection reliability of the PU in fading environments. In this paper, we propose a soft decision based CSS algorithm using energy detection by taking into account the noise uncertainty effect. In the proposed alg…
▽ More
Spectrum Sensing (SS) constitutes the most critical task i n Cognitive Radio (CR) systems for Primary User (PU) detection. Cooperative Spectrum Sensing (CSS) is introduced to enhance the detection reliability of the PU in fading environments. In this paper, we propose a soft decision based CSS algorithm using energy detection by taking into account the noise uncertainty effect. In the proposed algorithm, two threshold levels are utilized based on predicting the current PU activity, which can be successfully expected using a simple successive averaging process with time. The two threshold levels are evaluated based on estimating the noise uncertainty factor. In addition, they are toggled in a dynamic manner to compensate the noise uncertainty effect and to increase the probability of detection and decrease the probability of false alarm. Theoretical analysis is performed on the proposed algorithm to evaluate its enhanced false alarm and detection probabilities over the conventional soft decision CSS using different combining schemes. In addition, simulation results show the high efficiency of the proposed scheme compared to the conventional soft decision CSS, with high computational complexity enhancements.
△ Less
Submitted 20 May, 2015;
originally announced May 2015.
-
Multicore Dynamic Kernel Modules Attachment Technique for Kernel Performance Enhancement
Authors:
Mohamed Farag
Abstract:
Traditional monolithic kernels dominated kernel structures for long time along with small sized kernels,few hardware companies and limited kernel functionalities. Monolithic kernel structure was not applicable when the number of hardware companies increased and kernel services consumed by different users for many purposes. One of the biggest disadvantages of the monolithic kernels is the inflexibi…
▽ More
Traditional monolithic kernels dominated kernel structures for long time along with small sized kernels,few hardware companies and limited kernel functionalities. Monolithic kernel structure was not applicable when the number of hardware companies increased and kernel services consumed by different users for many purposes. One of the biggest disadvantages of the monolithic kernels is the inflexibility due to the need to include all the available modules in kernel compilation causing high time consuming. Lately, new kernel structure was introduced through multicore operating systems. Unfortunately, many multicore operating systems such as barrelfish and FOS are experimental. This paper aims to simulate the performance of multicore hybrid kernels through dynamic kernel module customized attachment/ deattachment for multicore machines. In addition, this paper proposes a new technique for loading dynamic kernel modules based on the user needs and machine capabilities.
△ Less
Submitted 20 November, 2012;
originally announced November 2012.
-
An Insight View of Kernel Visual Debugger in System Boot up
Authors:
Mohamed Farag
Abstract:
For many years, developers could not figure out the mystery of OS kernels. The main source of this mystery is the interaction between operating systems and hardware while system's boot up and kernel initialization. In addition, many operating system kernels differ in their behavior toward many situations. For instance, kernels act differently in racing conditions, kernel initialization and process…
▽ More
For many years, developers could not figure out the mystery of OS kernels. The main source of this mystery is the interaction between operating systems and hardware while system's boot up and kernel initialization. In addition, many operating system kernels differ in their behavior toward many situations. For instance, kernels act differently in racing conditions, kernel initialization and process scheduling. For such operations, kernel debuggers were designed to help in tracing kernel behavior and solving many kernel bugs. The importance of kernel debuggers is not limited to kernel code tracing but also, they can be used in verification and performance comparisons. However, developers had to be aware of debugger commands thus introducing some difficulties to non-expert programmers. Later, several visual kernel debuggers were presented to make it easier for programmers to trace their kernel code and analyze kernel behavior. Nowadays, several kernel debuggers exist for solving this mystery but only very few support line-by-line debugging at run-time. In this paper, a generic approach for operating system source code debugging in graphical mode with line-by-line tracing support is proposed. In the context of this approach, system boot up and evaluation of two operating system schedulers from several points of views will be discussed.
△ Less
Submitted 20 November, 2012;
originally announced November 2012.